This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
(exp tX)<j)(h) 6 L'L\ for some t 6 R, h £ H. Thus g = (exptX)hk for some k € K C H; thus g £ exp(RX)H. Suppose that g = exp(tiX)hi = exp(t2X)h,2. Then <j>(g) = )(p) = V(p), for each point p in M. The derived flag of a distribution V is the sequence of modules of vector • • •, 1>{q,p) =
= 4>(exptiX) = 0(expt 2 X) = exp (d(j>(t2X)).
and by injectivity of expod<^> on MX, we conclude t\ = ti- Then by cancellation h\ =b,2, so the representation is unique. The same argument works for .H'exp(RX'). Thus by Proposition 3.6 f) is not (IFT). A reason for presenting the alternate proof is that this second proof gives us the following new characterization of maximal semigroups. T h e o r e m 6.11. Let G be a simply connected solvable Lie group and let S := S(h + ) = ( e x p h + ) be a maximal semigroup of G, where t)+ is a half-space semialgebra, and f) is a codimension one subalgebra which is not an ideal. Then there exists X £ (h + \ h) f~l g' and for any such X, S = H(expRX) = (expMX)H with unique factorization in each case, where H := (exp h) We remark that the proof remains valid in the more general connected case. 6.2
Considering
(IFC)
The notion of irrelevance for controllability (IFC) is the natural one to look at if one's starting point is control systems on manifolds rather than the semigroups of smooth invertible maps. So we start with the vector fields generating flow maps
70 rather than the flow maps themselves. Actually this was our original motivation to introduce the irrelevancy notion, which proved more broadly powerful. For detailed proofs we refer to 17 . As a consequence of Theorem 5.5 and Proposition 5.6, controllability questions on g can be reduced to a problem on the factor algebra JJ/ IFC(g). Of course whether this information is very useful or not depends on the following two aspects: (D) Determination: How hard is the problem of determining IFC(g)? (S) Simplification: How complicated is the structure of the quotient g/IFC(g)? Is it much simpler that that of a general solvable g? For nilpotent g we obtain the best possible answers one can hope for: T h e o r e m 6.12. If g is nilpotent, then IFC(g) = A(g) = g'. A subalgebra f) C g is QFC) iff t) C g,'. Proof. An (IFT)-subalgebra f) is contained in g' by Lemma 6.5, and thus A(g) C g' by Corollary 6.3. By Proposition 5.3 we conclude that every (IFC)-subalgebra is contained in g', in particular IFC(g) C g'. The converse containments in the first statement are proved in 17 , and the converse implication in the second statement then follows from Theorem 5.5. • For g solvable, not nilpotent, the answer to (D) is, roughly speaking, that it is as hard as finding the Jordan normal form of a linear map. And concerning (S) one finds that g/ IFC(g) is only slightly more complicated than the basic-metabelian algebras mentioned in the previous section. The structure of these algebras (in terms of brackets) is not very complicated and actually easily understood, the major task one has to perform is introducing a well-suited notation. One might be disappointed by our anwswer to (D), in particular after one finds out that it is also necessary to pick a "generic" element in g, so this is not a straightforward procedure for finding IFC(g). But actually finding IFC(g) itself is not necessarily the task that must be solved. There are smaller ideals which are easily determined and which already provide most of the possible reduction: Proposition 6.13. If Q is solvable, then D2Q = [g',g'] is an (IFC)-ideal. See 17 for a proof. The factor algebra g = Q/D2Q is metabelian, i.e., its bracket algebra [g, g] is abelian. For practical purposes Proposition 5.6 can be quite useful. It allows one to proceed inductively: having found some (IFC)-ideal, pass to the quotient and look for (IFC)-ideals there. Finally there is a nice characterization of (IFC)-ideals in algebraic terms. Definition 6.14. We call a subalgebra f) C g an (NCA)-subalgebTa if it has the following property: If a C Q is a subalgebra and a + I) = g, then a = g. The acronym (NCA) stands for no complementary algebra: f) being an (NCA)subalgebra means there is no proper subalgebra o with 0 + f) = g. Of course it makes sense to speak of (NCA)-ideals, too. Proposition 6.15. An ideal i o/g is (IFC) if and only if it is an (NCA)-ideal.
71 We cannot expect that a sum of (IFC)-algebras fji,h2 is (IFC). Nevertheless, we obtain an analogue of the previous proposition. Its proof requires a few extra arguments, though. Proposition 6.16. If the subalgebra I J C J generates a closed subgroup (exp f)) and is (NCA), then it is (IFC). Every (NCA)-algebra t) is contained in A(g). Proof. Suppose that t) is (NCA). Then h C A(fl) must hold because if I) \ A(g) ^ 0 we can actually find a hyperplane subalgebra f) such that i) + t) = 0, cf. Lemma 6.5. We claim that f) is (IFC), so we consider M = G/H and $ C g such that the system on M given by {x | x e $} is controllable. We must show that S ( $ ) = G holds true. As usual 7r: G —> M denotes the canonical map, pa = 7r(e) the base point, and pr,, = d7r(e): a —> fl/f) is the projection with kernel I). Now let a = (($)) C 0, the subalgebra generated by $ . Then controllability on M implies TP0M = < X(po) | X 6 o !• which is equivalent to pr1)(o) = 0, hence a+ f) = 0. As f) is (NCA), o = £| follows, so int(5(*)) # 0. Since f) C A(g), H C A(G). Thus 5 ( $ ) is transitive on G/A(G). As A(G) is (IFT) and int(S($)) # 0, we deduce S ( $ ) = G, finishing the proof. • Now we ask if the reverse implication holds true, so is every (IFC)-algebra (NCA)? Example 6.6 and the discussion leading to Proposition 6.8 show that this is a nontrivial problem. Take an t) which is not (NCA). Take a proper subalgebra o 7^ g such that a + t) = Q. Then it is clear that the system induced by $ = a on M = G/H will be transitive at least on a neighborhood of the base point po = 7r(e) of M, but we cannot expect in general that the semigroup S = A = (exp o) is transitive on M, for that is equivalent to G = HA, and the latter we know for sure only if Ad(G)fj n o = {0} is trivial. Example 6.17. Again we consider the algebra of motions of the euclidean plane 0 = s p a n { A , X , Y } with brackets: [A,X] = Y, [A,Y] = -X. Which subalgebras f) of fl are not (NCA)? Let us look at dim h: 3-dim: Of course 0 is not (NCA), but it isn't (IFC) either. 2-dim: There's only one hyperplane subalgebra: 0'. Of course 0' is not (NCA), but it is not (IFC) either. 1-dim: The line RX is not (NCA) iff X £ g' because 0' is the only possible choice we have for a complementary subalgebra. As Ad(G)0 ; = 0', RX n Ad(G)$' = {0}, so G = exp(RX)G' holds true, i.e., RX is not (IFC). So in this case -i(NCA) implies -i(IFC). In particular we observe that, for example, the subalgebra f) = RX is (NCA), so it is (IFC), too. But we also observe that f) g IFC(0) = {0}. So the inclusion f) C A(o) if t) is (IFC) cannot be improved any further. An (IFC)-algebra need not be contained in the ideal IFC(0). We conjecture that-.(NCA) =$• -.(IFC), resp., (IFC) =$> (NCA) so we state this as an open
72 Problem 6.18. Prove or disprove: an (IFC)-algebra is also (NCA). Now we return to our discussions of the benefits of passing to the quotient g/IFC(g). 6.3
The controllability
normal
form
Reduction modulo A(g) led us to A-reduced algebras which actually turned out to be the ones we called basic-metabelian. The algebra g/IFC(g) also deserves a special name, we call it the controllability normal form (CNF)o/g. If IFC(g) = {0} we say that g is in (CNF). Although the structure (bracket-wise) of such a g in (CNF) is not really complicated, having a reasonable terminology is indispensable for analyzing it. Roughly speaking, such a g is made out of two components, a real vector space V and a set (actually a vector subspace) o C g£(V) of linear maps V -*V such that (1) every A G a is diagonalizable over C, and (2) any two maps A,A'(Ea
commute: A A' = A' A.
These conditions are tantamount to saying that a is an abelian subalgebra of g£(V), and V may be decomposed into common eigenspaces of A, A'. Formalizing this statement it turns out that one can decompose g in the following way:
fl = h e 0 g A © ® g u , AeA
(6.1)
wen
which has to be interpreted as follows: • h is an abelian subalgebra of g corresponding to o plus the center 3(g). • The sets A, Q stand for the so-called roots signifying the common eigenvalues and g , g" are common eigenspaces also called root-spaces. • A is the set of real roots; each A G A is a nonzero linear form A: f) —> R, and [H,XX] = \{H)XX,
holds true for all H G h, Xx G g A .
• Q is the set of complex roots—actually only one half of it since complex eigenvalues always appear in conjugate pairs—so an w G fi is an R-linear map from h -> C such that w(h) % R. The root space g" is isomorphic (as a vector space) to some Cr considered as a real vector space, and for H € h, Xu G gw
[H,Xu]=u(H)Xu. • Each root space gA, g" is an abelian ideal and [g a ,g^] = 0 for all a,/3 G A u Q . At this point one clearly sees the difference between the (CNF) g/IFC(g) and the basic metabelian g/A(g): in the latter case there are no complex roots (eigenvalues) .
73 Proposition 6.19. Let g be as in (CNF). Then
fl' = ©8 A ©©8 W > A€A
"* A(g) = © g " .
wgn
wen
A hyperplane subalgebra a in g has the form 0- = cii, + 8 ,
wi£/i a hyperplane Of, C h, or
a = f) © ^ f t g a © a,,
6.^
Generic controllability
un£/a a hyperplane oM C g^.
rank and type
Now we reap some of the benefits of reduction because we will be able to answer the following questions: • What is the minimum number of elements Xi,... ,Xr necessary to generate a given g as a Lie algebra, i.e., {{X\,... ,Xr)) = 8? • Given a Lie algebra g, some "drift vector fields" Ai,... ,Aj and some "input vector fields" Bi,... ,Bi, what are the minimal numbers (i,d) such that U = {Ai ,...,Ad, ±B%,..., ±Bi) is controllable? • Will controllability or noncontrollability of the system given by U = {Ai ,...,Ad, ± B i , . . . , ±Bi} C jj be stable under perturbations (to Aj, Bk € fl)? Is it a generic property? For the first two questions we observe that existence of such minimal numbers r, resp (i,d) is not really an issue, but it is not at all clear how to compute them. We call r the generic controllability rank of g, GCR(g), and (i,d) the generic controllability type of g, GCT(g). This choice of terminology has something to do with the last question because once certain conditions can be expressed in terms of some polynomials (say in Xi,..., Xr) not vanishing (like a rank condtion), they are either void or generic, so they hold either never or in almost every case. Looking at the (CNF) of a solvable g we can immediately read off its controllability rank and type. With the notation of (CNF) we let d\ = max < diniR g | A 6 A > ,
and
du = max {dime g" | u; € fi} ,
and obtain: Proposition 6.20. Let g be in (CNF).
Then
GCR(g) = max {dim t), 1 + dA, 1 + dn} , and GCT(g) = (i, d) with i + d = GCR(g), i = max {dim f), 1 + ^ A } ,
and
d = max{0,1 + dn — i) •
74 The proof requires two separate arguments. One must find a set of generators of the appropriate size, and one must show that one cannot generate g with less elements. In both cases the root decomposition as in (6.1) is instrumental. The details can be found in 1T. A particularly nice result is the next theorem which simplifies Sachkov's work on controllability of solvable Lie algebras, cf. 2 2 , also 2 3 . Definition 6.21. We call a Lie algebra g SID-controllable, if there exist A,B € g such that S(A,±B) = G, i.e., the single input plus drift system g = g(A + uB), u 6 R is controllable. It turns out that this is a generic property of the Lie algebra g rather than the particular pair (A,B): the set of such pairs is either empty or open and dense in g x g. T h e o r e m 6.22. For a solvable Lie algebra the following statements are equivalent: (i) g is SID-controllable. (ii) every factor algebra g/i (i C g an ideal) is SID-controllable. (hi) g/£>2g is SID-controllable (iv) GCT( 0 ) = (1,1), (v) g/IFC(g) is isomorphic to a semidirect product Ck x$j R with Q diag(a>i,...,uik) 6 g£{k, C) and w i , . . . ,Wfc € C \ R pairwise distinct.
=
If g is SID-controllable, then the foloowing statements hold true: (vi) $j has no ideals of codimension 2. (vii) Q' is a hyperplane and A(g) = $)'. Alternatively, in view of (iii), one can reduce the problem of classifying SIDcontrollable algebras to the metabelian case [fl',g'] = 0. This is a basic case in Sachkov's work which allows a very short classification where the conditions involved are similar to familiar criteria from linear control theory. Proposition 6.23. A metabelian g is SID-controllable iff it is isomorphic to a semidirect product M.k x ^ R where A € g£(k,M) has nonreal spectrum spec(A) C C\R and every eigenvalue has geometric multiplicity 1. The latter is equivalent to iank(A + XI) > k - 1 for all A 6 C. Comparison with the previous theorem reveals how far away from g/ IFC(g) the possibly bigger algebra Q/D2Q actually may be. The metabelian g/D 2 g has a root decomposition similar to the (CNF):
g/D2g = h © 0 g A e 0 g „ . A€A
u6d
The difference is that the root spaces Q\, gw need not be eigenspaces, they may be generalized eigenspaces; and the subalgebra t) is nilpotent but not necessarily abelian.
75
7
Irrelevant groups and algebras: the semisimple case
The highly developed structure theory for semisimple Lie groups and algebras allows one to derive quite powerful results concerning irrelevancy in the semisimple setting. This project has been carried out by L. San Martin and co-workers over a period of several years. We do not undertake to explain the details of these developments here, but only highlight a few principle results. Let G be a connected semisimple Lie group with finite center and let a denote its Lie algebra. Then the Lie group and the Lie algebra admit Iwasawa decompositions of the form G = KAN, g = t + a + n, where K = exp(t), A = exp(a), and N = exp(n), and where the product factorization of each element is unique and the Lie algebra decomposition is into direct summands. The Iwasawa decompositions arise by taking an Cartan decomposition t + p of 5, picking a maximal abelian subalgebra o of p, letting II be a set of positive roots for a, and setting n equal to the sum of the root spaces for the positive roots. Set M equal to the centralizer of A in K. Then P = MAN is a group, called a minimal parabolic group. The coset space G/P is called a maximal boundary of G. The first major result of San Martin appeared in 2 4 . We cite it in the language of this paper, not the language that he used. T h e o r e m 7 . 1 . Let G be a connected semisimple Lie group with finite center, and let P be a minimal parabolic subgroup. Then P is (IFT). Much stronger results are available if G is simple 25 . T h e o r e m 7.2. Let G be a simple Lie group and let H he a closed subgroup such that 0 < dim(H) < dim(G) and the coset space G/H is compact. Then H is (IFT).
References 1. J. Hilgert, K. H. Hofmann, and J. D. Lawson, Lie Groups, Convex Cones, and Semigroups, (Oxford Press, Oxford, 1989). 2. J. Hilgert and K.-H. Neeb, Basic Theory of Lie Semigroups and Applications, (Springer Lecture Notes in Mathematics 1552, Springer, 1993). 3. —, "Hyperplane subalgebras in real Lie algebras," Geom. Dedicata 36, 207-224 (1990). 4. K. H. Hofmann, J. D. Lawson, and E. B. Vinberg, Editors, Semigroups in Algebra, Geometry and Analysis, (de Gruyter, Berlin, 1995). 5. V. Jurdjevic, Geometric Control Theory, (Cambridge Press, Cambridge, 1997). 6. V. Jurdjevic and H. Sussmann, Control systems on Lie groups J. Diff. Eq. 12, 313-329 (1972). 7. J. D. Lawson, Maximal subsemigroups of Lie groups that are total, Proceedings of the Edinburgh Math. Soc. 87 , 479-501 (1987). 8. —, "Ordered Manifolds, Invariant Cone Fields, and Semigroups," Forum Math. 1 , 273-308 (1989). 9. —, "Semigroups of Ol'shanskii type," Semigroups in Algebra, Geometry and
76 Analysis, (de Gruyter, Berlin, 1994). 10. —, "Semigroups in Mobius and Lorentzian geometry," Geom. Dedicata 70, 139-180 (1998). 11. —, "Geometric control and Lie semigroup theory," In: Differential Geometry and Control, Proceedings of Symposia in Pure Mathematics, 64, Editors: G. Ferreyra, R. Gardner, H. Hermes, H. Sussmann, (Amer. Math. Soc, 207-221 1999). 12. C. Loewner, "On some transformation semigroups," J. Rat. Mech. and Anal. 5, 791-804 (1956). 13. —, "On some transformation semigroups invariant under Euclidean or nonEuclidean isometries," J. Math, and Mech. 8, 393-409(1959) . 14. —, "On semigroups in analysis and geometry," Bulletin A.M.S. 70, 1-15 (1964). 15. D. Mittenhuber, "Dubins' problem inhyperbolic space," In: Geometric Control and Non-holonomic Mechanics, CMS Conference Proceedings, Volume 25, Editors: V. Jurdjevic, R.W. Sharpe, (Can. Math. Soc, 101-114 1998). 16. —, "Controllability of solvable Lie algebras," J. Dyn. Contr. Systems 6, 453459 (2001). 17. —, "Controllability of systems on solvable Lie groups: the generic case," J. Dyn. Contr. Systems 7:1, 61-75 (2001) . 18. F. Monroy-Perez, "Non-euclidean Dubins' problem," J. Dyn. Contr. Systems 4:2, 249-272 (1998). 19. G. I. Ol'shanskii, "Convex cones in symmetric Lie algebras, Lie semigroups, and invariant causal structures on pseudo-Riemannian symmetric spaces," Sov. Math. Dokl. 26 , 97-101 (1982). 20. —/'Invariant cones in Lie algebras, Lie semigroups, and the holomorphic discrete series," Fund. Anal, and Appl. 15 , 275-285 (1982). 21. R. S. Palais, "A global formulation of the Lie theory of transformation groups," (Memoirs of the Amer. Math. Soc, Providence, 1957). 22. Y. Sachkov, "Controllability of right-invariant systems on solvable Lie groups," J. Dyn. Contr. Systems 3:4, 531-564 (1997). 23. —, "Survey on Controllability of Invariant Systems on Solvable Lie Groups," In: Differential Geometry and Control, Proceedings of Symposia in Pure Mathematics, Volume 64, Editors: G. Ferreyra, R. Gardner, H. Hermes, H. Sussmann, (Amer. Math. Soc, 297-317 1999). 24. L. San Martin, "Invariant control sets on flag manifolds," Math, of Control, Signals, and Systems 6 , 41-61 (1993). 25. L. San Martin and P. Tonelli, "Transitive actions of semigroups in semisimple Lie groups," Semigroup Forum 58, 142-151 (1999). 26. H. Sussmann, "The bang-bang problem for certain control systems in GL(ra, R), SIAM J. Control 10, 470-476 (1972).
77 CANONICAL CONTACT SYSTEMS FOR CURVES: A SURVEY
WITOLD RESPONDEK Laboratoire de mathematiques de VINSA de 16 131 Mont Saint Aignan, France E-mail: [email protected]
Rouen
WILLIAM PASILLAS-LEPINE Laboratoire des signaux et systemes, Supelec 91 129 Gif-sur-Yvette, France E-mail: [email protected] We start with a brief overview of results characterizing distributions (equivalently Pfaffian systems) that are locally equivalent to a canonical contact system. Then we concentrate on distributions equivalent to the canonical contact system on 7 n (IR,]R m ), that is the canonical contact system for curves. We give various necessary and sufficient geometric conditions for a distribution (or a Pfaffian system) to be locally equivalent to the canonical contact systems for curves. We study the geometry of that class of systems, in particular, the existence of corank one involutive subdistributions contained in the successive elements of the derived flag. We give a new characterization of distributions locally equivalent to the canonical contact systems for curves that uses only one involutive subdistribution in which all information about the geometry and about the regularity is encoded. We also distinguish regular points, at which the system is equivalent to the canonical contact system, and singular points, at which we propose a new normal form that generalizes the canonical contact system on Jn(R, R m ) in a way analogous to that how Kumpera-Ruiz normal form generalizes Goursat normal form, that is, the canonical contact system on Jn(R, M.). Finally, we briefly discuss how the geometry of distributions equivalent to the canonical contact systems for curves is reflected by their flatness and illustrate some of our results by a model of a simple nonholonomic control system.
1
Introduction
Consider Jn(Rk ,Rm), and denote by (qi,...
,qk,ui,...
the space of n-jets of smooth maps from Kfc into E m , ,um,Pi),
for 1 < i < m and for 1 < \a\ < n,
the canonical coordinates, also called natural coordinates, on this space (see e.g. 3 , 25 , and 3 2 ) , where qj, for 1 < j < k, represent independent variables and Ui, for 1 < i < m, represent dependent variables; the vector of non-negative integers a = (<TI, . . . ,o~k) is a multi-index such that \o~\ = o~\ + • • • + o~k
78
Denote p° — ut, for 1 < i < m. Any smooth map ip from Efc into Rm defines a submanifold in Jn(Rk ,Rm) by the relations Pi = - ^ — ( g i , - - - , % ) ,
for 1 < i < m and 0 < \a\ < n. This submanifold is called the n-graph of ip. It turns out that all n-graphs are integral submanifolds, of dimension k, of a distribution called the canonical contact system on Jn (Rk, Rm) or the Cartan distribution 32 on J n (M f c ,E m ). The Pfaffian system that anihilates this distribution, which is also called the canonical contact system (see e.g 3 and 2 5 ) , is given in the canonical coordinates of Jn(Rk, Rm) by ft
dp? - YjPi
.j.
'dqj = 0,
for 1 < i < m and for 0 < |cr| < n — 1,
where a + \j = {a\,... , Oj + 1 , . . . ,ak). The above description explains the importance of contact systems in geometric theory of (partial) differential equations and in differential geometry. In the former, a (partial) differential equation is interpreted as a submanifold in J"(K fc , IRm) and thus it is natural to study the geometry of pairs consisting of a contact system and a submanifold, see e.g. 3 , 19 , and 3 2 . In the latter, contact systems allow to describe, for instance, diffeomorphisms which preserve the n-graphs of appplications (for example, n-graphs of curves in the case k = 1), see e.g. 3 and 2 5 . A natural problem which arises is to characterize those distributions which are (locally) equivalent to a canonical contact system. This problem was posed by Pfaff 29 in 1814 and seems still to be open in its full generality although many important particular solutions have been obtained. In the case n = 1, with m = 1 and an arbitrary k the final solution has been obtained by Darboux 7 in his famous theorem generalizing earlier results of Pfaff 29 and Frobenius n . The case n = 2, m = 1 and A; = 1 was solved by Engel in 8 . The case n > 2, m = 1 and k = 1 was solved by E. von Weber 33 , Cartan 5 and Goursat 14 (at generic points) and by Libermann 18 , Kumpera and Ruiz 17 , and Murray 24 (at an arbitrary point). The case n = 1, with k and m arbitrary has been studied and solved by Bryant 2 (see also 3 ) . This paper is devoted to the problem of when a given distribution is locally equivalent to the canonical contact system in the case k = 1, n and m arbitrary, that is to the canonical contact system for curves. This problem has been studied by Gardner and Shadwick 12 , Murray 24 , and Tilbury and Sastry 31 (as a particular case of the the problem of equivalence to the
79
so-called extended Goursat normal form). Their solutions are based on a result of 12 that assures the equivalence provided that a certain differential form satisfies precise congruence relations. The problem of how to verify the existence of such a form had apparently remained open. This difficulty was solved by Aranda-Bricaire and P o m e t 1 , who proposed an algorithm which determines the existence of such a form. Their solution, although being elegant and checkable, uses the formalism of infinite dimensional manifolds and thus goes away from classical results characterizing contact systems. Recently, the authors 27 gave a geometric characterization of canonical contact systems in the case k = 1, and n and m arbitrary. That characterization of the canonical contact system on Jn(R,Rm), recalled as Theorem 2.1, turns out to be a natural combination of that given for J 1 (E f c ,R m ) by Bryant (see 2 and 3 ) and that given for J"(M,M) by Murray 2 4 . The aim of this paper is to analyze various possible characterizations of the canonical contact system on J"(M, K m ) . We start with stating, as Theorem 2.1, the result of 2 7 . That result involves two conditions: an essential geometric condition stating that each element 2)W of the derived flag (see the definition in the next section) contains an involutive subdistribution of corank one and the regularity condition T>^ = T>i (see the definition of the Lie flag in the next section). We will discuss other equivalent formulations of both conditions using, in particular, the notion of characteristic distributions (Theorem 5.4) and the language of Pfaffian systems (Theorem 3.7 and Corollary 5.5). All those descriptions lead to the following characterization, stated as 5.6, which is the main result of the paper. Main Theorem A rank m + 1 distribution V, with m > 2, on a manifold M of dimension (n + l)m + 1 is equivalent, in a small enough neighborhood of any point p in M, to the canonical contact system on J"(M, M.m) if and only if the following conditions hold. (i) £>(") = TM. (ii) £>(™-1) is of constant rank nm + 1 and contains an involutive subdistribution Cn-i that has constant corank one in (iii) 2>(°>(p) g £„_!(?). This result implies that all information containing both the geometry and the regularity of the problem is completely encoded in the involutive subdistribution £„_i of corank one in D(™-1). In particular, its existence and the regular intersection with 2?(°) (p) imply the existence of an involutive subdistribution of corank 1 in each X>W and, moreover, imply all regularity conditions X>W — T>i. It is important to observe that if conditions (i) and (ii) of the theorem are satisfied then the involutive subdistribution jCn-i is necessarily
80
unique (see Lemma 4.2), which allows to check easily condition (iii). In the case of Goursat structures, that is for m = 1, we give Theorem 5.7, an analogue of the above result, in which all information is, this time, encoded in the characteristic distribution Cn-\ of £>(" - 1 '. The paper is organized as follows. In Section 2 we define the canonical system for curves and we give Theorem 2.1, a first characterization of distributions that are locally equivalent to the canonical contact system for curves. In Section 3 we sketch briefly the history of results describing various canonical contact systems. In Section 4 we discuss the problem of whether a given distribution possesses a corank one involutive subdistribution and we recall Bryant's solution of that problem. Section 5 contains other characterizations of the canonical contact systems for curves. One of them, Theorem 5.4, is based on the notion of characteristic distributions while another, Theorem 5.6, uses only one involutive sudistribution in which all information about the geometry and about the regularity is encoded. The presented proof gives a method of constructing canonical coordinates for any distribution equivalent to the canonical contact system. Still in Section 5, we present an analogue of Theorem 5.6 for Goursat structures that uses, this time, only one characteristic distribution. Flatness of control systems equivalent to a canonical contact system for curves is discussed in Section 6. In particular, we show how the geometry of that class of systems helps to describe their flat outputs. We give a model of a mechanical nonholonomic system equivalent to a canonical contact system in Section 7. In Section 8 we introduce extended Kumpera-Ruiz normal forms and give a result, Theorem 8.1, stating that any contact system, regular or singular, is locally equivalent to an extended Kumpera-Ruiz normal form.
2
The Canonical Contact System for Curves
A rank k distribution V on a smooth manifold M is a map that assigns smoothly to each point pin M a, linear subspace V(p) C Tp M of dimension k. In other words, a rank A; distribution is a smooth rank k subbundle of the tangent bundle TM. Such a field of tangent fc-planes is spanned locally by k pointwise linearly independent smooth vector fields / i , . . . , fk on M, which will be denoted by X> = ( / i , . . . , / * ) . Two distributions V and V denned on two manifolds M and M, respectively, are equivalent if there exists a smooth diffeomorphism ip between M and M such that (
81
fields D<°) C 2? (1) C • • • denned inductively by D(0)=£>
P< i + 1 ) = X>(i) + [ D ( i ) , P w ] ,
and
for t > 0.
(2.1)
The Lie /Jag is the sequence of modules of vector fields Do C T>\ C • • • defined inductively by V0=V
and
P i + i =Vt + [T>o,T>i], for t > 0.
(2.2)
In general, the derived and Lie flags are different; though for any point p in the underlying manifold the inclusion T>i(p) C X>W(p) clearly holds, for i > 0. For a given distribution V, defined on a manifold M, we will say that a point p of M is a regular point of V if all the elements T>i of its Lie flag have constant rank in a small enough neighborhood of p. A distribution V is said to be completely nonholonomic if, for each point p i n M , there exists an integer N(p) such that VN^(p) = TPM. A distribution V is said to be involutive if its first derived system satisfies V^ = £>(°'. An alternative description of the above defined objects can also be given using the dual language of differential forms. A Pfaffian system 1 of rank s on a smooth manifold M is a map that assigns smoothly to each point p in M a linear subspace l(p) c T*M of dimension s. In other words, a Pfaffian system of rank s is a smooth subbundle of rank s of the cotangent bundle T*M. Such a field of cotangent s-planes is spanned locally by s pointwise linearly independent smooth differential 1-forms w i , . . . ,ws on M, which will be denoted by I = (UJI,... ,w s ). Sometimes we will denote the Pfaffian system I = ( w i , . . . ,CJ S ) by ui = 0 , . . . , ws = 0. In the case of a rank one Paffian system I = (w) we will speak also about a Pfaffian equation u = 0. Two Pfaffian systems I and 1 defined on two manifolds M and M, respectively, are equivalent if there exists a smooth diffeomorphism ip between M and M such that X(p) = (ip*l)(p), for each point pin M. For a Pfaffian system I , we can define its derived flag 1^ D 2' 1 ) D • • • by the relations I<°) = I and l( i + 1 > = {a e I ( i ) : da = 0 mod ! « } , for t > 0, provided that each element X^ of this sequence has constant rank. In this case, it is immediate to see that the derived flag of the distribution V = 1± coincides with the sequence of distributions that anihilate the elements of the derived flag of I , that is v(i)
=
(i«)-L ;
for i > o.
For a given Pfaffian system X, we will say that a point p of M is a regular point if p is a regular point for the distribution V =2±, that is if all elements T>i of the Lie flag are of constant rank in a small enough neighborhood of p.
82
Consider the space J n ( E , R m ) of jets of order n > 1 of functions from R into R m . This space is diffeomorphic to R ( " + 1 ) m + 1 . The canonical coordinates associated to R (denoted by x°) and to R m (denoted by x^,... , x ^ ) can be used to define the canonical coordinates on J n (R, R m ) , which will be denoted by 0 T° r° r1 r1 T" J-OJ-^-I)--- j i f n j J - j , . . . , x m , . . . , x j , . . . x
T"
,x
m
,
with obvious indentifications org = 9 and x° = p° = u*, for 1 < i < m, and xj = p3i, for 1 < i < m and 1 < j < n (see the beginning of the Introduction). Observe that any smooth map tp from R into R m defines a curve in J"(R, R m ) by the relations x{ = tpy\x°), for 0 < i < m and 0 < j < n, where ip^ denotes the j - t h derivative with respect to zfj °f the z'-th component of (p. This curve is called the n-graph of (p. It is clear that not all curves in J"(R, R m ) are n-graphs of maps. In order to distinguish the "good" curves from the "bad" ones, we should introduce a set of constraints on the velocities of curves in J n (R, R m ) . In other words, we should endow J"(R, R m ) with a nonholonomic structure. The canonical contact system on J n ( R , R m ) is the completely nonholonomic distribution spanned by the following family of vector fields: /
n—1 m
( d^f > • • • > 5 i ^ " » 9 4 + E T,xi
\
^7) •
(2.3)
In a dual way, we will also call the canonical contact system on J " ( R , R m ) the Pfaffian system (dx{ - x{+1dx%, 0 < j < n - 1, 1 < i < m) annihilating the above distribution. For the canonical contact system on J"(R, R m ) we will also use the name canonical contact system for curves. By definition, if a curve in J"(R, R m ) is the n-graph of some map then it is an integral curve of the canonical contact system. More precisely, a section a : R -> J n (R, R m ) is the n-graph of a curve ip : R -> RTO if and only if it is an integral curve of the canonical contact system on J"(R, R m ) (see e.g. 3 and 2 5 ) . The aim of our paper is to analyze various possible answers to the question "Which distributions are locally equivalent to the canonical contact system on J"(R, R m ) ? " . The following result, obtained by the authors 2 7 , will be the starting point of our study. Theorem 2.1 (regular contact systems, first version). A rank m + 1 distribution V on a manifold M of dimension (n + l)m + 1 is equivalent,
83
in a small enough neighborhood of any point p in M, to the canonical contact system on Jn(IR, E m ) if and only if the two following conditions hold, for 0^1' that has constant corank one in (ii) Each element T>i of the Lie flag has constant rank (i + \)m + 1. Remarks 1. This result yields a constructive test for the local equivalence to the canonical contact system for curves, provided that we know how to check whether or not a given distribution admits a corank one involutive subdistribution. We give in Section 4 a checkable necessary and sufficient condition for the existence of such a distribution. 2. Item (i) describes the essential geometric property of distributions equivalent to canonical contact systems for curves while the condition (ii) distiguishes regular points p at which X>'*'(p) = T^i(p) from singular points, where this last condition is violated. Different roles of (i) and (ii) justify Definition 2.2 below. 3. Theorem 2.1 was proved in 27 , as a special case of a more general result that gives a normal form for any system satisfying (i) (see Remark 2 above). In Section 5 we will present a direct proof of Theorem 5.6, which is another version, under the weakest possible assumptions, of Theorem 2.1. Definition 2.2 (contact systems). A rank m + 1 distribution T> on a manifold M of dimension (n + \)m + 1 is said to be a contact system if, for any 0 < i < n, each element T>^ of the derived flag, has constant rank (i + l)m + 1 and contains an involutive subdistribution Ci C X>W that has constant corank one in A contact system is said to be a regular contact system at p G M if it is, locally at p, equivalent to the canonical contact system on Jn(R, Rm). Otherwise it will be called a singular contact system at p e M. Theorem 2.1 implies that a contact system is a regular contact system at a point p if and only if the point p is regular. Other, than that of Remark 2 above, equivalent characterizations of regular points of contact systems will be given in Theorems 5.4 and 5.6 , and in Corollary 5.5. 3
History
In this section we give a brief histry of the following problem: Which Pfaffian systems are locally equivalent to the canonical contact system on the space Jn(Rk,Wn) of n-jets of maps from Rk to Mm ? This problem is a natural
84
generalization of the follwing classical problem stated in 1814 by Pfaff: Which Pfaffian systems are locally equivalent to the canonical contact system on the space J1 (Rk,Rm) of 1-jets of maps from Rk to R? Studies of that problem have mainly concentrated on the four following cases of equivalence to the canonical contact system on the space • J^Rfc.R): Pfaff (1814), Frobenius (1877), and Darboux (1882); • J"(M,M): Engel (1890), E. von Weber (1898), Cartan (1914), Goursat (1923), Giaro, Kumpera and Ruiz (1978), Libermann (1978), Murray (1994), Mormul (1998-2001), Pasillas-Lepine and Respondek (1998-2001), Montgomery and Zhitomirskii (1999), • J ^ I f ^ i r ) : Gardner (1972), Bryant (1979); • Jn(R,Rm): Garnder and Shadwick (1992), Murray (1994), Tilbury and Sastry (1995), Aranda-Bricaire and Pomet (1995), Pasillas-Lepine and Respondek (1999-2001). In the following subsections we will describe solutions in each of the four above cases. Throughout this section we will use the notation p° = Uj, for 1 < i < m (see Introduction). 3.1
Pfaff-Darboux normal form
In this subsection we recall the classical solution given by Frobenius and Darboux to the original Pfaff question: which Pfaffian systems are locally equivalent to the canonical contact system on the space J 1 (Rk, E) ? We will need the following notion introduced by Frobenius n and Darboux 7 Definition 3.1. The class, at a point p, of the Pfaffian equation u = 0 is the largest integer p such that
Using the notion of class, Darboux 7 gave the following celebrated answer to the problem. Theorem 3.2 (Darboux, 1882). A Pfaffian equation w = 0, defined on a manifold of dimension 2k + 1, is locally equivalent to the equation given by the following canonical contact form dp0 - pldqi
phdqk = 0
on J1(Rk, R), if and only if its class is constant and equals k. The above canonical contact form is called Pfaff-Darboux normal form.
85
3.2
Bryant normal form
In this subsection we will recall a solution to the generalization of Pfaff's problem to the space of 1-jets of arbitrary maps. The following question has been studied by Gardner since 1972, and by Bryant (who gave a solution in 1979): which Pfaffian systems are locally equivalent to the canonical contact system on the space Jl(Wk, W1) ? In order to give an answer we will need the following generalization of the notion of class of a Pfaffian equation. Definition 3.3. The Engel rank, at a point p, of a Pfaffian system I — [uii,... ,uis) is the largest integer p such that there exists a 1-form a inX for which {{da)p A w i A - - - A w s ) ( p ) / 0. Obviously, in the case of a single Pfaffian equation u = 0, the notion of Engel rank reduces to that of class. A characteristic vector field of a distribution P is a vector field / that belongs to V and satisfies [/, V] C T>. The characteristic distribution of V, which will be denoted by C, is the module spanned by all its characteristic vector fields. It follows directly from the Jacobi identity that the characteristic distribution is always involutive but, in general, it needs not be of constant rank. If for a constant rank Pfaffian system I , the characteristic distribution C of the distribution V = l1- is of constant rank, then its anihilator (C)1is a Pfaffian system, called the Cartan system of the Pfaffian system I and denoted by C(X). We refer the reader to 3 for a definition of the Cartan system given in the language of Pfaffian systems. The following result was proved by Bryant in his Ph.D. Thesis 2 (see also 3 ) . Theorem 3.4 (Bryant, 1979). Assume that m ^ 2. A Pfaffian system 2=
(oJi,...,ojm),
defined on a manifold of dimension m + k + mk and such that I^1' = 0, is locally equivalent to the canonical contact system p\dqk = 0
Pm ~ Pmdqi
Pkmdqk = 0
d
dp\ - p\dqi
on J1 (Rk, lRm), if and only if its Engel rank is constant and equals k, and the rank of its Cartan system is constant and equals m + k + mk. The above normal form of a Pfaffian system is called Bryant normal form. Reasons for which the case m = 2 is excluded will be explained in Section 4, see Lemma 4.2 and Corollary 4.3.
86
3.3
Goursat normal form
Starting with the work of Engel 8 and von Weber 33 , the following question has been intensively studied: which Pfaffian systems are locally equivalent to the canonical contact system on the space J"(R, R) ? The case n — 1 was solved by Pfaff in his pioneering paper 29 ; it is also the simplest case of Darboux theorem given above. The case n = 2 was solved by Engel 8 . In order to give a solution for an arbitrary n, recall, see Introduction, the following fundamental notion. The derived system I' of a Pfaffian system 1 = (UJI, ...,u!s) is the Pfaffian system generated by all 1-forms a in I such that da AOJI A • • • A ws = 0.
Starting from a Pfaffian system 1 and its (first) derived system I ' we define the flag of derived systems 1^ D 1^ D • • • D 1^ D • • • inductively by I<°) = J a n d I < i + 1 ) = ( ! « ) ' . The following characterization of Pfaffian systems, that are locally equivalent to the canonical contact system on the space J"(R, R), was found by von Weber and has been intensively studied by Cartan. Theorem 3.5 (van Weber, 1898 ; Cartan, 1914). A Pfaffian system X = (uii,...,LJn) of rank n, defined on a manifold of dimension n + 2, is is locally equivalent (on an open and dense subset) to the canonical contact system dp0 - pldq = 0 , • • • , dpn~l - pndq = 0 on Jn(M., E) if and only if the elements 1^ of the flag of derived systems are of constant rank n — i, for 0 < i < n. The canonical form present in the above theorem is called Goursat normal form, mainly because it was intensively used by Goursat in his book 14 . We will call a Goursat structure any distribution V (resp. any Pfaffian system I) on a manifold of dimension n + 2 such that the elements of the derived flag X>(") are of constant rank i + 2 (resp. the elements j M 0 f the flag of derived systems are of constant rank n — i, for 0 < i < n). Therefore the above theorem says that a Goursat structure is locally equivalent to a Goursat normal form on an open and dense subset of the underlying manifold. Notice, see Proposition 5.1, that a Goursat structure is just a contact system of rank 2 (a contact system of corank 2, if we use the language of Pfaffian systems). Theorem 3.5 is thus a special case of a more general Theorem 2.1 stating that a contact system is equivalent to the canonical contact system on an open and dense subset. Applying the latter theorem to the case of Goursat
87
structures, the open and dense subset of Theorem 3.5 can be described as points p at which X>W(p) = T>i(p) (as shown earlier by Murray 24 directly for Goursat structures). Other equivalent descriptions of the open and dense set of regular points of Goursat structures will follow from characterizations of regular points of contact systems given in Theorems 5.4 and 5.6 and in Corollary 5.5, stated in Section 5. Here we would like only to emphasize that in the cases n = 1 and n = 2 we do not have to restrict ourselves to an open and dense set. In fact, the conditions dim X^\q) — 0, in the case n = 1, and dim l{1)(q) = 1, dim l(2)(q) = 0, in the case n = 2, satisfied for any q in a neighborhood of a given point p, guarantee that the Pfaffian system I on a manifold, respectively of dimension 3 and 4, is locally equivalent around p to Goursat normal form. Giaro, Kumpera, and Ruiz 13 were the first ones who observed that starting from n = 3, singularities may appear for Goursat structures. Example 3.6 (Giaro et al, 1978). The following Pfaffian system of five variables dx2 — xzdxi = 0 dx$ — Xidxi = 0 dx\ — x^dxi = 0 is a Goursat structure, that is, it satisfies the conditions of the above theorem but is not locally equivalent to the canonical contact system on J 3 (E, E) if x5=0. It was this example with which an increasing interest in the problem of singularities of Goursat structures started (for more results see e.g. 17 , 6 , 22 , 23
28 \
21
!
3.4
>
I'
Canonical contact systems for curves
In this section we will discuss the following question: which Pfaffian systems are locally equivalent to the canonical contact system on the space J"(M, E m ) , that is, to the canonical contact system for curves? We will put the answer, given to that question as Theorem 2.1 in Section 2, in a form analogous to that of Theorems 3.4 and refthm-goursat. Theorem 3.7 (regular contact systems, second version). A Pfaffian system I of constant rank nm, where m ^ 2, defined on a manifold of dimension (n + \)m + 1, is locally equivalent (on an open and dense subset)
88
to the canonical contact system dp0! - p\dq = 0 , • • • , dp7?'1 - p^dq = 0 dp°m -pldq
= 0 , • • • , dp^-1 -p^dq
=0
on J"(E, R m ) if and only if each element j W of the flag of derived systems satisfies the following conditions, for 0 < i < n. (i) 1^ is of constant rank (n — i)m; (ii) The Engel rank ofl^ equals one; (iii) The rank of the Cartan system of I^1' is constant and equals (n — i + l)m + l. The above theorem is given, as Corollary 2.4, in 2 7 . Its proof follows directly from Theorem 2.1 and a characterization, via the notion of Engel rank, of the existence of an involutive subdistribution of corank one in a given distribution. Such a characterization was obtained by Bryant 2 and will be discussed in Section 4. Notice that the formulation of Theorem 3.7 can be considered as a natural combination of, on the one hand, Weber-Cartan's Thereom 3.5 and, on the other hand, of Bryant's Theorem 3.4. Indeed, like Thereom 3.5, it involves the derived flag and requires to exclude some singular points. On other hand, like Theorem 3.4, it expresses the basic geometry of the problem through the notion of Engel rank. As we said, we will discuss links between the geometry and Engel rank in Section 4. What concerns regular points, one characterization as points p such that V^ (p) = T>i (p) was already given in Theorem 2.1. We would like, however, to express that regularity condition in terms of the Pfaffian system I ; but is not clear how to represent the elements T>i of the derived flag in terms of X. For that reason, we will present in Section 5 other equivalent descriptions of regular points, in particular, in terms of Pfaffian systems. 4
Involutive s u b d i s t r i b u t i o n s of corank one
The aim of this Section is to provide checkable conditions to verify (i) of Theorem 2.1 and to justify the statement of Theorem 3.7. This will be done by answering the following question: "When does a given constant rank distribution V contain an involutive subdistribution C C T> that has constant corank one in T>T\ In fact, the answer to this question is an immediate consequence of a result contained in Bryant's Ph.D. thesis 2 . Links between Bryant's result and the characterization of the canonical contact system for
89 curves have also been observed by Aranda-Bricaire and Pomet 1. Recall that in Subsection 3.2 we defined the Engel rank of a Pfaffian system X, at a point p, as the largest integer p such that there exists a 1-form u in X for which we have (dw)p(p) ^ OmodX. The Engel rank of a constant rank distribution V will be, by definition, the Engel rank of its anihilator V-1. Obviously, the Engel rank p of a distribution equals zero at each point if and only if the distribution is involutive. We will now give an equivalent definition of the Engel rank, see 2 7 , in the language of vector fields, in the particular case when p — 1, which will be important in the paper. Let V be a distribution such that Z>(°) and T>^ have constant ranks d0 and di, respectively, and denote ro(p) — d\ (p) — do(p). Assume that d0 > 2 and r 0 > 1. Take a family of vector fields ( / l , - - • , / d 0 ) 5 l ) - - - >#r 0 )
such that 2>(°) = ( / ! , . . . , / d o ) and V& = ( A , . . . ,fdo,gu . . . ,gro). The structure functions ck- associated to those generators, for 1 < i < j < d0 and 1 < k < ro, are the smooth functions defined by the following relations: [/* . / > ] = £ 4 9k mod P<°>,
for 1 < i < j < d0.
It is important to point out that the structure functions are not invariantly related to the distribution V, since they depend on the choice of generators. Assume that the Engel rank p of D is constant and that ro > 1. It is easy to check that p = 1 if and only if either do = 2, or do = 3, or do > 4 and the structure functions satisfy the relations
4-c« - <44 + &U + %A - >,& + « , • = o-
(4-i)
for each sextuple (i,j,k,l,p,q) of integers such that l 1. Then the two following conditions are equivalent: (i) The characteristic distribution CofT> has constant rank CQ = do—ro — 1 and the Engel rank p of V is constant and equals 1; (ii) The distribution T> contains a subdistribution B CD that has constant corank one in T> and satisfies [B,B] C T>.
90
Observe that if the first condition is satisfied then we must necessarily have r 0 < do — 1. The following result is included in the proof of Bryant's 2 normal form Theorem (see also 3 ) . An alternative proof of its surprising item (iii) is given in Appendix A of 27 , where the role of the assumption ro > 3 is explained by relating it to the Jacobi identity. Lemma 4.2 (Bryant). LetV be a distribution such thatV^ andV^ have constant ranks do and d\, respectively. Assume that the distribution T> contains a subdistribution B C T> that has constant corank one in T> and satisfies
[B,B]cV. (i) If ro = 1 then the distribution T> contains an involutive subdistribution C C T> that has constant corank one in T>; (ii) Ifr0>2 then B is unique; (iii) If ro > 3 then B is involutive. Observe that, in the first item of the above Lemma, the involutive subdistribution C can be different from B, which is not necessarily involutive. The following result is a direct consequence of Bryant's work. In order to avoid the trivial case ro = 0, for which the existence of a corank one involutive subdistribution is obvious, we will assume that r 0 > 1. Corollary 4.3 (corank one involutive subdistributions). Let V be a distribution such that X>'0' and T>^ have constant ranks do and d\, respectively. Assume that ro > 1. Then, the distribution V contains an involutive subdistribution C C T> that has constant corank one in D if and only if the three following conditions hold: (i) The characteristic distribution C ofT> has constant rank Co — do—ro — 1; (ii) The Engel rank pofDis constant and equals 1; (iii) If ro = 2 then, additionally, the unique corank one subdistribution B C T> such that [B, B] C D must be involutive. We would like to to emphasize that the above conditions are easy to be checked, and, as a consequence, so are the conditions of Theorems 2.1 and 3.7. Indeed, for any distribution, or the corresponding Pfafnan system, we can compute the characteristic distribution C and check whether or not the Engel rank equals 1 using, respectively, the formula (4.2) and the condition (4.3) below. This gives the solution if ro ^ 2 (equivalently, if m / 2). If r 0 = 2 we have additionally to check the involutivness of the unique distribution B satisfying [B, B] C V, whose explicit construction is also given by (4.4) below. In the sequel, we will show, following Bryant 2 , how to verify whether or not the Engel rank equals 1, and how to construct explicitly — if it exists — the unique corank one subdistribution B C V satisfying [B, B] C V (which, in the case r 0 > 2, is the unique involutive subdistribution of corank one in V).
91 Consider a distribution T> of constant rank d0, defined on a manifold of dimension N. Let w i , . . . ,w So , where so = N — do, be differential 1-forms locally spanning VL, the annihilator of V, that is V± =
(ui,...,uao).
We will denote by I the Pffafian system generated by u\,..., For any form ui 6 V-1, we put W(w) = {/GZ> :
wSo.
/jdweP1}.
Clearly, the characteristic distribution C of V is given by C=f)W(oJi).
(4.2)
Now assume that V^ is of constant rank di > do, that is ro > 1, or, equivalently, that the first derived system 2^ is of constant rank smaller than soBy a direct calculation we can check (see e.g. 3 ) that the Engel rank of the distribution V, or of the corresponding Pfaffian system I, equals 1 at p if and only if (dui A db}j)(p) = 0
modi,
(4.3)
for any 1 < i < j < soNow let us choose a family of differential 1-forms LJi,...,u)ro,Wr0+i,...,uSo such that I = (V^)1= (OJX,. .. ,wSo) and i W = (D^)1= (uv 0 +i,... ,w So ). When condition (i) of Lemma 4.1 is satisfied, independently of the value of ro > 2, the unique distribution B satisfying [B, B] C V is given, as shown by Bryant 2 , by B=Y:\V(LJi).
(4.4)
i=l
In fact, Bryant has also proved that it is enough to take in the above sum only two terms corresponding to any 1 < i < j < ro. In order to verify, in the case ro = 2, the conditions of Corollary 4.3 we have additionally to check the involutivity of this explicitly calculable distribution. Notice that, if the distribution V satisfies Lemma 4.1 (i) and r 0 > 3, then the formula (4.4) gives the unique involutive subdistribution C of corank one in V.
92 5
5.1
Contact systems, characteristic distributions, and involutive subdistributions Contact systems and characteristic
distributions
Let V be a contact system, that is, according to Definition 2.2, each distribution Z?W contains a unique involutive subdistribution d of corank one. It turns out that those involutive subdistributions are unique and, moreover, can be expressed in terms of characteristic distributions. Indeed, we have the following result. Proposition 5.1. Let T> be a rank m + 1 contact system, defined on a manifold of dimension (n + l)m + 1. Each distribution P ^ ' — either for 0 < i < n — 2, if m = 1, or for 0 2 — contains a unique involutive subdistribution Ci C P ' 4 ' that has constant corank one in P « and that, for 0 < i < n — 2, coincides with the characteristic distribution Cj+i o/p( J + 1 >. The following picture summarizes the above proposition. £>(°) c P ^ C • • • C X>("-2) C Pf"- 1 ) C P ( n ) = TM U U U U Co C C\ C • • • C Cn-2 C Cn-\ n n ii C\ C C2 C • • • C C n - i
n £>(i)
n c
(5.1)
n
£>(2) C • • • C D^""1)
The inclusions between corresponding successive elements of the first and second row are those of the definition of contact systems. The inclusions between the elements of the third and fourth row express simply the fact that Ci, being a a characteristic distribution of V^\ is contained in 2?W. The equalities between the corresponding elements of the second and third row express a basic property of the sequence of involutive subdistributions Ci. In the case m = 1, that is for Goursat structures, Proposition 5.1 seems to be already known to Cartan 5 . For a proof we refer the reader to Montgomery and Zhitomirskii 21 (see also 4 , 17 , and 2 0 ) . Incidence relations of distributions V^'s and Cj's have been intensively used by the authors 28 and by Montgomery and Zhitomirskii 21 to study the geometry of Goursat structures at singular points. In the case m > 2, to the best of our knowledge, Proposition 5.1, has never been stated. Notice a substantial difference between the cases m = 1 and m > 2, which is the presence of the canonical involutive subdistribution Cn-i in the second row for m > 2. In the case of Goursat structures, that
93 is if m = 1, the distribution p ( " - 1 ) contains many different involutive sudistributions of corank one but non of them has any canonical properties and therefore in the case m = 1, the second row of (5.1) terminates with £„_2This basic difference between the cases m = 1 and m > 2 will be reflected in different statements of Theorems 5.6 and 5.7, that describe, respectively, regular contact systems for m > 2 in terms of £„_i and, for m = 1, in terms of C n _i = £„-2Definition 5.2. We will call the corank one involutive subdistribution d given in the previous result — either for 0 < i < n — 2, if m = 1, or for 0 2 — the canonical subdistribution o / p W . When proving Proposition 5.1 we will use the following result proved in 2 7 . Lemma 5.3 {Co C CI). Let T> be a distribution such that p ( ° ' , T>^\ and T>W have constant ranks do, di > do + 2, and c?2 > d\ + 2, respectively. Assume that each distribution p W , for i = 0 and 1, contains an involutive subdistribution Ci C P ^ that has constant corank one in p W . Then Co CCiProof: (of Proposition 5.1). In the case m = 1, that is for Goursat structures, the result was proven in 21 (see also 4 , 17 , and 2 0 ) . Consider the case m > 2. By Lemma 4.2, we conclude that all involutive distributions d, for 0 < i < n—1, are unique. In order to show that d = Cj+i, for 0 < i < n — 2, apply Lemma 5.3, with T>(°\ T>^1\ and V^2\ being replaced, respectively, by V^\ T>(i+1\ and X>('+2). We conclude that d C Ci+1. On the one hand, the rank of d equals m{i +1) while, on the other hand, the rank of Ci+i is also equal to m(i +1), because V^+1^ contains a subdistribution C,+i that satisfies [Ci+i,d+i] C X>('+1) (see Lemma 4.1). We thus have d = Ci+iD In the case of Goursat structures, that is m = 1, using the canonical subdistributions we can distinguish singular points from regular ones, see 21 and 2 8 . Below we show that the geometry of incidence of canonical subdistributions describes completely also regular contact systems in the general case m > 2. Indeed, we have the following result. Theorem 5.4 (regular contact systems, third version). A rank m + 1 distribution T>, with m > 2, on a manifold M of dimension (n + l)m + 1 is equivalent, in a small enough neighborhood of a point p in M, to the canonical contact system on J"(R, E m ) if and only if the following conditions hold. (i) Each element P ^ of the derived flag, for 0 < i < n, has constant rank (i + \)m + 1. (ii) Each element T>^> of the derived flag, for 0 < i < n — 2, contains the characteristic distribution C,+i of V^l+1' that has constant corank one in P « .
94 (iii) X>(™-1) contains an involutive subdistribution £n-i that has constant corank one in Z)( ra_1 ). (iv) For any 0 < i < n — 3, we have 1)W (p) is not contained in Ci+2 (p) and, moreover, X>(n_2)(p) is not contained in £„_i(p). Remarks 1. Let us look at items (i)-(iv) in the case of of Goursat structures, that is m = 1. Firstly, item (i) implies, according to Proposition 5.1, item (ii). Secondly, in the case m = 1, as shown in 21 and 2 8 , a Goursat structure (that is a distribution of rank two satisfying item (i)) is locally at p equivalent to Goursat normal form under the regularity conditions V^l\p) ^ d+2(p), for 0 < i < n — 3. Thirdly, being equivalent to Goursat normal form, it clearly satisfies (iii) because it admits infinitely many involutive subdistributions of corank one in Z>(™_1), most of which fulfilling the second part of item (iv). 2. Notice that any distribution T> satisfying (i), (ii), and (iii) is a contact system. Indeed, since the characteristic distributions C; are involutive, we put d = Ci+i and thus each distribution 2?W, for 0 < i < n — 2, contains an involutive distribution £j C V^ of corank one because of (i) and (ii). The one before the last distribution 2?(™_1) contains, by (iii), an involutive subdistribution £ n - i , which is unique and it is the canonical involutive subdistribution in P ^ - 1 ) . 3. As we argued in Remark 2 above, (i)-(iii) imply that T> is a contact system. Item (iv) is thus another way of expressing the regularity condition 2?W(p) = T>i(p). Contrary to this last condition, however, item (iv), as well as items (i)-(iii), can be immediately expressed in terms of Pfaffian systems, which we will state below as Corollary 5.5. 4. We will omit a proof of Theorem 5.4 because it follows from a more general result, Theorem 5.6, which will be stated in Subsection 5.2 and proven in Subsection 5.3. Corollary 5.5 (regular contact systems, fourth version). Let I be a Pfaffian system of rank nm, defined on a manifold M of dimension (n + l)m + 1. If m ^ 2, the Pfaffian system I is locally equivalent, at a given point p of M, to the canonical contact system on J n (M, E m ) if and only if (i) The rank of each derived system ZW is constant and equals (n — i)m, for 0 < i < n. (ii) The rank of each Cartan system C(I^) is constant and equals (n + 1 — i)m + 1, for 0 < i < n — 1. (iii) The Engel rank ofl^n~^ is constant and equals 1. (iv) For any 0 < i < n - 3, C(l^+2^){p) is not contained in l^{p) and, moreover, Jn-\{p) is not contained in T^n~2^[p), where Jn-\ is the unique integrable Pfaffian system such that Z(™_1) is of constant corank
95
one in Jn-\, and whose existence is given by (ii) and (iii). Notice two differences between Corollary 5.5 and Theorem 3.7. Firstly, as we have mentioned in Remark 3 just above, item (iv) of Corollary 5.5 gives a description of the open and dense set of regular points of Theorem 3.7. Secondly, when using Corollary 5.5 we have to calculate the Engel rank of l ' " - 1 ) only (equivalently, to verify the existence of an involutive subdistribution of corank one in the distribution X>(™_1) only). 5.2
Contact systems and involutive subdistribution
£n-i
The aim of this subsection is to show that all information about a distribution V, which is locally equivalent to the canonical contact system on J n (K, R m ) , that is, about a regular contact system, is encoded completely in the canonical involutive subdistribution £ n _ i . This information concerns, simultaneously, the existence of all canonical distributions d = C;+i, for 0 < i < n — 2, the regularity of the point p, as well as the desired ranks of all intermediate distributions 2?W. In fact, we have the following result. Theorem 5.6 (regular contact systems, fifth version). A distribution V of rank m + 1, with m >2, on a manifold M of dimension (n + l)m + 1 is equivalent, in a small enough neighborhood of a point p in M, to the canonical contact system on Jn(R, K m ) if and only if the following conditions hold. (i) £>(") = TM. (ii) XH" -1 ' is of constant rank nm + 1 and contains an involutive subdistribution £n-i that has constant corank one in (iii) T>(°)(p) is not contained in Cn-i{p). If the conditions (i)-(iii) are satisfied then, of course, the distribution £ „ _ i , whose existence is claimed by (ii), is unique and it is with the canonical subdistibution of p f " " 1 ) . It is interesting to see how much information is encoded just in the existence of the canonical involutive subdistribution Cn-i and its regular intersection with Z>(°). In this case, the distribution £n-i "knows" about the existence of all canonical distributions £, = Cj+i, as well as about the regularity of all intersections of Z>(*) with C i + 2 . Notice, however, that although Cn-\ "knows" how to distinguish regular points from singular ones, it does not "know" about the geometry of the incidence of various DW's and C,-'s at singular points corresponding to different extended Kumpera-Ruiz normal forms (see Section 8). Theorem 5.6 was stated for m > 2, thus excluding m = 1, for the obvious reason: in the case of Goursat structures, that is for m = 1, we do not have the
96 canonical involutive subdistribution £ „ - i . The second row of (5.1) terminates at £„_2 = Cn-\. It can be noticed, however, that a "non-canonical" version of Theorem 5.6 holds also for m = 1: a rank-two distribution is equivalent to a Goursat normal form if and only if there exists a distribution £„_i satisfying the conditions (i),(ii), and (iii) of Theorem 5.6. Necessity is seen immediately from Goursat normal form while sufficiency follows from the sufficiency part of the presented proof of Theorem 5.6, which is the same independently of the value of m. What is important to emphasize, however, is that in the case m = 1, the condition (iii) is, in general, non veryfiable. In fact, if ranks of D( n _ 1 ) and T>^ equal, respectively, n + 1 and n + 2 then X>("-1) contains an infinity of involutive subdistributions of corank one. In order to check condition (iii) we have to calculate (some of) them and it is not clear how to do it because the method presented in Section 4 and based on Engel rank does not apply. Moreover, even if we would be able to find one such an involutive subdistribution, it may happen that it does not satisfy (iii) while others do. We will show now that an analogue of Theorem 5.6 holds for Goursat structures, that is for m = 1, with Hn-\ replaced by Cn-\. In fact we have the following result. Theorem 5.7 (Goursat normal form). A distribution T> of constant rank two, on a manifold M of dimension n + 2, is equivalent, in a small enough neighborhood of a point p in M, to Goursat normal form, that is to the canonical contact system on J n (R, E), if and only if the following conditions hold. (i) £>(") = TM. (ii) J)(n~2> is of constant rank n; moreover, the characteristic distribution Cn-i of X>(ra_1) is contained in £>(" -2 ) and has constant corank one in T>(n~2K (iii) p(°)(p) is not contained in Cn-i(p). Notice that we assume neither that T>^, for 0 < i < n — 3, are of constant rank nor that X>(n_1) is of constant rank nor that the ranks of 2?W grow by one at any point: the characteristic distribution C n _i "knows" about all that. 5.3
Proof of Theorem 5.6
Proof: Necessity is obvious. We prove sufficiency by giving a sequence of local diffeomorphisms that bring the distribution V to the canonical contact system on J"(E, R m ) . For simplicity, we will use the notation
dx>'
K
dx{,'"'dxL
97
Since £ n - i is an involutive distribution of constant corank m + 1 in TM we can choose m + 1 smooth functions >o, <j>i, . . . ,0 m such that (d(j)0,d(l)i,...,d(f>m) = (^n-i)- 1 in a neighborhood of p. Item (iii) implies that there exists a vector field g~o € T>(°> such that go(p) £ £n-i(p). Moreover, there exists a function fa such that LgQ
r
- JL
( ~ - - dz1'"m'dx»)
J n 1 (
A-)
and d ''O
^
A l
d
,. d
d .
i=l
Since p(™ ^ is of constant rank and corank of £ n _ i in 2?(" ^ equals one, item (iii) implies that 2?( n_1 ) = £ „ _ i + (#0)- Hence Z>(") = pf"" 1 ) + [ZXn-^ZX"-!)] is given by X > ( n ) = £ „ _ i + [9d,£„_i] + (S0). This relation and item (i) imply that we can choose among - ^ j , for 1 < i < m, 1 < j < n, vector fields / i , . . . , fm, for such that P<"> = £ „ _ i + ([po, /i], 1 < » < m) + (go). In particular, this implies that the differentials dfa^,, d
98
and d
V^
i
d
.,
d
d ^
We claim that
Suppose that there exists a vector field / £ X>(n_2) such that / ^ J"„-2- Since go € D("-2> and P< n - 2 ) c X*^"1) = ( 5o ) + £ „ - i , we can suppose that
for suitable smooth functions /3j, 1 < i < m, where at least one function, say /3k, is not identically zero in a neighborhood of p. Since / € X>(n_2) and 2 but [/, 5o ] = E " i Agftr ffo G Z><°) C P ( " - ) , it follows that [f,g0] € V^V _1 mod£„_!, which clearly is not in X>(™ ) = £ n _ i + (g0) thus yielding a contradiction. Hence it follows that V(n~2) C ( § § * , . . . , ^ ) + (0)We will repeat analogous reasoning for the general step and the proof will follow by an induction argument. Suppose that we have introduced a local coordinate system (XQ,X0,... ,xn), where xj = {x{,... ,x3m), for 0 < j < n, and such that
A =4>\ iorO<j
=Lgo
— l and 1 < i < m. Suppose, moreover, that in those coordinates
as well as 2 ? ( " - * ) c ^ „ _ ? ) = ( ^ , . . . , ^ ) + («*,)
(5.3)
and ^-
+ 1 )
=(5o)
+
(^zr)mod(A,...,^_).
(5.4)
Notice that we do not assume the ranks of £>(""«) and of 1)("-
99
We will prove that equations (5.2), (5.3), and (5.4) hold with q being replaced by q + 1. To start with, observe that X>("-«+1) = Z>(n-«) + [£>("-»),£>("-«)] together with (5.3) imply that we can find m vector fields h, • • •, fm G X» (n_9) such that P<"-' + 1 > = &>) + ([/<,»>], l < i < m )
mod(^,...,^).
After relabelling xj, for q < j < n and 1 < i < m, if necessary, we can suppose, because of (5.3) that d d d t A< ^ /i = ^ ™ ° d ( ^ T I , . . . , — ) .
G T>(n~9^, the above expression implies that
Since fi,...,fm ^
^
M
+ ^ m o d ^ , . . . , ^ ) .
(5.5)
Moreover, it follows that the differentials d(j>Q, dcjyf, for 0 < j < q and 1 < i < m, are linearly indpendent at p. Introduce new coordinates by replacing x\ by
for 1 < i < m. In new coordinates, denoted still by x\, the distribution (— — ) v d a ; « ' ' " ' dxn' is preserved while the vector field g0 becomes
for some smooth functions 4>J+1, 1
<m.
T
, . . . , A ) + (»,).
(5.7)
Suppose that there exists a vector field / e X>("-«_1) such that / ^ ^ r ( n _,_ 1 ). Since y0 G £>("-) and P * " - ' " 1 ) c £>(""«), we can suppose, because of (5.5), that ^
Q
Q
Q
100
for suitable smooth functions /?,, 1 < i < m, where at least one function, say Pk, is not identically zero in a neighborhood of p. Since / e D t " - ? - 1 ) and g0 6 E H i f t ^ P (o) c 2?(n-9-i), it follows that [f,go] E 2>("-»> but [/, 5o ] = m o d ( g f 7 , . . . , gf^-), which clearly is not in p("-») because of (5.5). This yields a contradiction thus proving that (5.7) holds. Now observe that the three just proved conditions (5.6), (5.7), and (5.5) correspond, respectively, to (5.2), (5.3), and (5.4), with q being replaced by q + 1. Hence by an induction argument we arrive after n steps at Vi0)
C
*» =
(
}
^
+
(5o)
'
where r±
w=
n—1
m
p.
+
+
r\
a4 gS^ 'S|
mod( )
^-
Since r>(0) is of constant rank m + 1 it follows that X>(°) = T0 = ( ^ ) + (g0), which ends the proof. • 5.4
Constructing canonical coordinates
The presented proof gives a method to calculate a diffeomprphism bringing a regular contact system V of rank m + 1 to the canonical contact system on Jn(Rk, Rm), starting from m + 1 independent functions (j>%, 0 ° , . . . , >£, whose differentials annihilate £ n _ i , that is
(d0g,d0;,...,d^) = (£n_1)-L. Choose a vector field gQ G V^ such that go(p) $ Cn-\. Without loss of generality (see the beginning of the proof), we can suppose that Lgo(f>Q(p) / 0. Define x° = <$ and
J
= &
L
9Q^i
=
for 0 < j < n and 1 < i < m. It follows from the presented proof that in the coordinates (XQ,X°, ..., xn), the contact system T> takes the form
T>-(JL
—
A
rv
i+'i)
101
5.5
Proof of Theorem 5.7
Proof: Necessity is obvious. In order to prove sufficiency, choose a local coordinate system (xi,... ,xn+2) such that x(p) = 0 £ R™+2 and that
c
-(d
-?-)
ax\
oxn-i
We will identify the point p with 0 € R"+ 2 . Since Z>(°) (0) is not contained in C„_i(0), choose a vector field g0 6 V^ such that gQ(0) ^ C„_i(0). By multiplying go, if necessary, by a nonvanishing function, we can assume that 9o = 9o 5I7 + 9o+1 asf^T + s^f^ m o d c « - i • Because C„_i is of corank one in X>("-2) and g0(0) $ C n _i(0), it follows that D<"- 2 ) = C n _! + (50). Hence
*~" = <»•£•••• • 5 b k ' s r ' - - ^ s ^ » -
(5 8)
-
We claim that dim Z>(n_1)(g) = n + 1 for any q in some neighborhood of p = 0. In order to prove it, we will firstly show that dim V^n~1'(q) < n + 1 for any q in some neighborhood of p = 0. If not, there is a sequence of points qi, i = 1,2,... such that qi -> p and such that dim V^n~1\qi) = n + 2. Take a sequence of open neighorhoods Ui, such that qi € Ui and dim Vl^n~1\q) = n + 2 for any q £ \J Ui. Hence on any Ui, the characteristic distribution C„_i of X^"" 1 ) coincides with D*""1) = TM, which contradicts the fact that C„_i is of rank n — 1. Now w are going to prove that dim V^n~l\q) = n +1 in a neighborhod of p = 0. Suppose that dim V^n~1\p) = n. Denote Cj = g | - , for 1 < j < n — 1. There exist an integer 1 < k < n — 1 and a sequence of points qt, i = 1,2,... such that qi -» p and such that [ck,9o](qi) £ 1>(n~2\qi); otherwise, in view of (5.8), we would have, in a neighborhood of p, that £>(™_1) = £>("~2), which contradicts V^ = TM. Take a sequence of open neighborhoods Ui, such that qi 6 t/j and [ck,go](l) $ X^ n _ 2 '(g) for any q &\JUi. From the fact that dim V^n~1\q) < n + 1 it follows that there exist smooth functions a_,-, for 1 < j < n — 1, defined on |J Ui and such that [CJ > 9o] = aj [cfc, go] mod 2?("- 2) .
(5.9)
on (JUi. Now recall that the vector fields Cj, for 1 < j < n - 1, are characteristic vector fields of Z>("_1) and hence [a, [cj,go]] 6 Z>' n_1 ), for any 1 < i, j , < n — 1. Thus we have 2>(n) = 2?(»-D +
( [ 5 o , [c
^
1 < j < n - 1).
102
Notice that (5.9) implies that [30, [9,So]] = ctj[g0, [ck,go]]mod'D<-n~1^ on \jUi. Since dim V^n-^(p) = n, it follows that dim X>
oxn-i
We define an involutive distribution £„_i = (gf~, • • • , gf-) and the remaining part of the proof follows the same line as that of Theorem 5.6. • 6
Flatness of contact systems
In this Section we will discuss flatness of nonholonomic control systems, that are feedback equivalent to a canonical contact system for curves. Consider two control systems, with m + 1 controls, m
S:
x — 2_]fi(x)ui
= f(x)u
and
(6.1)
i=0 m
t : x = ^2fi(x)
Ui = f(i) u,
(6.2)
i=0
evolving respectively on two open sets X and X of RN, where u = (u0,...,umy, u - (u0,...,um), / = (fo, • • •, fmf, and / = (fo,...,fm). We say that the systems E and E are feedback equivalent if there exits a smooth diffeomorphism ip : X -> X and a feedback u = /3u, where the matrix P, whose entries /3j are smooth functions, is invertible at any x 6 X, such that
MfP) = f, where for any vector field g on X we denote by
103
Since the distribution V remains invariant under any invertible transformation u = (iu, all objects that we construct with the help of V can be considered as feedback invariant objects attached to E. The concept of flatness was introduced to control theory by Fliess, Levine, Martin, and Rouchon 9 , 10 in order to characterize control systems that are linearizable by endogenous feedback. This concept goes back to Hilbert 1 5 and Cartan 5 It is easy to observe that contact systems are flat. In this section we explain how the geometry of contact systems, described in previous Sections, provides additional information on various aspects of the flatness of contact systems. Consider a nonlinear control system E of the form (6.1). We say that E is x-flat at (x*,u*) E M.N x E m + 1 (see 10 , 16 , 30 ) if there exits a neighborhood V0 = X0 x Uo of (x*,u*) and smooth functions (pt on X0, for 0 < i < m, called x-flat outputs, having the following property: there exits an integer k and smooth functions •ji, 1 < i < N, and Si, 0 < i < m, such that we have Xi = 7. (>,¥>, •••,V (fe) ), Ui = Si(tp,tp,...,ip^),
l
where
104
canonical contact system on J"(K,M m ). Then we have. (i) £ is x-flat at any (x*,u*) such that (u*)°nv(x*) ^ 0. (ii) The dynamic precompensator zx
= z2 ;
Zn
=
(6.3) V0
linked with the system via the dynamic feedback law « — Hmv I
vfh
J J
where vm = ( « i , . . . ,vmY and 7 is a suitably chosen nonvanishing function, renders the overall system, controlled by {vo,vi,...,vm)t, static feedback linearizable at any point (x*,z*) such that z\ ^ 0. (iii) As x-flat outputs we can choose any m + 1 smooth functions (pa, ipi,...,
105
7
A n Example
In this Section we will consider a simple example given by a kinematical model of a car for which both the front and rear wheels are steerable, see 31 . We denote by (xo,yo) (resp. by (xi,j/i)) the coordinates of the midpoint between the two front (resp. rear) wheels and by a0 (resp. by c*i) the angle of the front (resp. rear) axle with respect to the horizontal. Finally, by tp we denote the angle of the bar connecting the two axles with respect to the horizontal. The two nonholonomic constraints reflecting the fact that the front and rear wheels roll without sleeping are given, respectively, by fi0 = 0 and Qi = 0 where differential 1-forms ilo = sin aodxo — cos aodyo J7i = smaidxi — cosaidyo. The holonomic constraints %i —
XQ
+ c o s ij}
2/i = 2/o +sinV> define a map tp from (S 1 ) 3 x E 2 , equipped with the coordinates (tp,a0,ai,xo,yo) into (5 1 ) 3 x R 4 , equipped with the coordinates (ip,ao,a{^xo,,yo,xi,yi). The pull-backs wo =
= 7r/2 + kn, k 6 Z} C X
and
S2 = {ct\ — ao = kit, k € Z} C X. The differential 1-forms UJQ and uj\ are linearly independent at p 6 X if and only if p ^ S\ n S2- Therefore outside Si fl S2, the nonholonomic constraints define a control system of the form e = /o(0«o + / i ( 0 « i + / 2 ( 0 « 2 , where f = (^,oio,oii,xo,yo) are coordinates on X = (S 1 ) 3 x R 2 , and the distribution V = ( / 0 , / i , / 2 ) , where / 0 = -£-, h = 3^7, and / 2 = cos(ai - ip) ( c o s ( a 0 ) ^ + s i n ( a 0 ) ^ J + sin(ai - a 0 ) ^ .
106 It is interesting to notice that at points of S\ D S2, integral curves of the distribution V = (wo,Wi)x do not represent all trajectories of the system subject to the nonholonomic constraints (uo,uii). Indeed, through any point p £ Si f)S2 there pass trajectories of the system whose velocities do not belong to V(p). We thus consider the model of a car outside Si fl S2. Clearly, the distribution V contains the involutive subdistribution *
I da0 ' Sen / "
of corank one in V. Since n = 1, the distribution C = (~^-, ~^-) is the canonical involutive subdistribution £ n _ i . By a direct calculation we check that V^(p) is of rank five if and only if p £ Si U 53, where S 3 = {a0 - ip = TT/2 + fcTr, k € Z} C X. Therefore at any configuration point outside Si n S3, the model is feedback equivalent to the canonical contact system for curves on J X (E, R 2 ). It is interesting to observe that at points of S2 that are outside Si U S3, the system, although not being equivalent to a cononical contact system, is flat. It can, moreover, be transformed via a local feedback to the following normal form ±1 = x 5 u 0 ±2 =
X1U1
X3 =
U0
X4 =
Ui
x5 = u2, which generates a nilpotent Lie algebra. As flat outputs we can choose X2, X3, and Xi. Notice that the form of the set of singular controls at which the system ceases to be flat changes: it is formed by the union of the planes u0 = 0 and «i = 0 in E 3 . Finally, we would like to mention the following difference between kinematical model of a car towing trailers and that of a multisteerable car. The former defines a Goursat structure (that is a contact system of rank two) at any point of its congfiguration space and, conversly, any Goursat structure can be locally equivalent to that kinematical model at a well choosen point of its configuration space (see 28 and 2 1 ) . This correspondence is no longer valid for multisteerable cars. They define a contact system of rank m +1 > 3 at almost all configurations but, in general, they fail to do so at some configurations (like at p € Si U S3 for the considered example).
107
8
Singular points and extended Kumpera-Ruiz normal forms
The aim of this Section is to study the class of distributions that satisfy condition (i) of Theorem 2.1 but fail to satisfy condition (ii) of that theorem. The fist condition describes the geometry of the canonical contact system while the second condition characterizes regular points. Recall that a distribution that satisfies the former but fails to satisfy the latter at a point p is called a singular contact system at p (see Definition 2.2). In this Section we will recall a result of 27 stating that any such distribution can be brought to a normal form for which we propose the name extended Kumpera-Ruiz normal form. Those forms generalize the canonical contact system on J 1 (R, Rm) in a way analogous to that how Kumpera-Ruiz normal forms (see e.g. 17 , 2 1 , 2 3 , and 28 ) generalize the canonical system on J1(M,1R), which is also called Goursat normal form. Consider the family of vector fields K1 = (K\, ... , K^, KJ) that span the canonical contact system on JX(IR, M.m), where K1
i
- -AK1 - -Aaxp • • • ' "-m — aT£
—
and the family of vector fields AC2 = (K 2 , . . . , K^, KQ) that span the canonical contact system on J 2 (M, K m ) , where 2
-
K
a -A K-2 — H£[' • • • > ^m — o ^
"'l
—
K2
_ ji
a
,
, 2
a_ ,
a
I
,
,
d_ ,
I
a
Loosely speaking, we can write 2 9 d *2 K *"1 — 5 x f ' • • • ' ^m — a ^ T K0 = XxKt + |- XmKm + KQ.
In order to make this precise we will adopt the following natural notation. Consider an arbitrary vector field / given on Jn~1(R, E m ) by n —1 m
/ = E E / / ( ^ - 1 ) ^ + /o°(^-1)4> j=0i=l
u
i
where J " " 1 denotes the coordinates XQ,X°,... , x°m, x\,... , xlm,... , a;" - 1 ,... n_1 m , x ^ of J ( E , E ) . We can lift the vector field / to a vector field on Jn(R, Rm), which we also denote by / , by taking n—1 m f=
v w ^ - ^ j=0i=l
+^ n - ^ a
+Q
.
a +-.. + 0 - 4 - .
108
That is, we lift / by translating it along the directions -^,...
, -J^r-
Notation(lifts of vector fields) . From now on, in any expression of the form n% = S f c o a « ( x ) K r _ 1 ' t h e vector fields K ^ - 1 , . . . , / C ^ - 1 should be considered as their above defined lifts. Let K n _ 1 = ( K ? - 1 , . . . , K ^ - 1 , K ™ _ 1 ) denote a family of vector fields defined on Jn~1(M.,Mm). A regular prolongation, with a parameter c", of K n _ 1 , denoted by Kn = Rc„ ( K " _ 1 ) , is a family of vector fields nn = « , . . . , K £ , K£) defined on J " ( E , E m ) by „ra _
d
n
_
d
(8.1) where cn = ( c " , . . . , cJJ,) is a vector of m real constants. A singular prolongation, with a parameter c™, of /c n _ 1 , denoted by K™ = S C n (/e n - 1 ), is a family of vector fields Kn = (K?, . . . , KJJ,, K#) defined on J n (R, E™) by a
„n _
„n
«0" = (*? + W *
_
+ • • • + « - l + C_ 1 )«»-_ 1 1 + K""1 + XnmK%-\
n
^ " ^
where c = ( c " , . . . , c|J l _ 1 ,0) is a vector of m real constants, the last one being zero. A family of vector fields K" on J " ( E , R m ) , for n > 1, will be called an extended Kumpera-Ruiz normal form if K" = CT„ O • • • o ^ ( K 1 ) , where for each 2 < i < n the map Oi equals either Rct or Sci, for some vector parameters c1. In other words, a Kumpera-Ruiz normal form is a family of vector fields obtained by successive prolongations from the family of vector fields that spans the canonical contact system on J 1 (M, R m ) . The above defined prolongations and prolongations-based definition of extended Kumpera-Ruiz normal forms generalize for contact systems analogous operations introduced by the authors 28 for Goursat structures. Let x : M -> R(»+i)™+i =* J " ( R , R m ) be a local coordinate system on a manifold M, in a neighborhood of a given point pin M. We will say that an extended Kumpera-Ruiz normal form on J"(R, Rm), defined in x-coordinates, is centered at p if we have x(p) = 0. For example, on J 2 (R, E 2 ), we have the two following extended Kumpera-Ruiz normal forms -r2 d.
( JL-
-2,
\d^'
8if ) Xl 8^
I r2 +
a^
9
.
I -r1
9
.. I T 1
-+• X2 \X1 gjs
9
.. I
-TX2Q^
d "\ -+- Q^)
J ,
109
defined by R^^K1) and 5(0,o)(«*), respectively. These two normal forms are obviously centered at zero. Recall, see Definition 2.2 that we define a contact system as a distribution V such that each element 2?W of its derived flag has constant rank (i + l)m + l and contains an involutive subdistribution A C V^ that has constant corank one in PW, for 0 < i < n, that is as a distribution satisfyng the condition (i) of Theorem 2.1. Moreover, we say that a contact system is regular at p if it satisfies at p the regularity condition (ii) of that theorem and is singular at p if it fails to satify that condition. The following theorem from 27 asserts that extended Kumpera-Ruiz normal forms serve as local normal forms for all, regular or singular, contact systems for curves. Theorem 8.1 (extended Kumpera-Ruiz normal forms). A contact system T> of rankm+1 on a manifold M of dimension (n + l)m + 1 is equivalent, in a small enough neighborhood of any point p in M, to a distribution spanned by an extended Kumpera-Ruiz normal form, centered at p and defined on a suitably chosen neighborhood of zero. Acknowledgments This survey is based on lectures delivered by the authors at the Conference on Geometric Control Theory and Applications, organized by Alfonso Anzaido Meneses, Bernard Bonnard, Jean-Paul Gauthier, and Felipe Monroy Perez, at Universidad Autonoma Metropolitana (Unidad Azcapotzalco), Mexico City. The authors are grateful to the local organizers, in particular to Alfonso Anzaido Meneses and Felipe Monroy Perez, for having organized the conference and for their hospitality. Moreover, they are grateful to Jean-Baptiste Pomet for discussions on his results 1. References 1. E.Aranda-Bricaire and J.-B. Pomet, Some explicit conditions for a control system to be feedback equivalent to extended Goursat normal form. In Proceedings of the IF AC Nonlinear Control Systems Design Symposium, (Tahoe City California, 1995). 2. R.Bryant, Some aspects of the local and global theory of Pfaffian systems. Ph.D. thesis (University of North Carolina, Chapel Hill, 1979). 3. R.Bryant, S-S. Chern, R. Gardner, H. Goldschmidt, and P. Griffiths, Exterior Differential Systems. Mathematical Sciences Research Institute Publications. (Springer-Verlag, New York, 1991).
110
4. M.Canadas-Pinedo and C.Ruiz, Pfaffian systems with derived length one. The class of flag systems. Transactions of the American Mathematical Society, 353, 1755-1766, (2001). 5. E.Cartan. Sur I'equivalence absolue de certains systemes d'equations differentielles et sur certaines families de courbes. Bulletin de la Societe Mathematique de France, 42, 12-48. (Euvres completes, Part. II, Vol. 2, (Gauthiers-Villars, Paris, 1914). 6. M.Cheaito and P.Mormul, Rank-2 distributions satisfying the Goursat condition: All their local models in dimension 7 and 8. ESAIM Control, Optimisation, and Calculus of Variations, 4, 137-158 (1999). 7. G.Darboux, Sur le probleme de Pfaff. Bulletin des Sciences mathematiques, 2(6), 14-36, 49-68 (1882). 8. F.Engel, Zur Invariantentheorie der Systeme Pfaff'scher Gleichungen. Berichte Verhandlungen der Koniglich Sachsischen Gesellschaft der Wissenschaften Mathematisch-Physikalische Klasse, Leipzig, 41,42, 157-176; 192-207 (1889), (1890). 9. M.Fliess, J.Levine, P.Martin, and P. Rouchon, Sur les systemes non lineaires differentiellement plats. Comptes Rendus de I'Academie des Sciences, 315, 619-624 (1992). 10. M.Fliess, J.Levine, P.Martin, and P.Rouchon, Flatness and defect of nonlinear systems: Introductory theory and examples. International Journal of Control, 6 1 , 6, 1327-1361 (1995). 11. G.Frobenius, Uber das Pfaff'sche problem. Journal fiir die reine und angewandte Mathematik, 82, 230-315 (1877). 12. R.Gardner and W.Shadwick, The GS algorithm for exact linearization to Brunovsky normal form. IEEE Transactions on Automatic Control, 37, 2, 224-230 (1992). 13. A.Giaro, A.Kumpera, and C.Ruiz, Sur la lecture correcte d'un resultat d'Elie Cartan. Comptes Rendus de I'Academie des Sciences de Paris, 287, 241-244 (1978). 14. E.Goursat, Lecons sur le probleme de Pfaff, ( Hermann, Paris, 1923). 15. D. Hilbert, Uber den Begriff der Klasse von Differentialgleichungen. Mathematische Annalen, 73, 95-108 (1912). 16. B. Jakubczyk, Invariants of dynamic feedback and free systems. In Proceedings of the European Control Conference, Groningen, (1993). 17. A.Kumpera and C.Ruiz, Sur I'equivalence locale des systemes de Pfaff en drapeau. In F. Gherardelli, editor, Monge-Ampere equations and related topics, (Instituto Nazionale di Alta Matematica Francesco Severi, Rome, 201-247, 1982). 18. P.Libermann, Sur le probleme d'equivalence des systemes de Pfaff non
111
completement integrables. Publications Paris VII, 3, 73-110 (1977). 19. V. Lychagin, Local classification of non-linear first order partial differential equations. Russian Mathematical Surveys, 30, 1, 105-175 (1975). 20. P.Martin and P.Rouchon, Feedback linearization and driftless systems. Mathematics of Control, Signals, and Systems, 7, 235-254 (1994). 21. R.Montgomery and M.Zhitomirski Geometric approach to Goursat flags. (Preprint, University of California Santa Cruz 1999). 22. P.Mormul, Rank-2 distributions satisfying the Goursat condition: All their local models in dimension 9. (Preprint, Institute of Mathematics Polish Academy of Sciences, 1997). 23. P.Mormul, Goursat flags: classification of codimension-one singularities. (Preprint, Warsaw University, 1999). 24. R.Murray, Nilpotent bases for a class of nonintegrable distributions with applications to trajectory generation for nonholonomic systems. Mathematics of Control, Signals, and Systems, 7, 58-75 (1994). 25. P.Olver, Equivalence, Invariants, and Symmetry. (Cambridge University Press, 1995). 26. W.Pasillas-Lepine and W.Respondek, On the geometry of control systems equivalent to canonical contact systems: regular points,singular points, and flatness, in Proceedings of the 39th IEEE Conference on Decision and Control, 5151-5156 (Sydney, Australia, 2000). 27. W.Pasillas-Lepine and W.Respondek, Contact systems and corank one involutive subdistributions. To appear in Acta Applicand&Mathematicas (2001). 28. W.Pasillas-Lepine and W.Respondek, On the geometry of Goursat structures. ESAIM Control, Optimisation, and Calculus of Variations, 6, 119-181 (2001). 29. J-F. Pfaff, Methodus generalis, aequationes differentiarum partialum, nee non aequationes differentiales vulgares, utrasque primi ordinis, inter quotcunque variabiles, completi integrandi. Abhandlungen der KoniglichPreufiischen Akademie der Wissenschaften zu Berlin, Mathematische Klasse, 76-136 (1814-1815). 30. J.-B. Pomet, A differential geometric setting for dynamic equivalence and dynamic linearization. In B. Jakubczyk, W. Respondek, and T. Rzezuchowski, editors, Geometry in Nonlinear Control and Differential Inclusions, 32, 319-339. (Banach Center Publications, Warszawa, 1995). 31. D.Tilbury and S. Sastry, The multi-steering n-trailer system: A case study of Goursat normal forms and prolongations. International Journal of Robust and Nonlinear Control, 5, 4, 343-364 (1995). 32. A.Vinogradov, I.Krasil'shchik, and V.Lychagin, Geometry of Jet Spaces
112
and Nonlinear Partial Differential Equations. (Gordon and Breach, New York, 1986). 33. E. von Weber, Zur Invariantentheorie der Systeme Pfaff'scher Gleichungen. Berichte Verhandlungen der Koniglich Sachsischen Gesellschaft der Wissenschaften Mathematisch-Physikalische Klasse, Leipzig, 50, 207-229 (1898). 34. M.Zhitomirski, Normal forms of germs of distributions with a fixed segment of growth vector. Leningrad Mathematics Journal, (English translation), (2), 1043-1065 (1991).
113
THE BRACHISTOCHRONE PROBLEM A N D MODERN CONTROL THEORY HECTOR J. SUSSMANN Department of Mathematics Rutgers University Hill center, Busch Campus Piscataway, NJ 08854, USA E-mail: [email protected]. edu URL: http://www.math.rutgers.edu/~ sussmann JAN C. WILLEMS Department of Mathematics University of Groningen P.O. Box 800, 9700 AV Groningen, The Netherlands URL: http://wvjw.math.rug.nI/~ willems Dedicated to Velimir Jurdjevic on his 60"1 birthday. 1
Introduction
The purpose of this paper is to show that modern control theory, both in the form of the "classical" ideas developed in the 1950s and 1960s, and in that of later, more recent methods such as the "nonsmooth," "very nonsmooth" and "differential-geometric" approaches, provides the best and mathematically most natural setting to do justice to Johann Bernoulli's famous 1696 "brachistochrone problem." (For the classical theory, especially the smooth version of the Pontryagin Maximum Principle, see, e.g., Pontryagin et al. 31 , Lee and Markus 2 6 , Berkovitz 3 ; for the nonsmooth approach, and the version of the Maximum Principle for locally Lipschitz vector fields, cf., e.g., Clarke 12 '', Clarke et al. 14 ; for very nonsmooth versions of the Maximum Principle, see Sussmann 35-36,39,4o,4i,42,43,44. for t n e differential-geometric approach, cf., e.g., Jurdjevic 24 , Isidori 19 , Nijmeijer and van der Schaft 3 0 , Jakubczyk and Respondek 2 0 , Sussmann 37 - 40 .) We will make our case in favor of modern control theory in two main ways. First, we will look at four approaches to the brachistochrone problem, presenting them in chronological order, and comparing them. Second, we will look at several "variations on the theme of the brachistochrone," that is, at several problems closely related to the one of Johann Bernoulli. The first of our two lines of inquiry will be pursued in sections 3, 4, 5, and 6, devoted, respectively, to
114
1. Johann Bernoulli's own solution based on an analogy with geometrical optics, 2. the solution based on the classical calculus of variations, 3. the optimal control method, and, finally, 4. the differential-geometric approach. We will show that each of the three transitions from one of the methods in the above list to the next one leads to real progress for the brachistochrone problem, by making it possible to derive stronger conclusions about it than those that were obtainable by means of previously existing techniques. Specifically, we will point out that l.a the differential equation derived by Johann Bernoulli has spurious solutions (i.e., solutions other than the cycloids), as was first noticed by Taylor in 1715, but l.b the application of the Euler-Lagrange condition of the calculus of variations eliminates these solutions and leaves only the cycloids, and that 2.a with the calculus of variations approach, the existence of solutions is a delicate problem, due to the non-coercivity of the Lagrangian, but 2.b the optimal control control method renders this question trivial, reducing the proof of existence to a straightforward application of the Ascoli-Arzela theorem, and, moreover, 3.a the usual calculus of variations formulations require that one postulate that the optimal curve is the graph of a function x >-> y(x), but 3.b with the optimal control control method this postulate becomes a provable conclusion.
115
These observations show that the first two of the three transitions brought improvements to the analysis of Johann Bernoulli's problem. But the core of our argument is 6, where we look at the third transition and the advent of differential-geometric control theory. We will argue that, on the one hand 4.a previous methods require that we assume extra knowledge of physics (in the form of the law of conservation of energy) in order even to be able to translate Johann's problem into mathematics, and 5.a with those methods an anomalous situation appears to arise when one looks for a state space formulation of the problem and asks for its "Hamilton principal function" V (that is, the value function regarded as a function of both the initial and terminal states), since V ought to be a function of an even number v of variables, but for the brachistochrone problem the best choice of v seems to be five; but on the other hand 4.b in the differential-geometric framework one does not need to bring in conservation of energy in order to formulate the problem; instead, one writes the equations as a control system in ffi4, and then computes the accessibility distribution, which turns out to be three-dimensional at each point; this shows that a nontrivial conserved quantity exists locally, and direct computation shows that it exists globally and is equal to the energy, and 5.b the anomaly described in 5.a disappears, and the principal function V turns out to be defined on E 4 x K4 (although V(q, q') < +00 if and only if q and q' belong to the same leaf of the foliation of R4 by the level submanifolds of the energy function); the method provides a full explanation for the apparent anomaly, by showing that the full optimal synthesis requires impulse controls that lead to instantaneous jumps. The discussion in 6 will also show that the differential-geometric approach brings into the problem interesting connections with other mathematical ideas. To begin with, the analysis leads naturally to a minimum time optimal control problem with a four-dimensional state space and an unbounded scalar control entering linearly into the dynamics. Moreover, this problem is quite degenerate, because its accessibility distribution happens to be three-dimensional, as
116
pointed out above. Since the problem is affine in the control but not linear, complete controllability on each leaf is in principle problematic. The accessibility Lie algebra turns out to be the "diamond Lie algebra," a well known four-dimensional solvable Lie algebra that occurs, for example, in the study of the quantized harmonic oscillator. To establish controllability on each leaf one needs additional arguments, and it turns out that the issue can be settled by computing the Jurdjevic-Kupka Lie saturate (cf. Bonnard et al. 8 ' 9 , Jurdjevic-Kupka 21>22; Jurdjevic 24 ) of the family of system vector fields. The minimum time problem then makes sense in principle for every pair of points belonging to the same leaf. But, since the control is unbounded, it is not at all obvious that an optimal control joining A to B exists for every pair (^4, B) of points belonging to a given leaf L. An application of the Maximum Principle yields a vector field Z tangent to the leaves such that all optimal arcs are integral curves of Z. Since, in general, two arbitrarily chosen points A, B of a leaf L do not lie on the same integral curve of Z, it follows that for general pairs (A, B) € L x L an optimal control joining A to B does not exist. On the other hand, when a control system is completely controllable and a suitable properness condition is satisfied a , then most mathematicians would agree that optimal controls ought to exist, and if they do not then this can only happen because we have not made a good choice of our space U of admissible controls, so U has to be enlarged by adding suitably defined "generalized controls." (For a trivial example, consider the problem of moving from A to B in minimum time with the dynamics x = u, u arbitrary, so in particular no bound is prescribed for the velocity u. Clearly, the "minimum time," is 0, and to attain it one needs to allow instantaneous jumps, which means that the class of ordinary controls has to be enlarged to allow for Delta functions. We will show in 6 that something like this happens for the brachistochrone.) In our case, the true complete optimal synthesis of the problem turns out to require impulse controls, a fact known to be common in problems with "cheap" unbounded controls, and studied in other (linear quadratic) contexts by Jurdjevic and Kupka, cf. Jurdjevic-Kupka 23 , Kupka 2 5 . Our second group of arguments in favor of modern control is presented in 7, and consists of "five modern variations on the theme of the brachistochrone." Following a suggestion made by M. Hestenes in 1956, we "speculate on what type of brachistochrone problem the Bernoulli brothers might have formulated had they lived in modern times" (Hestenes 18 , p. 64), and we "See Definition 6.3 for a precise statement of the properness condition, and Proposition 6.4 for the proof that the condition is satisfied for the brachistochrone problem.
117
extend Hestenes' idea by speculating not only on "what type of . . . problem the . . . brothers might have formulated" but also on what type of answer they might have liked for such problems. We deal with five questions. The first one is Hestenes' own "variation of the brachistochrone problem," which turns out to be a problem solvable by optimal control but not by classical calculus of variations techniques. The second "variation" is the "reflected brachistochrone," a very natural extension of Johann's problem, in which the state space is the whole plane rather than a half plane. This problem is no longer solvable by means of traditional smooth or nonsmooth optimal control methods—because it is a minimum time optimal control problem with a non-Lipschitzian right-hand side—but turns out to lend itself to the "very nonsmooth" approach, which yields a solution with a simple geometric interpretation that, in our view, the Bernoulli brothers would have liked. The third "variation" is the brachistochrone problem with friction, which corresponds, mathematically, to "unfolding" the original Johann Bernoulli question by embedding it in a one-parameter family of problems, the parameter being, of course, the friction coefficient p. These problems turns out to be nondegenerate, in the sense that, for example, their accessibility distribution has rank four at each point. Consequently, their time-optimal controls behave qualitatively like the time-optimal controls for generic control-affine systems in R4 with scalar unbounded controls. In particular, through every point there pass a one-parameter family of optimal curves, since the optimal control is given in feedback form as a function of the state and of an additional scalar parameter that stays constant along each trajectory. The analysis clarifies the way in which the original problem of Johann Bernoulli is degenerate: the accessibility Lie algebra is eight-dimensional for p ^ 0 but four-dimensional for p = 0; the accessibility distribution has rank four everywhere if p ^ 0 but has rank three everywhere if p = 0; the optimal "feedback" control is a function of the state and an extra parameter, but this parameter only occurs in a term that is multiplied by p, so that when p = 0 the optimal control is a function of the state only. The fourth "variation" deals with the derivation of Snell's law of refraction from general necessary conditions for an optimum. Since Snell's law plays a crucial role in Johann's solution of the brachistochrone problem, we speculate that a natural question for him to have asked "if he had lived in modern times" is whether the general theories about minimizing curves that were created in the three centuries after Johann's work actually cover the Snell law case. Our answer is negative if only the classical calculus of variations and pre-1990 smooth and nonsmooth optimal control are considered, but becomes positive
118
if we add to the list our recent very nonsmooth methods To conclude the paper, we take the liberty of including a fifth "variation" that we find amusing but does not really have much to do with control theory. In this "variation"—which is a digression on a topic of great importance and related to geometry, to the brachistochrone, and to Johann Bernoulli—we "speculate" that Johann Bernoulli would have found it very interesting to know b that with his work on the brachistochrone he had been extremely close to solving one of the most important open problems in Geometry, namely, that of the independence of Euclid's fifth postulate from the other ones. (The problem had intrigued mathematicians since ancient times—cf., e.g., Bonola 10 —and was very much in the forefront of research in geometry in Johann's times, as shown by the publication of Saccheri's book 32 in 1733. A definitive solution was found by Beltrami in 1868, cf. Beltrami 2 and Stillwell 33 .) Remark 1.1. An earlier, related version of the argument presented here appeared in 3 8 . That paper also dealt with the evolution from the brachistochrone to the calculus of variations and optimal control, but focused on a different set of issues, especially on the search for the correct Hamiltonian formulation of the necessary condition for an optimum and how opportunities were missed before the "control Hamiltonian" was finally discovered in the 1950s. Q 2
Johann Bernoulli and the brachistochrone problem
In the June 1696 issue of the journal Acta Eruditorum, Johann Bernoulli—a Swiss-born professor of mathematics at the University of Groningen, in the Netherlands0—challenged mathematicians to solve the "problema novum" of finding the curve that—following Leibniz' suggestion—he named the "brachistochrone," from the Greek /3paxicroq: shortest, and xP^vcx;: time d . The journal's May 1697 issue contains articles on this problem by six of the most renowned mathematicians of the time: Johann Bernoulli, Johann's elder brother Jakob, Leibniz, Newton, the marquis de l'Hopital, and Tschirnhaus. Leibniz called Johann Bernoulli's question a "splendid" problem. The fact that he and five other thinkers of such unparalleled distinction chose to * Although he probably would not have thought of asking the question. c T h e Bernoulli family was originally from Antwerp in Flanders, where they lived until 1583. Like many other Flemish Protestants, they fled Flanders to escape religious oppression by the Spanish rulers. They spent some time in Frankfurt, and finally settled in Basel, Switzerland, early in the seventeenth century. d So the brachistochrone is the solution of a minimum time problem, i.e., of a genuine optimal control problem in the modern sense of the term!
119 become actively engaged in the search for its solution shows that at the time of its formulation the problem was perceived by the leaders of the mathematical community as a major challenge, lying at the very frontiers of their research. It would not have been easy at that point to predict the future impact on science of Johann Bernoulli's challenge. Yet, the six famous authors of the 1697 papers must have had their own reasons to sense that an exceptionally promising line of inquiry was being opened, and the developments of the 300 years that followed have lent ample support to their intuition. Today, it is clear that the events of 1696-97 represented a critical turning point in the history of mathematics, and it is widely agreed that with them an important new field, later to be called the "calculus of variations," was born. R e m a r k 2 . 1 . The words "calculus of variations" were first used by Euler in a 1760 paper, cf. 1 5 . Euler had received, in 1755, a letter from a 19-year-old mathematician from Turin, called Ludovico de la Grange Tournier, where a new method was proposed for the study of variational problems, by using what we would now call "variations" (cf. Goldstine ie, p. 110.). This was quite different from the approach Euler had been using until then, and presented in his 1744 book entitled Methodus inveniendi maximi minimeve proprietate gaudentes sive solutio problematis isoperimetrici latissimo sensu accepti, that is "A method for discovering curved lines having a maximum or minimum property or the solution of the isoperimetric problem taken in its widest sense." Euler's earlier method was based on a time-discretization followed by a passage to the limit?, as the size h of the time intervals goes to 0. Euler was so enthusiastic about the new idea of the unknown youngster from Turin that he dropped his own method, adopted that of Lagrange instead, and renamed the subject the calculus of variations. In the summary to his first paper using variations, Euler says "Even though the author of this [Euler] had meditated a long time and had revealed to friends his desire yet the glory of first discovery was reserved to the very penetrating geometer of Turin LA GRANGE, who having used analysis alone, has clearly attained the same solution which the author had deduced by geometrical considerations." £> T h e newborn subject was destined t o become a major area of m a t h e matics. Moreover, its implications would soon extend t o physics, whose very foundations would b e shown in t h e 18th century—by Euler, Maupertuis, Lagrange, and others—to depend on principles formulated in t h e language of the calculus of variations. So it is not surprising t h a t t h e story of t h e brachistochrone is told in e
Euler himself talked about "letting h be infinitesimal."
120
many books, and that the importance of this curve is stressed in most histories of mathematics. The brachistochrone has received honors not commonly bestowed on mathematical curves. For example, it is depicted in a stainedglass window of the Academy building (the main venue of the university) in Groningen. Moreover, the brachistochrone is probably the only mathematical curve that has been deemed worthy of a monument in its honor. In 1994, on the occasion of the 375th anniversary of its foundation, the University of Groningen decided to honor in various ways the most famous former members of its faculty. In the case of Johann Bernoulli, who had been a professor from 1695 to 1705, a decision was made to honor him by erecting a monument to the brachistochrone, one of the most significant discoveries—and one of which he was particularly proud—that he made during his Groningen period. The unveiling of the brachistochrone monument, located in the Zernike complex of the University, took place in 1996, coinciding with the 300th anniversary of the publication of Johann's challenge in Acta Eruditorum. 2.1
Johann Bernoulli's challenge
The June 1696 challenge took the form of an "Invitation to all Mathematicians to solve a new problem": If in a vertical plane two points A and B are given, then it is required to specify the orbit AMB of the movable point M, along which it, starting from A, and under the influence of its own weight, arrives at B in the shortest possible time. So that those who are keen of such matters wiJJ be tempted to solve this problem, is it good to know that it is not, as it may seem, purely speculative and without practical use. Rather it even appears, and this may be hard to believe, that it is very useful also for other branches of science than mechanics. In order to avoid a hasty conclusion, it should be remarked that the straight line is certainly the line of shortest distance between A and B, but it is not the one which is traveled in the shortest time. However, the curve AMB—which I shall disclose if by the end of this year nobody else has found it—is very well known among geometers. Later, Johann Bernoulli changed his original plan to disclose the solution by the end of 1696 and, following a suggestion by Leibniz, extended the deadline until Easter 1697.
121
Responses came from the best mathematical minds of the time. In addition to Johann's own solution, there was one by Leibniz, who had communicated it to Johann in a letter dated June 16, 1696; another one by Jakob; one by Tschirnhaus; one by l'Hopital and, finally, one by Newton. Newton's solution was presented to the Royal Society on February 24, 1697 and published, anonymously and without proof, in the Philosophical Transactions. (The identity of the anonymous author was clear to Johann Bernoulli, since he thought that you can know ex ungue leonem, i.e., that you can know "the lion from its claws.") The May 1697 issue of Acta Eruditorum contains articles with the solutions by Johann 5 and Jakob 6 , as well as contributions by Tschirnhaus and l'Hopital, a short note by Leibniz, remarking that he would not reproduce his own solution, since it was similar to that of the Bernoullis, and Newton's anonymous paper, reprinted from the Philosophical Transactions. 2.2
The cycloid's double role as the brachistochrone and the tautochrone
As is apparent from Johann Bernoulli's announcement, he was under the impression that the problem was new. However, Leibniz knew better: Galileo in his book on the Two New Sciences in 1638 formulated the brachistochrone problem and even suggested that the solution was a circle. (Galileo had showed—correctly—that an arc of a circle always did better than a straight line—except, of course, when B lies on the same vertical line as A—but he then wrongly concluded from this that the circle is optimal.) Johann Bernoulli considered the fact that Galileo had been wrong on two counts, in thinking that the catenary was a parabola, and that the brachistochrone was a circle, as definitive evidence of the superiority of differential calculus (or the Nova Methodus, as he called this new set of techniques, following Leibniz). He was thrilled by his discovery that the brachistochrone was a cycloid. This curve had been introduced by Galileo who gave it its name: related to the circle. Huygens had discovered a remarkable fact about the cycloid: it is the only curve with the property that, when a body oscillates by falling under its own weight while being guided by this curve, then the period of the oscillations is independent of the initial point where the body is releasedJ Therefore Huygens called this curve, the cycloid, the tautochrone (from TOLVTCX;: equal, and xpovcx;: time). Johann Bernoulli was amazed and somewhat puzzled, it seems, by the fact that the same curve had these two remarkable properties related to the 'Contrary to what Galileo thought, the circle has this property only approximately: the period of oscillation of a pendulum is a function of its amplitude.
122
time traveled on it by a body falling under its own weight: Before I end I must voice once more the admiration that I feel for the unexpected identity of Huygens' tautochrone and my brachistochrone. I consider it especially remarkable that that this coincidence can take place only under the hypothesis of Galilei [HJS
2.3
Why was the brachistochrone so important in 1696-97?
Perhaps the crucial fact about Johann Bernoulli's question is that he is telling us very clearly that the solution is a curve that everybody knows, and yet there appears to be no way to guess which curve it is by means of some intuitive geometric argument. This situation should be compared with that arising in another famous and much older question about curve minimization, the isoperimetric problem. In this problem, we are asked to find the simple closed rectifiable curve that encloses the largest possible area among all simple closed rectifiable curves of a given length, and the answer also turns out to be a curve that is certainly "very well known among geometers," namely, a circle. But in this case it is fairly easy to guess directly that the answer is a circle, even though it is a delicate matter to give a rigorous proof. For example, it suffices to observe that the question is rotationally symmetric, so the answer ought to be a rotationally symmetric curve, and the only such curve is the circle. Although this argument does not amount to a completely rigorous proof9, it is undoubtedly rather convincing, and makes it quite obvious that the solution must be a circle. Therefore, if Johann Bernoulli had challenged "the best mathematical minds" of his time to solve the isoperimetric problem, it would not have made s
T h e argument assumes the existence of a solution, as well as its uniqueness up to translations. Indeed, it might conceivably happen that the solution is not unique, and in that case the only conclusion one can draw from rotational symmetry is that every rotation will map a solution 5 to a solution S', which may be different from S.
123
sense for him to say, as he did in his brachistochrone challenge, that "the curve—which I shall disclose if by the end of this year nobody else has found it—is very well known among geometers," since everybo,y would have guessed immediately that the mysterious curve was the circle. In the case of the brachistochrone, there is no way to guess h. The problem was posed and solved at a time when the Calculus was being invented, and cannot be solved without using Calculus. So it provides an impressive illustration of the power of the Calculus to solve problems that cannot be solved in any other way, and to discover truths about the world that can be stated and understood in "well known" terms ', but can only be arrived at by using this new mathematical tool. 3
The standard formulation and Johann Bernoulli's solution
We now discuss the "standard formulation" of the brachistochrone problem in modern mathematical language, as it is usually found in modern books, and Johann Bernoulli's ingenious solution based on an analogy with geometrical optics. In our view, the standard formulation is questionable. We will explain our objections in 6 below, where we will make a case for a different approach, based on differential-geometric control. But first it is necessary to present the classical point of view in detail. The key fact is that the "movable point M" is supposed to move "under the influence of its own weight," which presumably means "under the influence of its own weight and nothing else other than whatever force is needed to keep M on the given curve." This means, in particular, that energy is conserved. 3.1
Formulation of the brachistochrone problem
We choose x and y axes in the plane with the y axis pointing downwards, use (a, a) and (b, j3) to denote, respectively the coordinates of the endpoints A fe Even now, 300 years later, we are not aware of any simple geometric argument that might lead to the cycloid without using Calculus. ' T h i s is an important point. Compare the brachistochrone, for example, with Newton's 1685 work on the shape of a body with minimal drag. In this case, the solution also requires the Calculus, but once we have solved the problem all we find is a curve defined by some strange formula. This curve was not "well known to geometers," and would not have had any particular significance to mathematicians, who would only retain from going through Newton's solution the fact that the solution is "some curve." In the case of the brachistochrone, one does not need the Calculus to explain what the solution is, since cycloids were already well understood curves, but one does need it to discover that the solution is a cycloid.
124
and B of our curve, and fix a number B e l , thought of as the total energy of the particle. The condition characterizing the feasible paths [0,T] B t H- ^(t) = (*(*)»!/(*)) e K2
(3.3.1)
will then be—using m, g to denote, respectively, the particle's mass and the gravitational acceleration—that the sum of the kinetic energy ^{xty)2 +y(t)2) and the potential energy —gmy(t) is equal to E. We can certainly choose units so that m = 1. The condition then becomes l
-(i(t)2
+ y{t)2)-gy{t)
= E.
So we formally define an {A, B, E)-feasible trajectory (or (A, B, E)-feasible path) to be a map (3.3.1), defined on some interval [0, T], such that (i) t(0) = A, £(T) = B, (ii) £ is absolutely continuous, (iii) \{x{t)2 + y(t)2) = E + gy{t) for almost all t G [0,T]. Notice that Condition (iii) implies in particular that the derivatives x and y are bounded, so a feasible path is in fact Lipschitz continuous. Condition (i) states that the path £ must start at A and end at B. The number E + ga is the initial kinetic energy of the body. (Often, only the case E + ga = 0 is considered, corresponding to the body being dropped at time 0 with zero initial speed.) Condition (ii) is of course a modern technicality that would have made no sense to Johann Bernoulli, who was far from imagining that there could exist curves that cannot be recovered by integrating their derivative. Condition (iii) reflects conservation of energy. (The law that the kinetic energy of a body which has fallen from a height h is increased by an amount proportional to h was due to Galileo, and was well known in Bernoulli's time.) An (A, B, .E)-feasible path £ : [0, T] -> E 2 is said to be optimal if there exists no (A, B, £)-feasible path f: [0, f ] -> E 2 for which f < T. An (A, B, E)brachistochrone is either (a) an optimal (A, B, E)-feasib\e path, or (b) the geometric locus in ffi2 of such a path, that is, a subset B of E 2 such that B = {(x,y) G E 2 : there exists t G [0,T*], such that (x,y) = £*(*)} , for some optimal (A, B, .E)-ieasible path £* : [0, T*] i-> E 2 . A brachistochrone is an object which is an (A, i?,.E)-brachistochrone for some A,B,E.
125
Obviously, a feasible path" (3.3.1) must be entirely contained in the closed half-plane H
+(v) = {(x,y) : y > -n] ,
where
n= —,
(3.3.2)
since, for y(t) to be < — n, the kinetic energy would have to be negative, which is of course impossible. It is clear that the solution cannot always be a straight line, as Johann Bernoulli rightly points out in his "Invitation." For example, consider the extreme case when E = 0 and a = ct = (3 = Qj£b. (That is, when the endpoints of the curve are at the same height but do not coincide, and the initial kinetic energy vanishes.) We will show that in this situation there is a feasible path that goes from A to B in finite time. Since the straight-line segment S from A to B is horizontal, (iii) implies that the speed of motion along S vanishes, and then it will follow that S cannot be an optimal path, because the motion along S takes infinite time. To construct a feasible path from A to B we follow a circle C with center on the x axis. The circle can be parametrized by x, taking values in the interval [0, b]. The speed v of motion along C will be proportional to ^/y, and hence bounded away from 0 as long as x is bounded away from 0 and b\. Near A, we can also use y as a parameter, and then y \, 0 as x ], 0, and v behaves like -^ and also like y/y, so ^ ~ -j=, showing that t stays finite as x and y approach 0, since the function y H-> -4= is integrable near y = 0. A similar argument shows that t stays finite near B. So C joins A to B in finite time. 0 3.2
Johann Bernoulli's
solution
It turns out that the brachistochrone is a cycloid. When E = 0, it is the curve described by a point P in a circle that rolls without slipping on on the x axis, in such a way that P passes through A and then through B, without hitting the x axis in between. (It is easy to see that this defines the cycloid uniquely.) Johann Bernoulli's ingenious derivation of the brachistochrone has been the subject of numerous accounts, but since this event plays a crucial role in our own tale, we will briefly tell the story again. Johann Bernoulli based his derivation on Fermat's minimum time principle. If we imagine for a moment that instead of dealing with the motion of a moving body we are dealing with a light ray, Condition (iii) above gives us a °FVom now on, as long as A, B and E are fixed, we just talk about "feasible paths" and "brachistochrones," without explicitly specifying A, B and E.
126
formula for the "speed of light" c(x,y) as a function of the position (x,y): c(xiy)
= ^/2E + 2gy.
We now change coordinates so that E = 0. (This amounts to moving the x axis upward to a height rj.) Then r) is now equal to 0, so our feasible paths live in the half plane H+ = H+(0). Also, let us rescale—or, if the reader so prefers, "change our choice of physical units"—so that 2g = 1. Then our problem is exactly equivalent to that of determining the light rays—i.e., the minimum-time paths—in a plane medium where the speed of light c varies continuously as a function of position according to the formula c[x,y) = y/y. It is then reasonable to expect that, if we use a suitable discretization of our problem, then the optimal paths of the discretized problem will approach those of the original problem as the discretization parameter 5 goes to 0. Johann Bernoulli did this as follows. Let us define ysk=kS
for
S > 0 and it = 0 , 1 , . . . ,
and then divide the half-plane H+ into horizontal strips Sskd={(x,y)eR2:yi
for
k = 0,1,...,
of height S. Johann's discretization is then obtained by replacing c by a constant csk in each strip Sk. The constants c\ are defined by
4 = yvi+i
for
each
k
•
He then computed the light rays for the original problem by taking the limit of the light rays for the discretized problem as 6 j . 0. The light rays of the discretized problem can be studied using Snell's law. Clearly, the paths will be straight-line segments within each individual strip, and all that needs to be done is to determine how these rays bend as they cross the boundary between two strips. The answer is provided by the laws of optics as developed by Snell, Descartes, Fermat, Leibniz and Huygens. Snell had observed that, if the speeds of light on the two sides Hi, Hi of a line L in the plane are different constants «i, i>2, and a light ray R traverses Hi and then enters H2 after being refracted at L, then the ratio fj^V" °^ ^ e s m e s of the incidence and refraction angles is a constant, independent of 61. (By definition, 9i is, for i = 1,2, the angle formed by the part Ri of R that lies in Hi with the line perpendicular to L. It is clear that the Ri are straight lines.) Fermat subsequently showed that this is precisely what happens when light
127
is assumed to follow a minimum-time path. The law relating the incidence angles to the velocities of propagation is due to Leibniz and Huygens, and implies the law of Snell. It says that sin 0i vi ——— = — sin0 2 v2
. or, equivalently,
sin 0i sin0 2 = , vi v2
,,,,>
v(3.3.3)
Johann Bernoulli used (3.3.3) to study the light rays of the discretized problem. If R is such a ray, and we let Rk be the part of R that lies in the strip S|, and use 8sk to denote the angle of Rk with a vertical line, then (3.3.3) enabled him to conclude that the quantity
sin0f
t1/in is a constant, since in each strip S% the speed of our light ray is t/yjfe+1. Passing to the limit as 5 \. 0, we conclude that the sine of the angle 0 between the tangent to the brachistochrone and the vertical axis must be proportional to ^/y. Since dx
sin 0 = sjdx
1
+ dy2
,
we find that dx2 dx + dy2 ~ 2
Vy
'
where v is a constant. Then dx2 4- dy2 _ 1 dx2 vy I.
e., 1 + y'(x)2
C 1 - — , where C = - .
So the curve described by expressing the y-coordinate of the brachistochrone as a function of its x-coordinate will satisfy the differential equation
y'(x) = J^f^, V y(x) for some constant C.
(3.3.4)
128
The parametrized curves tp i-> (x((p),y(
C y(ip) = —(1-cos
0<
satisfy (3.3.4). It is easily seen that these equations specify the cycloid generated by a point P on a circle of diameter C that rolls without slipping on the horizontal axis, in such a way that P is at {XQ, 0) when ip = 0. Moreover, it is also easy to check that (*) given two points A and B in H+ there is exactly one curve passing through A and B and belonging to the family of curves (3.3.5). (The parameter count for (*) is obviously right: (3.3.5) describes a 2parameter family of curves; requiring that the curve pass through A fixes one of the parameters, and then asking that it pass through B fixes the other. Proving (*) requires a bit more work, but is still quite easy.) 4 4-1
Spurious solutions and the calculus of variations approach Remarks on Johann Bernoulli's
argument
The argument that we have presented is the one of Johann Bernoulli, and Equation (3.3.4) is the one that he wrote in his paper, followed by the statement "from which I conclude that the Brachistochrone is the ordinary Cycloid." (He actually wrote dy = dx./-^^, but he was using x for the vertical coordinate and y for the horizontal one. Cf. 34 , p. 394.) In present day notation, it is customary to adopt the convention that the symbol y/r stands for the nonnegative square root of r, but it is obvious that Johann Bernoulli did not have this in mind. What he meant was, clearly, what we would write today as
or, equivalently, y(x)(l + y'(x)2) = constant.
(4.4.2)
In particular, the solution curves should be allowed to have a negative slope, contrary to what a "modern" interpretation of (3.3.4) might suggest. But y' should stay continuous, so that a switching from a + to a — solution of (4.4.1) is not permitted.
129 Even with the more accurate rewriting (4.4.2), the differential equation derived by Johann Bernoulli also has spurious solutions, not given by (3.3.5). Indeed, for any y > 0, the constant function y(x) = y is a solution, corresponding to C = y. More generally, one can take an ordinary cycloid given by (3.3.5), follow it up to y>=7r—so that dy/dx=Q—then follow the constant solution y(x)=C for an arbitrary time T, and then continue with a cycloid given by (3.3.5). Such paths are, indeed, compatible with Huygens' law of refraction. Remark 4.1. The existence of the spurious solutions was pointed out by Brook Taylor (1685-1731) in 1715. (Taylor is known to mathematicians as the discoverer of "Taylor's expansion," the inventor of integration by parts, and the creator of the calculus of finite differences. Like many other mathematicians of his time, he was involved in a priority dispute with Johann Bernoulli, involving Taylor's 1708 solution of the problem of the center of oscillation, that went unpublished until 1714.) <£ 4-2
The calculus of variations approach
The spurious solutions, and all the other problems, such as the apparent arbitrariness of the requirement that y' be continuous, can be eliminated in a number of ways. For example, one can prove directly that the spurious trajectories are not optimal. Or one can use, as an alternative to Johann Bernoulli's method, the calculus of variations approach, based on writing the Euler-Lagrange equation. This equation gives a necessary condition for a function y* to minimize the integral
J= /
L(y(x),y'(x),x)dx
Ja
within the class y of all functions [a,b] B xt-^ y(x) e R that satisfy some appropriate technical conditions (for example, the requirement that y(-) is Lipschitz continuous) and are such that y(a) and y(b) have given values a, j3. Here, the "Lagrangian" 0 x R x [a, b] 3 {y,u,x) >->• L(y,u,x)
GR
is a given function, and Q is an open interval in R. Under suitable technical
130
conditions on L, yt, and the set y, if j/* is a minimizer then the condition
dx \ du X
V* (x) ,y'.(x),z))
=-Q- (v* (x), y', (x), x)
(4.4.3)
must be satisfied for almost all x. It is easy to see that the above result can be applied to the brachistochrone problem, provided that we postulateb that it suffices to consider curves in the x, y plane that are graphs of functions x t-> y(x) denned on the interval [a, b]. Then Constraint (iii) can be written (taking E = 0 and 2g = 1 as before) as dx2 +dy2 = y dt2 , which gives dt =
v/cte2 + dy2 — = L(y, y )dx.
where
L(y,u) = ^ ± Z .
(4.4.4)
Vy So Johann Bernoulli's problem becomes that of minimizing the integral J = /
L{y(x),y'{x))dx,
J a
subject to y(a) = a and y(b) = /?, where L is given by (4.4.4). This gives the equation l + y'(x)2+2y(x)y"(x)
= 0,
(4.4.5)
which is stronger than (4.4.2), since (4.4.2) is equivalent to y' + y' +2yy'y" = 0, i.e., to j/'(l + y' + 2yy") = 0, whose solutions are those of (4.4.5) plus the spurious solutions found earlier. It is easy to verify that the solutions of the Euler-Lagrange equation (4.4.5) are exactly the curves given by (3.3.5), without any extra spurious solutions, showing that, for the brachistochrone problem, the Euler-Lagrange method gives better results than Johann Bernoulli's approach. We now show that optimal control is even better. ''With optimal control, this "postulate" becomes a provable conclusion, as will be shown in 5 below.
131
5
The optimal control approach
Having shown that the Euler-Lagrange equation gives better results than Johann Bernoulli's method, we now look at another important step in the evolution of the theory of minimizing curves, namely, optimal control theory. We show that optimal control provides even better results in two ways: 1. With optimal control it is no longer necessary to postulate that the optimal curve is the graph of a function x i-» y(x). 2. In the optimal control setting, the problem of the existence of an optimum becomes trivial. 5.1
The brachistochrone as an optimal control problem
We can formulate Johann Bernoulli's question as an optimal control problem, in which the motion takes place in the x, y plane, and the dynamical behavior is given by x = u^/\y\,
y = vy/\y~\.
(5.5.1)
Here the control is a 2-dimensional vector (u, v) taking values in the set U = {(u,v):u2+v2
= 1}.
(5.5.2)
(Actually, for the "true" brachistochrone problem the motion is restricted to the upper half plane, so we could have written y/y rather than y/\y\. We choose to use the more general expression y\y\ because later, in the study of the reflected brachistochrone, we will want to work in the whole plane.) Let us use p, q, p0 to denote, respectively, the momentum variables conjugate to x and y, and the abnormal multiplier. Then the control theory Hamiltonian H(x,y,u,v,p,q,po,t) is given (using a = sgny) by the formula H(x,y,u,v,p,q,p0,t)
= (pu + qv)y/ay - p 0 .
The Pontryagin Maximum Principle then tells us that if a curve [0,T] 9 i .->£(*) = (*(*),»(*)) is optimal then there exist absolutely continuous functions t »->• p(t) and 11-> q{t), and a nonnegative constant po, such that, if we let u = x, v — y, and write p to denote the momentum vector (p, q), and \\p\\ to denote its Euclidean norm y/p2 + q2, then the Hamiltonian maximization conditions
• ( 0 -iwoii' "<,, = »
(5 5 3)
'-
132
as well as the "adjoint system" of differential equations
«„=„,
W-_*'*M±%M«)__-4ai 2
2 vow)
,
(5.5.4)
v<w)
are satisfied for almost all t. (Notice that \\plt)\\ ^ 0. Indeed, the Maximum Principle also gives the condition that H = 0. So ||p|| = 0 would imply p0 = 0, contradicting the nontriviality of the triple (p(t),q(t),p0).) If the constant p vanishes, then x = 0, so we get a vertical line. Otherwise, x is always ^ 0, showing that we can use x to parametrize our solution. Since y>{x)
=
!k ax
=
i = l = lt x u p
(5.5.5)
we have l+y'(x)2
= ^ -
(5.5.6)
and y
„{x)
=
l.*L p dx
i.. px
=
(s.5.7)
But (5.5.1) and (5.5.3) imply that a; =
*
Ibll
,
(5.5.8)
and then (5.5.4) and (5.5.7) yield »"(*) = -
2 ML & • 2
<5-5-9)
2t/P So 2yy" = - ^
= - ( 1 + y'2), and then l + y'2+2yy"
= 0,
(5.5.10)
which is exactly Equation (4.4.5). As before, this leads to the cycloids, with no "spurious solutions." Notice that this argument does not involve any discretization or any use of Snell's law of refraction. Notice also that in our control argument we have not postulated that the solution curves could be represented as graphs of functions y(x). We have proved it! This should be contrasted with the analysis based on the calculus of variations, where this fact had to be postulated.) This is one example showing that, for the brachistochrone problem, the optimal control method gives better results than the classical calculus of variations.
133
5.2
Optimality proofs and the existence problem
For a second example showing the advantages of the optimal control method over the classical calculus of variations for the specific case of the brachistochrone problem, we turn to the question of the rigorous proof of the optimality of Bernoulli's cycloids. Clearly, no argument based solely on necessary conditions for optimality will ever prove that a trajectory is optimal, because it could happen, for example, that there are no optimal trajectories, in which case the statement that "every optimal trajectory is a cycloid" would be vacuously true. If we really want to prove the optimality of Bernoulli's cycloids, an extra step is needed. To give a rigorous proof of the optimality of the cycloids, valid for all choices of A and B in the closed half-plane H+ = {x,y) : y > 0}, one can proceed in two ways. One possibility (cf. Bliss 7 ) is to use Hamilton-Jacobi theory. Another, much simpler, approach, is to prove first the existence of an optimal trajectory. Indeed, once this existence result is known for all A, B 6 H+, then it is clear that any optimal trajectory £» : [0, T] t-> H+ going from A to B must be such that its restriction £, \[a, /?] to every subinterval [a, /?] of [0, T] is also optimal for the problem P(a, /?) with endpoints £»(«), £*(/?). (This is a rather obvious point, known in optimal control theory as the "principle of optimality," and noticed in the case of the brachistochrone by Jakob Bernoulli.) If one assumes that £» (a) and £* (/?) are interior points of H+ for 0 < a < /3 < T, then the necessary conditions for optimality give a unique candidate for the solution of P(a,/3), namely, the unique curve in the family given by (3.3.5) that goes through £*(a) and £»(/?). So £* is such that ^»[[a,/?] is given by (3.3.5) whenever 0 < a < /? < T, and this easily implies that £* itself is a cycloid given by (3.3.5). To complete the argument, one has to exclude the possibility of a solution of the problem with endpoints A, B which intersects the x axis at some other point. This can be done by a simple qualitative argument, using the following lemma. L e m m a 5.1. Suppose L is a horizontal line contained in the interior of the upper half-plane H+. Let P, Q be distinct points of L. Then the straight-line segment joining P to Q is strictly faster than any trajectory C, from P to Q such that £ lies entirely in the lower closed half-plane determined by L and at least one point of C, lies strictly below L. Proof. Let T^, T^ be the corresponding times. Let y be the y coordinate of the points of L, so y > 0. Write P = (p,y), Q = (q,y). Let £ : [0,T] >-> H+
134 be the horizontal segment from P to Q. Then Te = Now let [a,b] 3 t i-> ((t) = (x(t),y(t)) in our statement. Then
\q-p\
Vy ' G H+ be a trajectory from P to Q as
m< warn = Vv(d
< ( 6 - o ) v ^ = T c V».
JO
Therefore T, > i| g^ ~- ^Pil = r € , and our proof is complete.
0
d
The following is an immediate consequence of the lemma. Corollary 5.2. Lei A, B be points of H+ such that A^ B, and assume that £ : [a, b] i-+ if+ is h'me optimal. Then £(t) is an interior point of H+ whenever a
135
So everything hinges upon the existence result. It turns out that this is not at all trivial if the brachistochrone problem is reformulated in terms of the classical calculus of variations 0 , because (a) the Lagrangian L given by L(y, u) =
t^
has a singularity at y = 0,
and, even more importantly, (b L is not sufficiently coercive for the usual existence theorems to apply. (The precise meaning of (b) is that, as a function of u, L just grows like a constant times |u|, whereas for the usual existence theorems to apply L would have to grow like a constant times |u| r for some r such that r > 1. More sophisticated existence theorems for classical calculus of variations problems that include the brachistochrone are stated in Cesari n , 14.3 and 14.4, but these conditions are not at all simple, and in addition they depend on other results by Cesari that were to appear in a book that was never published.) In the control theory setting, however, the existence question is easily settled by a completely trivial application of the Ascoli-Arzela theorem: given A, B £ H+, let T be the infimum of the times of all trajectories going from A to B; let {£,-} be a sequence of trajectories from A to B, defined on intervals [0, Tj], such that Tj I T. Then the £y are all contained in a fixed compact subset of H+, as can easily be shown by applying Gronwall's inequality and using the fact that the right-hand side of (5.5.1) has less than linear growth as a function of x and y. Then (5.5.1) implies that the derivatives £, are uniformly bounded as well. So we can use the Ascoli-Arzela theorem to extract a subsequence that converges uniformly to a curve £ : [0,T] \-¥ H+. In principle, £ is just a trajectory of the "convexified" system, in which the evolution equations are those of (5.5.1) but the control constraint is u2 + v2 < 1 rather than u2 + v2 = 1. However, it is easy to see that £ is a true trajectory of the nonconvexified system because, if the inequality u(t)2 + v(t)2 < 1 was strict on a set of positive measure, then it would be possible to find an even faster trajectory from A to B by just reparametrizing £. So we see, once again, that in the control setting a question that is hard from the point of view of the calculus of variations can become easy or even trivial. The key point in the above argument is that there is at least one trajectory going from A to B in finite time, as can easily be verified. (If one of the points J4, B belongs to dH+, and A ^ B, then the argument would not work if the speed of light was y rather than ^/y, for in that case there would be no path joining A and B in finite time.) c
We thank F. H. Clarke for bringing this point to our attention.
136
6 6.1
The differential-geometric connection Is the standard formulation of the brachistochrone problem satisfactory?
The mathematical formulation of the brachistochrone problem, as presented in 3.1, is not completely natural, because it takes it for granted that we know that energy is conserved. More precisely, "energy conservation" is "assumed" in at least two conceptually different ways. First, the fact of energy conservation is a prerequisite for even stating the problem. This is clearly not intellectually satisfactory, because it would be much better to state the problem first, and then let the mathematical analysis discover energy conservation for us, after which we could use this new fact to find the solution. Second, the problem of "finding the minimum time curve from A to B" does not make sense for general A and B, unless the value of the energy E is specified. The problem usually studied in textbooks is another one, namely, that of "finding the minimum-time curve from A to B subject to an extra constraint E = E, for a given E." If this constraint is not given, then we really have, for each given pair (^4, B) of endpoints, a one-parameter family of problems, one for each value of E. (Alternatively, we may want to look at the unconstrained problem, in which we are also minimizing over all values of E. But this is clearly not what Johann Bernoulli or anyone else who studied the problem had in mind, and in addition is easy to solve: the infimum of the times is zero, and there is no minimizer.)
6.2
Hamilton's principal function
At this point, we digress to discuss an important idea of Hamilton, namely, the "principal" or "characteristic" function. This idea was not truly appreciated for more than century, until optimal control theory emerged to provide the right setting that would do it justice. So we will only describe it here in broad outline, following the 1944 article 4 5 by J. L. Synge. For the specific case of the brachistochrone, we will show below that the search for Hamilton's principal function leads directly to the discovery of the differential-geometric structure underlying the problem. Synge wrote that To Hamilton, optics and dynamics were merely two aspects of the calculus of variations. He was not interested in experiments. To him, optics was the investigation of the mathematical properties of of curves giving
137
stationary values to an integral of the type
v= v x y z
j ( > ' '%%d£)du-
(6 6 1}
--
Hamilton was, in fact, a great contributor—probably the gratest single contributor of all time—to the calculus of variations. Consider two points, A' with coordinates x',y',z' and A with coordinates x,y,z. Consider all possible curves connecting A' and A. To each curve corresponds a value of the above integral. [Note by HJS&JCW: Hamilton was working with Lagrangians that are homogeneous of degree one with respect to the velocities. In that case, the choice of the interval of definition of the curves is immaterial, since every curve can be reparametrized to be defined on [0,1] without changing the value of the cost integral.] Now compare these values. Is there one curve that gives a smaller value to the integral than all others? It would seem that there must be, but it would be a rash conclusion. Suffice it to say that "in general" there is a curve giving a smallest value to the integral. Then such a curve is a ray in optics. In the language of the calculus of variations, it is an extremal. Now here is Hamilton's great central idea. Regard the minimum (or stationary) value of the integral as a function of the six coordinates x',y',z',x,y,z of A' and A. This function is Hamilton's characteristic or principal function. There is nothing hard to understand about that. What is hard to follow is Hamilton's plan to develop all the properties of the extremals from the characteristic function. In fact, it is so hard that few mathematicians in the ensuing century have given serious consideration to this plan. With the rise of optimal control, "Hamilton's plan to develop all the properties of the extremals from the characteristic function" has become one of the best and most widely used tools in the modern analysis of optimal control problems, under the name of the "dynamic programming" approach. Synge's complaint that "few mathematicians in the ensuing century have given serious consideration to this plan" may have been true in 1944 but is certainly not true today. 6.3
What is Hamilton's principal function for the brachistochrone?
Let us look at the brachistochrone problem from the modern dynamic programming point of view. The natural questions to ask are
138
1. what is the state space for the brachistochrone problem, and 2. what is the problem's "principal function" in Hamilton's sense? More precisely, we would like to know 3. what, exactly, is the controlled dynamical system q=f(q,u),
q&Q,
uGU
(6.6.2)
that corresponds to the brachistochrone problem ? (and, in particular, what is the system's state space Q ?) 4. what is the cost functional
J?
and 5. what is the value function V, regarded as a function QxQ3
fe,
qt) •->• V(qi,
? i
) € l U {-00, +00}
of the initial and terminal points, as in Hamilton's
definition?
Remark 6.1. Naturally, Hamilton's definition and Synge's version of it have to be modified to adjust them to contemporary standards of mathematical precision. Synge's text quoted above seems to require the existence of an optimum for the principal function to be well defined. We take the point of view that V(qi,qt) is always well defined as an extended real number—i.e., a member o / I U {—oo, +oo}. Precisely, V(qi, qt) is the infimum of the costs of all the trajectories going from qt to qt. In particular, V(qi,qt) = +oo if and only if there exist no trajectories from qi to qt with cost < +oo. 0 We want to address these questions taking seriously the facts that energy conservation need not be known for it to be possible to state the problem mathematically, and that, consequently, there should not be a different brachistochrone problem for each value E of the energy, but only one brachistochrone problem. It is clear that J should be time. But what exactly should we choose for Q, U, and / ? Can we, for example, take Q = E 2 ? This would mean that one can specify arbitrary points qi and qt in R2 and define V(qi,qt) unambiguously. But we know that this is not so. Given qt and qt, the minimization problem as formulated in 3.1 only makes sense for a fixed value E of the energy, and only if both qt and qt belong to the E-dependent closed half-plane H+l — I.
139
Therefore there is a well defined value VE(qi,qt) for each value of E such that qt € H+ (jj and qt € H+ (—), but these values are in general different, and the infimum of all of them is zero. A common way to get around this difficulty is by stipulating that we are only interested in dropping the particle with zero initial kinetic energy. (One can even interpret Johann Bernoulli's statement of the problem, quoted in 2.1, as suggesting this possibility.) This, however, will not do, because it would have the following unnatural consequence: if we pick two points q, q' in an optimal arc T going from qi to qt, and use T(q,q') to denote the piece of T from q to q', then T(q,q') will not in general be an optimal arc from q to q' (because the particle that started falling with zero velocity at qt will not in general have zero velocity at q). In modern terminology, the problem with zero initial kinetic energy does not satisfy the principle of optimality.d Perhaps the state space is bigger than the plane? In other words, it might be the case that one can specify the position and something else at the starting time and a similar object at the terminal time. This will not do either. We cannot, for example, specify the initial and terminal positions and velocity vectors, or even just the positions and the lengths of the velocities. What can be specified is qi, qt, and the energy—or something yielding equivalent information, e.g. the length of the initial velocity vector—and from this there is no way to derive a state space S such that V is defined on S. The key problem is that Hamilton's principal function must be a function of an even number of parameters, since it is a function of the initial and terminal states, but the natural number of variables for the value function of the brachistochrone problem seems to be five, namely, two coordinates for the initial position, two for the terminal position, and one for the energy. 6.4
Lie brackets, integrals of motion, and the differential-geometric perspective
A much better way to pose the problem is to write down the equations of motion that truly correspond to Johann Bernoulli's problem as he stated it in June 1696. One should then let the mathematical analysis lead us to the discovery of energy conservation and the simplification resulting from it. This will lead us naturally to a four-dimensional problem whose analysis is best carried out from the differential-geometric perspective, by means of Lied
J a k o b Bernoulli's solution of the brachistochrone problem makes heavy use of the fact that a portion of a minimizer is itself a minimizer. So Jakob cannot possibly have been thinking that one should only consider the case of zero initial kinetic energy.
140
algebraic calculations. The true equations of motion for the brachistochrone are X =
v, w, V = —uw w = uv y=
(6.6.3)
Here x, y, v and w are the state variables, and u is the control. The values of u are allowed to be arbitrary real numbers. The variables x and y are the horizontal and vertical coordinates of our moving point, and v, w are the components of the velocity vector. The requirement that the point is freely falling "under the influence of its own weight" means that the force effectively acting on the point should be equal to a vector proportional to [0, — g] plus a "virtual force" that does no work, i.e., is perpendicular to the velocity. Equation (6.6.3) captures these requirements, by introducing a virtual force vector of the form [uw, — uv], where u is an arbitrary "control," taking values in E. Using "t" to denote "transpose," and writing q = [x, y, v, w]^, we get a familiar equation, namely, q = F(q) + uG(q),
(6.6.4)
where F = [v,w,0, —gtf, and G = [0,0, — w,v]^. This is a 4-dimensional system, with state space Q equal to M4, and control space U equal to E. 6.5
The diamond Lie algebra
Having expressed our equations of motion in the form (6.6.4), the first thing that one should do, from the differential-geometric perspective, is to compute the iterated Lie brackets of F and G, and determine the Lie algebra structure of our problem. Precisely, we use L to denote the "accessibility Lie algebra" of our system, that is, the Lie algebra of vector fields generated by F and G. Our first task will then be to compute L. It turns out that the four vector fields F, G, [F, G] and [F, [F, G}} are linearly independent over M. (that is, there exists no nontrivial linear combination of them with constant coefficients that vanishes identically), and in addition the relations
141
[G,[F,G]]=F,
(6.6.5)
[F, [F, [F, G}}} = [G, [F, [F, G}]\ = 0
(6.6.6)
hold. It then follows easily that L is four-dimensional. As an abstract Lie algebra, L is isomorphic to the diamond Lie algebra G4, a solvable Lie algebra that plays an important role in representation theory (cf. 4 , p. 59). This Lie algebra has a basis H,P,Q, E, with the commutation relations [H,P}= [H,Q] =
Q, -P,
[P,Q]= [HtE\=
E, [P,E} = [Q,E} = 0.
(To construct an isomorphism $ from G4 to L, define * ( P ) = [F,G], *(Q) = F, *(ff) = G,*(E)
= ~[F, F,G\] •
A concrete realization of G4 as an algebra of differential operators—acting on smooth functions on the real line—can be defined by "quantizing the harmonic oscillator," i.e., by taking P Q E H
dx ' multiplication by x, identity,
i<-PW) = i(-^ + * ! ),
so that H is the Hermite operator. 6.6
Energy conservation
We now compute the accessibility distribution A generated by L. Formally, q i-> A(q) is the map that assigns to each state q the linear space A(q) = {X(q)
:XEL}.
To find A we first observe that, even though the vector fields F, G, [F, G] and [F, [F, G]} are linearly independent over R, a simple computation shows that G = <;1F + <;2[F,G} +
<;3[F,[F,G}},
(6.6.7)
142
where the Q are smooth functions of q, given by v
Ci=—,
9
w
C2 =
9
, Cs =
v2 +w2 5-2-. V
6.6.8
From these facts one can easily show that every iterated Lie bracket of F and G is a linear combination of F, [F,G], and [F,[F,G]] with smooth coefficients. Since F(q), [F,G](q), and [F, [F,G]](q) are linearly independent at each point q E E 4 , we can conclude that A is 3-dimensional at each point. This means—using, for example, Probenius' theorem on the existence of integral manifolds—that, at least locally, there is a nontrivial integral of motion, i.e., a function with nonzero gradient which is constant along all integral curves of F and G, and then also along all solutions of (6.6.4). This integral of motion can be easily computed and turns out to be the energy E, given by E{x,y,v,w)
v2 + w2 = — - — + gy.
(6.6.9)
Notice that E is a smooth function on E 4 , with nowhere vanishing gradient. Therefore the level hypersurfaces of E—i.e., the sets SE^
{q : E(q) = E] ,
for E € E—are smooth 3-dimensional manifolds. (The set SE is given by the equation V=
2E - v2 - w2 Tg '
which shows that each SE is the graph of a smooth function (x,v,w) 6.7
Naive application of the Maximum
H-> y.)
Principle
The fact that our 4-dimensional state space E 4 is foliated by 3-dimensional "leaves" SE has profound implications for the optimization problem. Indeed, if we try to find the optimal trajectories by applying the Maximum Principle, we get nothing. This is a consequence of more general result, namely, that, ( # ) whenever a control system satisfies a nontrivial "holonomic constraint" such as E = constant, then the Maximum Principle is uninformative, because every trajectory is an extremal. (The word "nontrivial" here means that VE never vanishes.)
143
Remark 6.2. The reason for ( # ) is as follows. The necessary condition given by the Maximum Principle for a trajectory-control pair (£, n) to be optimal is that (£, n) be either a "normal extremal" or an "abnormal extremal." Moreover, being an abnormal extremal is a necessary condition for the pair (£, n) to be such that the terminal point of £ does not belong to the interior of the reachable set from the initial point. If a nontrivial constraint such as E = constant is satisfied, then the reachable set from any initial point q is contained in a submanifold of the state space of positive codimension. Hence the reachable set has empty interior. Therefore all trajectory-control pairs are abnormal extremals. ^ 6.8
Controllability
We now investigate the controllability properties of our system. First of all, it is clear that two points qi, qt cannot be joined by a trajectory of (6.6.3) unless they belong to the same leaf SE- We now show that the converse is true, that is, any two points qi, qt belonging to the same leaf of (6.6.3) can be joined by a trajectory. In other words, the restriction of (6.6.3) to each leaf is completely controllable. To prove our statement, we use the technique of "Lie saturates" introduced by Jurdjevic and Kupka. Fix E and let (6.6.3.E) denote the restriction of (6.6.3) to SE- By construction, the system (6.6.3..E) has the accessibility property. So to prove complete controllability it suffices to establish complete controllability of the Lie saturate family of vector fields C. It is clear that F, G and — G belong to C. So the desired conclusion will follow if we show that — F G C. But this follows easily, because the identity [G, [G, F]\ = — F implies e*GFe-*G
6.9
=
_
p
The existence problem
A precise definition of Hamilton's principal function V was given in Remark 6.1. It is clear that, if qi and qt belong to R 4 , then V(qt, qt) < +00 if and only if qi and qt belong to the same leaf SEIt is then reasonable to ask whether an optimal control steering qi to qt exists whenever V(qt,qt) < +00. Equivalently, we want to know if the infimum that occurs in the definition of V is in fact a minimum whenever it is finite. The usual way to prove existence of an optimum is to pick a sequence {??j}j£N of controls steering qi to qt with costs Cj that converge to the infi-
144
mum, look at the corresponding trajectories £,, and then try to produce a subsequence {r)j(e)}e€T$ of {^}jgN that converges in some sense to a limiting control TJOO and is such that the £,•(£) converge to a trajectory £00 of rjco and the costs Cj(^) converge to the cost CQQ of (£oo,»?oo)This method may fail for various reasons. The simplest one is that the £j may "go off to infinity." That is, there may not exist a subsequence {£j(e)}teN that stays in a fixed compact set. In the following definition, we give a name to the property that this particular obstruction to the existence of optimal controls does not occur. Definition 6.3. We say that an optimal control problem with dynamics (6.6.2) and Lagrangian cost functional with Lagrangian L satisfies the properness condition if for every pair qi,qt of states such that V(qi,qt) < +00 there exist a number c € l and a compact subset K of Q such that (1) V(quqt)
(2) whenever rj is a control steering qi to qt with cost < c, and £ is the corresponding trajectory, it follows that £ is entirely contained on K. 0 When a problem satisfies the properness condition, it still does not follow that optimal controls exist, because the minimizing sequences {T]J}J€N need not have subsequences that converge in any reasonable sense. On the other hand, it is commonly agreed that in this case optimal controls really exist if one enlarges the original class U of admissible controls by adding to it some "generalized controls." We will show that this is indeed what happens for our brachistochrone problem, and that in this case the generalized controls can be easily described in concrete terms, and turn out to be none other than "impulse controls." To begin with, we prove the following. Proposition 6.4. The minimum time problem for the brachistochrone system (6.6.3) satisfies the properness condition. Proof. Let qi, qt be two points of M4 having the same energy E. Fix an arbitrary c € E such that c > V(qi,qt). We have to find a constant A such that ||£(i)|| < A for all t € [0,r] whenever r < c and £ : [0,r] (->• K4 is a trajectory of (6.6.3) for some control t 1-4 r)(t) such that £(0) = qi and &T) = qt. Write £(*) = [x{t),y(t), v(t),w(t)]*. Let
9(t) = [v(t)Mt)V, at) = Kt)M*)V-
145
Then (6.6.3) implies that dt
|| C (t)|| 2 = - 2
9
•«,(*).
Therefore dt
2 2 IIC(*)IIS = 2g\w(t)\<9 +w(t)
+\\at)\\
and then, if we write
V>(*) = |llC(*)ll 2 -IK(o)|| 2 |, we find that < [\g2 + \K(s)\\2) ds < c(g2 + ||C(0)||2) + / V(*) ds. Jo Jo Then Gronwall's inequality implies that m
W)
< c(g2 + | | C ( 0 ) | | V < c{g2 + ||C(0)|| 2 )e c .
It then follows that
IICWII 2
0 6.10
• Application of the Maximum Principle on manifolds
Using the integral of motion E, we can regard E 4 as foliated by the 3dimensional level hypersurfaces of E, and conclude that every trajectory is contained in one of the leaves Sg. If E G E, then the Maximum Principle on manifolds (cf. 3T ) can be applied to the problem restricted to Sg. The conclusion turns out to be exactly the same as that for the unrestricted problem, except that the nontriviality condition for the momentum is slightly stronger. This extra strength is actually fundamental for, as explained in 6.7, without it the conclusion would be completely uninformative. If [a, b] 3 t H-> £„(£) € E 4 is an optimal trajectory for our problem, corresponding to a control [a, b] 3 t i-> n* (t) € E, then the usual version of the Maximum Principle yields the existence of a nontrivial momentum vector field [a, b] 3 t !->• 7r(t) 6 E 4
146
(often called an "adjoint vector") that satisfies the adjoint equation, the Hamiltonian maximization condition, and the additional condition that the maximized value of the Hamiltonian is a nonnegative constant 6 . The principle applied to the restriction to the leaves yields the stronger conclusion that n(t) cannot be orthogonal to the leaf Sg that contains £*. (In differential-geometric terms, -?v(t) is really a covectoron Sg at £*(£), and has to be nontrivial as such. Equivalently, if one insists on regarding 7r(i) as a vector in R 4 , then it should not be orthogonal to the tangent space to Sg at £*(£).) (Naturally, one could also avoid using the Maximum Principle on manifolds by invoking instead the fact that Sg is identified with M3 in an obvious way, since it is the graph of a smooth function (x,v,w) i-> y(x,v,w). This would amount to eliminating y from the equations by substituting for it its expression in terms of the other variables, for each fixed E.) The Hamiltonian is the function R4 x E x M4 3 (q,u,p) >-> R defined by H(q,u,p)
=
(6.6.10)
where the functions ip, ip are defined by
•
Since u is completely unrestricted, H cannot have a maximum as a function of u unless tp(q,p) = 0. So V"(6(*).T(*)) = 0
for
a11
*•
(6.6.11)
t,
(6.6.12)
Differentiation of (6.6.12) yields p(^(t),Tr{t))
= 0 for all
where p{q,p)d=
147
A further differentiation—together with the fact that [G, [F, G]] = F—yields 0(M*)Mt))
+u(t)
= 0,
(6.6.13)
where e(q,p)^{p,[F,[F,G]](q)). This, together with the equalities ip = p = 0, determine u as a function of q, i.e., provide an optimal control in the form of a state space feedback law u = u(q), as we now show. First of all, (6.6.7) implies that *l> =
(6.6.14)
along our trajectory. Next, we use the crucial fact that ¥>(*(«),&(t)) 5* 0,
(6.6.15)
which follows from the stronger version of the Maximum Principle. (The precise argument is as follows. The covector n(t) must be nontrivial as a linear functional on the tangent space at £*(<) of the leaf containing £*(t). Since this space is spanned by the three vectors F(^(t)), [F,G](£*(£)) and [F,[F,G]](Ut)), one of the numbers ¥>(*(*),&(*)). p{*(t),t.{t)), 0(TT (*),£,(*)) must be nonzero. We know that p(ir(t),t;*(t)) = 0. If <^(7r(t), £*(*)) = 0, then (6.6.13) would imply that 6{n(t), £» (£)) = 0, and we would get a contradiction. Hence (6.6.15) holds.) Therefore (6.6.13) implies u{t)
(6 6 16)
—
--
that is U{t)
~
(«(t),F(Ut)))
(6 6 17)
•
- -
Moreover, it follows from (6.6.14), together with the fact that I/J = p = 0 along our trajectory, that Ci(Ut)M^(t),^(t)) Then (6.6.16) implies
+ C3(Ut))0(At),Ut))
= 0.
(6.6.18)
148
as long as C3 (£*(*)) ^ 0. Therefore u{t) = u(Ut)),
(6.6.20)
where , def
Ivg
uj(q) = u{x, y, v, w) =
(6.6.21) 2 2 . v2 + w'z It is then clear that u is a smooth function on the complement of the set S = {(x,y,v,w)
:v2+w2
= 0} .
(6.6.22)
The "singular" set £ corresponds, naturally, to the points where the kinetic energy vanishes, which as we know from our other analyses is where some singular behavior occurs. If we define Z(q)=F(q)+Lj(q)G(q), we see that Z is a smooth vector field extremals in the sense of the Maximum The resulting trajectories are easily use the feedback u = u(q), the dynamic
(6.6.23) 4
on M \E whose trajectories are the Principle. computed. First, observe that, if we equations become
x = v, Vw, v = —2g(v2 + w2)~1vw , 2 1 2 L ...2w..2 w = 2g(v22_+w )~ v —„ g 2 2 1 2 = {v + w )' {2gv - g{v2 + w2)) = g{v2 +w2)~1(v2 -w2).
(6-6-24)
If we introduce a complex variable z denned by z = v + iw, then the last two equations of (6.6.24) say that
i = ^ 1
that is, z=9—.
(6.6.25)
z
\\ '* The formulas of (6.6.25) define a well known dynamical system of the complex plane C. The right-hand side of the equations is a smooth vector field on C\{0}. The integral curves are circles passing through the origin and having center on the real axis (plus the imaginary axis, parametrized by E 3 t —> (0, — gt), which is also a solution since along it (6.6.25) just says that i = — gi). The parametric equation of the circles is z(t) = r{l + a e^), where a £ { + 1 , - 1 } and r is a positive real number.
(6.6.26)
149
Once it has been shown that the velocity vector {v,w) evolves according to (6.6.26), a trivial integration shows that the trajectories described by the position vector (x, y) are cycloids. 6.11
The complete synthesis and Hamilton's principal function for the 4-dimensional brachistochrone problem
We haven't yet completely answered all our earlier questions and determined Hamilton's principal function V. Let qi, qt be two points of R4. It is clear that V(qi, qt) = +oo unless both <7i and qt belong to the same energy hypersurface, because if the energies are different then the points cannot be joined by a trajectory, and the infimum of the empty set is +oo. On the other hand, if the points qi and qt belong to the same level hypersurface Sg, then we know that they can be joined by a trajectory, so V(qi, qt) will be finite. Moreover, it is reasonable to expect that such a curve will exist, and if it does not then we ought to be able to explain why. Write qi = (Ai,a,i), qt = (At,at), where Ai} at, At, at, belong to R 2 . The classical analysis carried out earlier seems to suggest that, once Ai, At and E are specified, there will be only one optimal arc joining them. So, if we specify Ai and At but do not fix the energy, then there should be a 1parameter family of such arcs (one arc for each value of E). Yet we now seem to need a 3-parameter family, because we want to accommodate all possible pairs (oj, at) (i.e., four parameters) as long as they give rise to the same energy (one constraint, and 4 — 1 = 3 ) . Equivalently, given E, we ought to be able to choose the directions 0i, Ot, of the velocity vectors ai and at in a completely arbitrary way. (Their magnitudes are determined by the fact that the energy at qi and qt must equal E.) But given Ai, At and E there is a unique cycloid from Ai to At corresponding to the energy E, and in particular the directions 6i, 6t at Ai, At of this cycloid are completely determined by Ai, At and E. In other words, we ought to be able to choose freely the two angles 9i, 6t, but it appears that we cannot do so. The solution of this apparent paradox is as follows. Suppose we fix an energy level E. Given any Ai, the set C(Ai, E) of those a's such that (Ai, a) € Sg is either empty or a circle, except for the "borderline" case when (Ai,0) £ Sg and C(Ai,E) consists of a single point. If C(Ai,E) is a circle, then through each point of {Ai} x C(Ai,E) there passes an integral curve of Z. The set of all points lying on such curves is 2-dimensional, and intersects the 1-dimensional set {At} x C(At,E) at just one point. This means that there is a unique pair (a.i,at) such that (Ai,a,i) and (At,at) lie on the same integral
150
curve of Z. The portion of that curve joining (At, en) to (At,at) is, of course, our famous cycloid. What is then the optimal trajectory joining (Ai, at) to (At, at) for general ai, at"? What is the corresponding value of V(Ai,ai,At,at)'? The answer is quite simple: the "optimal trajectory" is obtained by 1. applying an impulse control to rotate a; to a; instantaneously, 2. then following the cycloid from (Ai,cn) to
(At,at),
3. finally applying another impulse control to rotate at to at. (These are not true trajectories, of course, but it is easy to translate the above into a statement about a minimizing sequence of true trajectories.) The value V(Ai,cii, At,at) is therefore equal to V(Ai,a,i,At,dt). 7 7.1
Five modern variations on the theme of the brachistochrone Hestenes' "variation of the brachistochrone problem"
In 1956, M. Hestenes wrote (in
18
, p. 64):
... it is interesting to speculate on what type of brachistochrone problem the Bernoulli brothers might have formulated had they lived in modern times. An interesting variation of the brachistochrone problem can be described as follows: Consider an idealized rocket ship moving in a vertical plane. The ship is to be considered as a particle acted upon by a gravitational force g and a thrust of constant magnitude F and variable inclination (p ... There are no other forces acting on this ship. The pilot steers the ship by controlling the inclination
y = y(t)
0
subject to the constraints (with unit mass) x — F cos tp
y = F sin (p — g
and with x, y, x, y, prescribed at t = 0 and t — T, one which minimizes the time of flight T.
151
It can be shown that the path of least time has the following interesting property: there is a line L with direction fixed in space, and with position fixed relative to the ship, such that if a particle M is moving along L with constant speed, the ship will traverse a path of least time by keeping the thrust aimed at the particle M. Hestenes later on proceeds to solve the problem. Let us present a solution based on optimal control. The dynamical equations can be rewritten as x =z
z = F cos ip
y =w
w = F sin (p — g.
If [0,T] 9 ( 4 (x*(t), z*(t),yt(t),w*(t)) e I 2 is an optimal trajectory, corresponding to a control function [0, T] 3 t t-> ip* (t) € M, then the Maximum Principle yields a nonnegative constant -KQ and and a vector-valued function [0,T]3t^n(t)
=
(nx(t),irz(t),Try(t),Trw(t)).
The Hamiltonian is given by H(x,z,y,w,ip,px,pz,py,pw,po)
= px z+pz Fcosip+py w+pw (Fsmtp-g)-p0
.
The adjoint equation becomes •kx = 0
7rz = — 7rx
Then, if we write u)(t) — {nz(t),Ttw(t)), u(t) =
-ky = 0
nw = — •Ky .
we have tui,
(JQ +
where u>o and u>i are constant two-dimensional vectors. The Hamiltonian maximization condition then says that the vector u(i) = (cos ip(t), sin ip(t)) is given by
""» = »
•
<"•»
The Hamiltonian must vanish along our trajectory. Therefore, ir(t) can never vanish, for if 7r(t) = 0 it would follow that TT0 = 0, contradicting the nontriviality condition. Hence u>(t) cannot vanish identically. So u is a linear function which is not identically equal to zero. Therefore there is at most one value of t such that uj(t) = 0. It follows that the function u of (7.7.1) is well denned except possibly at one point i. Moreover, if such a i exists, then cj(t) = (t — i)cji, so if t < t u(t) = {—wi ui\ if t > i.
152
The "line L" of Hestenes' "interesting property" is the line Lt with direction <jj\, with a point At having a "position fixed relative to the ship" given by At = (x*{t),y*(t))
+u}0.
So in fact Lt is moving along with the ship. If a point qt on Lt moves with constant velocity UJ\, starting at Ao at time 0, then the point's position at time t is A* + toj\. Therefore the vector from the ship's position to qt is precisely u)(t). We have shown that the direction of the thrust is uj(t), from which Hestenes' "geometric property" follows. 1.2
The reflected brachistochrone problem
In 3, our analysis of the brachistochrone problem was carried out in a halfplane, by looking only for optimal trajectories that are entirely above the x axis, as in Johann Bernoulli's original problem. Let us pursue Hestenes' idea that "it is interesting to speculate on what type of brachistochrone problem the Bernoulli brothers might have formulated had they lived in modern times." An obvious question for the brothers to ask would have been that of finding out what happens if we work in the whole plane. This is in fact a much more natural mathematical setting for the brachistochrone problem. We shall refer to this new question as the reflected brachistochrone problem. (It is because we were thinking all along of this extension that we wrote y/\y~\ rather that ^Jy in (5.5.1)). For a recent study of the reflected brachistochrone problem from the dynamic programming perspective, see Malisoff 2 8 ' 2 9 . Solving this more general problem amounts to finding the light rays when the medium is the whole plane, and the speed of light is ^/\y\. In other words, we want to find the minimum time paths for the control system (5.5.1) with control set U given by (5.5.2) and state space equal to the whole plane. Notice that this problem is "completely controllable," in the sense that (C) any two points A, B of R2 can be joined by a feasible path, even if they lie on opposite sides of the x axis. To prove (C), observe first that the desired conclusion is trivial if A and B both lie in the open upper half plane or in the open lower half plane. So it clearly suffices to prove the conclusion if A lies in the open upper half plane and B belongs to the x axis. Moreover, we can assume that A lies on the same vertical line as B, for otherwise we can pick A' on the same vertical as B but strictly above B, and join A to A' first in finite time. So let us suppose
153
that A - (a,a) and B{a,0). Then the curve [0,a] 9 s 4 ( a , a - s ) certainly goes from A to B. Along this curve, dy = -^/ydt, so dy
ds
Therefore the total time T required to move from A to B along this curve is given by fa J0
ds
n
r-
yfa-s
so T < +oo, and (C) is proved. The crucial point here is that the right-hand side of (5.5.1) vanishes along the x axis, but this does not prevent the existence of feasible paths crossing the x-axis in finite time, because the "speed of light function" (x,y) <-+c(x,y) = y/\y] is not Lipschitz near the x axis. Remark 7.1. If the function c was Lipschitz-continuous, then the well known uniqueness theorem for ordinary differential equations would imply that every solution passing through a point on the x axis is a constant curve. This would obviously render it impossible for feasible curves starting at a point not on the x axis to reach the x axis in finite time. An important example of this situation is the case when the speed o light is \y\ rather than y/\y\. That case was also studied by Johann Bernoulli, as is clearly seen in the text we quoted at the end of 2.2, where he says that if, for example, the velocities were as the altitudes, then both curves [HJS & JCW: that is, the brachistochrone and the tautochrone] would be algebraic, the one a circle, the other one a straight line. It turns out that the "brachistochrones" for the case when the speed of light is \y\ are half circles whose center lies on the x axis. Moreover, these circles take infinite time to reach the x axis, precisely because the function J/ — ' >• |y| is Lipschitz. ^ We have seen that the complete controllability of (5.5.2) depends essentially on the non-Lipschitzian character of its speed of light function. However, this same non-Lipschitzian character also renders the Maximum Principle inapplicable, in its classical and nonsmooth versions (including the so-called "Lojasiewicz version," cf. Sussmann 35), since all these results require a Lipschitz reference vector field. Can we still use necessary conditions to find the solution, perhaps by invoking a more general version of the Maximum Principle that would be applicable to this nonsmooth, non-Lipschitzian case?
154
Suppose, for example, that we want to find an optimal trajectory £ going from A to B, where A lies in the upper half-plane and B is in the lower halfplane. Then one can easily show, first of all, that an optimal trajectory £ exists, using the Ascoli-Arzela theorem as before (cf. 5.2). Next, using the usual necessary conditions for optimality, e.g. the Euler-Lagrange equation or the classical version of the Maximum Principle, one shows that any portion of an optimal curve which is entirely contained in the closed upper half plane or in the closed lower half plane is a cycloid given by (3.3.5), or a reflection or such a cycloid with respect to the x axis. Next, one sees that £ cannot traverse the x axis more than once. (This is a very straightforward consequence of Lemma 5.1.) Therefore £ must consist of a cycloid T\ going from A to a point X in the x axis, followed by a second cycloid 1^ going from X to B. It remains to find X. Naturally, X is the only point on £ in a neighborhood of which the usual smooth and nonsmooth versions of the Maximum Principle are inapplicable. On the other hand, it is easy to produce a nonrigorous formal argument yielding the missing condition, as we now show. Let us apply the Maximum Principle formally, even though the technical conditions required for the validity of the usual versions are not verified. The maximized value of the Hamiltonian is clearly equal to ||p||\/[y| — po, and we know that this number is in fact equal to zero. So \\p\t)\\V\y\=Po
for all
t.
(7.7.2)
If we multiply (5.5.6) by \y(t)\, and use (7.7.2), we find
|y(*)l ( l + ^ ) » ) = M * ^ K = ^ .
(7.7.3)
The adjoint system (5.5.4) tells us that p is constant on each time interval that does not contain the point r such that £(r) = X. If we make the Ansatz that p is actually constant on the whole time interval, i.e., that the constant values of p on both sides of T are equal, we find that the cycloids Ti, T2 must be such that the quantity C=\y(x)\(l+y'(x)2).
(7.7.4)
which is constant along each Tj, actually has the same value for both curves. This extra condition determines X. The above argument is, of course, not rigorous. It turns out, however, that the recent "very nonsmooth" versions of the Maximum Principle developed by one of us—cf. Sussmann 39>4o,4i,42,43,44—apply to this problem and provide a
155
rigorous justification for our formal argument. The reason for this is that the new results do not require Lipschitz continuity—or even continuity—of the reference vector field. All that is needed is that the flow maps generated by the reference flow be differentiable in an appropriate sensed Remarkably, our "extra condition" has a simple geometric meaning. Recall that the cycloids of Johann Bernoulli's problem are generated by a circle rolling along the x axis. The extra condition then says that
(C) The rolling circles that generate the upper and lower cycloids must have equall radii. 7.3 A modest claim If we are going to "speculate" as Hestenes did, it is natural to ask not only what questions the Bernoulli brothers would have asked if they had lived in modern times, but also what kinds of answers they would have liked. This is of course a hard question to answer, and anything we say will have to be highly subjective. Granted this, we are firmly convinced that 1. The Bernoulli brothers would have liked the "interesting property" of the solution of Hestenes' problem (which is almost certainly the reason why Hestenes chose to emphasize this property) and also that 2. they would have found elegant the geometric condition of our solution of the reflected brachistochrone problem. This seems clear to us but, since de gustibus non est disputandum, we shall not pursue this particular argument further. 7.4
The brachistochrone with friction
In the problem studied in 6, we assumed that there was no friction. This implied that energy was conserved and that the four-dimensional state space was foliated by three-dimensional orbits of the system's accessibility distribution. ' F o r the specific case of the reflected brachistochrone, classical differentiability suffices. For more general problems, one needs other theories of "generalized differentiation," as e x p l a i n e d in 39,40,41,42,43,44
156 It is then natural to consider the more general case of the brachistochrone with friction, and to expect that for this problem there will be no nontrivial conserved functions of the state, so the accessibility distribution will be fourdimensional, and the system will have the accessibility property. We now outline some of the results, without carrying out a complete analysis. The dynamical behavior of the system is given by the equations x—
v, w, v = —uw — pv, w = uv — pw — g, y =
(7.7.5)
where, as before, u is an unbounded scalar control. The parameter p is a real number, the friction coefficient. The problem with friction corresponds to p > 0. When p = 0 we get the frictionless case considered earlier. If we write
F(q) =
v w -pv -pw
G(q) =
0 0 -w v
then (7.7.5) takes the form q=
F(q)+uG(q).
(7.7.6)
From now on, we assume that p ^ 0, since the case when p — 0 has already been discussed. Let L be the Lie algebra of vector fields generated by F and G. In order to analyze the structure of L, it will be convenient to introduce five new vector fields Z, V, W, X, and Y. The following eight formulas express F, G, and [G, F] in terms of the usual differential operator notation for vector fields, and define Z, V, W, X, and V:
F = vdx + wdv — pvdv - {pw + g)du vd„ G= - wdv + [G,F] = -wdx + vdy — gdv wdu Z= vdv + V = dv
w = X = Y =
dx 8y
157
Let S be the linear span of the eight vector fields F, G, [G,F], Z, V, W, X, and Y. It then turns out that S C I , because Z = a(F + [G,[G,F}}), W = 7 ([F, Z] + F + PZ), V = [G,W], X = pV-[F,V], Y = PW- [F, W], where 1 , 1 a = - and 7 = — . P 29 On the other hand, the formulas [F, [G, F]] = -p[G, F] + 2gX - 2pgV, [G,[G,F]] = -F-pZ, [F,Z] = [F,V} [F,W] [F,X] [G,Z] [G,V] [G,W]
= = = = = =
-F-pZ-2gW, -X + pV, -Y + pW, [F,Y] = 0, [G,X] = [G,Y] = 0, -W, V,
show that [F, S] C S and [G, S]CS. So L = S. A straightforward calculation shows that F, G, [G,F], Z, V, W, X, and Y are linearly independent over E. So L is an S-dimensional Lie algebra. Let A be the accessibility distribution of our system, so A(q) = {h(q) : h 6 L} if q 6 K4. Since the vector fields X, Y, V and W belong to L, it is clear that A(q) = M4 for every q. So our system has the accessibility property. An analysis similar to the one carried out in the previous section for the frictionless problem yields an optimal "feedback control" 2gv _Cy2 ~ vw u = — - — 5 - + 2 np — 25 - — r2> v + w involving an additional parameter £, given by _ TVy_
TTx
{7.7.7)
158
(Here irx, ny are the X and Y components of the momentum, which are easily seen to be constant along extremals.) In other words, through every point q € E 4 there passes a one-parameter family of optimal trajectories, depending on the parameter C,. 7.5
The classical brachistochrone as a degenerate problem
The family of brachistochrone problems with friction coefficient p constitutes an "unfolding" of the classical frictionless problem. For p ^ 0, the situation is generic, in the sense that the accessibility distribution is four-dimensional. The generic behavior of the time-optimal trajectories for such problems is precisely the one described here: the control is given in "feedback" form, depending on the state and an extra parameter. Formula (7.7.7) shows that the extra parameter only occurs as part of a term that is multiplied by p. It follows that when p = 0 the extra parameter disappears, and we get a state space feedback as determined earlier. 7.6
Deriving Snell's law from the Maximum Principle
Our third "modern variation" will involve going back to ancient times, long before 1696, and considering Fermat's minimum time principle and Snell's law of refraction. We have already explained how Snell's law is intimately related to the brachistochrone problem, since it played a crucial role in Johann Bernoulli's solution. So, if we pursue Hestenes' line of inquiry further, and speculate on other questions that the Bernoulli brothers might have asked "had they lived in modern times," it is easy to imagine them asking the following. (Q) Given that in these modern times powerful general theories have been developed, going far beyond the particular examples that were dealt with by ad hoc methods up to the 17th and early 18th centuries, do these general theories actually cover all these special examples whose study preceded and begat them? As an important special case of this question, the brothers would certainly want to know whether the necessary conditions of the calculus of variations and optimal control theory apply, in particular, to the derivation of Snell's law. So let us consider a medium such as the one we described when discussing Snell's law, in which the speed of light is constant on each of the two sides H-, H+ of a hyperplane H in K n , but the constant values c_, c+, are different.
159
Let us ask the obvious question, namely, whether Snell's law be derived from the Maximum Principle. It is easy to see that there is no way to formulate this question as a minimum time problem with a controlled dynamics q = f(q,u,t) in which / is continuous with respect to q, since | | / | | has to jump as we cross H. In the usual versions of the Maximum Principle, it is not required that / be differentiate, or even continuous, with respect to the control u. But here the situation is different, since we need to be able to deal with an / which is discontinuous with respect to the state variable q. Once again, it turns out that the recent versions the Maximum Principle developed in Sussmann 39>4o,4i,42,43,44 apply to this problem as well, and lead to Snell's law. The reason is that in these version all that is needed is that the reference trajectory £* arise from a flow {$t',t} consisting of differentiable maps, and there is no need for this flow to be generated by a continuous vector field. 7.7
Was Johann Bernoulli close to discovering non-Euclidean geometry?
Given a "speed of light function" (x,y) •-* c{x,y) on some plane region, the time along a curve is given by _ y/dx2 + dy2 c(x,y) since y ^f + -^ = c(x,y). This means that, if we adopt a slightly different point of view, and think of time as "length," then the function c defines a Riemannian metric on the set fi c = {{x,y) : c(x,y) > 0}, provided that c is sufficiently smooth. Let us assume that c(x,y) > 0 whenever y > 0, as was the case in our two examples. Then c gives rise to a Riemannian metric 2
_ dx2 + dy2 c(x,y)2
on the upper half plane. The "brachistochrones" are then the minimum length curves, that is, the geodesies. In the case of the "true brachistochrone," when c(x,y) = y/\y~\, these geodesies are cycloids (plus vertical segments) but maximal geodesies have finite length. On the other hand, in the case when c{x, y) = \y\, so that the Riemannian metric on the upper half plane is given by ds2=y-2(dx2+dy2), the geodesies are, as we have seen, half circles of infinite length.
(7.7.8)
So, using modern terminology, the circles that occur as solutions of this modified version of Johann Bernoulli's brachistochrone problem are the geodesies of a Riemannian metric on the upper half plane given by (7.7.8). We now know that this Riemannian metric defines the so called Poincare half plane, which is the simplest model of non-Euclidean geometry. This raises an obvious question: how close did Johann Bernoulli get to solving the famous problem of the provability of Euclid's fifth postulate? What renders the question particularly interesting is that 1. Johann Bernoulli lived at a time when the problem of Euclid's parallel postulate was very much at the center of mathematical research. For example, G. Saccheri published in 1733 his book 3 2 , entitled Euclides ab omni naevo vindicatus (that is, "Euclid vindicated from every flaw"), where he pushed very far the development of non-Euclidean geometry, only to end up dismissing it by claiming that a particular conclusion he had reached had to be rejected because it was "repugnant to the nature of a straight line." 2. The cycloids that occur in the original brachistochrone problem have many of the incidence properties of Euclidean lines, and therefore come close to satisfying all Euclid's postulates other than the fifth. But they do not yet provide a good model for non-Euclidean geometry because they "go to infinity" (i.e., approach the x axis) with finite length. 3. The half-circles of the Poincare model have all the desired incidence properties and also have infinite length, so they furnish a perfect model where all the Euclidean axioms other than the fifth are true. 4. Johann Bernoulli did study these half circles, as we pointed out in Remark 7.1. So in a sense he did discover the Poincare model. 5. Johann Bernoulli was definitely interested in the question, since he made significant contributions to the early stages of the development of the idea of geodesies on curved surfaces, by studying the curves of minimum length on a surface, which later 9 came to be known as "geodesies." 6. The solution of the problem of the independence of Euclid's postulate is extremely simple by today's standards, even though mathematicians took about 2200 years to find it. Actually, the solution is so simple that it can be explained to any undergraduate with a moderate understanding of plane analytic geometry. All that is required is to take any 9
T h e word "geodesic" was coined much later by Liouville (1809-1892).
161
reasonable way to compute "time" along curves, rename "time" by calling it "length," and then call the length-minimizing curves (that is, the "brachistochrones") "lines." For example, the case when c = \y\ leads us to taking the half circles with center on the x axis (plus the vertical half lines) and calling them "lines." Once this trivial renaming has been carried out, it is easy to verify that all of Euclid's postulates other than the fifth hold, but the fifth postulate fails. So Johann Bernoulli had essentially all the tools he needed to solve the problem and prove that Saccheri was wrong, by showing that Euclid's fifth postulate does not follows from the other ones. Why didn't he do it? One can surmise that he lacked the understanding of the geometric significance of the cycloids and half circles that he had found as solutions of his minimization problems. He certainly saw these curves as "geometric," but failed to see that they could serve as the "lines" of a different geometry. In modern terms, this is a trivial remark, namely, that one can turn around the well known fact that the segment is the shortest path between two points, and make it the definition of the notion of segment, while at the same time using a more general prescription to measure "length." But the idea that minimization problems can be the source of new geometries evolved very slowly over two millennia, and took a long time to crystallize, reaching final form in the nineteenth century, in the era of Gauss and Riemann, when it revolutionized mathematics and then moved on, at the beginning of the twentieth century, to bring a a fundamental change in our understanding of the physical universe. One can already find the idea in a rudimentary form in Fermat's least time principle'1, which can be interpreted as saying that light rays follow the shortest paths, with the proviso that "length" must now be defined in terms of a new and more refined geometry, in which the physical properties of space influence the geometry, and shortest paths can be curved or bent. This interpretation is suggested, for example, by G. Lochak, in 27 , p. 57-58 [our translation]: [Fermat's principle] anticipated the greatest revolution to be undergone by geometry in the future, namely, the discovery of non-Euclidean geometries, because it showed for the first time that the physical properties of a medium might cause the shortest paths not to be straight lines. ''which, as we have seen, was a direct ancestor of Johann Bernoulli's solution of the brachistochrone problem
162
In fact, Leibniz definitely uses the word "shortest" in his analysis of the refraction problem in his 1684 Nova methodus paper, showing that he was at least entertaining the possibility that light rays in a medium with variable speed of light truly were "shortest paths" in some new geometry. But it would appear that, in spite of these earlier glimpses of what would become one of the greatest scientific ideas of all times, the era of Johann Bernoulli was not yet ready to take the trivial step of turning around the lenght minimization property of segments. Johann Bernoulli passionately wanted to succeed and be famous, and was obsessed by the desire to surpass his contemporaries, especially his brother Jacob and his son Daniel*. It is thus sadly ironic that he, of all people, got so close to solving an important 2000-year old problem but missed because he failed to take a small step, a step so simple that today it can easily be explained to beginners. Had Johann Bernoulli "lived in modern times," he probably would have been torn between the curiosity to find out about non-Euclidean geometry and the desire not to hear about the tragedy of his missed opportunity to reach eternal fame and glory. Acknowledgements The writing of this paper began while the first author was Johann Bernoulli Visiting Professor at the University of Groningen, in May, June and July 1999, and continued while he was back at Rutgers University, partially supported by the National Science Foundation under NSF Grant DMS 98-03411. It was concluded while he was Weston Visiting Professor at the Weizmann Institute of Science in Rehovot, Israel, from October 2000 to January 2001. He is grateful to the University of Groningen, the Weizmann Institute, and NSF, for their help and support. References 1. Bell, E. T., Men of Mathematics. (Simon and Schuster, New York, 1937). 2. Beltrami, E., Saggio di interpretazione della geometria noneuclidea.Giornale di Matematiche VI, 284-312 (1868) (in Italian). "Johann was very jealous of Daniel's success. He once threw Daniel out of the house for having won a French Academy of Sciences prize for which Johannn had also been a candidate (cf. Bell l , p. 134). Probably with the help of his loyal disciple Euler (a student of Johann in Basel and a colleague of Daniel in Saint Petersburg), Johann plagiarized Daniel's pathbreaking work on hydrodynamics (cf. Guillen 1 7 ) .
163 (Eng. transl. in Stillwell, 33 , 7-34). 3. Berkovitz, L. D., Optimal Control Theory. (Springer-Verlag, New York, 1974). 4. Berna, P., N. Conze, M. Duflo, M. Levy-Nahas, M. Rais, P. Renouard, M. Vergne, Representations des groupes resolubles. (Dunod, Paris, 1972). 5. Bernoulli, Johann, Curvatura radii in diaphanis nonuniformibus . . . [The curvature of a ray in nonuniform media . . . ]. Acta Eruditorum, May, 206-211 (1697). {Opera omnia I, 187-193.) 6. Bernoulli, Jakob, Solutio problematum fraternorum . . . una cum propositione reciproca aliorum [Solution of problems of my brother . . . together with the proposition of others in turn]. Acta Eruditorum, May, 211-217 (1697). (Opera I, 768-778.) 7. Bliss, G. A., Calculus of Variations. (Open Court Publishing Co., Chicago, 1925). 8. Bonnard, B., V. Jurdjevic, I. A. Kupka and G. Sallet, Transitivity of families of invariant vector fields on the semidirect products of Lie groups. Trans. Amer. Math. Soc. 271, no. 2, 525-535 (1982). 9. Bonnard, B., V. Jurdjevic, I. A. Kupka and G. Sallet, Systemes de champs de vecteurs transitifs sur les groupes de Lie semi-simples et leurs espaces homogenes (French). In Systems Analysis Conf. (Bordeaux, 1978), Asterisque, Soc. Math. France, Paris, 75-76 (1980). 10. Bonola, R., Non-Euclidean Geometry. (Italian original: La Geometria non-Euclidea, The Open Court Publishing Co., 1912.) Eng. transl. by H. S. Carslaw, (Dover, New York, 1955). 11. Cesari, L., Optimization—Theory and Applications. (Springer-Verlag, New York, 1983). 12. Clarke, F. H., The maximum principle under minimal hypotheses. SI AM J. Control Optim. 14, 1078-1091 (1976). 13. Clarke, F. H., Optimization and Nonsmooth Analysis. (Wiley Interscience, New York, 1983). 14. Clarke, F. H., Yu. S. Ledyaev, R. J. Stern, and P. R. Wolenski, Nonsmooth Analysis and Control Theory. Grad. Texts in Math. 178, (Springer-Verlag, Berlin, 1998). 15. Euler, L., Methodus inveniendi . . . accepti. Lausanne, Geneva, 1744; (Opera omnia, ser. I, vol. 25, 1952). 16. Goldstine, H. H., A History of the Calculus of Variations from the 17th to the 19th Century. (Springer-Verlag, New York, 1981). 17. Guillen, M., Five equations that changed the world: the power and poetry of mathematics. (New York, Hyperion, 1995).
164
18. Hestenes, M. R., Elements of the calculus of variations. In Modern Mathematics for the Engineer, (E. F. Beckenbach Ed., McGraw-Hill, New York, 1956). 19. Isidori, A., Nonlinear Control Systems. (Springer-Verlag, Berlin, 1995). 20. Jakubczyk, B., and W. Respondek eds., Geometry of Feedback and Optimal Control. (Marcel Dekker, New York, 1997). 21. Jurdjevic, V., and I. A. K. Kupka, Control systems on semisimple Lie groups and their homogeneous spaces. Ann. Inst. Fourier (Grenoble) 31, no. 4, vi, 151-179. 22. Jurdjevic, V., and I. A. K. Kupka, Control systems subordinated to a group action: accessibility. J. Differential Equations 39, no. 2, 186-211 (1981). 23. Jurdjevic, V., and I. A. K. Kupka, Linear systems with degenerate quadratic costs. Compt. Rend. Math. Rep. Acad. Sci. Canada 10, no. 2, 101-106 (1988). 24. Jurdjevic, V. Geometric Control Theory. (Cambridge U. Press, New York, 1997). 25. Kupka, I. A. K., Degenerate linear systems with quadratic cost under finiteness assumptions. In New trends in nonlinear control theory (Nantes, 1988), Lecture Notes in Control and Inform. Sci. 122, (Springer, Berlin, 231-239, 1989). 26. Lee, E. B., and L. Markus, Foundations of Optimal Control Theory. (Wiley, New York, 1967). 27. Lochak, G., La Geometrisation de la Physique. (Flammarion, Paris, 1994). 28. Malisoff, M., On the Bellman equation for optimal control problems with exit times and unbounded cost functionals. In Proc. 38th IEEE Conference on Decision and Control, Phoenix, AZ, Dec. 1999, (IEEE Publications, New York, 23-28, 1999). 29. Malisoff, M., A remark on the Bellman equation for optimal control problems with exit times and noncoercing dynamics. In Proc. 38th IEEE Conference on Decision and Control, Phoenix, AZ, Dec. 1999, (IEEE Publications, New York, 877-881, 1999). 30. Nijmeijer, H., and A. van der Schaft, Nonlinear Dynamical Control Systems. (Springer-Verlag, New York, 1990). 31. Pontryagin, L. S., V. G. Boltyanskii, R. V. Gamkrelidze, and E. F. Mischenko, The Mathematical Theory of Optimal Processes. (Wiley, New York, 1962). 32. Saccheri, G., Euclides ab omni naevo vindicatus: sive conatus geometricus quo stabiliuntur prima ipsa universale Geometriae Principia. (Milan,
165 Italy, 1733). 33. Stillwell, J., Sources of Hyperbolic Geometry. History of Mathematics, Vol. 10 (Amer. Math. Society, Providence, RI, 1996). 34. Struik, D. J., Ed., A Source Book in Mathematics, 1200-1800. (Harvard U. Press, Cambridge, Mass., 1969). 35. Sussmann, H. J., A strong version of the Lojasiewicz maximum principle. In Optimal Control of Differential Equations, N. H. Pavel Ed., (Marcel Dekker, New York, 293-309, 1994). 36. Sussmann, H. J., A strong version of the Maximum Principle under weak hypotheses. In Proc. 33rd IEEE Conference on Decision and Control, Orlando, FL, 1994, (IEEE publications, New York, 1950-1956, 1994). 37. Sussmann, H. J., An introduction to the coordinate-free maximum principle. In Geometry of Feedback and Optimal Control, B. Jakubczyk and W. Respondek Eds., (Marcel Dekker, New York, 463-557, 1997). 38. Sussmann, H. J., and J. C. Willems, 300 years of optimal control: from the brachystochrone to the maximum principle. (IEEE Control Systems Magazine, 32-44, June 1997). 39. Sussmann, H. J., Multidifferential calculus: chain rule, open mapping and transversal intersection theorems. In Optimal Control: Theory, Algorithms, and Applications, W. W. Hager and P. M. Pardalos Editors, (Kluwer Academic Publishers, 436-487, 1998). 40. Sussmann, H. J., Geometry and optimal control. In Mathematical Control Theory, J. Baillieul and J. C. Willems Eds., (Springer-Verlag, New York, 140-198, 1998). 41. Sussmann, H. J., Transversality conditions and a strong maximum principle for systems of differential inclusions. In Proc. 37th IEEE Conference on Decision and Control, Tampa, FL, Dec. 1998. (IEEE publications, New York, 1-6, 1998). 42. Sussmann, H. J., Resultats recents sur les courbes optimales. In Quelques aspects de la theorie du controle, Journee annuelle de la Societe Mathematique de France, (Paris, 1-50, June 2000). 43. Sussmann, H. J., A nonsmooth hybrid maximum principle. In Stability and Stabilization of Nonlinear Systems (Proceedings of the First European Community Nonlinear Control Network Workshop on Stability and Stabilization held in Ghent, Belgium, on March 15-16, 1999), D. Aeyels, F. Lamnabhi-Lagarrigue and A. J. van der Schaft Eds. Lect. Notes Control and Information Sciences 246, (SpringerVerlag, 325-354, 1999). 44. Sussmann, H. J., New theories of set-valued differentials and new versions of the maximum principle of optimal control theory. To appear in
166
Nonlinear Control in the Year 2000 (Proceedings of the European Control Network Workshop on Nonlinear Control, Paris, France, June 5-7, 2000), A. Isidori, F. Lamnabhi-Lagarrigue and W. Respondek Eds., (SpringerVerlag, 487-526,2000). 45. Synge, J. L., The life and early work of Sir William Rowan Hamilton. Scripta Mathematica X, 13-25 (1944).
Part II Contributed Chapters
This page is intentionally left blank
169
SYMPLECTIC M E T H O D S FOR S T R O N G LOCAL OPTIMALITY I N T H E B A N G - B A N G CASE ANDREI A. AGRACHEV Steklov Mathematical Institute, Russian Academy of Sciences, Gubkina ul.8 - 117966 Moscow, Russia and SISSA, Via Beirut 4 - 34014 Trieste, Italy E-mail: [email protected] GIANNA STEFANI Dip. di Matematica Applicata, G. Sansone, Via S. Marta 3 -50139 Firenze, Italy E-mail: [email protected] PIERLUIGI ZEZZA Dip. di Matematica per le Decisioni Via C. Lombroso 6/17 - 50134 Firenze, Italy E-mail:[email protected], URL: http://www.dmd.unifi.it/zezza/ We announce a new sufficient condition for a bang-bang extremal to be a strong local optimum for a control problem in the Mayer form. The controls appear linearly and take values in a polyhedron and the state space and the constraints are smooth, finite dimensional manifolds.
1
Introduction
We state a sufficient condition for a bang-bang extremal to be a strong local optimum for a control problem in the Mayer form. The controls appear linearly and they take values in a polyhedron and the state space and the constraints are smooth, finite dimensional manifolds; here we give only a sketch of the proof, a complete version of the result will be published elsewhere, see 7
The sufficient optimality condition contains two different sets of assumptions: the first (Assumption 1 and 2) concerns the regularity of the Hamiltonian flow while the second is the positivity of a suitable second variation. In the case of bang-bang extremals the control values belong to the boundary of the control set so that the kernel of the first variation does not contain any subspace and hence the usual second variation is not defined. To overcome this problem we consider the finite dimensional subproblem generated by perturbing the switching times. We prove that the sufficient second order
170
optimality conditions for this finite dimensional subproblem yield local strong optimality. We follow the Hamiltonian approach to strong optimality consisting in constructing a field of state extremals covering a neighborhood of the reference trajectory. This field of extremals is obtained by projecting, on the base manifold M, the flow of the maximized Hamiltonian emanating from the Lagrangian submanifold of the initial transversality conditions, which belongs to T*M; if this projection admits a Lipschitz continuous inverse, then we can estimate the variation of the cost at a neighboring trajectory by a function which depends only on the final point and it is hence independent on the control differential equation, the Lipschitz continuity of the inverse is guaranteed by the surjectivity of the tangent map. Let us point out that if the final point is fixed then the manifold of the admissible final points reduces to a singleton and hence, to obtain optimality, it is sufficient the surjectivity of the projection which corresponds to the classical non self-intersecting property of the state extremals, when the final point is not fixed, the optimality will also be guaranteed by the positivity of the second variation. The relations existing between the second variation and the symplectic properties of the Hamiltonian flow can be used to show that if the second variation is positive then the tangent map of the flow is surjective and the sufficient optimality conditions for the above mentioned function of the final point are satisfied. A crucial idea underlying the general approach is that the linear Hamiltonian flow of this linear quadratic problem has to be the tangent map of the flow of the maximized Hamiltonian. Some issues has already been addressed. In 5 we stated sufficient conditions for strong local optimality for an optimal control problem in Rn with unbounded controls, while in 4 we gave an intrinsic expression of the accessory problem and we studied the relations between the Hamiltonian flow and the index of the second variation. The geometric properties of the field of extremals needed to prove sufficient conditions for strong optimality has been studied in 6 . Similar conditions for time-optimality of a bang-bang trajectory have been obtained by A. Sarychev in 9 and 8 .
171
2
Main results
Let Xi, i = 1 , 2 , . . . , m be m distinct C°° vector fields defined on the C°° manifold M and let us define the multivalued function X:M^>TM,
X:x*+co{X1(x),...,Xm[x)},
where co denotes the convex hull. We consider an optimal control problem stated in the Mayer form Minimize
J(£) := co(£(0)) + cT{Z{T))
(2.1)
subject to i{t) e x{i(t)) ttO) 6 N0 , £(T) 6 JVr
(2-2) (2.3)
where N0, NT are C°° submanifolds of M. We are given, as a candidate optimal solution, a reference bang-bang Pontryagin extremal £; that is an absolutely continuous function £ : [0, T] - • M which is a solution of system (2.2)-(2.3), satisfies the Pontryagin maximum principle (PMP) and it is such that we can find a partition of the interval [0, T], {0 = t0 < tx < t2 < • • • < tr < U+i = T} with the property that, in each subinterval, the function £ is a solution of i(t) = XjMt)),
te[ti-i,n]
(2.4)
for some ji 6 { 1 , 2 , . . . , m } ; the values ti, i = 1,2,... ,r will be called switching times. We assume that Xj{ ^ Xji+1, for i = 1,2,..., r, for not introducing unnecessary switching times. Corresponding t o t h i s reference trajectory we define the time-dependent reference vector field h : [0, T] x M -> TM and we set hi ~ % 4 . _ l j t . ) : x >->•
Xj^x).
The time dependent reference vector field ht defines, by lifting to the cotangent bundle, the reference time dependent Hamiltonian H : [0,T] x T*M - • E, (t,£) ^
(£,ht(n£)),
where n : T*M -> M is the canonical projection and we set Hi := The maximized Hamiltonian H :T*M-*R,
l^
max
(l,v),
H^t._lit.y
172
is well defined and Lipschitz. For simpler notations we set x0 := |(0), xT := £(T). Recalling that any Hamiltonian Ht : T* M ->• R defines a Hamiltonian vector field, denoted by Ht, we can express the PMP by saying that there exist po £ {0,1} and a lift A of £ which is a solution of X(t) = Ht(X(t))
(2.5)
A(0) - po dco(x0) = 0 on TX0N0 \{T)+podcT(xT)=0 on TXTNT
(2.6) (2.7)
such that bo| + | | A | | ^ 0 Ht{X(t)) =
max (X(t),v) = H(X(t)). vEX(i(t))
The functions po CQ and po ex are denned on N0 and Nx, respectively, but they can be extended to the whole manifold M in such a way that the transversality conditions (2.6) and (2.7) hold on the whole tangent space. Let us denote by a, j3 : M —> E two functions such that a -p0c0 /3 — po ex
on N0, on Nx
A(0) = da(xo), X(T) =
-dP(xT).
In the normal case (po = 1) Q, /3 are cost functions equivalent to the original ones while in the abnormal case (po = 0) they are extensions of the zero function. When po = 0 all the costs disappear from our conditions and indeed we will study a problem with a zero cost. Proving that £ is a strict strong minimizer will imply that it is isolated among the admissible trajectories. The transversality conditions of the PMP can be stated saying that the end points of the adjoint covector belong to some specific Lagrangian manifolds, if we define A0 := {da(x) + T^No | x 6 N0} , A T := {-d/3(x) + T^Nx | x € NT} , then the transversality conditions (2.6)-(2.7) can be expressed as A(0) 6 A 0 ,
X(T) G A T .
173 The sufficient conditions will be stated analyzing the problem Minimize
J(fl := a(£(0)) + /?(£(T))
(P)
subject to (2.2) and (2.3). The points ti:=\(U),
i=0,l,...,r + l
will be called the switching points of the adjoint covector A. From the PMP we can deduce the following consequences which represent necessary optimality conditions for i — 1,2,..., r Hi(li) = Hi+i(£i) (d(Hi+1-Hi),Hi+1)(ei)>0. To state sufficient condition for f to be a strong local minimizer we need to strengthen these two conditions. We will assume that the following hold Assumption 1. (d(Hi+1 - Hi),Sw){ti)
> 0, * = 1,2,... ,r.
Assumption 2. The maximum max (\(t),v), t€[0,T] v€Xtf(t)) is attained at a vertex of X(£(t)) for all t ^ ti,t2,- • -,tr or along an onedimensional edge for t = t\, t2, • • •, tr. These conditions are strictly related to the properties of the flow of the maximized Hamiltonian H and they can be interpreted as regularity conditions on the maximized Hamiltonian near the adjoint covector. In fact the PMP implies that each switching point Hi belongs to the level set i?j+i — Hi = 0 and Assumption 1 yields that this level set is an hypersurface called switching surface. Moreover we can prove Lemma 2 . 1 . Under Assumption 1 there exists a neighborhood U of IQ such that we can define recursively, for i = 0 , 1 , . . . , r, the C°°-maps Ti : U - • M, and fa : U -> T*M
in the following way, set To : = 0,
the T{ 's are implicitly defined by (Hi - Hi+1) (expnWHiifa-iit))) Ti("i) = ti,
=0
174
while the fa 's are C°° symplectic diffeomorphism defined as <j>i{t) := exp(-Ti(e)Hi+1)
expn(e)Hi{4>i-i{l)).
Under Assumptions 1 and 2, by Lemma 2.1, we can say that the map "K : [0,T] x U - > [ 0 , r ] x T*M given by •K{t,t) = (t,exptHi+1(
t £ [Ti(0,r i + i(/)],
is the unique solution of the Hamiltonian system \(t) = H(X(t)) A(0) = L Moreover the flow "K is C°° on each 2 n + 1 dimensional C°° manifold with boundary Oj, given by Oi := { ( M ) | / € U, Ti_i(0 < * < Ti(l)} C [0,T] x T*M. Let us remark that every solution of this system has the same number of switches as the adjoint covector A. Remark 2.2. Assumption 1 plays a role analogous to the one of the strengthened Legendre condition in the case of unbounded controls; in fact Lemma 2.1 says that we can define locally, in a tube around the adjoint covector, a timedependent Hamiltonian as {t,l)t-¥Hi(t,e),
for
M~1(t,t)£Oi.
Assumption 2, which concerns the uniqueness of the maximum point, ensures that with this choice we obtain the maximized Hamiltonian, for analogous conditions in the case of unbounded controls see 5 . In the case of bang-bang extremals, the kernel of the first variation of the problem is trivial and hence the usual second variation, which is defined on the kernel of the first one, does not exist. To define an appropriate second variation we proceed starting from the following trivial fact: any choice of a subset of admissible variations for problem ( P ) would lead to a subproblem and hence to necessary conditions. Let us choose an appropriate r—dimensional family of variations corresponding to bang-bang trajectories, they are constructed by taking variations of the switching times. The optimality with respect to these variations will be sufficient (under the above stated assumptions) to prove strong local optimality of the reference trajectory. For a given a > 0 such that min i=l,...,r+l
(U — tj_i) > 2a,
175
let e G B(0, a) C W, set e 0 = £V+i = 0 and consider the time-dependent vector field obtained from the reference one in the following way (e,x) t-> ht(e,x) = hi(x) , t 6 (£,-_i + £•,•-!, *i +£i),
i = 1,2,.. .,r + 1.
In other words we have moved the switching time U by £j. Remark 2.3. Notice that a small e corresponds to a control variation which is small in the L1 norm but not in the L°° norm. We denote the flow of £(t) = ht(e,£(t)) by St : M x B ( 0 , a ) - » M and we consider the following finite dimensional subproblem of problem (P) Minimize
j(x,e)
:= a(x) + P(Sr(x,e))
(sub-P)
subject to x £ N0 , ST(x,e)
e NT.
Note that St : x ^
St(x,0)
is the flow of the reference vector field and hence St(xo, 0) = £(£)• Thanks to the choice of a,0 we have that the PMP implies that (xo,0) is a critical point for 7, that is dj(xo,0)=0, so that the second derivative of 7 at (xo,0) is well defined and J":=^27(xo,0) is a quadratic form on TXo M x E r which gives the second order approximation of 7. Thus the second variation of (sub-P) is the restriction of J " to the linearization of the constraints, namely if we set Ji = {{6x,e) e TXON0 x W : ST*{5x,e)
e
TXTNT},
then the second variation of (sub-P) is J,'^. Our main result is given by the following theorem. It states that under the regularity conditions on the maximized Hamiltonian, the positivity of the second variation of (sub-P) it is sufficient to prove that the reference trajectory is a strict strong local minimizer for the original problem. Theorem 2.4. Assume that a given bang-bang Pontryagin extremal £ satisfies Assumptions 1 and 2. If the second variation J!^ of (sub-P) is positive definite then £ is a strict strong local minimizer. In the abnormal case £ is an isolated admissible solution of the constrained control system.
176
3
Sketch of the proof
The proof is very technical, here we give an idea of how to construct the field of extremals and of the relations existing between the Hamiltonian flow and the second variation. A general introduction to symplectic methods and their applications to optimal control is in 1 and 2 . When the initial point is not free, it is not possible to cover, by the projection, a neighborhood of the initial point. In the calculus of variations this problem was solved by perturbing the initial time, but this method does not work in our case because the projection is singular on [£o,*i]- We can achieve the same result by perturbing the initial cost with a penalty term and constructing an equivalent problem with a free initial point. Let Q be any non negative quadratic form on TXoM, such that kerQ = TX0N0, we extend it to TXoM x E r by setting Q[6x,e}2 = Q[6x}2. If the quadratic form J" is positive on N then we can find p > 0 such that J" + PQ > 0 on {(Sx,e) G TXoM x W : ST*(Sx,e)
€ TXTNT} ,
as can be easily proved by elementary arguments of linear algebra. Let us choose a function ap such that ap = a on iVo dotp — da on TXo N0 D2(ap-a){xQ) = pQ, and consider the problem Minimize
a p (£(0)) + j8(£(T))
(3.1)
subject to
iit) G x(m) £(T) e NT. Since the reference trajectory satisfies the initial boundary conditions then proving that it is optimal for this new problem yields its optimality for the original one; therefore without loss of generality we can assume that the original problem has already the initial point free (iVo = M and a = ap) and hence the initial Lagrangian submanifold is horizontal and its projection covers a neighborhood of the initial point xoRemark 3 . 1 . This reduction is possible because the new cost on the initial point contains an exact penalty which can be constructed assuming that the second variation is positive definite.
177
Let s be the canonical one form in T*M, we denote by er = da the canonical symplectic two form on T*M, see 3 for the definitions. In the following we will use some of the properties of the Cartan form u> = s — H dt on I x T*M, associated to the Hamiltonian H. Namely • u> evaluated along a lift of a solution of (2.2) is non positive and it is zero along A. • w is exact on the manifold generated by the flow of H emanating from a Lagrangian manifold. By taking the restriction to a neighborhood of xo we can assume that A Q C U and that it is a smooth simply connected Lagrangian submanifold. We define r+1 4=1
The fi;'s are n + 1 dimensional C°°-manifolds with boundary. When the map -Kt := 7T o 0it : A 0 M- M
is nonsingular for all t € [0, T] then, by relating the properties of the Cartan form u> and those of this map, we can estimate the variation of the cost at a neighboring trajectory by a function which depends only on the final point and it is hence independent on the control differential equation. The optimality properties of | can be described through those of the symplectic map "Kt* '• Tf0Ao —> T^,t-.M, for simpler notations we set LQ := T(0 Ao- In particular let us consider, for i = 1,2,..., r + 1, the tangent map to the pull-back of the restriction of the flow "K to fli-i is given by the maps ^ : L 0 -»• TtoT*M ipi :=5ii.](exptiHi)tf(j)i-.it
=
ji^l^iexpti-iHi)*^-!*.
The functions ^,'s can be used to characterize the singularity of 7rt T h e o r e m 3.2. The map n* "Kt* • L0 -»• T^M is onto for each t £ [0,T], i.e.,
for each t 6 [0, T], if and only if
178
1. For i = 1,2,..., r -+-1 we have that ir*ipiL0 = TX0M. 2. For i = 1,2, ...,r
if Sli, Sl2 € L0 are such that 7r» tpi 5ti = 7T* rpi+iSl2
then adjide1,tpi+1S£2)>0. The above theorem states that, in the bang-bang case, a conjugate point can occur only at a switching time and, differently from the case of unbounded controls, the second variation may have a positive index without having positive nullity for a smaller t. When the final point is not fixed the following Theorem describes the optimality of £. Theorem 3.3. Assume that the map 7r» "Kt* '• LQ —> T;,t,M is onto for each t G [0, X1]. If for every non zero 6(.\ 6
LQ
7r» ipr+iS£i
and % £
^•T*^X(T)^T
SUC
^
^hat
= 7T» SI2
we have that o-(V»r+i«i, W2) > 0 then £ is a strict strong local minimizer for the problem (P). To show the relations between the second variation and the properties of the Hamiltonian flow, we first reduce (sub-P) to a problem which is linear in the control and where the admissible control maps are piecewise constant with the ti's as switching times. This goal can be achieved by a reparametrization of time. Consider the following control boundary value problem on E (p{r) = 1 + y>(0) = 0,
V(T)
where the control u takes values in (—1,1). Any solution of this boundary value problem is an increasing isomorphism of the interval [0, T] onto itself. If we set £i : = y - 1 ( * i ) ~U,
i=
l,2,...,r
179
then we have that ht(e,x) Moreover Sv^(x,e)
=
hv-i(t)(x).
is the solution of the differential equation C(T) = [1 + K T ) ] M C ( T ) ) ,
C(0)=X.
(3.2)
Let us notice that, since the behaviour of the trajectory is fully described by the integral of the control v, to obtain our variations it is sufficient to take v as a piecewise constant control with the same switching times of the reference trajectory. If i/; is the value of the control on [U-i, t{) and we set Ui ~
Vi (ti — i j - x )
then, from the definition and from the boundary conditions, we have that i
r+1
3=1
3=1
the last relation yields that our control space U has dimension r U:= ^ ( u i , u 2 , . . . , « r + i ) € # r + 1 | 5 T « i = o L For each u € U we denote by uu the corresponding control. In our finite dimensional subproblem (sub-P) we take the state defined by equation (3.2) and than the reference control will be the zero one. If we define the second variation of this problem, as in 4 , through the pull-back system, we obtain a linear-quadratic problem on TXoM x U. Thanks to the special structure of the problem the associated Hamiltonian is of the following type
(«,./)-• Gi'(MMt) where G" : T*QM x TXQM -> ffi is a piecewise constant linear Hamiltonian. We denote by Q"(6l,u) the solution of the associated Hamiltonian system \{t) = G'l{\{t))vu{t),
A(0) = St
The Lagrangian subspaces of the transversality conditions are L'J := {{-D2j(x0)(Sx,-),6x)
where 7 = a + f3 o St-
\ Sx G TXoM}
(3.3)
180
We study the changes of the signature of the second variation recursively, for k = 0 , 1 , . . . , r, on each
xk = xnTX0MnUk, where Uk := {u e U | Uj = 0 for j = k + 2 , . . .r + 1} , notice that UQ = {0} and Ur = U. This corresponds to consider admissible controls that are zero from a certain switching time on. Following the same ideas as in the case of unbounded controls, see 4 , we obtain that ker J!^ is described by those S£ G LQ and u G Uk such that %'^{5(.,u) £ L? and f G'i (S" (W,«)) i/„ (*) d* = 0,
for all veUk.
(3.4)
Thanks to Assumption 1, equation (3.4) defines a linear feedback laww : LQ —> U and S"(<W, (w, SI)) corresponds to the solution of the classical Jacobi system, since it is the solution of the Hamiltonian system of the second variation with the feedback control defined by equation (3.4). To conclude the proof we show that the linear maps
S$.(M):=S5r-(M,<w,M»), contain the necessary information about the behaviour of tangent map to the flow "K at the switching point £;. This information can be described through the anti-symplectic isomorphism i : T*oM x TX0M -> T(oT*M,
(w, 6x) •-> - u +
d(-0)t5x,
as S§. = »-Vfc+i * for * = 0 , 1 , . . . , r.
(3.5)
It is important to notice that (3.5) states that, modulo isomorphisms, the flow of the Hamiltonian of the second variation is the tangent map to the flow of the maximized Hamiltonian. References 1. Andrei A. Agrachev and Revaz V. Gamkrelidze, Symplectic geometry for optimal control, in Nonlinear Controllability and Optimal Control, Hector J. Sussmann, Ed., Pure and Applied Mathematics, vol. 133 (Marcel Dekker, 1990).
181
2. Andrei A. Agrachev and Revaz V. Gamkrelidze, Symplectic methods for optimization and control, in Geometry of Feedback and Optimal Control, New York, New York, B. Jacubczyk and W. Respondek, Eds., Pure and Applied Mathematics, 1-58 (Marcel Dekker, 1997). 3. Vladimir I. Arnold, Mathematical methods in classical mechanics (Springer, New York, 1980). 4. Andrei A. Agrachev, Gianna Stefani and PierLuigi Zezza, An invariant second variation in optimal control, International Journal of Control, 7 1 , 5, 689-715 (1998). 5. Andrei A. Agrachev, Gianna Stefani and PierLuigi Zezza, Strong minima in optimal control, Proceeding of the Steklov Mathematical Institute, 220, 4-26 (1998); translation from Tr. Mat. Inst. Steklova 220, 8-22 (1998). 6. Andrei A. Agrachev, Gianna Stefani and PierLuigi Zezza, A Hamiltonian approach to strong minima in optimal control, in Differential Geometry and Control. Proceedings of the Summer Research Institute, Boulder, CO, USA, June 29-July 19, 1997, Providence, RI, Ferreyra et al., Eds., 11-22 (American Mathematical Society, 1999). 7. Andrei A. Agrachev, Gianna Stefani and PierLuigi Zezza, Strong optimality for a bang-bang trajectory, Tech. report 08 (Dipartimento di Matematica per le Decisioni, Firenze, 2000). 8. Andrei V. Sarychev, Sufficient optimality conditions for Pontryagin extremals, Syst. Control Lett. 19, 6, 451-460 (1992). 9. Andrei V. Sarychev, First and second order sufficient optimality conditions for bang-bang controls, SI AM J. Control Optimization, 1, 35, 315-340 (1997).
This page is intentionally left blank
183
C H A R G E S I N M A G N E T I C FIELDS A N D S U B - R I E M A N N I A N GEODESICS ALFONSO ANZALDO-MENESES AND FELIPE MONROY-PEREZ Departamento de Ciencias Bdsicas, Universidad Autonoma Metropolitana-azcapotzalco, 02200, Mexico D.F. E-mail: [email protected], [email protected] In this paper we study the action of certain nilpotent Lie groups under which a differential system given in terms of a rank-two distribution remains invariant. Such systems describe the dynamics of electric charges in static inhomogeneous magnetic fields. The action of the Lie group associated to the sub-Riemannian structures in the simplest non-uniform cases, contains naturally the Heisenberg group which corresponds to a uniform field. A linear magnetic field yields the nilpotent group associated with the so-called Martinet case and the group corresponding to systems of Engel type. The Cartan five dimensional system is also the natural setting for the study of linear magnetic fields acting on classical electric charges. Our results generalize previous works on this class of non-holonomic systems.
1
Introduction
Let A be a rank-two distribution denned on a differential manifold. We study the action of certain nilpotent Lie groups that leave the differential system determined by A invariant. The motivation for this study is the geometry of dynamical systems describing the motion of classical electric charges in the Euclidean plane, under the influence of external, perpendicular and inhomogeneous magnetic fields. Sub-Riemannian geometry provides a natural setting for the study of certain aspects in classical particle and plasma physics 5 , this approach is not new and has been in the literature at least in the last ten years, see for instance, 13 . The dynamics of a classical electric charge in the plane under the influence of a perpendicular inhomogeneous magnetic field can be described by means of a rank-two distribution, which together with a flat Riemannian metric, determines a sub-Riemannian structure. The Lie algebra generated by the distribution turns out to be nilpotent. The study of the action of the corresponding nilpotent Lie group plays an important role in our analysis, and it contains as the simplest nilpotent subgroup the well-known Heisenberg group. The so-called Martinet case, studied by R. Montgomery, 14 , and B. Bonnard et al, x, turns out, under our approach, to be a particular example
184
corresponding to linear magnetic fields. Our work is related with that of Y.L. Sachkov 17 and that of the two authors mentioned above. In contrast with those references, we present explicit and detailed calculations of trajectories (sub-Riemannian geodesies), and a more general approach. The PfafHan system associated to the inhomogeneous magnetic field leads us to consider Goursat and Cartan distributions. The non-holonomic constraints encoded in the set of 1—forms, motivates the extension of the standard two dimensional Lagrangian to higher dimensions, even in this generality, certain constants of motion are explicitly exhibited. For this presentation we apply variational principles and perform an explicit integration process in terms of hyper-elliptic functions. Abnormal extremals appear, however they turn out to be non strictly abnormal. In a separate work 12 , we apply Pontryagin Maximum Principle to carry out the study of a time optimal control problem in certain nilpotent Lie groups which contain some of the groups described here. Apart from this introduction, this paper contains four sections. In the second section we establish our notation, and provide basic definitions and standard results in sub-Riemannian geometry and distributions. In section three we apply the Lagrange multipliers technique and explicitly integrate the extremal equations in terms hyper-elliptic integrals. In section four, we discuss low dimensional cases. In particular we study the five dimensional Cartan system, which has attracted a lot of interest in differential geometry and control theory, see 3 . Also in this case we present an explicit integration in terms of the Weierstrass p-function. 2
Sub-Riemannian geometry and classical particles
As we mentioned before, some problems in classical particle physics find a natural setting within a general geometric framework which goes in the literature under different names: singular Riemannian geometry 2 , sub-Riemannian geometry 19 , or Carnot-Caratheodory geometry 7 . 2.1
Sub-Riemannian geodesies and distributions
A sub-Riemannian structure on a manifold P consists of a couple (A, g), where A is a distribution satisfying the so-called full-rank condition (Lie(A)(g) = TqP, for all q € P), and g is a Riemannian metric on A, see 8 . Let us denote the corresponding scalar product as (-,-) 9 . An admissible curve for A, also called horizontal, is an absolutely continuous curve
185
q : [0, Tg] -> P such that q(t) £ A(g(£)) almost everywhere. The length of an admissible curve q(t) is given as
L{q{t)) = J
9
^J(q(t),q(t))g dt,
whereas its energy is given as follows
E(q{t)) = \ j '(q(t),q(t))g dt. Given two points qo,qi 6 P, a minimizer connecting qo with q\ is an admissible curve q : [0, Tq] —> P with minimal length satisfying q(0) = qo and q(Tq) = qi. This defines the sub-Riemannian distance between qo and q\. It is well known that if all the curves considered are defined in the same interval, then the length and energy minimizing problems are equivalent. By the Chow-Rashevsky's theorem 6 , 14 , if the distribution satisfies the full-rank condition, and the manifold is connected, then any two points can be connected by a smooth horizontal path. A non-constant admissible curve q : [0,Tq] —¥ P is called a subRiemannian geodesic, if its restriction to any small enough subinterval of [0, Tq] is a minimizer, and ||<7(£)||9 does not depend on the parameter t. In contrast with the Riemannian case, sub-Riemannian geodesies are not determined by its initial tangent vector, but by its initial covector, that is, if p 6 T*P is an initial covector, then the corresponding initial tangent vector v is denned by means of the following formula (w,v)g
=p{w),
for all w S A(TT(P)), where IT denotes the canonical projection of the cotangent bundle T*P. We shall consider in this paper bracket generating and regular distributions. A distribution A on a manifold P is said to be bracJcet generating, if there is an integer n such that A™ = TqP for each q € P. Here, A i + 1 = [A,A 4 ], i = l,..., which lead to the flag A , c A^ c • • • C TqP
186
As R. Montgomery has pointed o u t 1 5 , the growth vector is the most basic numerical invariant associated with a distribution. The growth vector nq of A at the point q, is defined as nq = (niq,n2q,...
,nnq),
where n ^ = dim Aq. The distribution is said to be regular at q, if the growth vector is constant in a neighborhood of the point q. The notion of symmetry in sub-Riemannian geometry has been studied since the early paper of R. Brockett 2 . More recently symmetries have been analyzed for higher dimensional Heisenberg groups n , and for some relevant cases of low dimensional Lie algebras 17 . We shall follow our paper mentioned above to describe the symmetries of A by means of the action on P of a Lie group under which A remains invariant. 2.2
Geometry of non-holonomic
constraints
The dynamics of a point under non-holonomic constraints when these are given by a certain Pfaffian system, can be written as a differential control system. The corresponding variational problem of minimizing the action, can be formulated either as an optimal control problem, or in the language of connections on principal bundles. Let M be the base manifold, a simply connected and complete twodimensional Riemannian manifold and take a coordinate chart with coordinates (x,y). Let {d6i — {vidx + fiidy)}i be a set of linearly independent 1—forms, the kernel of this Pfaffian system is encoded in the differential control system q = uX1{q)
+vX2{q),
where the control parameters u = x, v — y and the state variable q = (x,y,6\,...) are taken in the appropriate spaces denoted as U and P respectively. If A = {Xi,X2} turns out to be a regular and bracket generating distribution, then for all q € P, we can complete a basis {Xi(q),X2(q),X3(q),... ,XdimP(Q)} for each tangent space TqP. By declaring this basis orthonormal we obtain an inner product (-, -)q whose restriction to A yields a sub-Riemannian structure. The sub-Riemannian geodesies correspond to absolutely continuous solution curves [0, Tq] ->• P, t H» q(t) of the above control system which minimize the energy functional
187
E(q(t)) = \jq<«(*),«(*)>, dt=^J
\2+v2
dt.
In summary, we obtain under this setting an optimal control problem on the manifold P. The above formulation leads us naturally to the framework of principal bundles. Because in our case (not true in general), the Lie group G, corresponding to the Lie algebra go generated by the vector fields {X3,... , Xdim p } , yields an action • : G x P -> P satisfying {9192)-q = 9i- (92 • q) for each q € P, which together with the natural projection TT : P -> M, q H* Tr(q) = (x, y), determines the principal bundle over M with total space P and structure group G. Furthermore, since for each q 6 P one has the splitting TqP = Horq © Verq, where Horq = Aq and Verq = go(q), then a unique connection on the principal bundle (P, M, n, G, •) is determined, see 18 for details. Remark 2.1. The control system mentioned before determines horizontal lifts Xi and X2 of tangent vectors on M. This lifting provides a smoothly varying family of linear maps aq : T(x>y)M —*TqP satisfying dirq o aq — idM and a(g-q) = g- a{q) 2.3
The variational Problem
As explained above, the physical problem of a classical particle evolving on the manifold M under the influence of an external static magnetic field normal to M, can be formulated indistinctly either as a variational problem, as an optimal control problem or as a problem of connections on principal bundles. In this paper we take the calculus of variations' approach to discuss the problem. Let us choose the coordinate chart (x, y) on M in such a way that the magnetic field B is written locally as (0,0, Bz). We shall consider two cases, namely Polynomial magnetic fields The function Bz (x) is a polynomial in x Linear magnetic fields The function Bz(x,y)
is linear in both x and y
If m denotes the particle's mass, the geometrical problem consists in the calculation of the extremum of the action
S=jJ(x>+y>)dt,
188 subject to nonholonomic constraints which correspond to particular gauge choices for the magnetic field vector potential. 3
Polynomial Magnetic Fields
Several choices are possible to define a polynomial Bz(x), in particular we consider nonholonomic constraints defined by means of the following Pfaffian system ojk=d9k-l-xkdy,
k = l,...,N
+ l,
(3.1)
where N is the degree of Bz(x). These 1—forms have a well defined degree. Setting deg(x) = deg(y) = 1, we get deg(uik) = k + 1. Particular cases for which some of these forms are absent could be of some interest. This variational problem lead us to a rank-two distribution A defined on the N + 3-dimensional manifold P, with coordinates q = (x,y, 6\,... , #;v+i), as explained before. The distribution A is given by the vector fields
Xi = dx, N+l
X2 = dy+Y^
-^dOi.
(3.2)
This distribution is bracket generating and its growth vector is generically written as ( 2 , . . . , N + 3); the last entry can be smaller under some circumstances. The Lie brackets lead to a nilpotent Lie algebra denoted as Q, with the following commuting relations [Xi,X2] = X3, [Xi,X3] = X4,
[XI,XN+2]
with
= Xtf+3,
189
N+l
1
The vector field XN+3 — 86N+I is central, and the subalgebras generated by the vector fields {Xk, k ^ 1} are Abelian. The subalgebra generated by {Xi,X3,Xi,... ,XN+3} is a non-Abelian ideal, whereas those generated by {Xi,Xi+\,... ,XN+3} are Abelian ideals. Let * be the product on P defined as ( a i , a 2 , . . . ,aN+3)*(a[,a'2,...
,a'N+3)
= (a'{,a2',...
,"#+3)
where a" = a i + a[, "2 = a 2 + a 2 ,
fc-i
ai' = a*+aj;+ V 71
1
^a?~X>
k = 3,...,N + 3
The manifold P becomes a Lie group with this product and we have Theorem 3.1. The group (P, *) acts on itself by left multiplication leaving the vector fields {X\,..., Xw+3} invariant Proof. It follows directly from the definition of *-left invariant vector fields.
• From the physical point of view we can write the following Corollary 3.2. The dynamical system defined by the metric x2 + y2 and the non-holonomic constraints u>i = dBi — \xldy, i = 1 , . . . , N + 1 is invariant under the left action of P. Remark 3.3. Observe that the vector fields {X{} are not left invariant with respect to left translations of the group law locally obtained by the exponential map, that is a" = ai + a[,
a'2' = a2 + o>'2, a'l = a3 + a'3 + -(ctict'2 — 0.^2), • • • ,etc.
190 However, any linearly independent set of vector fields left invariant with respect to this group law, can be expressed as linear combination of the basis {Xi} above. These linear combinations determine a map from the tangent space to itself and are directly related to the gauge invariance of the magnetic field vector potential. Clearly, the set of all linear maps of the corresponding vector space forms a group. Accordingly, there is set of related Pfaffian systems corresponding to the same dynamical system. We are dealing here with the group of automorphisms Aut(g) of our nilpotent Lie algebra g.
3.1
Goursat systems
The Goursat normal form for a Pfaffian system has been in the control theory literature under the name of chained form systems. This normal form has been used in the study of planning problems with non-integrable velocity constraints, see 16 . A Pfaffian system on a N + 3 dimensional manifold P system is in the Goursat normal form, if there exist a coordinate chart q = (x,y,ui,... ,UN+I) on which the system is written as follows
du\ — ydx du2 — u\dx \
(3.3)
dv,N+l ~ Ujydx
This system, together with a flat Riemannian metric on P, is equivalent the system discussed above, in the sense that it leads to the same equations of motion and it is possible to give an analytic map from one set of variables to the other. We have the following Proposition 3.4. The Lie algebra determined by system (3.3) is isomorphic to the Lie algebra g. Proof. In fact, the kernel of the above Pfaffian system is encoded in the following differential system
q=
xZ1(q)+yZ2(q),
defined by the following vector fields
191 N
Z\ = dx + ydui + 22 Uidui+x t=i
Z2=dy The non-zero Lie brackets are the following [Z1,Z2] =
-du1=:Z3
[ZuZk]
= (-l)* + 1 0u f c _i =: Zk+1
[Z1:ZN]
= (-l)N+1duN
=: ZN+1
Therefore, the Lie algebra coincides with Q above as expected. 3.2
(3.4)
•
Sub-Riemannian geodesies for polynomial fields
In this section we shall integrate the extremal equations. This equations are written in the total space P, and project down to the base manifold M. That is, consider a curve C : [0, Tc] -+ M, t »-> C(t), with initial point {xo,Vo) = C(0) and with p0 in the fiber 7r -1 (xo,yo). The parallel transport of po along C is given by the curve defined by dC{t)/dt = apdC(t)/dt, with (7(0) = po and C(t) = p. The curve C projects by 7r onto the curve C. As usual in optimal control problems, there are two types of extremal curves, normal and abnormal, in the present context the latter correspond to those extremals which do not depend on the kinetic energy. For details on abnormality we refer the reader to R. Montgomery's survey 14 . Proposition 3.5. The normal extremals project to the curve t i-> C(t) = (x(t),y(t)) which satisfies the system x = jj =
qBz(x)y -qBz{x)x
with qBz(x) a polynomial in x. And consequently can be integrated by quadratures. Proof. For the Pfaffian system (3.1) and the metric given by the kinetic energy, the Lagrangian is given by
192
C = A 0 y ( x 2 + y2) + j ; \k0k
-
-xky),
k=i
where the A*, A; = 1,...,JV + 1, are Lagrange parameters, in general time dependent, and the parameter Ao is a constant, which can be taken equal to 1. The Euler Lagrange equations readily yield Aj
Et JTWX
mx = my =
. (i-iyx
j_i .
y
'
x
'
Ai = 0, consequently, all Lagrange parameters A; are constant. The above equations are just Lorentz' equations for a particle charge q with mass m, under the influence of the magnetic field given by B = (0,0, Bz)1 and qBz = — ]T^ (i-\)\x%~1 •> which i s a polynomial of degree N in x. The horizontal trajectory C(t) is given by the solutions x(t) and y(t) and by the solutions of the Pfaffian system 0* — \xly = 0, for i = 1 , . . . , N + 1 , as required. In order to integrate the equations we parameterize the trajectories by arclength, write the field as ^qBz = ^ bix1, and set, without loss of generality, the coordinate chart in such a way that (xo,yo) is the origin in the plane, we have then N
bixly,
x = ^2 i=0
d
TT-^
bi
i+1
«=0
so that integrating N
h °i
E TT\ i=0
and then
X
i+1
'
193 h a2N+2X27V+2
£ = a0 + a\x +
with £ = x and a set of known constants a i5 i = 1 , . . . , 2N + 3. Some of these constants are the following
a0 = (iof, ai = 26 0 «/o, a-2 = hy0 - b%,
2bNbM-i
&2N+1
a2N+2
N(N + 1) _
b%
The above equation can be formally integrated by means of the following hyper-elliptic integral
t(x) =
/ (0) y/a0 H
h a2N+2C2N+2
'
the associated inverse functions are thus the solutions of the above algebraic equation and the integration is complete.
•
Remark 3.6. The coefficients Oj can lead to singular algebraic curves for certain initial conditions. For example, for XQ = 0 and bo = 0, i.e., ao = 0 = a\, the curve has at least a double point for N > 0. Since qBz{x) = bo + b\x + • • •, this example corresponds to a trajectory along a line (the yaxis) for which the magnetic field is zero. Since in this case the remaining parallel lines on which the magnetic field is zero can be at any other values of x, then the same applies for all straight lines on which B is zero. We now discuss abnormal extremals for polynomial fields. We consider in this case the Goursat normal form for the Pfaffian system which is equivalent to our system as was mentioned above. We take Liu and Sussmann's definition of strict abnormality, see 10 Definition 3.7. A strictly abnormal extremal of a sub-Riemannian manifold, is an abnormal extremal that is not normal.
194
Proposition 3.8. Abnormal extremals for polynomial magnetic fields in one variable correspond to straight lines on M, and consequently they are not strictly abnormal. Proof. In this case we have the Lagrangian JV+l
£ abnormal G
\
"* \ / -
=2-,
Afc Ufc
'
„.
*\
~uk+lx),
i=l
and the Euler-Lagrange equations reduce to qBz(x)y
= 0,
qBz(x)x
= 0,
where again B = (0,0, BZY and qBz = \N+I, with constants of integration a. The non-trivial solutions are precisely the set of points for which Bz(x) = 0, which corresponds to straight lines, i.e., x = y •=• 0, in base space. Therefore, the abnormal extremals are not strictly abnormal. D Remark 3.9. We observe that, the equations of motion resulting from other gauges are equivalent to these obtained equations above. Furthermore, the extremal for the case n = 1 corresponds to the famous example of R. Montgomery and I. Kupka, see 9 . 3.3
Engel and Martinet systems
There are two particular cases which has been extensively studied by Y. Sachkov 17 , R. Montgomery 14 and B.Bonnard et al x , they are known as Engel and Martinet systems. Engel case In this case one considers the polynomial gauge for N = 1 and writes z = d\ and w = 82 in (3.2). The resulting vector fields are the following
195
Xi = dx, x2 X2 = dy + xdz + —dw, X3 = dz + xdw, Xi = dw, these vector fields yield a basis for the so-called Engel algebra
[X\, X2] = Xz,
[Xi, Xz] = Xi.
This algebra is four dimensional and the associated one-forms are of Goursat type. The growth vector of the distribution {X\,X2} is (2,3,4). The corresponding magnetic field is ^-Bz = b^x + boMartinet case In this case, one takes ^Bz = bix, that is, zero constant magnetic field contribution. From the two 1—forms of the Goursat system resulting for N = 1 in (3.1) only that corresponding to the single Martinet 1—form dw = dz — \x2dy is considered. The growth vector is here (2,3) for x 7^ 0 and (2,2,3) for x = 0. Notice that the reduced vector fields still satisfy the Engel algebra. Remark 3.10. R.Montgomery uses also the approach of classical particles in magnetic fields to describe the Martinet case, see 1 4 . 4
Linear magnetic fields, and Cartan's five dimensional case
We study now the system corresponding to Bz(x,y) being a linear function in the two variables x and y. This apparently simple case, leads naturally to the famous Cartan five dimensional Pfaffian system 4 . According to the last section, the points on which the magnetic field is zero are always straight lines parallel to the y—axis. Such a selection is in fact particularly simple, however, the generic case would be straight lines which are not necessarily parallel to the vertical axis, but have a nontrivial slope. For this purpose we take the function ^Bz with a2 + P2 = 1.
= b1(ax + l3y) + bo,
196
This case does not reduce to the case TV = 1 of the Goursat system (3.3) d>i = du — zdx u>2 = dz — ydx,
(4.1)
since we must add the following 1-form
Q3 = dv - -zdy.
(4.2)
The vector fields of the rank-two distribution denoted as V, corresponding to these forms are the following Xi — dx + zdu + ydz X2 = dy+
Z
-dv.
(4.3)
Let us denote as ns the nilpotent five dimensional Lie algebra generated by T>, the non-trivial Lie brackets are as follows [xltX2]
=^dv-dz=:X3
[x1,X3]=du=:Xi
(4.4)
[x2,X3] =8v=:X5. The growth vector of T> is (2,3,5). This system is usually called a system of Cartan type, it has been largely studied beginning with E. Cartan 4 , who showed their relation with the exceptional Lie algebra Q2- A modern treatment can be found for instance in 3 . The Lie algebra ns is more general than the algebra (3.4) corresponding to N = 1, since it contains the additional (central) generator X&. Let us denote as N5 Lie group corresponding to ns. This group can be represented by M5 endowed with the following group law
(ai,a 2 ,a3,0!4,a:5)(/3i,/?2,/33,/?4,/?5) = («i + / ? i , a 2 + /32,a'3,a'4,a'5), here
197 a3 = a3 + fa + - ( Q I / ? 2 -
a2Pi)
a'i = a 4 + Pi + 2("i^3 - a3Pi) - — (/?i - a i ) ( a 1 ^ 2 - a 2 f t ) a^ = a 5 + ^5 + - ( a ^ - 0:3^2) - y ^ C ^ ~ a2)(ai/S 2 -
a20i).
The group JV5 acts on itself by left multiplication leaving the following two vector fields invariant
x,=fc-|«,-(j| + i)a.-£*,. These vector fields span the kernel of the Pfaffian system wi = dz + %(ydx — xdy), Z
X
u2 = du + -dx + jz(ydx Z
- xdy),
(4.5)
1J
u3 = dv + -dy + jziydx
-
xdy),
that we call the Cartan Pfaffian system. A direct calculation shows that
[xl,x2] = dz + ^du + y-dv.= x3. Therefore taking X4 := X4 and X5 := X5, we recover the commuting relations (4.4) for the vector fields Xi, i = 1 , . . . , 5. In summary we have the following Proposition 4.1. The group N$ acts on itself by left multiplication leaving the vector fields Xi,i = 1 , . . . , 5 invariant. Moreover the Lie algebra generated by {Xi) is isomorphic to ns 4-1
The action of the semi-direct product N$ x S 0 2
As in the Heisenberg case there is a rotational action on N5, which is encoded in the left invariant vector field W = xdy — ydx + udv — vdu,
198 which satisfies the following commuting relations [XUW}= X2, [X2,W] = -X1, [X4,W}= X5, [X5,W]=-Xi, these brackets together with (4.4), allow us to consider the semi-direct sum ns ® s so 2 , from which the action of iV5 x SO2 becomes transparent. As usual, we take the product (x, z, w, R) • (x1, z',, w', R!) to be equal to (x+Rx*,z + z'+^x*JRxH,w with R,R' e S 0 2 ,
+ Rw' + hz'x-zRxH)
&-{x,y),
uf = (u,v),
+
^x4JRx'(x-R&),RR')
z £ E, and
We have the following Theorem 4.2. The group (iV5 x SO2, •) acts on N$ leaving the Cartan Pfaffian system (4-5) invariant. 4-2
Sub-Riemannian geodesies for linear magnetic field
As in the last section we take a curve C : [0, Tc] -> E 2 , t i-> C(t), with initial point (a;0, yo) = C(0) and with p0 in the fiber 7r_1 (XO, yo)- The parallel transport determines the curve C which projects by it onto C. Proposition 4.3. The normal extremals project to the curve t t-^ C(t) — (x(t),y(t)) which satisfies the system x =
m
—Bz(x,y)y
V=
Bz(x,y)x m with qBz(x,y) linear in x andy. The complete system in E 5 can be integrated by quadratures. Furthermore there are not strictly abnormal extremals. Proof. The Lagrangian for the non-holonomic constraints in this case is given as follows
199
Cc = A o y ( i 2 + f) + ^(u+-x
+
+ Ai(i + - ( i / i - xy)) ^x-^)
+
X3(v+-y
+
—
--y),
we take first the normal case, Ao = 1, to obtain easily x =
—Bz{x,y)y
V
Bz(x,y)x m with ^-Bz = bi(ax +/3y) + bo as required. Furthermore, the remaining EulerLagrange equations yield
z .
x .
z .
y
This system can be written in a simpler way by means of the following rotations
" P\ fx\ -fx\
( a 0\ [A
-fu\
-/3a)\yJ-\Y)'
\-/3aj{vJ
\V J
to obtain X = {hX + b0)Y, Y = -{hX + b0)X, Z=±(XY-YX),
Z V
Y
•
(4.6)
Y +
•
= -2 -6ZThe equation for X is just an elementary elliptic curve and its solution can be easily given in terms of doubly periodic functions. The corresponding algebraic equation takes the form
200
£2 = a 0 + a i X + a2X2 + a 3 X 3 + a 4 X 4 , here ao = X$, a\ = 2boYo, a2 = biYo — b\, a 3 = — &0&1 and 04 = —b\/A. The algebraic equation can be given in terms of the Weierstrass p-function by a linear fractional transformation X =
ap + b ;, cp + d
ad — be = 1,
where the p-function satisfies (p') 2 = 4 ( P - ei)(p - e 2 )(p - e 3 ) for ei, e 2 and e 3 constants. The function y ( i ) can be also obtained in terms of elliptic functions as follows
= Y0t- f (bo + h—^T.
2 the integration can be performed again analytically. For the old variables (x,y), the straight lines for which ^Bz = b\{ax + 0y) + bo vanishes are particular solutions of the equations of motion for the normal extremals. Since for the abnormal extremals, Ao = 0, the same straight line solutions are obtained, we conclude that also in this case the abnormal extremals are not strictly abnormal. Further, every horizontal vector is tangent to an abnormal extremal for this distribution.
•
Remark 4.4. Not surprisingly, in the coordinates (x,y,z,u) the system (4-6) corresponds to the case of an original Engel structure. In general, if the magnetic field does not variates strongly as a function of the coordinates, this case corresponds to the first order approximation. Remark 4.5. A simple characterization of the above elliptic curve is obtained in terms of the cross-ratio
ei - e 3 Most important is the fact that the modular invariant
201
tf_ (A2 - A + l ) 3 32 ' A 2 ( A - 1 ) 2 ' is a true modulus, that is, it can be used to define an equivalence between curves having the same J. If e? — ez,e\ = e-i or e\ — e$, the cross-ratio takes the values (0,1, oo) and J = oo. The curve degenerates in equivalent stable singular cubic curves. Topologically, these curves are associated with a sphere with two points identified. In our problem, if we let b\ —> 0 the curve degenerates in a rational curve. Acknowledgements Part of this work was done during a sabbatical stay of the first Author at the Arnold Sommerfeld Institute, Clausthal, Germany with support of DAAD. References 1. Bonnard B., Chyba M. and Kupka I., Lieu conjugue en geometrie sousRiemannienne. Le cas Martinet plat, Rapport de Recherche No.90, University de Bourgogne, Laboratoire de Topologie, Dijon, Prance, (1996). 2. Brockett R.W., Control theory and singular Riemannian geometry, in New directions in applied mathematics, Hilton, P.J. and Young G.S. Eds., (Springer-Verlag, 11-27, 1981). 3. Bryant R., Hsu L., Rigidity of Integral Curves of Rank-two distributions, Invent. Math., 114, 435-461 (1993) 4. Cartan E., Les systemes de Pfaff a cinque variables et les equations aux derivees partielles du second ordre, Ann. Sci. Ecole Normale, 27, No.3, 109-192 (1910). 5. Chandrasekhar, S., Plasma Physics, (The University of Chicago Press, Chicago, 1960). 6. Chow, W.L., Uber Systeme von Linearen partiellen Differentialgleichungen erster Ordnung, Math. Ann., 117, 98-105 (1939). 7. Gromov M., Carnot-Caratheodory spaces seen from within, in SubRiemannian geometry, Eds. A. Bellaiche, J.J. Risler, (Birkhauser, 79323, 1996). 8. Kupka I., Geometrie sous-Riemanniene, Seminaire Bourbaki, (1996). 9. Kupka I., Abnormal extremals, preprint, (1992). 10. Liu W., Sussmann H., Shortest paths for sub-Riemannian metrics on rank-two distributions, Memoirs of the American Mathematical Society,
202
118, No.564, (1995). 11. Monroy-Perez F. and Anzaldo-Meneses A., Optimal control on the Heisenberg group, Jour. Dyn. Control. Syst, 5, No.4, 473-499 (1999). 12. Monroy-Perez F. and Anzaldo-Meneses A., Optimal control on certain nilpotent groups, preprint, (1999). 13. Montgomery, R., The Isoholonomic Problem and some Applications, Comm. Math. Phys., 128, 565-592 (1990). 14. Montgomery, R., A Survey of Singular Curves in Sub-Riemannian Geometry, Jour. Dyn. Control. Syst, 1, 49-90 (1995). 15. Montgomery, R., A tour of Sub-Riemannian Geometries, Geodesies and Applications, to be published, (2000). 16. Tilbury D., Murray R., Sastry S., Trajectory Generation for the N-trailer Problem Using Goursat Normal Form, IEEE Transactions on Automatic Control, 40, 5, 802-819 (1995). 17. Y.L. Sachkov, Symmetries of Flat Rank Two Distributions and SubRiemannian Structures, Rapport de Recherche No.151, Universite de Bourgogne, Laboratoire de Topologie, Dijon, France, (1998). 18. Spivak M., A comprehensive Introduction to Differential Geometry, Vol. II, (Publish or Perish, 1979). 19. Strichartz R.S., Sub-Riemannian Geometry, Journal of Differential Geometry, 24, 221-263 (1986).
203
TOPOLOGICAL V E R S U S S M O O T H LINEARIZATION OF CONTROL SYSTEMS LAURENT BARATCHART INRIA, B.P. 93, 06902 Sophia Antipolis cedex, France E-mail: Laurent. [email protected]
Dept. of Mathematics,
MONIQUE CHYBA Univ. of California, Santa Cruz, CA 95064, U. S. A. E-mail: [email protected]
JEAN-BAPTISTE POMET INRIA, B.P. 93, 06902 Sophia Antipolis cedex, France E-mail: [email protected] This note deals with "Grobman-Hartman like" theorems for control systems (or in other words under-determined systems of ordinary differential equations). The main results (proved elsewhere) is that when a control system is topologically conjugate to a linear controllable one, then it is also "almost" differentiably conjugate. We focus on the meaning of this result, and on an open question resulting from it.
1
Introduction
In this note, we discuss the local behavior of a nonlinear control system x = f(x,u),
xeW1
, u£Rm
,
(1.1)
say around (0,0) € E n + m . For general control systems (as opposed e.g. to affine in the control), "local" has to be understood with respect to both state and control. The first reaction when dealing with local properties is to compute the linear approximation of (1.1). When this linear control system happens to be controllable, all the local usual control objectives can be met using linear control, based on the linear approximation. For instance, a linear control that asymptotically stabilizes the linear approximation will also stabilize the nonlinear system, locally; minimizing a quadratic cost can also be achieved up to first order based on the linear approximation only. Hence, the linear approximation is a good enough model for the purpose of designing controllers achieving a desired behavior for small states and controls. We believe that all control engineers or control theorists agree on this statement, arising from practice, although we would welcome some contradiction.
204
Rephrasing the above statement without reference to control objectives leads to an imprecise statement, grounded mostly on some necessarily subjective intuition, and that should rather be taken as an opening sentence to launch a debate than as a conjecture : (nothing distinguishes qualitatively the behavior of a nonlinear control ]system from the one of its linear approximation if the latter is controllable. (1.2) It is natural to try to formalize this statement, as a prerequisite to any proper theory of nonlinear modeling and identification of control systems, in a very preliminary manner since it only deals with local phenomena. A nice way to turn that belief into a sound, and correct, assertion would be to find some equivalence relation between control systems (or models) that preserves at least "qualitative" behavior, and for which these two systems (a nonlinear system and its controllable linear approximation) are in general equivalent. We assume controllability of the linear approximation because when this fails none of the above is correct, at least in most common case when the nonlinear system is itself controllable. Indeed, (non-)controllability is a qualitative phenomenon : for instance, feeding a linear non controllable system with "random" inputs, one observes that the state is confined in leafs of positive codimension, while for a controllable system the whole state space is explored.. To enlighten the discussion on local behavior of control systems, let us recall the situation for ordinary differential equations x = F(x) (particular case of (1.1) where the control u has dimension 0) : • If F(0) ^ 0, the "flow-box theorem" (see e.g. 1 ) , gives local coordinates,
(o\ smooth if F is smooth, in which F is of the form
:
w
• If F(Q) = 0 and the square matrix F'(0) has no pure imaginary eigenvalue (hyperbolic equilibrium), then Grobman-Hartman Theorem 5 tells us that the flow of the differential equation is locally conjugate to the flow of its linear approximation via a homeomorphism that need not, in general, be smooth if F is smooth (and in fact smooth conjugation requires more assumption, resonances are obstructions to it) (see e.g. 2 ) . • If -F(O) = 0 and the square matrix -F'(O) has some pure imaginary eigenvalue, then the situation is more intricate even locally, namely the phase
205
portrait of the nonlinear dynamical system x = F(x) can be very different locally from the one of a linear system. This case is of high interest in the theory of dynamical systems, but can be considered as "degenerate", in the same way as non controllability of the linear approximation for control systems. Since conjugation of flows does preserve qualitative phenomena like the overall aspect of the phase portrait, one can indeed assert that, locally around all points except non hyperbolic equilibria, a differentiable dynamical system "behaves like" a linear one, and this is translated by conjugation via a homeomorphism, although conjugation via a smooth diffeomorphism preserves some more subtle local invariants (resonances, e t c . ) . Coming back to control systems, first of all, the equivalent of conjugation by a smooth change of coordinates is (smooth) feedback equivalence, whose study was initiated in 4 , see a survey in 7 . In fact this is conjugation via a smooth diffeomorphism on the state and control, forced to have a triangular structure (see Proposition 2.5 below). The conditions under which a control system (1.1) is smoothly feedback equivalent to a linear controllable one are well known 8 ' 6 (and contrary to the case of ordinary differential equations, they are very simple), but they imply that very few nonlinear systems are locally feedback equivalent to a linear one, even when the linear approximation is controllable. This remark and the review of the situation for ordinary differential equations naturally brings about the question whether for control systems, relaxing the regularity of the conjugating maps, i.e. considering conjugacy by homeomorphisms instead of smooth diffeomorphisms would make more systems equivalent to a linear one. After recalling some basic facts in section 2, we give in section 3 an essentially negative answer to the question evoked above, based on quoting a result to appear in 3 , that topological conjugacy to a linear controllable system implies conjugacy by "almost" smooth feedback (but the gap is really small). Section 4 recalls, also from 3 , a technical open question that would allow a nicer result and a nicer description of that "almost" smooth conjugacy, and finally section 5 extends the discussion of the results from section 3, their implications, and the questions they raise in nonlinear modeling.
206
2 2.1
Preliminaries on equivalence of control systems Definitions
Consider two smooth control systems with state x (resp. z) and input u (resp. v):
x = f(x,u), x e i " , u e r , 1
z = g{z,v),
1
zeW '
, veW '
(2.1)
,
(2.2)
or, expanded in coordinates, Xi=fi(xi,...
,xn,ui,...
,um),
Zj=gj(zi,...
,zn-,vi,...
,vm<),
1 < i < n, 1 < j < n', with the /j's and g^s some smooth (i.e. C°°) maps. We assume that / and g are defined respectively on the whole of E n x Km and E" x E m because it simplifies many of the statements below; this is actually no loss of generality to us for all the results we prove are local with respect to x,u,z,v, so that / and g can be extended using partitions of unity outside some neighborhoods of the arguments under consideration without affecting the results. Definition 2.1. By a solution of (2.1) that remains in an open set ft C jjn+m ^ we mean a mapping j defined on a real interval : 7
: 7
~* ^
(2 3)
with 7i(t) € E" and 71 (t) £ Rm, such that 7 is measurable, locally bounded, 71 is absolutely continuous and, whenever [Ti,T2] C / , we have : 7i(T 2 ) - 71(^1)
=
/
2
f(li(t), Mt))
dt .
Solutions of (2.2) that remain in Q,' C M" + m are likewise defined to be mappings 7' : / —> ft' having the corresponding properties with respect to g. We now define the notion of conjugacy for control systems. Definition 2.2. Let X :
U
^n> (x,u) i-> x(x,u)
(24) = (xi(x,u),
XTL{X,U))
be a bijective mapping between two open subsets of ffira+m and K™ + m respectively. We say that \ conjugates 7 : 7 — ^ 0 and 7' : I —• ft' if and only if i = x ° 7-
207
We say that \ conjugates systems (2.1) and (2.2) if, for any real interval I, a map 7 : / —> SI is a solution of (2.1) that remains in SI if, and only if, X ° 7 is a solution of (2.2) that remains in SI'. We say that systems (2.1) and (2.2) are locally topologically conjugate at (0,0) if we can chose SI and SI' to be neighborhoods of the origin and x a homeomorphism. We say that they are locally smoothly conjugate if, in addition, x and X"1 are smooth. Here the word smooth means C°°. In case there is no control, so that m = m' = 0 and we omit u and xn, Definition 2.2 coincides with the classical notion of local topological conjugacy for non controlled differential equations, and may serve as a definition in this case too. Let us write more formally the classical local results on ordinary differential equations that we recalled in the introduction : Theorem 2.3 (Flow-box theorem). If m = 0 and /(0) ^ 0, system (2.1) is locally smoothly conjugate at 0 to the linear system z\ = 1, z^ = • • • = zn = 0. Theorem 2.4 (The Grobman-Hartman theorem). If m = 0, /(0) = 0, and /'(0) has no pure imaginary eigenvalue, system (2.1) is locally topologically conjugate at 0 to the linear system z = f'(0)z. Prom now on, we consider control system, i.e. we assume m > 1. 2.2
Some properties of conjugating maps
It turns out that conjugating homeomorphisms preserve the dimension of both the state and the control and must have a triangular structure : Proposition 2.5. With the notations of Definition 2.2, suppose that (2.1) and (2.2) are topologically conjugate via a homeomorphism x '• fi —> fJ'. Then n = n', m = m', and xi depends only on x: X{x,u)
=
( x i ( z ) , Xn(x,u))
.
(2.5)
Moreover, xi : ^R" —>• ^R™ is a homeomorphism. Proof. Let x, u, v! be such that (x, u) and (x, u') belong to fl. Let further x(t) be the solution to (2.1) with a;(0) = x and u(t) = u for t < 0 and u(t) = u' for t > 0. By conjugacy, z(t) = xi{x(t),u(t)) is a solution to (2.2) with v given by v(t) = xn(x(t),u(t)), for t 6 (—e,c) and some e > 0. In particular Xi(x(t),u(t)) is continuous in t so its values at 0 + and 0 - are equal. Hence Xi(x,u) = xi{x,u') so that xi : ^K" -> 0^„< is well defined and continuous. Similarly, ( x - 1 ) j induces a continuous inverse SI' , -» OR». • In view of Proposition 2.5, we will only consider conjugacy between systems having the same number of states and inputs. Hence the distinction between
208
(n,m) and (n',m') from now on disappears. Taking into account the triangular structure of \ m Proposition 2.5, one may describe conjugation as the result of changing coordinates in the statespace (by setting z = xi(x)) a n d feeding the system with a function both of the state and of a new control variable v (by setting u = (x - 1 )n(z,^)), in such a way that the correspondence (x, u) i-> (z,v) is invertible. In the language of control, this is known as a static feedback transformation, and two conjugate systems in the sense of Definition 2.2 would be termed equivalent under static feedback. This notion has received much attention, although only in the differentiable setting (i.e. when the triangular transformation ^ is a diffeomorphism), see e.g. 4 ' 7 . 2.3
Linearization
Recall that / is assumed to be smooth (of class C°°). Let us make a formal definition of topological and smooth linearizability. Definition 2.6. The system (2.1) is said to be locally topologically linearizable at (x, u) € R n + m if it is locally topologically conjugate, in the sense of Definition 2.2, to a linear controllable system z = Az + Bv. Definition 2.7. The system (2.1) is said to be locally smoothly linearizable at (x,u) € R" + m if it is locally smoothly conjugate, in the sense of Definition 2.2, to a linear controllable system z — Az + Bv. Explicit necessary and sufficient conditions for a nonlinear system to be locally smoothly linearizable at a point were given in 8 ' 6 , and also in 9 (the previous two references dealt with control affine systems only), and is recalled in many nonlinear control textbooks. Without mentioning these conditions, let us simply say that they require a certain number of distributions to be involutive, and that this is a very non-generic property. 3
Main result on topological linearization
Let us now give a —basically negative— answer to the natural question raised at the end of section 1 : for control systems, removing the differentiability requirement on the conjugacy does not allow many more control systems to be (topologically) conjugate to a linear controllable system, contrary to the situation of ordinary differential equations (without control), (see section 1 and Theorems 2.3 and 2.4). Recall that / is assumed to be smooth (of class C°°).
Theorem 3.1 ( 3 ). System (2.1) is locally topologically linearizable at (0,0) if, and only if there exists an open neighborhood ft of (0,0) in Rn+m and a
209 homeomorphism
x:
ft
-> fv
_
/ 3 Xx
(a;, w) (->• x(x,u) = ( x i ( z ) , Xn(z,")) (possibly different from the homeomorphism defining topological linearizability of the system) such that 1. x conjugates system (2.1) to a linear controllable system z = Az + Bv, in the sense of Definition 2.2, 2. Xi : H n —> Q'n defines a smooth (C°°) diffeomorphism. This does not state that topological linearizability implies smooth linearizability for x need not be a diffeomorphism even though xi is. In 3 , the conclusion of the theorem is called quasi smooth linearizability. A thorough discussion as well as the proof of Theorem 3.1 is given there. Let us recall here what is necessary to make this theorem clearer. Proposition 3.2. The conclusions of Theorem 3.1 imply that 1. BXTL • 0 -*• Km is smooth,
2. the rank of B is the maximum rank of df/du the origin.
in small neighborhoods of
Proof. Computing z at the origin of a trajectory starting from (x,u) 6 0 implies, by the smoothness of xi, ^-(x)f(x,u)
=
Axi(x) + Bxn(x,u) .
(3.2)
This gives an obviously smooth expression of Bxn- The second point is proved using Corollary 3.5° at points close to the origin where the rank of df/du is maximum, and hence locally constant. • If B is left invertible (i.e. has rank m), the first point implies that xn itself is smooth, and we have the following immediate corollary : Corollary 3.3. If there are points arbitrarily close to (0,0) where the rank of df/du is m (i.e. where this linear map is injective), then x in Theorem 3.1 is a smooth mapping. Of course if B has rank strictly less that m, xn need not be smooth. This is discussed in section 4. " T h e proof of Corollary 3.5 does not use Proposition 3.2.
210
Note that the assumption of Corollary 3.3 is very "reasonable": for instance for single input systems, the only case where it is not met is when / does not depend on u in a neighborhood of (0,0), but then the system cannot be topologically conjugate to a controllable linear system : Corollary 3.4. If m = 1, i.e. if (2.1) is a single input system, then \ »" Theorem 3.1 is a smooth mapping. This is however still not "smooth linearizability" because even though \ is smooth, its inverse might fail to be differentiable at the point of interest. The simplest example is the system x = u3, x e E , i i £ l
,
(3.3)
3
clearly conjugate by (z, v) = \(x, u) = (x, u ) to the linear controllable system z = v. Obviously, x ls smooth, x _ 1 is continuous, xi is the identity smooth diffeomorphism, but the inverse of x itself fails to be differentiable at the origin. In fact, no smooth diffeomorphism can conjugate these two systems. This can easily be proved but is also a consequence of the necessity part of the following result that tells us exactly when smooth linearizability is implied by topological linearizability : Corollary 3.5. When f is of class C°°, system (2.1) is locally smoothly linearizable at (0,0) if and only if it is locally topologically linearizable at (0,0) and the rank of df/du is constant around (0,0). Proof. Smooth linearizability is a particular case of topological linearizability, and it implies constant rank of df/du because differentiability of the smooth diffeomorphism and its inverse allow one to get a formula for df/du(x,u). Let us prove the converse. Suppose that the rank of df/du is r < m in a neighborhood of (0,0) and that system (2.1) is locally topologically linearizable at (0,0). From Theorem 3.1, this implies that there exists a triangular homeomorphism (x,u) >-)• (z,v) = x(x,u) = (xi(x),xn(x,u)) that conjugates system (2.1) to a linear controllable system z = Az + Bv with the additional property that xi defines a smooth diffeomorphism from a neighborhood of 0 € M™ onto its image. Let r' < m be the rank of the matrix B. There are invertible n x n and m x m matrices P and Q such that ' Ir, Bc
= PBQ
=
(3.4) 0
V where Iri is the r' x r' identity matrix.
o\ 0
211
Computing z at the origin of a trajectory starting from (x, u) £ Q implies (3.2) by the smoothness of xi- Hence the map BcQ~xXi is smooth where it is denned, and differentiating (3.2) with respect to u yields :
Since P^-(x) is invertible and the rank of df/du is r, both sides have constant rank r. This implies r < r'. This also implies that the mapping Rn+m _j. K n+ r ' defined by (x,u)
H->
(x,
BCQ~1XTL{X,U))
,
has a constant rank n + r i n a neighborhood of the origin, hence, by the constant rank theorem applied to this mapping, there i s a r x m matrix K of rank r (selects r lines that are independent among the m lines of (x,u)), 9u n+m r a neighborhood ft of (0,0) in R , two smooth mappings a : K"+ -> Rn+m and ft : Rn+m -> Mr ~ r (in fact they only need to be defined in suitable neighborhoods of the origin) such that x is defined on fi, and BcQ~lU-,u)
=
(«(*,Jrfleg-ix.(z,«))J
(3.5)
for all (a;, u) £ fi and (x,u)
*-> ( a r . J f B c Q ^ x i t a r . u J . ^ i . u ) ) ,
(3.6)
defines a smooth diffeomorphism from ft onto its image. This implies that r = r' because from (3.5, r < r' would prevent x from being one-to-one. Hence K can be taken the identity matrix. Define x : fi -* IR"+m by \ — L ° V' with ^(x,w)
(Fxi(a;), i f 5 c Q _ 1 x n ( 3 ; , u ) ,
=
1
0(x,u))
1
and L(z,v) = (P~ z,Q~ v). ijj is a smooth diffeomorphism because (3.6) is one, and L is obviously a (linear) smooth diffeomorphism. Setting (z, v) — x(x, u) conjugates system (2.1) to z = Az + Bv. D 4
A n open question
It is a reasonable question to ask whether the conclusion of Corollary 3.3 holds in general, namely whether Theorem 3.1 can be strengthened so as to state that x is, on top of its other properties, a smooth mapping (when the rank of df/du is not locally constant, x would fail to be be differentiable, from the necessity part of Corollary 3.5).
212
Let us examine the case where the assumptions of Corollaries 3.3, 3.4 and 3.5 fail (these three corollaries already state the desired conclusion), namely the case of systems with m < 2 controls where the rank of df/du is everywhere strictly smaller than m, studied locally around a point where this rank is not constant (i.e. the rank at the point is strictly less than the maximum rank in arbitrary small neighborhoods of this point, itself strictly smaller than m. The smallest dimensions where this occurs is n = 1, m = 2, i.e. systems x = f(x,ui,U2) with x, ui and u2 scalar. In order to state our open question in the smallest dimension possible, let us drop the dependence on the righthand side on x and consider systems x = a(ui,u2),
i £ l , u = (tti,u 2 ) £ E 2 ,
(4.1)
2
where a : E -» E is smooth. Let us assume that this system is locally topologically linearizable around (x,u) — (0,0,0). The only canonical controllable linear system with one state z € IK and two controls (v\,v2) 6 E 2 is z = vi, hence local topological linearizability means existence of a homeomorphism X : (x,u1,u2)
>-• (z,v1,v2)
= (Xi(x),
\2(x,ui,u2), an
ls
X3(x,ui,u2))
(4.2)
tnat
(in the terms of Definition 2.6, xi is xi d Xi (X2,X3)) conjugates (4.1) to the linear system z — v\. From Theorem 3.1, this implies existence of another homeomorphism x of the same triangular form, that we denote by X instead of x, such that xi is a smooth diffeomorphism (from a real interval containing zero onto an open interval) and X2 is a smooth mapping from an open neighborhood of the origin in E 3 to E, while our results do not grant that X3 has any more regularity than continuity. In fact the conjugation reads dxi -jr— (x)a(ui,u2) - X2(x,ui,u2). (4.3) This implies in particular that X2 does not depend on x, and then one can replace v2 = X3(^7^1,1*2) with v2 = X 3 ( 0 J M I > U 2 ) without changing the conjugating property. Composing x given by (4.2) with (z,Vi,v2) H* (XI _ 1 iz), i l r (Xi_1 (•z)) _1 ^i. v2), one finally gets a conjugating homeomorphism of the form (x,u1:u2)
H-» (x, a(ui,u2),
P(ui,u2))
(4.4)
where f3(u\,u2) = X3(0,ui,U2)- Hence local topological linearizability amounts to existence of a continuous mapping /? from an open neighborhood of the origin in E 2 to E such that (u\,u2) i-> (a(ui,u2),/3(ui,u2)) defines a homeomorphism from a neighborhood of the origin in E 2 onto its image (we just proved it is necessary, but conversely, it makes (4.4) a local homeomorphism, that obviously conjugates (4.1) to z = vi). Similarly, conjugacy via
213
a homeomorphism that is a smooth map amounts to existence of a smooth mapping having the same property. Hence the question whether \ c a n D e taken a smooth mapping in Theorem 3.1 reduces to the following Open Question 4.1. Let a and /3 be two mappings ] — e,e[2 —> E, e > 0, such that a is smooth, j3 is continuous, and (111,112) *-> {a(ui,U2),fi(ui,U2)) defines a homeomorphism from ] — e,e[2 onto its image. Does there exist a smooth mapping b :] - £',e'[2-t R, 0 < e' < e, such that (1^,1*2) >-> (a(ui,U2),b(ui,ii2)) defines a homeomorphism from]—e' ,e'[2 onto its image ? This question in differential topology can be posed in higher dimension of course, see below. It is of interest in its own right and seems to have no answer so far, even for p — q = 1. Open Question 4.2. Let O be a neighborhood of the origin in W+q and F : O —»• W a smooth map. Suppose there is a continuous map G : O —• K? such that F x G : O —>• Rp x l ' is a local homeomorphism at 0. Does there exist another neighborhood of the origin O' C O and a smooth mapping H : O' ->• W such that F x H : O' ->• E p x W is again a local homeomorphism atO? 5
Implications in Control Theory
Let us come back to the discussion we started in the Introduction. Consider a control system (1.1), assume for simplicity that we work around an equilibrium, i.e. /(0,0) = 0, and let us write its linear approximation, i.e. f(x,u)
= Ax + Bu + F(x,u) dF dF with F(0,0) = — (0,0) = — (0,0) = 0 ,
(5.1) (5.2)
so that the nonlinear system (1.1) reads x
= Ax + Bu + F(x,u)
.
(5.3)
From the remarks made in the introduction, the relevant situation is the one where the linear system i
=
Az + Bv
(5.4)
is controllable. Let us assume slightly more to stay away from the pathologies evoked in the previous section. The additional assumption is very mild and is, for instance, always true when the constant n x m matrix B has rank m; it is implied by linear controllability for single input systems. Assumption 1. The pair {A,B) is controllable and the rank of B + dF — (x,u) is equal to the rank of B for small (x,u).
214
The question raised in the introduction was the one of finding a reasonable equivalence relation that would make the two systems (5.3) and (5.4) locally equivalent. Comparing the situation of ordinary differential equations (without control), a candidate was local topological conjugacy as in Definitions 2.2 and 2.6, and if that candidate was successful, we would have a result making precise the vague statement (1.2). Corollary 3.5 implies that, for A, B and F satisfying Assumption 1, systems (5.3) and (5.4) are locally topologically conjugate if and only if they are locally smoothly conjugate, and it is known from 8-6 that this is false for a generic F, even satisfying (5.2). This discards topological conjugacy as a candidate for the above mentioned equivalence relation, but this does not contradict the basic belief behind statement (1.2). A way to contradict that statement would be to find at least one example satisfying the assumption, but where the nonlinear system (5.3) displays some local "qualitative" phenomenon that do not occur for the linear system (5.4). In the qualitative theory of dynamical systems (without control), the phase portrait gives a picture of the behavior, on which phenomena like attractors, invariant set, (stable) closed orbits can just be "seen". A control system is more complex : it describes how the behavior of the state (at least in the state space representation) is linked to the control. It is not very clear what a qualitative phenomenon should be for a control system. The least to require is that it be invariant by topological conjugacy as defined here. In the introduction, we pointed out that (non-)controllability is a qualitative property, but it is of no help here since (1.2) only refers to controllable systems. We do believe that clarifying the status of a statement like (1.2) is very relevant to control theory and modeling. Our negative results (section 3) say that topological conjugacy is not the right tool to answer this. A looser equivalence could be a way to state (1.2) properly. It could also be that the intuition behind (1.2) is totally wrong and that some nonlinearities F allow system (5.3) to display some qualitative phenomena locally that cannot occur on a linear system (5.4).
Acknowledgements The authors wish to thank Prof. I. Kupka, from Universite Pierre et Marie Curie (Paris VI) for many enriching discussions. Thanks are also due to Prof. C.T.C. Wall from the University of Liverpool for his comments on the "Open Question 4.2" in section 4.
215
References 1. V. I. Arnold, Equations Differentielles Ordinaires, 3 ed. (MIR, 1974). 2. V. I. Arnold, Chapitres supplementaires de la theorie des equations differentielles ordinaires (MIR, 1980). 3. L. Baratchart, M. Chyba, and J.-B. Pomet, On the Grobman-Hartman theorem for control systems, in preparation. 4. R. W. Brockett, Feedback invariants for nonlinear systems, IFAC World Congress (Helsinki), 1115-1120 (1978). 5. P. Hartman, Ordinary differential equations, 2 ed., (Birkhauser, 1982). 6. L. R. Hunt, R. Su, and G. Meyer, Design for multi-input nonlinear systems, in Differential Geometric Control Theory, R.W. Brockett, Ed., 258298, (Birkhauser, 1983). 7. B. Jakubczyk, Equivalence and invariants of nonlinear control systems, in Nonlinear Controllability and Optimal Control, Hector J. Sussmann, Ed., (Marcel Dekker, New-York, 1990). 8. B. Jakubczyk and W. Respondek, On linearization of control systems, Bull. Acad. Polonaise Sci. Ser. Set. Math., 28, 517-522 (1980). 9. A. J. van der Schaft, Linearization and input-output decoupling for general nonlinear systems., Syst. & Control Lett., 5, 27-33 (1984).
This page is intentionally left blank
217
LOCAL A P P R O X I M A T I O N OF T H E R E A C H A B L E SET OF CONTROL PROCESSES ROSA MARIA BIANCHINI Department of Mathematics U. Dini, University of Florence, Viale Morgagni 67/A, 50134 Firenze, Italy E-mail: bianchiniQmath. unifi. it This paper concerns the variational approach to the study of controls problems. We construct local approximations of the reachable set from which sufficient conditions for a trajectory to be locally controllable and necessary conditions for a reference pair to be a weak solution of a Mayer control problem with end constraints can be derived.
1
Introduction
The knowledge of the geometry of the reachable set can help greatly in several control problems. In particular what is really useful is to find conditions which the boundary points have to fulfill. For example a reference trajectory is locally controllable if its final point is not a boundary point, which surely happens if the point does not satisfies the necessary conditions. An other example is given by Mayer optimization problems without constraints on the final point: under standard hypothesis, if a reference trajectory solves the optimization problem then its final point is a boundary point of the reachable set and hence has to satisfies the necessary conditions. This is the basis of Pontriagin Maximum Principle and of its generalizations 1, 2 etc. Since it is difficult to compute directly the reachable set, one tries to approximate it near a point x by means of tangent vectors, objects which can be computed easier. But the collection of tangent vectors provides a good approximation of the set only in the case in which the boundary is sufficiently regular; for an example it works if the set is locally convex. Unfortunately the boundary of the reachable set may be irregular even in the analytic case, see 1. This has been the starting point for searching subsets of tangent vectors which could provide the informations one needs in concrete problems, see 3 , 1 and the references therein. In this paper I will start with reviewing some local approximations of the reachable set of a control process pointing out their relations with the study of some control problems. Later I will construct a local approximation of the reachable set of an affine control process near the final point of the trajectory
218
relative to the drift term; this local approximation can be used to test either the local controllability of the trajectory or its optimality respect to a Mayer control problem with constraints on the end points. In the last section I will show by means of some examples how the theory can be applied. The natural setting for this variational approach, is the one of differential geometry; the control process are family of differential equation on a differential manifold, M, finite dimensional, sufficiently regular and the tangent vectors at y is an element of the tangent space of M at y. However for simplicity sake I am going to examine the particular case in which the manifold is an Euclidean space. 2
Tangent cones
Let us consider a control process x = f(t,x,u)
u(-)EU
(2.1)
where the state x belongs to M" , the function / and the set of admissible control U are such that the following standard assumption is verified A) for each admissible control u(-) e U the Cauchy problem (x =
f(t,x,u(t))
\ X(t0) = X0
has an unique maximal solution whatever is the initial condition. We will denote by S(t,to,x0,u) the value at time t of the solution of (2.1) relative to the control u which at time to takes the value xo and by R(t, to, xo) the reachable set at time t from the initial conditions (to,£o) R(t,t0,x0)
= | S(t,t0,x0,u)
:
ueU>.
We are interested in the following problems: LC Local controllability of a trajectory, x* : t -» [£o>£i], i-e. x*(ti) € m«i?(ti,t 0 ,'a;*(
Vt > 0.
Mayer optimization problem, i.e. locally minimize the value of a C 1 functional, /o, over the set R(ti,t0,x0).
219 COP
Mayer optimization problem with constraints on the final point, i.e. locally minimize the value of a C 1 functional, /o, over the intersection of the set R(ti,t0,x0) with a given subset C.
A possible approach to these problems is to derive either necessary or sufficient conditions from a local approximation of the reachable set at the final point of the trajectory 1 4 5 6 7 . The local approximations used in the quoted papers are suitable subsets of the intermediate tangent cone, a subset of the contingent tangent cone; the difference among these papers depends on the subsets considered. I want to make some comments about this approach and I want to state the results I am going to use in the next paragraph. Let us start with the definition of the intermediate tangent cone to a subset of an eucleadian vector space, 8 : Definition 2.1. Let K be a subset of Mn and let y E K. The itermediate, or adjacent, tangent cone to K at y, IK{V) , is the set of vectors tangent at y to the curves contained in K passing from y IK(V) = [v
: y + ev + o(e) £K
ee[0,e]|.
The vectors belonging to IK{V) are named tangent vectors to K at y. For studying the LC property, the intermediate tangent cone to the reachable set at the final point of the trajectory is a too large set because it can be the whole space even if the property does not hold, see the example in x. However one can derive sufficient conditions for this property to hold by considering only subsets, H, of lR(tltt0,x0)(x*(Ti)) chosen in such a way that the following property holds: A) if the set H contains a positive basis of tangent vectors, then the point is an interior point of the reachable set. I recall that a set of vectors is positive basis of W1 if 0 belongs to the interior of convex hull of the set. Notice that condition A) could be replaced by the weaker condition: A ' ) if the set H is the whole space, then the point is an interior point of the reachable set. however property A) is more useful in the applications. Let us consider the optimal problems. The first thing to notice concerning these problems, is that the set /fi(t1,t0,a:o)(2/) is constructed by means of the points close to y which can be reached from (t0,x0) at time t\ independetly from the trajectories by means of which this can be done, meanwhile in the
220
definition of the O P and of C O P problems, locally has to be understood in the sense of the calculus of variations, i.e. either strong or weak solutions of the problem. Let me state the definitions of strong and of weak solutions which I am going to use. An admissible pair of a control process in a given interval [to,*i], is a couple (x* ,u*), x* solution relative to u* on [to>*i] • Definition 2.2. A pair (x*,u*) is a strong solutions of O P if the final point of the reference trajectory, x*(ti), is a point of minimum of the functional /o on the set of points reachable from the initial point by means of a trajectory which are near the trajectory x* in the C°([£o;*i]] topology. Weak solution of O P may be defined if it is given a topology on the set of control with respect to which the solution map is continuous. In the sequel we will assume that such a topology, II, is a data of the problem. Definition 2.3. A reference pair (x*,u*) is a weak solutions of O P if the final point of the reference trajectory, x*(ti), is a point of minimum of the functional /o on the set of points reachable from the initial point by means of pairs (x, u) belonging to a neighborhood of the reference pair in the C°([to, ti]) x U topology. For taking in account this fact, only particular tangent vectors to R(ti,to, XQ>) at x*(ti) have been considered; roughly speaking we look for the tangent vectors to the curves obtained by slightly modifying the control u*. Tangent vectors of this type are the ones introduced by A. Bressan 5 for the affine systems. I will name these tangent vectors, weak trajectory variations because they depend on the reference pair considered, not only on the final point of the trajectory. The definition of the weak trajectory variations for affine system is the following Definition 2.4. Let t \-t x*{t) be a solution of the C1 control process m
X = fo(x) + Y2Uifi(X) on the interval [to,ti] relative to the control u*. A vector v is a weak trajectory variation of (x*,y*) if there exist a family of admissible control maps, u(e) , e G [0,e] continuous in the L1 topology, such that 1. l i m ^ 0 + / £ \\u{e){s) - u*(s)\\ ds = 0 2. 5(*i,i0,a;*(*o),M(e)) = x*{tx) + ev + o(e). Notice that since the solution map of an affine control process depends continuously from the control in the L1 topology, from property 1 it follows that lime_+0+ \\S(-,t0,x*(t0),u(s)) -x*(-)))\\ = 0 in the C° norm, and hence
221
the weak trajectory variations are tangent vectors at x*(*i) to the set of points which can be reached from (to,xo) by means of trajectories which lay near the reference one. The definition of weak trajectory variations can be generalized to any control process if there exists a topology on the set of controls with respect to which the solution map restricted to a compact interval is continuous. If (x*,u*) is a weak solution of the O P problem, then Dfo{x*{h))v
>0
Vu weak variation of {x*, u*).
(2.2)
This implies that if we find a positive basis whose elements are weak trajectory variations of (x*,u*), then (x*,u*) is not a weak solution of O P . An example provides by A. Bressan in 5 proves that if there are constraints on the final point, then condition (2.2) is no longer a necessary condition for (x*,u*) to be a weak minimum. In fact in that example the reference pair is given by x*(t) = 0, t € [0,3], u* = 0; the reachable set from the origin at time 3 is i?(3,0,0) = { ( x , 2 / , 2 ) e K 3
:
xy>0}
the set of weak trajectory variations of (x*,u*) contains a positive basis, but if C = {x — —y, z = 0 } then (x*,u*) is a weak minimum of C O P independetly from the cost / 0 . The condition which a weak solution of C O P has to fulfill is £>/o(x*(ti))w>0
(2-3)
for each weak trajectory variation w , which belongs to the intermediate tangent cone of the intersection of the reachable set with the set of constraints, C. The intermediate tangent cone to the intersection of two sets does not coincide with the intersection of the tangent cones of the sets considered. This implies that (2.3) holds only for the vectors belonging to a collection of trajectory variations, H, which have the following property 1,3: B) if the convex closure of the set H cannot be separated from a convex cone V, and if coH and V are not two complementary subspaces, then the set V n co H contains at least one half line. The previous arguments lead to search for subsets of weak trajectory variations which have either property A) or property B ) . Bressan, in the quoted paper, has single out subsets of weak trajectory variations which have both the properties. A point to underline about this result, is that these subsets are particular collections of weak trajectory variations and they are not a collection of particular weak trajectory variations.
222
Other authors have chosen a different way to select weak trajectory variations with the required properties. They introduce a more restrictive definition of trajectory variations, 6 , 7 , 9 ecc, the needle-like variations. We will assume that it is given a topology on the set of controls with respect to which the solution map is continuous and continuously differentiable with respect to the initial state. This hypothesis is not necessary for introducing the needle-like variations, but it let the definition simpler. For conditions which insure that this hypothesis is fulfilled see 10 . Definition 2.5. Let (x*,u*) be a reference pair on the interval [to,ti] and let T € [io^i] • A vector v is a needle-like trajectory variation of x* at r if for each e £ [0,e] there exists an admissible control map u(e) with the following properties: 1. e >-> u(e) is continuous 2. u(e) coincides with the reference control outside the interval, [a(e),b(e)\ 3. lime_^0+ He) = lime_>0+ a(e) = r 4. S(ti,to,x*{to)),u(e))=x*(ti)
+ ev + o(e).
More general needle-like variations can be denned. I have chosen this definition because it is rather simple but sufficiently general to illustrate the main properties of this type of variations. The needle-like variations at r indicates the controllable directions of the reference trajectory from X*(T) . Notice that, if the values of the control u(e) are uniformly bounded, then from 3 it follows that u(e) tends to u* in any Lv norm, therefore for a large class of control processes 10 the needle-like variations are weak trajectory variations. It is an open question if the set of needle-like variations has property A)or property B)but both the properties has been proved for some subsets of them. It is known that any collection of needle-like variations each occurring at a time different from all the others has the properties we are speaking about. This implies that also any set of the needle-like variations which are limit point of sequence of variations each occurring at a different time, have the properties A)and B ) , see 6 , 7 . We will call this type of variations, K-variations. More precisely the definition of K-variation at time r is the following: Definition 2.6. A vector v is a K-variation atr if there exists a non stationary sequence {r„} converging to r and for each r n a needle-like variation, vn at rn, such that {vn} converges to v. The K-variations depends on the behavior of the control process along an arc of trajectory. In 4 G. Stefani and myself have single out a subsets of needle-
223
like trajectory variations which are of local nature and which have property A and property B. In the next section I am going to use these variations in a simplified version which can be found in u . Let us start with the definition of good variations. Definition 2.7. A vector v is a right good-variation, (left good-variation) of the pair (x*,u*) at r if there exists positive numbers k,c,e and for each e € [0, e] a family of admissible control maps, ue (c), c € [0, c] with the following properties: 1. we(c) coincides with the reference control outside the interval [r + e ,T + (1 + a)ek] 2. for each e, c H-> ue(c) is continuous 3. S(T+(a+l)ek,
r+ek,
(S(r-ek, r-{a+l)ek, uniformly w.r.t. c.
x*(r+ek),
ue(c)) = x*(T + {a+l)ek) + cev + o(e)
x*(T-(a+l)ek),
us(c)) = x*{T~ek)+cev+o(t)
)
The good variations will be simpler named g-variations. Let G(t, T) denote the transport along the flow generated by the reference control from x*(r) to x*(t); G(t,r) is the evolution matrix of the linearization along the reference trajectory. If v is a g-variation at r, then G(ti,r)v is a needle-like variation at r. Definition 2.8. The varational cone of the reference pair (x*,u*), /C, is given by /C = co< G(ti,r)v
,
v g-variation at
T\
It n it is proved that K, has both the properties A)and B)and that, under mild hypothesis, it contains the set of K-variations. Properties A)and B)imply the following theorems: Theorem 2.9. Let x* be a trajectory relative to the control u* defined in the interval [^o^i] • If the set of g-variations of (x*,u*) at to contains a positive basis of the state space, then the trajectory is STLC. / / the variational cone coincides with all the tangent space at x*{t{), then the trajectory x* is LC in the interval [to,ti]. I want to remark that it can be proved that if the variational cone is the whole space, then the trajectory x* is LC also in the interval [£o, *i — e], for e sufficiently small. This last property imply, for a large class of control process, that if x* is time optimal in this interval, then its variational cone cannot be the whole tangent space.
224
Theorem 2.10. Let {x*,u*) be a weak solution of COP and let N be the Dubovitskij-Miljutin tangent conea 8 of the constraints set C at x* (t\). Under these hypothesis there exists Ao < 0 and a linear functional v at least one of which different from zero, such that vw>0, (XoDfoix*^))
s
+ V) y < 0
Vw<EN
(2.4)
Vy e K
(2.5)
Prom the previous theorem one can derive high order Maximum Principle. Let me give an example of this type of results. I recall that if the control process is C 1 and if the values of the controls are bounded, then the "right" topology for the set of controls is the Lp([t0,ti]) one. This implies that the Pontriagin variations are g-variations. From Theorem 2.10 it follows: Theorem 2.11. Let
x = f(t,x,u)
u € {/
be a C1 control process and let U be a bounded set. If (x*,u*) is a weak solution of COP and if Dfo(x*(ti)) ^ 0, then there exist a non trivial solution X(t) of the adjoint equation
X=
-Xfx(t,x*(t),u*(t))
such that X(ti) = AoD fo(x* (ti)) + v X(t) v < 0
Ao
and
v
as in Theorem 2.10 W, g-variationat T
and
X(t)f(t,x*(t),u*(t))
=maxX(t)f(t,x*(t),u)
a.e. t£
[t0,ti].
Notice that a strong solution of a COP is also a weak one therefore the previous theorem is also a necessary condition for the pair be a strong solution. a
t h e vector v belongs to this tangent cone if for all s sufficiently small I * ( ( I ) + £ » G C
225
3
Examples of g-variations
In this section I am going to consider a single input, C°°, affine control process (£)
x = f{x) +ug(x)
|u| < S
(3.1)
the admissible controls are the locally integrable maps satisfying the constraints ||u(i)|| < S endowed with the L\oc topology. In the following the reference pair will be (x*,0) with x* : [0,T] -> W1 , t !->• exp t / o O , i.e. the reference trajectory is the one relative to the drift term of the process and the reference control is the null one. g-variations of this pair have been constructed by means of the "neutralization of the obstructions". To recall this result I need to introduce some notations. Let LieX denote the free algebra generated by two indeterminate .Xo, X\; if Z,Y £ LieX, [Z,Y] denotes the Lie bracket of Z and Y and adkzY is defined inductively by ad°zY = Y,
ad%+1Y = [Z, adzY].
Let A € UeX; |A|j is the number of times that Xi appears in A. Definition 3.1. Sussmann 12, Bianchini-Stefani 13 The bad brackets are the brackets in Lie X which contain XQ an odd number of times and X\ an even number of times. Let B be the set of bad brackets B = {A , |Ajo is odd |A|i is even }. The set of the obstructions is the set B* = Lie (X0, B) \ {aX0 , a G R}. The name of obstructions come from the fact that if all these brackets are equal to zero at the initial point of the reference trajectory, then the trajectory is STLC and hence the final point of the reference trajectory is an interior point of the reachable set. The same result holds if the obstructions which are different from 0, are neutralized by means of a "weight". Definition 3.2. An admissible weiaht for the process (3.1) is a couple of non negative numbers, I— (lo, l\), which verify the relations 0 < lo < l\ . Definition 3.3. Let 1 = {lo,h} be an admissible weight, the \-weight of a bracket $ is given by ||*||i = 'o|*|o + *i|*|iAn element 0 £ Lie X is said l-homogeneous if it is linear combination of brackets with the same \-weight, which we name the l-weight of the element.
226
The weight of a bracket, $ , with respect to the standard weight 1 = {1,1} coincides with its length and it is denoted by | | $ | | . Let 0 6 Lie X; with 0 s we denote the element in the Lie algebra generated by the vector fields / , g obtained from 0 substituting to XQ the vector field / and to Xi the vector field g. Definition 3.4. Let 0 € Lie X; following Sussmann 12 we say that 0 is ly.-neutralized at a point y if the value at y of 0 s is linear combination of the values of brackets with less l-weight, i.e. &T,(y) — 5Z ai ^s(2/) > 11 ^ J 111 <
lieili. The result in 4 states that the directions individuate by evaluating at r those elements of the Lie algebra associated to the system with the property that each obstruction which has less or equal l-weight is ls-neutralized at r , is a g-variation at r. Theorem 3.5. Letr € [0, T] and let A € LieX ; if there exists an admissible weight 1 such that each obstruction whose l-weight is less than or equal to \\A\\\ is ly.-neutralized at X*(T) then ±As(a;*(r)) are g-variations at r . This theorem implies that to each admissible weight 1 and to each time r we can associate a subspace of g-variations at r, the subspace spanned by AY,(X*(T)), A 1-homogeneous element whose weight is not greater than a positive number, p(l, r ) . In 14 it is proved that if the neutralization of the obstructions holds on an arc of trajectory containing x* ( r ) , then the set of g-variations which can be associated to 1 and r, is a larger set since to the subspace described above one can add an other g-variation whose direction is given by obstructions; this further variation, v, is of unilateral type, i.e. —v may be not a g-variation. Notice that variations of unilateral type are very important in optimization theory since they distinguish between minimization and maximization problems. I refer to 14 for the statement of the general result because it is very technical; I limit myself to quote two results which can be deduced from it. Further investigation are necessary to derive from the general result other explicit computable g-variations. Proposition 3.6. Let 1 be an admissible weight such that each obstruction whose l-weight is less than (2kli + lo) is Is -neutralized on an arc of trajectory containing x* ( r ) , then
(adlkf)(x*(t)) is a g-variation at T . Let me remark that since whatever is 1 there are no obstruction of weight
227
less than (2Zi + lo), then (ad2g)f(x*(t)) is a g-variation at each t £ [0, T]. Proposition 3.7. If there exists an admissible weight 1 such that each bad bracket of l-weight less than (2n + l)Zo + 2Zi is l-neutralized on an arc containing x* (T) , then (-ir+i[g,adf+'g](x*(T)) is a g-variation at T. The Legendre-Clebsch condition can be deduced from the previous proposition. In fact the neutralization by means of the weight 1 = (0,1) is given by brackets which contain a less number of g so that the following proposition holds. Proposition 3.8. Let Li(x*{t)) = span {Ax(x*(t)) : ||A||i = 1} and let p be an integer with the property that for each t in a non degenerate interval, J, contained in [0, T] ; [5,[a4,5]](x*(i))€Li(x*(t))
:t = l,...,p-l
for each t E J. If \g, [adpp] {x*(t)) $ Li(x*(t)), and
(3.2)
then p is an odd integer
(-l)ip+1)/2[g,[^f,g)]{x*(t)) is a g-variation at t for each t £ J. Remark 3.9. It can be proved u that the neutralization on an arc of trajectory implies that the integer p has the property that if A 6 Lie X contains two Xi and i times Xo, i < p, then A, is Is -neutralized on the interval. In Propositions 3.6 and 3.7 it is required the neutralizations of some obstructions on an arc of the reference trajectory, meanwhile in Theorem 3.5 only the neutralization in one point of the trajectory is required. I don't know if the neutralization in only one point by means of any admissible weight is sufficient for constructing unilateral g-variations, but in a forthcoming paper I am going to prove that, if the neutralization is done by means of the weight (0,1), then to the subspace of g-variations given by Theorem 3.5, one can add an unilateral g-variation. The analogous of Proposition 3.6 is Theorem 3.10. If r 6 [0,T] and each obstructions which contains (2p— 2) times g is neutralized at x* (T) by means of brackets which either contain a less number of g or contain an equal number of g but a less number of f, then
(adff){x*{T))
228
is a g-variation at r . Up to now it is not clear what is the analogous of Proposition 3.7. 4
Applications
In this section I want to show how the theory developed in the previous sections can be applied. Let me start with a L C problem. We want to know if there exists T > 0 such that the stationary trajectory x* (t) = 0 of the system ' i\ = u X2=X! X3 = X4
^ X4 = -X3 + x\ \u\ < 1, is locally controllable in the interval [0, T]. Let us calculate the variational cone of the pair (x*, 0) in the interval [0, T]. Since the trajectory is a stationary one any neutralization holds on the whole trajectory and the only obstructions which have to be neutralized are the bad brackets, 13 . The only bad brachets non vanishing at 0 contain two g and three / ; they cannot be neutralized by any weight, g = d/dxi and [f>d] = —d/dx2 have less 1-weight of these bracketts whatever is the weight 1, therefore the subspace span{ei, e 2 } is a subspace of g-variations at each time; [/,g], [[f,g],f]\x*(t) ^ 0 implies that the number p of Proposition 3.7 is equal to 3; therefore v = 9J [/>[/>[/>d]]] \x*(t) = 2d/8x4 is an unilateral g-variation at each r. It can be proved that the trajectory cannot be STLC; however if T > TT then the conic convex hull of the set
U
GT T
(>>
T€[0,T]
is the subspace spanned by {e 3 , e^] . Therefore if T > ix, then the variational cone is equal to M4 . We can conclude that the trajectory is locally controllable in any interval larger than IT. Let me notice that we cannot a priori exclude that the trajectory is locally controllable in a shorter interval of time, but it is really probably that it is not.
229
Let us examine now an application to an optimization problem with constraints on the end point. Consider the following C O P : minimize the functional fo(xi,X2,x3,X4,x5,X6) = -x5 + x\ over the set of points reachable in time 1 by means of the trajectories of the control process ±1 = 1 X2 = U X3 = X4
xh XQ
X2
Xz Xo
Xn
X^ -p X3
—
starting at 0 and satisfying the constraint x5+x6
(4.1)
= 0.
Let us calculate the variational cone of the pair (x*(t) = 0, u*(t) = 0) in the interval [0,1]. The obstructions which do not vanishing along the trajectory contain two g and at least three / , therefore span < e%, e3, a \ , the subspace spanned by brackets which contains only one g and at most 2 / , is a subspace of g-variations at any t € [0,1]. The obstruction adj. -,f(x*(t)) is proportional to the bracket adzgf{x*{t)); therefore each obstructios which contain two g and at most four / is 1-neutralized along the trajectory by means of the weight (1,1), 14 . By Proposition 3.7, ±ad3gf{x*(t)) = ±6d/dx5, -[g,[ad5fg\] = —2d/dxe are variations at any t. The linearization of the process along the reference pair is given by: 2/1=0 2/2 = 0 2/3 = 2 / 2 2/4 = 2 / 3
2/5 = 0 2/6=0 therefore the variational cone of the reference pair is the half-space: { x$ < 0} . If the reference pair is a weak solution of C O P , then by Proposition 3.7 there exist y € C1- and A < 0 such that (AV/ 0 (a;*(l)) + 2/, v) < 0 T
(4.2)
v G /C. T
Since Vf0(x*(h)) = (0,0,0,0, - 1 , 0 ) and y = ( 0 , 0 , 0 , 0 , a , a ) , for (4.2) to hold there exists a number a > 0 and a number A < 0 such that the vector (0, 0, 0, 0, - A + a, a)T
230
is orthogonal to span {e 2 , e%, e\, e 5 }. Since this is impossible, the reference pair is not a weak solution of the problem. 5
Open problems
The main open problem of this theory is the following: is it necessary to limit ourselves to consider only very particular tangent vectors to the reachable set for the variational approach either of COP or of LC problem? Taking in account what is known, I suggest to investigate to these specific points: 1. The weak trajectory variations has not the property A)and hence cannot have the property B)). Does any its convex subset has property B)? Is it true that if the set of weak trajectory variations is the whole tangent space, then the trajectory is LC? 2. Is the set of needle-like variations at a given time a convex set? 3. Have any convex subset of needle-like variations property A) and/or property B)? References 1. Bianchini, R.M. , Variational Approach to some Optimization Control Problems, in Geometry in Nonlinear Control and Differential Inclusions, Banach Center Publication, Warszawa, 32, 83- 94 (1995). 2. Mirica, S., New proof and some generalizations of the minimum principle in optimal control J. Optim.Theory Appl., 74, 487-508 (1992). 3. Mirica, S., Intersection Properties of Tangent Cones and Generalized Multipler rules, Optimization, 46, 135-163 (1999). 4. Bianchini, R.M., and Stefani, G., Controllability along a Reference Trajectory: a Variational Approach, SIAM Journal on Control and Optimization, 3 1 , 900-927 (1993). 5. Bressan, A., A high order test for optimality of bang-bang controls, SIAM Journal on Control and Optimization, 23, 38-48 (1985). 6. Krener, A. , The High Order Maximal Principle and its Applications to Singular Extremal, SIAM Journal on Control and Optimization, 15, 256-292 (1977). 7. Knobloch, H.W. , Higher Order Necessary Conditions in Optimal Control Theory, Lecture Notes in Control and Informations Sciences, 34, (Springer-Verlag, Berlin, 1981).
231
8. Aubin, J.P. , Frankowska, H., Set-valued Analysis (Birkhauser, Boston, Base, Berlin, 1990). 9. Agrachev, A.A. , Newton Diagrams and Tangent Cones to Attainable Sets, in Colloque international sur I'analyse des systemes dynamiques controles, 3-6 Juillet, Lyon, France, 1-12 (1990). 10. Bianchini, R.M. , and Margheri, A., First-Order Differentiability of the flow of a System with LP Controls, J. Optim. Theory Appi, 89, 293-309 (1996). 11. Bianchini, R.M. , Good needle-like variations, in Proceedings of Symposia in Pure Mathematics, 64, 91-101 (1999). 12. Sussmann, H. , Lie Brackets and Local Controllability: a Sufficient Condition for Single Input system, SIAM J. Control Optim., 2 1 , 686-713 (1983). 13. Bianchini, R.M. , and Stefani, G. , Graded Approximations and Controllability Along a Trajectory, SIAM Journal on Control and Optimization, 28, 903-924 (1990). 14. Bianchini, R.M., High Order Optimality Conditions, to appear in Rendiconti del seminario matematico di Torino.
This page is intentionally left blank
233
GEOMETRIC OPTIMAL CONTROL OF T H E A T M O S P H E R I C A R C FOR A SPACE SHUTTLE B. BONNARD,E. BUSVELLE, AND G. LAUNAY Laboratoire d'Analyse Appliquee et d'Optimisation Universite de Bourgogne, UFR des Sciences, 21000 Dijon, France E-mail: [email protected], [email protected], We give preliminary remarks concerning the optimal control of the atmospheric arc for a space shuttle (earth re-entry or Mars sample return project). The system governing the trajectories is 6-dimensional, the control is the bank angle, the costintegrand is the thermal flux and we have state constraints on the thermal flux and the normal acceleration. Our study is geometric and founded on the analysis of the solutions of a minimum principle and direct evaluation of the small-time reachable set for the problem taking into account the state constraints.
1
Introduction
The objective of this article is to make a preliminary analysis of the optimal control of the atmospheric arc for a space shuttle where the cost is the total thermal flux. The control is the bank angle (the angle of attack being hold fixed) and we have state constraints on the thermal flux and the normal acceleration. A pure numerical approach to the problem is presented in 2 where the analysis is also simplified because the terminal condition is relaxed to a condition on the modulus of the speed. Our aim is to analyze the problem with fixed end-point conditions which leads to a complex control law due to the number of switchings (or the number of rotations) we need to match the boundary conditions. This article is only a first step in the analysis in order to introduce the geometric tools to handle the problem and the necessary optimality conditions. In particular we shall restrict our computations to a 3 dimensional subsystem where the state variables are the modulus of the velocity, the altitude and the flight path angle. Also we shall localize the analysis to a small neighborhood of any point in the flight domain. This will allows to give local bounds to the number of switchings. It must be completed by numerical simulations to get a global bound. Our approach is geometric and use necessary optimality conditions and direct evaluation of the small time reachable set in the spirit of n but using normal forms as in 2 where the constraints are taken into account. It is well illustrated by the following planar example. Consider the time optimal
234
control problem for the system q = X (q) + uY (q), q = (x,y), \u\ < 1. Let 7+ (resp. 7_) be an arc corresponding to u — + 1 (resp. u = — 1) and denote 7172 an arc 71 followed by 72. Take a generic point q0, then the small time reachable set starting from q0 is a cone bounded by arcs 7+ and 7_ and each optimal trajectory is of the form 7 + 7 - or 7_7+, see figure 1; moreover along a trajectory the time can be measured using Miele's form: UJ = pdq where p is given by (p,X) = 1, (p,Y) = 0.
Figure 1. Reachable set with and without constraints
Assume qo = 0 and the trajectories constrained to the domain C : y > 0. Let 76 (t), t € [0,T] be a boundary arc starting from q0 = 0 and contained in the frontier y — 0; assume that the corresponding control Ub is admissible and not saturating. Let B — 7;, (£), T > 0 small enough. Consider the arcs 7+7_ and 7-7+ joining 0 to B , one is time minimal (and the other is time maximal) for the problem without state constraint, and we have two possibilities for the constrained problem, see figure l,(b). Assume it is 7 + 7 - , then if it is contained in y > 0, it is admissible and the boundary arc is not optimal, the optimal synthesis near qo for the constrained system being 7 + 7 - . If 7 + 7 - is not contained in y > 0 the boundary arc is time optimal and the optimal synthesis is 7 + 7(,7_. The analysis can be carried out in full details using the model x = 1 + ay, y = c + u and not the Miele's form w defined only for planar systems.
235
A major problem when analyzing optimal control problems with state constraints is to derive necessary optimality conditions. Indeed the constraints can be penalized in the cost in several manners and this leads to introduce the concept of order of the constraints. Also it is the basic concept to construct normal forms and evaluate the reachable sets for the system with the constraints. We shall formulate a minimum principle due to 8 ' 1 2 , adapted to analyze the optimal trajectories for the space shuttle. It concerns single input control systems and we need regularity assumptions. It is much more precise than the general minimum principle of 12 , where an optimal trajectory is the projection of a trajectory in cotangent bundle depending of a measure supported by the constraints. 2
The model
The problem is to control the atmospheric arc nearby a planet which can be the Earth (re-entry problem) or Mars (sample return project). In both cases the equations are the same, except for constants related to the planet (radius, mass, angular velocity, atmosphere). In our computations we shall assume that the planet is the Earth. In order to modelize the problem, we use the laws of classical mechanics, a model of the gravitational force, a model of atmosphere and a model of the aerodynamic force which decomposes into a drag force and a lift force. The equations are simplified by choices of orthonormal moving frames that we explain below. 2.1
Moving frames
We denote by E = (ej,e2,e3) a standard Galilean frame whose origin O is the center of the Earth and let i?i = (/, J, K) be a rotating frame centered at 0 where K is the axis N-S of rotation of the Earth, the angular velocity being f2 and I is chosen to intersect Greenwich meridian. Let R be the Earth radius and let G be the center of mass of the shuttle. We denote by R[ = (e r ,e(,e£,) the frame associated to spherical coordinates of G = (r,l,L), r > R being the distance OG and /, L being respectively the longitude and latitude. We introduce the following moving frame R2 = {i,j, k) whose center is G. Let C : t -> (x (t), y (t), z (t)) be the trajectory of G measured in the frame Ri and let ~& be the relative speed v = xl + y J + zK. To define i , we set ~& = \v\ i . The vector j is a vector in the plane (i, er), j is perpendicular to i and oriented by j.er > 0. We take k = i A j . The vector i is parametrized
236
(a)
(b)
Figure 2. Moving frames: flight path angle and azimuth
in the frame R[ = (er,ei,ei)
by two angles:
• 7: flight path angle • E: azimuth defined on figure 2. 2.2
Model of the forces
For the atmospheric arc we assume the following Assumption 1 There is no thrust: the shuttle is a glider. Assumption 2 The speed of the atmosphere is the speed of the Earth, i.e. the relative speed of the shuttle with respect to the atmosphere is the speed it. We must consider two types of forces acting on the shuttle. • Gravitational force. We assume that the Earth is spherical so that the gravitational force is oriented along e r .It is written in the moving frame R2 P = — mg (i sin 7 + j cos 7) where g = % .
237
• Aerodynamic force. The effect of the atmosphere on the shuttle is on aerodynamic force which decomposes into
— A drag force colinear to the speed ~& and of the form
2* = - (^pSCDvA i
— A lift force perpendicular to ~& and given by
~P T = -pSCiv2
(j cos p + k sin p)
and p is called the bank angle, where p — p{r) is the atmospheric density, S is a constant and CD, CL are respectively the drag and lift coefficient.
Assumption 3 Both coefficients Co and CL are depending upon the angle of attack a which parametrized the orientation of the speed v with respect to the normal of an element of area of the shuttle. We assume that for the atmospheric arc the angle of attack is kept constant. This is very restrictive but it is worth to point out that in the numerical simulations of 2 where a is a control, in the optimal solution it is a constant. Hence the only control is the angle of bank p.
2.3
System equations
The atmospheric arc is governed by the following system
238
dv — =t/sin(7) d v
•
(2.1a) !
I \
$ CD
o
— = —g sin (7) — -p .£
C*E
n.2
r , •
T
•
T
v + s i r cos L (sin 7 cos L - cos 7 sin L cos s ) TIL
(2.1b) — = cos (7) ( at
\ 2
+ fl dL
v
1- - ) + -zP v
r>
2
wcos(u) + 2ficosLsinE;
(2.1c)
m
(cos 7 cos L + sin 7 sin L cos E) „
(2-ld) ,
-7- = - cos 7 cos = (2.1e) (2.1f) at r dl v cos 7 sin H d=. 1 5(7^ u v , . _ . , x — — = -/> 1—cos 7 tan L sin ^ (2.1g) dt r cossmu L dt 2 m cos 7 r „~ / - ,—s ^9 r sin i cos L sin 5 ,„ , . + 2H (sin L — tan 7 cos L cos ,=,) + f r (2.1h) v cos 7 where the control is the bank angle fi and the state space is q = (r, u, 7, L, I, E) 2.4
Atmospheric model
Atmospheric density is tabulated for Earth, Mars and Venus and we take an exponential model P = Poe~0r 3 3.1
The control problem Control and control bounds
The control can be either /x or /t. In the first case we can have the following bounds: /J, £ [— | , | ] or ft 6 [—7r,7r]. We set Ui = cos/x and «i is a direct control on the flight path angle 7. We let «2 = sin^j and U2 control the azimuth, the sign of u?, allows the glider to turn left or right. 3.2
State constraints.
There are several state constraints but in the first step of our analysis we shall consider two constraints:
239
• Constraint on the thermal flux
< y>max
(3.1)
where Cq is a given constant • Constraint on the normal acceleration 7n=7n0(«)^2<7rx 3.3
(3-2)
Optimal cost
Several choices are allowed and we make the analysis for T
J ( / i ) = f Cq^v3dt (3.3) Jo which represents the total thermal flux, the duration T of the atmospheric arc being not fixed. We introduce the differential equation ^
= C , ^
s
(3-4)
with go (0) = 0. 3.4
Boundary conditions
The transfer time T is free and we have two choices for the boundary conditions: • Fixed boundary conditions at t — 0 and t = T for q —
(r,v,j,L,l,E).
• 7 £ [7min> 7max] at t = 0 with the constraint that a preliminary maneuver on the Keplerian arc allows this possibility. 3.5
State domain for the atmospheric arc
The flight domain D for the Earth re-entry of the shuttle is the following: • Altitude: h = r - R G [40 km, 120 km] • Velocity amplitude v 6 [2000 m / s , 8000 m / s ] • The flight path angle domain i s 0 > 7 > — 1 5 ° .
240
Assumption 4 (controllability assumption) The Earth angular velocity ft is small and hence
|-«(7)(-|
+
J) + i^.«-M
(3.5)
We shall denote by Dc the subset of D where the lift force can at each point compensated the gravitational force that is 1 SCL g -p v > 2 m v and (3.5) is feedback linearizable in the domain. 4
The minimal principle without state constraints — Extremal curves
4-1
Problem statement and notations
Let the single-input control system q = F(q,u)
(4.1)
and a cost to be minimized of the form J (u) = / if (q) dt (4.2) Jo where the transfer time T is free and tp is not depending upon u. The set of admissible controls is the set U of measurable mappings u : [0, T] —> U. The state domain is a subset of W1 with the state constraints: • Constraint on the thermal flux ci (q) =
(4.3)
• Constraint on the normal acceleration C2 (q) = In (?) < oa
(4.4)
The boundary conditions are of the form: • q (0) =
then 7(0) G [71,72], 7i < 72 < 0.
We denote by R (q0, t) the reachable set at time t > 0 fixed and R (q0) = U small enough R (*>, *) the small time reachable set.
241
4-2
Minimum principle
We recall the minimum principle 13 which allows to parametrize the boundaries of the reachable sets n . We introduce the Hamiltonian H (q, p, u) = {p,F(q, u)) + pQtp {q) where q = (r,v,j,L,l,E) and p = (pr,Pv,P-y,PL,PhPs) is the adjoint vector and po is a constant such that (p,po) j1 0. Definition 4.1 If po ^ 0 we are in the normal case and ifpo =0 we are in the abnormal case. Definition 4.2 We call extremal a triplet (q,p,u) solution of the minimum principle q
= F(q,u)
dF _ dip - ^q-- °-dq= min H (q,p,w)
P= p
H{q,p,u)
= —
p
(4.5) =
8H -dq-
^ (4.7)
wEU
Proposition 4.3 An optimal solution for the problem without state constraint is a projection on the state space of an extremal solution. Moreover po > 0. Since the transfer time T is free it is exceptional, that is H = 0. If moreover 7 is free at t = 0, the adjoint vector p satisfy the transversality condition p7(0)=0i/7(0)€]7l,72[ 4-3
(4.8)
Definition of subsystem (I)
Observe that fi is small with respect to the velocity of the shuttle. Hence if we neglect the transport terms O (fi2) and the Coriolis terms O (Q) our system can be decomposed with q\ = (r, ^,7) and qi — (L, i,H) into Qi = Fi
(9i>"i)
where u\ — cos/z, w2 = sin^, u = (1(1,^2) and u
= \u\ + ul - 1 a n d ui > 0 if /a G ~, ^1 j
242
The adjoint system (4.6) is the decomposed into
I f 0 0\ {PlP2 0)=-{PlP2Po)
| |§f fff 0
fe 0 0 If we relax the end-point condition on q2 = (£,'>£) we obtain using the transversality condition p2 (T) = 0 and hence p2 (t) = 0. The analysis of extremals reduces to the analysis of the solutions of 9i =
Fi(q1,u1) dFi _ dip
Pi = -Pi-E oqi
Po^— dqi
It is associated to the optimal control of system (I) given below: dr (I)
= v sin (7)
dv . . . 1 SCD 1 — =-gsm{rt)--p——v at 1 m d-y . . / g v\ 1 SCL ,N — = c o s ( 7 ) ( - - + - ) + - p — — «cos(/i)
Note that q\ = (r, ^,7) appears only in the state-constraints. We shall concentrate in a first step our analysis on the subsystem (I). It is related to the numerical simulation of 2 . 4-4
Analysis of extremals of system (I)
4-4-1- Problem reduction and definitions Consider a single input affine system q = X + uY,
\u\<\
(4.9)
and a cost to be minimized of the form J ( « ) = /
(4.10)
Assume moreover that tp(q) > 0 in the state domain. Introduce the equation / % =¥> (q) X 9o (0) = 0
243
and q = (q,qo) is the enlarged state. Hence (4.9), (4.10) can be written q = X(q) + uY(q),\u\
(4.11)
and let s be the new time parameter defined by ds =
(4.12)
and if q' denote the derivative of q with respect to s, (4.9) can be written q' = X(q) + uY(q),\u\
(4.13)
where X = ipX, Y = tpY and ip = -. The optimal control problem becomes a time minimum control problem. Definition 4.4 Consider q = X + uY. A singular trajectory of the system (X, Y) is a projection of the following equations dH q = dp dH i> = dq uq (p,Y)=0
(4.14)
where H = (p,X + uY), p ^ 0. It is called exceptional if H = 0 and, admissible if \u\ < 1 and strictly admissible if u £ ] —1, +1[. Notation 1 IfXi and X2 are two smooth vector fields, we denote by [Xi,X2] the Lie bracket computed with the convention [X, ,X2](q) = ^
(q) X, (q) - ^
(q) X2 (q)
Assumption 5 Throughout this article we shall assume that g = ^ is constant. Proposition 4.5 In the domain cos 7 7^ 0 there is no exceptional singular arc for the system (X, Y). Proof. The singular extremals are located on (p, Y (q)) = 0. Differentiating twice with respect to t one gets (p,[X,Y](q)) = 0 (p, [X, [X, Y}} (q)) + u{p, [Y, [X, Y]] ()> = 0
244
We must compute the Lie brackets Y, [X,Y], [Y,[X,Y]] and [X,[X,F]] where . d , . , o\ d / g v\ d X = vsin7(gsin7 + kpv ) — + cos 7 h - TTv or ov \ v r) 07
and k , k are defined by the equations (2.1b) and (2.1c). Since the concept of singular arc is feedback invariant we can replace in our computations X and Fby v . d X = vsin7or
,
.
v a sin
, 2, d 7 + kpv ) — ' ov
We have then [X,Y\ = - t ; c o s 7 — +g cos 7 — [r, [X, y]] = v sin 7 — - 5 sin 7 — hence [X, Y] and [y, [X, Y]] are colinear. Moreover [X, [X, Y]] = kpv2 cos 7-5- + (-kv3p' or
cos7 + 2fcpt/t; cos7) — ov
The singular extremals are located on £': {p, Y) = {p, [X, Y]) = 0 that is P-r = Pv5 - prv = 0. We introduce D = D' = D" =
det(Y,[X,Y],[Y,[X,Y}}) det(Y,[X,Y),[X,[X,Y}]) det{Y,[X,Y],X)
From our previous computations singular arcs are located on D = D' = 0 and moreover if they are exceptional they satisfy D" = 0. We have £> = 0 D' = kv2 cos2 j (p v2 — 3pg) D" = kpv3 cos 7 Since C0S7 7^ 0 the proposition is proved. Moreover we have for system (4.9)
D
245
Lemma 4.6 If cos^f ^ 0 then 1. Y and \X, Y] are independent; 2.
[Y,[X,Y}]eSpan{Y,[X,Y]}
4-4-%- Analysis of extremals Consider the time minimum control problem for system (4.13): q'=X(q)+uY(q),
|u|
We introduce the following definitions Definition 4.7 The set S : {p,Y (q)) = 0 is called the switching surface. Let (q,p,u) be an extremal defined on [0, T]; it is called singular if it is contained in S, bang if u — + 1 or u = — 1 and bang-bang if u(t) is piecewise constant and given a.e. by u(t) = — sign (p (t), Y (q (t))). We denote respectively by 7+ (resp. 7_, 7 S ) a smooth arc associated to u = + 1 (resp. u = —1, u singular control) and 7 + 7 - represents an arc j + followed by an arc 7_. Let us calculate Lie brackets. We have , ( • d , . , 9, d X = i> ^ s i n 7 ^ : - (<7Sin7 + kpv)fo+
/ g v\ d \ c o s 7 (-- + -) g^J
-*F
Y = ipkpv— 07 where tp = ip~l. Since X = ipX, Y = ipY using for fi, J2 smooth functions the formula [hX, f2Y] = fxf2 [X, Y] + h (Xf2) Y-h
(Yh)
X
where Zf = | £ z (q) is the Lie derivative we get \X,Y]
=*l>2[X,Y] +
i>(Xilj)Y-iP(YiP)X
Since Y = kpv-^- and ip = f(p,v) we have Yip — 0. Hence \X, Y] — ip2 [X, Y]+I/J (Xi/)) Y. Computing [F, [X,F]] as before we obtain Lemma 4.8 1. The set £ ' : (p,Y)
= f/f3\Y,[X,Y]] €Span{r,[X,y]}.
= (p, \X,Y]) mod
= 0 is given by (p,Y) =
Span{F, [X,Y]}
and
hence
246 Moreover,
D = det(F,[X,F],[F,[x,F]])=o and P" = det(F,[X,F],X)
=
^
I
and hence Y, [ X , F ] , X are a frame in the flight domain where cos 7 7^ 0. So there exists a, b, c such that \X, \X,Y]]
= a~X + bY + c \X,Y]
(4.15)
Long computations give us the crucial result Lemma 4.9 If cos 7 ^ 0 we have 1. D' = det ( F , [ X , F ] , [X, [ X , F ] ] ) = - | ^ - c o s
2
7 *0
Corollary 4.10 If cos 7 7^ 0 t/sere exists no singular trajectory. 4-4-3. Application to the classification of extremals near E Let (q,p,u) be a smooth extremal on [0,T]. Differentiating the switching function $ : t -> (p (t), Y (g (£))) we have
*(t) = (p(t),[x,F] (?(*))> $(t) = (p(t), {X, \X,Y\] (q(t))+u(t) [F, [X,F]] (g(t))> We use the results of 10 to classify the extremals near a point ZQ = (qo,Po)1. Ordinary points. If z0 belong to (p, Y) = 0, (p, [ X , F ] ) 7^ 0, the point 00 is called of order 1 or ordinary and each extremal curve is locally of the form 7 + 7 - or 7-7+. 2. Points of order 2. Let z0 E £': (p,Y) = (p,\X,Y])= 0. Then if (q,p,u) is a smooth extremal through ZQ the switching function satisfies at ZQ: *(*) = $ (t) = 0
247
and * ( t ) = (p(t), =
\X, \X,Y]]
(q(t))+u(t)
[F, \X,Y\]
(«(*)))
(p(t),\X,{X,Y]](q(t)))
from lemma 4.8 which is non zero from lemma 4.9. Hence both curves corresponding to u — + 1 and u = — 1 have a contact of order 2 with respect to £ and the extremal solutions are represented on figure 3. Ac-
Figure 3. extremal solutions (a > 0)
cording to the classification of 10 the point ZQ is a parabolic point and each extremal is locally bang-bang and of the form 7 + 7 _ 7 + or 7 _ 7 + 7 _ . Prom this analysis and from the minimum principle we can conclude about small time optimal policy. Theorem 4.11 If cos 7 ^ 0 each small time optimal policy is of the form 7_7+7_ where 7+ is an arc corresponding to u = cos/z = + 1 and 7_ an arc corresponding to u = — 1 (or u — 0 if n £ [— | , | ] ) . Proof. __ According to the time minimum principle, an optimal arc has to satisfy H — 0, p0 > 0 where H — {p,X (q) + uY (q)) + p0 = 0. Hence p can be oriented at z0 € £ ' according to (p,X (q)) < 0. Write [X, \X,Y]]
=alC + bY + c
\X,Y]
248
and a < 0 from lemma 4.9. Hence from figure 3, only an extremal 7_7 + 7_ can be optimal. The assertion is proved. • 4-5
Geometry of the small time reachable set
Consider again system in 3-dimension
^=X(q)+uY(q) and its time extension in 4-dimension by adding the cost ^ j - = 1. We denote respectively by R(qo) the small time reachable set and by R(qo,0) the small time reachable set for the extended system. One major research program undertake in 1 4 ' n using original ideas from Lobry is to evaluate in small dimensions the small time reachable set and its boundary. In particular the following result is basic: Lemma 4.12 Consider system (X,Y) in dimension 3 and let gi = X + Y and g2 = X — Y. Assume g\, g2 and [#1,(72] linearly independent at qo then R (qo) is bounded by the two surfaces 7 + 7 - (go) and 7-7+ (go) and moreover
R(qo) = U 7+7-7+ (go) (or U 7-7+7- {qo)) Actually in theorem 4.11 we proved more (see also 1 4 ) : Lemma 4.13 If cos 7 ^ 0 the boundary of the small time reachable set for the extended system R (qo, 0) is an union ofj-j+ 7_ (go, 0) and 7 + 7 _ 7 + (g 0 ,0) where 7 denotes the time extended trajectory. 4-6
Optimal control of the atmospheric arc
If we consider the complete set of equations it can be written as a time optimal control problem for a 6-dimension system of the form
q'=X(q)+u1Y1(q)+u2Y2(q) where 111 = cosju and u2 = sin/i. We can use two points of view related to the control device: • We set jj, = w where w is taken as a control bounded by M. The system is then a single input affine control system on the 7-dimensional state space (g,£t). • We can consider the original system on the 6-dimensional state space. The control u = (MI,W 2 ) satisfies u\ + u\ = l. If fi € [0,27r], the optimal control problem is equivalent to a sub-Riemannian problem with drift.
249 Indeed if we set ipi = (p, Yi{q)), given by u= ^ ( ^ 1 , ^ 2 ) -
i = 1,2 an extremal normal control is
We must analyze the existence of abnormal extremals and the number of oscillations and switchings of optimal trajectories. 4-7
Conclusion about this section
Using minimum principle, Lie brackets and geometric methods we have evaluated the small time reachable set and solved the small time optimal control problem for system I. In particular we have obtained bounds on the number of switchings. The main property of system I is that [Y, [ X , F ] ] belong to Span {Y, [X, Y]}. This is connected to the feedback linearizability if system I. For the global aspect one needs to analyze the global proportion of the switchings function $ using convexity analysis and Rolle theorem. Our study is a preliminary step in order to evaluate the reachable set for the full system of equation without the state constraints and nearby the state constraints. We shall analyze the structure of the reachable set for system nearby the constraints in the next section. 5
Optimal control with state constraints
In this section we analyze the optimal control problem for system I, taking into account the constraints. We recall a minimum principle from 12 adapted to our situation. Our contribution is to make a direct evaluation of the small time reachable set for the constrained system using the previous computations of section 4 and a normal form. When dealing with constrained systems the main concept is the concept of order of the constraint that we define next before to state the minimum principle adapted to our analysis. 5.1
A minimum
principle
We consider the single input affine control system q = f{q)+ug
{q)
\u\ < 1
and a cost to be minimized of the form
J(u)=G{x{T)) where the transfer time T is fixed and q is constrained to c(q)<0
250
The boundary conditions are g(0) = go $ [x (T)) = 0 The problem is denoted by (Vo) and can be imbedded into the one parameter family of problems (Va) where the constraints set is taken as c
(Q) < a, a small The important concept is the concept of order of the constraint. Definition 5.1 The absolute (or generic) order of the constraint is the first integer n + 1 such that g{foc)=g{flc)=---=g(fn-1c)=0
fl(/nc)#0 where the vector Definition 5.2 tained in c — 0. by differentiating
fields f, g acts on c by Lie derivative. A boundary arc t >-> jb (t) is a solution of the system con/ / the constraint is of order n it can be generically computed n times the constraint and solving the linear equation c<"> = fnc + ug {r~lc)
= 0
(5.1)
A boundary arc is contained in c = c=-=
c(n_1) = 0
(5.2)
-1
and the constraints c = • • • = c'™ ' = 0 are called the secondary constraints. We denote by Ub the feedback control rSL f ^ which allows to remain in the constrains set. Let's now formulate the Maurer minimum principle 12 : Assumption 6 (General assumption) We assume that the following conditions hold on a boundary arc s H-> 71, (s), s € [0, t}: (He) g (/ r a _ 1 c) |7i,7^ 0 (n being the order) (H7) |w&(£)| < 1 i.e. the boundary feedback control is admissible and not saturating. Necessary conditions. Define the Hamiltonian by H(q,u,p,n)
= (p,f + ug)+nc
(5.3)
where 77 is a Lagrange multiplier of the constraint set. The necessary conditions of the minimum principle are the following:
251
Condition 1 1. There exists r}(t) > 0, a real number r]o > 0 and 5 such that the adjoint (row) vector satisfies
p(T)=Vo^(q(T))+6~(x(T))
(5.5)
2. The function r) (t) satisfies n (t) c (q (t)) = 0 Vi € [0, T] and is continuous on the interior of the boundary arc. 3. The jump condition at a contact point or a junction time t\ is Br P (tf) = P (*r) - "i Q- (9 (*i)), "i > 0 4- The optimal control u(t) minimizes the Hamiltonian, H (q(t) ,u(t) ,p(t) ,ij(«)) = min H (q(t) ,u,p(t)
(5.6)
i.e. ,n(t))
(5.7)
|u|
Remark 5.3 In this minimum principle, only the constraint c is penalized in H; others choices are possible using the secondary constraints, see 8 ' 1 3 . Remark 5.4 There exist a general minimum principle without assumption (He), see for instance 9 where the adjoint equation (5-4) takes the form
P(«) = - / P ( » ) ( ^ + « | ) * - / | ^ where \ii is a measure supported on the set c = 0. Our principle is much more precise because from (5.4) the measure is of the form dfii = T) (t) dt where n is C°. This additional regularity comes from assumption (HQ) and at non generic point where g (fn~1c) vanishes n can blow up. The case where T is not fixed can be deduced from the case where T is fixed. We introduce a new variable z — T and the system dt _ ds
fs=(f(q)+ug(q))z
252
We have s = £ and the trajectories are parametrized by s € [0,1]. The new transfer time is 1. An important research program is to analyze the solutions of the minimum principle with constraints. This analysis is outlined in 12 . An interesting point of view is to analyze the open loop solution deduced from the problem without constraints by analyzing the bifurcation of an unconstrained optimal solution when the constraint c (g) < a becomes active. Next we adopt a different approach based on the evaluation of the small time reachable set near the constraints. It will provide necessary and sufficient optimality conditions. 5.2
A direct approach
5.2.1. Order of the constraint Consider in the shuttle problem the constraint on thermal flux ci = Cgy/pv3
poe~0r
= 0 is a secondary constraint. Moreover
(r,v)cos7 where
Similarly for the normal acceleration c2 = 7n 0 p^ 2 we get ci = -7n 0 (2fy>V + {Ppvz + 2gpv) sin 7) i.e. c2 = iphir,v) + tp6 (r, v) sin 7 = 0,
253
constraint is of order 2, hence Yc = 0 near 0 and Y is tangent to all the surfaces c = a. Hence | | = 0. • Normalization 2. Since c is not depending upon z using a diffeomorphism preserving Y ~ ^ we can normalize c t o c ( x , j / ) = x. The system can be written x = Xi (q) V = X2 (q) z X3(q)+u and c = x. The secondary constraint is x = 0 and we assume that x = x = 0 is an arc a passing through qo = 0- If we keep the affine approximation sufficient for our analysis we obtain a system which can be written x = aix + a2y + a3z y = b0 + hx + b2y + b3z z = co + c\x + c2y + c3z + u where a is approximated by the straight line x = a2y + a3z. If bo ^ 0 (generic case) we can assume bo = 1. • Normalization 3. Changing z into — z and u into — u if necessary and using a transformation of the form Z = ay + z one can identify a to x = z — 0 and the system can be written x = aix + a3z y=l + b1x + b2y + b3z z = Co + C\X + c2y + c3z + u
(5.8)
where a 3 > 0. If moreover the boundary arc is admissible and not saturating (assumption (H7)) we have the condition |co| < 1. Theorem 5.6 Consider the problem of time minimization in q = X (q) + U Y (?)j ? € M3 subject to c (q) < 0. Let q0 £ {c = 0} and assume the following:
1. Nearqo, [F, \X,Y\] € Span{F, \X,Y]} 2. X,Y,
\3(,Y]
are linearly independent at qo and
\X, \X,Y]] with a < 0
(q0) = aX(q0) + bY (q0) + c{X,Y]
(q0)
254
3. The constraint c = 0 is of order 2 and assumption fH6/) and (H7) are satisfied at qo then the boundary arc through qo is small-time optimal if and only if 7_ (qo) is contained in the domain c > 0. Proof. From lemma 4.12, we know that each small time reachable point from qo can be reached by an arc 7+7-7+ and 7 - 7 + 7 - and from theorem 4.11 we know that the small time optimal arc is of the form 7 - 7 + 7 - for the unconstrained system. Let the constrained system written as (5.8) in the normal coordinates where q0 = 0 and the boundary arc •jb (t) is identified to (0, t, 0). Let B = 76 (t), t > 0 small enough. Let u = + 1 or u = —1. For a trajectory with q (0) = 0 we have the following approximations: z (t) = (co + u) t + o (t) t2 x (t) = a3 (co + u) — + o (t) Hence the projections of the arcs 7+7-7+ and 7 - 7 + 7 - joining 0 to B in the plane (x,z) are loops denoted 7+7-7+ and 7 - 7 + 7 - and are represented on figure 4.
Figure 4. Projection of the arcs 7+7-7+ and 7 - 7 + 7 -
In particular, we proved the following.
255
Lemma 5.7 The loops 7 - 7 + 7 - (resp. 7+7-7+,) are contained in the domain x < 0 (resp. x > 0). We can now end the proof of the theorem (the assertions concern system
(X,Y)). If the arc 7 - 7 + 7 - to join 0 to B is contained in the domain c < 0 it is time minimal and the boundary arc is not optimal. If the arc 7 - 7 + 7 - is contained in c > 0 then we can join 0 to B by an arc 7+7-7+ in c < 0. But the analysis of section 4 replacing min t by max t shows that such an arc is time maximal. Hence a bang-bang arc 7+7-7+ in the domain c < 0 joining 0 to B cannot be optimal. Then the boundary arc 76 is optimal. • Moreover Corollary 5.8 / / a boundary arc % is small time optimal then there exist optimal trajectories of the form 7-7+767+7-. 5.2.3. Application to the shuttle For the shuttle we have a < 0, so we have to consider loops 7 - 7 + 7 - where 7_ corresponds to cos fi = 0 or cos fj, = — 1. From the computations of section 5.2 we have for both constraints c\, c-i: '6i = $ (r, v, 7) + u cos 7 $ (r, v) where $ < 0. Hence c, is minimal when /i = 0 ° i.e. u — + 1 . Assume that the parameters of the problem are such that assumption (H7) is satisfied. Then the arcs 7_ through the boundary points are contained in the non admissible domain and the boundary arc is optimal. 6
Conclusion
We have outlined the geometric research program to analyze the optimal control of the atmospheric arc for the space shuttle. Our tools are necessary optimality conditions and evaluation of the small time reachable set. Near the constraints the evaluation is related to the classification of pairs of vector fields near a surface. This problem is common to several problems met in optimal control: classification of extremals near the switching surface, optimal control with targets and so on. References 1. O. Bolza, Calculus of variations (Chelsea, 1973).
256
2. F. Bonnans and G. Launay, Large scale direct optimal control applied to the re-entry problem, Journal of guidance, control and dynamics, 2 1 , 6, 996-1000 (1998). 3. B. Bonnard, G. Launay, Time minimal control of batch reactors, COCVESAIM, 3, 407-467 (1998). 4. B. Bonnard and I. Kupka, Theorie des singularites de l'application entreesortie et optimalite des singulieres, Forum Math., 5, 11-159 (1993). 5. A. E. Bryson and Y. C. Ho, Applied optimal control (Hemisphere Pub. Corporation, 1975.) 6. CNES, Mecanique spatiale, tome 1/2 (Cepadues Eds., 1995). 7. J.-M. Coron and L. Praly, Guidage en rentree atmospherique, Rapports 4/5 (CNES, Octobre 2000). 8. D. H. Jacobson et al., New necessary conditions of optimality for control problems with state-variable inequality constraints, J. Math and Appl., 35, 255-284 (1971). 9. A. D. Ioffe and V. M. Tikhomirov, Theory of extremal problems (North Holland, 1979). 10. I. Kupka, Geometric theory of extremals in optimal control problems, TAMS, 299, 225-243 (1973). 11. A. J. Krener and H. Schattler, The structure of small time reachable sets in small dimensions, SIAM J. on Control and Op., 27, 120-147 (1989). 12. H. Maurer, On optimal control problems with bounded state variables and control appearing linearly, SIAM J. Control Optimization, 15, 345362 (1977). 13. V. Pontriaguine et al, Methodes mathematiques des processus optimaux (Ed. MIR, 1974). 14. J. H. Schattler, The local structure of time optimal trajectories in dim. 3 under generic conditions, SIAM J. Control Optimization, 26, 4, 899-918 (1988). 15. H. Sussmann, The structure of time-optimal trajectories for single input systems in the plane: the C°° non singular case, SIAM J. Control Opt., 25, 856-905 (1987).
257
H I G H - G A I N A N D N O N - H I G H - G A I N OBSERVERS FOR N O N L I N E A R SYSTEMS E. BUSVELLE A N D J.P. G A U T H I E R Departement de Mathematiques, Universite de Bourgogne, Laboratoire d'Analyse Appliquee et Optimisation, BP 47870, 21078, Dijon cedex, France E-mail: [email protected] In this paper, following ideas already developped in 1 0 , we construct an observer for nonlinear systems that looks like the extended Kalman filter. In fact, it is asymptotically (in time) exactly the deterministic version of the extended Kalman filter, and when the "innovation" is large, it is an high gain observer. In the context of the theory developped in 1 0 , we show that it works for "all observable systems". In the paper, we prove convergence of the estimation error, we give several estimates of this error, and we show a convincing illustrative application (a distillation column)
D e d i c a t e d t o Velimir Jurdjevic
1 1.1
Introduction, systems under consideration Systems under consideration
We consider nonlinear systems of the following form (1.1), on Rn. The control space U, is a closed subset of Rd. Only for simplicity of the exposition of the proof of the main result, the observation is taken to be single-valued: it is a u— dependant linear form on Rn. dx — =A(u)x UT
+ b(x,u),
y = C(u)x. A(u) , C(u) are matrices: C(«) = (oi(u),0,....,0),
(1.1)
258
( 0,a 2 (u),0,....,0 \ 0,0,a 3 (u),0,...,0 A(u) 0,
,0,a n (u)
V°.
»o/
where aj(.), i = l,...,n, are positive smooth functions, bounded from above and from below: 0 < am
\-b2(x1,x2,u)OXi
\- ... + OX2
bn(x1,...,xn,u)-—. OXn
These assumptions look very strong. In fact, under either genericity hypotheses or observability hypotheses, for the purpose of synthesis of observers, it is sufficient t o restrict to these systems, under the normal form (1.1) (or similar multi-output normal forms), and meeting these assumptions. This will be discussed in the next section 2.
We stress again that in all the paper, the single output assumption can be removed everywhere, and we leave this to the reader, but, in Section 4, we will deal with a 2— outputs system, in a similar normal form. 1.2
Presentation of the paper
Our purpose herein is to construct observers, for the observable systems described above. In fact, for these systems, several types of nonlinear observers can be constructed. We will focus on two types of construction that both turn around the "extended Kalman filter", in either its deterministic or its stochastic form: 1. First construction: The Extended Kalman Filter itself, 2. Second construction: The High Gain Extended Kalman Filter, 3. Our construction in this paper: a mixing of 1. and 2. Let us just give some details now, to explain where we want to go.
259 1. The extended Kalman Filter. For long, the engineers introduced and successfully used the extended Kalman filter (EKF), either in its stochastic or its deterministic form. The EKF is just the standard Kalman filter for linear time-dependant systems, applied to the linearized system along the estimate trajectory. We will give precise equations later on. It is easy to see that it is a non-intrinsic object (depending on coordinates). It would be intrinsic if it was dealing with the linearized along the real trajectory, but this trajectory is unknown. It is known that, under observability conditions, the Extended Kalman filter, has good properties: (i) In its deterministic form, it is a local observer in the following sense. For sufficiently small initial error on the estimate of the state, the estimation error converges exponentially to zero. The prototype of these results can be found in 2 for instance. For our systems (1.1), with the assumptions of Section 1.1, it is not hard to check that the linearized systems along any trajectory are uniformly observable, (in the classical sense of the linear theory, and with uniform bounds on the Gramm observability matrices). Hence, this result applies. (ii) In its stochastic form, except for the linear case, where the EKF is the "optimal" filter, there is no general theoretical result that applies. Even for good observable systems in our normal form 1.1, for small noise, small initial variance and dimension 1: there is a counterexample of such a system, in 16 for instance, where the EKF doesn't work at all. Nevertheless, despite the lack of these theoretical justifications, people use it in practice for nonlinear filtering and it may give very good results (even for systems that have much weaker observability properties than those considered here). In the application of our techniques, presented in section 4 below, we will show a (family of) practical examples which is very interesting because, it seems that, the results of 16 on the EKF for small noise, apply in general, and that the "small parameter" has a physical interpretation. We will not say more about that because this is beyond the scope of this paper. But it is one more justification of the use of our method developed here to this application. 2. The High Gain Extended Kalman Filter. The following results have been proved in 4 , 5 , 10 .
260
We consider the equations of the extended Kalman filter, in which the "covariance matrix Q" depends on a real parameter 6,6 > 1, in the following way: Qij=Oi+i+1Qlr For 6 = 1, it is exactly the EKF. For 6 large enough, it is what we call here the "High gain extended Kalman filter" (HGEKF). (i) In the deterministic setting, the estimation error has arbitrarily large exponential decay (depending on 6). ( 10 , for instance). This holds whatever the initial error is, (that is, this is a global result). (ii) In the stochastic setting, it is a nonlinear filter with "bounded variance" (the variance is bounded in 6n, which is not that good, but it is bounded anyway). ( 4 , for instance). 3. What we want to do in this paper. The idea in this paper is the very simple following one: we give the parameter 6 in the HGEKF an exponential decay from 60 large, to 1. What is expected, (and what happens) is the following: (i) The beginning of the transient of the estimation error is the one of the high gain extended Kalman observer: there is an exponential decay that can be made arbitrarily large. (ii) There is a global exponential decay of the estimation error (but, of course, it cannot be controlled). (iii) The asymptotic behavior is the one of the standard "extended Kalman filter", (that people like in practice, as stated above). Our main result, Theorem 3.1 in Section 3 proves (i) and (ii). The proof is more or less an improvement of the proof of convergence of the high gain Kalman observer, as given in 10 . Of course, this construction has a terminal defect: it is time dependant. In deterministic terms, it will work for large initial estimation errors, but not for big "jumps" of the state at intermediate times. In the section 3.3, we propose a very simple practical way to make the observer "recursive". In the section 4, we show the application of this procedure to a binary distillation column in which the "quality of the feed" is unknown, an subject to large changes. It was already noticed in the book 10 that this application is a nontrivial nice application of the observability theory, and of high gain observers. Here, it is even much more convincing: when the feed changes, (a big "state jump"), the behavior of the observer is the one of a high-gain observer:
261
recovering arbitrarily fast the quality of the feed, and when the feed does not move, the asymptotic behavior of the observer is the one of the extended Kalman filter, almost optimal with respect to small noise in that case (but we do not prove anything about this optimality in this paper). For first applications of "high gain observers" to distillation columns, see 19
20
2
Justification of the assumptions and observability
2.1
Justification of the normal form
Let us recall here the main results of an observability theory summarized in 10 , and developed in 6 , 7 , 8 , 9 , 13 . This theory leads to the consideration of systems under the normal form (1.1), or similar multi-output normal forms such as (2.1) below, that meet the assumptions of the section 1.1. Here, by "observability", we mean "observability for every fixed input function u(t)n. For details, see 10 . The main results of this theory are as follows. They concern general nonlinear smooth systems of the form:
y=
h(x,u),
on a smooth manifold X, n dimensional, y £ W, u € U, subset of Rd. Basically, there are two cases. Case 1. p < d. In that case, observability is a non generic property. It is even a property of infinite codimension, at the level of germs of systems. This high degeneracy leads to the fact that, in the control affine case, all observable systems can be put under normal forms similar to (1.1). (moreover, one can take Oj = 1, i = l , . . . , n ) .
This result is only a local result in the state space, but it is a global result with respect to the control variable. Moreover, in most of the practical cases we know, it is also global in x. In particular, it will be global in the application of Section 4. In the non control affine case, there is another result, that we don't want to recall here. It leads naturally to high gain observers of another type ("Luenberger type"). Let us just say that the results herein can be easily generalized to this normal form and these observers.
262
Case 2. p > d. In that case, the situation is completely opposite. Observability becomes a generic property, and generically, a system can be put globally under a normal form similar to (1.1), but the dimension of the state in the normal form is bigger than the dimension of the state of the original system: it is at most double plus one. Also, the control in the normal form contains a certain number of derivatives of the control of the initial system. But this is more or less unimportant for observation problems, where the control, and hence its derivatives, are known. In fact, generically, the systems can be put in a form which is a very special case of the form (1.1), called the "phase variable representation":
y^=V{y,y,....JN-l\u,u,....MN-1)\
(2.1)
N < 2n + 1.
Other cases: there are also other (nongeneric) interesting cases where the original system can be put under the "phase variable representation" (2.1). For instance, systems without control that are such that the mapping: initial — sate —>• derivatives
of y :
{M)
x0^(y,y,....,y ), has "finite multiplicity" for a certain integer M. (See
10
, and originally
13
).
N o t e 1. The reasons for which we make the matrix A(u) depend on u in the normal form (1.1) may look not clear, because, in all the cases described above, it doesn't. In fact, the only reason to consider this dependance is the following: the formal computations we do in the proof of our main result, work for that type of systems. Moreover, in the application we describe in Section 4, the matrix A actually does depend on u. Note 2. In that case were Oj depends on u, the following should also be noticed: even the high gain version of the extended Kalman filter is much better in practice than the "high gain Luenberger observer" mentioned above: the high gain observers both kill the nonlinearities contained in the vector field b. But the extended Kalman filter takes into account the variations of u, through the matrix A(u). The standard high gain observers in Luenberger form don't do this. This is the case in the application, Section 4 below.
263
2.2
Justification of the technical assumptions
Let us consider successively the two technical assumptions we made in the section 1.1: A. 0 < am < a,i(u) < a,M, i = 1, •••, n, B. The functions bi are compactly supported. In fact, the assumption A is always satisfied in the cases 1., 2. of the previous section 2.1: the a; are constant and equal to 1. In the application of section 4, this assumption is also satisfied, as we shall see. Let us just notice the following. Al. The Assumption a,i(u) ^ 0 just implies observability of systems in the normal form (1.1): - If the output y(t) is known, the input being also known, the fact that a\(u) is nonzero implies that we can compute xi(t) from y(t). - The fact that 02 (u) ^ 0 implies that we can compute X2 (t) from the knowledge of xi(t), and by induction, we can reconstruct the whole state x(t) from the knowledge of y(t). Modulo a trivial change of variables, the condition a,i(u) ^ 0 is equivalent to a,i(u) > 0. A2. The aj being smooth, restricting to a compact subset of the set of values of control implies that we can find the am, OM, of assumption A. The assumption B above can be trivially realized, by multiplying by a cut-off function, compactly supported, leaving the original vector field b unchanged on an arbitrarily large compact subset of R". We cannot expect more than that. As explained in the book 10 , the problem of synthesis of observers is an ill-posed problem outside compact sets of the state space. This is easily understandable: on noncompact sets, it can happen that the estimation error goes to zero for certain metrics, but to infinity for others. So that, reasonable observers work only as long as the state trajectory x(t) of the system remains in a given compact set, or they work for semi trajectories {x(t),t > 0} that are entirely contained in a given compact set. To finish, let us mention that this restriction to compact sets (unavoidable in a general observation theory), has not so important consequences: for instance, the high gain observers can be used in general for global dynamic output stabilization (again, see 1 0 ) .
264
3
Statement and proof of the theoretical result
The observer we propose, is based upon the High gain extended Kalman filter, proposed in 10 , 4 , 5 . For computational details about the Riccati matrix equation, we refer to 3 , or 1 0 . 3.1
The observer and the statement of the theorem
The equation of the observer is: (i) f = A(u)z + b(z, u) - Sitter-1 (Cz - y(t)), ( « ) # = - U ( « ) + b*(z,u))'S - S(A(u) + b*(z,u))+ C'r-^C-SQeS,
^
1 J
f =A(l-0), where C = (oi(u),0,...,0) > Qe = 92\-1QA~\ A = diag(l, | , . . . , ( I ) " " 1 ) . Here, b*(z,u) denotes the Jacobian matrix of b(z, u) w.r.t. z, and r,X are positive scalars. Q is a symmetric positive definite matrix. Comments: 1. Q,r, in the stochastic context, are the covariances of the state noise and output noise respectively. 2. If A = 0 and 6Q = 1, or if A > 0, but t is large, this is exactly the (deterministic version of) the extended Kalman filter. 3. If #o is large, and if r < T, then, this equation is almost the equation of the high gain extended Kalman filter with gain 9(T). Hence, for T < T, setting e(-r) = Z(T) — X(T), (e is the estimation error), we can expect the following, for #o large enough in front of T: \HT)\\2
< e(T) 2 ( B - 1 >H(c)e-< O l '^- O 2 ) r ||e(0)|| 2 .
(3.2)
Here, 0,1,0,2 are positive constants, H(c) is a decreasing positive function of c, where 5(0) > c Id. Also, 9{T) = 1 + (0O - l)e~XT. In particular, this implies that the error e(t) can be made arbitrarily small, in arbitrarily short time, increasing #o- For 8 constant, this is the behavior of the "high gain extended Kalman filter. In that case (6 constant), this estimate follows from 1 0 , 5 . We will prove it below for 6 nonconstant. Our main result herein will be the following: Theorem 3.1. 1. For all 0 < A < Ao, (Xo = $^*2\' w^ere Q ^ Qmld and a comes from Lemma 3.2 below), for all 9Q large enough, depending on A, for all
265
SQ > c Id, for all K C IRn, K a compact subset, for all £o = ZQ — xo, £o € K, the following estimation holds, for all r > 0 : \\e(r)\\2 < R(X,c)e-a A(eo,T,\),
=
T
||e o || 2 A(0o,r,A),
2 n
+
(3.3)
XT
6o ( -V U-^-*- \
where a > 0. R(\, c) is a decreasing function of c. 2. Moreover the short term estimate (3.2) holds for allT > 0, r < T, for allOo > 0O, So = e A T ( ^ - l ) + l, where L'is the sup of the partial derivatives of b w.r.t. x. Comments. a. Note that the function A(0O,T, A) is a decreasing function of r, and that, for all r > 0, A > 0, A(60,T, A) can be made arbitrarily small, increasing 00-
b. This means that, provided that A is smaller than a certain constant Ao, and 0o is large in front of A, the estimation error goes exponentially to zero, and can be made arbitrarily small in arbitrary short time. c. The asymptotic behavior of the observer is the one of the extended Kalman filter, d. The "short term behavior" is the one of the "high gain extended Kalman filter". 3.2
Proof of Theorem 3.1
3.2.1 Preparation for the proof Let us recall that: 6(T) = 1 + (0O - l)e~XT,
(3.4)
and let us set F = diag(0,1,2, ...,n — 1). Then: d(|) _ dr
dA-1 dr
A(l - 0) 02 '
A(l-fl) 0
(3.5)
266
The equations under consideration are: (t) % = A(u)e + b(z, u) - b(x,u) - S W - ' C ' r ^ C e , ( « ) f = -(A(u) + b*(z,u))'S - S(A(u) + b*(z,u)) + C'r~lC -
SQeS, (3.6)
We make the following changes of variables, with P = S - l . x = Ax, z = Az, e = z - x, e = As, S = OA^SA'1, ? = §-'
= \APA,b{z)
= Ab(A-1z),b*(z)
=
(3.7) Ab*(A~1z)A-1.
u
Remark : It should be noted that the Lipschitz constant of b is the same as the one of b, and the maximum of ||6*|| is the same as the one of ||6*|| (recall that the component bi of b is compactly supported with respect to all of its arguments (xi,...,Xi,u), and that 6 > 1). An obvious computation gives: ±(e)
= 6[(A - PC'r-lC)e
l(S)
= 8[-{A + \l*[z) -(*-§+
+ \{b{z) - b(x)) - ^ ^ F e \ ,
(3.8)
F)^1)'S
- S(A + \b*{z) - ( y + F)^^L)
+ C'r-lC - SQS],
(3.9)
Important comment. At this place, we used the observability properties: the normal form (1.1) is crucial in the computation above. Now, we can make a time rescaling. We set: dt =
9(T)(1T,
e(r) = e(t),S(r)
oit=
6{v)dv, Jo = S(t),P(T) = P(t),e(r)
= 6(t),
267
to get the final set of equations: (i) | ( e ) = [(A - PC'r-lC)e («) Jt&
+ l(b(z) - b(x)) - %~$-Fe],
= [~(A + \b*{z) -{%
(3.10)
+ F)^=^)'5
- 5 ( ^ + \b*{z) - ( ^ + F ) ^ ^ )
+ C V ^ C - SQS],
{
" ° dt - X 0 ' First, there are some classical results allowing to bound the solutions of the Ricatti equation (3.10), (ii), for do > lj a n d A < 1. To apply these results, one has to notice that the linear time dependant systems: % = (A(u(t)) y=
+
\b*(,)-(^+F)X-^)x(t),
C(u(t))x(t),
are uniformly observable (in the sense of linear systems), for all bounded measurable functions ai(u(t)), b*j(z(t)),9(t), with o « > aj > am > 0. Precisely, we have: Lemma 3.2. If the functions a,i(u(t)), \b*j(z(t))\, 9(t), are all smaller than o-M > 0, and if a,i(u(t)) > am > 0, (which is the case by our assumptions), if 0 < A < 1, and 1 < 9(t) then, the solution of the Ricatti equation 3.10, (ii), satisfies the following inequality, a Id< S(t) < 0 Id, for all TQ > 0, for all t > To, where a and (3 depend on To, am,a,M (but do not depend on c, So > c Id !) This result is more or less classical. It is contained in 3 for instance. A detailed proof is given in 10 , because there are several mistakes in many textbooks. The key point for a simple proof is the precompactness of the weak-* topology on L°°[0, T], and the continuity of the input-state mapping of a control-affine system, for the weak-* topology on controls, and the uniform topology on trajectories x(t), t 6 [0, T]. Straightforward computations with (3.10) give: jt(e(t)'S(t)e(t))
< -Qm e'S(t)2e + 2e'S(t)(l(b(z) A ( l - 0 ) ,=,, ,
- b(x) - b*(z)e)) (3.11)
268
where Q > Qm Id. In particular, if t > T0, with a given by Lemma 3.2, this gives: jt(e(t)'S(t)e(t)) < HQma + ~ ^ - ) e'S(t)e +
(3.12)
2e'S(t)(i(b(z)-b(x)-b*(z)e))Using this equation, and again Lemma 3.2, we will now prove the theorem. 3.2.2. Proof of the short term estimation 3.2 This proof is in two steps. We will first prove an estimation for T > t > To > 0, and after for t < To. Gluing them together, we get the short term estimation 3.2. This is the standard high gain reasoning, and it is done in details in 10 for 6 constant. We omit the computational details. Step 1, T > t > T0. Straightforward computations using (3.12), Lemma 3.2 and the remark in Section 3.2 give: < e(T0)'5(To)e(To)e-(Qma-^))(t'To).
e(t)'S(t)e(t)
(3.13)
< /?||e(T 0 )|| 2 e~ ( c ? '" a _ ^ ; ) ) ( '" T o ) , and finally:
Therefore e(t)'S(t)e(t)
T > t > To :
(3.14)
\\e(t)\\2 < - e - ( Q - " Q - ^ ) ) ( t - r o ) | | e ( T o ) | | 2 . a Step 2, t < T0. We need a more straightforward estimation here. A very rough one is obtained just using Gronwall's identity. For certain s, k > 0, we have: ||P(i)ll<(l|P(0)||+*)e'T°.
(3.15)
We assume that 5(0) = 5o lies in the compact set: c Id < So < d Id. As a consequence, P(0) < -Id. By the equation (3.10), we have, for t < T0 : £(e) = (A- PC'r~1C)e + i(6(z) - b(x)) - m~^Fe, hence: ||e(t)H2 < l|e(0)|| 2 + f
l k - ( r ) | | 2 ( 2 P | | + 2||C|| 2 ||r- 1 || | | P | | + l)dr,
269 and by 3.15, we know that ||P(t)|| <
< ||e(0)|| 2 + P(T 0 ,c) f
\\e(r)\\2dr,
Jo
and P(To, c) is a positive decreasing function of c. Gronwall's inequality implies that:
|| £ -(t)|| 2 <*(r 0 ,c)|| £ -(o)|| 2 , with: \J>(T0,c) = epT°, ^(To,c) is also a decreasing function of c. In particular, ||e(T 0 )|| 2 < #(T 0 ,c)||e(0)|| 2 . Plugging this in (3.14), we get: HeWII2 < - e " ( 0 m a " ^ ) ( t _ T o ) * ( r 0 , c ) | | £ ( 0 ) | | 2 , for T > t > T0.
(3.16)
Hence, for T > t > T0, \\£(t)\\2 < - e _ ( Q m a " ^ ) t e ° " ' a T o * ( T 0 , c ) | | £ ( 0 ) | | 2 .
(3.17)
Going back to t < To, we have: l|e(*)H2 < *(To,c)|| £ -(0)|| 2 < *(ro,c)^|| £ -(0)|| 2 a Hence, in all cases (either t < To or To < t), we have: ||e(i)|| 2 < / / ( T o , c ) e _ ( Q m a " ^ > ) t | | e ( 0 ) | | 2 , 0 < t < T,
(3.18)
with H(T0, c) = f *(T 0 , c)eQmCcT°, a decreasing function of c. Therefore, going back to the initial time r, since t = J^ 6(v)dv, and t < T, then, r < T(T), and t >
6(T(T))T:
\\e(r)\\2 < F(To,c)e-(°'" a e ^( T »-' L ') r ||e(0)|| 2 ,T(T) > r > 0, if C = Qma6(T{T))
- L' > 0, which is implied by 6o>eXT{-T\-^-\)
indeed, if (3.19) holds, since
0{T{T))
+ l,
(3.19)
= 6{T) = 1 + (0O - l) e - A T < T ) > ^
.
270
Since e
=
A^i,
1, ||e(r)|| 2
and 9 >
<
IKA-1)!!2^)!!2
<
^ ( " - ^ H ^ T ) ) ! 2 , we get, for all r 0 > T > 0 :
||e(r)|| 2 < ^ ( " - ^ ( ^ ^ ( T c ^ e - W - ^ ^ - ^ ^ H e ^ ) ! ! 2 , fore0>eAro(?^
1) + 1,
jj
or equivalently, 6{TQ) > —
.
H(T0, c) is a decreasing function of c. This is the short term estimation (3.2). If A = 0, it gives the standard high gain estimation. 3.2.3. Proof of the long term estimation Going back to (3.12), and using Lemma 5.2, in Section 5, we get, for all A, 0 < A < 1, t > T0, < - * i e'S(t)e + k2 9(t){n-2)\\S\\
jt(e(t)'S(t)e(t))
\\e\\\
where k\ = Qma, k2 is a positive constant. Lemma 3.2, applied to the Riccati equation in (3.10), implies: jt(e{t)'S{t)e{t))
< -h
e'S{t)e + k'2 #"- 2 > \\e(t)'S{t)e{t)\fi,
(3.20)
for another positive constant k'2. Now, we apply Lemma 5.1, in Section 5, to get that, for t > T > To: e(t)'S(t)e(t)
< 4e-klit-T^e(T)'S(T)e(T),
(3.21)
as soon as m
e(T)'S{T)e{T)6{T)^-V
Setting, q = e(T)'S(T)e(T)6(T)2(-n-V, tion (3.18). It gives q < /?F(T o ,c)e~ q<
(Qma
<
\2
^ - .
let us use the short term estima~^ ) T ||e(O)|| 2 0(T) 2 ("- 2 >,
PH(To,c)e-{Q™a-&)T\\e(0)\\%{n-2)-
If: 0o > eXT(-
IV
1) + 1,
(3.22)
271
then 2 a a _ _L_ > 22L^1 . indeed, in that case, 6{T) > 6(T) = 1 + (80 Then, let us chose T = T* - Log( ffi,"1 )* > T 0 (in order to get the equality in (3.22)). This is possible, since we can assume from the very beginning that J ^ - 1 > 0 (we can increase V for this) and ffir1 - > eT° > eXT° (we can take 0O large enough). iV
_ j
< ^(r 0 , c )(fe- r )^|!e-(o)||X 2(ri - 2)
q
< /m(To,c)|| e -(0)|| 2 (2(-^ - l ) )
2
^^-
2
)-
9
^.
Then, if:
(3 23)
"<W^Y
-
for 0O large enough, for ||e 0 || bounded, q is arbitrarily small. This means that the property (?#) above is met at T = T*(00, A), as soon as A satisfies (3.23) and 8Q is large enough. In that case, (3.21) above holds, for t > T* (> T0) : e(t)'S(t)e{t)
< 4e-* 1 ('- T *>e(T*)'5(:r)e(T*), <
4e-klt(^j2^-)^e(T*yS(T*)e(T*).
This implies, with (3.18): '|e(t)H2 <
de-^(^~L)^\\e(T*^ \ $ HIT
„\„L'T*-k!t/
#0-1
<4^H(T0,c)e^'e-^(^^-)^\\s0\\2, < A^-H{T0,c)e-^{^
2L'
ti
-)>\\e0
Qma
for t > T* (> T 0 ). For £ < T*, using (3.18), and the fact that k\ = Qma : || £ -(i)|| 2 < J ff(To, C )e-^ t e L ''|| e o || 2 ,
272
because /fl
1
> 1.
Therefore, for alU > 0 :
\m\f
-
ff(To>c,A)e-*'i0oil*£:||eo||2, e~Xr),
Finally, IKr)||2 < H(To,c,A)e-fc"|ko||2«'oil^+2(n-1)e-*1*(1-e-*T), where H(To,c, A) is a decreasing function of c. This is the long term estimation. It holds as soon as A satisfies (3.23), and for 6Q large, depending on A. 3.3
Practical implementation:
making the observer "recursive"
We consider a one parameter family {OT,T > 0} of observers of type (3.1), indexed by the time, each of them starting from So, 9g, at the current time T. In fact, in practice, it will be sufficient to consider, at time r, a slipping window of time, [r — T, r[, and a finite set of observers {Ot{, T -T
273
OtN- Then, the error will be given by the "long term" and "short term" estimates at time T : \\e(T + \\E(T + T)\\2 <
T)\\2
a. If T is large enough, the asymptotic behavior will be the one of the "extended Kalman filter". b. At the beginning, the transient is the one of the HGEKF. c. the error can be made arbitrarily small in arbitrary short time, provided that 6Q is large enough. 2. If at a certain time we have a "jump" of the state, then, the innovation of the "old observers" will become large. The "youngest" one will be chosen, and the transient will be the same as the one of the HGEKF, first, and of the EKF, after T. This looks very promising. We show on an example in the next section, that it works very well. 4 4-1
Application: observation of a binary distillation column The constant molar overflow model
The model we consider is the classical "constant molar overflow" (CMO) model. It is one of the most simple distillation models, and it is used by many process engineers (for instance, even in its static form, it is used for simple short-cut distillation calculations). Since everything here follows from the very special " tridiagonal" structure of this model, and since any reasonable distillation model possesses such a structure, all what we do in this paper can certainly be extended to more precise distillation models. The equations are based upon: a. a thermodynamical relation describing the thermal equilibria for each tray. b. Material balances on each plate. Thermal balance on each plate is replaced by the "Lewis hypotheses", that lead to the fact that the liquid and vapor flowrates along the column are constant in the "stripping" (above the feed) and "rectification" (below the feed) zones. For justification of these assumptions, see 1 5 .
274
The equations of this model are: Total condenser: dx\ Hi^j-= (V + FV)(y2 -
Xl).
(4.1)
Rectifying section, j = 2, • • • , / — 1 : dx'
(4.2)
Feed tray: Hf^-=
FL(ZF + L(xf-!
Xf)
+ FV(k(ZF)
- Xf) + V(yf+1
-
- yf)
(4.3)
yf).
Stripping section, j = f + 1, • • • , n — 1 : Hj-±
Vj).
(4.4)
Bottom of the column: dx Hn-jr = (L + F I ) ( i „ _ i - xn) + V{xn - yn). at The parameters have the following physical meaning:
(4.5)
n
f Hi Xj
Vi FL,FV,L,V ZF
= (L +
FL){XJ-!
-
XJ)
+ V(yj+1
-
number of trays, index of the feed tray, liquid hold up on the j t h tray (a geometric constant), liquid composition on the j t h tray, vapor composition on the j t h tray, feed (liquid and vapor), reflux and vapor flow, feed composition (molar fraction of light component in feed).
On each tray the liquid and vapor compositions, Xj and yj, are linked by the liquid/vapor equilibrium law yj = k(xj). We assume that the function k is monotonic, i.e. we do not consider azeotropic distillation. Each of the equations is relative to a tray. It just expresses the accumulation of the liquid on the corresponding tray, and the thermodynamical equilibrium. The condenser and the bottom of the column are assimilated to tray 1 and tray n respectively. The state of the model is the liquid composition profile of the more volatile component on each tray, denoted by (XJ).
275
The top and bottom product compositions x\ and the two observed variables. In practice, they are also the two variables that one wants to control: they are the "qualities" of the products going out of the column. The two control variables are the reflux flow-rate L and the vapor flowrate V. There are also two disturbances to be counteracted: a. changes in the feed rate F = FL + FV. In general this is a "measured disturbance", (a flowrate measurement), b. the in-feed composition Zp. In general, it is unknown, and it is practically very expensive to "observe it". Moreover, it may change brutally. We will consider this feed composition Zp as an unknown (constant) state variable. When Zp changes, the consequence is a jump of the state of the system. The qualitative properties of this model are very nice (see 10 , 18 , 1 7 ) : a. For positive control variables L and V, (negative doesn't physically makes sense), the "physical" domain D = [0,1]" is positively invariant under the dynamics. This means that all the state variables Xj remain between 0 and 1. b. In the domain D, all other variables (than the ar^'s and the t/j's) being constant, there is a single equilibrium, which is globally asymptotically stable. c. It has very nice observability properties, as will be discussed later on. Our goal in this section is to construct an estimator of the state x, and more specifically of the feed composition Zp, by using the results of the previous sections. 4-2
Observability of the model and synthesis of the observer
A complete analysis of observability and observer synthesis has been carried out in 10 in the general case. It happens that, even if the feed is considered as an unknown state variable (meeting the equation ^~f- = 0), the model is observable in the strongest possible sense. In particular, as we shall see, it can be put in a normal form similar to (1.1). Our purpose here is just to apply the observer described in the previous sections. Hence, we will fix a special case of distillation column. But all what we show works in general. We will chose:
276 • n = 5 and / = 3, • The function k is a diffeomorphism from [0,1] into itself and is given by, ax
ur \ k(X) v
=
;
T7—.
' 1 + (a - 1) x Here a is the "relative volatility" of the mixture. It is a physical parameter larger than 1 (but close to 1). The closer to one, the most difficult distillation. If a = 1, the two products are thermodynamically identical, and cannot be distillated (the model is not controllable). • Let us observe that k is a diffeomorphism from
—J^I,+OO
to
]-°°.=£r[• The feed is assumed to enter the column at its "bubble point". As a consequence, F = FL. Let us make the following change of state variables: £i = x\, £2 = k (x 2 ), £3 = x3, £4 = i 4 , & = x5 and £6 = ZF. Then, the system can be rewritten as:
H2$- = k' {k-1 ( 6 ) ) (L(& - fc-1 (&)) + v(k ( 6 ) - & ) ) , , # 3 # = Ffa - 6 ) + L(k~l ( 6 ) - 6 ) + V(k (&) - k ( 6 ) ) , S ff4% = (L + F ) ( 6 - £4) + V(k (&) - A (&)),
(4-6)
tf5ff = (L + F)(& - &) + V(& - * (&)), #B% rf< = 0
or: d6 = A(L,V)Zt dt
+
b(L,V,tt),
where,
A(L,V)
0 0 = 0 0 \0
0 0 0 0 0
0 0 L+F
0 0
0 0
0 0 \ 0 0
0 0 0 ^ 0 0 0 0 0/
(4.7)
277
and,
I
-—£ 1
\
1
V (AT fa)) (Lfa - fc- fa)) + V(k fa) - &)) / t f 2
(-F& + ^(fc-1 (6) - 6) + V(kfa)- k fa))) /tf 3
b(L,V,0 =
(-(L + F)£4 + V{k fa) - A fa))) / # 4 (-(£ + F)& + V(& - A fa)))/H5 0
\ /
6i(V,6)
/
\
b2(L,V, £!,..., e5) h(L,V, &,&,&) h{L,V,(ji) 0 \ The observations are then given by V =
J
100000 000010
f = Cf.
Now, since in fact the only pertinent (and positively invariant) part of the state space is D' = [0, l ] 6 , we can manage the things for b be compactly supported, as in section 1.1, and unchanged on D'. Let us change b (L, V, £) in the following way outside [0,1] : replace b (L, V, £) by b(L,V,0 = b(L,V,*fa) where $ fa,. . . , & ) = (if fa),... ,
^ , —^r • This modification does not change the "physical
J
a —
2
a
2 I
trajectories". Our system has the property to be observable for any input, as soon as the control variables L and V are > 0. Here, we assume that L, V are bounded from below (and from above) by > 0 constants: LM > L(t) > ei > 0, VM> V(t) > e 2 > 0. This assumption is the analog of the assumption 0 < am < ai(u) < aM, in section 1.1. It is a realistic requirement from the physical point of view. To finish, let us point out the fact that we are in case 1 of section 2.1 above (i.e. the nongeneric case): The number of observations is equal to the number of control variables (it is 2).
278
Due to these observability properties, we will be able to apply the observer of the previous section 3.3. In fact, it will be an adaptation of the results of section 3, Theorem 3.1, to this multi-output case. We leave the reader to check (this is really straightforward) that all the reasoning in the proof of Theorem 3.1 can be strictly repeated, and that the statements of this theorem are valid for the distillation column. Of course, in practice, we didn't compute the theoretical bounds Ao and #o(A). We have just got some values for them by experimentation. Also, the number TV of "parallel" observers, and the "sampling times" U of section 3.3 have been chosen experimentally. Finally, the state of our observer is the collection of the states of TV independent observers (zi, 5j, #i) i = 1 N. Each observer is a set of three equations of the following form: f =A(u)z + b(u,z)-S (ty1 CTR-6X [Cz - y (t)) H = - (A(«) + b* (z,«))'S-S(A(u) + b* (z,«)) + C'R^C
§=A(l-0)
-
SQ9S
where u = (L, V). Due to the multi-output structure, with "Brunovsky-like" blocks of different dimensions (4 and 2), a way to make the proof of Theorem 3.1 work, is to take a matrix R depending also on 6, as shown below. This could be avoided by increasing the dimension of the state as explained in 10 . It is not hard to check that a good choice is to set: A = d i a g ^ — , ^ , — ,g,l, with Qe = 92A~1QA~1
and Re = (CA^C)
R
p J (CA^C).
In practice, we have chosen TV — 5 observers, and we have taken a regular sampling ^ . That is to say, at each time step k^, the oldest observer is replaced by a new one (with 6 = #o and a new guess of state and covariance matrix). At the beginning of the simulation, we chose an initial value #o of 6 for each observer, such that the ith observer has #; = 1 + e~ A — ^ (0O - 1), see figure 3, where "crosses" represent reinitializations. We have implemented our observer as described in the previous section. Since the state has dimension 6, each observer requires to solve 28 ordinary
279
differential equations (for the state, the Riccati matrix, and the very simple equation for 6). Finally, our observer is a set of 140 ODE's. We have solved it in conjunction with the model (6 equations) using LSODAR from ODEPACK ( 14 ), without taking into account the possibility of decoupling these equations (which are indeed equivalent to five systems of 34 equations, including the model into each system). A simulation of 3 hours of real time takes about 40 seconds on a Pentium III machine.
4-3
Simulation results
We have chosen the following constant parameters: • Hold-up Hi = 40, Hj = 10 for j = 2,3,4 and H5 = 80, • Relative volatility a = 2. We have applied the following scenario: - During the simulation, the state noise is simulated by the sum of several sine functions at some random frequencies representing a band limited noise with an amplitude of 1 0 - 8 before the time *2 = 116 mn 40 s and 10~ 2 after this time, - Moreover, at time t\ = 66 mn 40 s, we simulate a step in the feed quality Zp from 0.45 to 0.60. Hence we can consider that there is no perturbation before time t\, where a large "jump of the state" occurs, - after that, nothing happens until time t 4.3 which is running.
280
Finally, R is equal to 10 2 times the 2 x 2-identity matrix and Q is 10 times the 6 x 6-identity matrix.
9
First of all, the behaviour of the observer is very good during the unmodelled transient as well as during smooth operation, see Figure 4.3: top and bottom quality measurements are plotted, as well as the unknown feed quality, each curve being represented by a continuous line. The overall estimation of the feed quality, corresponding to the estimation of the feed quality provided by the observer with the smallest innovation, is represented by a dashed line. It is very close to the actual feed quality.
20
40
60
80
100
120
140
160 180
Figure 1. Measured output and estimation of the feed quality.
A more accurate plot is presented on Figure 2 where we have only shown the relative estimation error of the feed quality. The estimation provided by the best observer (in our sense, that is to say, the observer with minimal innovation) is the continuous line. The crosses represent the estimation of Zp provided by other observers every minute. One can see that our criteria on the innovation to select the right observer is a good choice, at least in this case. Moreover, the behavior of the observer is very close to what we expected from the theoretical results:
281
Figure 2. Relative error between the actual feed quality and its estimation by the selected observer (continuous line) and the others.
- When no perturbation arises, the best observer (that is to say the observer with the smallest innovation) is the one with the smallest value of 6 i.e. the oldest observer which is also the observer which is the closest to the pure extended Kalman observer. -If a large perturbation occurs (such as the feed change at time t\ = 66mn40s), the best observer becomes the youngest one, i.e. the observer with the highest 9. -Of course, small perturbations are well corrected by oldest or intermediate observers. This is very clear on the figure 4. Our conclusion, from these simulations, is that even if the use of several observers in parallel requires the introduction of new tuning parameters (80, A, N and T), the choice of these new parameters is very easy, due to their very clear effect on the results. ^From a practical point of view, 60, A, N and T have to be chosen such that at any time, there is an HGEKF and an EKF-like observer running at the same time, that is to say such that 1 + e~x^ (6Q — 1) is large enough (to ensure that at least one observer is a HGEKF) and such that l + e _ A T (60 - 1) is close to 1.
282
0
20
40
60
80
100
120
140
160
180
Figure 3. T h e 5 observers. Time of reinitialization of each observer ( x ) , and the best one (continuous line).
0
20
40
60
80
100
120
140
160
180
Figure 4. Various values of 6 versus time (dotted lines), and best observer (continuous line).
283
Also, an important point, for people that are used to tune Kalman's observers, is that the choice of the Q and R matrices is less crucial than with a single observer which has to be tuned in order to be efficient both with and without perturbations. Moreover, this approach allows us to obtain a diagnosis of abnormal behavior: if the smallest innovation is provided by the last reinitialized observer then one can conclude that the model has encountered a perturbation. If this happen for a long time then one can conclude that the model has some difficulties to deal with certain unmodelled perturbations. Indeed, the scenario that we have applied in our simulations can be easily deduced from the figure 4. 5
Appendix. Technical lemmas
Lemma 5.1. Let {x(t) > 0, t > 0} C R n be absolutely continuous, and satisfying: — < —Xx + kxi/x, at for almost all t > 0, for X,k > 0. Then, as soon as x(0) < jjp, x(t) < 4x(0)e-'\ Proof. We make the successive following changes of variables: y = y/x, z = 1/y, w(t) = e~2(z(t). Then, all the quantities y(t), z(t), w(t) are positive and absolutely continuous, on any finite time interval [0,T]. We denote by ' the derivatives with respect to time. Straightforward computations give, for almost all t > 0 :
y'<-\y ,
A - 2
+ \y\ k 2
w' > - e ~ ^ ' - . 2 Moreover, w(0) =
* . Then, for all t > 0,
(5.1)
284
If
}
— j > 0, then w(t) > 0,and we can go backwards in the previous
inequalities: - -(1 - e - ^ )
w(t) > - .
y(t) <
x(t) <
**'{-&* -i)+r /x(0) x(0)e~xt (1-V^(0)!)2'
Hence, if z(0) < ^ , or 1 - y/xJU)^ > §, then: x(t)
<4x(0)e~xt. D
Lemma 5.2. Let B — b{z) — b{x) — b*(z)e be as in Section 3: e = z—x, b(x) = A6(A _1 a;), b*(z) = A6*(A _ 1 x)A _ 1 , where b*(x) is the Jacobian matrix of b at x, and where b is compactly supported. A = diag{\, |,—, fphrr), 9 > 1. Then, \\B\\ < K <9"-111£:||2, for some K > 0. Proof. Let us consider a smooth expression E(z, x) of the form: E(z,x)
= f(z) — f(x) — df(z)e, with e = z — x,
where / : W —> R is compactly supported. We have, for t > 0: f(z - te) = f(z) - »
/ |£(z -
re)dr,
and: 3/ 9a;
>-"> = £«-£•> jf sis*-*)*
Hence, ,1
f(z-e)
=/
M-S>£«
rr
a 2 /
+ ! > * / [j!&f-«*»*•
285
Since / is compactly supported, we get: M p \f(z) - f(z - e) - df(z)e\ < y £
l^l>
whereM = s u P J 5 ^ - ( x ) | . Now, we take f = bk, and we use the facts that 6* depends only on xi, •••,Xk, and t h a t 0 > 1 :
'dxidxj ^ ' W I ^ ' - ' l ^dxidxj lA-'x),. This gives the result.
•
References 1. M. Balde, P. Jouan , Observability of control affine systems, ESAIM/COCV, 3, 345-359 (1998). 2. J.S. Baras, A. Bensoussan, M.R. James, Dynamic observers as asymptotic limits of recursive filters: special cases, SIAM J. Appl. Math., 48, 11471158 (1988). 3. R. Bucy, P. Joseph, Filtering for stochastic processes with applications to guidance (Chelsea publishing company, 1968, second edition, 1987). 4. F. Deza, Contribution to the synthesis of exponential observers (Ph.D. thesis, INSA de Rouen, France, June 1991). 5. F.Deza, E.Busvelle, J.P.Gauthier, High-gain estimation for nonlinear systems, Systems and Control Letters 18, 295-299 (1992). 6. J.P. Gauthier, H. Hammouri, I. Kupka, Observers for nonlinear systems; IEEE CDC Conference, Brighton, England, december, 1483-1489 (1991). 7. J.P Gauthier, H. Hammour i S. Othman, A simple observer for nonlinear systems. IEEE Trans. Aut. Control, 37, 875-880 (1992). 8. J.P. Gauthier , I. Kupka, Observability and observers for nonlinear systems. SIAM Journal on Control, 32, 4, 975-994 (1994). 9. J.P. Gauthier , I. Kupka, Observability for systems with more outputs than inputs. Mathematische Zeitschrift, 223, 47-78 (1996). 10. J.P. Gauthier , I. Kupka, Deterministic observation theory and applications, (book to appear at Cambridge University Press). 11. A. Jaswinsky, Stochastic processes and filtering theory, (Academic Press, New York, 1970). 12. P. Jouan, Singularites des systemes non lineaires, observabilite et observateurs (Ph.D. thesis, Universite de Rouen, 1995).
286
13. P. Jouan, J.P. Gauthier, Finite singularities of nonlinear systems. Output stabilization, observability and observers, Journal of Dynamical and Control Systems, 2, 2, 255-288 (1996). 14. A. C. Hindmarsch, Odepack: a systematized collection of ode solvers, in Scientific Computing, R.S. Stepleman et al. (Eds.),55-64, (NorthHolland, Amsterdam, 1983). 15. C D . Holland, Multicomponent Distillation, (Englewood Cliffs, NewJersey, USA: Prentice Hall, 1963). 16. J. Picard, Efficiency of the extended Kalman filter for nonlinear systems with small noise, SIAM J. Appl. Math., 5 1 , 3, 843-885 (1991). 17. H.H. Rosenbrock, A Lyapunov function with applications to some nonlinear physical systems, Automatica, 1, 31-53, (1962). 18. P. Rouchon, Simulation dynamique et commande non lineaire des colonnes a distiller (These de l'ecole des mines de Paris, 1990). 19. F. Viel, Stabilite des systemes non lineaires controles par retour d'etat estime. Application aux reacteurs de polymerisation et aux colonnes a distiller (These de l'universite de Rouen, 1994). 20. F. Viel, E. Busvelle, J.P. Gauthier, A stable control structure for binary distillation columns, International Journal on Control, 67, 4, 475-505 (1997).
287
LIE SYSTEMS I N CONTROL THEORY JOSE F. CARINENA AND ARTURO RAMOS Departamento de Fisica Teorica. Facultad de Ciencias, Universidad de Zaragoza, 50009, Zaragoza, Spain E-mail: [email protected], [email protected] We show that the theory of systems of differential equations allowing a rule for expressing the general solution in terms of some particular ones, the so-called Lie systems, are relevant in Control Theory. These systems are related to some particular cases of control systems on Lie groups, whose exact solution is usually found by means of a method generalizing the one proposed by Wei and Norman. We establish some results concerning these systems. The general theory is illustrated with some examples.
Dedicated to Professor Velimir Jurdjevic on the occasion of his 60"" birthday.
1
Introduction
There exist systems of first order ordinary differential equations such that the general solution can be expressed in terms of a number of fundamental solutions and several constants determining each particular solution. The characterization of such systems is due to Lie 19 and henceforth will be called Lie systems. They are receiving a lot of attention during these last years, because they appear very often in a variety of problems in mathematics, physics and other branches of science and engineering 6 2S ' . The two best known examples are the linear systems and the scalar Riccati equations, but Lie systems also appear when studying evolution in Lie groups. More specifically, as far as Control Theory is concerned, drift-free right-invariant affine control systems in Lie groups 3 ' 1 6 are another example of Lie systems. The important fact we want to stress here is that Lie systems in Lie groups are in some sense prototypical, because any other Lie system can be related with one of such models. The aim of this paper is therefore twofold: to expose how Lie systems are interrelated with systems on Lie groups, and how a generalization of the method proposed by Wei and Norman 23 ' 24 can be useful for solving this last problem. There also exists reduction techniques for these problems 7 which will not be considered here. We feel that a careful study of Lie systems from a geometric viewpoint may be useful for a better understanding and treatment of these systems, which appear quite often in important problems of geometric Control Theory. The paper is organized as follows. Next section is devoted to the description of Lie systems and the formulation of the Theorem which characterize them, along with some simple but important examples. Then, we relate each of these systems with a similar system but denned in an associated Lie group. We follow in Section 3 with some questions of interest in Control Theory concerning these systems. In
288 Section 4 we show how a generalization of the Wei-Norman method applies in these problems. Finally, we illustrate in Section 5 the theory developed by means of several examples. 2
Systems of differential equations admitting a superposition rule
An interesting result by Lie is the characterization of time-dependent systems of first order differential equations ^M=Xi{x,t),
i = l,...,n,
(2.1)
for which it is possible to write the general solution in terms of a set of m arbitrary but independent particular solutions { x ( i ) , . . . , £( m )} by means of a function also depending on a set of n constants, X(j) = F(x(i),... , X ( m ) , c i , . . . ,c„). The function F will be called a superposition function. From the geometric viewpoint the system (2.1) can be considered as the one determining the integral curves of the timedependent vector field n
X(x,t) = J2xi(x,t)^-.
(2.2)
i—l
For more details about the geometry of these objects see, for example, 5 ' 6 . The characterization by Lie 1 9 of such systems, hereafter called Lie systems, amounts to say that they correspond to time-dependent vector fields which can be written as a linear combination r
X(x,t) = Y,bct(t)YM(x) a= l
with time-dependent coefficients ba(t) of vector fields Y^a\x) closing on a real rdimensional Lie algebra, i.e. { Y ^ } are linearly independent and there are r 3 real numbers, cal3 7 , such that r
[ y ( a ) , y , w ] = J2 c Q " 7 ^ ( 7 ) •
(2.3)
7=1
Moreover, the number r satisfies r < mn. For a geometric proof of this Theorem, see 6 . Simple examples of Lie systems are the linear systems ^ - = £ " L i A* j(t)x3, whose superposition function is linear, F(x(i-), . . . , X(ny,k\, . . . , k„) =fciX(i)+ - • • + A;„X(n), and the scalar Riccati equation x = /2(i) x2 + fi(t) x + fo(t), for which the superposition function reads as Xi(x3-x2) F(xi, x2, x3; k) = —
+
.
kx2(xi-X3) r— .
(X3 - X2) + k(Xl - X3)
289 See 7 ' 8 for details. Another remarkable example occurs when one takes a Lie group G and considers vector fields Xa in G which are either left-invariant or right-invariant, corresponding to the Lie algebra g of G or to the opposite algebra, respectively. We refer to standard references as 9 ' 1 7 for the properties of Lie groups and of their actions on manifolds. Let us choose a basis { a i , . . . , a r } for the tangent space TeG at the neutral element e 6 G. If Xa denotes the right-invariant vector field in G such that X
ff(t) = - £ > « ( * ) * « OK*))-
(2-4)
a=l
Applying (-R9(t)-i )»9(t) to both sides of this equation we obtain the equation on TeG r
(J*,(t)-i).,(«)($(*)) = " E M « K ,
(2.5)
a= l
which we will write with a slight abuse of notation as (S3 _ 1 )(t) = — 5Za=i ba(t)aa, although only in the case of matrix groups .R 9 (t)-i» 9 (t)3W becomes the matrix product g(t) <j(t) _1 . Analogous expressions are obtained if left-invariant vector fields Xt in G are considered, defined by X ' ( e ) = o 0 , that is, g(t) = ^ = 1 ba{t)X%(g(t)) and (i s (i)-i).j(()(j(*)) = E U i ba(t)aa. Now, let H be a closed subgroup of G and consider the homogeneous space M = G/H. Then, G can be seen as the total space of a principal bundle (G, ir, G/H) over G/H, where 7r : G —> G/H is the canonical projection n : g \-¥ gH. It is known 6 T ' that the right-invariant vector fields Xa are 7r-projectable and that the 7r-related vector fields in M are the fundamental vector fields —Xa = —Xaa corresponding to the natural left action $ of G on M, i.e. n*g(Xa(g)) = —Xa(gH). In this way we have an associated Lie system on M corresponding to the time-dependent vector field r
X{x,t)
= ^ba{t)Xa{x),
(2.6)
a=l
where we have taken x = gH. Therefore, see 6 ' 7 , the solution of (2.6) starting from xo is x(t) = $(g(t),xo), being g(t) the solution of (2.5) with g(0) = e. In other words, the curves x(t) so defined satisfy T
x(t) = Y,ba{t)Xa{x{t))
(2.7)
a= l
and the given initial condition. Conversely, given a system of type (2.6) such that Xa be complete vector fields and close on a Lie algebra g, we can regard them as
290 fundamental vector fields relative to an action defined by the composition of the flows of the vector fields Xa. Then, the restriction to an orbit will provide an homogeneous space of the type described above. As a consequence, we can reduce the problem of finding the general solution of the system giving the integral curves of (2.6) to the one of finding one solution of an equation of type (2.4) on the group G. This may give rise to more complicated equations than the original ones. However, there are two main advantages in doing this. Firstly, because of the right-invariance of (2.4), one only needs to find the particular solution starting from the neutral element, g(0) = e. Secondly, the solutions for all Lie systems corresponding to all possible (effective) actions of the same group G on the same or different manifolds, are obtained at once from the solution in the group.
3
Control and controllability of systems on Lie groups
Equations of type (2.6), where the functions ba(t) play the role of the control functions, appear often in Control Theory. These functions are to be determined in such a way that, for example, the trajectory passes through one or two specific points in the configuration space, or maybe gives some cost functional a stationary value. If the manifold in which the vector fields are defined is a Lie group the equation will be r
*(«) = £ &«(*)*-(*(*))•
(3-1)
Q= l
When the vector fields Xa in the Lie group G are right-invariant (respectively leftinvariant) the control system is said to be right-invariant (resp. left-invariant). As an example, the system (2.4) can be regarded as a right-invariant control system. The important question of controllability of control systems on Lie groups like (3.1) has been analyzed by Brockett 3 and Jurdjevic and Sussmann 16 . They have shown that controllability of such systems can be determined by studying algebraic properties of the corresponding Lie algebra g. We offer next a simple proof of one of their results. T h e o r e m 3.1. A drift-free right invariant system of the form (3.1) on a connected Lie group G is controllable if and only if the Lie algebra generated by {Xi, • .. ,Xr] is 0Proof. If h is the Lie algebra generated by { X i , . . . ,Xr}, then the Lie algebra of the Lie system is not g but the subalgebra f). The orbit of the neutral element e € G is the subgroup H of G with Lie subalgebra f). It is then clear that if h is a proper subalgebra of G, the system is not controllable, while it is so when h = g. • Note also that affine control systems on a Lie group like T
g(t) = X0(g(t)) + J2 ba(t)Xa(9(t)), a= l
(3.2)
291
with {Xo, Xi, . . . , X,.} right- or left-invariant vector fields, can be obtained (locally) from
^• = J2da(r)Xa(h(r))
(3.3)
Q=0
by means of a reparametrization given by dt/dr = do(r), ba(t) = da(T(t))/do(r(t)) and g(t) = h(r(t)). Therefore, the result is applicable for systems like (3.2) such that {Xo, X\, ..., Xr} close on a Lie algebra.
4
The Wei—Norman method
The equation in the group (2.4), or more precisely, in TeG given by (2.5), can be solved using a generalization of the method proposed by Wei and Norman 2 3 ' 2 4 for linear systems of type dU(t)/dt = H(t)U(t), with U(0) — I. It is based on the following property 7 : If g(t), gi(t) and gi{i) are differentiable curves in G such that g(t) = gi(t)g2(t),Vt <=R, then,
= ii M (0-i. 9 1 («)(9i(*)) + Ad(»i(0){iZ w (t)-i. M ( o(S2(*))} • (4-1) The generalization of this property to several factors is as follows. Let now g(t) be a curve in G given by a product of other I curves g(t) = gi{t)g2(t) • • • gi(t) = Ili=i 9i(t)- Then, denoting hs(t) = I l L . + i 9i(t) for s e {1, . . . , / - 1} and applying (4.1) we have Rgw-i
. 9( «)(fl(0) = R9lW-i
.„(«)(si(0) + A d ( S i ( t ) ) { i i k l ( t ) - i . h l ( t ) ( f t i ( t ) ) } .
Since hs(t) = ga+i(t)ha+i(t) for s 6 {1, . . . , I — 2}, the process can be iterated, and using that Ad(gg') = Ad(g) Ad(#') for all g, g' 6 G we obtain
« 9 (t)-i .,(«)($(*)) = E
A d
( n * - ( * ) ) {R9iM-*
*9iit)(9i(t))}
= E ( I I Ad (*(*)) ) {*«(*>-! .«(t)(9i(*))} .
(4-2)
where it has been taken go(t) = e for all t. The generalization of the Wei-Norman method consists on writing the curve g(t), solution of (2.5), in terms of the second kind canonical coordinates w.r.t. a basis of the Lie algebra g, {m, ..., ar}, for all t, g(t) = YYa=1 e x p ( - u a ( t ) a a ) , and transforming the differential equation (2.5) into a differential equation system for the va(t), with initial conditions va(0) = 0 for all a = 1, . . . , r. The minus signs in the exponentials have been introduced for computational convenience. Then, we use
292 the result (4.2), taking I = r — dimG and ga(t) = exp(—v a (t)a a ) for all a. A simple calculation shows that Rga^-i „ 9Q ( ( )(pa(*)) = — va(t)aa, so that (4.2) reduces to R
g(t)-i *9(t)(g(t)) = - ^ Z * " I I I Ad(exp(-«^(t)o^)) J aa
= ~ ^2 Vac I Yl exp(-V0(t)
ad(a0)) J aa ,
a=l \0
a
I I I exP(-v0(t)a.d(a0)) \0
J aa = ^ 6 a ( t ) a a , J a=l
(4.3)
with va(0) = 0, a = 1, . . . , r. The resulting differential equation system for the functions va(t) is integrable by quadratures if the Lie algebra is solvable 2 3 , 2 4 , and in particular, for nilpotent Lie algebras. 5 5.1
Illustrative examples Model of a mixed-culture
biorreactor
Our first example is inspired in the Example 2.44, p. 44 of 22 , related with the model of a mixed-culture biorreactor. The state space will be lRn, where we take coordinates (xi, . . . , x„). The control system of interest is ii — bi (t)xi
if
i/ j ,
±j = 6i (t)xj + b2,
(5.1)
with j 6 {1, . . . , n} arbitrary but fixed. Let us consider the vector fields
dxi '
dx
The control system (5.1) therefore determines the integral curves of the timedependent vector field b\(t)Xi(x) + b2{t)X2(x). Taking the Lie bracket of X\, X2, 71
[Xi,X2]
Q
Xi
^ dxi'dxj
we see that {Xi,X2} close on a Lie algebra isomorphic to the Lie algebra of the affine group in one dimension Ai. The elements of this group can be represented by a pair of real numbers (ri, r2) with r\ > 0, with the product law ( n , r2)(r\, r2) =
293 ( n r ' i , r\r2 + r2). The corresponding Lie algebra has a basis {ai, a2} in which [ai, 02] = — <X2. The adjoint representation of this Lie algebra in this basis takes the form ad(a1)=(°_°1),
ad(aa)=(5J)
and therefore exp(-wi ad(aO) = ( 0 e «i J >
exp(—u2 ad(a 2 )) = [ _v
l
)•
If we express the solution of the associated problem in the group A\ as the product g(t) = exp(—vi(t)ai)exp(—V2{t)a,2), then (4.3) reduces in this case to vi ai + i ) 2 e x p ( - w i ( i ) a d ( o i ) ) a 2 = viai + v2 e" l ( t ) a 2 = 6i(t)ai + b2(t)a2 , and then, i2=e-Vl(t)b2(t),
vi=bi(t),
jointly with «i(0) = v2(0) = 0. The integration is immediate, giving vi(t) = b J*bi(s)da and v2(t) = J* e~ K ^'^ds'b2(s)ds. Going back to the initial manifold, we know that X\, X2 are the generators of dilations in R n and translations in the j direction, respectively, so the action can be written as $ ( ( M I , U2), (XI, ...,
Xj, . . . , xn))
= ( e u i x\, . . . , e" 1 (XJ +
1
u2), . . . , e" xn), where the group has been parametrized now by the second class canonical coordinates, in the same factorization order as the one chosen for the solution curve in the group, i.e. ui = l o g n , u2 = r2, so that the group law reads as ( « i , u2)(u[, u2) = (u\ + u\, eui u2 + u2). Then, the general solution in R" is $((vi(t),v2(t)),(xio,...,xjo,...,xno)) = ( e B l ( t ) xio, . . . , e B l ( t ) (xj0 + Bl< 3) Bl(4) f*e~ - b2(s)ds), ..., e x „ 0 ) , where Bi(t) = f*bi(s)ds. The result is exactly the same as the one obtained by direct integration of the system (5.1). 5.2
Model of maneouvring
an automobile
or of a robot
unicycle
22
This example is inspired in the Example 2.35, p. 52 of , related with a simplified model of maneouvring an automobile. Essentially the same control system arises as well in the problem of path planning for a robot unicycle, see e.g. 18 and Eq. (4) of 2 0 . The configuration space is R 2 x S 1 , where we take coordinates (xi, x2, xz). The control system can be written as x\ =b2{t)smxi
,
x2 =b2(t) cos xz ,
giving the integral curves of bi(t) X\(x) + 62(f) X2{x), Xi = -— , oxz
xz = bi(t),
(5-2)
where X\ and X2 are now
X2 = sin x3 -5 h cos x3 -z— • ox 1 dx2
(5.3)
294 The Lie bracket of both vector fields, d . d Xz = [Xi, X2] = •5—, smx3-r 0x3 ox\
d I-cos £3 -r— 0x2
d d • cos x3 — sin x3 -— ax\ 0x2 is linearly independent from Xi, X2, and {Xi, X2, X3} satisfy [X\, X2] = X3, [X2, X3] = 0 and [Xi, X3] = —X2, therefore closing on a Lie algebra isomorphic to the Lie algebra se(2) of the Euclidean group in the plane SE(2). This Lie algebra has a basis {ai, 02, 03} for which [ai, 02] = 03, [02, a 3 ] = 0 and [ai, 03] = —02. The adjoint representation takes the form ad(oi) =
/00 0 \ / 0 00\ 00-1 , ad(a 2 ) = I 0 0 0 , \0 1 0 / \-l 00/
ad(a 3 )
and as a consequence /l 0 0 \ exp(—vi ad(oi)) = I 0 coswi sinvi I , \ 0 — sin t>i cos vi J exp(—V2 ad(o2)) = 1
/ I 00\ 0 10),
exp(—v3 ad(os)) =
If we write the solution of the associated problem in the group SE(2) as the product g(t) — exp(—v\{€)a{) exp(—^2(^)02) exp(—v 3 (t)a 3 ), then (4.3) reduces in this case to vi ai+t>2 exp(—vi ad(ai))a2+t)3exp(—vi ad(ai))exp(—V2 ad(a2))o3 = 61 01+6202 • Performing the operations results the system vi = 61 ,
V2 = 62 c o s « i ,
i)3 = 62 sini>i,
(5.4)
with initial conditions i>i(0) = «2(0) = 113(0) = 0. Denoting Bi(t) = / 0 61 (s) ds, the solution is vi(t) = Bi(t),
v2(t) = I b2(s) cosBi(s)ds, Jo
v3(t) = I 6 2 (s) sinBi(s) ds . Jo (5.5)
An action of SE(2) on the original manifold R2 x S 1 such that the associated fundamental vector fields be Xi, X2, X3 is obtained by simply composing their integral flows. We parameterize the group SE(2) by the second class canonical coordinates obtained with the same factorization order as the one of the solution curve in the group, i.e. by three real numbers (9, o, b) with the composition law (9, a, b)(9', a', 6') = (9 + 9', a' +acos9' - bsinfl', b' +bcos9'+ asm9'). Therefore, the above mentioned action will read as $((#, o, b), (xi, X2, x3)) = (xi + bcosx3 + a s i n x 3 , X2 — bsinx3 + a cos2:3,2:3 + 9).
295 As a consequence, the general solution of (5.2) is $((«1, V2, V3),(X10, X20, X30)) = (xio + v3 cos X30 + V2 sin X30, X20 — v3 sin X30 + i>2 cos X30, X30 + « i ) , where vi = vi(t), V2 = V2(t) and v3 = v3(t) are given by (5.5). The direct integration of (5.2) gives exactly the same result. On the other hand, the vector fields X2 and X3 commute, so there exist coordinates (2/1,2/2,3/3) such that X2 = d/dy2 and X3 = d/dy3. Then, 1/2 will satisfy -X22/2 = 1 and -X32/2 = 0, and similarly 2/3 is such that X22/3 = 0 and X3y3 = 1. Particular solutions are 2/2 = Xi sinx3 + X2 COSX3 ,
2/3 = xi COSX3 — X2 sinx3 ,
which can be completed with y\ = x3. In this new coordinates X i takes the form Xi = -r 1-2/35 02/1 oyi
2/2-5— • oy3
Needless to say, these vector fields have to satisfy [Xi, X2] = X3l [X2, X3] = 0 and [Xi, X3] = —X2, as can be checked immediately. The control system of interest will be now the one giving the integral curves of the time-dependent vector field h(t)X!(y) + b2(t) X2(y), that is, yi=bi{t),
y2 = bi(t)y3 + b2(t),
y3 = - 6 i ( t ) y 2 •
(5.6)
The associated system in SE(2) is again (5.4). Now, X i , X2 and X3 can be regarded as the fundamental vector fields of the action of SE(2) on R 2 x 5 l given by /1 0 0 \ /yi\ / e ${{9, a, 6), (2/1, 2/2, 2/3)) = I 0 cos9 sinO I I y2 I + J a cos 6 + b sin 9 \ 0 — sin 9 cos 9 J \ 2/3 / \b cos # — a sin 6 Therefore, the general solution of (5.6) is given by $((wi, t>2, ^3), (2/10, 2/20, 1/30)), i.e. 2/i = 2/io + «i ,
1/2 = 2/20 cos t»i + 2/30 sin i>i + V2 cos ui + U3 sin vi , 2/3 = 2/30 cos «i — 1/20 sin v\ + v3 cos v\ — V2 sin «i , where v\ = i>i((), V2 = «2(<) and W3 = v3(t) are given by (5.5). However, the direct integration of (5.6) is, from the computational viewpoint, slightly more involved than the integration of (5.4).
296 5.3
Lie systems Hamiltonian
of elastic problem of Euler and related systems
integrable
This example is inspired by the recent works of Prof. Jurdjevic on the study of integrable Hamiltonian systems on homogeneous spaces of constant curvature embedded in a three dimensional Euclidean space, and the so called elastic problem of Euler n>12>13>14. These problems have important generalizations in 15 . However, we will treat here only the aspects of the problem related with Lie systems. The configuration space will be now R 3 , where we take coordinates (xi, x 2 , X3). The control system of interest is ii =-bi(t)x2-b2(t)x3,
±2 = bi(t)x\ + b3(t)x3 ,
£3 = e(b2(t)xi -
b3(t)x2), (5.7)
where e = ± 1 , 0 . This control system gives the integral curves of the time-dependent vector field b^t) Xi{x) + b2(t) X2(x) + b3(t) X3(x), where v
Xi = xi
d 0x2
d x2 - — , ox\
X2 = exi-
d dx3
d x3 -—, dxi
X3=x3-
d dx2
d ex2-—• ox3 (5.8)
These vector fields satisfy the Lie brackets [Xi, X2] = X3, [X2, X3] = eX\ and [X3, Xi] = X2, hence isomorphic to the Lie algebra ge of the Lie group G e , given by Go = S £ ( 2 ) , Gi = 5 0 ( 3 ) and G_i = SO(2,1). Therefore, the case e = 0 essentially reduces to the previous example. We take a basis {01, 02, 03} of ge in which the Lie products read [01, 02] = 03, [02, 03] = eai and [03, ai] = 02. Then, the adjoint representation is /00 0 \ ad(ai) = 0 0 - 1 ,
ad(a 2 ) =
\oi 0 I
/ 0 0 e\ 0 0 0 , ad(o 3 )
\-100y
/0-e0\ = 1 0 0 .
\o 0 0/
We define the signature-dependent trigonometric functions (see, e.g., 2 ) Ce(x), andT e (a;) by ( cos x Cc(x)=\l coshx
e= 1 e= 0 e = —1
{
( six sin x
Sc(x)=\x
^ ssii inhx
which have the properties Ce(x + y) = Ce(x)Ce(y)
e= 1 e= 0 e = —1
Se(x)
_ , . e(X)
T £ (x) =
— tSc(x)Sc(y),
e
^ '
Se(x + y)
297 Ce(x)Se(y)
+ Se(x)Ce(y)
and Ce2(x) + t S ( 2 ( i ) = 1. Then, we have
/I 0 0 \ exp(—t^i ad(oi)) = I 0 cosvi sintn I , \ 0 — sin vi cos vi J exp(—v2 ad(a2)) = I
0
1
0
J ,
\Se(v2) 0 Ce(v2) J exp(-«3ad(o3)) =
( Ct(v3) -Sc(v3)
V
e5«(«3)0\ C,(v3) 0 .
0
0
1/
Writing the solution of the problem associated to (5.7) in the group Ge as the product g(t) = exp(—vi(t)ai) exp(—v2(t)a2) exp(—v3(t)a3), and using (4.3), we obtain the differential equation system for vi(t), v2(t) and v3(t): vi = bi + eTe(v2)(b3 cosui + 62 sini;i), v2 = b2 cos v\ — 63 sin vi , 63 cos v\ + b2 sin v\ V3 = Cc(v2) with ui(0) = v2(0) = 1)3(0) = 0. If we take instead the factorization g(t) = exp(—v2a2) exp(—1)303) exp(—tnai), we obtain Vl =
6 i a ( f 2 ) + e635e(w2)
c&T)
'
v2 = b2 + Te(v3){b! Cc(v2) + i>3 = b3Cc(v2) -
eb3Se(v2)),
biSe(v2);
and the factorization g(t) = exp(—11303) exp(—uiai)exp(—v 2 a 2 ) gives vi = biCe(v3) -eb2Sc(v3)
,
v2 = (b2 Ce(v3) + 6i5 £ (?;3))secwi , v3 =b3 + (b2Ce(v3)
+biSe{v3))ta,nvi,
both of them with the initial conditions ui(0) = v2(0) = v3(0) = 0 as well. The other three remaining possibilities give rise to similar equation systems. For e = ± 1 the group Ge is not solvable and none of the previous systems can be integrated by quadratures in a general case. For the particular case treated by Prof. Jurdjevic we must put (with our notation) 61 (t) = 1, b2(t) = 0 and b3(t) = k(t). In his work a comprehensive study of certain related Hamiltonian systems is carried out, but this is out of the scope of this article. We refer to n > 1 2 ' 1 3 ' 1 4 ' 1 5 for a detailed exposition.
298 5.4
Brockett's
systems
There are certain control systems in the Control Theory literature known as Brockett's systems, see e.g. 4 ' 1 8 , which are known to be related with the tridimensional Heisenberg group H(3). We will study this fact with detail. The first form of Brockett's system we will consider is the control system in R 3 with coordinates (x, y, z) x = bi(t),
y = b2(t),
z = xb2{t) - ybi{t),
(5.9)
see, e.g., . This system gives the integral curves of the time-dependent vector field b1(t)Xi(x) + b2(t)X2(x), with Xi = £ - y £ and X 2 = ^ + z £ . The Lie bracket X3 = [Xi, X2] = 2 ^ is linearly independent from X\, X2, and {X\, X2, X3} close on the Lie algebra defined by [Xi,Xt]
= Xa,
[Xi,X3]
= 0,
[X2,X3]
= 0,
(5.10)
isomorphic to the Lie algebra h(3) of the Heisenberg group H(3). The Lie algebra h(3) has a basis {ai, a2, 03} for which the Lie products are [01, a2] = 03, [ai, 03] = 0 and [a2, 03] = 0. The adjoint representation of f)(3) is then /0 0 0 \ ad(ai) = 0 0 0 ) ,
ad(o 2 ) = |
0 0 0 ) , ad(a 3 ) =
and therefore exp(—vi ad(ai)) =
/ I 0 0\ 0 1 0 , \0-wi 1/
exp(—v2 ad(o 2 ))
exp(—V3 ad(a3)) = Id . Writing the solution of the associated problem in H(3) as the product of exponentials g(t) = exp(—ui(£)ai)exp(—v2(t)a2)exp(—1)3(^)03) and applying (4.3), we find the differential equation system vi = bi ,
v2=b2,
i>3 = b2vi,
(5-11)
with initial conditions «i(0) = ^2(0) = ^3(0) = 0. The solution can be immediately found: vi(t)=
[ bi(s)ds, Jo
v2(t)=
/ b2(s)ds, Jo
v3(t) = / b2{s) / Jo Jo
bi(r)drds. (5.12)
The elements of the Heisenberg group H(3) can be represented by three real numbers (a, b, c) with the composition law (a, b, c)(a ,b,c) = (a + a,b + b,c +
299 c' + 6a'). In a similar way as in previous examples, the vector fields Xi, X2 and X3 can be shown to be the fundamental vector fields of the action of H(3) on R 3 given by $((a, b, c), (x, y, z))
Then, the general solution of (5.9) is * ( ( « i , v2, V3), (xo, yo, zo)), i.e. x = xo + «i, y = yo + V2 and z = zo + xo«2 — J/o«i + 2t>3 — «i«2, where «i = v\(t), V2 = V2{t), and v3 = v3(t) are given by (5.12). The direct integration of (5.9) gives the same result. Other form of Brockett's system 18 is the control system in R 3 with coordinates (*, V, z) i = tn(t),
y = b2(t),
z = -bi(t)y.
(5.13)
This system gives the integral curves of the time-dependent vector field 61 (t)X\ + b2{t)X2, where X x = £ - y§-z and X2 = ±. If we take X3 = [Xly X2] = £, it is easy to see that {Xi, X2, X$} satisfy the Lie brackets (5.10). Moreover, they can be regarded as the fundamental vector fields of the action of H(3) on R 3 given by
*((a, b, c),(x, y, z)) =
The general solution of (5.13) is now $ ( ( « i , ^2, U3), (%o, yo, zo)), i.e. x — xo + vi, y = yo+V2 and z = zo — yovi+V3 — viV2, where vi = vi(t), V2 = V2(t), and V3 = V3(t) are given by (5.12). Direct integration of (5.13) gives the same result. Finally, another realization of the same system can be obtained from the natural action on R 3 of the representation of the group H(3) as upper triangular 3 x 3 real matrices. Taking coordinates (x, y, z) on R 3 , this action reads as
H(a, b, c), (x, y, z)) =
The associated fundamental vector fields are now X\ = z-g-, X2 = y£ and X3 = z£, which satisfy (5.10) as well. Then, the general solution of the control system giving the integral curves of 6i(t)Xi + &2(£)X2, that is, x = b2(t)y,
jr = 6 1 (t)z,
i = 0,
(5.14)
is $ ( ( m , V2, V3), (xo, yo, z0)), i.e. x = xo + yovi + z0V3, y = yo + z0vi and z = z0, where v\ = vi(t), V2 = V2{t) and 113 = V3(t) are given by (5.12). Once again, the direct integration of (5.14) gives exactly the same result.
300 5.5
Front-wheel
driven kinematic
car
This example is based on a model for an automobile proposed by Murray and Sastry . The configuration space is R2 xS1 xS1, where we take coordinates (x, y,
y = a(t)t&n0
,
<$> — 02(f),
8 = ci(t) tan cf> sec 9,
(5.15)
which gives the integral curves of the time-dependent vector field c\(t)X\ + c2(i)X2, where X\ = - 5 - + t a n 0 — +tan> s e c 0 — , ox oy 09
X2 = — . dtp
Taking Lie brackets we can try to find the minimal Lie algebra containing these two vector fields. It is not difficult to see that such an algebra is infinite-dimensional so the system is not a Lie system. However, it can be approximated by a Lie system as follows. Making the local feedback transformation proposed in 21 (although it seems that there are some misprints there) xi=x,
X2 = sec 3 9 tan <j>, 2:3 = tan 0,
c\=b\
C2 = — 3sin <j>sec 9sin9bi+
x\ = y, cos 8 cos <j>b2,
we obtain the control system on R 4 , with coordinates (xi, X2, X3, Xi) i\ = 61 ,
£2 = 62,
xs = X2b\,
±4 = 2:361,
(5.16)
which is a system in chained form, according to the terminology of 2 1 . It is interesting to note that in this case, the system (5.16) can be essentially obtained approximating all of the trigonometric functions appearing in (5.15) by their zero order Taylor series, see also 18 . Then, the control system (5.16) gives the integral curves of the vector field bi(t)Xi + b2(t)X2, where now Xi = ^ + x 2 ^ + x 3 g f j and X2 = g f j . Taking the Lie brackets d X3 = [Xi, X2] = - -r— , 0x3
d Xi = [Xi, X 3 ] = - — , 0x4
we obtain two other linearly independent vector fields such that {Xi, X2, X3, X4} close on the Lie algebra defined by [Xu X2\ = X3 ,
[Xi, X3] = X4 ,
(5.17)
being all other Lie brackets zero. The corresponding abstract Lie algebra has a basis {ai, a2, 03, 04} for which the only non-vanishing Lie products read as [01, 02] = 03,
301 [oi, 03] = 04. The adjoint representation takes the form
0000\ 0000 0100 00 1 0 /
ad(ai)
0 000\ I 0 000
AI \ ad fl2)=
(
I -1000 0 0 00/
/ 0 000\ 0 000 ad(a3) = 0 000
/oooo 0000
ad(a4) =
V-10 1 0/
0000
\0000,
and therefore
vl
exp(—v\ ad(ai)) = Id —vi ad(ai) + — ad(ai) o a d ( a i ) , exp(—w 2 ad(a2)) = Id— V2 ad(o 2 ), exp(—V3 ad(o3)) = Id— 1)3 ad(a3),
exp(—V4 ad(a4)) = Id .
Writing the solution of the problem associated to (5.16) in the Lie group as g(t) = exp(—viOi)exp(—i;2ffl2)exp(—^303) exp(—1)404), and applying (4.3), we obtain the system
Vl = 6 l ,
i)2 = 62 ,
V3 = b2V\ ,
1)4 = 62
vl
(5.18)
with initial conditions tn(0) = 1)2(0) = 1)3(0) = 114(0) = 0, which is easily integrable by quadratures. Denoting B\{t) = fQbi(s)ds, £?2(0 = fQb2(s)ds, the solution reads vi(t) = B1(t),
V2{t) = B2(t),
v3(t)=
f
b2(s)B1(s)ds,
Jo
Vi{t)
-\f.«
s)BJ{s)ds.
(5.19)
The Lie group whose Lie algebra is the one described above is four dimensional, and its elements can be represented by four real numbers (a, b, c, d) with the composition law (a, 6, c, d)(a, b', c , d!) = (a + a , b + b', c + c + ba, d + d' +ca' + b(a'f / 2 ) . Now, the vector fields Xi, X2, X3 and X4 can be shown to be the fundamental
302 vector fields of t h e action t h e previous Lie g r o u p on R 4 given by
$ ( ( a , b, c, d ) , ( x i , x 2 , x 3 , x 4 ) ) =
n0L 01 00 00 \
fxA X2
0 a 10
X3
\0^-al) /
\x4J
a b ab —c
-
\
: + dJ
V
T h e n , t h e general solution of (5.16) is $ ( ( w i , v2, v3, i>4), (xio, X20, £30, X40)), i.e. Xl = Xio + Ul ,
X2 = X20 + «2 ,
^3 = X30 + X20«l + «ll>2 — V3 , V? v\v2 X4 = X40 + :E30«1 + £20"^" H ^~
' V\V3 + Vi ,
where vi = vi(t), V2 = V2(t), V3 = V3(t) a n d V4 = V4(t) are given by (5.19). Direct integration of (5.16) gives t h e s a m e result. O t h e r realization of t h e control s y s t e m (5.16) can b e o b t a i n e d w h e n one considers t h e n a t u r a l action o n R 4 of a r e p r e s e n t a t i o n of t h e previous Lie g r o u p as u p p e r t r i a n g u l a r 4 x 4 real m a t r i c e s . Taking again c o o r d i n a t e s ( x i , X2, X3, X4) on R 4 , t h i s action r e a d s
$ ( ( a , b, c, d), ( x i , x 2 , x 3 , x 4 ) )
/16 c Ola 00 1 \000
d \ Xl ( \ ^ X2 0 X3 1 ) \X4j
T h e associated f u n d a m e n t a l vector fields are now Xi = X3Tp— + X4-J*—, X2 = X2 •dxi
>
X3 = x3^r a n d X4 = X4-^-, whose non-vanishing Lie b r a c k e t s are given by (5.17). T h e control s y s t e m giving t h e integral curves of t h e vector field b\(t)X\ + 62(4)^2 is X1=62(*)X2,
X2=bl(£)x3,
X3 = 6 l ( * ) x 4 ,
X4 = 0 ,
(5.20)
whose general solution is j u s t 3>((wi, V2, V3, V4), (xio, X20, X30, X40)), i.e. Xl = Xio + l20«2 + X30H3 + £40^4 1
X2 = X20 + X30UI + £40 —
X3 = 2:30 + X40W1 ,
X4 = X40 ,
where vi = vi(t), V2 = V2(t), V3 = V3(t) a n d V4 = V4(t) a r e given by (5.19). s a m e result c a n b e o b t a i n e d i n t e g r a t i n g directly (5.20).
The
303
Acknowledgments A.R. is supported by FPI Grant PB96-0717 of the Spanish Ministerio de Educacion y Cultura. A.R. also thanks the organizers of the Velfest, GCTAA-2000 for their kind invitation and support. Support of the Spanish DGES (PB96-0717) is also acknowledged.
References 1. J. Baillieul and J.C. Willems Eds., Mathematical Control Theory, (Springer Verlag, New York, 1999). 2. A. Ballesteros, F.J. Herranz, M.A. del Olmo and M. Santander, Quantum structure of the motion groups of the two-dimensional Cayley-Klein geometries, J. Phys. A: Math. Gen., 26, 5801-5823 (1993). 3. R.W. Brockett, System theory on group manifolds and coset spaces, SIAM J. Contr., 10, 265-284 (1972). 4. R.W. Brockett and L. Dai, Nonholonomic kinematics and the role of elliptic functions in constructive controllability, in Nonholonomic Motion Planning, Li Z.X. and Canny J.F. Eds., (Kluwer, Norwell, MA, 1993). 5. J.F. Carinena, Sections along maps in Geometry and Physics, Rend. Sem. Mat. Univ. Pol. Torino, 54, 245-256 (1996). 6. J.F. Carinena, J. Grabowski and G. Marmo, Lie-Scheffers systems: a geometric approach, (Bibliopolis, Napoli, 2000). 7. J.F. Carinena, J. Grabowski and A. Ramos, Reduction of time-dependent systems admitting a superposition principle, Acta App. Math., in press. 8. J.F. Carinena and A. Ramos, Integrability of the Riccati equation from a group theoretical viewpoint, Int. J. Mod. Phys., A 14, 1935-1951 (1999). 9. S. Helgason, Differential Geometry, Lie Groups and Symmetric Spaces, (Academic Press, New York, 1978). 10. V. Jurdjevic, The geometry of the plate-ball problem, Arch. Rat. Mech. Anal, 124, 305-328 (1993). 11. V. Jurdjevic, Optimal control problems on Lie groups: crossroads between geometry and mechanics, in Geometry of Feedback and Optimal Control, Jakubczyk B. and Respondek W. Eds., (Marcel Dekker, New York, 1993). 12. V. Jurdjevic, Non-euclidean elastica, Amer. J. Math., 117, 93-124 (1995). 13. V. Jurdjevic, Geometric Control Theory, (Cambridge University Press, New York, 1997). 14. V. Jurdjevic, Optimal control, geometry, and mechanics, reference 1 , 227-267. 15. V. Jurdjevic, Integrable Hamiltonian systems on Lie groups: Kowalewski type, Ann. Math., 150, 605-644 (1999). 16. V. Jurdjevic and H.S. Sussmann, Control systems on Lie groups, J. Diff. Eq., 12, 313-329 (1972). 17. S. Kobayashi and K. Nomizu, Foundations of Differential Geometry, Vol. I,
304 II. (Wiley, New York, 1963). 18. N.E. Leonard, Averaging and Motion Control of systems on Lie Groups, Ph. D. Thesis, (University of Maryland, 1994). 19. S. Lie, Vorlesungen uber continuierliche Gruppen mit geometrischen und anderen Anwendungen, revised and edited by Dr. G. Scheffers (B.G. Teubner, Leipzig, 1893). 20. V. Manikonda, P.S. Krishnaprasad and J. Hendler, Languages, behaviors, hybrid architectures, and motion control, reference 1, 199-226. 21. R.M. Murray and S.S. Sastry, Nonholonomic motion planning: steering using sinusoids, IEEE Trans. Auto. Control, 38, 700-716 (1993). 22. H. Nijmeijer and A.J. van der Schaft, Nonlinear dynamical control systems, (Springer Verlag, New York, 1990). 23. J. Wei and E. Norman, Lie algebraic solution of Linear Differential Equations, J. Math. Phys., 4, 575-581 (1963). 24. J. Wei and E. Norman, On global representations of the solutions of linear differential equations as a product of exponentials, Proc. Amer. Math. Soc, 15, 327-334 (1964). 25. P. Winternitz, Lie groups and solutions of nonlinear differential equations, in Nonlinear Phenomena, K.B. Wolf Ed., Lecture Notes in Physics, Vol. 189, (Springer Verlag, New York, 1983).
305
FROM THE GEOMETRY TO T H E ALGEBRA OF NONLINEAR OBSERVABILITY
SETTE DIOP Laboratoire des Signaux & Systemes CNRS-Sup elec-Univ. Paris Sud Plateau de Moulon 91192 gif sur Yvette cedex France E-mail: [email protected] This contribution goes along the lines of the belief that nonlinear observability has a deep geometric content. We try to set up that geometry, its algebraic translation all the way down to the algorithms needed to do some of the computations. The basic geometric picture of observability is that of a projection map, that of the system trajectories onto the data trajectories. Observability is then seen as the condition for this map to be finite in the sense that its fibres are generically finite. It is this definition which easily passes to differential algebraic geometric characterizations. Next, most of the picture is kept by defining the notion of singular observation data, that is, the special data for which the generic observability property is lost. The work is incomplete at least in that we hardly touch the so-called real case where we have to leave the abstract differential algebraic geometry to enter the differential real algebraic geometry. This is under investigation.
Dedicated to Velimir Jurdjevic 1
Introduction
A system x = f(t,u,x), y = h(t,u,x), is a Um the Um
{
>
set X of trajectories (u,x,y) satisfying equations (1) in some affine space x Un x Up, where U is the function space where the control, the state, and output take their values. An external trajectory 1 6 (u,y) is an element of x Uv for which their is at least an x such that (u, x, y) £ X. The set of external trajectories is called the external behavior 1 6 of the system and denoted by Xu
306 there is at least one such a state x. The word determine rather refers to the number of possible values of x. Formally, we have a projection map IT of the set X onto the set Xu,y which sends a trajectory (u, x,y) into its corresponding external trajectory (u, y) according to the following picture.
Figure 1. Variety projection picturing observability notion.
Given an external trajectory (u,y) in some time interval we shall be in one of the following three situations. The first situation is when -K~X(U, y) contains infinitely many elements. The second one is when 7r _1 (S,y) is finite with cardinality greater than 1. The last one is when 7r - 1 (u,|/) is a singleton. The word determine in the above setting of the observation problem also has some practical content which refers to the ability to explicitly compute or approximate all the possible values of x corresponding to (u,y). This want leads to the statement that the observation problem is solvable, we say that the system is observable, if TT"1 (u,y) is finite. Finiteness, or even cardinality 1, of 7r _1 (u, 1/) for every external trajectory (u,y) is actually the ultimate need from a practical standpoint. But simple characterizations of observable systems requires a weaker condition. We shall thus rather qualify a system as observable if the projection map IT is generically finite. Denoting by (u,y) a generic external trajectory this means that we actually require 7r -1 (u,2/) to be finite for the system to be observable. The eventual external trajectories (u, y) for which ir~l{u, y) is infinite are then considered as singular ones for the observability of the system.
307
To have a theory of observability we need to specify the class of function spaces U over which the trajectories take place. The differential algebraic geometric approach of observability that is to be presented here aims at providing a mathematical sound language to the previous intuitive interpretation of observability. Observability theory for nonlinear systems has been a long term research interest for many authors. Its pertinence being sufficiently explained by the germaneness of estimation problems in control system designs. Following the work of R. E. Kalman for linear systems, the notion of distinguishability of state space points has been used to define observability. Referring to our previous abstract presentation of the observation problem we may say that, the classical theory would rather state observability as the condition where 7r _1 (u, y) contains only 1 element for every external trajectory (u,y). The reader may refer to 7 for more details on this approach. The book 6 by J. P. Gauthier and I. A. K. Kupka provides a thorough study of nonlinear observability and observer design. The differential algebraic approach of nonlinear observability dates back to 1986 with a paper by J. F. Pommaret n . But the approach was first explicitly developed in 5 ' 4 . The present work is an extension of the previous two papers. More specifically, we consider general systems described by Pi(w,z,0=0, Q(w,z,O^0,
* = 1,2,...,
,s {Z)
where the Pi's and Q are finitely many polynomials in w, z, £ and their derivatives. The variable w stand for the data (in the classical observation problem, w is (u, y), the input and the output), z is the variable to be observed (or estimated), and £ is an extra variable which may eventually be present in the system's description and which is neither part of data nor being observed. The coefficients of the system's equations are supposed to be in a differential, field, k, of functions of one variable, the time, whose derivation is the standard derivation with respect to the time. The above mentioned function space U over which the system variables take their values is here a differential algebraic closure, k, of k. The set, X, of trajectories of system (2) is thus a subset of an affine space over k. A basic assumption we make throughout this approach is that X is irreducible. Its projection XWjZ along the variables w and z (i.e. the set of (w, 1) in the ambient affine space for which there is £ such that (w, ~z, Q £ X) is thus irreducible, too. Similarly for the projection, Xw, of Xw z along the variable z. Let •n :
308
%w,z -> Xw the latter projection map. Let XWjZ and Xw be the respective closures of XWiZ and ^ for the differential Zariski topology on the ambient affine space. These sets have generic points, that is, points which are dense in the respective set. Let (w, z) be a generic point of XWtZ. Our formal definition of observability reads as: z is observable with respect to w if n~1(w) is finite. The previous geometric definition of observability now has a purely algebraic translation: z is observable with respect to w iff k(A'tU]2} is algebraic over k(Xw) where the latter are differential field extensions of the field of coefficients. Computationally, z is observable with respect to w iff for each component, Zi of z there is a polynomial equation Hi(Zi,w,w,...)=0
(3)
in Zi, and finitely many time derivatives of the data w, with coefficients in k. It turns out that the situation where the Pi's and Q are not necessarily polynomial may be coped with by merely relaxing the polynomial requirement on Hi (this function still has to depend on only finitely many data time derivatives). But we do not enter this generalization. This definition raises at least three basic questions. First, what if the data assume some value w such that all existing relations (3) for Z{ degenerate into the trivial equation 0 = 0? Second, what if all such relations as (3) are known to have more than one solution in zp. Third, does the definition require more regularity of the data than the latter are in effect? The first question is related to the singularity of the observability notion with respect to the data. We shall address it precisely in 2.3. The second question is about a so-called local character of the observability of the system we have at hand. A partial answer is provided through the notion of rational observability addressed in 2.2. And the third question asks for some extended version of the present differential algebraic theory. We discuss it in 2.4. We have tried to make this contribution as self contained as possible. We thus have provided appendix sections giving key details on the differential algebraic tools necessary to read the paper for someone not familiar with them. Most of the technical differential geometric notions appearing in the next two sections are explained in the appendices. 2
The differential algebraic geometric approach
In this approach we call (differential) (algebraic) system with s variables, and with coefficients in k a proper differential quasi-affine variety X C k defined
309
over k where k is a differential closure of k. The differential dimension of X over k is the (minimum) number of input components. An input is a differential transcendence basis of k(X) over k. Once an input is chosen the other components of the system variable are differentially algebraic over k{u). In observation problems, the system variable is partioned into the data, or observations, w = wi,... ,w^, the variable being observed (or estimated) z — z\,... ,zn and the remaining variables, £. In what follows we assume n > 1. In the classical observation problem, the data consist of the control u and the measurements y. The present approach allows to consider more general observation problems. When the variable £ is present, we consider the projection Xw
Irreducibility of rational state systems
It is important to be able to deal with systems (1) where / and h are rational functions of u and x with coefficients in some differential field k. Such systems read as
Xi
Pi{u,x) qi(u,x) fj(u,x) 9j(u,x)
(l
where u stands for ux, u 2 , . . . ,um, and Pi, qt, fi and p; are differential polynomials of order zero in x with coefficients in k. Written by means of equations
310
and inequations the previous system is equivalently described by the following ' qi(u,x)xi=pi(u,x) 9j («, x) Vj = fj (u, x) <
n
(l
p
Y[qi(u,x)Y[gj(u,x) , i
^0.
i
The set X of trajectories of this system is defined to be the elements (u, x, y) £ k x k x k which satisfy Pi{u,x,y) IA(u,x,y)
= 0 ^ 0,
{l
where J^ is the multiplicative set generated by q\ (1 < i < n) and gj (1 < j < p), and where k is a differential closure of the differential field of coefficients k, and where Pi(U,X,Y)
= qi(U,X)X{i1)
Pn+j (U, X, Y) = fj(U,X)-
-Pi(U,X) 9j(U,
(1 < i < n),
X)Yj
(l<j
We need to establish the irreducibility of X so that (1) defines a system in the sense of the present approach. Having done this we shall call such systems rational state systems. Let k {[/, X, Y} be equipped with the ranking {{UuU2,...,Um},{X1,X2,...
,X„,y1,Y2,...
,YP}},
see Notation 1. Then the set A consisting of Pi,P2, • • • , Pn+P is readily an orthonomic autoreduced set. By Lemma 2.10, p = [A]: I™ is a prime differential ideal with characteristic set A. Let p define the affine variety X', (u,x,y) € X, and P € p . There is v e !N n + p such that I\P € [A] so that I^(u,x,y)P(u,x,y) = 0. Since (u,x,y) £ X implies that I^(u,x,y) ^ 0, we have P(u,x,y) = 0. Therefore X C X'. Next, let X" consist of the elements of X' which do not annihilate IVA. It is clear that X C X" because, as we just have proved, X is contained in X'. Conversely, let (u, x,y) £ X". Since Pj 6 p we have Pi(u,x,y) = 0. Therefore, X" C X. It results that X = X", that is, X is an open subset of X'. The fact that A is a characteristic set of p also implies that
31 1
lAf][A] =
Definitions and properties
Definition 2.3. Let X be a system with variables w, z, and (, and with coefficients in k. The variable z is said to be (algebraically) observable with respect to w if the projection map TT : Xw>z —> Xw (sending every trajectory (w,~z) of XWyZ into the corresponding observation w) is generically finite. If z is observable with respect to w then the degree of 7r is called the observability degree of z with respect to w, and is denoted by d^z. The variable z is said to be rationally observable with respect to w if it is observable with respect to w with observability degree one. For state systems of the form (1) we say that the system is observable if its state variable is observable with respect to u and y. Theorem 2.4. Let X be a system with variables w, z, and C,, and with coefficients in k. The variable z is observable with respect to w if, and only if, k(A'u,]Z) is algebraic over\s.{w). This follows directly from Lemma 1.4. Theorem 2.5. Let X be a system with variables w, z, £, and £, and with coefficients in k. If Q is observable with respect to (w,z), and z is observable with respect to w then £ is observable with respect to w. This transitivity of the observability property follows immediately from that of the algebraicity property. Theorem 2.6. Let X be a system with variables w, z, and £, and with coefficients in k. If z is observable with respect to w we have d°wz = A°wzl • d°WZlz2
•w,z\,Z2,... ,zn-izn J
312
where d^, Zl
where hi and qi are differential polynomials with coefficients in k. The proposition is thus immediately proved. For example, the following system
x2=u, y = xi,
(4)
is rationally observable since xi=y
and
x2 = - — 2u
But the system Xi
— —X2 ,
x2 = ux2 , y =
(5)
xi,
is not rationally observable since xi = y
and
x\ =
-y,
313
and there is no means to reduce the degree of the observability condition of x2. For rational state systems observability is equivalent to the fact that the number of state components is equal to the order of the system. To prove this we need the following result. Lemma 2.8. For the system (1) the (nondifferential) field extension k(u, x, y) of k(u) is purely transcendental. The (nondifferential) field k(X) =
k(u,x,y)
= k(u)(a; 1 ,a;2,... ,xn,ii,...
,xn,...
, j / i , . . . ,yP,yi,---
,yP,---)
is equal to k(u)(xi,X2,• • • ,xn). The result then follows from Lemma 2.10. Proposition 2.9. For a rational state system X the following two conditions (i) X is observable, (ii) the number of components of the state variable x in (1) is equal to the order, oX, of X, are equivalent. The order of X is defined to be the transcendence degree of k(^f„i2/) = k(u,y) over k(u). We have dk(«>k
+
d^{u)k(u,y).
By Lemma 2.8, d k , ,k(u)(x) is equal to the number of state components, while by the observability assumption, d k , >k(u,x,y) = 0. This proves the proposition. 2.3
Regular observability
Let X be a system with variables w, z, and £, and with coefficients in k. Recall the observation projection map ir : XWtZ —> Xw sending any trajectory (uJ,z) onto W. We assume z to be observable with respect to w, that is, the inverse image of the generic point w of Xw is with finite cardinality. It is a fact that, for special observations w, 7r~1(w) may contain infinitely many elements, leading to a singularity of our generic notion of observability. An example of such situations is the following
314
±i = xxx2 , x2 = u + x2 , y = xi.
(6)
x is observable with respect to u, y since x\—y
and x2 = — • V But in practice, in any time interval where y is identically zero (or, merely, small), the observability of x2 is singular in the sense that it is lost. Definition 2.10. Let X be a system with variables w, z, and £, and with coefficients in k, and such that z is observable with respect to w. An observation w £ Xw is said to be singular for the observation of z with respect to w ifTr~1(w) is infinite. Observations w € Xw which are not singular are called regular. The variable z is said to be regularly observable with respect to w if there is no singular observation for its observability with respect to w. For instance, system (4) is regularly observable. R e m a r k 2.11. Our definition of singular observations certainly copes with basic needs in practice. However, since the finiteness requirement on the fibres of the projection map n is no more generic we have to care about the conditions on passing from the geometric objects to corresponding algebraic ones. Recall that the differential algebra k {w} refers to the closure of Xw which may strictly contain the latter set. Likewise, the differential algebra k {w, z} refers to the closure of XWiZ which, also, may strictly contain the latter set. Given w € Xw it may be that 7r_1 (w) is finite when it is considered as a map XWtZ —> Xw while n~1(w) is infinite for ir : XWjZ —»• Xw. But, of course, for W £ Xw, if n~1(w) is finite for -K : XWjZ —> Xw then 7T-1(wJ) is finite, too, for IT : Xw
315
±i —XiX^ +X2 , X2 = U + X2 , y = zi.
(7)
This system is observable given the equations for y and xi. We may see that its observability degree is actually 1. The variable x2 is not integral over \i{u,y}, but the system may be seen as regularly observable since when y is not identically zero we have y x\ + x2 - y = 0 while, when y is identically zero, we have x2=y
= 0.
The situation may be more complex as shown by the following simple example Xi = UX2 , (8)
x2 = xi,
y =differential xi. The system is not closed for the Zariski topology, its closure is described by
The system is observable since
x\—y
and
y
x2 = —, u x is not integral over k {u, y}, but the system is regularly observable since its external behavior is described by uy — iiy — u2y = 0 ,
316
so that there is no singular external trajectory for the observation of x. We proceed to extend the result in Theorem 2.12. Let k{w,z} be a differential algebra generated by the elements in w and z and let k {w, z} be an integral domain. An element £ of k {w, z} is said to be primitive over k {w} if it is a zero of a polynomial ad id + a d _i £ d _ 1 + • • • + a 0 = 0 such that (i) the aj's are in k {w}, (ii) the perfect differential ideal {ad(w),ad-i(w),... unit ideal.
,a0(w)} of k.{w} is the
We have the following result whose proof is also immediate but which seems to be the best one can get for the characterization of regular observability from the algebraic properties of the differential algebras associated with the system. Theorem 2.13. Let X be a system with variables w, z, and C, and with coefficients in k. The variable z is regularly observable with respect to w if z is primitive over\a{w}. This theorem allows to see that system (7) is regularly observable since X2 is primitive over k{u,y}. But the theorem also fails to see that system (8) is regularly observable. 2.4
Observability and data differentiability
The discussion we enter here is about the extension of this differential algebraic geometric theory of observability to systems whose variables are allowed to live in spaces of functions which, say, are not smooth as required by differential algebraic affine spaces. The discussion is mostly motivated by systems whose input u may be not smooth. In such cases we cannot blindly apply the usual rules on operations of differential algebras. To be more specific, let us consider the following simple linear example ±1 = X2 + u , ±2 = -xi - x2 ,
y = x\ + u. This system is observable since
(9)
317
x\ = y — u
and
x2 = y — ii — u.
If the control, u, is Lebesgue-measurable, but not continuous, say, u is piecewise continuous with finite jumps then, in the usual sense of derivatives, u is not differentiable, and the expression x2 = y - u — u is not valid. If u is not differentiable then we note that the observability of x 2 reads as X2 = Xi — U ,
given the first equation in (9). Since xi = y - u is differentiable, we rather write x2 in the following form x2 = (y-u)'
-u,
which evacuates the need to differentiate u, and, at the same time, sheds light on how the data differentiability question did enter the scene. It basically results from the application of the following rule (a + b)' =a + b. This rule is of course not applicable on function spaces which are not closed under derivation. In nonlinear examples we may encounter the other usual "distributivity" rule which may not apply in the derivation of the observability conditions: (a • b)' = ba + ab. For constant-coefficients linear system we may write down the observability condition without violating the differentiability of the observations. Theorem 2.14. For constant-coefficients linear systems (± = Fx + Gu, \y = Hx + Eu,
K
'
where F, G, H, and E are real matrices of appropriate sizes the observability of x with respect to u, y when u is taken over a differential closure o/k = H is equivalent to the observability of x with respect to u, y when u is taken over the
318
space of piecewise continuous real-valued functions. Moreover, the differential algebraic observability condition may be rewritten so as it does not explicitly invoke the proper derivative of u. The equations of y in system (10) define a finite type algebra generated by u, y and x that we denote by k[u,x,y]Q. We abuse notation by using the same symbols u, y and x since, here, the latter are rather the residues of corresponding indeterminates modulo the equations of the output (and not the equations of the whole system), but there will be no fear of confusion. Let X 0 be a maximal subset (in the sense of inclusion) of xi, X2, • •., xn which is algebraic over k[u,x,y]0. Let no be the cardinality of Xo- no > 1 since otherwise the system is not observable. If no = n then the theorem is proved. We next assume that 1 < no < n. To simplify the notations we assume that the members of Xo are x\, X2, •. •, xno. For each i < no we have zo,i — Xi = aiy + biU where ctj and fcj are real constant row vectors. By definition of the system the XiS are differentiable. We differentiate the Xj's in X 0 to obtain relations z\ = i 0 = Hix + E\u where ZQ is the vector with components .zo,i, 1 < i < no, Hi is a submatrix of F and E\ is a submatrix of B. The equations ( y = H x + Eu, \ z\ — H\x + Eiu, again define a finite type algebra generated by u, y, x and z\ that we denote by k[u,x,y,zi]0. Let Xi be the maximal subset ofxi, X2,..., xn which contains X 0 and which is algebraic over \a[u,x,y,zi]0. Let n\ be the cardinality of X i . We have n 0 < n\. If n\ = no then the system is not observable. If m = n then the theorem is proved. Next we assume 1 < n 0 < n\ < n. We repeat the previous construction to obtain an increasing sequence of integers 1 < no < ni < • • • < n which has to stop. This proves the theorem. This result clearly generalizes to nonconstant-coefficients linear systems as long as the coefficients are in a ring of functions where they have inverses. For nonlinear systems one may see that the same procedure outlined in the previous proof applies as long as, at each step, the state components which are algebraic over the algebra generated by u, y, x, z\, etc. are actually rational over the same algebra.
319 3
Computing
We have collected here some algorithms which may be used as computing devices in nonlinear observability. The first one is an observability test valid for polynomial systems, i.e., those defined by polynomial differential equations. fi{w,w,...
,z,z,...)
= 0,
i = 1,2,... .
This test reduces to a general rank condition which is a necessary and sufficient condition for the observability of z with respect to w. When specialized to the observability (of x with respect to (u,y)) of rational state systems we obtain a rank condition which is similar to (but, a priori, not the same as) the well-known observability rank condition in 7 . 3.1
A rank condition for polynomial systems
For polynomial systems, a quite general observability test may be derived in terms of the rank of a Jacobian matrix. The basic tools are (Kdhler) differentials for which we include the following brief account. Let R be a commutative ring with unit element. Let A be an R-algebra with structural morphism p : R —> A. Let M be an A-module. An Rderivation of A into M is a map D : A —> M which satisfies the following three axioms: (i) D(x + y) = Dx + Dy (x, y € A), (ii) D(xy) = yDx + xDy (x, y £ A), and (iii) D(p(a)) =
0(aeH).
The basic property of differentials of an R-algebra A is the following. There exist an A-module, usually denoted by O R (A) (or by Q A / R ) > and an R-derivation, C?A/R •' A —• J)R (A) (called the canonical R-derivation of A), such that, for any A-module M , any R-derivation D of A into M , there is one, and only one, A-module morphism D of H R (A) into M such that D = DdA/R (when no confusion may occur, the sub-index with d will be dropped). For example, let (T*) i6l be a family of indeterminates over the ring R. Let o be an ideal of R {(Ti)i€i\ and A = R [(Ti)iei] / o = R [(*i) ie i]. Then fiR(A) is the quotient of the free A-module with basis ((dtj) i e I ) by the submodule generated by the A-linear forms
320
Y,gjr(t)dti
(Pea).
The main fact we shall make use of is that, if R = k is a field (of characteristic zero), and if A = K is a field extension of k, then the transcendence degree d k K of K over k is equal to the dimension [ftk(K): K] of the K-vector space fik(K). More precisely, a family (ti)i€l of elements of K is algebraically independent over k if, and only if, the family (dti)iel of elements of fik(K) is K-linearly independent; and, for K to be algebraic over k ((ii)iei), it is necessary and sufficient that the K-vector space fik(K) is generated by (dti)i€l. In particular, let the field extension K of k be of finite type, generated by n,... ,r„ (that is, K = k ( r i , . . . ,rli)). Let Pi, P 2 , . . . , Pa be a set of generators of the ideal of definition of K over k, i.e., k
[TI.TS,
...
,T„]
= k [TltT2,...
,r„]/(Pi,P2,
...,P„)
(where the T"s stand for indeterminates). Then d k K = [fi k (K) :K] = v - r k K J ( P i , P 2 , . • • ,P„) where J ( P i , P 2 , . . . ,Pa) denotes the Jacobian matrix of P i , P 2 , . . . ,P f f , i.e., the matrix
\dPi (
0
where 1 < i < a, and 1 < j < v, and where rkK (Pi, P2, • • • , P
+ (at - l)ai, Q i _izf i _ 2 + • • • + a u ) ii + a i ; Q i _ i 2 t a i _ 1 + • • • + a i , 0 = 0,
321
that is, ii is in k(u/)(zj) since, in characteristic zero, by the minimality of at , we have ctiz"'-1 + (a, - l ) a i i a j _ i z ? i - 2 + • • • + Oi,i ^ 0. Since k(w, z) = k(w) I \z\3') \\
J, our claim is proved by an immedi/ l<»
ate induction on j . Actually, we obtained a more general fact: for any given rj € 1N(1 < j < n), the algebraicity of z over k(w) is equivalent to that of
k(w)((z[h\4h),...,z^A \ \
)
'
/0<ji
over k(w). Notation 1. If z = (zj)i<,-< n and r = (rj)i<j< n € IN" then the family V
/0<ji
will be denoted by z^. What precedes proves the following result. T h e o r e m 3 . 1 . Let X be a system with variables w, z and £. Let rj S IN (1 < j < n) be given. Let Pi, P2, • - • ,Pa € k{u>) [ZM] be a set of generators of the ideal of definition ofli(w)^r') over k(w). Then z is observable with respect to w if, and only if, the Jacobian matrix
dP dz(ki)
77(2)
(where 1 < i', < a, and 1 < kj < rj, and 1 < j < n) with a rows and r = Y^j=i(rj + 1) columns is of rank r over \t(w)(z^). R e m a r k 3.2. The determination of the polynomials Pi,--- ,Pa may be not immediate even if the choice of the integers rj £ IN (1 < j < n) is arbitrary as stated by the theorem. These polynomials are the equations of the system X with respect to zM. They result from some elimination procedure, namely, the elimination of £ and (Jjx) yzl
iz2
(h)
(j„)\ 1 • • • izn Jj i > n , - - - ,jn>r„
322
from the original equations of the system (and all their derivatives). To proceed with such an elimination problem we have to differentiate the system original equations some number of times, and then use standard (nondifferential) elimination procedures to eliminate all unwanted derivatives of the variable z. How many times do we have to differentiate the system equations to make sure that we shall get all relations between w and zM by eliminating the higher derivatives of z? This is a longstanding elimination problem still unsolved in full generality. Here is where J. F. Ritt's differential algebraic decision methods come into play. In the next section we show that there is a finite set A consisting of polynomials A\,... , Aat which may replace the polynomials P\,. . . , PCT in the above rank condition, and which are easier to compute. 3.2
The rank condition for rational state systems
In the present section we derive a constructive rank condition for the rational state system (1) with coefficients in a differential field k which, for the sake of simplicity, we assume to be of constants. Let p be the ideal of definition of k(u, y) (x) over k(u, y). Let
Pj(U,X,Y)
= qi{U,X)xf)-pj{U,X)
Qi(U,X,Y)
= fi(U,X)-gi(U,X)Yi
(1 < j < n), (1 < i < p).
The set A consisting of Pi (1 < i < n) is an autoreduced set of k{[/, X, Y} with respect to the ranking
{{Ui,U2, • • • , Um} , { X i , X 2 , . . . ,Xn, Yi,Y2,...
,5^}}-
See Notation 1 for the setup of this notation for rankings. Let Q\ = Qi(U,X, Y) (1 < i < p), and let Qf' be the remainder (see appendix 2 section 2.2.3) of Qy' with respect to A. Q\ is merely the derivative, (Q{°])', of Q{°} in which x\1] (1 < j < n) are eliminated by multiplying n 0)
(<9i Y by Y[qh
and then substituting Pj + pj for qjXf\\
<j<
n). The
(=i
linear combination of Pj (1 < j < n) which appears reduces to zero when the remainder is taken. Explicitly,
323 ( n
\
m
n
nr
at
n
I
n
\
m
a
n.£§^'+E|^n« -* (n.) E ^ 1 ' w
d
9i.Tl„.
i=l
„.VW
' 1=1
If the ground field k were not assumed to be of constants the latter formula would contain more terms. Now this formula is nothing but a counterpart of Lie derivatives yielding the rational expression of yi in terms of y, y, x, u and ii.
We 0
have
Q\ '(u,x,y)
=
0, and Q\ '(u,x,y)
=
0 hence
1)
Q< VA-,l/_))QJ (U,X,»)ep. Let Q\3 ' be the remainder of Q]3' with respect to A. Again, Qi (u, x, y) = 0, hence Q3 (it, X, y) G p. This construction is iterated in order to produce the sequence Q\3' (U, X, Y) such that Q]3' (u, X, y) € p(i
1 < i < p, j G IN): 1% C p.
Let P be a nonzero element of p. Denote by the same symbol P the differential rational fraction obtained by replacing u and y in P by U and Y, respectively. There is S in k { t / , Y } such that S(u,y) jt 0, and SP € k {[/, Y} [X], so that S P G l(X) f] k {U, Y} [X]. As such, P verifies
^SP=
Yl
A P J)
^i
(!)
l
jew +P
for some t' G IN" , and Aitj rewritten in the form
G k { [ / , X , Y } . Clearly, equation (1) may be
PASP= J2 AijPlJ)+ £ l
BitjQ?
1<*
for some t G IN" +P , and BUj (i G IN) are in k {U, Y}[X]. Since the differential ideal of k {U, X, Y} generated by A has no nonzero element in common with
324
I(X) f\k{U, Y}[X] (this results from an obvious degree argument based on the fact that P; (1 < i < n) are with degree 1 in their respective leaders X W ) , the first sum in the previous equality must be zero; showing that P is in (Qi(u,X,y),i 6 IM):/^ We have proved the following result. Lemma 3.3. The ideal of definition of the field k(u, y) (x) over k(w, y) is given by p = (Q\j)(u,X,y), 1 < i < p, j £ 1 N ) : / ^ . We denote
by Q[™\ and
(Q\J)) ^
' jeK
by Q® lor I e IN, I > 1,
(QW) V
/ o<j
and we proceed to show that only finitely many Q\' are needed. Lemma 3.4. Let d ° ( u x y i (1 < i < p), be y i _ i > k<«,a:,i/i,... ,yt) = n\ the transcendence degree oflc(u,x,yi,y2,-• • , j/i) overk(u,x,yi,y2, • • • , J/J_I), p
and let 2_jn'i
=
n
' • ^
e
^ave
n
' — n>
an
^
P = (^i
\uiX,y), 1 < i < p).
The definition of n\ is meaningful since each i/j is differentially algebraic over k(u). Recall that d k(«,y 1 , y 2 ,...,j, i _ 1 ) k ( u ' 2 / 1 ' y 2 '- •• ->Vi) is the the order (minus 1) of the minimal differential polynomial of yi over k(u,yi,y2,. • • ,2/i-i)- Therefore, n
'i ^ d k ( « , y 1 , y 2 , . . . , ! , i _ 1 ) k ( u . 2 / i . 2 / 2 , . - -
,yt).
Moreover, the order, o(X) = d£->k(u,y), of X verifies p
n' < J2 dk<«,»i,»,... ,y i -i) k ( u - 2/1,2/2, • • - , 2/i} = o(X) . Now, from the rationality of x and y over k(u)(x), and from the following fact n
= dk(u)k(u, x, y) = d£
it follows o(X) < n, and the first assertion of the lemma, too. To complete the proof of the lemma, we proceed to show that Q\J)(u,X,y) Let j >n'i, and let Q\J'
G (Q[fyu,X,y):I%
(j > n\).
be the remainder of Q\3' with respect to Pl,P2i
• • • ,Pn,Qi
' •
325
As may be seen from its construction, Q\3' does not involve Y„ if a ^ i or a = i and /? > j , and it may also easily be seen that Q\3' is of degree 1 in Y>3'. Therefore, Q}J' (u,X,y) is free of y derivatives. If it were not the (identically) null polynomial, the fact that Q\J' (U, X, y) — 0 would read as a dependence relation of x over k(w) which would be contradictory. The lemma is thus proved. Remark 3.5. It is usually understood that, when computing the observability rank only the first n — 1 proper derivatives of the output equations need be performed. This lemma provides a proof of this matter of fact, perhaps for the first time. Note that more precisely than stated in this lemma, when computing the observability rank, the ith component of y need be differentiated no more than n\ — \ times. The following lemma is there to save us the need for an explicit knowledge of a basis of p. Lemma 3.6. Let K = k(r) = k ( T I , T 2 , . . . , T „ ) be a finitely generated field extension of k. Let p C k [T] = k [Ti, T 2 , . . . , T„] be the ideal of definition of K over k with generators Pi,P2,... ,Pa. Let a ranking of k [T] be fixed, and let A be an autoreduced set with elements A\,A2,... , Aai £ p, and such that its set, IA , of initials does not meet p. If p = (A): 1^ then
rkK(Pi,P2,... ,P
,Aa.).
For any i G IN, 1 < i < a, there are it 6 IN1"', and a'u € k [T] such that
w = i=iE
i=i
Since r is a generic point of p, and I 4 f| p = 0, we have IAi (T) ^ 0 for all 1 < i < a'. Denoting the value of
2, 01 j
c*f j ( r ) , we have
at r by — , and a - i i ( r ) / / ^ ( r ) by UTi
326
dPi 1
^
dAi
1=1
3
and
This shows that the vectors (dPx
dP2
dP^
. . . . . .
are linear combination over K of the vectors
and vice-versa. Therefore these two sets of vectors have the same rank over K, and the lemma is proved. What just precedes shows the following fundamental result. Theorem 3.7. A rational state system is observable if, and only if, the Jacobian matrix of the vector (Q^](u,x,y)) with respect to x is of rank n over 3.3
h(u,y)(x).
Computing with characteristic sets
The following theorem is a new test of observability based on the construction of a characteristic set of the defining differential ideal of X. Theorem 3.8. Let a ranking ofk{W, Z,T} (where T stands for the differential indeterminate corresponding to £) be fixed, let it be such that any derivative of the components of W is lower than Z\, Z2, • • •, and Zn whose derivatives are all lower that T. Let A be a characteristic set ofl(X). If z is observable with respect to w, then each Zi is (effectively) introduced in A by a differential polynomial of order zero and with degree < d^Zi in Zi (1 < i < n). Conversely, if each Zi is introduced in A by a differential polynomial of order
327
zero and degree d, in Zi then z is observable with respect to w, d^Zj > di, and d^Zi divides di • • • d^ • d\ (hence d°wz\ — d\). Assume that z is observable with respect to w. Let Pi be the polynomial of k{W}[Zi] obtained by substituting W for w in the minimal polynomial of Zi over k(w) and by multiplying by the least common multiple of the denominators of the coefficients in k(W). By multiplying Pi by the product of some powers of the initials and separants of the elements of . 4 Q k { W } , the simultaneous remainders of the coefficients of Pi may be substituted for these coefficients, and then consider Pi as reduced with respect to .4P|k{VF}. Having done that transformation, Pi is still with degree d£,Zi in Zi since the initial of Pi could not have been reduced to zero by the fact that it is not in 1(X) P|k{W}. Now, if Zi is not introduced in A by a differential polynomial of order 0 and degree < d£,zi then Pi would be reduced with respect to A, which would contradict the nonnullity of Pi. Conversely, assume that each Zi is introduced in A by a differential polynomial of order 0 and degree di. It is then clear that z\ is algebraic over k(iy), z-i is algebraic over k(w) (zi), etc., andz„ is algebraic over k(w) ( z i , . . . , z n _i). This implies that z is observable with respect to w. The rest of the proof is classical in the theory of algebraic extensions. By the primality of I(X), and the fact that A is a characteristic set of I(X), the differential polynomials in A which introduce z\,... ,zn, respectively, are irreducible over the fields k(w), k(w) (zi), ..., k(w) ( z i , . . . , z n _ i ) , respectively, hence the degrees over k(w) of k(w) (zi), k(iu) (zi,z 2 ), . . . , k{u;) ( z i , . . . , z n _i) are given by di, dz -d\, • • • ,dn- • • d2-di, respectively. This ends the proof.
3.4
Computing regular observations
Let X be a system with variables w, z and £ such that z is observable with respect to w. Given the discussions in 2.3 we won't be able to compute all the regular observations. Here is a partial result. Theorem 3.9. Let X be a system with variables w, z and £ such that z is observable with respect to w. Let A be a characteristic set of X as in Theorem 3.8. Then an observation which, for each member of A, does not annihilate all its coefficients is regular for the observation of z with respect to w.
328
Appendix: Basic differential algebraic geometry The reader may refer to the books 2 ' 9 for more details on the material of this section. Let R be a commutative ring of functions of one variable, with unit, and which contains the field of rationals, Q, as a subring, and let R be equipped with a single derivation, that is, a map of R into R such that (a + b)'=a
+ b,
and
(ab)'=ba
+ ab
for all a, b 6 R where d is the derivative of a. We call R an ordinary differential ring of characteristic zero. All differential rings that will be considered in the sequel will be assumed to be ordinary and of characteristic zero. When R is a field k it is said to be a differential field. The standard fields Q, M and C are differential rings (and fields) of constants. The set, C°°(I, H ) , of real-valued smooth functions defined on a given interval I is a differential ring when equipped with the usual derivation. It is well-known that C°°(I, H) is not an integral domain, hence cannot be embedded in a differential field with the usual addition and multiplication of functions. But the set, C"(1,1R), of real-valued analytic functions is a differential ring and an integral domain. Its quotient field is the differential field of meromorphic functions. The differential polynomial algebra R{T} = R{Ti, T 2 , . . . , TM} is defined to be the polynomial algebra over R in the family of indeterminates (Tj V
)
. The order of an element P G R { T } \ R is
/l
l
JV
denoted by o(P). The object of differential algebraic geometry is the study of the set of solutions (or zeros) of sets of differential polynomials by means of the properties of associated algebraic objects. We shall give to the notion of zero a special meaning here. 1.1
On the admissible trajectories
First we restrict ourselves to differential polynomial equations and inequations (^) with coefficients in a differential field. Next, we look for their solutions (or zeros) in differential field extensions of the ground field of coefficients. These restrictions to fields are for the simplicity of the theory. Proceeding as in algebraic geometry where solutions of polynomial equations and inequations are considered in affine spaces over the algebraically closed field C, we use the notion of differentially algebraically closed field introduced by A. Robinson. A differential field K is said to be closed if it contains zeros of an arbitrary set of differential polynomials with coefficients in K whenever that set of
329 polynomials has a zero in some field extension of K. More precisely, L. Blum provided the following axiomatic definition of differentially algebraically closed fields: for any P,Q € K{T} (here T is single) such that Q ^ 0, o(Q) < o(P), there is r € K such that P(T) = 0 and Q(T) ^ 0. A differential field extension k of k is said to be a differential algebraic closure of k if k may be embedded into any differential field extension of k that is differentially closed. The existence of differential algebraic closures was first proved by L. Blum. As is clear, k is much larger than k, but these two fields have the same cardinality. For more details see 10 . A. Seidenberg 13 ' 14 has shown the following Theorem 1.1. Embedding theorem. Any differential field extension, K = Q ( r i , r 2 ) . . . ,r„), of finite type of the field of rational numbers is isomorphic to a differential field, Q ( r i , T 2 , . . . , T V ) , of meromorphic functions in one variable defined in a domain (open and connected) of the field of complex numbers. This result, a priori, would force us, even for systems with real coefficients, to consider their trajectories with complex time if its real analytic version did not hold. This was obtained by M. F. Singer 15 . Theorem 1.2. Embedding theorem: The real analytic case. Any real differential field extension, K = Q ( T I , T 2 , . . . , T„), of finite type of the field of rationals is isomorphic to a differential field Q ( r i , T 2 , . . . , T V ) , of real meromorphic functions in one variable defined in a neighborhood domain of 0. Of course, as the algebraic closure of the field of reals is not a real field, there is no hope that a differential field possesses a real differential closure. In other words, the Embedding Theorem does not avoid us the need to look at the trajectories of even real systems in abstract differential closures, it rather calls for a decidability result on whether a real system has a real meromorphic trajectories. Such a result was obtained by M. F. Singer 15 : Given a system of differential polynomial equations and inequations with coefficients which are rational functions of the time, the existence of real meromorphic trajectories defined on some open subset of M is decidable. 1.2
Differential algebraic sets
Let fi G IN, [i > 1, let k be a differential field, k a differential closure of k, and let k {T} = k {Ti,T2,... ,TM} be the differential polynomial k-algebra in the differential indeterminates T = (Ti, T 2 , . . . , T^).
330
To any subset £ of k {T} is associated the subset V(S) of kM consisting of the zeros of £ in k , i.e., the elements t = (ti,t2,... ,t^) G kM, such that P(t) = 0 (P G £ ) . It follows V(S) =_V([E]) = V({£}). Conversely, to any subset X of k is associated the subset I(X) of k {T} of elements P such that P(t) =0(teX). 1{X) is a differential ideal of k {T} equal to its radical. A differential algebraic set (ofk ) is a subset X of k such that V(I(X)) = X. The maps, X t-> I(X), and a i-> V(a), between the set of differential algebraic sets of k and the set of the differential ideals of k {T} equal to their radicals, are bijective and inverse to each other. For V I (the composition map of I by V) is the identity map by the definition of differential algebraic sets. The fact that I V is the identity map follows from the Nullstellensatz (see 3 of 12 , and IV.3 of 9)_. A map / : X -» k of a differential algebraic set into k is called a differential polynomial function on X if it is the restriction to X of a differential polynomial map ofk into k, i.e., if there is P in k {T} such that f(t) = P(t) (t G X). The set of differential polynomial functions on X is denoted by k{A"}. k{A'} is easily seen to be a differential k-algebra (for any / G k - ^ } , then / is the differential polynomial function which is the restriction of P to X where P is an element of k {T} whose restriction to X is / ) . Moreover, k {X} is reduced (i.e., with no nonzero nilpotent element) and is isomorphic to k {T}/1 (X) as a differential k-algebra. Let px be the canonical (or natural) surjection k {T} —> k {X} which sends P G k {T} into the differential polynomial function on X defined by P. Clearly, px is a morphism of differential k-algebras, surjective, and with kernel I (X) (the images, through px, of T = (TX,T2,... ,TM) are denoted by T — (TI,T2, . . . , T M ) , and called the differential coordinate functions on X). Hence, as k{T}/I(X), k { ^ } is a reduced differential k-algebra differentially of finite type (i.e., k {X} is differentially algebraically generated by finitely many elements, these elements being, say, r = ( n , T 2 , . . . , T M ) ) . kjA'} is called the differential coordinate ring oi X. To any differential algebraic set X is, thus, associated a reduced differential k-algebra differentially of finite type, k {X}. Conversely, if A is such a differential k-algebra, then A is of the form A = k {T} and the substitution map k {T} —> A which sends P into P(T) is a surjective morphism of differential k-algebras whose kernel, OA, is equal to its radical (since A is reduced). Therefore, to A is associated the differential algebraic set X\ — V ( O A ) (for which Ti,T2,... ,T M are the differential coordinate functions).
331
Roughly, the preceding correspondences, a:X H> k{A^} and /3:A i-> AA, between the class of differential algebraic sets and the class of reduced differential k-algebras differentially of finite type are bijective and reciprocal. For j3a is clearly the identity map, and a/3 is the identity map by the Nullstellensatz. A map / : X —>• y between two differential algebraic sets (X C k , and y C k ) is called a morphism of differential algebraic sets if there are Pi, P 2 , . . . , Pv € k {T} such that f(t) = (P1(t),...,P„(t))
(teX).
It is easily seen that the identity map of a differential algebraic set is a morphism of differential algebraic sets, and that the composition of two morphisms of differential algebraic sets is one (that is, the class of differential algebraic sets together with the morphisms of differential algebraic sets is a category). Now, to a morphism of differential algebraic sets / : X —> y is associated a differential k-algebra morphism / * : k {y} —»• k {X} defined by: /*(a) =
af(aek{y}). The map / i-» /*, thus considered, is clearly injective. Conversely, to a differential k-algebra morphism, >:k {y} -» k {X}, is associated a morphism of differential algebraic sets / : X —> y such that
The differential Zariski topology
The empty set and k itself are differential algebraic sets of k . The intersection of an arbitrary family (Xi) of differential algebraic sets is one since
n i ^ = niV(i(-vi)) = v(u i i(^)).
332
The union of two differential algebraic sets X\ and X2 is one again since Xx U X2 = Y{l{Xl)-l{X2)) where l(Xy) -l(X2) is the set of differential polynomials of the form Pj • P2 (Pi £l(X1),P2£l (X?)). Therefore, the set of differential algebraic sets of k is the set of closed subsets of a topology on k called the differential Zariski topology of k**. A differential algebraic set inherits the differential Zariski topology of its ambient affine space. Let a be a differential ideal of k { T } . K is said to be a differential field of definition of a if k • (a PI K{T}) = a; it will also be said that a is defined over K. If K is a differential field of definition of a then every differential field between K and k is a differential field of definition of o. In what follows, a D K{T} is denoted by o K _ Lemma 1.3. Any differential ideal o / k { T } has a differential field of definition. The smallest differential field of definition of a prime differential ideal is differentially of finite type over Q, hence k is a differential closure of it. If b is a differential ideal of k {T} equal to its radical, with differential field of definition K then V(b) = V ( 6 K ) - If a is a differential ideal o/K{T} equal to its radical, then {a}w T i = k-a, (k • o) DK {T} = a, hence {a}jr T \ is defined over K. See 9 , mainly, IV.4 for a proof. This lemma makes clear the following definition. Let X be & differential algebraic set. It will be said that X is a differential K-algebraic set, or is K-closed, or is defined over K, if I (X) is defined over K, which amounts to saying that X = V(I(X)K) = V ( S ) for some subset £ of K { T } . As above, it may be checked that the differential K-algebraic sets of k are the closed sets of a topology on k , which is called the differential Zariski K.-topology. Topological terms which are used relatively to the differential Zariski K-topology will be prefixed by K—, unless K = k. To a differential K-algebraic set are associated a differential ideal of K {T} equal to its radical, and a reducec' c ifferential K-algebra differentially of finite type which is denoted by K {X}. If T\ , r 2 , . . . , r^ are the differential coordinate functions on X, then K{A"} is isomorphic to the differential K-subalgebra of k {X} generated by K and TI,T2,.
.. , rM.
Roughly, the correspondence between the set of differential K-algebraic sets of k , and the set of differential ideals of K{T} equal to their radicals, is bijective, and that the correspondence between the class of differential Kalgebraic s^ts and the class of reduced differential K-algebras differentially of finite type is bijective. A morphism / : X —> y of differential algebraic sets is said to be defined
333
over K if X and y are both defined over K and if / is defined by some differential polynomials with coefficients in K. To such an / corresponds one, and only one, morphism g: K { ^ } -> K{
Differential quasi-affine varieties.
Let k be a differential field, k a differential closure of k. The topological notions used in this section will be relative to the differential Zariski k-topology and we will drop the prefix k-. A differential affine (algebraic) variety is an irreducible differential algebraic set of an affine space over k with respect to the differential Zariski topology. A differential quasi-affine (algebraic) variety is an open subset of a differential affine variety. A differential rational function on a differential quasi-affine variety X is the restriction to X of one on the closure of X. A differential rational morphism of a differential quasi-affine variety X into another differential quasiaffine variety y is defined to be a differential rational morphism of X into
y.
_
For a differential quasi-affine variety X we denote the algebra k {X} by k {X}, and analogously for the fields. A differential quasi-affine variety is said to be defined over a given differential subfield of k if its closure is defined over that field. Lemma 1.4. Let f : X —t y be a dominant differential rational morphism defined over k. Let x be a generic point of X. f^1(f(x)) is finite if and only if[k(X):k(y)] is so. _ Assume that a: is a generic point of X. Since any point in f~1{f(x)) satisfies any equation verified by the generators xi,X2,... , xM of k(X), if the latter field is finite over k(}>) then f~1{f(x)) is finite, too. Conversely, if k(X) is infinite over k(y), then at least one xt is nonalgebraic over k(y). These nonalgebraic components then take infinitely many values, hence, f~1(f(x)) is infinite. The lemma is thus proved. Remark 1J>. According to this lemma, if f~1(f(x)) is finite for some generic point x of X then f~1(f(x')) is finite for any other generic point x' of X. This allows the following definition of generically finite differential rational morphisms. A dominant differential rational morphism f:X—>y is said to be generically finite if f~1(f(x)) is finite for some generic point of x of X. The integer \k(X):]s.(y)] is then called the degree of f.
334
Appendix: Characteristic sets Calculations on a ring usually invoke a basic procedure known as the reduction procedure. In order to provide differential polynomial algebras with such a procedure, it is necessary to define the notion of a differential polynomial reduced with respect to another one as this notion is defined for usual polynomials by means of their degrees. The concept of ranking is the first step towards that goal. Given a differential ideal by means of one of its sets of generators, does there exist some subset of that ideal the computations on which will make easier the answers to questions on the ideal? A characteristic set of a differential ideal aims to play that role. It is not a set of generators of the ideal but it characterizes the ideal, at least when the ideal is prime. The concept of characteristic set goes back to van der Waerden (who called it basic set) and is most known from the work by J. F. Ritt. 2.1
Preorders on differential polynomial algebras
Let R be a nonzero differential ring. Let R{T} = R { T i , . . . ,TM} be the differential polynomial R-algebra in the differential indeterminates T = (2i,...,TM). A ranking of R{T} is a total order on the set of indeterminate derivatives T< N ), consisting of T^j) (j € IN, 1 < i < /x), which satisfies (i) Ti < i f > (1 < i < M ); (ii) T^
< T^
=> Tf + 1 ) < T^'+l)
(1 < i,i' < »,j,j'
G IN).
By means of the bijective correspondence, T( N ) _> I N ; x
IN
the rankings of R{T} are bijectively in correspondence with the total orders on the set 1N£ x IN which satisfy the following conditions: (i) ( i , 0 ) < ( i , l ) (ii) (i,j) < (i',f)
(l<*
(1 < *,*' < H,3,j' £ W).
Lemma 2.1 (Kolchin 9 ) . The product order is clearly not a well-order on INt x IN, but it verifies the following property: every sequence of elements of
335
IN* x IN possesses an increasing subsequence whose elements all have the same first component. The lexicographic order on IN* x IN with respect to (j, i) (i.e., the order on IN* x IN induced by the lexicographic order on IN x IN* via the bijective map (*\ J) •-*• UJ) of IN* x IN into IN x IN*) clearly verifies the above properties (i) and (ii), hence corresponds to a ranking of R{T}. R e m a r k 2.2. A ranking o / R { T } is a well-order of the set of indeterminate derivatives T^. This results from the following basic lemma. Lemma 2.3 (Kolchin 9 ) . With respect to any total order on IN* x IN which verifies the above property (i), IN* x IN is well-ordered. A total order on IN* x IN is a well-order if every sequence in IN* x IN has an increasing subsequence. Since, by Lemma 2.1, any sequence of elements of IN* x IN has a subsequence whose elements all have the same first component and which is increasing with respect to the product order, it will suffice to show that such an increasing sequence is also increasing with respect to any order < on IN* x IN which verifies property (i). Let (i,j),(i,j') € IN* x IN be such that (i,j) is less than or equal to (i,f) with respect to the product order, and let e = j ' - j . By property (i), (*> j) < (h j + e)> which proves the lemma. N o t a t i o n 1. We agree in the notation
{{Ti.ieiiMTi.ieM,...} to designate a ranking such that the elements of (Ti)j e /j and all their derivatives are lower than the elements of (Tj)j € j 2 , and the latter and all their derivatives are lower than the elements of the succeeding subset, and so on; in each subset (Tj)^^ the indeterminate derivatives are ranked orderly, i.e., according to the lexicographic order of couples (j, i) corresponding to the indeterminate derivative T^ . It is clear that the rankings of k { T } cannot all be put in the previous form, but this notation will be useful. Let P 6 R{T} - R, and let a ranking of R{T} be given. The leader of P is defined to be the greatest (with respect to the given ranking) indeterminate derivative 7^ which appears in P, and is denoted by up. If dp = d ° p P (the degree of P as a polynomial in up) then P can be put, in a unique way, into the form: dp
P = YJWP i=0
336
where It (0 < i < dp) are in R{T} and are free of up, IdP # 0 and every derivative TJ present in J, is lower than up. The initial of P 6 R{T} - R is denned to be its differential polynomial coefficient IdP in the previous decomposition, and is denoted by Ip. The separant of P is the differential polynomial Y^i=i HiUp-1 (= dP/dup), and is denoted by Sp. It is clear in this context of characteristic zero that the separant of P & R is never the zero polynomial. The set of initials of the elements of a subset V C R{T} will be denoted by I-p, and the set of separants of elements of V will be denoted by S-p, while H-p will stand for the union of Ip and Sp. Leaders, initials and separants are, of course, relative to the particular ranking being used. The preorders that will be considered on R{T} are those which extend the total orders that are defined by rankings. With any differential polynomial P of R{T} — R is associated a couple ui(P) = (up,dp) consisting of its leader and its degree in its leader. By convention, w(P) = (0,0) (P 6 R ) , and 0 is less than any element of T^. The set of couples (u,d) (u = Ooru 6 T ^ ) andd £ IN), is lexicographically ordered. The differential polynomials are ordered according to their associated couples, i.e., it will be written P < Q, and said that P is of lower rank than Q, if u){P) < u(Q). When w(P) = w(Q) it will be said that P and Q are of the same rank. The preorder on R{T} thus defined is of course not an order. Given a differential polynomial P of R{T} - R, IP
Autoreduced sets. Reduction procedure
Let R be a nonzero differential ring, R{T} a differential polynomial algebra over R, and suppose given a ranking of R{T}A differential polynomial F 6 R f T } is said to be partially reduced with respect to a differential polynomial P G R{T} — R if -F is free of every proper derivative of u p . F is said to be reduced with respect to P if F is partially reduced with respect to P, and either F is free of up or d ° p (F) < dp. F is said to be partially reduced (reduced) with respect to a given subset E of R{T} — R if F is partially reduced (reduced) with respect to each element of S. A subset S of R{T} — R is said to be autoreduced if each element of S is reduced with respect to all the others. Examples of autoreduced sets are
337
given by sets of single differential polynomials of R{T} - R. The empty set is an autoreduced set, too. In an autoreduced set any two elements must have distinct leaders. Lemma 2.5 (J. F. Ritt). An autoreduced set is necessarily finite, and (in this context of ordinary differential polynomials) its cardinal number cannot exceed \i. If there is an infinite autoreduced set A then the set u^ of the leaders of the elements of A is infinite since the leaders of two elements of A must be distinct. It follows that, with respect to the order on T^ induced by the product order on IN* xlN, it may be found in UA, by Lemma 2.1, an increasing sequence of elements which are derivatives of a unique Tj. All the elements of this sequence are proper derivatives of the first one; this is contradictory. The lemma is thus proved. Lemma 2.6 (J. F. Ritt, Kolchin 9 ) . The leader of P^ is Up'; precisely, if P £ R{T} - R and if j e IN, j > I, then P^ = SPu(p + P* with P* < u(p. Writing P in the form P = Y?i=o huP then P = SPuP + £ ? f 0 UuP. The differential polynomial 5Zi=o ^« u p i s °f lower rank than up since every derivative of Tj present in some U is lower than up. The lemma is thus proved for j = 1; it easily follows by induction on j € IN, j > 1. 2.2.1. Euclidean remainder Let R be a ring, R [T] the polynomial R-algebra in the single indeterminate T, let P,Q 6 R[T] ,Q ^ 0. The Euclidean remainder, P*1, of P with respect to Q, and a corresponding natural integer, t, generalizing the notion of remainder when R is a field are about to be defined. If P = 0 or P ^ 0 and dp < dQ then let to = 0 and PQ = P ; otherwise, let i 0 = 1, d0 = dp - dQ, and P 0 = I$P- IPTd°Q. This is the first step of an induction which leads to P^, and i. Let i £ IN, and assume dj, tj and P; to be defined. If Pj = 0, or Pj ^ 0 and dpt < dg then let n+\ = t, and P ; + 1 = P;; otherwise, let tj+i = n + 1, di+i = dp{ - dQ, and Pj+i = Iqi+1Pi IpiTdi+lQ. We have d; + i < d,. Since the sequence (di) of natural integers is strictly decreasing the above procedure must stop, i.e., there is a least i such that either P = 0 or P,• ^ 0 and dpt < dQ. By definition, P1* = Pj is the Euclidean remainder of P with respect to Q. It is straightforward to check that, assuming i = H, then
IQP = P*
mod (Q), with either P* = 0 or P h / 0 and d p6 < dQ.
338
The computation of P^ and i involves only the operations (addition and multiplication) of the ring R. If factorization is constructively performable in R [T] then the above algorithm may slightly be improved as following: if IQ is seen as a factor of Ipt, say, I p. = Ip^, for some a in R then rather let t j + 1 be Li and Pi+1 be Pt aTdi^Q. If R is an integral domain then the classical Euclidean remainder algorithm may also be performed over the quotient field of R, and then return to polynomials over R by clearing the denominator of the quotient in an obvious way. 2.2.2 Partial remainder Let R{T} be a differential polynomial algebra with a given ranking, and let A be an autoreduced set, A\ < A2 < ... < A„, and let Uj,Ij,Sj and dj be the leader, initial, separant and degree of Aj (1 < j < v). The partial remainder F* of any F £ R { r } with respect to A, and corresponding natural integers o-j (1 < j < u) are about to be defined. If F is partially reduced with respect to A then let FQ = F andCT^O= 0(1 < j < v). Otherwise the set of indeterminate derivatives which occur in F and which are proper derivatives of a leader of at least one A € A is nonempty and finite. Let VQ be its greatest element. The set of elements A of A such that vo is a proper derivative of their leaders also has a greatest element, Aj0. Let 60 be such that v0 = «j0° • Regarding F and A^°' as polynomials in vo, let .Fo be the Euclidean remainder of F with respect to Aj , and 0j-Olo the corresponding integer and
V).
Let i 6 IN, and assumeCT^J(1 < j < u),ji,Vi, and Fi to be defined. If Fi is partially reduced with respect to A, then let cr^i+i =CTJ^(1 < j < v) and Fi+i = F». Otherwise the set of indeterminate derivatives which occur in Ft and which are proper derivatives of a leader of at least one A 6 A is nonempty and finite. Let Uj+i be its greatest element. We have vt+i < u,. The set of elements A of A such that Vi+i is a proper derivative of their leaders also has a greatest element, Aji+1. Let 6i+i be such that vi+\ = Uj.£ . Regarding Ft and Aj.'*1 as polynomials in Uj+i let Fi+± be the Euclidean remainder of Fi with respect to A,.iJ[ , and 0j i+1 ,i+i the corresponding integer, and o-j^i+i = CTji.i, • • • , <7jo,t+l = ffjo,*> a n d aJ,i+l ~ 0 for a11 J ^ J»+l> a n d J 7^ Ji> • • • » a n d
339 3 ¥=• Jo-
Since the sequence (v,) of indeterminate derivatives is strictly decreasing the above procedure must stop, i.e., there is a least i such that Fj is partially reduced with respect to A. By definition, F # = F* is the partial remainder of F with respect to A. It is straightforward to check that, assuming o~j = &j,i (1 < 3 < " ) , t n e n
• F # is partially reduced with respect to A; • S^F = F#
mod [A];
• F* < F , where a = ( o i , . . . ,
More precisely, S^F — F # is a linear combination, over R{T}, of the elements of A^ with leaders of lower rank than up; here, A^ denotes the set of derivatives of the elements of A. The determination of the partial remainder of F with respect to A and of the corresponding natural integers o~i (1 < i < v) involves only the operations (addition, multiplication and derivation) on R. 2.2.3.
Remainder
The remainder F* of any F € R{T} with respect to A, and corresponding natural integers ij,o~j (1 < j < u) are about to be defined. Let F * be the partial remainder of F with respect to A, with OJ (1 < j < v) the corresponding integers. If F* is reduced with respect to A, then let FQ = F#, and Lj$ = 0 (1 < j < v). Otherwise, let j0 be the greatest integer such that F * is not reduced with respect to Aj0. Let F0 be the Euclidean remainder of F * with respect to Aj0 (when F * and Aj0 are considered as polynomials in Uj0), and ijofi the corresponding integer, and ij,o = 0 for all 3 # JoThis is the first step of an induction which leads to the determination of F*, and of the integers Lj (1 < j < v). Let i £ IN, and assume i^i (1 < j < v), ji, and Fi to be defined. If Fj is reduced with respect to A, then let tj,i+i = tj,i (1 < j < v), and Fi+i = Fj. Otherwise, let ji+i be the greatest integer such that Fi is not reduced with respect to Aji+1.
340
We have jj+i < j ; . Let Fj+i be the Euclidean remainder of Fj with respect to Aji+1 (when Fi and Aji+1 are considered as polynomials in Uji+1), and iji+ui+i the corresponding integer, and tjiti+i = ij{,»,... ,ij0,i+i = ij0,u and ijti+1 = 0 for all j ^ j i + 1 , and j ^ ji, - -., and j ^ j 0 Since A is finite the above procedure must stop, i.e., there is a least i such that F, is reduced with respect to A. By definition, F* = Fi is the remainder ofF with respect to A. It is straightforward to check that, assuming ij = t,jti (1 < j ' < u), then
• F* is reduced with respect to A; • H\F • F*
= F* mod [A];
where n = ( t i , . . . ,iv,o\,
... ,av), and V
HZ =
Y[II/S;>. 3= 1
More precisely, H%F — F* is a linear combination over R{T} of the elements of A^ with leaders of lower rank than up. 2.3
Differential quotient ideals
Let R be a (nondifferential) ring. A multiplicative subset m of R is a subset containing 1 and such that a b G m whenever a and b are in m. A multiplicative subset m is said to be generated by its finite subset g if its elements can be written as a\la^2 • • • a*" for some natural integers i\,.-- ,is, and where a i , . . . , a s are the elements of g. Let o be an ideal of R. The set o: m is defined to be the elements a of R such that there is m € m such that ma E a. If m is generated by Q (consisting of a i , . . . ,a s ) then an element o o f o: g°° is one for which there are natural integers ii,... , is such that Oj1 a%2 ••• a\'a 6 o. For the sake of notation compactness we shall write the previous relation as gla € a, where g is the set consisting of a\,... , a s and i stands for ( i i , . . . , i s ) . Now a: m is an ideal whenever o is one. The notation a: a°° is used when m is generated by the single element a. The ideal a:a°° is equal to its radical if a is so. If o is prime and a $. o then o: o°° = o. In differential rings, o:m is a differential ideal if o is so. See Corollary of Lemma 1 of 1.2 of 9 for a proof. Let k be an ordinary differential field and k{T} a differential polynomial algebra equipped with a ranking.
341 Lemma 2.7. If a nonempty subset A of k{T} is autoreduced then any element of [A]:H™ which is partially reduced with respect to A is in (A):H^. Let P € [A] :H%. Then there are Pitj 6 k{T}, 1 < i < u,j £ IN which are all zero but a finite number of them such that
HnAP= £
PiAU)
l
for some n G IV". Let M, be the leader of A{. Let j > 1. The degree of Af' in its leader v.]3' is 1. If Pi>j>, {i',j') ^ {i,j), effectively involves u\3', we reduce it with respect to Af' to obtain
where mi'ji € IN, and Pi',j>,o is free of u\3'. In the above expression of H^P, upon modification of n, we may thus replace the terms Pi'tjiA\J ' and PijA\3' by Pi',j',oA\'"> a n d (pi,i + Qi'j'Ai"*)^' respectively. We repeat this type of substitution as many times as necessary to be able to consider Py j>, in the above expression of H%P, as free of uf' whenever
(i',J')7*(M).' Hence, looking at the degree in u\3' of both sides of the expression of H^P, we use the fact that P does not involve uf' (it is partially reduced with respect to A), and the fact that (if (i',j') ^ (i,j)) to conclude that Pij must be zero. The expression of H^P then reduces to
H$P = J2
Pit0Ai,
l
that is, P is in (A):H^, which ends the proof of the lemma. This lemma may also be seen as a consequence of a lemma due to A. Rosenfeld. 2-4
Characteristic sets
Let R{T} be a differential polynomial algebra with a given a ranking. Let A and A' be two autoreduced sets with respective elements Ai,... , Av and A[,... , A'v, numbered increasingly. A preorder on the set of autoreduced sets of R { 7 i , . . . , T^} is defined by assuming that A < A' (it is then said
342
that A is of lower rank than A') if one of the following two conditions is satisfied: • There is some natural integer j such that 1 < j < min(i/, u'), and Ai and A\ are of the same rank (1 < i < j), and Aj < A^; • v > v', and Ai and A\ are of the same rank (1 < i < v'). A and A' are said to be of the same rank if v = v' and if Ai and A\ are of the same rank (1 < i < v). If A < A' or A and A' are of the same rank then it will be written A < A'. The relation < thus defined, clearly, is a preorder. Any two autoreduced sets may be compared, i.e., one of the relations A < A', A' < A holds. Moreover, the following result holds. L e m m a 2.8 (J. F. Ritt). A nonempty set E of autoreduced sets contains a least element, i.e., an element A such that A < A' (A' 6 E). The elements of an autoreduced set are supposed to be increasingly numbered. Let E 0 = E, and define E, {i £ IN, i > 0), to be the set of elements A of Ej_i such that A is with cardinal number greater than or equal to i, and such that Ai is a least element of the set of ith differential polynomials of the elements of E»_i. This decreasing sequence of subsets of E must terminate by E; = 0 for some i, since otherwise the leaders Vi of the ith differential polynomials of the elements of Ej would be a sequence of derivatives of indeterminates which are not proper derivatives of one of them; this would contradict Lemma 2.1. Since Eo = E is nonempty then there is j € IN such that Ej + i = 0 and any element of Ej is a least element of E. This shows the lemma. This proof is due to Kolchin 9 . If a differential polynomial P 6 R{?"} — R is reduced with respect to an autoreduced set A then P and the elements A of A which are reduced with respect to P form an autoreduced set which is lower than A. It results from this that an autoreduced set A of a family V of differential polynomials (i.e., an autoreduced set with elements in V) is a least element of the set of autoreduced sets of V if, and only if, V does not contain any element of R{T} — R which is reduced with respect to A. Let a be a differential ideal of R{T}, A an autoreduced set with elements in a. If A is a least element of the set of autoreduced sets of o, then for any AeA,
SA&a=>
IA&a.
343
Indeed, if A is in A, and if IA € a, then A-IAUAA € a. Since A-IAUAA reduced with respect to A, it results from the minimality of A that A—IAUAA R. Now
0 = -£-{A ouA
- IAudAA) = SA-
dAUu^1
is €
€ a,
hence SA £ a; which proves the assertion. It follows that if a is a proper differential ideal of R{T}, and if A is a least autoreduced set of o then the separant of an element of A cannot be in a if it is not a nonzero noninvertible element of R. Consequently, if R = k is a differential field then the separant of an element of a least autoreduced set of a proper differential ideal of k{T} is not in this ideal. Definition 2.9. A characteristic set of a differential ideal a o / R { T } is defined to be a least element of the set of autoreduced sets A of a such that
SA
iip = Y,PiAi, »=i
344
for some n € IN". If P;<, i' ^ i, effectively involves Wj, we reduce it with respect to Ai to obtain
IA? Pi' =Qi'M
+ PVfi
where m? € IN, and /V, 0 is free of Wj. The fact that Pj0 is free of ut results from the fact that Ai is of degree 1 in Uj which, in turn, reflects the assumption that A is orthonomic. In the above expression of I^\P, upon modification of n, we may thus replace the terms Pi
345
6. J. P. Gauthier, I. A. K. Kupka, Deterministic Observation Theory and Applications, (Cambridge University Press, Cambridge, United Kingdom), to appear. 7. ft. Hermann, A. J. Krener, Nonlinear controllability and observability, IEEE Trans. Automat. Control, 22, 728-740 (1977). 8. N. Jacobson, Basic Algebra. II, (W. H. Freeman and Company, San Francisco, 1980). 9. E. R. Kolchin, Differential Algebra and Algebraic Groups, (Academic Press, New York, 1973). 10. E. R. Kolchin, Constrained extensions of differential fields, Adv. in Math., 12, 141-170 (1974). 11. J. F. Pommaret, Geometrie differentielle algebrique et theorie du controle, C. R. Acad. Sci. Paris Ser. I, 302, 547-550 (1986). 12. A. Seidenberg, An elimination theory for differential algebra, Univ. California Publ. Math. (N.S.), 3, 31-65 (1956). 13. A. Seidenberg, Abstract differential algebra and the analytic case, Proc. Amer. Math. Soc, 9, 159-164 (1958). 14. A. Seidenberg, Abstract differential algebra and the analytic case II, Proc. Amer. Math. Soc, 23, 689-691 (1969). 15. M. F. Singer, The model theory of ordered differential fields, J. Symbolic Logic, 43, 82-91 (1979). 16. J. C. Willems, System theoretic models for the analysis of physical systems, Ricerche Di Automatica, 10, 71-106 (1979).
This page is intentionally left blank
347
E X I S T E N C E T H E O R E M S IN N O N L I N E A R REALIZATION THEORY A N D A C A U C H Y - K O W A L E W S K I T Y P E T H E O R E M BRONISLAW JAKUBCZYK Institute of Mathematics, Polish Academy of Sciences, 00- 950 Warsaw, Sniadeckich 8, Poland E-mail: [email protected]
1
Introduction
The realization problem, stated vaguely, is the problem of reconstruction of a system given in the form of a "black box". Representing the input-output behaviour of the black box by a nonlinear causal operator
F-.u-^y which maps functions of time u(-) into functions of time ?/(•), the problem consists of two sub-problems: (a) Find conditions on F which guarantee that this operator can be realized as the input-output operator of a finite dimensional controlled and observed system r : x = f(x,u),
x(0) = xo
y = h(x),
where the state variable x belongs to a differentiable manifold X. (b) Reconstruct the system V, called realization of F, from the operator F. The main difficulty in the problem is to construct the state space X which is not given a priori and can be an arbitrary differential manifold. It was shown by Sussmann 10 that, if the realization exists and is complete, there exists a minimal (i.e. transitive and observable) realization which is unique up to a diffeomorphism of X. Existence criteria for global complete realizations were given in 2 . We refer the reader to the survey 4 for more information concerning other versions of the problem. In this paper we recall some results concerning the existence problem (a) (Section 2). The following result is their consequence (Section 3).
348
If a formal power series is converging along a family of formal vector fields {Xi}iej, then it is converging along any family of vector fields in the Lie algebra Lie{Xi}i^i. Here we call a formal power series h on W1 converging along a family of formal vector fields {Xi}i&i on K." if it converges with respect to the variables h,...,tk, when composed with the flows: hoexp(tkXik)o• -oexpitiXi^iO), for any choice of k > 1 and i i , . . . , i * . In Section 4 we show that the above result has implications useful in PDE's. Following an idea of T. Morimoto, we prove a theorem on existence of converging solutions for analytic systems of nonlinear PDE's. (This theorem can be considered as generalization of a Malgrange version of Cartan-Kahler theorem 7 .) 2
Existence of analytic realizations
Let U and Y be fixed sets, called input and output spaces. We fix an interval [0, T), where 0 < T < oo. Given two families of functions [0,T) -> U and [0,T) —> Y denoted by It (input signals) and y (output signals), respectively, we consider an operator F :U -» y. Recall that F is called causal (respectively, strictly causal) if (Fu)(t) depends on the values of u on the interval [0, t] (respectively, on [0,£)), only, i.e. on the function a = «|[o,t] (respectively, a = u|[ 0i t)). If F is strictly causal, then we shall denote by (F, a) the value of y = Fu at the end of action of a, i.e., (F, a) = y(t) = (Fu)(t)
where
a = u [0>t ).
For the aim of this article it is enough to assume that U consists of piecewise constant functions, left-continuous in the discrete topology of U, where U has at least two elements (see 2 for a more general case). For simplicity, we will take Y = W and y equal to all functions y : [0, T)->Y. Let X be a connected real analytic differential manifold. Consider a control system on X r : x = f(x,u),
a;(0) = x 0
y = h(x),
where x(t) € X, u(t) € U, y(t) £ Y and / , h are analytic. Assuming the initial point XQ fixed, the system T defines a unique strictly causal operator Fr : u(-) -> 2/(-)
349 provided that all the solutions are well denned on the interval [0, T). Namely given a control function u(-), there exists a unique trajectory x(-) of system T starting from XQ, and y(-) = F(u(-)) is defined as y(t) = h(x(t)). This operator is called the input-output operator of T. Given a causal operator F : U —> y, a system T is called realization of F if the input-output map Fr of T coincides with F. Our aim is to present conditions on F which guarantee existence of realizations. We denote by a=
(ti,ui)---(tk,uk)
the piecewise constant function defined on the open interval [0, \a\) by a(t) = Ui G U for t G [Ti-i, Ti), where T0 = 0, Tt = ti H 1- U for i = 1 , . . . , k, and \a\ = Tk- The set of such functions forms a semigroup, denoted by S, with concatenation as product. Namely, for two functions a € S and b £ S their product is defined as (ab)(t) = a(t) for t € [0, |a|)
and
(ab){t) = b(t - \a\) for t G [|a|, |a| + |6|).
or, in our notation, ab = (*i,ui) • • • (tk,uk)(si,
vi) • • • {sm,vm)
if 6 = (si,«i)---(s T O ,u m ). The fact that T is a realization of F can be expressed by the equalities (F,a) =hoexp(tkfUk)
o---oexp(hfUl)(xo)
satisfied for any u\,...,uk G U and t2 H \-tk
->
(F,{ti,u1)---{tk,uk)),
are analytic on the sets ££ = {(*!,...,tk): *i,...,tfc>0,
«! + • • • + « * < T } ,
350
for any ui,...,v,k 6 U and k > 1. This means that these maps have analytic extensions to neighbourhoods of these sets, in particular, they are defined and analytic in a neighborhood of 0 6 R*. Thus the expressions ( F , ( t i , u i ) • • • (tk,Uk)) are well defined for small negative t\,...,tk, too. If U is a subset of Rm, then the same operator is called jointly analytic if the maps
(ti,...,tk,ui,...,uk)
->
(F,(ti,ui)---(tk,v,k))
are analytic on the sets E^ x Uk, for any k > 0. Suppose that Y = M. We define the rank of the strictly causal operator i^by
r a n k F = suprank{ — ( F , a 6 j ) } * j = 1 , C/Ej
where {•}* - =1 denotes a k x k matrix, and the supremum is taken over all k > 1, all elements a = (£i, ui) •••(**, u*) 6 5 and over all sequences b\,..., bk £ S such that \a\ + \bj\ < T. If Y = W,thenF = ( F i , . . . , F , ) and the definition of the rank is the same, with (F,abj) changed for (Fs^,abj) and the supremum is taken also over all sequences s ( l ) , . . . , s(k) with values in { 1 , . . . , q}. Consider a realization T of F represented by a quadruple T = (X, f, h, x0). We call this realization analytic (respectively, jointly analytic) if the differential finite dimensional manifold X is real analytic and Hausdorff, and f(-,u) and h are analytic with respect to x, for any u G U (respectively, analytic jointly with respect to (x,u) € X x U). The realization is called minimal if it is transitive (i.e. the orbit of any point is the whole X) and observable (i.e. for any two different initial points on X there exists a control which produces two diferent output functions). We call the realization complete if the vector fields f(-,u) are complete. Theorem 1. (a) 5 . If U is a compact subset of W1 then a strictly causal operator F has a jointly analytic realization T if and only if F is jointly analytic and rank F is finite. Then there exists a minimal realization of F with X paracompact and dimX = r a n k F . (b)1,5. IfU is a finite set, then the same is true with F and F "analytic" instead of "jointly analytic". (c) 2. There exists a complete analytic realization of strictly causal operator F (with X possibly not paracompact) if and only if the maps ( (F, (ti,ui) • • • (tk,Uk)) have analytic extensions to Rk and r a n k F is finite.
351 Remark 1. The theorem also holds for causal F, then one should choose the observation function y = h(x,u) depending on u, cf. 5 . Remark 2. Analysing the proof in 1 or 4 one can see that in the case of finite U (statement (b)) it is enough to assume that any of the maps (ti,...,tk) —> (F,(ti,ui)---(tk,Uk)) is analytic on some neighbourhood of 0 € K* and this (together with finiteness of the rank) implies that there exists a T > 0 such that all these maps have analytic extensions to E^. In order to provide some intuition to the notion of rank we will show that the inequality rank F < dim X holds for any realization of F (see 3 for more details). For a given realization we denote $a{x) = exp(£fc/„fc) o • • • o exp(tifUl)(x), for any a = (-> ((F, ubi),... , (F, ubk)) is analytic, if the realization is analytic, and it is equal to the composition of two maps Rk -» X ->• Rk, namely of (*i,..., **) -> x = $ a (^o) and x -¥ (h o <j>6l ( x ) , . . . , h o $bk (a;)). Therefore, the rank of the Jacobian matrix of this map can not exceed dimX. In this way we see that if F has a realization V = (X, f,h,x0), then r a n k F < dimX. 3
Convergence along vector fields and their commutators
We will use statement (b) of Theorem 1 in order to show that if a formal power series is converging along a family of vector fields then it is also converging along their commutators. More precisely, we have the following result. Theorem 2. Let h be a formal power series in R[[xi,..., xn}} and let X\,..., Xr be formal vector fields on W1, with coefficients in E [ [ x i , . . . , xn]], such that the estimate (A)
\(Xil---Xikh)(0)\
holds with some positive constants C and p independent of k and i\,... ,ikThen for any formal vector fields Z\,...,Z^ in Lie{Xi,..., Xr} there exist constants C\ > 0 and pi > 0 such that the following estimate holds (B)
\(Zjl---Zjmh)(0)\
352
for any 1 < j \ , . . . ,jm < N and m > 1. Proof. Let our control set be U = { 1 , . . . , r } . We define a strictly causal operator F by the formulas (with X? denoting the p-th power of the differential operator Xu) fPk
(F,(t1,u1)---(tk,uk))
(XZl--.X%h)(0)tp11----±-\---PkL
=
From the estimate (A) it follows that the functions of ( t i , . . . , tk) are well defined and analytic in a neighbourhood of 0 € Rk. Moreover, it follows from the formula ^2P+n=s ^ ^ ( P ^ O - 1 = (*» + * j ) s ( s 0 _ 1 t n a t w e have the identities (F, ( t ! , u i ) • • • ( f , « ) ( * " , u) • • • (**,«*)> = (F, ( * i , « i ) • • • ( « ' + « " , « ) • • • ( « * , « * ) > .
so the left hand side does not depend on the representation of the piecewise constant function and the expression {F,(ti,ui)---(tk,Uk)) is well defined if the switching times ti,...,tk are small enough (possibly negative). This means that the strictly causal operator F is well defined and analytic for ti,... ,tk small enough and we can write (F, (*i,«i) • • • (tk,uk))
= (exp(i!.X Ul ) •
••exp(tkXUh)h)(0),
p
where exp(iX„) = J2pX^t /p\ is the formal series of differential operators. In order to be able to use Theorem 1 (statement (b)) we have to show that F has finite rank. Note that the entries of the matrix defining the rank of F can be written in the form (for simplicity of notation we assume 7 = 1) — (F,abj) = —{F^i^u^a^abj)^
=
-^(F,ai{e)bj)\t=0,
with a = {h,ui)---(tk,uk),ai = (h, «i) ••• (£*,«*), ajl = (—ti,Ui)---(—t\,Ui),ai(e) = a,i(e,Ui)a~ , and bj — abj (here we use the fact that the expressions (F, (h,ui) • • • (tk,uk)) are well defined for ti,...,tk possibly negative and small. Given a vector field Y, we denote (adY)(X) = [Y, X] - the commutator of Y with X. Then we have the formula exp(tY)X exp(—tY) = exp(t&dY)X It can be seen from the definition of the operator F and the above formula that the matrix appearing in the definition of rank F has the entries
a
ij = frj-(F>abj) =
((exp(tiadX U l ) • • •exp(t i _ 1 adX„ i _ 1 )(X Ui )(exp(s^X t) i) • • •exp(s^X j) i > )/i)(0),
353
where we write for simplicity bj = (s{,v{) • • • (s£,,w4) € S (m depends on j). Now we will show that
d = det{ a i j } = det{ — (F,abj)}^=1
=0
if k > n. This will imply that r a n k F < n. Note that the coefficient aydepends analyticaly on the parameters, a»j = aij{ti,... ,ti,s\,... ,sJm) (where m depends on j). It is enough to show that any iterated partial derivative of d = det{ajj} with respect to parameters is equal to zero, when the parameters are equal to zero. Since the determinant is multilinear function of the rows of the matrix (and also multilinear function of columns), we can differentiate it easily with respect to the parameters, by differentiating the corresponding rows (when we differentiate with respect to t\, £2, , or tk) or columns (when we differentiate with respect to the remaining parameters). Thus it is enough to show that the determinants of the form det{ajj(0)} are equal to zero, where Sy(0) = {DfD^aij){Q)
= dHj(0) Yi(0)
and Df denote monomials of partial derivatives with respect t — (ti,..., tk) while DJ denote monomials of partial derivatives with respect to the variables si = (s{,..., sJm) such that Yi = (adX U l )°i • • •
(&dXUi_y<-iXUi
and
H^iX^-.-iX^h. The k x k matrix of the form {ajj(O)} = {dHj(0) 5^(0)} has rank at most n since it is a product of two matrices of size k x n and nx k with real entries, thus its determinant vanishes when k > n. This implies that det{ajj} = 0 and so r a n k F < n. From statement (b) of Theorem 1 it follows that F has an analytic realization (X, / , h, XQ) (we denote its output function by h in order to distinguish it from the original formal power series h). Thus we have fPl
(F,(h,u1)---(tk,uk))
= (fZl---f%h)(x0)''
fPk
*
354
for * i , . . . , i/fc close to zero. Comparing with the definition of F we get
{Xl\.--Xllh){Q) = {Pu\...rulh){x0), for all k > 1 and ui,...,Uk G { 1 , . . . , r } . Here X„ ; and fUi are formal and, respectively, analytic vector fields, while h is a formal power series and h a local analytic function. Each of the formal vector fields Z\,..., ZN is obtained from the formal vector fields X\,..., Xr by a finite number of operations of taking linear combination and commutator (Lie bracket). Define the vector fields Z\,..., Z/v using the same operations in the same order, but starting with the vector fields / i , . . . , / r of our realization. Then Z\,..., ZN are analytic as / i , . . . , fr are. It follows from the above equalities that (Z%---Z%h)(0)
=
(Z%-.-Z%h)(xo).
Since the vector fields ZUl,..., ZUk and the function h are analytic, the Cauchy type estimate (B) holds which can be shown by a classic argument. The proof is complete. Another less direct proof of Theorem 2 was given in 6 . 4
Existence of analytic solutions of P D E ' s
The result stated in the preceeding section can be used to improve regularity statements for solutions of a class of nonlinear partial differential equations (possibly singular) on nilpotent Lie groups as considered in Morimoto 8 , 9 and in 6 . We begin with recalling a Malgrange version of the Cartan-Kahler theorem 7 . Let z € C" and u denote an unknown function of z with values in C™. We consider the following differential equation on C" (E)
^(x,{DIu}lI]
where I = (ii,...,in), D1 = (dZl)n • • • (<9Z„)*" and * is a vector valued function, holomorphic in a neighbourhood of a point (ZQ, { M ° } | / | < P ) in C X enJVfp). Consider a polynomial u = T,\i\
355
strongly prolongable if for any s > q and any s-jet extension jsu of jqu which is a s-jet solution of (E) there exists a (s+l)-jet solution js+1u of (E) whose s-jet is jsu. Theorem 3. 7 If up is a strongly prolongable p-jet solution of (E), then there exists a solution u = ^2u(j(I\)~1(z — ZQ)1 converging in a neighbourhood of ZQ such that jpu = up. This theorem was extended by T. Morimoto to the case of noncommuting partial differential operators on nilpotent Lie groups by introducing power series in formal Gevrey class. Consider a graded nilpotent Lie algebra g = gi © 92 © • • • © gs of finite dimension n, with weighting w(X) = i if X € #,. Let G be a corresponding nilpotent Lie group. For any element X 6 g we denote by X the corresponding left invariant vector field on G. Let us fix a basis X±,. ..,Xn of g adapted to the direct sum, i.e. so that Xi € 9j(i)i f° r a n y i = 1, . . . , n . Consider the corresponding vector fields X±,...,Xn on G. Note that any left invariant differential operator on G can be written as a polynomial of the first order differential operators Xi,... ,Xn. For (a germ) of m functions y = h(x) = (hi(x),... ,hm(x)) on G and a multiindex / = (ii,... ,i^) we denote yj{x)
= hi(x)
= Xjh
= (Xh
•
••Xikh)(x),
(we take the derivatives componentwise) and w(I) = w{Xil) + • • • + w(Xik). After Morimoto 8 , we define weighted jets of h as follows. Two functions h,h' : G —> Mm (denned locally around the identity e) are called q-equivalent at e if (Xih)(e) = (Xjh)(e) for any I such that w(I) < q. The equivalence class of h is called the (weighted) jet of order q of h at e and is denoted by jqh. We consider the following differential equation on G *(x,{yi}w(i)
with real vector valued * . A q-jet j h, q>p,is if, after plugging it to the equation, we have (Xj^(x,{yi}w(I)
=0
(4.1)
called a q-jet solution of (4.1)
=0
356
for all multiindices J such that w(J) < q — p. Similarly as earlier we call a q-jet solution yq = jgh of (4.1) strongly prolongable if for any s-jet extension jsh of j9h, s > q, which is a s-jet solution of (4.1) there exists a (s+l)-jet solution js+1h of (4.1) whose s-jet is j'h. We have the following theorem which is Theorem EA in 9 and Theorem 3 in 6 (in fact, Theorem 3 in 6 is slightly stronger). Theorem 4. Let \P be in formal Gevrey class with respect to x and analytic with respect to yj at (e,yp). If yp = {yi}w(i)
(4.2)
for any 1 < i i , . . . ,ik < n andfc> 1, where w(I) = w(Xi1) + • • -w{Xik). The main result of 8 says that if * in the equation (4.1) is analytic with respect to yi and in formal Gevrey class with respect to x then, for any m>p and any finite jet solution jmh of the equation which is strongly prolongable, there exists a solution h in formal Gevrey class with jmh = jmh. Proof Theorem 3. It follows from 8 that there exists an infinite jet solution h which is in formal Gevrey class. This implies, in particular, that this solution satisfies the estimate (A) in Theorem 2 with respect to the vector fields X\,...,Xr generating linearly gi (since w{X) = 1 if X € #i). Since #i generates g as Lie algebra, it follows that there exists a linear basis Z\,..., ZM of g such that Zi 6 Lie{Xi,... ,Xr}, i = 1,...,N. Applying Theorem 2 we see that the Cauchy type estimate (B) holds for h and the vector fields Zi,..., Z^. Since Z i , . . . , ZN form a basis of the tangent space at identity e, it follows that h is converging. The proof is complete. Theorem 3 was conjectured (and proved in the case where g was the Heisenberg algebra) by T. Morimoto during the Banach Center Symposium on Differential Geometry and Mathematical Physics in April 1995. It was later published in 9 , without detailed proof (cf. 6 for more details on Theorems 2 and 3).
357
Acknowledgments Supported by Polish KBN grant 2P03A 035 16. References 1. F. Celle, J.-P. Gauthier, Realizations of nonlinear analytic input-output maps, Math. Systems Theory, 19, 227-237 (1987). 2. B. Jakubczyk, Existence and uniqueness of realizations of nonlinear systems, SIAM J. Control and Optimiz. 18, 4, 455-471 (1980). 3. B. Jakubczyk, Local realizations of nonlinear causal operators, SIAM J. Control and Optimiz. 24, 2, 230-242 (1986). 4. B. Jakubczyk, Realizations of Nonlinear Systems; Three Approaches, in Algebraic and Geometric Methods in Nonlinear Control Theory, M. Fliess, M. Hazewinkel Eds., (Reidel, 3-31, 1986). 5. B. Jakubczyk, Existence of global analytic realizations of nonlinear causal operators, Bull. Pol. Acad. Sci., Ser. Math., 36, 729-735 (1986). 6. B. Jakubczyk, Convergence of power series along vector fields and their commutators; a Cartan-Kahler type theorem, Ann. Polonici Mathematici, 74, 117-132 (2000). 7. B. Malgrane, Equation de Lie II, J. Differential Geometry, 7, 117-141 (1972). 8. T. Morimoto, Theoreme de Cartan-Kahler dans une class de fonctions formelles Gevrey, C.R. Acad. Sci. Paris, 311, serie A, 433-436 (1990). 9. T. Morimoto, Theoreme d'existence de solutions analytiques pour des systemes d'equations aux derivees partielles non-lineaires avec singularity, C.R. Acad. Sci. Paris, 321, I, 1491-1496 (1995). 10. H.J. Sussmann, Existence and uniqueness of minimal realizations of nonlinear systems, Math. Systems Theory, 10, 263-284 (1977).
This page is intentionally left blank
359 N O R M A L I T Y , LOCAL CONTROLLABILITY A N D N O C FOR MULTIOBJECTIVE OPTIMAL CONTROL P R O B L E M S ABDERRAHIM JOURANI Departement de Mathematiques Analyse Appliquee et Optimisation BP 47 870, 21078 Dijon Cedex France E-mail: [email protected]
1
Introduction
The main intention of this paper is devoted to normality, seminormality and local controllability of optimal control problems of unbounded differential inclusions with nonconvex admissible velocity sets. We consider here systems of the form
x(t) e F(t,x(t))
a.e. t € [a, b],
(x(a),x(b))
€ S
(1.1)
where F: [a,b] x E" t-» E n is a multivalued mapping and S C E" x R" is a nonempty closed set. The domain over which the study of system (1.1) occurs is typically one of the functions W 1 ' 1 ([a, 6],E n ) (abbreviated W1,1) consisting of all absolutely continuous functions x: [a,b] H> E n for which | i | is integrable on [a, b] (x denotes the derivative (almost everywhere) of a;). An arc is a function in W1'1. The space W1'1 is endowed with the norm ||a;|| = \x(a)\ + J
\x(t)\dt
Ja
where | • | denotes the euclidean norm of EP. Consider the multivalued mapping G: E™ x E n i-> W 1 ' 1 defined by G(u, v) = {x G W1'1:
x(t) e F(t,x(t))
a.e., (x(a) + u,x(b) +v)e
5}(1.2)
The distance function on W1'1, E" or E" x W1 will be denoted by d(-, •). Let z b e a solution of system (1.1). This system is said to be semi-normal at z if there exist a > 0 and r > 0 such that d(x, G(u, v)) < a{d((x(a) + u,x(b) +v);S)+
/ d(x(t); Ja
F(t,x(t))dt}(1.3)
360
for all x € B(z, r) and u, v € r l . Here B stands for the closed unit ball in W1 and B(z,r)
= {xeW1'1:
||x-z|| < r}.
Finally, system (1.1) is said to be locally controllable at z if there exist a > 0 and r > 0 such that G(u,v)DB(z,a(\u\
+ \v\))^
Vu,v€rM
(1.4)
It is obvious that seminormality implies local controllability Our aim here is to present sufficient to develop necessary optimality conditions problems with a general preference. These terms of the Euler-inclusion in the refined
conditions for seminormality and for multiobjective optimal control condtions are in part expressed in form
p(t) £ coD*F(t, z(t), z(t))(-p(t)
a.e. t e [a, b]
(1.5)
where D*F(t, •) means the coderivative (24) of F(t, •) in x at the point (z(t),z(t)) and "co" stands for convex hull. The conditions presented are also given, in addition to (1.5), in terms of the transversality condition (p(a),-p(b))6N(S;(z(a),z(b)) and the maximum (p(t),z(t))
(1.6)
condition = max{
6 [a,b].
(1.7)
Here N(C; x) refers to the limiting normal cone to C at x. System (1.1) is termed to be normal at z if there is nonzero p € W1'1 satisfying (1.5)-(1.7). We prove that, under a Lipschitz assumption, normality implies seminormality and hence local controllability. These questions, in addition to being of interest in their own right, have an important and underestimated bearing on other aspects of the subject, like sensitivity analysis and necessary conditions. Consider, for example, necessary optimality conditions (or the maximum principle) of optimal control problems of unbounded nonconvex differential inclusions of the form min f(x(a),x{b)) over all arcs x satisfying : (x(a),x(b)) E S, x(t) G F(t,x(t))
(1.8) a.e. t e [a, b]
361 Suppose F satisfies measurability requirements and that / is locally Lipschitzian around (z(a),z(b)), where z is a local solution of the problem (1.8). If system (1.1) is semi-normal at z then z is a local solution of the problem min{f(x{a),x(b))
+ kfa{d((x(a),x(b));S)
+ f
d(x(t));F(t,x(t)))dt}}
Ja
where a > 0 is as in (1.3) and kf is a Lipschitz constant of / at (z(a),z(b)). This problem is of Bolza type, and the theory of necessary optimality conditions (which has been developed by several authors including Clarke 5 - 6 , Ioffe 13 , Loewen-Rockafellar 2 1 - 2 3 , Ioffe-Rockafellar 14 , Mordukhovich 26 - 27 , Jourani 15 ) yields the existence of p € W1'1 satisfying, in addition to (1.5) and (1.7), the following transversality condition (p(a), -p(b)) e df(z(a),z(b))
+ N{S; (z(a), z{b)).
Now if system (1.1) is not semi-normal then it is not normal, that is, there exists p € W1'1, with p ^ 0, satisfying (1.5)-(1.7). Thus in either case there is a nontrivial pair (A,p), with p € W1'1 and A 6 {0,1}, satisfying together with (1.5) and (1.7) the following condition (p(a), -p{b)) e Xdf(z(a), z(b)) + N(S; (z(a), z{b)).
(1.9)
Relations (1.5), (1.7) and (1.9) are referenced as necessary optimality conditions for problem (1.8) at the point z. There are differents approaches and various results on necessary conditions for (1.8), expressed in terms of various generalized derivatives of F including Clarke's derivative 6 , limiting subgradient which is also known under other names : limiting subgradient set in Clarke 7 , approximate subdifferential in Ioffe 12 , subdifferential in Mordukhovich 27 , subgradient set in the general sense in Rockafellar 3 1 . Most of these results are obtained for Lipschitz, integrably sub-Lipschitz, bounded or unbounded differential inclusions. Our aim is to use the class of multivalued mappings introduced by Loewen and Rockafellar in 21 to get normality, local controllability and necessary optimality conditions for multiobjective optimal control problems with a general prefence. The remainder of the paper is arranged as follow : In section 1 we describe notations and background material that will be used. In section 2 we state our first result in which we establish that n o r m a l i t y implies
seminormality
provided that F is merly "Lipschitzian" in the sense of Loewen and Rockafellar 21 around the reference point.
362
The previous conditions of normality are expressed in terms of the coderivative. In section 3 we express them in terms of the Hamiltonian, and we call them Hamiltonian normality. We show that, under convexity assumptions, Hamiltonian normality implies
seminormality
In section 4, we treat normality, seminormality and local controllability of systems of Mayer type x(t) = f{t, x(t),w(t))
a.e.,
w(t) € W(t) a.e.,
(x(a),x(b))
£ S
n
where f:[a,b]xR x R™ i->- E" is a function, W is a measurable multivalued mapping from [a, b] into Rm and 5 is a closed set in E n x l " . To conclude, we develop in the last section necessary optimality conditions for multiobjective optimal control problems of the form (1.8) (i.e., / takes values, for example, in ffim) related to a general preference which includes preferences for weak Pareto optimum, generalized weak Pareto optimum, preference determined by an utility function as well as preference determined by the Lexicographical order. Note that there are many other types and refinements of necessary optimality conditions for real-valued objective (or weak Pareto for multiobjective) optimal control problems. Some relevant references are 13 , 21 23 34 15 33 , , , . A detailed discussion of the historical background of conditions akin to the Hamiltonian conditions is proved in a number of papers and books („ „
6
28 36
10 11
8
26
27
35-i
In , Zhu has substentially unified the various Hamiltonian optimality conditions that had previously been derived for "uniformly" Lipschitz bounded and convex-valued differential inclusions related to a general preference. The obtained conditions are expressed in terms of the Clarke's generalized derivative which is larger than the limiting Frechet subdifferential. In this paper, we obtain necessary optimality conditions for nonconvex-valued differential inclusions, which belong to the large class by Loewen-Rockafellar 21 , in terms of the limiting Frechet subdifferential and then we deduce the corresponding Hamiltonian conditions for convex-valued differential inclusions. We provide alternative proofs of these conditions. Those of Zhu 36 are based on recent progress in nonsmooth analysis, in particular, calculus for smooth subdifferentials of lower semicontinuous functions (4, 9 ) . The proofs in the present paper employ the results established in sections 1-3 related to normality, seminormality and local controllability.
363
2
Background
We begin by recalling some basic tools of generalized differentiation needed for our main purpose. Details may be found in 2 4 - 2 5 . Let C be a closed subset of W1 containing some point c. The e—normal cone to C at c is the set Ne(C;c) := k l " : liminf ^~},X ~f* > -e\ . \
x€C->c
\\x-c\\
~
J
The normal cone to C at c is the set N{C;c) := limsupN e (C,x). x£C->c
«->0+ n
Now let / : R 4 l U {00} be a lower semicontinuous (l.s.c.) function and let c G 1 " be such that /(c) < 00. The limiting FVechet subdifferential of / at c is the set df(c) := { ( £ ! " : (£, - 1 ) € N(epif; (c,/(c)))} where epif denotes the epigraph of / . We have the following analytic characterization of df(c) : df(c) = lim sup
d£f(x)
«->o+
where a,m
- (*• € X- : lim,nf /(» + * ) - / W - ^ . » > _A . h I -*° II h II J
Next we consider a multivalued mapping G from W1 t o E m of the closed graph GrG := {(x,y) : y £ G(x)} . The multivalued mapping D*G(x,y) D*G(x,y)(y*)
: Rm >->• W1 defined by
:= K € R n : (**,-*) € iV(GrG; (z,2/))}
is called the coderivative of G at the point (x,y) G GrG. From the definition of the limiting Frechet subdifferential we obtain the following result.
364
Lemma 2.1. Let G be pseudo-Lipschitzian (*•, 2, 32) around (x0,yo) G GrG with modulus K , i.e., there exists r > 0 such that for all x,u £ xo + rW> G(x) n (?/o + rB) C G(u) + K\x - u|B, Then for all y* e E " , with D*G(x0,y0)(y*) sup{|x*| :
^ 0, one has
x* G D*G(x0,y0)(y*)}
< K\y'\.
If in addition G is closed-valued, then for all (x, y) € (XQ + -j^B) x (yo + f^B), with (x,y) $ GrG, and all (x*,y*) 6 dd{-; G(-))(x,y) we have \y*\ = 1, and\x*\
Proof. It suffices to establish the second part, the first one follows from the definition of limiting Frechet subdifferential. Let (x,y) £ (xo + fjB) x (j/o + f^B), with (x,y) £ GrG, and let {x*,y*) € dd(-;G(-))(x,y). Then there are sequences Xk —>• x, yk —> y, x*k —> x*, yk —> y*, Ek —> 0 + and rk —> 0 + suc that d(v;G(u))~d(yk;G(xk))-(xl,u-Xk)-(y*k,v-yk)+ek[\u-Xk\
+ \v-yk\] > 0
for all u E Xk + r^B and v 6 yk + ffcB. For each integer k, there exists Vk 6 G(xk) such that d(yk;G(xk))
= \yk
-vk\.
So l 2 / ' - v | - \yk ~Vk\ - (x*k,u-xk)
- (yt,v-yk)
+£k[\u-xk\
+ \v-yk\]
>0
for all u E. xk + rkM, v 6 yk + rkW> and y' £ G(u). Consider the function g defined by g(u,y',v)
= \y' -v\-
{x*k,u-xk)
-{y*k,v-yk)
+ek[\u-xk\
+
\v-yk\].
Then (0,0,0) G dg(xk,vk,y,)
+ N(GrG;(xk,vk))
x {0}.
As for k large enough yk ^ Vk then dg(xk,vk,yk)
C {(0,«*,-«*): |«*| = 1} + ( - z J . O . - y J ) +<=*! x {0} x efcB
and hence we obtain («£>uit)
€
N(GrG; (xk,Vk)), with |ujj| = 1, such that
\x*k ~ ul\ < £fc Now since d(yk;G(xk)))
and
\Vk ~vt\<
£k-
= \yk — vk\, we get for k sufficiently large \Vk -vk\
< -
365
and hence \xo ~xk\
+ \yo-vk\
< -g--
Thus for all u, u' £ xk + | B F(u) n (vk + £ B ) C F(u') + K\u - u'|B. 6 So the first part of the lemma ensures that
\<\
Vx G V.
Hi) G is convex valued. Then G is upper semi-continuous (u.s.c.) on V, that is, for all u G V and all e > 0 there exists a neighbourhood U of u in V such that G(x) c G(u) + e l ,
Vx € U.
With the help of the last lemma, we can prove the following one. Lemma 2.3. Suppose that the mapping f: (xo,yo) + rW> i-t M. is Lipschitzian with constant K. Define the multivalued mapping V: {XQ , yo) + rM x E" x l n E n by T(x,y,p,s)
= co{q: (q,p) G df(x,y)
+ sM}.
Then for all A G]0,1[, all (x,y,s) G (xo,yo,0) + ArB and all p G Kra, with T(x,y,p,s) ^ 0 r is u.s.c. at (x,y,p,s) in the sense of Lemma 2.2. Proof. Note that ii) and Hi) of Lemma 2.2 are satisfied. It is not difficult to show that T is of closed graph and to apply Lemma 2.2.0 The following lemma is a small modification of the Clarke's one 5 . For the sack of completeness, its proof is included. Lemma 2.4. Lete>0 andT : [a, b]xE" x E " x E " x E H-> E n be a multivalued mapping such that for almost all t G [a, b], T(t, •) has nonempty, compact and convex values around (z(t),z(t),p,s), with s G [0,e] and T(z(t),z(t),p, s) ^ 0. For sequences (zk) and (pk) inW1'1, ((f>k) in Lx{[a,b],]0, +oo[), (Q^) and(sk)
366
in K+ with zk -> z in W1'1, <j>k ->
+oo[), for some integrable
s'): t£ [a, b], x' G z{t) + e l , y' G z{t) + e l , s 'e[o, e ],r(t,x', 2 /',y, s ')^0}
£/ie multivalued mapping t' t-> T(t',x,y,p, s) is measurable. ii) For all k, \pk(t)\ < (f>k(t) for almost all t G [a, b]. Hi) For all k, pk(t) e r(t,zk{t),zk(t),pk(t),sk) + akM a.e. t€ [a,b]. iv) For almost all t € [a, b], for every p 6 E" with T(t, z(t), z(t),p, 0) ^ 0, the multivalued mapping (x',y',p',s') i-> T(t,x',y',p',s') is upper semicontinuous at (z(t),z(t),p,0). v) The sequence (pk(a)) is bounded. vi) There exists an integrable function ip such that sup max \y\ < ip{t) a.e. {(p',s'):s'e[o,e], r(t,z(t),i(t),p',s')^0}ver(t,2(t),z(«),p',s') Then there is a subsequence of (pk) which converges uniformly to an arc p satisfying p(t) G T(t,z(t),z(t),p(t),0)
a.e.t G [a,b].
Proof. By ii) and v), the sequence (pk) is bounded and equicontinuous, so there is a subsequence (we do not relabel) converging uniformly to a limit p. Using Dunford-Pettis criterion, we may extract a subsequence (we do not relabel) converging weakly to a limit p' in L1([a,b],Mn). From pk(t) = pk(a) + Ja pk(s)ds we deduce p(t) = p(a) + Ja p'(s)ds and hence p = p'. Since (zk) converges to z in W 1 ' 1 , we may assume that (zk(t)) converges to z(t) almost every t G [a, b]. Let J C [a, b] be a set such that meas(J) = 0 and for all t G [a, b] ("1 Jc \imzk(t)
= z(t)
and for all k \Pk(t) < \Mt)\,
Pk(t) € T(t,zk(t),zk(t),pk(t),sk)
+ akM.
Consider the function h defined by h(t,x,y,p',q,s)
= max{(q,w):
w G
T(t,x,y,p',s)}.
It is easy to see (via vi)) that t •->• h(t, z(t), z(t),p(t),q, s) is integrable. Let q G W1 and let t G [a, b] (1 Jc. Since F(t, •) is upper semicontinuous at (z(t),z(t),p(t),0) and h(t,zk(t),zk(t),pk{t),q,sk) > (q,pk(t)) - ak\q\ one has limsup(q,pk(t)) k
<
h(t,z(t),z(t),p(t),q,0).
367
Note that, by ii), \(q,Pk(t))\ < 0fc(*)M- We then invoke Fatou lemma to get for any measurable set I in [a, b], the integrability of the function limsupfc(<7,pfc(i£)) and the inequality limsup / (q,pk(t))dt < / k Ji Ji
\imsup(q,pk(t))dt. k
As (p'fc) converge weakly to p in L 1 ([a, 6],lRn), one has / (q>p(t))dt < / Ji Ji
\imsup(q,pk(t))dt. k
So for each measurable set J in [a, b] one obtains
J{q,p{t))dt < J h(t,z{t),z(t),p(t),q,0)dt. It follows that {q,p(t)) < h(t,z(t),z(t),p(t),q,0) almost everywhere. Since h is continuous in q, this inequality can be obtained for all q, except for t in a set of measure 0. Thus p(t) 6 T(t, z(t),z(t),p(t),0) almost everywhere.^ We conclude this section by recalling necessary optimality conditions for the following generalized problem of Bolza m i n ^ ( i ( a ) , a ; ( 6 ) ) + / L(t,x(t),x(t))dt\
(PB)
where the functions L : [a,b] x Rn x M" 4 l U {+00} and l i f x l M R U {+00} are such that for each t £ [a, b], the functions L(t, •, •) and I are l.s.c. o n l " x W1. The function L is epi-Lipschitz at an arc z if there exist an integrable function k : [a, b] i-> R and a positive e satisfying the following conditions : for almost all t £ [a, b], given two points z\ and Z2 within e of z(t) and ui G Rn such that L(t, zi, ui) is finite, there exist a point 112 6 Rn and 8 > 0 such that L(t,Z2,U2) is finite and |"i -u2\
+ \L(t,z1,u1)
- L(t,z2,u2)
-S\<
k{t)\zi - z2\.
This is equivalent to saying that the multivalued mapping E(t, s) = {{u,r) e E" x R : L{t, s,u) < r} is Lipschitzian in x on z(t) + eM. L is said to be epi-measurable (in t) if for each s € E", the multivalued mapping E{t, s) is Lebesgue measurable in t.
368
The notation dL will denote the limiting Frechet subdifferential of the function L(t,.,.). Now we may state a variant of the necessary conditions for the generalized problem of Bolza established in Jourani 15 . Theorem 2.5. Let z solves locally the generalized problem of Bolza (PB) (in W1'1). Suppose that L(t,x,u) is epi-measurable in t, and epi-Lipschitzian at z and £ is locally Lipschitzian around (z(a),z(b)). Then there exists an arc p such that one has : p(t) £ co{q :
(q,p(t)) 6 dL(t,z(t),z(t))}
a.e.t € [a, b]
(p(a),-p(b))Ed£(z(a),z(b)) (p{t),z{t))-L(t,z{t),z{t)) 3
= max{(p(t),v)
-L(t,z{t),v)
: w e H"}.
N o r m a l i t y implies s e m i n o r m a l i t y
In developing the theory on normality and seminormality of the system (1.1) and on necessary optimality conditions for the multiobjective control problem (1.8), one property on the differential inclusion, which intervenes in (1.1) and (1.8), turns out to be particulary important: the sub-Lipschitz condition. Definition 3.1. F is said to be sub-Lipschitzian in the sense of LoewenRockafellar 21 at z if there exist /3 > 0, e > 0 and a summable function k : [a, b] H» E such that for almost all t £ [a,b], for all N > 0, for all x,x' 6 z(t) + e l and y € z(t) + NM one has d(y, F(t, x)) - d(y, F(t,x'))
< (k(t) + 0N)\x - x'\.
This is not the original definition by Loewen-Rockafellar 21 , but both concepts are equivalent. Note that this definition is very similar to the notion of sub-Lipschitzian behaviour introduced by Rockafellar 3 2 , the only difference being that here we consider multivalued mapping with explicit timedependence, with linear Lipschitz constant (in N) and e not depending on N. Rockafellar offers a detailed discussion of (autonomous) sub-Lipschitzian multivalued mappings and the relationship between this property and the pseudo-Lipschitz continuity introduced by Aubin l. Here and throughout the paper we assume the following basic hypothesis: ( H i ) for each x, the multivalued mapping t i-> F(t,x) the values of F are closed.
is measurable and
369 We begin with the following lemma which gives us a necessary condition for nonseminormality. Lemma 3.2. Let G be defined as in (1.2), N > 0 and let z be a solution of system (1.1). Suppose that i) F is sub-Lipschitzian at z. ii) System (1.1) is not semi-normal at z. Then there are sequences sk —> 0 + and (zk,Uk,Vk) —• (z,0,0), with Zk ^ G(uk,Vk) for all k, and (pk) in W1'1, with max |pfc(i)| = 1, such that t€[a,b]
iPk(a),-pk(b))
edd((zk(a)+Uk,zk(b)+vk);S)+skM
(3.1)
Pk{t) £ co {q: (q,pk(t)) £ dd(-;F(t,-))(zk(t),zk(t)) {Pk(t),h(t)) - d(zk(t));F(t,zk(t))) = max {(pk(t),v) - d(v;F(t,zk(t)))
+ s*B} a.e.
(3.2)
- sk\v - zk(t)\} a.e. (3.3)
»ei(t)+(JV+|i t (t)|)B
Proof. Consider the function h denned by h(x,u,v) = d{{x(0)+u,x{l)+v);S)+ / d(x{t);F(t,x(t)))dt Ja and recall that the multivalued mapping G is defined by G(u,v) = {xe
W1'1:
(x(a)+u,x(b)+v)
£ S, x(t) €
F(t,x(t))a.e.}.
Note that the values of G are closed. Suppose that system (1.1) is not semi-normal at z. Then there are sequence (xk,uk,vk) —> (z,0,0) in W1'1 x 1 W x M" such that, for k large enough d(xk,G{uk,vk))
>
kh(xk,uk,vk).
Define the function fk(x) =
h{x,uk,vk)
By (i), fk is lower semicontinuous on the set V = B(z,e).
Set ek = V'fk{xk)
>
2
0, \k = min(efc,fce^) and sk = ^-. Then ek -> 0 + and sk -> 0 + . Therefore one has fk(xk)
< inf
fk(x)+e2k.
xEV
So Ekeland variational principle produces a n z ^ E V satisfying \\zk-xk\\<\k
(3.4)
370
fk(zk) < fk(x) + sk\\x - zk\\,
VxEV
(3.5)
Observe that for k sufficiently large zk is an internal point of V, by (3.4), Zk ^ G(uk,Vk), and that equation (3.5) implies that Zk is a local solution of the following Bolza problem mm{ek(x(a),x(b))+
where lk(u,v)
/ Lk(t,x(t),x(t))dt} Ja = d((u + uk,v + vk);S) + sk\u - zk(a)\ and
Lk(t,x,y) = I d(V>F^x)) + S*\V ~ **(*)! if (x.tf) e A(t), othewise.
where A(t) = {zk(t) + e l ) x (z(t) + (N + |i*(t)|)B). Apply Theorem 2.5 to this problem, this produces an pk in W1'1 satisfying pk(t) E co{q :
(q,Pk(t)) € dLk(t,zk(t),zk(t))} (Pk(a),-pk{b))
e
a.e.t € [a,b]
dlk{zk{a),zk{b))
{pk(t),zk(t)) - Lk(t,zk(t),zk{t)) =max{(pk(t),v) - Lk(t,zk(t),v) : v € En}. These three relations imply (3.1)-(3.3) and the proof of the lemma is complete.^ Now we combine the previous efforts to prove that normality condition implies seminormality without boundedness and Lipschitz continuity assumptions. Theorem 3.3. Suppose that F is sub-Lipschitzian at a point z which is a solution to system (1.1). Then the normality of the system at z implies its seminormality at z and hence its local controllability. Proof. Suppose the contrary. Then, by Lemma 3.2, there are sequences sk ->• 0+ and (zk,uk,vk) -> (z,0,0), with zk £ G(uk,vk) for all k, and (pk) in W1'1 satisfying (3.1)-(3.3). Then there exists z\ € dd((zk(a) + uk,zk(b) +vk);S) with either | ^ | = 1 if {zk(a) + uk,zk(b)+vk)
i S
(3.6)
or (because of Lemma 2.1) |P*(*))I = 1 (3-7) for all t of a set of a positive measure on which zk{t) $. F(t, zk{t)) such that (Pk(a),-pk(b))-zleskM
(3.8)
371
pk(t)eT(t,zk(t),zk(t),pk(t),sk)
+ skM
a.e.
(3.9)
a.e. (3.10)
where r : [a, 6] x E" x E" x E n x E+ (-• E n is the multivalued mapping defined by T(t,x,y,w,s)=
co {q: (q,w) £ dd(-;F(t,-))(x,y)
+ sM} .
Extracting subsequences if necessary we may suppose that z\ —> z* for some z* in cW(a;(a),a;(fr));S) with |z*| = l i f (zk(a) + uk,zk(b)
+ vk) $ S.
On the other hand, by Lemma 2.3, the multivalued mapping T(t, •) is upper semicontinuous with compact convex values and by the definition of the limiting Frechet subdifferential and the sub-Lipschitz condition we have for all k \pk(t)\
+ P(N + \zk(t)\)
a.e.
(3.11)
By Lemma 2.4 there exists a subsequence of (pk) converging uniformly to an arc p satisfying p(t)er(t,z(t),z(t),p(t),0)
a.e.
(3.12)
and hence we obtain, by passing to the limit in (3.8) and (3.10), (p(o), -p(b)) G dd(S; (z(a),z(b)))
(3.13)
(j>{t),z(t))=
max (p(t),v)a.e. (3.14) veF(t,z(t))n(m+(N+\m\W) Now because of (3.6)-(3.7) the pair (z*,p) must be nonzero. In fact we have max|p(t)| = l.
(3.15)
t£[a,b]
Now not forget that p depends on N. So we obtain that the sequence (p^) satisfies (3.11)-(3.15). Again Lemma 2.4 produces a subsequence of (JPN) converging uniformly to some p which satisfies, by (3.11)-(3.15), the following p{t)£F{t,z(t),z(t),p(t),0) (p(a),-p(b))
e
a.e.
dd(S;(z(a),z(b)))
372
{p(t),z(t))=
max
(p(t),v).
v£F(t,z(t))
max \p(t)\ = 1. t€[a,b]
'
This contradicts the normality of our system at z and the proof is complete. 0 R e m a r k 3.4. Note that in the proof of Theorem 3.3 we only used the following condition p(t) 6 co {q: (q,p(t)) G dd(-;F(t,-))(z(t),z(t))},
a.e.
(3.16)
instead of (1.5) The following corollary extends the result by Kaskosz and Lojasiewicz 20 from the Lipschitz case to the sub-Lipschitz one. Corollary 3.5. Let F be as in Theorem 3.3 and let S be of the form Sa x W1. Then system (1.1) is normal at z and hence it is semi-normal and locally controllable at z.
R e m a r k 3.6. To incorporate the constraint g{x(a),x(b))
£K
in our problem, it suffices to see that for S = {(u,v) GC : g(u,v) € K} and (u*,v*) £ dd(S;(uo,Vo)) with (A,y*) ^ 0, such that
there exist A > 0 and y* €
\(u*,v*) € d(y*og)(u0,v0)+N{C; provided g is locally Lipschitzian around 4
dd(K;g(uo,Vo)),
(u0,v0))
{UQ,VQ).
Hamiltonian normality implies seminormality
All the results presented in the previous section may be expressed in terms of the Hamiltonian instead of the coderivative provided that F is convex-valued. The Hamiltonian associated to the multivalued mapping F is given by H(t,x,p)=
sup
(p,v).
v€F(t„x)
System (1.1) satisfies the Hamiltonian normality condition at z (or simply H-normal at z) if there is no p 6 W1'1, p^Q, with the following properties p(t) £ co {q : (-q,z(t))
G dH(t,z{t),p(t))}
a.e. t € [a,b],
(4.1)
373
(p(a),-p(b))
£
N(S;(z(a),z(b))),
(p(t),z(t)) = H{t,z{t),p(t))
a.e. t £ [a,b}.
Note that as in the proof of Theorem 3.3 we only used (3.16). But we now that (31 and 3 ) for convex-valued sub-Lipschitz multivalued mapping F, relations (4.1) and (3.16) are equivalent (for max t6 [ ai6 ] \p(t)\ < 1). With the help of the previous comments, we may state the following theorem. Theorem 4.1. Suppose that F is convex-valued and sub-Lipschitzian at z where system (1.1) is H-normal at z. Then the system is semi-normal at z and hence locally controllable at this point. 5
Normality implies seminormality for systems of Mayer type
In this section, we consider the following system of Mayer type x(t) = f(t,x(t),w(t))a.e., 1
w{t) £W(t)a.e.,
(x(a),x{b))
£S
(5.1)
m
where / : [a, b] x W x ffi n 1 " is a mapping, W is a measurable multivalued mapping from [a, b] into E m with closed values and S is a closed set in l n x l n . We note L1 the set of all integrable mapping from [a, b] into ffim endowed with the usual norm ||w||i = J" \w(t)\dt. The space W1'1 x L1 is equiped with the norm||(a:,«;)|r = ||a:|| + | H | i . We consider the sets G{u,v) = {(x,w) G W1'1 x L1: x{t) = f{t,x(t),w(t))a.e.,w(t) 6 W(t)a.e., (x{a) +u,x(b) + v) e S} and R = {(x,w) eW1'1
x L1: x(t) = f(t,x{t),w{t))a.e.,
w{t)
eW(t)a.e.}.
We say that system (5.1) is semi-normal at (z, w) if there exist a > 0 and r > 0 such that d{(x,q),G(u,v))
<
ad((x(a),x(b));S)
for all x € B{z,r) and q £ B(w,r), with (x,q) £ R, and all u,v £ r l . The system is locally controllable at {z,w) if there exist a > 0 and r > 0 such that G(u, v) n B{{z, w), a(\u\ + \v\)) ^ 0,
W, v £ r l .
374
The system is said to be normal at z if there is nonzero p £ W1,1 satisfying p ( i ) 6 c o { q : ( 9 ,0) € d[-(p(t), f(t,.,.)) (p(a),-p(b)) (p(t),f(t,z(t),w(t)))
G
+ ^u{t)(-)](z(t),w(t))}
a.e.
dd((z(a),z(b));S)
=
mBxMt)J(t,z(t),q))a.e. q€U(t)
We have the following theorem whose proof is in the spirit of that of Theorem 3.3. Theorem 5.1. Suppose that - f(t, x, q) is measurable in t and continuous in q - there exist e > 0 and an integrable function k : [a, b] i-> E such that, for almost all t 6 [a,b], given two points z\ and z2 within e of z(t), q\,q2 € w(t) + eB and q 6 U(t) we have \f(t,Z!,q) \f(t,Z1,q1)-f(t,z2,q2)\
- f(t,z2,q)\
< k{t)\zi - z2\
<*(*)[l*l ~ Z2\ + \qi - q2\].
Then the normality of (5.1) at z implies its seminormality and hence its local controllability. The proof of this theorem is based on the following lemma whose proof is similar to that of Lemma 3.2. Lemma 5.2. If the assumptions of Theorem 5.1 are satisfied and that system (5.1) is not semi-normal at z, then there are sequences s^ -> 0 + , u t -» 0, Vk —>• 0 in W1, u>k —> w in L1 and Zk —> z in W1,1 such that hi) {zk(a) + ufc, zk{b) + vk) $ S h2) (zk,uik) € R ^3) {z,k,Wk) is a local solution of the control problem min (x>g) fk(x(a),x(b)) + fa Lk(t,x(t),q(t))dt over all (x, q) satisfying: ±(t) = f{t, x(t), q(t)) a.e., q{t) € W(t) a.e. (5.2) where fk(u,v) = d((u + uk,v + vk);S) + ek\u - zk(a)\ ek[\f{t,u,v)-f{t,zk{t),wk{t))\ + \u-uk(t)\}.
and Lk(t,u,v)
=
375
6
N O C for multiobjective optimal control problems
In this section, we apply the results developed in the previous sections to produce necessary optimality conditions for the following multiobjective optimal control problem min f(x(a),x(b)) over all arcs x satisfying : (x(a),x(b)) e S, x(t) £ F(t,x{t))
(6.1) a.e.t £ [a, b]
where / : E n x W1 i-* E m is a mapping, F: [a, b]xRn >->• E" is a closed-valued multivalued mapping which is measurable in t £ [a, b], S C M" x E" is a closed nonempty set. These conditions are related to a general preference. Let -< b e a nonreflexive preference for vector in E m . Let z be a feasible trajectory for (6.1). We say that z is a solution to problem (6.1) if there exists no feasible trajectory x for (6.1) such that f(x(a),x(b)) -< f(z(a),z(b)). For any r € E m , we denote C{r) = {s E E m :
s
36
As in , we need the following regularity assumptions on the preference. Definition 6.1. 36 We say that a preference -< is regular at r € E m provided that (Di) for any s € E m , s£ clC(s); (£>2) for any r -< s, t € clC(r) implies that t -< s; (D3) for any sequences rk,0k *-> r in W71 limsupN(clC{rk),0k)
C
N(clC(r),r).
k—>+oo
There are many examples of regular preferences, including the following ones 36 : 1) (m = 1) r -< s ^=> r < s. 2) (The weak Pareto optimum) r -< s <$=> r* < Sj, i = 1, - • • , m and at least one of the inequalities is strict. 3) (A generalized weak Pareto optimal) r -< s <$=> r - s € K, where K C E m is a closed cone. 4) (Utility function) r -< s <==> u(r) < u(s), where u is a continuous utility function. This preference is regular at r provided that lims_>r d(0,du(s)) > 0. 5) Preference determined by the lexicographical order.
376
In 36 , Zhu established Hamiltonian conditions for multiobjective optimal control problems involving Clarke's generalized gradient and assuming that F is "uniformly" Lipschitzian and bounded. The aim of the following theorem is to refine and extend the previous works to the large classes of sub-Lipschitz differential inclusions. First we recall that the Hamiltonian associated to F is defined by H(t,x,y)=
sup
(y,v).
vEF(t,x)
T h e o r e m 6.2. Let z be a local solution to the multiobjective optimal control problem (6.1). Suppose that F is sub-Lipschitzian at z and that the preference -< is regular at f(z(a),z(b)). Then there exist p € W 1 ' 1 , A > 0 and w £ N(cl£(f(z{a),z(b))),f(z(a),z(b))), with \w\ = 1, such that X+\p(t)\^0,Vte[a,b};
(6.2)
p(t) e coD*F(t,z(t),z(t))(-p(t))
a.e. t € [a,b];
(p(o), -p(6)) G Xd((w, / ( - , •)»(*(«), z{b)) + N(S; (z(a), z(b));
(6.3) (6.4)
(p(t),z(t)) = H(t,z{t),p(t))a.e.t e [a,b\. (6.5) If in addition F is convex-valued, then (6.3) may be replaced by the following one p(t)E co {q: (-q,z(t))
EdH(t,(z(t),p(t))}
a.e.te[a,b].
(6.6)
Proof. We distinguish two cases : 1) When system (1.1) is not semi-normal at z : Then, by Theorem 3.3, system (1.1) is not normal. Thus there exists p € W1'1, with p^O, such that relations (3.16), (1.6) and (1.7) hold and the proof is complete. 2) When system (1.1) is semi-normal at z : Let fcbea positive integer and choose 0* -< f(z(a),z(b)) such that \6k — f(z(a),z(b))\ < p- and define 0 := clC(0k) and T be the solution set of system (1.1). Finally, define the function h(xe)
=
l\f(^),x(b))-e\ \ +co
i£xeB(z,8l), otherwise.
where si is such that / is Lipschitzian on (z(a), z(b)) + siB with constant kf. Because of (.Di), {z,9k) € T x 0 and hence h(z,9k)<
inf
(x,$)€Txe
h(x,0) + —. 2 k
377
Note that T and 0 are closed in W1'1 and R m respectively, and that h is lower semicontinuous on T x 0 . By Ekelan d variational principle there exists (zfc,7fc) € T x 0 such that
||zk-z|| + l 7 * - 0 * l < ^
(6-7)
and Kzk,lk)
< h(x,9) + ±[\\zk-x\\
+ |7fc - 0\], V(x,6) € T x 0
(6.8)
^Prom (6.8) one gets M**,7fc) < Kx,lk)
+ -j^\\zk ~ x\\, Vx € r
(6.9)
M**,7fc) < h(zk,8) + i| 7 f c - 9\, V0 e 0
(6.10)
and
Since z is an optimal solution to problem (6.1), then, by (D2) and the choice of 0k, one has 7* ± f(zk(a),zk(b)). Set wk = ^ { ^ { ^ Z ^ - Extracting subsequence we may assume that (wk) converges to some w, with |u>| = 1. Thus, by (6.10) and (D3), one has w e limsupN(cl£{6 k ), l k ) C
N(clC(f(z(a),z(b))),f(z(a),z(b))).
fc—y+00
Now from (6.9) and the seminormality of (1.1) one gets the existence of a > 0 and s\ > s > 0 (both not depending on k) such that h(zk,-yk) < h(x,lk)
+ -\\zk - x\\ + a(kf + l)[d({x{a),x{b));S)
f
+
b
d(x(t),F(t,x(t))dt]
Ja
for all x £ B(zk,s). Define the functions h(u,v)
= \f(x{a),x(b))
- 7 / f c | + -\x(a) - zk(a)\+a(kf
+
l)d((x(a),x(b));S)
and
L
'
\+oo
othewise.
378
where A{t) = (zk(t) + eB) x (z{t) + (N + \zk(t)\)M). solution of the Bolza problem min{4(a;(a),x(6))-l- /
So that zk is a local
Lk(t,x(t),x(t))dt}
Ja
and the proof follows as in Theorem 3.3, by remarking that 0(1/0,') - lk\)(zk(a),
zk(b)) C d((wk,f(;
-)))(zk(a), zk(b)).<>
References 1. J. P. Aubin Lipschitz behaviour of solutions to convex minimization problems, Math. Oper. Res. 8, 87-111 (1984). 2. J. P. Aubin and H. Frankowska , On the inverse function Theorem, J. Math. Pures Appliquees 66, 71-89 (1987). 3. D. N. Bessis, Yu. S. Ledyaev and R. B. Vinter, Dualization of the Euler and Hamiltonian inclusions, to appear in Nonlinear Analysis TMA, (1998). 4. J. Borwein and A. Ioffe , Proximal analysis in smooth spaces, Set-Valued Analysis, 4, 1-24 (1996). 5. F.H. Clarke The generalized problem of Bolza, SIAM J. Control Optim., 14, 682-699 (1976). 6. F.H. Clarke Optimization and nonsmooth analysis, (Wiley-Interscience, New-York 1983). 7. F. H. Clarke Methods of dynamic and nonsmooth optimization, (CBMSNSF Regional Conference Series 57, SIAM, Philadelphia, 1989). 8. F.H. Clarke and P. R. Wolenski, Necessary conditions for functional differential inclusions, Appl. Math. Optim., 34, 51-78 (1996). 9. F.H. Clarke, Yu. S. Ledyayev, R. J. Stern and P. R. Wolenski, Nonsmooth analysis and control theory, (Graduate texts in Mathematics, Vol. 178, Springer-Verlag, New York, 1998). 10. H. Frankowska, The maximum principle for an optimal solution to a differential inclusion with endpoint constraints, SIAM J. Control Optim., 25, 145-157 (1987). 11. H. Frankowska, Local controllability and infinitesimal generators of semigroups of set-valued maps, SIAM J. Cont. Optim., 25, 412-432 (1987). 12. A. D. Ioffe, Approximate subdifferentials and applications I : The finite dimensional theory, Trans. Amer. Math. Soc, 281, 389-416 (1984). 13. A. D. Ioffe, Euler-Lagrange and Hamiltonian formalisms in dynamic optimization, Trans. Amer. Math. Soc, 349, 2871-2900 (1997).
379 14. A. D. Ioffe and R. T. Rockafellar, The Euler and Weierstrass conditions for nonsmooth variational problems, Calculus of Variations and PDE, 4, 59-87 (1996). 15. A. Jourani, Lagrangean and Hamiltonian necessary conditions for the generalized Bolza problem and applications, (Preprint Universite de Bourgogne, January 2000). 16. A. Jourani, Hoffman's error bound, local controllability and sensitivity analysis, SIAM J. Control Optim., 38, 947-970 (2000). 17. A. Jourani, Qualification conditions for multivalued functions in Banach spaces with applications to nonsmooth vector optimization problems, Math. Prog., 66, 1-23 (1994). 18. A. Jourani, L. Thibault, Verifiable conditions for openness and metric regularity of multivalued mappings in Banach spaces, Trans. Amer. Math. Soc, 347, 1255-1268 (1995). 19. A. Jourani, L. Thibault, Coderivatives of multivalued mappings, locally compact cones and metric regularity, Nonlinear Anal. Th. Meth. Appl., 35, 925-945 (1999). 20. B. Kaskosz, S. Lojasiewicz Jr, Lagrange-type extremal trajectories in differential inclusions, Systems Control Lett., 19, 241-247 (1992). 21. P.D. Loewen and R.T. Rockafellar, Optimal control of unbounded differential inclusions, SIAM J. Control Optim., 32, 442-470 (1994). 22. P.D. Loewen and R.T. Rockafellar, New necessary conditions for the generalized problem of Bolza, SIAM J. Control Optim., 34, 1496-1511 (1996). 23. P.D. Loewen and R.T. Rockafellar, Bolza problems with general time constraints, SIAM J. Control Optim., 35, 2050-2069 (1997). 24. B.S. Mordukhovich, Maximum principle in problems of time optimal control with nonsmooth constraints, J. Appl. Math. Mech., 40, 960-969 (1976). 25. B.S. Mordukhovich, Metric approximations and necessary optimality conditions for general classes of nonsmooth extremal problems, Soviet Math. Dokl., 22, 526-530 (1980). 26. B.S. Mordukhovich, Optimal control of difference, differential and differential-difference inclusions, J. Math. Sci., to appear (1999). 27. B.S. Mordukhovich, Discrete approximations and refined Euler-Lagrange conditions for nonconvex differential inclusions, SIAM J. Control Optim., 33, 882-915 (1995). 28. B.S. Mordukhovich , Approximation methods in problems of optimization and control, (Nauka, Moscow (Russian); English trans, to appear in Wiley-Interscience, 1988).
380
29. R. T. Rockafellar, Conjugate convex functions in optimal control and the calculus of variations, J. Math. Anal. Appl., 32, 174-222 (1970). 30. R. T. Rockafellar, Existence and duality theorems for convex problems of Bolza, Tran. Amer. Math. Soc, 159, 1-40 (1971). 31. R. T. Rockafellar, Equivalent subgradient versions of Hamiltonian and Euler-Lagrange equations in variational analysis, SI AM J. Control Optim., 34, 1300-1315 (1996). 32. R. T. Rockafellar, Lipschitzian properties of multifunctions, Nonlinear Anal. TMA, 9, 867-885 (1985). 33. H. Sussmann , A strong version of the Lojasiewicz maximum principle, in Optimal Control of Differential Equations, (N. H. Pavel, Ed., M. Dekker, 293-309, 1994). 34. R. Vinter and H. Zheng, The extended Euler-Lagrange condition for nonconvex variational problems, to appear in SIAM J. Control Optim., (1998). 35. Q. Zhu , Necessary optimality conditions for nonconvex differential inclusions with endpoint constraints, J. Differential Equations, 124, 186-204 (1996). 36. Q. Zhu, Hamiltonian necessary conditions for a multiobjective optimal control problem with endpoint constraints, CECM Research Report, 99, 129 (1999).
381
CONTROLLABILITY A N D C O O R D I N A T E S OF T H E FIRST KIND MATTHIAS KAWSKI Department of Mathematics Arizona State University, Tempe, Arizona 85287-1804, U.S.A. E-mail: [email protected], http://math.la.asu.edu/~ kawski Since the articles on nonlinear controllability of the early 1970s by Jurdjevic and Sussmann, Lobry, Brockett, Hermes and Haynes and others, one common thread of the differential geometric approach has been the desired conceptual separation of the time-varying features from the fixed geometrical structures. Integral manifolds are intrinsically related to Lie-brackets of vector fields, and they determine accessibility. Iterated integral functional of the controls carry a natural chronological algebra structure, and they carry the information that distinguishes controllability from accessibility - in the case of nonlinear systems. This article surveys some of the progress made in the study of nonlinear controllability since the 1970s, leading to a modern reformulation of the concept of obstructions to controllabilty (traditionally misnamed bad brackets) which takes advantage of newly found formulas for the logarithm of the Chen-Fliess series, i.e. using coordinates of the first kind, and comparing them to the established formulas using coordinates of the second kind.
1
Introduction
The study of "controllability of nonlinear systems"12 began in the early 1970s, when a sequel of path-breaking articles by Jurdjevic and Sussmann, Lobry, Brockett, Hermes and Haynes, and others pioneered differential geometric methods in system theory (a term coined in 4 ) , and thereby laid the ground for what since has a become a rich research field. The simple example considered in another early article 3 3
{
±i = u
IMOI <
x2 = xx X% — X2
c
ar(0) = 0
(1.1)
1 Xj
nicely illustrates some of the questions about nonlinear controllability that have been studied since, and some of which still remain open. The most basic question is, in terms of this example, whether by means of solutions curves x(-) corresponding to (e.g. measurable) controls u(-) it is possibly to reach every point x e R 3 (global controllability), or any point in an open neighborhood of x(0) = 0 (local controllability), whether this is possible only
382
for large times and/or large control bounds c > 0, or also in arbitrarily short times T > 0 or arbitrarily small control bounds. Alternatively, if the system is not controllable (about x = 0 in small time) then the reference trajectory x(t) = 0 is an extremal in the sense of optimal control. In this simple example, the {x\, a^-subsystem is linearly controllable, and the interest stems from the tug-of-war between the definite term x\ which might conceivably cause ±3 > 0 for all small times and small controls, and the indefinite term x\ which might provide nonlinear controllability. The situation is delicate as x\ is, in some well-defined sense, of lower order than x\ with respect to the control (bound), whereas the situation is reversed with respect to time. Work of the last three decades has led to ever more refined technical notions and solution techniques that are originally inspired by these simple questions. In this article we survey some of the progress made in the use of differential geometric methods in problems related to nonlinear controllability, culminating in the exploration of some of the newly available tools. The approach considered here is based on a fundamental paradigm of splitting the contributions of the time-varying control from the differential geometric structure defined by the time-invariant vector fields. As such, the formulae needed are basically continuous analogues of the famed Campbell-Baker-Hausdorff formula which dates back to the turn of the century. The results are applicable to time-varying differential equations in a much broader sense, which includes many recent applications in chemistry and physics, deformations and quantum groups, as well as novel algorithms for numerical integration on manifolds see e.g. 7>8>25. They basically just ask for interpreting the function u as e.g. a perturbation (or general time-varying function) rather than thinking of it as a control that may be freely chosen. Clearly, the solution formulas and differential geometric features are unaffected by such re-interpretation. However, for the sake of clarity, this article will throughout use the original control language as in the original article 12 . This article is organized as follows: Following the introduction we introduce the basic technical terminology and surveys key methodologies and results related to controllability without going into a deeper technical discussion. This is followed by a more precise treatment that involves liftings to free systems and an efficient combinatorial algebraic machinery. The objective is the subsequent discussion of the logarithm of the Chen Fliess series in the context of controllability, and, in particular, its utility to move from the vague term "bad brackets" to more precise notions of supporting hyperplanes (in the context of coordinates of the first kind). A short conclusion summa-
383
rizes the relative advantages of various descriptions for such different purposes as explicitly calculating solution curves or providing a concise geometric description. 2
Integral manifolds and controllability
This section introduces some basic technology and surveys some earlier developments in the area of controllability. The emphasis is on outlining the historical development of key ideas and strategies, and leaving technical detail to the original references (unless needed in the sequel), or to the later sections which also provide more precise definitions of some imprecise or misleading vocabulary of the 1980s. Suppose that /o, / i , • • • /m are real analytic vector fields on a manifold, which, since are interested in local properties, may be taken to be R n . These define an affine control system of the form m
x(t) = f0(x(t)) + Y, «*(*) /<(*(*))
(2-1)
where the set U of admissible controls u = ( u i , . . . u m ) consists of all measurable functions of time (defined on some interval [0,T] or [0, oo)) that take values in a set U C R m , usually taken to be compact and convex, and containing u = 0 in its interior. A typical choice, considered from here on, is the unit cube U = [—1, l ] m . For each control u(-) there is exists a unique solution denoted x(-) = x(-,u;p), defined on a maximal interval [0,Tfin) (with Tfin possibly infinite), of the initial value problem (2.1) with initial condition x(0) = p. We work with a fixed initial point, and thus usually take w.l.o.g. p = 0. The reachable set TZp(t) = {x(t,u;p) : u E U} (at time t > 0, from p) is the set of all endpoints x(t,u;p) of trajectories starting at t = 0 at x(0) = p and which correspond to admissible controls u € U. We note on the side that in case one is only interested in the interior points of the reachable set, it suffices to consider piecewise constant controls. However, points on the boundary of the reachable set, of most interest in optimal control, may not be reachable in the same time using such controls. Thus in order to avoid such technical difficulties, it is more convenient to work with the space of measurable controls. The most fundamental dichotomy is whether the reachable set 1Zp(t) has n-dimensional interior or whether it is contained in some lower dimensional
384
submanifold through p 6 R™. In the analytic category 31 a simple algebraic criterion for this integrability condition is provided by the Lie-algebra rank condition (LARC): Let L(fo,fi,...fm) denote the Lie algebra generated by the vector fields fk with the usual Lie product [/, g] = f o g - g o / of vector fields, considered as composition of partial differential operators. Then the reachable set TZp(t) has nonempty n-dimensional interior if and only if L(fo, • • • fm)(p) — {v(p) : v e L ( / 0 , . . . / m ) } is the entire tangent space T p R n . In this case the control system (2.1) is called accessible from p. A key feature of this algebraic rank condition is its geometric character. This may be considered a direct consequence of the fundamental fact that Lie brackets are geometric, i.e. invariant in the sense that 4>*[^,u>] = [$*v,$»w;] under changes $ of local coordinates. Unlike in the case of linear systems, accessibility does not imply controllability for nonlinear control systems as is illustrated in the most simple example l*1=\ (2.2) v \x2 = x* ' with k = 2. Here the Lie algebra L ( / o , / i ) generated by f0 = x\-^- and / i = a l l s P a n s r « R 2 a t e v e r v P ° i n t « e R 2 (using that [[/o,/i],/i] = 2 ^ ) and thus the system is accessible. Nonetheless, since X2(t,u; (pi,P2)) > P2 for any choice of the control, it is a priori clear that each reachable set ~R-P(t) (in forward time, from any initial point p) is contained in a half-space. A good starting point is the observation that the iterated Lie brackets of the vector fields fi, evaluated at p, loosely speaking, describe the directions in which the reachable sets extend from p. More precisely (see e.g. 15 ) one may define a (fc-th order) order tangent vector to the family {TZp(t): t > 0} of reachable sets at p to be any vector v £ ToR" for which there exists a sequence of times ti \ 0 and a parameterized family of control variations u^ : [0,U] i-» U such that x(ti,u^;p) —p + t^v + o{t\) (conveniently identifying T P R" with R n in this definition). It can be shown that the set of such tangent vectors is almost a truncated convex cone which approximates the reachable sets in a suitable sense 15 . The key observation is that certain iterated Lie brackets such as [h) [/1! [/1 > /o]]] give one both forward and backward controllability, while others such as [/1, [/1, /o]] only give one-directional accessibility. In the simple example (2.2) one easily calculates that /i(0) = gf- and (ad fc /i,/o)(0) = k\-^(defined by (&dt+1v,w)
= [v, (ad\w)])
are (up to Jacobi-identity etc.) the
385
only iterated Lie brackets of the / , that do not vanish at p = 0. It is apparent that one needs to distinguish the Lie brackets with k odd from those with k even, the first providing full control whereas the latter might be considered obstructions to controllability. In order to translate such observations into precise technical result, formally define the system (2.1) to be small-time locally controllable (STLC) (about p) if the reachable set TZp(t) contains the initial point p in its interior for every time t > 0. To facilitate the statement of typical conditions of STLC for single-input systems, i.e. m — 1, write S1 for the linear span of all iterated Lie brackets of the form ( a d J / 0 , / i ) with i > 0. Inductively define Sk+1 for the linear span of all Lie brackets [v,w] with v S S1 and w 6 Sk. The best-known sufficient condition for STLC if m = 1 is the Hermes condition, proved originally by Hermes for the case n = 2 and for the general case by Sussmann 3 2 : If the system (2.1) satisfies the LARC and <S2* (0) C S2k~l (0) for every integer k > 0 then the system is STLC about p = 0. Conversely, Stefani 29-30 showed that if (ad 2 */i,/o)(0) 0 S 2 * " 1 ^ ) for some k > 0 then the system is not STLC about p — 0. These conditions earned the respective iterated Lie brackets the suggestive, though potentially misleading, names "good brackets" and "bad brackets" (this author shares much of the blame for these misnomers, e.g. 1 7 ) . Loosely speaking good brackets are those that, when nonzero at the starting point, provide full (forward and backward) controllability, whereas bad brackets are potential obstructions to controllability, in the sense that, unless neutralized by lower order brackets will force the reachable set to lie on one side of some smooth hypersurface passing through p. We shall come back to these notions in the fourth section. In the quest for narrowing the gap between necessary and sufficient for STLC, the game-plan in the 1980s appeared to characterize a maximal set of bad brackets, and to find the most general notions for a set of brackets to neutralize a nonzero-bad brackets. Notable successes in this direction included Sussmann's general theorem for controllability 3 5 , and a necessary condition demonstrating that the iterated Lie bracket [[/ 0 , / i ] , [[/o, / i ] , /o]]] 14 is indeed an obstruction unless neutralized by a specified subset of S3. Related results such as the balancing of two bad brackets will be discussed in section four. These conditions may be derived by analyzing the endpoints x(T, uW,; 0) of trajectories arising from parameterized families of control variations u(s>: [0, T] i-> U . (This shows the close relation to conditions for optimall y of the reference trajectory x* = 0.) Basic families of control variations use the magnitude (|| • ||oo, inspired by classical calculus of variations), or the
386
length of the time subinterval on which u ^ is different from u* = 0 (needle variations), while more complicated constructions may e.g. be parameterized by their frequency of oscillations (increasing to infinity) 1 6 . A sufficiently smooth such family yields a higher order tangent vector to the reachable sets. The main technical analysis concerns the ways in which such variations can be combined, and through adapted open mapping theorems, one obtains notions of approximating cones for the reachable sets. One intuitive approach to exhibit good brackets is based on constructing families of control variations {u^}s>o that they generate a desired the iterated Lie bracket /^(O) as a tangent vector to the reachable sets. For some brackets fn one can show that replacing the family {u' s '} s >o by the family {u^}s>0 may generate the tangent vector —fn(0), while for other brackets both families generate the same tangent vector. For example, in the case of the system (2.2) the families of control variations defined on an interval [0, T] (for s > 0 sufficiently small) by „ f n . , . T J J : ' - ^ ; ; ^i ii »
«(ll,)(t)=
.
( 0 ifO
are easily seen to generate (positive scalar multiples) of the tangent vectors /i(0) = gf^-(O) and (ad*/i,/o)(0) = gf^(0). The family of control variations {—u(1,s)}„>o clearly generates the tangent vector —/i(0) = - af-(0), but the family {-u( 2 ' s ^} s >o generates the tangent vector (-)*/i(0) = (—)*gf-(0), i.e. only in the case that k is odd does it provide a new direction. Through careful technical arguments one can indeed make this into a rigorous argument, establishing one part of the Hermes' condition: Loosely speaking, formal brackets containing an odd number of factors of the controlled vector field / i are good brackets, i.e. they always yield both directions. In a similar spirit, one can use the input symmetry u —> u~ that replaces each control by its time-reversal u~(t) = u(T — t), inspired by an example of Stefani (1985), to further narrow the set of potentially bad brackets by establishing, loosely speaking that also all totally even brackets, that is formal brackets that contain an even number of factors of each vector field fa, are good brackets, or are not obstructions to controllability, see the original reference 3 5 for a precise statement. The second parts of these sufficiency theorems use partial orderings of formal brackets to specify in which way a potential obstruction may be neu-
387
tralized by lower order brackets. Such partial orders of formal brackets may be defined in a natural way which reflects the possible order of the tangent vectors generated by a family of control variations. The basic families of control variations mentioned above (parameterized by their size || • ||oo or by the length of the time subinterval on which M*S) is different from zero) yield two basic partial orders. Technically these are defined on formal brackets (see the next section). The first partial order is based on counting the number of factors of the (each) controlled vector field f\ (or fi, i > 1), the second is based on the total length of the formal bracket. The general theorem on controllability by Sussmann 35 also allows one to take convex combinations of these weightings. An example in 16 demonstrated that these partial orders do not exhaust all possible ways of neutralizing bad brackets, but the partial orders in 35 remain the most important general conditions. 3
Separating invariant flows from time-varying functionals
This section surveys technical tools that implement a desired separation of the underlying invariant geometric features from the time-varying control input (which may also interpreted as disturbances in other applications). More specifically, for a control input u and time T > 0, the endpoint x(T,u) of the associated solution curve x(-,u) of (2.1) is determined by both the geometry of the system, characterized via the Lie relations between the vector fields fi, and by the characteristic properties of the controls Ui. For the purposes of this article (with emphasis on formulas suitable for computations) it suffices to consider a fixed set of vector fields /*. However, on a deeper level, the geometry is determined by the distributions spanned by the vector fields fi, rather than the specific choice of basis for the distribution. (A basis change for the distribution corresponds to a feedback transformation v — $(ar, u) (invertible for each fixed a;) of the controls. Consequently, for fixed vector fields fi, the geometry is determined by the linear relations among all iterated Lie brackets of the vector fields fi. Among these relations one is only interested in those that are nontrivial, i.e. that are not consequences of the anticommutativity and Jacobi identity that hold in every Lie algebra. Thus one readily sees that one only needs to consider the relations between the vector fields that are images of a basis of the free Lie algebra L{X\,... Xm) under a Lie algebra (evaluation) homomorphism that sends the generator Xi to the vector field fi. For practical purposes of both analysis and computation it is more suitable to capture the geometry in terms of the flows (t,q) i-» exp(tfi)(q) of the vector fields fi, that is by the solution curves of the system (2.1) that correspond
388
to controls u = ( 0 , . . . , 0 , 1 , 0 . . . 0). Clearly these flows also determine the flows the flows (t, q) H-»- exp(tfpi)(q) of iterated Lie brackets fpi, the flows (t,q) >-¥ exp(t(52(Qfi))(q) of constant linear combinations, and thus e.g. via piecewise constant controls the solution curves of (2.1) for arbitrary (locally) integrable controls u. The main advantage of working with these flows is that not only the flows exp(/j) (q) of the system fields fi but also e.g. the flows e x p ( ^ Ci/i) () of constant linear combinations and/or iterated Lie brackets may, in principle, be computed off-line. This observation has been exploited in a dramatic way in the Crouch-Grossman algorithms 7 for numerical integration of "time-varying nonlinear differential equations with algebraic constraints" (i.e. on manifolds). This approach is even more attractive in the case of well-understood Lie groups, e.g. when the Lie algebra L(fi,f2) is isomorphic to e.g. the Lie algebra of a rotation group SO(n) 8 . On the other side one needs an effective way to characterize the relevant properties controls of the controls Uj that is compatible with the encoding of the geometry. One may think of this as similar to ways to characterize the controls Uj by their moments J 0 tkUj(t)dt, their Fourier coefficients J 0 e~2nkt/TUj(t)dt etc. As is recapitulated in the following, an effective way to extract the relevant information from the controls in a way that matches the description of the geometry relies on selected iterated integrals of the controls. For some representations these iterated integrals are determined by very simple recursive formulae that lend themselves to further analysis, and that are suitable for applied problems such as path planning (effectively reducing this problem to partially inverting the map from controls to their iterated integrals using pre-computed coefficients that characterize the geometry). With this motivation it is natural to factor the system, considered as map from control inputs to solution curves of (2.1) through a free system. It proves convenient 18 to focus in some places on the primitives Ui = J0 w,(s) ds € U (instead of Uj), indeed one may well consider the U — i as controls". These functions are elements of U = .4Ci o c ([0,oo),R m ), with the added constraint of having bounded derivatives (e.g. bounded by 1). It is also convenient to extend the system (2.1) by allowing for outputs y = h(x) whose images under an analytic map h: Mn i-» N are curves y(-) 6 y in some space TV, usually taken to be a Euclidean space MP. A special case is y = x. Thus the system (2.1) with added output, denoted by E is a map from U to y, and we factor it as a composition of a free system denoted by Efree as a map from U into the subspace of curves on a free associative algebra generated by
389
m indeterminates Xo, • • • Xm, followed by a linear projection TT. The analysis of the continuity and convergence properties of these maps is intricate (see e.g. 32>33), and far beyond the scope of this article which is focused on the algebraic, geometric and combinatorial features. A first explicit formula for this factorization £ = n o £ f r e e is given by the Chen Fliess series 5 ' 9 . (for a slightly different set-up compare the Magnus expansion 2 2 ) . Consider a set Z = {XQ, ... ,Xm}, also called an alphabet, of indeterminates (or letters) X0,... ,Xm. On the free associative algebra A{Z) (over E, consider the free system m
S{t) = S(t) • ( Xo + Y2 «»(*) Xi(t) )
initialized at
Using the chronological product *: AC x AC ACioc([0,T],R)) defined by
5(0) = 1
(3.1)
i-» AC (with AC
=
(F*G)(t)=
f F(s)G'(s)ds, (3.2) Jo and which naturally carries over to curves in A{Z) with coefficients in AC, the system may be written in the compact {integrated) form m
S{t) = S{t) * U
with U = ] T Ui{t) Xi{t)
and
U0{t) = t.
(3.3)
i=0
The solution is obtained easily by iteration of (3.3) (compare 18 > 20 . 32 . 3 3 ) yielding OO
S{t) = ^V*k
(3.4)
k=i
where the right chronological powers are defined inductively by f*(*:+1) = F*h * F. Upon specification of analytic vector fields fi the familiar form of the Chen series for the system (2.1) is the image of (3.4) (in expanded form) under the chronological algebra {evaluation) homomorphism that maps the indeterminate Xi to the vector field fi, when applied to an analytic output map h and evaluated at the initial point p = x{0):
y{T,u) = J2 ?w{u){T) • {fwh){p)
(3.5)
390 where the sum extends over all words (noncommutative monomials) in the free monoid Z* generated by Z, fw is a partial differential operator (identify fx{ with fi), and iterated integral functionals are defined recursively by Twa(u) = Tw(u) * T a and Ta(u) = Ua for a £ Z. 5.9.33,32 For later convenience we briefly introduce some notation and terminology to use the natural graded structure of the free associative algebra A(Z). The subspaces of interest are the level sets of the (associative) algebra homomorphism 6 = (l,8i,...Sm): A(Z) *-> (Zo~) 1+m whose components Si: A(Z) (->• ZQ" are defined by Si(Xj) = 5ij (Kronecker-delta on the right). This multi-grading carries over to the free Lie algebra L(Z) over Z, yielding a grading of formal brackets, and dually, yields a grading on the iterated integral functionals T„,. Specifically, if u = ( l , u i , . . .um) = : [0,T] i-> Rm+1 and a{u): [0,cT] i-> l r o + 1 are related by {o(u))i{ct) = SiU^t), then T(
ef T(u) (T)
(3.6)
This graded structure underlies the algorithms for obtaining nilpotent approximating systems (e.g. 30 and n ) underlies most proofs of conditions for small-time local controllability (e.g. 17>35), and governs which tangent vectors to the reachable sets will be generated by which families of control-variations 15
The Chen Fliess series separates the time- and input-dependent iterated integrals from the products of the vector fields which correspond to the geometry of the system. In order to improve the formula one notes that the system (3.3), alas (3.1), actually evolves on the formal group G C A(Z) 3 5 ' 1 8 of exponential Lie series - known classically as Ree's theorem 2 6 , basically a consequence of a continuous version of the Campbell-Baker-Hausdorff formula, and algebraically elegantly captured by the fact that the map T is a chronological algebra homomorphism 18 ' 20 . Thus it is a priori clear that the Chen Fliess series van be expressed as the exponential of a Lie series 5 ^ Tw ® w = exp I ] T w€Z*
CH
® [H])
\HEU
or as a directed infinite product of exponentials
(3.7)
) 34
^ Tw ® w = Y[ exp(6f ® [H]) wez* Hen
(3.8)
In either case, the sum on the right extends over any basis for the free Lie algebra L(Z) - technically T-L is any Hall-Viennot set 36 over Z, and [H] € L(Z)
391
denotes the canonical image of the binary tree H € %. This notational abuse is justified by the fact that the restriction of the deparenthezation map (from the free magma over Z to A(Z) to any Hall-Viennot H is injective and maps into L(Z) considered as a subset of A(Z). In order to of practical value one wants explicit formulas for the coefficients £H and Oy in either expression. Explicit formulas for the coordinates of the second kind £# were obtained in 3 4 , and independently, using substantially different terminology and settings, in 10 ' 23 > 28 . Arguably the most elegant recursive formula is given by 18 ' 21 £HK=€H*ZK
if
H,K,HKeH
(3.9)
and if the left factor of K is not equal to H (otherwise there is factorial present 18,21}
No similarly elegant formula has been found for £h- However, in 2 0 a substantial number of these iterated integral functionals have been calculated and tabulated, and realized through a cascade polynomial control system. The calculation of these basically coincides with calculating a formal series expansion of the logarithm of the Chen Fliess series using the chronological products. 4
Coordinates of the 1 st kind
Before proceeding with the logarithm of the Chen series we work out one simple example in detail. This shall provide a better feeling about what is involved and how to do calculus with the change to coordinates of the first kind, and in particular, how this compares with familiar expansions based on coordinates of the second kind, we . Consider the vector fields fi(x) = -^- + a^gf- and h{x) — -^- on R 2 . The Lie algebra L ( / i , / 2 ) is a two dimensional since [ / I , ^ ] = fa. Define new coordinates C, = (Ci,C2) (in a neighborhood of p = 0) in R 2 via the inverse of the map $ _ 1 : C •-> exp(d,/i + C2/2XO). To find an explicit formula calculate the solution curves of the differential equation x = Cifi{x) + (,2f2{x) (with i(0) = 0) as x
i(t) = Ci*
and
e Clt - 1 x2(t) = — C2-
Evaluating at t = 1 yields the coordinate change x = $ _ 1 o £ and its inverse explicitly as e"*1 — 1 xi = Ci, 2 = —7 C2, x
Ci
#i and
Ci = xi,
C2 = —
7 x2,
eXl - 1
392 The apparent singularity is removable and this is indeed an analytic coordinate change. A straightforward calculation of the Jacobian £ ) $ _ 1 and its inverse yield formulas for the new coordinate vector fields JL - JL d(,i dxi
/ l ~ e X l +xieXl \ exi - 1
X2\ _d_ xi) dx2
_d_ _ feXl - 1 \ d dC,2 \ xx J dx2
Thus the second coordinate vector fields differ only by their scaling. However, the first coordinate vector fields are substantially changed. By construction [•£-, •&-] = 0, but it is an easy and instructive calculation to verify this directly. This simple example already suggests that in general one may not expect to calculate partial derivatives with respect to coordinates of the first kind almost by inspection (as is commonly practiced for coordinates of the second kind), but rather that such coordinate changes require substantial careful calculations. We contrast this with coordinates of the second kind, starting with the same vector fields. Define new coordinates £ = (£1,62) (in a neighborhood of p = 0) in R 2 via the inverse of the map \ P _ 1 : £ H-> exp(£ 2 /2) ° e x P(£i/i)(0)To find an explicit formula calculate and concatenate solution curves of the differential equations x = fi(x) (with x(0) = 0) and x = f2{x) (with x(0) — exp(£i/i)(0)). Evaluating at U = & yields the coordinate change x = * _ 1 o£ and its inverse explicitly as £i=6,
x2=eil-&,
and
£1 = x1}
& = e~Xl • x2,
Using the Jacobian £>\l/ -1 and its inverse one obtains the new coordinate vector fields — - — — nd — - e X l — d£i dx\ dx2 d£2 dx2 As expected the first coordinate vector field agrees with the field /1 used to construct the new coordinates, while the second coordinate vector field may be considered as arising from the infinite series with terms
SJ
k=i
These simple formulas are a special case of the familiar formula for calculating partial derivatives of products of exponentials: If f\,... fm are smooth vector fields, p e Mn and * : t= ( t i , . . . ,tm)
i-»exp(Wm)o...oexp(£j/j)o...oexp(ii/i)(p)
393 is denned for sufficiently small |i|, then repeated use of the chain-rule yields —
= exp(i m / m )«o • • o e x p ( t i + i / i + 1 ) » / 4 x p ( t . / . ) o . . o e x p ( t i / ] ) ( p ) ( 4 . 1 )
= exp(t m / m )» o • • • o e x p ( t i + i / i + 1 ) , / 4 x p ( _ t . + i / . + i ) o . . . o e x p ( _ t m / m ) $ ( ^ 4 . 2 )
= E ^•••tf^-(^kmfm,...(adk^fi+1,fi))..)
.
(4.3)
We return to the logarithm of the Chen series, or the coordinates 0? of the first kind in (3.8). It is clear that the £//• are well-defined by (3.8), but this formula does not provide, per se, an effective algorithm for computing more explicit formulas. For the case of the Hall bases as defined in Bourbaki 3 (which is strictly narrower than the Hall-Viennot bases 36 ) explicit formulae for the case m = 2 have been computed in 2 0 . The calculations were carried out in the computer algebra system MAPLE to perform work in various nonassociative, noncommutative algebras. Rather than directly considering explicit formulas for the iterated integrals C,H is appears to be more useful to consider recursive algorithms similar to consider realizations as polynomial cascade systems similar to the elegant formula (3.9). Due to the underlying chronological algebra structure it is indeed possible to do this - i.e. go from the purely algebraiccombinatorial formulae to formulae which take a dynamic perspective 18 . Such a realization via a cascade polynomial system is tabulated in table 1 for the case of m = 2 and a standard Hall basis in the narrow sense of 3 . Several comments are in order. First of all the formulas appear quite complicated, especially when compared to the elegant formulas (3.9) for the corresponding coordinates of the second kind. The details of the calculations in 20 suggest that this may be due to the way that Hall sets, Lazard elimination, and exponential products are all very intimately related via the chronological product structure, whereas the coordinates of the second kind don't seem to fit well together with the construction underlying the Hall Viennot sets (these are the only effective known algorithms for generating explicit bases for free Lie algebras). It appears that one needs to start with a completely different construction of bases in order to obtain nicer formulas for the coordinates of the first kind. Nonetheless, for implementations of e.g. numerical integration algorithms, or for path planning algorithms these formulas are just fine as they are still suitable for a table look-up, and where suitable, an off-line integration to obtain the flows corresponding to the basic fields.
394
Table 1. Realization as affine nilpotent system'20
Va
—
ib
-
ua u
y
~h b""
iab
Vaab
T2y"yb)
(-jVob -
Vbab
+
"a
=
(-J^oob - A y »i> ya ) u »
Vbaab
=
(-|»6ab -
ibbab
=
y
baaab
A»ab»b) "a
~ TSyayaab+0yayab
755»1"»
ibbbab
= ("i^boab -
n>«*l' y
+
Vaaab
y Vaaaab = (~h aaab
J»°"6
+
-H»l"»
~ ^yl
+
{-? ab
0y
( - Jl/aab ~ A ^ a b ^ a ) "
+
(-2" y bab -
y
u
b)
a
~ A»a!/bab -
~
~ Tiybybab
a b + ° yaybVab +
Vabaab = ( " i ^ b a a b + TZybyaab
T5Vayb)ub
+
+
{-hybbab
E » l » " ' ~ A^iabab +
b
+0ybyab
b
T2yabyb)"b
T^S^b
~
T^yayl)"b
+ s f c i ' a i ' b ) ««
( " J Vaaab ~ i > « » . . l + ° » a » a l -
^fc^a^b)
TjVab + ° Va »b »ab + 3 B 5 » 5 » l ) »<•
+
("Vaaab -
\vaVaab
+ "vlvab
~ 3goyayb)
u
b
U
b
Vabbab = ( " I b b b a b ± 'rVi'bWbab + ° I ' b ! ' a b + 7§o *<• *b) " a
+
ibbaab
= (-JVbbab -
(0»baab +
0
T2ybybab
+°ybyab
+
(-I»6aab -
»bl/aab ~ jVai/bab -
H » « i + ° » « » t » « » ~ 7jjJ»ol'b)"b
+ 235 B a " b ) ™«
^VbVaab
~ VlVaVbab + ° " l b + ^yaVbVab
~ ^Bylyb)
However, for the sake of analyzing e.g. controllability and optimality properties these formulas are a disappointment as they do not readily lend themselves to obtaining structural insights. The hope, of course, was that via
u
b
395
the exp map the explicit representation of the free system (3.3) (resp. (3.1)) yields a very tangible view on the linear space TeG of how the system evolves on the group G C A(Z), yet allows one to take advantage of its linear structure. One obvious key objective is to facilitate the analysis of approximating cones of the reachable sets, which traditionally have been liberally confusing objects in the manifold with objects in its tangent space at the identity (or initial point x(0), see e.g. 15 for typical such offenses). The next section will a little further elaborate the reachable set and its preimage under the exp map. 5
Supporting hyperplanes
The basic construction underlying conditions for small-time local controllability (STLC) (and, dually, optimality) for system (2.1) relies on approximating cones for the reachable sets. These are, roughly speaking, sets of (generally higher order) tangent vectors to the reachable sets, usually resulting from oneparameter families of control variations, together with some open-mapping theorem, see e.g. 15 ' 32 ' 35 . The approximating cones for systems of form (2.1) are images of analogous sets for the free system (3.3), and thus inherit some filtered structure resulting from (3.6). In particular, if the system (2.1) is not STLC, then there exists a supporting hyperplane for the approximating cones of all orders, i.e. the approximating cones all lie on one (the same) side of this hyperplane. Conversely, if the origin lies in the interior of some approximating cone for some order, then the system is STLC. Since the reachable set of (2.1) is the image (under a linear map TT ) of the reachable set of (3.3) (assuming the same set of admissible controls), every supporting hyperplane of the approximating cones of (2.1) can be pulled back by TT* to a supporting hyperplane of the approximating cone of (3.3). A major simplification - and major motivation behind using coordinates of the first kind - is that one can further pull back these supporting hyperplanes to the tangent space TeG at the identity via the exponential map. The key advantage is that the (pre)image of the reachable set of (3.3) has a natural additive structure, thus obviating in part the need to work with approximating cones, and instead allowing one to work directly with the (pre)image of the reachable set of (3.3) itself. (Note that this is basically also possible when working with the coordinate representation of (3.3) induced by (3.9), and more generally when working with any system that admits a dilation group as in (3.6). A technical discussion of associated technical details, including convergence, lack of completeness etc. will be the subject of a fourthcoming article.) Before proceeding, it is helpful to recall that the reachable sets for non-
396
w ¥
Figure 1. A cusp with supporting hyperplanes and one controllable direction
linear non-STLC systems generally have cusps at the origin, compare figure 1, and thus their approximating cones (for small t > 0) have arbitrarily small interior, or almost every hyperplane is a supporting hyperplane for some sufficiently small time t. Thus it is natural to utilize the graded structure of (3.3), and first consider the supporting hyperplanes for each homogeneous subspace separately. Let HfTee(t) Q TeG denote the preimage under the exp map of the reachable set of (3.3) at time *. Also, for any multi-index a € ( Z f ) 1 + m let La(Z) denote the subspace of TeG, identified with L(Z), that is generated by all Hall words H € % that satisfy 6(H) = a (loosely speaking, by all formal brackets containing a, times the indeterminate Xi as a factor). One has several options. If interested only in STLC for systems of form (2.1) one may consider the slices that are the intersections ofHfTee(t) with the with the subspaces La(Z) of TeG, identified with L(Z), that is generated by all Hall words H € V. that satisfy 6(H) = a. However, one expects that (e.g. for single-input systems, m = 1) these slices generally consist of a singleton only: Consider e.g. the system (2.2) with k — 2 (e.g. as a nilpotent approximation it may be considered a subsystem of (3.3). Then X2(t,u) ^ 0 for every u ^ 0, and hence %fTee{t) f\La = {0} for all a ^ (1,2). Alternatively, one may try to consider only maximal polynomial cascade systems of (3.9) that are STLC, e.g. by starting with the trivially STLC system i< = Ui and inductively adding only such components xu that preserve STLC. However, at this time there does not appear enough knowledge available about e.g. inhomogeneous obstructions to STLC as to make it clear that this is indeed a viable approach. (Compare e.g. the example in 16 that defied all prior expectations of the iterated integral functionals T„, compared to the natural partial ordering on the sets of multi-indices
397
w
Vt/
Figure 2. Two supporting hyperplanes, two controllable directions
{8(H): Hen} C(Z+)1+m.) Thus one is led to instead consider the projections 7£free(*) C TeG of the reachable sets Tl{iee(t) onto the homogeneous subspaces La(Z) of TeG, identified with L(Z). Note that due to the graded structure of L(Z) this projection is well-defined, and, in particular, independent of any choice of basis for L(Z). It is convenient to consider this set H%ee(t) as the (output-)reachable set, i.e. the set of all y(t,u) = h(S(t,u)) of system (3.3) with output y = h(S). Here the components of the map h may be taken as all (H for which 8(H) = a and H € %. For practical calculations it is useful to note that this set ^freeW depends only on a finite dimensional subsystem of the free system (3.3), e.g. one might consider the free nilpotent system that consisting of all those equations of (3.6) for which ||<5(.ff)||i < ||a||i. We note on the side that if dim La(Z) = s then, there are always at least (s — 1) linearly independent functions g/i,... ys~i: G(Z) C A(Z) •-> H, which may be taken as linear combinations of the coordinates Cw( of the first kind , where Hi are elements of a Hall set and {[Hi],.. .]HS]} is a basis of dim La(Z). This is a direct consequence of the homogeneity properties (3.6) and accessibility (which guarantees that H"ree(t) has interior points). Compare figures 2 and 3 for suggestive illustrations. Consequently, it is of little interest to look for maximal STLC systems with output, but instead the natural focus is on the maximal collection of (homogeneous) uncontrollable outputs. With this set-up the objective is to find (for each homogeneous component, specified by a) a maximal set of (linearly independent) supporting hyperplanes of Tlfree(t). Such supporting hyperplanes make precise what the
398
Figure 3. Three supporting hyperplanes, two controllable directions
term "bad brackets" intended to capture. Of course, in the case of one dimensional subspaces (e.g. when a = (1,2k) for some integer k) there is no need to make such distinction. However, in general any coordinate filed g § - does not depend only on H, but rather on the entire choice of the basis ft. More succinctly, algebraically supporting hyperplanes of (approximating cones of) the reachable sets are by elements of the dual space, i.e. are linear functionals on the space La(Z) C TeG. In general, one may consider for fixed a € (ZQ)" all linear functionals u>: TeG = L(Z) H> R, that vanish identically on each homogeneous subspace L®(Z) with /3 -fi a. (However, working with linear functionals w: La(Z) i-» R has the advantage that one need not introduce infinite-dimensional machinery, especially make choices of a topology and consider consequent continuity properties). Consider a fixed a € (ZQY and suppose {[UTi],^].-••[#«]} is a basis for La(Z). Consider as output of the free system (3.3) the projection of the solution S(t, u) onto this homogeneous subspace, i.e. y(T,u) = ( d r , ( T , U ) , . . . O r . ( T , « ) )
=
C#i(T,u)-[H t ]
+ ... + CH,(T,U) • [H,] (5.1)
The precise analogue of a "bad bracket" is a linear functional s
u = ] T uHi • dHi such that s
<w,S(T,u)> = J2 ^Hi<m{T,u) i=l
>0
399 for every admissible control u and all small times T (here S( •, u) is the solution of the free system (3.3), i.e. basically the Chen Fliess series). Each confirmed supporting hyperplane basically may give rise to a new necessary condition for STLC (or sufficient condition for optimality). (Where possible, one also would like to specify for each such supporting hyperplane the minimal set of directions CH that can possibly neutralize it. This can also be nicely phrased in this set-up of projections, but goes beyond the scope of this article.) The most obvious examples of homogeneous supporting hyperplanes are given by d£n (here considering coordinates of the second kind) where H G % is of the form H = (sA2i'Hia,...
{ad2i2Hi2,(a.d2ilHh,X0))...))
(5.2)
with Hij+1 >- Hij G calH which are common elements in Hall-Viennot bases. The corresponding Lie brackets / # are always even in each controlled direction, i.e. 6j(H) is even for j = l , . . . m , while their total length |<S(i?)| is odd. These are precisely the potential obstructions singled out in Sussmann's general theorem for controllability 35 . In particular, these include the formal brackets (ad 2 Xi,X0) which are addressed in Stefani's necessary condition 30 (no problems here as the subspaces L^'2k\X0,Xi) are of dimension one). This also includes a supporting hyperplane corresponding to the formal bracket (ad 2 [X 0 ,Xi],X 0 ) G L< 3 ' 2 )(X 0 ,Xi) studied in 14 . However, this homogeneous subspace is of dimension two, and thus always contains at least one controllable direction, compare figure 2 for illustration. Indeed, there is a second independent homogeneous supporting hyperplane for 7£( 3,2 )(X 0 ,Xi)(£) determined by d£# with K = (ad 2 X 0 ,ad 2 Xi,X 0 )). This leads to the well-known observation that iterated integral functionals TK with K = XoXo... XQH G H basically have the same controllability properties as T//- Thus when investigating STLC for systems of the form (2.1) such K will only be obstructions to STLC of a particular system (that cannot be neutralized) if already H causes loss of STLC. However, when considering STLC of systems with output, one needs to include such formal brackets in the analysis. As a practical example, consider single-input systems (i.e. m = 1) and a = (2,4). A typical Hall word may include the formal brackets Hi = [Xo,(ad 4 X!,X 0 )] and H2 = [[X^Xo], (ad3XuX0)]. The iterated integral functional THl is clearly positive definite (i.e. ^ ( T , u) > 0 for all u,T, with equality only if u = 0. This is clearly a direct consequence of the definiteness of TJJ-J with K\ = (ad XI,XQ). On the other hand Stefani showed in 29 that
400
YH2 is indefinite by analyzing the system X\
= U
x2 = xi x3 = x\x2
(5.3)
This raises the question whether this is the one (1 = dimL' 2 ' 4 )(Z) — 1) guaranteed controllable direction in the reachable set 7?/ 2 ' 4 ' (t) of the free system with output and there are two linearly independent supporting hyperplanes (like in figure 2). The alternative scenario is that there is only one supporting hyperplane, and the reachable set TZ^2'4^(t) looks qualitatively like a disk (with 0 G L( 2 ' 4 '(Z) on its boundary, e.g. with boundary differentiable at 0 G L' 2 , 4 '(Z) in a suitable sense). The first scenario is plausible: It might be that the functional Y ^ is so strongly positive definite that its integrals T^jKi will dominate all Tw with S(w) = (s + 1,4). In general these are very delicate questions about integral inequalities. However, in this special case a little calculation establishes that basically the converse is true: By judicious choice of the control u(t) = (l,ui)(t) G { l } x [s,e] one can reach points y{T,u) = [TH1(T,U),TH2(T,U)] such that the ratio JO (/O 3 JO'2 UICO dt^ dti)
TH2(T,U) 7742 =
=
THl(T,u)
"
(/o" u iCO dt2) -J
/o / o 2 ( / o a « i C O d * i )
dt3 (5.4)
dt2dt3
is arbitrarily small positive. Consequently, the free system (3.3) with scalar output y(T,u) = Ci • rHl(T,u) + C2 • TH2(T,u) is STLC for all choices (Ci,C 2 ) E l x ( R \ {0})- This result may be obtained in a rather straightforward computation assisted by a computer algebra system. First, due to homogeneity it is clear that one may fix T = e = 1 (recall, scaling time and/or control bounds cannot distinguish between homogeneous iterated integral functional T ^ and T,„2 if 6(w\) = S(w2)). Then consider piecewise constant controls m(t) = l i f t G [0,a]U[l-/?,l] andui(t) = Oift G (a, 1-/3), and consider the ratio 7742 as a function of (a,/?). It is easy to see that this ratio can assume all values in some interval containing zero. On the other hand one readily calculates that the ratio can also take arbitrarily large positive and negative values by considering the behavior of 7741 (a, /3) as both a and /3 approach zero from above, albeit at carefully related rates. On the side, we briefly note that one last hope for saving the notion of a bad bracket might be a natural inner product structure on L(Z), which would obviate the need to distinguish between formal Lie brackets and elements of
401
the dual space. Indeed, there is a natural inner product on A(Z) (and thus also on L(Z)) - which is such that the set Z* of all words (noncommutative monomials) is an orthonormal basis. Thus the homogeneous components LaZ are automatically mutually orthogonal subspaces. Consequently formal Lie brackets of different multi-degrees are orthogonal. However, in general there is no reason why formal Lie brackets of the same multi-degree that are elements of a Hall-Viennot basis should be orthogonal with respect to this inner product - nonetheless, maybe there is some way of using such orthogonality in the construction of new bases, bases that may be better adapted to the construction of coordinates of the first kind. Finally, the discussion in this section has focused on homogeneous supporting hyperplanes. A much harder problem is to obtain a complete description that also accounts for additional, inhomogeneous supporting hyperplanes. Finally, to complete the analysis of STLC it remains to describe the sets that can possibly neutralize any supporting hyperplane, and finally analyze the simultaneous neutralization of different supporting hyperplanes. This is best illustrated in the example (1.1) in which the formal bracket [XH] = {ad2[X0,Xi],X0) = e l/ 3 ' 2 >pfo,Xi) maps to f"(0) £ 0. While T # > 0, the system is still STLC since any linear combination Tjy + C • TK with K = {a,d2[X0,Xi],XQ) =e L(1'3)(-Xo,-Xi) is indefinite for any constant C ^ 0. In this case Sussmann's general theorem 35 yields STLC since ||<J(if)||i < ||<$(.ff)||i. However, much remains to be done to find all admissible such partial orderings. 6
Conclusion
Since the articles of the early 1970s differential geometric nonlinear control theory has developed many new tools, and continues to do so. Nonetheless, some of the original problems, such as finding necessary and sufficient conditions for controllability that are amenable to algorithmic implementations, are still far from being completely resolved. This article focused on the still relatively young combination of the chronological algebraic tools with geometry and combinatorics, in order to address the problem of nonlinear controllability, which at least partially owes its existence to in Jurdjevic' 1972 and 1973 articles 12 - 13 . We are forever grateful for this sequel of papers (together with those mentioned in the introduction and others more) to have initiated such a fruitful research, which has spun off many topics that are exciting in their own right. The coordinates of the first kind, or the logarithm of the Chen series, discussed here, while motivated by the problem of nonlinear controllability,
402
may eventually be much more useful in very different settings, e.g. in effective numerical integration algorithms for differential equations with algebraic constraints. References 1. A. Agrachev and R. Gamkrelidze, Exponential representation of flows and chronological calculus, Math. USSR Sbornik (Russian), 107, 4, 487532 (1978). Math. USSR Sbornik, 35, 727 (1978). 2. A. Agrachev and R. Gamkrelidze, Chronological algebras and nonstationary vector fields, Journal Soviet Math., 17, 1, 1650-1675 (1979). 3. N. Bourbaki, Lie Groups and Lie algebras, (Springer, 1989). 4. R. Brockett, Differential geometric methods in system theory, in Proc. IEEE Conf.Dec.Cntrl., 176-180 (1971). 5. K. T. Chen, Integration of paths, geometric invariants and a generalized Baker-Hausdorff formula, Annals of Mathematics, 65, 163-178 (1957). 6. P. Crouch and F. Lamnabhi-Lagarrigue, Algebraic and multiple integral identities, Acta Appl. Math., 15, 235-274 (1989). 7. P. Crouch and R. Grossman, The explicit computation of integration algorithms and first integrals for ordinary differential equations with polynomial coefficients using trees, Proc. Int. Symposium on Symbolic and Algebraic Computation, (ACM Press, 89-94, 1992). 8. P. Crouch and F. Leite, The dynamical interpolation problem: on Riemannian manifolds, Lie groups and symmetric spaces, J. Dynam.Control Systems, 1, 177 - 202 (1995). 9. M. Fliess, Fonctionelles causales nonlineaires et indeterminees noncommutatives, Bull. Soc. Math. France, 109, 3-40 (1981). 10. M. Grayson and R. Grossman, Models for free nilpotent algebras, J. of Algebra, 135, 177-191 (1990). 11. H. Hermes, Nilpotent and high-order approximations of vector field systems, SI AM Review, 33, 238-264 (1991). 12. V. Jurdjevic and H. Sussmann, Controllability of nonlinear systems, J. Diff. Eqns., 12, 95-116 (1972). 13. V. Jurdjevic and H. Sussmann, Control systems on Lie groups, J. Diff. Eqns., 12, 313-329 (1972). 14. M. Kawski, A new necessary condition for local controllability, AMS Contemporary Mathematics, 68, 143-156 (1987). 15. An angular open mapping theorem, in: Analysis and Optimization of Systems, A. Bensoussan and J. L. Lions, Eds., (Lect. Notes in Control and Information Sciences, 111, 361-371, 1988). 16. Control variations with an increasing number of switchings, Bull.Amer.Math.Soc, 18, 149-152 (1988). 17. High-order small-time local controllability, in: Nonlinear Controllability and Optimal Control, H. Sussmann, Ed., 441-477, (M. Dekker, 1990) 18. M. Kawski and H. J. Sussmann Noncommutative power series and for-
403
19. 20. 21. 22. 23. 24. 25. 26. 27. 28.
29. 30. 31. 32. 33. 34. 35. 36.
mal Lie-algebraic techniques in nonlinear control theory, in: Operators, Systems, and Linear Algebra, U. Helmke, D. Pratzel-Wolters and E. Zerz, Eds., 111-128, (Teubner, 1997). Controllability via chronological calculus, Proc. 38th IEEE Conf. Dec. Contr., 2920-2925, (1999). M. Kawski, Calculating the logarithm of the Chen Fliess series, submitted to Int. J. Control (special issue Proc. MTNS 2000 Perpignan). M. Kawski, Chronological algebras: combinatorics and control, Itogi Nauki i Techniki,(translation in J. Math. Sciences), 68, 144-178 (2000). W. Magnus, On the exponential solution of differential equations for a linear operator, Comm. Pure Appl. Math., VII, 649-673 (1954). G. Melancon and C. Reutenauer, Lyndon words, free algebras and shuffles, Canadian J. Math., XLI, 577-591 (1989). G. Melancon and C. Reutenauer C , Combinatorics of Hall trees and Hall words, J. Comb. Th. Ser. A, 59, 285-299 (1992). H. Munthe-Kaas and B. Owren, Computations in a Free Lie Algebra, report no. 148, (Dept. Informatics, Univ. Bergen, 1998). R. Ree, Lie elements and an algebra associated with shuffles, Annals of Mathematics, 68, 210-220 (1958). C. Reutenauer, Free Lie algebras, (Oxford, Clarendon Press, 1993). M. Schiitzenbergerj Sur une propriete combinatoire des algebres de Lie libres pouvant etre utilisee dans un probleme de mathematiques appliquees, Seminaire P. Dubreil, Algebres et Theorie des Nombres, (Faculte des Sciences de Paris 1958/59). G. Stefani. On the local controllability of a scalar-input system, in Theory and Applications of Nonlinear Control Systems, C. I. Byrnes and A. Lindquist Eds., (Elsevier, North-Holland, 167-179, 1986). G. Stefani, Polynomial approximations to control systems and local controllability, Proc. 25 t h IEEE, CDC, 33-38 (1985). H. J. Sussmann, An extension of a theorem of Nagano on transitive Lie algebras, Proc. Amer. Math. Soc, 45, 349-356 (1974). H. J. Sussmann, Lie brackets and local controllability: a sufficient condition for scalar-input systems, SIAM J. Cntrl. & Opt., 2 1 , 686-713 (1983). H. J. Sussmann, Lie brackets, Real Analyticity, and Geometric Control, in Differential Geometric Control, R. W. Brockett, R. S. Millman, H. J. Sussmann, Eds., 1-116 (1983). H. J. Sussmann, A product expansion of the Chen series, in Theory and Applications of Nonlinear Control Systems, C. I. Byrnes and A. Lindquist Eds., (Elsevier, North-Holland, 323-335, 1986). H. J. Sussmann, A general theorem on local controllability, SIAM J. Control & Opt, 25, 158-194 (1987) G. Viennot, Algebres de Lie Libres et Monoides Libres, ( Lecture Notes, Math., 692, Springer, Berlin, 1978).
This page is intentionally left blank
405 VARIATIONAL E Q U A T I O N S OF L A G R A N G I A N S Y S T E M S A N D HAMILTON'S P R I N C I P L E
H. N. NUNEZ-YEPEZ Departamento de Fisica Universidad Autonoma Metropolitana-Iztapalapa Apartado Postal 55-534, Iztapalapa 09340, Mexico D.F. E-mail: [email protected] JOAQUiN DELGADO Departamento de Matemdticas Universidad Autonoma Metropolitana-Iztapalapa Apartado Postal 55-534, Iztapalapa 09340, Mexico D.F. E-mail: [email protected] A. L. SALAS-BRITO Laboratorio de Sistemas Dindmicos Departamento de Ciencias Bdsicas Universidad Autonoma Metropolitana-Azcapotzalco Apartado Postal 21-726, Coyoacdn 04000, Mexico D.F. E-mail: [email protected] We discuss a recently proposed variational principle for deriving the variational equations associated to any Lagrangian system. The principle gives simultaneously the Lagrange and the variational equations of the system. We define a new Lagrangian in an extended configuration space —which we call D'Alambert's— comprising both the original coordinates and the compatible "virtual displacements" joining two solutions of the original system. The variational principle is Hamilton's with the new Lagrangian. We use this formulation to obtain constants of motion in the Jacobi equations of any Lagrangian system with symmetries. These constants are related to constants in the original system and so with symmetries of the original Lagrangian. We cast our approach in an intrinsic coordinate free formulation. Our results can be of interest for reducing the dimensions of the equations that characterize perturbations in a Lagrangian control system.
1
Introduction
Extremum principles have a great importance in dynamics and in control 1 , 2 , 3 . They can be used to attain general formulations of the theory and can be of assistance in the solution of particular problems 4 , 5 . The purpose of this contribution is to discuss an extremum principle which generates both the equations of motion and the so-called variational equations describing deviations between any two solutions of a Lagrangian system 6 . The variational
406
principle is Hamilton's but with a different Lagrangian. Since we are dealing with Lagrangian dynamical systems, in which such "least action" principle is assumed to be valid, the variational equations describe deviations between extremal curves in a certain configuration space so they, slightly abusing the language, may be called the Jacobi variational equations of the Lagrangian system as we have done in 6 . Note that, though we use a notation with a strong flavour of classical mechanics 7 , the formulation is not limited in any way and can be applied to field theories like Einstein's, or evolution equations like the Korteweg-DeVries or the Kamdomtsev-Petviashvili 8 - 1 1 . The intrinsic formulation of section 4 should make clear this matter. The variational equations can also be used to study the stability of solutions and as starting points to evaluate their Liapunov spectrum as we have succintly discussed in 6 . They are, besides, describable in the vertical extension of the Lagrangian formalism 12 . We should mention previous related work along the same lines 13 and an older contribution with different purposes 14 . The article is organized as follows: In section 2 we define the new Lagrangian, use it to formulate the variational principle whose extremals are solutions of the Lagrange and the variational equations of the system. In this section we also discuss basic features of the formulation. In section 3, we address the existence of constants of motion in the variational equations and its relation with constants in the original system. In section 4 we cast our approach in an intrinsic language, making connection with the symplectic manifold that can be associated with the prolongation of the original Lagrangian. If you are interested mainly in the mathematical setting of the formulation this is the section to read. In section 5 we give various examples in which the approach may be used. 2
T h e variational principle
Let us consider then a JV-degree of freedom dynamical system endowed with a Lagrangian function L(qaqa,t), a = 1 ...N; defined in the tangent bundle TQ of its configuration space Q, i.e. L : TQ -> E. From the Lagrangian we usually construct the action 4 , 5
S[q{t)}= J' L(qaqa)dt
(2.1)
Ju where we are using square brackets to indicate that S is a functional of the paths qa{t) joining two given points, qa{h) and qa{t2) (a = 1> • • -,N), m Q at two fixed instants of time ti and i2- The extremalization of S directly
407
leads to the Lagrangian equations of motion of the system [equations (2.6) below]. Let us now consider a different configuration space D, comprising both the original configurations of the system plus all the possible "virtual displacements" joining any two extremal paths of (2.1). With the help of L, we can define a new (the mathematical meaning of all this should become clear in sections 4.2 and 4.3) Lagrangian 7(q, q,e, e, t) 6 as 7(q, q, e, e, *) = — e a + — e „ oqa
(2.2)
OQa
here, as in all of the paper, the summation convention is implied for repeated indices. The function 7 : TD -> M, where TD is the tangent bundle of D, is central in our formulation. The 2iV-diniensional D'Alambert configuration space D is assumed, mostly in sections 2 and 3, to be coordinatized by (qa, ea), a = 1 , . . . , N. The iV-component object c = (ei, e 2 , • • •,CAT) stands for the displacement from an extremal path of (2.1) to another, and e = (ci, C2> • • •) CN) for its corresponding velocity. This means that e plays the role of the variational field associated with trajectories of the original system: q' = q + £ and q' = q + e, where both q and q' are extremal paths 1, 4 , 6 of the original action (2.1). A useful property of j(q,q,e,e,t) is that it is an explicit function of time only when L is non-autonomous. The function 7 is used to define the new ("displaced") action functional S[q(t), e(t)] = /
2
7(q,
q, C, £, t)dt
(2.3)
•ft!
of the paths joining two given configurations (q%, ei) and(g 2 , £2) of the varied system between two fixed instants of time t\ and t2. We are calling "time" the parameter appearing in (2.3), but it can be any other useful parameter, for example, the arc length, or even may be any finite set of parameters. The statement of the variational principle is just Hamilton's, that is, S is extremal when the system follows its actual path in D6 aE[q(i),e(t)] = 0.
(2.4)
The extremalization is done varying the path but mantaining the endpoints and the time fixed. The conditions for the functional T,[q(t), e(t)] to be an extremum are the 2./V Euler-Lagrange equations 1 , 4 , 1 4 d fdj\
57
n
d fdj\
dj
n
s u ) - ^ = ° - 5uy-5t=°-
o=i
"-
Wi
, , (25)
408
or, using the definition (2.2) in the preceding equations, we obtain the Lagrange equations of the original system
±(9L\_dL__ dt \dqaJ dqa
(2.6)
plus the linear variational equations: Mabeb + Cabeb + Kabeb = 0,
a = 1,
,N.
(2.7)
Equations (2.7) describe the evolution of the deviation, c, of a varied trajectory from an unperturbed one. The N x N matrices M, C and K, are d2L dqadqb d_ ( d2L = dt dqadqb d_ d2L — dt dqadqbJ
Mab = Cab
Kab
(2.8) d2L + dqadqb d2L dqadqb
d2L dqbdqa a,b = 1,
(2.9) ,N.
(2.10)
It shoud be clear that equations (2.7) are the variational equations of the original system 1, 6 , 8 . One of the interesting properties of these equations is that the matrices M, C, and K, and, hence, the variational equations (2.7), may be regarded as time-independent as long as L is autonomous. This property, that does not happen in the standard description, may have important consequences in analysing perturbations to optimal control problems 3 1 5 16 . At this point, it is worthwhile to emphasize four important features: 1. The function 7 : TD —> R plays the role of a new Lagrangian describing the original system plus its response to "virtual displacements" or perturbations. It therefore is also useful for studying stability 6 . 2. Equations (2.6) and (2.7) are invariant under arbitrary point transformations (i.e. changes of coordinates in D) : Xa = fa{x, t), a = 1 , . . . , 2N, where the fa are functions, fa : D -> D, assumed to be invertible (i.e. det(df(x)) 7^ 0, x € D) and at least C 1 . As happens in any Lagrangian description, this result is proved by the very existence of the variational principle (2.3); \ 4 , 1 7 . 3. The e-derivatives of 7 reduce to corresponding ^-derivatives of L
409 •^7- = ^ - , and — = — (2.11) ota oqa aea oqa 4. As a consequence of the previous property, 7 is a first-order homogeneous function of the virtual displacements ea and velocities ea
7 = p-*a + Piaoea 3
(2.12)
dea
Symmetries and constants of motion
One of the most important consequences of any Lagrangian description using a variational principle, is the close association between the symmetries of the Lagrangian and the existence of constants of motion 18 , that is, functions that are conserved along integral curves of the Lagrangian vector field. There is a control-theory version of such result, saying that every symmetry of a control system gives rise to a conservation law along biextremals 3 . This theorem makes the following results important for perturbed control systems. The association between symmetries and conservation laws is shared by the the function 7. The precise sense is the following: Any symmetry of L can be carried over to a symmetry of 7, where they imply the existence of constants of motion in the variational equations. Although a general proof can be given, we prefer to illustrate this result in what follows —though our discussion of what we have called inherited constants comes close to being an informal proof. 3.1
Invariance under time translations
If the Lagrangian of the system is autonomous, then there exist a well-known constant of motion 1, 4, H = ~qb - L. (3.1) oqb H can, in certain instances, be identified with the energy of the system. In such autonomous case, 7 is autonomous too and the variational equations thus admit an analogous constant of motion —which can be derived in an strictly similar fashion to H x , 4 , 6 — namely h = -~qb oqb
+ —h oeb
- 7.
(3.2)
410
Using definition (2.2), the constant h can be recasted as 9T
9T
aqb
deb
6
or, using (3.1), as
h = j—et + —eb. (3.4) dqb dqb If we interpret H as the energy, a related interpretation for h is that it is the first-order energy change in going from a solution to a displaced nearby one. 3.2
Invariance under space translations
The invariance of a system under space translations, the property frequently called by physicists the homogeneity of space, usually manifest itself in the independence of the Lagrangian on certain coordinates. When the Lagrangian does not depend on a specific coordinate qs, the coordinate is said to be ignorable (respect to the original Lagrangian L), then its conjugate momentum is conserved. That is, if 9L n -— = 0, dqs
then
dy dL . ps = -rr- = TTI- is a constant. des dqs
(3.5)
ltqs is ignorable in L, then, as shown in equation (3.5), it is also ignorable in 7, and e s is ignorable too, so the momentum ps —conjugated to eQ in 7— is conserved (as it should be since it is conserved in the original system!). Furthermore, qs is also ignorable in 7, hence the quantity (the momentum, TTS, conjugated to qs in 7) _ 07 d2L . , d2L ns = -— = eb + eb, dqs dqsoqb oqsdqb
(3.6)
is also a constant of motion in the variational equations. After having proved the existence of the constants h (equation 3.2) and •K (equation 3.6), it is worth formulating a more general result. As we show next such theorem follows from the close relationship between solutions to equations (2.6) and constants of motion in the variational equations (2.7). We call such first integrals, inherited constants of motion.
411 3.3
Inherited constants of motion
Let us consider any constant of motion, J(q, q), of the original set of equations (6). If we evaluate it on two nearby solutions of (6), q and q' = q+e, separated by the Jacobi field c, the difference, j(e, e) = J ( q ' , q') - J(q, q), between these constant quantities is also trivially a constant,
^ M
= | [ J ( q ' , q ' ) - J ( q , q ) ] = 0.
(3.7)
The constant, j[e], can be expressed as
•n
(dJ
+ £*.)•
0.8)
This is precisely the form of h in equation (3.4). Equation (3.8) tells us how, given both a solution of equations (2.6) and any one of its constants of motion, we can obtain a constant of motion in equations (2.7). Equation (3.8) can be directly proved to be a constant by computing its time derivative. This result establishes a direct relationship of constants in the variational equations, like j , to constants in the original Lagrangian. Related results are discussed in 7 , 1 9 . Furthermore, as in some non-linear evolution equations the constants of motion, J[q(t)], are functionals (and not functions) of the solutions of (2.6), we need to pinpoint that the constant j[c(t)] becomes a functional of the Jacobi fields e(t), which must be given thus by
W)\ = I<$2W, where 5J[q(t)]/5q(T) stands for the functional derivative of the constant functional J[q(t)} 2 0 2 1 . Having stablished the existence of the inherited constants (3.8) [or (3.9)], it should be clear that any symmetry of the original Lagrangian is reflected in the existence of another constant in the Jacobi variational equations. Thus, Noether's theorem also holds for the variational equations and can be used to reduce the dimensions of the variational system 6 ; 3 . Related results are discussed in 7 ; n .
412
4
Intrinsic formulation
In this section we formulate in an intrinsic manner our variational approach. First we consider an arbitrary vector field Y (at least C 1 ) defined on a manifold M and recall the constructions of the variational vector field T(Y) denned on the tangent bundle TM and the adjoint variational vector field T*(Y) defined on the cotangent bundle T*M. As it is well known 2 , 1 5 J 1 6 J 2 2 5 2 3 ) T*(YX) posseses a natural structure: it is a Hamiltonian vector field with respect to the the canonical symplectic form on T*M. On the other hand the variational vector field T(Y) does not have a natural Lagrangian structure due to the fact that the Hamiltonian for T* (Y) is linear in the momenta and so there is no natural Legendre transform TM —> T*M defined. Our main interest is in the case of M = TQ, i.e. the tangent bundle of a configuration space, and Y = YL, a Lagrangian vector field on TQ for the function L: TQ -> R. We recall that if L is not degenerate, the pull back of the canonical symplectic form wo in T*Q under the Legendre transformation C(L): TQ -> T*Q, UJL = C*GJ0, makes (TQ,UL) into a symplectic manifold and the Lagrangian vector field YL is Hamiltonian for the energy function H: TQ -> R. Also YL is a second order differential equation: TQ,YL =YL°T where TQ : TQ —> Q is the projection. From what was said in the first paragraph of this section, the adjoint variational vector field T*(YL), defined on T*(TQ), has a natural Hamiltonian form. We want to dilucidate what the structure of the variational vector field T(YL), defined on T(TQ), is. A naive guess is that T{YL) should be Lagrangian and the Lagrangian should be the prolongation 15 , 2 L: T(TQ) —» E of L: TQ -> R: L(£) = dL„{^), for £ e T(TQ). A moment of caveat shows that this cannot be true. In fact, taking coordinates (q,q,e,e) for T(TQ), and (q, q,p, n) in T*(TQ), the prolongation is given by dL, . .. dL dL — (q,q,e,e) = —{q,q)e+-jp-(q,q)e ana its ^egenare transiorm j:
^yj
P
>i
•(
dL dL ~ 8e ~ dq _dl de
(4.1)
_dL dq
but then the pull back of #o = pdq + ndq is
(4.2) (4.3)
413
— d — 6L = £(L)*90 = (poL)dq+{noL)dq=—dq+—dq dq dq
(4.4)
that is 91 = dL o TTQ, there TTQ : T(TQ) —>• TQ is the projection. This in turn implies d#^ = 0 (!). Thence the interpretation is untenable. 4-1
The variational and adjoint variational vector fields
For setting straight the formalism, let us recall briefly the appropriate construction: Let M be a manifold and let TM • TM -» M, T^ : T*M -» M be the tangent and cotangent bundles, respectively. Given a vector field on M, that is a section Y: M ->• TM, the variational vector field associated to Y, herein denoted by T(Y), is defined as follows: Given m G M there exists a neighborhood B of m and e > 0 such that the local flow $ ( : B$ (m) -> M is defined for * G ( - e , e). The lift T ( $ t ) : T M | f f - i ( B ) : 7r _1 (B) ->• TM then defines a local flow and its infinitesimal generator is the variational vector field T(Y). Similarly, the lifting to the cotangent bundle T*(
(4.5.a)
x = f(x), p = -pDf(x)
(4.5.6)
Actually the adjoint variational vector field (4.5.b) admits the Hamiltonian form
>--§£• where H(x,p)=pf(x).
(4.8)
This form is compatible with the standard symplectic form on T*M. As we mentioned earlier in this section, there is no natural Lagrangian structure for £• — 1 / ZJ\
(4.5.a) since the Legendre transformation (x,p) > (x,v) is not defined due to the linear dependence of H on the momenta (4.8).
414
4-2
Prolongation of a system,
There is a related concept of extending a given system known as its prolongation 3 , 15 , 16 . The motivation comes from the following argument: if the system is given by x = f(x), then by taking time derivatives along solutions one gets * = Df{x)x 2
x = D f(x)
= Df{x)f{x), • (Df(x),
(4.9)
f(x) + Df(x)
• (Df(x)
• f(x)),
etc. (4.10)
Considering the first prolongation x — Df(x)f(x), first order system
it can be written as the
± = V
(411)
v = Df(x)-f(x).
^-LL)
'
A natural question is if this system admits a Lagrangian structure. Proposition 4.1. Consider the system x = f(x) and suppose that J^- = -J^-, that is Df(x) be symmetric. Then the first prolongation of the system (4-11) is Lagrangian for the function L{x,v) = \\v\2 + \\f{x)\. The proof is elementary and will be omited.
(4.12) •
The symmetry condition can be rephrased in terms of the graph Gf = {(x, f(xf)
| x G Dom / } C Rn x (Rn)*
(4.13)
as being Lagrangian 2 4 , 15 (here the notation f(x)T means simply that f(x) is viewed as a covector). If / : M ->• N, its prolongation 3 1 5 is the map / : T(M) ->• T*(N) defined as follows: For £ £ TXM and a G T1,JM), let (,) denote the natural pairing, then
/(0(") = -
(4-14)
In local coordinates (x,v) e f x f for TM and (y,p) G ffi" x (W1)* for T*(N) the first prolongation is given by
415
V = f(x),
(4.15)
p = vTDf(x)T.
(4.16)
In the case of a function f:M->R, TjtM ~ R and the prolongation can be identified with the differential of the map df: TM -> R
f(L)=dfx(D4-3
(4.17)
Variational equations of Lagrangian systems
We now consider the case of M = TQ for some configuration space Q, and L: TQ - > E a non degenerate Lagrangian for the vector field Y on TQ. The prolongation is then the a map L: T{TQ) —> R. As was mentioned at the beginning of this section, the variational vector field T(Y) cannot be viewed as the Lagrangian vector field for the prolongation L in an obvious way since the pullback of the canonical 1-form in T*(TQ) to T(TQ) by the Legendre transform £(L) of the prolongation L is an exact differential. Instead we consider the D'Alambert configuration space consisting of pairs (q, e) where q is a given configuration and e is a "virtual displacement". More formally we consider a subbundle , D C Q then TD an be embeded in T(TQ) under the map (q,e,q,e) —> (q,q,e,e). Now view the prolongation as a map 7 : TD-*R, that is
l(q, e> Q, e) = -g- (V, ?)c + -QT (q, q)e
(4.18)
(in fact TD is the natural domain of definition of the prolongation, since the "virtual displacements" are defined by choosing the subbundle). Notice that the embeding a is well defined since the e's are elements of the tangent space TqQ, and that D is not necessarily a tangent bundle, that is, it is not necessarily an integrable distribution. Thus we can consider even holonomic or nonholonomic constraints in the specification of D. Theorem 4.2. Let w7 = C(L)*uio be the pullback of the canonical symplectic form on T*D under the Legendre transformation of the prolongation of L: TQ -> R, viewed as a map 7 : TD -¥ R. Then T(YL) viewed as a vector field on TD is a Lagrangian vector field for the energy function h: TD -> R. In local coordinates (q, e, q, e) for TD,
416
%,e,g>e)=0T«-|je,
(4-19)
which coincides with (3.4).
Proof. We carry on the proof in local coordinates. Let (q, e, q\ e) be coordinates in TD and (q, e, ix,p) coordinates in T*D. The canonical 1-form in T*D is given by 0O = ndq + pde. The Legendre transform £(7) is given by
TT = § ? ,
(4.20)
P = c*e 5 = <9q' ^-
(4-21)
d — = 1^J-dq+—de
(4.22)
then
ei
dq
dq
A straightforward computation shows t h a t d27 <927 927 1K I K fc A''dq.,dgfcdq-j-dq "" fc A " dqj ""^ +' % rfg % fc """ A ' ' dgj " ^ +' dekdqj . de "~^ "3 a27 927 327 + Q . Q . rfefc A dq, + -—x^dq f c A de., + (% A de.,-. oe*^ aqfcde.,0%^
(4.23)
Consider the second order equation (or spray)
j then
™=4+'!+c!+Dl
<4-24>
417
T(YL)jojy
=
d27 <927 {qkdqj - q.jdqk) - ». ». gj£% dqkdqj dqkdqj d27 . _,. 9e/fc0gj
527 . 9gfc9ej
J
927 g^efe* dekdqj
027 . j defcc%
d27 • ^ dftctej
927 - j . d27 ~ j 927 —ijdqk + a . Q. C'fcdgj + . , Cjferfej d^de, <%% <9<7/fc<9e <927 otkdqj (4.25) The differential of /i is JL
/ a27 \dqkdqj
J
a 2 7 J.
a27 ,
oqkdqj
dekdqj
a 2 7 J. . . dekoqj
2
(4.26)
9l .. ( &1 J d 7 ,.\ dj , + -^dqj dqk + ». „ dqk £j ^-dtj. dqj \dqkdej dqkdej J dti Equating coefficients of dek in the equation T(Yi,)i_w7 = — dh one gets
% [Wjqj
+
% {WJ Cj = Wk
(4 27)
-
by using the fact that Ck = q\ (here the dots means derivative with respect to time) wer recover the orignal Lagrange equations
! (£) - £• Equating coefficients of dik and dqk, sendous identities are obtained. Finally equating the coefficients of dqk lead to the linearized equations (2.7). D 5
Examples
A few examples will be presented where the above formulation can be applied. Geodesic flow. Here the Lagrangian is just the square of the length L = ^9abqaqb
(5.1)
418
where gab is the first fundamental form of the metric. The linearized equations are properly known as Jacobi equations or as equations of geodesic displacement. The prolonged Lagrangian is dgab . . , . . 7 = -£—£kqaqb + gabqa£boqk
,_ _. (5.2)
The Lagrange equations give the geodesic equations qa + ra&c<Mc = 0,
(5.3)
and the equations of geodesic deviation e'a = Rabcdqbqdtc,
(5.4)
where, Tabc is the affine connection or Christoffel's symbol, Rabcd is the Riemann tensor, and the dots stand in this case for derivatives respect the arc length s 8 2 5 . Newton's equations. Here
L = -mabqaqb - V(q),
(5.5)
where m a j, is a constant symmetric matrix. The prolonged Lagrangian is dV 7 = -^— eo + mabqaeb. oqa
(5.6)
The Lagrange equation
d ( dj \ _ dj dt\dh) ~ *••
(5 7)
'
leads to dV makqk = ~^—, oqk
(5-8)
419 which are Newton's equations. In the case of the solution being an equilibrium point with the configuration a minimum of the potential, these are the equations of small oscillations. Lagrange equation dt \dqkJ
aqk
then becomes makek + Kakek
= 0,
(5.10)
where d2V Kab = , — r - . oqaaqb
(5.11)
Plane motions in an arbitrary 2-potential. Choose a reference curve and let s = arc length, z = normal distance from the curve, and p = radius of curvature; then taking (s, z) as generalized coordinates the Lagrangian is 6,14,26
S2
V(z,s). (5.12) z2 + ^(p + z) P The linearized equations obtained by computing the prolonged Lagrangian yields ..
3 fs2
£ +
pd2V\
2h
+
< A7 3^h = 7-
/rnoN
(5 13)
'
Here is worth to say that we choose the "virtual displacements" es and e2 to be planar ones. The constant of motion h can be expressed as
/l=s
H7 + ^ J £ 2 + ^es-
(5 14)
-
We pinpoint that from equation (5.13) we can see the that a planar orbit is stable, against energy preserving perturbations (h = 0), only when the expression between parenthesis is positive 6 ' 2 6 . ? -Xi =- il then the Rotating Lagrangian systems. Let q = R{t)Q, RR~ Lagrangian of a mechanical system L(q,q) — h(q,q) - V(q) transforms into
420
L(Q, Q) = \ (Q, Q) + (SIQ, Q) - Vef (Q)
(5.15)
in a rotating frame, where Q is the rotation matrix and where the effective potential is Vef{Q) = V(Q) 4- -{XQ, Q),
1 = inertia matrix;
(5.16)
and (,) is the usual inner product in E". The prolonged Lagrangian yields the right linearized equations: Q + 2nQ + VVef(Q)
=0
e + 2Cle + Be = 0. where B =
(5.17) (5.18)
HessVef(Q).
Acknowledgements All authors contributed equally to this paper. The first author was partially supported by PAPIIT-IN122498, the second author was partially supported by Conacyt grant 32167-E. The third author ALSB wants to thank C. M. Arizmendi and G. Hentshel for very interesting conversations in Atlanta. Part of this work were done while visiting the Department of Physics, Emory University. The authors acknowledge with thanks the useful remarks of G. Sardanashvily of Moscow State University and the suggestions of E. Pina of UAM-Iztapalapa. Last but not least, HNNY and ALSB dedicate this work to the memory of their beloved friend M. Mec. References 1. V. I. Arnold,Mathematical Methods of Classical Mechanics (Springer, New York, 1978). 2. H. J. Sussmann, Lie Brackets, Real Analicity and Geometric Control. In Differential Geometric Control Theory, R.W. Brockett, R.S. Millman, H.J Sussmann, Eds., Progress in Mathematics (Birkhausser 1983). 3. H. J. Sussmann, Symmetries and integrals of motion in optimal control. In Geometry and Nonlinear Control and Differential Inclusions, A. Fryszkowski, B. Jakubczyk, W. Respondek, T. Rzezuchowski, Eds.,
421
4. 5.
6. 7. 8. 9. 10. 11. 12.
13.
14. 15.
16. 17. 18. 19.
Banach Center Publications, 32, (Institute of Mathematics of the Polish Academy of Sciences, Warsaw Poland, 379-393, 1995). L. D. Landau and E. M. Lifshitz, Mechanics (Pergamon, Oxford, 1976). I. C. Percival, Am. Inst. Phys. Conf. Proc. 57, 302 (1979); reprinted in R. S. Mackay and J. D. Meiss (compilators), Hamiltonian Dynamical Systems (Adam Hilger, Bristol, 367, 1987). H. N. Nunez-Yepez, and A. L. Salas-Brito, Phys. Lett, A275, 218-222 (2000). E. T. Whittaker, A Treatise on the Analytical Dynamics of Particles and Rigid Bodies (Cambridge University, Cambridge, fourth edition, 1989). M. P. Do Carmo, Riemannian Geometry (Birkhauser, Boston, 1992). D. E. Soper, Classical Field Theory (John Wiley, New York, 1976). B. Straughan, Explosive Instabilities in Mechanics (Springer, Berlin, 1998). K. M. Case, Proc. Nat. Acad. Sci. U.S.A. 8 1 , 5893 (1984). G. Giachetta, L. Mangiarotti and G. Sardanashvily, J. Math. Phys., 40, 1376 (1999); G. Giachetta, L. Mangiarotti and G. Sardanashvily, Differential geometry of time-dependent mechanics, arXiv: dg-ga/9702020 (Sections 4.7, 5.5); G. Giachetta, L. Mangiarotti and G. Sardanashvily, New Lagrangian and Hamiltonian Methods in Field Theory, Section 4.9, (World Scientific, Singapore, 1997); L. Mangiarotti and G. Sardanashvily, Gauge Mechanics, Section 5.11. (World Scientific, Singapore, 1998). B. Casciaro, M. Francaviglia and V. Tapia, The Second Variation and Jacobi Equations for Second-Order Lagrangians, preprint IC/95/37 (International Centre for Theoretical Physics, Trieste, Italia, 1995); B. Casciaro, M. Francaviglia and V. Tapia, Jacobi Equations as Lagrange Equations of the Deformed Lagrangian, preprint IC/95/38 (International Centre for Theoretical Physics, Trieste, Italia, 1995); B. Casciaro and M. Francaviglia, Nonlinear Anal, 30, 597 (1997). A. L. Salas-Brito, Am. J. Phys. 52, 1012 (1984). P. E. Crouch and A.J. van der Schaft. Variational and Hamiltonian Control Systems. Lecture Notes in Control and Information Sci. 101. (Springer-Verlag, 1987). H. Nijmeijer and A. J. van der Schaft, Nonlinear Dynamics Control Systems (Springer, 1990). E. C. G. Sudarshan and N. Mukunda, Classical Dynamics: A Modern Perspective (John Wiley, New York 1974). E. Noether, Nachr. Akad. Wiss. Gottingen, II, Math.-Physik. KL, 235 (1918). K. M. Case, Phys. Rev. Lett. 55, 445 (1985).
422
20. 21. 22. 23.
C. Nash, Relativistic Quantum Fields (Academic Press, New York, 1978). F. Riesz and B. Sz.-Nagy, Functional Analysis (Dover, New York, 1990). E. Pina, Contactos,in Spanish, 4, 57 (1991). J. E. Marsden. Lectures on Mechanics. London Math. Soc. Lect. Notes Ser. 174, (Cambridge University Press, 1992). 24. R. Abraham and J. E. Marsden. Foundations of Mechanics (Benjamin/Cummings, New York, 1978). 25. S. Weinberg, Gravitation and Cosmology (Wiley, New York, 1972). 26. S. Chandrasekhar, Principles of Stellar Dynamics, Appendix 2 (University of Chicago Press, Chicago, 1942).
423
CONTROL OF T H E H O V E R C R A F T VESSEL: A FLATNESS PLUS S E C O N D O R D E R SLIDING M O D E CONTROL APPROACH HEBERTT SIRA-RAMiREZ Centro de Investigation y Estudios Avanzados del Instituto Politecnico National, (CINVESTAV-IPN), Departamento Ingenieria Electrica, Avenida I.P.N. # 2508, Col. San Pedro Zacatenco, A.P. U-740, 07300 Mexico D.F., Mexico E-mail: hsira ©mail, cinvestav. mx A suitable combination of the differential flatness property and the second order sliding mode controller design technique is proposed for the specification of a robust dynamic feedback multivariable controller accomplishing prescribed trajectory tracking tasks for the earth coordinate position variables of a hovercraft vessel model
1
Introduction
Differential flatness is a structural property which makes a dynamic system (whether linear, non-linear, monovariable or multivariable) equivalent to a linear controllable system under endogenous feedback i.e. one which only requires internal variables to be synthesized. Differential flatness was first introduced from the viewpoint of differential algebra in the work of Fliess et al 6 . It soon became apparent, however, that a more general approach was possible from the viewpoint of differential geometry of infinite jet spaces. The geometric approach using differential varieties, or "diffieties", naturally views a dynamic controlled system as a controlled Cartan field (a diffieity) and system transformations, leading to equivalence, are then cast into the framework of Lie Backlund transformations (see the work of Fliess et al 7 ) . Higher order sliding modes appear as a natural generalization of first order sliding modes, thoroughly studied in the work of Utkin 21 , and of a vast, and still growing, list of authors (see the survey prepared by Professor Emely'anov 4 ). The basic idea, first proposed by Emely'anov and his coworkers, 3 , in the context of "second order sliding modes", is to impose, on a certain auxiliary stabilizing output differential function -or suitably defined tracking error- of arbitrary, but well defined, relative degree, the dynamic behavior of a higher order discontinuous (sliding) dynamics. The chosen sliding dynamics, usually of order higher or equal than two, is such that its trajectories globally, and robustly, converge towards the origin of phase coordinates in finite time.
424
This important robustness characteristic of the chosen "higer order sliding algorithm" is usually guaranteed to be preserved even in the presence of, unmodelled, absolutely bounded disturbances. The reader is referred to the works of Levant n , 12 , Fridman and Levant 9 , for interesting details, extensions, and generalizations of the second order sliding mode control idea. In the context of uncertain systems, the reader may also benefit from the contents in the recent articles by Bartolini and his coworkers 1,2. The article by Sira Ramirez et al20 contains an application example of second order sliding modes to the stabilization of the popular "TORA" system. In this article, we present an application of the differential flatness property in the controller synthesis of a nonlinear multivariable model of a hovercraft vessel. We propose a robust dynamic feedback control scheme for the hovercraft system based on off-line trajectory planning and dynamic feedback auxiliary trajectory tracking error stabilization to the origin of its phase space coordinates based on second order sliding mode contro. For both, the trajectory planning and the feedback controller design aspects, use is made of the fact that, contrary to the general surface vessel model 8 , the hovercraft system model is indeed differentially flat. The flat outputs are represented by the hovercraft position coordinates with respect to the fixed earth frame. The system is shown to be equivalent, under endogenous dynamic feedback, to two fourth order, independent, controllable linear systems in Brunovsky's form. The flatness of the hovercraft model was established in 19 . Section 2 revisits the hovercraft vessel model derivation performed in 5 , taking as the starting point the fully actuated, though simplified, ship model found in 8 and also in 13 . In section 2, it is shown that the obtained hovercraft system model is differentially flat. In Section 3, we pose the trajectory tracking problem and derive a robust dynamic feedback controller based on flatness and second order sliding modes. These modes are induced on a set of independent auxiliary polynomial differential functions of the flat outputs tracking errors. Section 4 contains the simulation results for a typical trajectory tracking maneuver and Section 5 is devoted to some conclusions and suggestions for further research. 2
The Hovercraft Model
The regulation of a ship vessel, by means of two independent thrusters located at the aft, has received sustained attention in the last few years. Reyhanoglu 17 uses a discontinuous feedback control law for exponential stabilization towards a desired equilibrium. A feedback linearization approach was proposed by Godhavn 10 for the regulation of the position variables. The scheme, how-
425
ever, did not allow for orientation control. In an article by Pettersen and Egeland 13 , a time-varying feedback control law is proposed which exponentially stabilizes the vessel state towards a given equilibrium point. Timevarying quasi-periodic feedback control, developed in Pettersen and Egeland 14 , has been proposed taking advantage of the homogeneity properties of a suitably transformed model achieving simultaneous exponential stabilization of the position and orientation variables. An interesting experimental setup has been built which is described in the work of Pettersen and Fossen 15 . In their work, the time-varying feedback control, used by 13 , is extended to include integral control actions, including excellent experimiental results. High frequency feedback control signals, in combination with averaging theory and backstepping, have also been proposed by Pettersen and Nijmeijer 16 , to obtain practical stabilization of the ship towards a desired equilibrium and also for trajectory tracking tasks. In 18 the ship trajectory tracking control problem was examined from the perspective of Liouvillian systems (a special class of non-differentially flat systems, i.e. systems which are not equivalent to linear controllable systems by means of endogenous feedback). The hovercract model we use is based on the recent work of Fantoni et al 5 where the vessel's dynamics is derived on the basis of the underactuated ship model extensively studied by Fossen 8 . In 5 , a series of interesting Lyapunovbased feedback controllers are derived for the stabilization and trajectory tracking of the hovercraft system. In the book by Fossen 8 the following model is proposed for a rather general surface vessel dynamics, v + C(y)v + Dv =
T
(2.1) where
o C(y) =
0 1TI22V
0 —rri22V 0 miiu -miiu 0
J(v) =
cos xjj — sin ip 0 sin ip cos I/J 0 0 0 1
(2.2)
with M = diag {mii,m 2 2,m 3 3}, D = diag {^11,^22,^33}
(2.3)
The vector v = [u,v,r]T denotes the linear velocities in surge, sway, and angular velocity in yaw. The vector r) = [x,y,ip] denotes the position and orientation in earth fixed coordinates. The vector r = [TI,T2,T 3 ] denotes the control forces in surge and sway and the control torque in yaw. The matrices
426
C(u) and D represent, respectively, the Coriolis and centripetal forces and the hydrodynamic damping. Consider the simplified version of the underactuated hovercraft shown in Figure 5. A model for such symmetric vessel can be directly derived, as already done in Fantoni et al 5 , from equations (2.1)-(2.3) by enforcing the following simplifying assumptions:
m n =TO22,n = mUTu,T2
=0,T3
= m 3 3 r r , dii = d33 =0,0
=
(2.4) m22
We thus obtain the following model of the underactuated hovercraft vessel system,
x = u cos ip — v sin i\) y = u sin ip + v cos ip xj) = r ii — vr + Tu v = — ur — /3v r = Tr
2.1
(2.5)
Differential flatness of the hovercraft system
We have the following proposition Proposition 2.1. The model (2.5) is differentially flat, with flat outputs given by x and y i.e., all system variables in (2.5) can be differentially parameterized solely in terms of x and y, as
427
w = arctan
— \x + j3xj x(x + fix) + y(y + (3y) u— y/(x + px)2 + (y + py)2 yx — xy V
=
r = Tu
=
^{x + fix)2 + (y + Py)2 yW(x + fix) - x(3\y + Py) + P2 (xy - yx) (x + P±)2 + (y + Py)2 x(x + fix) + y(y + Py) y/(x + px)2 + (y + Py)i 2/W (x + fix) - x^\y
+ Py) + P (y^x - x^y) (x + fix)2 + (y + py)* — [y(z\x
[(£ + px)2 + (y + PyYY [{x + px)(x^
+ fix) - x^{y
- P2 (x^y
-
y^x)
+ py) - P2(xy - yx)]
+ px) + (y + Py)(yW + py)} (2.6)
Proof. From the first two equations in (2.5) we readily obtain v = y cos ip — x sin ip u = xcosip+ ysinip
(2.7)
Differentiating the first two equations in (2.5) with respect to time, yields, after use of (2.5) and (2.7), x = u cos ip — uip sin ip — v sin ip — vip cos ip = TU cos if) + Pv sin ip y = u sin ip + uip cos ip + v cos ip — vip sin ip =
TU
sin ip — Pv cos ip
(2.8)
Multiplying the first equation in (2.8) by sinV> and the second equation by cosV> and then subtracting the obtained expressions we obtain, after use of (2.5), x sin ip — y cos ip = Pv
(2.9)
428
Similarly, multiplying the first equation in (2.8) by cos^ and the second by sint/> and adding, we obtain TU = x cos ip + y sin rp
(2-10)
Substituting the first of (2.7) into (2.9) one obtains, after some further algebraic manipulations tan V = t ^ r
- • if, = arctan (£±M)
x + Px
T
\x + /3xj
(2.11) K
'
Using (2.11) in (2.7) we obtain, y {x + P±) ~ x (y + Py) ^(x + f3x)2 + (y + py)2
yx - xy •\2 J(x + 0xf + (y + Py)
(2.12)
and _ U
~
i(x + fix) + y(y + fiy) y/(x + /3±)2 + (y + py)2
(2.13)
Substituting in (2.10) the value of ip, obtained in (2.11), leads to the expression for the force input, r u , given in the proposition. Finally, we make use of the fact that r = tp and rT=^>.
•
2.2
Invertibility of the control parameterization
The differential parameterization of the input torque rr depends on the flat outputs, and their time derivatives, up to the fourth order. Note, however, that the corresponding parameterization of the control input r„ only depends up to the second order time derivatives of x and y . This simple fact clearly reveals an " obstacle" to achieve static feedback linearization and points to the need for a second order dynamic extension of the control input r„ in order to exactly linearize the system. Use of (2.5) allows the following (simpler) expressions for the control inputs r r and T„, in terms of the system's state variables, the highest order derivatives of the flat outputs x and y, and a first order extension of the control input r„. rr =
?/(4'
fu = z
COST/'
- x^ sinV' - /3rru — 2rfu — 2j3r2v — /32ur + (33v (2-14) PU + TU
(4)
cos ip + y
(4)
sin ip + 2/3ur2 + 2/32rv - /3vrr + r2ru
(2.15) (2.16)
429 Using (2.6) we have that, fiu + TU = y/(x + fix)2 + (y + f3y)2. Clearly, we are interested in maneuvers for which this quantity is bounded (which is physically reasonable and natural) and it is also bounded away from zero (which somehow limits the class of desired trajectories). Assumption 1. We assume that the positive quantity, s/(x + fix)2 + (y + Py)2, is uniformly bounded by a constant for all times, and it is nowhere identically zero along the evolution of the system. The previous assumption specifically precludes us from considering trajectories that either lead, or contain, a resting point for the earth position coordinates, x, and y, of the hovercraft vessel. These may be treated by time reparameterizations, or "control of the clock" techniques (see 7 ) which are outside the scope of this paper. On the other hand, straight lines, circles, and many other types of trajectories, which are to be followed at constant speeds, can be handled by the method here proposed. Let £ and <j> denote two independent auxiliary control inputs. Under the above assumption, the, locally defined, input coordinate transformation Tr =
TU
<j> cos ip — £ sin i/> — /3rru — 2rtu — 2f3r2v — fi2ur + f3zv
= £ cos ip +
PVTT
+ r2Tu (2.17)
yields the following transformed system x< 4 )=£,
y
W=0
The hovercraft system is thus equivalent, under endogenous feedback, to a set of two independent linear systems in Brunovsky's controllable canonical form. 3 3.1
Trajectory Tracking for the Hovercraft System The unperturbed case
Suppose a desired trajectory is given for the position coordinates x and y in the form of specified time functions: x*(t) and y*(t), respectively. The following proposition gives a dynamic feedback solution to the trajectory tracking problem based on flatness and exact tracking error linearization through imposition of second order sliding modes on stable differential polynomials of the tracking errors. Proposition 3.1. Let the set of constant real coefficients {a2,cti,ao} and {72,7i,7o} represent independent sets of Hurwitz coefficients, so that the
430
polynomials in the complex variable s, p(s) = s2 + a2s + cti,
q(s) = s2 + j2s + 71
have symmetric roots located in the left portion of the complex plane. Let x* (t) and y*(t) de a given a set of desired trajectories for the position coordinates x and y which satisfies assumption 1. Associated with the polynomials, p(s) and q(s), define the following two auxiliary differential functions of the position tracking errors: p = (x - x* (t)) + a2(x - x* (t)) + an(x - x*(t)) r) = (y-y*
(*)) + l2(y-
y* {t)) + 71(2/ - »'(*))
Then, for any set of real parameters Ap, Bp and Av, Bv such that Ap > Bp and Av > Bn, the following dynamic second order sliding feedback controller 4> cos ip — £ sin ip - /3rru — 2rtu — 2/3r2v — fi2ur + /33v (3-1)
rr = Tu-r2Tu
= £cosxp + (j> sin ip + 2^ur2 + 2j32rv - Pvrr w
{3)
(3)
e = x* (t)-a2(x -x* (t))-ai(x-x*
(3.2) (*))
- \ sign{p)[Ap + Bp + {Ap - Bp)sign(p p)} 4> = y*(i)(t)
-
72 (2/
(3)
- y*i3)(t))
- 71(2/ - y* (t))
- \ sign(r))[Av + Bv + (A„ - Bv)sign(n V)] with x = u cos ip — v sin %l> y = u sin ip + v cos ip x = flv sin xp + TU cos ip y= x
(3)
TU
sin ip — j3v cos ip
= - [r (/3u + TU) + P2v\ sin xp + (Prv + fu) cos xp
t/(3) = [r (f3u +
TU)
+ /32v] cos xp + (/3rv + fu) sin xp
semi-globally stabilizes the auxiliary tracking errors p and r] and their first order time derivatives p, 1) to zero, in finite time. As a consequence, the trajectory tracking errors, ex = x — x*(t) and ey = y — y*(t) exponentially asymptotically converge to zero.
431
Proof. Subtracting the controller expression, for fu in (3.1), from the open loop expression in (2.15) we obtain, after some algebraic manipulations, P +-sign(p)[Ap
+
+ BP + {Ap - Bp)sign(p p)] cosip
V +-sign(TJ)[Ar, + Bv + (Av - Bn)sign{r) V)] sinV = 0
(3.3)
Proceeding in a similar fashion with respect to the corresponding closed and open loop expressions for r r , one finds:
+
P +-sign(p)[Ap
+ Bp + (Ap - Bp)sign(p p)] sin \j)
rj +-sign{r])[Av
+ Bn + {An - Bn)sign{r] V)} cosi/' = 0
(3.4)
Then, clearly, the tracking errors functions p and rj satisfy the ideal second order sliding dynamics P +^sign{p)[Ap
+ BP + {Ap - Bp)sign{p p)] = 0
t) +-sign(r))[An
+ Bn + (A„ - Bn)sign{r) i)] = 0
(3.5)
As a consequence, the variables p and r) , as well as their corresponding time derivatives, p and »7, converge to zero in finite time (see 20 for a proof of the convergence to the origin, in finite time, of the trajectories generated by following asymptotically exponentially stable dynamics on the flat outputs tracking errors, (x - x* (t)) + a2(x - x* (t)) +ax{x-
x*(t)) = 0
(y - v* (0) + 72(^ - v* (t)) + 7i(y - y* (0) = o
3.2
The perturbed case
A simple tracing of the influence of unmodelled perturbations, in the open loop system (2.5), reveals that a hypothesized perturbation in either the surge velocity equation, or the sway dynamics, affects all the state variables of the system, except the orientation angle tp. On the contrary, a similar perturbation affecting the yaw rate dynamics, propagates to all of the states in the system. We consider first the latter case.
432
Consider an unmodelled matched perturbation input, c(i), which is absolutely uniformly bounded by a strictly positive constant S, i.e. | <;(t) \ < S V t. This perturbation affects the ship's yaw rate dynamics in the form: x = u cos ip — v sin ip y = u sin ip + v cos ip ip = r u = vr + TU v = —ur — (3v r = rr+ <;(t)
(3.6)
The input coordinate transformation (2.14), (2.15) on the perturbed system (3.6) results now in the following set of perturbed Brunovsky canonical forms x ( 4 ) = £ - q(t) [(/3u + TU) sin ip + 0v cos ip], 2/(4) = (f> + q(t) [(fin + TU) cos ip + fiv sin ip]
(3.7)
Evidently, from assumption 1, and for absolutely uniformly bounded surge and sway velocities u and v, the perturbation terms affecting the transformed system are also absolutely uniformly bounded. a Thus, for some strictly positive constant parameters Q and R, the perturbed transformed system can be assumed to be of the form: x^ = £ + m(t), y^ = <j> + n(t) with I m(t) | < Q and | n(t) \ < R, for all t. Following the same steps in the proof of the previous proposition, we find that the perturbed version of the dynamics of the auxiliary tracking error functions p and TJ are now governed by, P +-^sign(p)[Ap + BP + (Ap - Bp)sign(p ij +-sign{rf)[An According the perturbed vided, Bp < R < m.m{Bn,
P)] = m(t)
+ Bn + (A„ - B„)sign(ri V)] = n(t)
(3.8)
to the robustness results of second order sliding modes 2 0 , evolutions of p and rj converge to zero in finite time, proAp, with Q < min{Bp, (Ap - Bp)/2} and Bn < An, with (Av — B^/2}. This implies that for a set of suitable controller
"Note that the sway velocity dynamics is a linear time-invariant dynamics, with a strictly negative eigenvalue, excited by the product of the surge velocity, u, and the yaw rate, r. Since it is physically plausible to assume that these two velocities are absolutely uniformly bounded, then it follows that the absolute value of the sway velocity is also uniformly bounded.
433
parameters, A^,B^, and for an absolutely uniformly bounded perturbation input signal, s(t), the trajectory tracking errors, x -x*(t), and y - y*(t), still asymptotically exponentially converge to zero in spite of the influence of the perturbations. A similar conclusion can be reached for the case of unmatched perturbations affecting the non-actuated sway velocity dynamics. For absolutely uniformly bounded perturbations \{t) with similarly bounded first order time derivatives, acting on the non-actuated sway velocity equation, as v = -ur — /3v + X(t), the input coordinate transformation (2.14), (2.15) yields the following perturbed Brunovsky canonical forms:
x(
4
) = £ - r [2A(t) - /3A(i)] cos^ + [/3A(*) + (r 2 - 0 2 )A(t)] sin V
yW
= cj) - \p\(t) + (r 2 - {32)\(t)\ cos V - r h\(t)
- /3A(t)] sinV'
The perturbations affecting the right hand sides of the Brunovsky forms are evidently absolutely bounded for bounded yaw rates, r, and absolutely bounded perturbation inputs, X(t), with an absolutely bounded first order time derivative. Thus, expressions similar to (3.8) are also valid. In the simulations presented below, we test the proposed nominal dynamic second order sliding mode controller of Proposition 3.1 with an unmatched perturbation input signal of the form just discussed. 4
Simulation Results
Simulations were carried out to evaluate the performance of the proposed dynamic feedback controller for a rather common trajectory tracking task: The tracking of a circular trajectory, defined in the earth fixed coordinate frame, of radius R, centered around the origin. 4-1
Tracking a circular trajectory
A circular trajectory, or radius R, is to be followed in a clockwise sense in the plane (y,x), with a given constant angular velocity of value w. In other words, the flat outputs are nominally specified as, x*(t) = R coswt,
y*(t) = R smut
(4.1)
For this particular choice of x and y, the nominal orientation angle ip* (t)
434
is given by ip*(t) = arctan
^-.
= arctan(tan(wt - 0)) = cut - 0 (4.2)
with 0 = arctan(/3/w). The nominal surge and sway velocities and the nominal yaw angular velocity are given, according to (2.7) and the fact that r = ip, by the following constant values u*(t) = -Rusin6,
v*(t) = RuicosO,
r*(t) = UJ
(4.3)
Similarly, using (2.10) and the fact that r r = xjj we obtain that the nominal applied inputs are given by the following constant values T*
(t) = -Ru2
cos 0,
T*
{t) = 0
(4.4)
Note that for the chosen trajectory, the nominal value of the quantity /3w + r u , appearing in the denominator of the controller expression for rr, is given by PU + TU = RU(LJ cos 0 + /3 sin 0) = RUJ^J(32 + UJ2 ^ 0
The only system parameter 0 was set to be /3 = 1.2. We have chosen the following parameters for the circular reference trajectory R=10,
UJ =
0.1,
which result in 0 = 1.4876 rad, T* = -8.304x 1 0 - 3 . The controller parameters were set to be: a2 = 72 = 1.4142, Qi = 7i = 1, Ap = An = 0.2, Bp = Bn = 0.05 Figure 5 depicts the controlled evolution of the hovercraft position coordinates when the vessel motions are started significantly far away from the desired trajectory. Figure 5 shows the corresponding surge, the sway, and the yaw angular velocities. Figure 5 depicts the applied external inputs, ru(t), Tr(t).
4-2
Robustness with respect to unmodelled, unmatched,
perturbations
In order to test the robustness of the proposed controller, we introduced in the non-actuated dynamics (i.e., in the sway acceleration equation) an unmodelled external perturbation force, simulating a rather strong "wave field" effect, of the form:
\{x{t)) = L s'm(fx(t)) + -
5
COS(TT
ur - /3v + X(x(t))
fx(t)) •
(
*
-
435
with L = 0.15 and / = 10. The parameters of the second order sliding dynamics and the auxiliary function p were set to be, A = 0.2, B = 0.05, Qi = 7i = 1.414, a0 = 70 = 1 In spite of the unmatched nature of the perturbation signal, the proposed dynamic feedback controller, using the same controller parameters used before, efficiently corrects the undesirable deviations, due to the persistent perturbation, and manages to accomplish the trajectory tracking task with satisfactory precision. 5
Conclusions
In this article, we have illustrated how the property of differential flatness can be advantageously combined with the robustness and simplicity of higher order sliding modes. We have carried out this combined controller design option in the context of the trajectory tracking regulation of an underactuated hovercraft system model, derived through some simplifying assumptions from the general surface vessel model. This model is shown to be differentially flat. The flatness property immediately allows to establish the equivalence of the model, by means of dynamic state feedback, to a set of two decoupled controllable linear systems. A trajectory planning, combined with a second order sliding mode trajectory tracking scheme , allows to obtain a direct feedback controller synthesis for arbitrary position trajectory following. The design was shown to be robust with respect to significant perturbation input forces even when they affect the non The characteristics, and simplicity, of higher order sliding mode controllers, beyond those of the second order type treated here, seem to be a natural, and remarkably robust, alternative for the efficient regulation and trajectory tracking tasks of perturbed nonlinear systems which are nominally differentially flat. The more difficult problem of hovercraft position regulation towards trajectories that include a resting equilibrium point is the object of ongoing research by many authors. The problem certainly deserves attention from the flatness viewpoint using time-reparameterizations, and other suitable techniques. Acknowledgments The author has greatly benefitted from discussions with Dr. G. Silva-Navarro of CINVESTAV-IPN and also from e-mail discussions with Dr. Arie Levant
and Professor L. Pridman. Their criticism and encouragement is gratefully acknowledged. This research was supported by the Centro de Investigation y Estudios Avanzados del Instituto Politecnico Nacional (CINVESTAV-IPN), Mexico and by the Consejo Nacional de Ciencia y Tecnologia (CONACYT) under Research Contract No. 32681-A. The author is on leave of absence from the Departamento de Sistemas de Control of the Universidad de Los Andes, Merida-Venezuela. References 1. G. Bartolini, P. Pidynowsky, An improved chattering free VSC scheme for uncertain dynamic systems IEEE Trans, on Auto. Contr., 4 1 , 1220-1226 (1996). 2. G. Bartolini, A. Ferrara, A. Levant, E. Usai, On second order sliding mode controllers, in Variable Structure Systems, Sliding Mode and Nonlinear Control, K.D. Young, and U. Ozgiiner (Eds). Lecture Notes in Control and Information Sciences, 247, 329-350. (Springer-Verlag, London, 1999). 3. S.V. Emel'yanov, S.K. Korovin and L.V. Levantovsky, Higher order sliding modes in the binary control systems, Soviet Physics, Dokady, 3 1 , 4, 291-293 (1986). 4. S. V. Emelyanov, Titles in the theory of variable structure control systems (International Research Institute for Management Sciences, Irmis, Moscow, 1990). 5. I. Fantoni, R. Lozano, F. Mazenc and K. Y. Pettersen, Stabilization of a nonlinear underactuated hovercraft, Int. Journal of Robust and Nonlinear Control, (to appear). 6. M. Fliess, J. Levine, Ph. Martin and P. Rouchon, Flatness and defect of nonlinear systems: Introductory theory and examples , International Journal of Control, 61 , 1327-1361 (1995). 7. M. Fliess, J. Levine, Ph. Martin and P. Rouchon, A Lie-Backlund approach to equivalence and flatness, IEEE Transactions on Automatic Control, 44, 5, 922-937 (1999). 8. T. I. Fossen, Guidance and Control of Ocean Vehicles, (John Wiley and Sons Ltd. Chichester, 1994). 9. L. Fridman, A. Levant, Higher order sliding modes as a natural phenomenon in control theory, in Robust Control via Variable Structure and Lyapunov Techniques, F. Garofalo and L. Glielmo (Eds.). Lecture Notes in Control and Information Science, 217, 107-133 (Springer-Verlag, Lon-
437
don, 1996). 10. J. M. Godhavn, Nonlinear tracking of underactuated surface vessels, in Proceedings of the 35th IEEE Conference on Decision and Control. Kobe, Japan, 987-991. 11. A. Levant, Sliding order and sliding accuracy in sliding mode control, International Journal of Control, 58, 6, 1247-1263 (1993). 12. A. Levant, Controlling output variables via higher order sliding modes, in Proc. of the European Control Conference ECC'97, CD Rom version (Brussels, 1-4, June 1997). 13. K. Y. Pettersen and O. Egeland, Exponential stabilization of an underactuated surface vessel, Proceedings of the 35th IEEE Conference on Decision and Control, Kobe, Japan, 967-971 (1996). 14. K. Y. Pettersen and 0 . Egeland, Robust control of an underactuated surface vessel with thruster dynamics, in Proceedings of the 1997 American Control Conference (Albuquerque, New Mexico, 1997). 15. K. Y. Pettersen and T. L. Fossen, Underactuated ship stabilization using integral control: Experimental results with Cybership I, in Proceedings of the 1998 IFAC Symposium on Nonlinear Control Systems, NOLCOS'98 Enschede, (The Netherlands, July 1998). 16. K. Y. Pettersen and H. Nijmeijer, Global Practical Stabilization and Tracking for an underactuated Ship- A Combined Averaging and Backstepping Approach, Proceedings of the 1998 IFAC Conference on Systems Structure and Control, Nantes, (France, 59-64, July 1998). 17. M. Reyhanoglu, Control and stabilization of an underactuated surface vessel, in Proceedings of the 35th IEEE Conference on Decision and Control, (Kobe, Japan. 2371-2376). 18. H. Sira-Ramirez, On the control of the underactuated ship : A trajectory planning approach , 37th IEEE Conference on Decision and Control, Phoenix, AZ, December 1999, (to appear). 19. H. Sira-Ramirez and C. Aguilar-Ibanez, On the control of the hovercraft system, Dynamics and Control, (accepted for publication, to appear). 20. H. Sira-Ramirez, A. Levant and L. Fridman, Regulation of the RTAC system via flatness and second order sliding, in Proc. of the Variable Structure Systems Workshop, Rockhampton, Australia, December 2000, (to appear). 21. V. I. Utkin, Sliding Modes and their Applications in the Theory of Variable Structure Systems (MIR Editors, Moscow, 1977).
Figure 1. The simplified hovercraft system
5
0
•5-
-10•
|
•10
•
|
-5
.
|
0
--T
|
5
,
p
10
Figure 2. Feedback controlled position coordinates for circular path tracking
439 0.5n
U(t)
0.0: •0.5: -1.0: •1.5+
0
20
40
60
80
100 120
time
v(t)
0.2 0.1 0.0 0
20
40
60
100
120
time 0.5 0.0 -0.5 -1.0
r(t) 0
20
40
60
80
100 120
time Figure 3. Feedback controlled velocity variables for circular path tracking
0.1
rt i\
-0.1
w U
-0.2-
0
^
V
20
40
60
80
100
120
80
100
120
time 1.50.0
-1.5
w rM
•3.0•4.5-600
20
40
60
time Figure 4. Applied control inputs for circular path tracking
440
-10 -5 0
5 10
0 20 40 60 80100120
0 20 40 60 80100120
time
time 1
4 r(t)
0.5|
W
0 20 40 60 80100120
0 20 40 60 80100120
0 20 40 60 80100120
time
time
time 2i
0.1 0.0!
r-4Jr~
-0.1 •0.2
0 20 40 60 80100120
0 20 40 60 80100120
time
time
Figure 5. Circular path tracking performance under unmodeled sustained perturbations
441
O P T I M A L I T Y OF S I N G U L A R T R A J E C T O R I E S A N D A S Y M P T O T I C S OF ACCESSIBILITY SETS U N D E R GENERIC ASSUMPTIONS E. T R E L A T Universite
Paris Sud,
France
We investigate minimization problems along a singular trajectory of a single-input affine control system with constraint on the control, and then as an application of a sub-Riemannian system of rank 2. Under generic assumptions we get necessary and sufficient conditions for optimality of such a singular trajectory. Moreover we describe precisely the contact of the accessibility sets at time T with the singular direction. As a consequence we obtain in sub-Riemannian geometry a new splitting-up of the sphere near an abnormal minimizer 7 into two sectors, bordered by the first Pontryagin's cone along 7, called the L°°-sector and the L 2 -sector.
1 1.1
Introduction Statement of the problems
Consider a control system on R n : xu(t) = /(*(*),«(*))> x«(0) = x0 m
n
(1)
n
where / : R " x M —> H is smooth, x0 € H , and the set of admissible controls U is made of measurable bounded functions u : [0, T(u)] —• fl C R m . Let f° : B n x ]Rm —• IR be a smooth function, T > 0, and set CT{u) = f0 f°(xu(t),u(t))dt : it is called the cost of the trajectory xu associated to the control u on [0, T]. Definition 1.1. Let T > 0. The end-point mapping at time T of system (1) is the mapping ^
. U —> R" ' u>-+xu{T)
where the trajectory associated to u. Definition 1.2. A control u on [0,T] (or the corresponding trajectory xu) is said to be singular if it is a singularity of the end-point mapping ET, that is if there exists a non trivial vector xjj in E, n such that ip.dEr(u) = 0. Definition 1.3. Let u be a singular control on [0,7"]. The subspace Jm dEr{u) is called the first Pontryagin's cone along u (or along xu). The control u is said to be of corank 1 if Im dEx{u) has codimension 1 in R n , that is if ip is unique up to a scalar.
442
1.2
Optimization problems
Let 7 be a solution of system (1) such that 7(0) = x0, -y(T) = x\. problem is the following :
The
Among all solutions of system (1) steering x0 to x\, is 7 minimizing the cost ? Here we consider a single-input affine system with constraint on the control : x(t) = f0(x(t))+u(t)f1(x(t)), x(0) = x0
|«(t)|
Let 7 be a reference singular trajectory of this system, associated to a corank 1 control u, and let ip G K n \ { 0 } such that ip.dEr(u) = 0. We suppose that u = 0. Definition 1.4. • If f0 and [/i,[/i,/o]] are on the same side with respect to the first Pontryagin's cone along 7, then 7 is said to be elliptic. • If they are on opposite sides, 7 is called hyperbolic. • /f/o 6 Im dEr(u) along 7, then 7 is said to be exceptional. Actually due to the well-known Legendre-Clebsh condition, elliptic trajectories are never time-minimizing. The basic object we have to study is the so-called intrinsic second-order derivative : Definition 1.5. The intrinsic second-order derivative along 7 is the real quadratic form : E';(v)=rp.d2ET(u).(v,v) where v € Ker dEr(u). Roughly speaking, if the latter quadratic form is positive (or negative) definite" then 7 is locally isolated and thus locally optimal. Conversely if 7 is optimal then E'^ is positive (or negative), see 5 . Actually this reasoning works for hyperbolic trajectories (see 1 8 ). For exceptional trajectories the situation is a bit more complicated, and we have to study the intrinsic second-order derivative on a larger domain (see the timex input/state mapping in 5 , reduced operator in 7 ) . a
A real quadratic form q(x) is said positive definite if x ^ 0 = > q{x) > 0, and if there exist i , t / / 0 such that q(x)q(y) < 0.
indefinite
443
1.3
Accessibility sets
Definition 1.6. Consider the general control system (1), and let T > 0. The accessibility set at time T, denoted by Acc(T) is the set of points that can be reached from XQ in time T by solutions of system (1), i.e. this is the image of the end-point mapping ETLet 7 be a reference trajectory on [0,T], solution of (1), associated to a singular control u. Using the formalism of 7 we are able to describe precisely the boundary of Acc(T) along 7 for the single-input affine system (2). 2
Optimality of singular trajectories
Consider the single-input affine control system with constraint on the control: x = X + uY(x),
\u\
(3)
Let 7 be a reference singular trajectory, defined on [0, T] and such that 7(0) = xo- If 7 is injective we may assume that it is associated to the control u = 0. In the sequel we make the following assumptions along 7 : (Ho) 7 is injective, associated to u = 0 on [0, T]. (Hx) Vt € [0,T] K(t) = Vect {adkX.Y(j(t)) / k 6 N } (first Pontryagin's cone along 7) has codimension 1, and is spanned by the first n —1 vectors : K(t) = Vect {adkX.Y(-f{t)) (H2) Vt e [0,T]
ad2Y.X(^(t))
/ k =
0...n-2}
$ K(t).
(H3) If n = 2 then : V* £ [0,T] X(j(t)) and Y(j(t)) are independant. If n = 3 then : V* £ [0,T] Xfr(t)) £ Vect {adkX.Y(j(t)) / k = 0...n-3}. In these conditions 7 is of corank 1, and moreover we get normal forms which allow to express easily the differential operator representating the intrinsic second-order derivative and make easier the computation of conjugate times (see 7 ) . We first investigate the time-optimal problem, and then the problem of minimizing some cost. 2.1
Time optimality
Definition 2.1. • The trajectory 7 is said C°-time-minimal on [0,T] if there exists a C°-neighborhood of 7 such that T is the minimal time to
444
steer 7(0) to 7(T) among the solutions of the system (3) that are entirely contained in this neighborhood. • Recall that 7 is associated to the control u = 0. Let 6 > 0. The trajectory 7 is said L°°-time-minimal on [0,T] if there exists a neighborhood of 0 in L°°([0,T + 5]) such that T is the minimal time to steer 7(0) to j{T) among trajectories associated to controls of this neighborhood. Obviously if 7 is C°-time-minimal then it is L°°-time-minimal. We have to distinguish between hyperbolic and exceptional cases. The following results generalize those of 7 which concern an afnne system (3) without any constraint on the control. Hyperbolic case Lemma 2.1. 7 Suppose that 7 is hyperbolic and n > 2. Then the system (3) is in a C°-neighborhood of 7 feedback-equivalent to :
i=2
i,j=2
dx„
where an,n(t) < 0 on [0,T] and R can be neglected in our work. In these conditions, the controllable part of the system is (X2, • • • , xn), the singular reference trajectory is j(t) = (t,0,... , 0), and the intrinsic secondorder derivative d 2 £'J r (0)/ Ker dET(o) along 7 is identified to : r
/ ^0
V
ay (*)&(*)&(*) dt , where
i,j=2 ,rr,
&2 = £3, • • • ! £n-l = £n, in = V Set y = £2- Then it can be written as
QT/G>
i-T
Qr(y) = / qr(y)dt Jo
where : n—2
and
qT(y) = V ] •• n i,j=0
hjy(l)y
with bi-2,j-2 = ^iiyiu-, and where G is the following space corresponding to the kernel of the first derivative : G = {y I 2/ ( 2 ( "" 2 ) ) € L 2 ([0,T]), 2/^(0) = y&(T) = 0, t = 0 . . .n - 2} Integrating by parts we get :
445
Lemma 2.2 ( 7 ) . The quadratic form QT is represented on G by the operator DT SO that : QT(V) =
(DTy,y)L*
where ( , )x,2 is the usual scalar product in L2([0,T]), is :
i=0
"
and the operator DT
i,j=0
Our aim is to study the sign of QT, thus we are lead to make a spectral analysis of DT- Unfortunately the spectrum of DT on G is empty. Hence we have to enlarge this space so that the spectrum is not trivial and that the representation lemma 2.2 is still valid. That's why we set : F = {y I y{n~2)
G L2([0, T}), j,W( 0 ) = t/«(T) = 0, i = 0 . . . n - 3}
Definition 2.2. We call T a conjugate time ofQ along 7 if there exists y £ F such that 2/(2(n-2)) e L2([0,T]) and DTy = 0. Lemma 2.3. For any f 6 L2([0,T]), ifT is not a conjugate time, there exists y E F unique such that yi2(-n~2^ e L 2 ([0,T]) and DTy = f. Let L denote the operator f *-> y considered as an operator from L2([0,T]) into L2([0,T]) ; it is selfadjoint and compact. Let tc be,the first conjugate time of the operator D. It is known (see for instance 7 , 18 , 4 ) that tc > 0 or tc = +00. We have the following result : Theorem 2.4. The trajectory 7 is C°-time-minimal if and only if T < tc. Moreover 7 is not L°°-time-minimal ifT>tc. The shape of accessibility sets at time T is represented on fig. 1. Remark 2.1. In dimension 2, the operator D is equal to bold, and thus tc — +00 (provided assumptions (HQ — H3) are fulfilled on K + ^, i.e. 7 is C°-time-minimal on R + . E x c e p t i o n a l case Lemma 2.5. 7 Consider the affine system q — X + wY, q(0) = 0 under the assumptions (HQ — Hz), and suppose 7 is exceptional. Then in a C°neighborhood of 7 the system (X, Y) is feedback-equivalent to :
^
8
d
"
d (4)
d
A = dXn-x where a„_ l j n _i(t) > 0 on [0,T] and R can be neglected in our work.
446 , ,3Tj
:\
xx
\JT
Acci(T)
T
T>tc Figure 1. Hyperbolic case
Set xi = i+f. The controllable part of the system is (£, x2, •. • , xn-i), the reference singular trajectory is j(t) = (t,Q,... ,0), and the intrinsic secondorder derivative cPET(0)KeT dET{o) along 7 is identified to : X n—1
L
y ^ a,ij(t)£i(t)£j(t)
dt , where
»,j=2
6 = 6 , - - - ,6i-2 = 6»-l,£n-l = «,
and
&(0) = &(T) = 0, i = 1 . . . , 71 - 1
Contrarily to the hyperbolic case where only one differential operator is in a natural way associated to the intrinsic second-order derivative, here in the exceptional case we get two natural operators in a natural way : 1. If £ = x\ — t, it can be written as Q/G, where : n-2
Q(0 = [ q(Z)dt and q(0 = £
U) b^P
with bi-ij-i = aii 2 a , i , and where G is the following space corresponding to the kernel of the first derivative : G = {£ I £<2(»-2» eL 2 ([0,T]), £ ( i ) ( 0 ) = £ ( i ) ( T ) = 0, i = 0 . . . n - 2 } Let D be the operator representing Q. We have :
447
where ra-2
,,•
ci
n—2
2. It can be expressed in function of x2 as Qi/o1» where : Qi{x2)=
l
qi(x2)dt
and
gi(ar2) = ^ Z i j + i . j + i a : ^ 1 ^
and where Gi is the space corresponding to the kernel of the first derivative : Gi = {x2 / 4 2 < " " 3 ) ) £L2([0,T}),
x{2i)(0)=xi2i)(T)=0,
t = 0...n-3,
rT
and / x2 dt = 0} Jo Let Z?i be the operator representing Q\. We have : Qi{x2)
= (x 2 ,-Dia;2)L2
where
2=0
W
J,j=0
Note that Q(£) = Qi(£) and D = — — -Di —. As previously the spectral study at at of these operators has to be made on larger spaces : • F= { £ / £ ( n - 2 ) € L2([0,T}), the operator D.
e w ( 0 ) = ^(T)
= 0, i = 0 . . . n - 3 } f o r
. Fi = {x2 I 4 " - 3 ) G L 2 ([0,T]), 4 ° ( 0 ) = 4\T) = 0, t = 0 . . .n - 4} for £>i if n > 4 (if n = 3, no condition is imposed). The following lemma is an improvement of 7 , where only a non strict inequality is proved : Lemma 2.6. Let tc (resp. tcc) denote the first conjugate time of Q on F (resp. Qi on Fi). Then : 0 < tcc < tc. We have the following result : Theorem 2.7. The trajectory 7 is C°-time-minimal if and only if T < tcc. Moreover 7 is not L°° -time-minimal ifT> tcc.
448
Figure 2. Exceptional case
The shape of accessibility sets in function of T is represented on fig. 2. Remark 2.2. If n = 3, we have tcc — +oo provided assumptions (Ho — H3) are fulfilled on R + . Hence in this case 7 is C°-time-minimal on E, + . Remark 2.3. In both hyperbolic and exceptional cases, the notion of conjugate time and the optimality of 7 do not depend on the constraint on the control. It comes from the fact that singular reference control belongs to the interior of the domain of constraints. 2.2
Optimality for some cost
Let us now investigate the problem of minimizing some cost C(T,u), also denoted by CT(U), where C is a smooth function satisfying the following additional assumption along the reference singular trajectory 7 : {Hi) VT
rank {dET(0),dCT(0))
= n
i.e. the singularity of the end-point mapping of the extended system has codimension 1, and in particular the cost is independant from the end-point mapping along 7. We investigate several optimization problems : 1. final time not fixed : the aim is to steer the system from XQ to xi in some time T (not preassigned) and minimizing the cost C. 2. final time fixed : let T > 0 ; the aim is to steer the system from x0 to x\ in time T and minimizing the cost CTFinal time not fixed.
449 Definition 2.3. • The trajectory 7 is said to be C°-cost-minimal on [0,T] if there exists a C°-neighborhood ofj such that for any trajectory q contained in this neighborhood, with g(0) = 7(0) and q(t) = 7(T), we have : C(t,v) > C(T,0), where v is the control associated to q. • Let 6 > 0. The trajectory 7 is said to be L°° -cost-minimal on [0, T] if there exists a neighborhood ofO in L°°([0,T-|-<5]) such that, for any trajectory q associated to a control v of this neighborhood, with q(0) = 7(0) and q(t) = 7(T), we have : C(t,v) > C(T,0). Obviously the C° -cost-minimality implies the L°° -cost-minimality. We have the following result (compare with 5 ) : Theorem 2.8. 1. If 7 is hyperbolic, 7 is never L°°-cost-minimal. 2. If 7 is exceptional, then 7 is C°-cost-minimal if and only if it is C°time-minimal. Actually, 7 is C°-cost-minimal if T < tcc, and is not L°°-cost-minimal ifT> tcc. Hence in the exceptional case, both problems of cost-minimization and time-minimization are equivalent. Final time fixed Definition 2.4. • The trajectory 7 is said to be C°-cost-minimal on [0,T] if there exists a C°-neighborhood ofj such that for any trajectory q contained in this neighborhood, with q(0) = 7(0) and q(T) = 7(T), we have : CT(V) > CT(0), where v is the control associated to q. ~» The trajectory 7 is said to be L°°-cost-minimal on [0,T] if there exists a neighborhood ofO in L°°([0, T]) such that, for any trajectory q associated to a control v of this neighborhood, with q(0) = 7(0) and q(T) = 7(T), we have : CT(V) > C T ( 0 ) . We have the following : Theorem 2.9. 1. If 7 is hyperbolic, then 7 is C°-cost-minimal if and only if it is C°-time-minimal. Actually, 7 is C°-cost-minimal ifT< tc, and is not L°°-cost-minimal if T > tc, where tc denotes the first conjugate time of 7 (see theorem 2-4). 2. If 7 is exceptional, then 7 is C°-cost-minimal if and only if T < tc. Moreover, 7 is not L°°-cost-minimal if T > tc (whereas 7 is C° -timeminimal if and only if T < tcc), where tcc and tc denote the two types of first conjugate times off (see lemma 2.6). Hence in the hyperbolic case, the times at which 7 ceases to be minimizing are the same in both time-optimal and cost-optimal problems. At the contrary in the exceptional case they are different : 7 ceases to be C°-time-optimal before it ceases to be C°-cost-optimal (since tcc < tc, see lemma 2.6).
450
2.3
Application to the sub-Riemannian case
Consider a smooth sub-Riemannian structure (M, A, g) where M is a Riemannian n-dimensional manifold, n > 3, A is a rank 2 distribution on M, and g is a metric on A. Let qo £ M ; our point of view is local and we can assume that M = R n and go = 0. Suppose there exists a smooth injective abnormal (or singular) trajectory 7 passing through 0. Up to changing coordinates and reparametrizing we can assume that : •
7 (t)
= (i,0,...,0),
• A = Span {X, Y} where X, Y are g-orthonormal, • 7 is the integral curve of X passing through 0. Under these assumptions, the sub-Riemannian problem is equivalent to the time-optimal problem for the system with constraint : q = vX + uY, o(0) = 0 v2+u2
(7) {>
sys(8)
where the control w satisfies a constraint of the form : \w\ < rj. In order to investigate the optimality of the trajectory 7 for the subRiemannian system (7), we compare this system with its associated affine system (8). The fact that the optimality of 7 for the affine system does not depend on the constraint is crucial. We obtain the following result : Theorem 2.10. Suppose that assumptions (H0 — Hz) are fulfilled along 7 for the system (X, Y). Then 7 is C°-optimal for the sub-Riemannian system (7) if and only if it is C°-time-minimal for its associated affine system (8). Moreover 7 is exceptional for this affine system ; actually 7 is C°-optimal if T < tcc and is not L°° -optimal ifT> tcc. In particular conjugate times are the same along 7 for both systems. Therefore the whole formalism that was introduced for affine systems (the differential operator Di) is still valid in sub-Riemannian geometry. Hence the conjugate time of the sub-Riemannian problem can be computed using an algorithm. This result makes a link between works of 7 and 3 , 4 . Example 2.1. The Martinet case (see section 3.2) is in dimension 3, hence tcc = +00 (see remark 2.2). The abnormal trajectory is optimal on ]R + .
451
Remark 2.4. As proved in 2 the C°'-optimality is in sub-Riemannian geometry equivalent to the optimality in the sense of L2 on controls. Remark 2.5. IfT is small enough (depending on the choice of the Riemannian structure, and lower than tcc), then as first noted by 3 7 is moreover globally optimal among all sub-Riemannian trajectories steering 0 to 7(T). Remark 2.6. It should be noted that the loss of optimality is in L°°. Hence controls L2-close to the reference abnormal control have no influence on the optimality of the abnormal trajectory (see splitting-up in sectors, section 3.2). 3
Asymptotics of the accessibility sets
In this section we describe very precisely the boundary of accessibility sets for a single-input affine system near a reference singular trajectory. These boundaries are the level sets of the value function associated to the timeoptimal problem. Then we apply our results to the sub-Riemannian case of rank 2, where we describe precisely the contact of the sphere with the abnormal direction. As a consequence we obtain a splitting-up of the sphere into two sectors near the abnormal minimizer. 3.1
Single-input affine control systems
Hyperbolic case In this case the shape of the accessibility set depends on the constraint, as shown in the following example : x = 1 + y2 y=u The accessibility set Acd1^)
, . . . where \u\ < n ~
is represented on fig. 3. It is easy to state that :
• Acc(T) = {(T,0)} U {(x,y) € R 2 / x > T} • In a neighborhood of x = T, y = 0, the boundary of Accv(T) is given by the curve : x = T + ^-\y\3. Exceptional case Contrarily to the previous case, in the exceptional case the boundary of Accn(T) does not depend on the constraint near 7(T). Precisely we have the following : Theorem 3.1. Consider the affine system with constraint (3) and suppose that assumptions (H0 — H3) are fulfilled along the reference singular trajectory 7 on [0,T]. We suppose that 7 is exceptional. Let tcc and tc denote the first conjugate times associated to 7, see Exceptional case. Then :
452
y
* = T + h\y\3 ' Acc(Tj ' • ;
T.
T
v =-
+00
u\
+00
Figure 3. Hyperbolic case
1. There exist coordinates (x\,... ,xn) locally along 7 such that in these coordinates : 7(2) = (t, 0 , . . . ,0), and the first Pontryagin's cone along 7 is: K(t)=Vect {£;,..., e£_}h. 2. If T is small enough then for any point (xi,... ,xn) of Ace"(T)\{(T,0,... ,0)} close to j(T) we have : xn > 0 (see fig.
4). 3. If T < tc, then in the plane (xi,xn), near the point (T,0), the boundary of Accn(T) does not depend on n, is a curve of class C2 tangent to the singular direction, and its first term is :
xn = AT{Xl - Tf + o((Xl - T)2) The function T i-> AT is continuous and strictly decreasing on [0,i c [. It is positive on [0, tcc] and negative on [tcc,tc[. Moreover, if n depends on X\—T then the result is still valid providing : x\ - T = o(n) as xi ->• T. The evolution of Acc7'(T) in function of T in the plane (ari,a;n) is represented on fig. 5. The contact with the singular direction is of order 2 ; the coefficient AT describes the concavity of the curve. Beyond tc the accessibility set is open. Remark 3.1. The coefficient AT can be computed in the following way (see 7 20 , ). Let D denote the operator (5) introduced in Exceptional case and Q the quadratic form associated to D, representing the intrinsic second-order derivative along 7. There exists a function J of class C 2 '™ -2 ' on [0,T] such that DJ = 0 and satisfying the limit conditions : Vke{0,...,n-3}
jW(0)=0, j(fc>(T)=^
453
Figure 4. Shape of Aa-n{T),
T small
Figure 5.
Then : AT
3.2
= Q(J)
(9)
Application to the sub-Riemannian case
Asymptotics of the sub-Riemannian sphere along an abnormal direction Let us consider the framework introduced in section 2.3, and let us now define a notion of constrained accessibility set : Definition 3.1. Let 0 < a < 1. We denote by AcCgR(T) the accessibility set at time T for the sub-Riemannian system (7) with the additional constraint
454
on the control : t;2+w2
l - a < u < l ,
|u| < a
(see fig. 6)
Figure 6.
Note that controls steering 0 to points of AccgR(T) are in a L°°neighborhood of the abnormal reference control v = 1, u = 0. Consider the associated affine system : x = X{x)+wY(w)
(10)
and denote by Acc\(T) the accessibility set at time T for this system with the constraint : \w\ < n. The reference singular trajectory 7 corresponds to w = 0, and is exceptional for this affine system. Theorem 3.2. Suppose assumptions (Ho — Hz) are fulfilled along the reference abnormal trajectory 7 for the system (X, Y). Let tcc and tc denote the first conjugate times of 7 for the associated affine system. Let a £]0,1[. Then : 1. There exist coordinates (xi,... ,xn) locally along 7 such that in these coordinates : 7(2) = (t, 0 , . . . ,0), and the first Pontryagin's cone along 7 is :K(t)=Vect {&,..., jJL-}^ 2. If T is small enough then for any point (xi,... to 7(T) we have xn > 0 (see fig. 4)-
,xn)
of AccgR(T)
close
3. IfT< tcc, then in the plane (xi,xn), close to the point (T,0), the boundary of Acdgn(T) does not depend on a, is a curve of class C2 outside (T,0), tangent to the abnormal direction, whose first term is : • if Xi < T then xn = 0. • ifxi>T
then xn = AT{xx
- T)2 + o((xi -
T)2).
455
The function T i-> AT is the same as in theorem 3.1. Figure 7 represents the evolution of AccgR(T) in function of T in the plane (xi,xn). It is open in a neighborhood of j(T) if T > tcc, contrarily to the affine case where it becomes open only beyond tc.
Xi
Figure 7.
Remark 3.2. To compare the system (7) with its associated affine system (10) we need the following reparametrizing : ds ~d~t
= v
which only holds if v does not vanish. This condition is satisfied when the control (v, u) is in a L°° -neighborhood of the abnormal reference control (1,0), for in this case v is close to 1 in the L°° sense. Hence using this method it is only possible to describe a constrained accessibility set, i.e. in a L°°neighborhood of the reference abnormal control. Splitting-up of the sphere near an abnormal direction Let T > 0 small enough so that properties 2 and 3 of theorem 3.2 are satisfied. In particular the reference abnormal trajectory 7 is minimizing. Then A = j(T) belongs to the sub-Riemannian sphere 5(0, T) with radius T. If controls steering 0 to points of the boundary of AcCgR(T) in xn > 0 (that are L°°-optimal) are actually globally optimal, then this boundary is included in the sphere 5(0, T). In this case the sphere splits into two sectors near j(T), bordered by the first Pontryagin's cone xn = 0 : • sector xn > 0 corresponding to the previous description,
456
• sector xn < 0. According to the previous results, final points at time T associated to controls which are L°° -close to the reference abnormal control are in the first sector : xn > 0. Obviously due to controllability of the system the sector xn < 0 is accessible. In fact a basic calculus shows : Lemma 3.3. For any neighborhood V of the point A in R n we have :
S(o,T)nvn(z„
[^-sector/-^ xn < 0
^ '
Figure 8.
Typical example : the Martinet case. Consider the two following vector fields in R 3 : dx
2 dz
dy
457
and endow the distribution spanned by these vector fields with an analytic metric g of the type : g = adx2 + cdy2 where a = (1+ay)2 and c = (l+ffx+jy)2. The abnormal reference control for the sub-Riemannian system x = vX(x)+uY(x) with constraint v2 +u2 < 1 is v = l , u = 0, and corresponds to the trajectory 7 : x(t) = t, y(t) = z(t) — 0. We have, see 9 : Lemma 3.5. Assumptions (H0—H3) are fulfilled alongj if and only if a ^ 0. In this case branches 1 et 2 (see fig. 8 with x\ = x,xn = z) have the following contacts with the abnormal direction : • branch 1 : x > T, z= ^?{x
- T)2 + o((x - T)2)
• branch 2 .- x < T, z ~ i ( l + 0(T))(x - T ) 3 Remark 3.3. The coefficient AT of the first branch can be computed directly or using formula (9) (see remark 3.1). As we are in dimension 3, results of theorem 3.2 are in fact available on R + , see remark 2.2. The L°°-sector is z > 0 and corresponds to controls that are globally minimizing. References 1. A. Agrachev, Quadratic mappings in geometric control theory, J. Soviet Math. 5 1 , 2667-2734 (1990). 2. A. Agrachev, Any smooth simple ifMocal length minimizer in the Carnot-Caratheodory space is a C°-local length minimizer, Preprint Labo. de Topologie, Dijon (1996). 3. A. Agrachev, A. V. Sarychev, Strong minimality of abnormal geodesies for 2-distributions, J. of Dynamical and Control Systems 1, 2 (1995). 4. A. Agrachev, A. V. Sarychev, Abnormal sub-Riemannian geodesies : Morse index and rigidity, Annales de 1'IHP 13, 635-690 (1996). 5. A. Agrachev, A. V. Sarychev, On abnormal extremals for Lagrange variational problems, Journal of Mathematical Systems, Estimation, and Control 8, 87-118 (1998). 6. B. Bonnard, M. Chyba, The role of singular trajectories in control theory, Math. Monograph, Springer-Verlag, to be published. 7. B. Bonnard, I. Kupka, Theorie des singularites de l'application entree/sortie et optimalite des trajectoires singulieres dans le probleme du temps minimal, Forum Math. 5, 111-159 (1993).
458
8. B. Bonnard, I. Kupka, Generic properties of singular trajectories, Annales de 1'IHP, Analyse non lineaire 14, 167-186 (1997). 9. B. Bonnard, E. Trelat, On the role of abnormal minimizers in SRgeometry, to appear in Annales de la Faculte des Sciences de Toulouse. 10. B. Bonnard, E. Trelat, Stratification du secteur anormal dans la sphere de Martinet de petit rayon, Lecture Notes in Mathematics, A. Isidori, F. Lamnabhi Lagarrigue, W. Respondek (Eds), Nonlinear Control in the Year 2000. 11. R. L. Bryant, L. Hsu, Rigidity of integral curves of rank 2 distributions, Invent, math. 114, 435-461 (1993). 12. M. R. Hestenes, Applications of the theory of quadratic forms in Hilbert space to the calculus of variations, Pacific J. Math. 1, 525-581 (1951). 13. E. B. Lee, L. Markus, Foundations of optimal control theory, John Wiley, New York (1967). 14. C. Lesiak, A. J. Krener, The existence and Uniqueness of Volterra Series for Nonlinear Systems, IEEE Transactions on Automatic Control 23, 6 (1978). 15. W. S. Liu, H. J. Sussmann, Shortest paths for sub-Riemannian metrics of rank two distributions, Memoirs AMS 118, 564 (1995). 16. R. Montgomery, Abnormal minimizers, SIAM J. on Control and Opt. 32, 6 (1997). 17. M. A. Naimark, Linear differential operators, Frederick U. Pub. Co (1967). 18. A. V. Sarychev, The index of the second variation of a control system, Math. USSR Sbornik 41, 3 (1982). 19. E. Trelat, Some properties of the value function and its level sets for affine control systems with quadratic cost, J. of Dynamical and Control Systems 6, 4 (2000). 20. E. Trelat, Optimality of singular trajectories and asymptotics of accessibility sets, ESAIM: COCV 6, 387-414 (2001). 21. Zhong Ge, Horizontal path space and Carnot-Caratheodory metric, Pacific J. Math. 161, 255-286 (1993).
459 CONTROL T H E O R Y A N D H O L O M O R P H I C DIFFEOMORPHISMS DROR VAROLIN Department of Mathematics University of Michigan 48109-1109 USA E-mail: varolin@@math.lsa.umich.edu To Velimir Jurdjevic on the occasion of his sixtieth birthday.
1
Introduction
This paper surveys some results about holomorphic diffeomorphisms on very symmetric complex manifolds. However, the true underlying objective is to show how this can be achieved using certain ideas which lie at the foundations of control theory. By symmetric complex manifolds we mean those whose holomorphic diffeomorphism groups are infinite dimensional and, even more, as large as they can possibly be. As a complex analyst might expect, the manifolds of greatest interest are Stein. However, much stronger hypotheses than these will be needed. In fact, having so many holomorphic diffeomorphisms is the nongeneric situation in complex analysis. This will be explained in more detail below. In what follows, a central role will be played by complete holomorphic vector fields. While this is to be expected, a new twist in complex analysis is that there are always very few complete holomorphic vector fields. (Contrast this with situations where nontrivial compactly supported functions exist.) However, there is a trick, going back to Euler, which allows one to approximate by diffeomorphisms the flow of a holomorphic vector field satisfying certain conditions. One is then lead naturally to the so-called Density Property, which we introduced in ir. One of the simplest ways to exploit the ideas alluded to in the previous paragraph is to study the possible jets of holomorphic diffeomorphisms using
460
these ideas. Here again, ideas from control theory make an appearance, since as we shall see, one wants to look at the orbits of the holomorphic diffeomorphism group in the jet space. With good control over the jets of holomorphic diffeomorphisms, classical techniques from complex dynamics can be used to reveal a lot about the underlying structure of these complex manifolds with very large holomorphic diffeomorphism groups. The organization of the paper is as follows. We begin in section 2 by explaining the geometry behind the reasons one expects few complex manifolds to have very large holomorphic diffeomorphism groups. The basic obstruction is a generalization of the classical phenomenon of "Schwarz's lemma", i.e., the nondegeneracy of the so called Kobayashi pseudometric. We then go on, in section 3, to discuss holomorphic vector fields and Euler's method, which lead naturally to the definition of the Density Property, a condition which rigorizes the notion of "largest possible" holomorphic diffeomorphism group. Finally, in section 4 we state results on the jets of holomorphic diffeomorphisms and numerous corollaries which are proved via techniques from complex dynamics. These corollaries reveal much about the underlying complex structure of these manifolds. As we shall see, although the situation where there are largest possible holomorphic diffeomorphism groups resembles in many ways the situation of (real) smooth manifolds, there still arise some rigidities, which again distinguish the complex case. Since the paper is primarily a survey, proofs are almost always omitted. There are three exceptions. The first is our proof of the Andersen-Lempert theorem (theorem 3 below), which appears in print here for the first time. The second is a sketch of the proof of the general jet theorem (theorem 11 below), which we have chosen to include because it is fundamentally control theoretic. Finally, we prove one of the corollaries, because it gives a sense of the role of dynamical systems in the application of the jet theorems. (The proof we include is based on an old idea, often attributed to Konig, but the complete proof is due to Rosay-Rudin 13 .) One glaring omission, due only to lack of space, is that of our general notion of shears 19 . This subject is based on the rigid existence criterion for functions which multiply complete holomorphic vector fields to complete holomorphic vector fields, and involves first integrals more directly than the real theory; it would have been nice to include here. 2
Complex analytic considerations
In the complex category it is generally more rare to have diffeomorphisms. As an illustrative example, let's look at the unit disk D. First, for any complex
461
number a with \a\ < 1, the map p a : £ *-> (a - C)/(l — " 0 is a holomorphic involution on the unit disc which carries 0 to a. It follows that the unit disc is a homogeneous space, and using the Schwarz lemma one can easily show that the isotropy group of a point is S1. In fact, it is possible, without much difficulty, to see that the Schwarz lemma says essentially that every holomorphic diffeomorphism preserves the Poincare line element ds = \dz\/(l — \z\2). There is an analogous construction in a more general context, which we now describe. Let M be a complex manifold, and let v € TMX. We define a (generally nonsmooth) Finsler pseudometric KM on M, called the Kobayashi-Royden pseudometric, as follows. With D r := {|£| < r} C C, let KM(v)
:= inf j - ; 3 holo/ : D r -> M s.t. /(0) = x and /'(0) = v
Vaguely, the length of a vector v is one over the radius of the largest possible disc which can be mapped into M such that the origin passes through x with velocity v. We leave it as an exercise for the interested reader to apply the Schwarz lemma to show that when M is the unit disc, the Finsler metric so obtained is the Poincare line element. A nontrivial result, due to Royden 14 , is that KM is upper semicontinuous. One can then define a pseudometric PM by pM{a,b) = i n f j / KM{j'(t))dt
; 7QO, 1],0,1) -> (M,a,b)
smooth
If pM is actually a metric, one says M is Kobayashi hyperbolic. Note that the metric PM is invariant under the action of Diffo(M). It then follows from general geometric principles that Diffo (M) is a locally compact topological group (see, e.g., 1 1 ) . This is the main step in proving that Diffo(M) is actually a Lie group. It is a strongly believed piece of folklore that most complex manifolds have Kobayashi pseudometrics with some sort of nondegeneracy. The trouble with proving something like that is that we don't have any idea what most complex manifolds means. However, there are various special situations where the philosophy has been verified. For example, for domains in C there are quite sophisticated results. For compact Riemann surfaces, the uniformization theorem realizes our claim, and for complex surfaces one can witness the Kodaira classification. Recent work of Siu, Demailly and others even states that the generic smooth algebraic variety in CP™ of sufficiently high degree (depending on n) is hyperbolic.
462
3
Holomorphic vector fields
In this section we want to study the Lie algebra of Diffo (M). In the hyperbolic case, since the latter is a Lie group, we will have the usual Lie theoretic picture provided by the exponential map, but this will not be so in general. For details on the geometry of holomorphic vector fields, we recommend the appropriate section of chapter 0 in 9 . 3.1
Approximating solutions of ODE
The approximation technique we will describe now is due to Euler. Suppose one is given a family Ft : M —> M of (holomorphic) maps on a (complex) manifold M which is C1 in both variables at once, such that F0 = idM, and let X := , dt =0 (In practice, one is given X, and constructs the family Ft.) This datum says that, for a very short time, Ft is a good approximation to the flow of X. Euler's idea was to use this approximation repeatedly for shorter and shorter times. He proved that the limit converged to the flow. Precisely, the result is as follows. Theorem 3.1. Let X and Ft be as above, and denote the flow of X by plx. Then Urn iv—t-oo
FfflW^p'xix), '
with the limit holding locally uniformly on the subset of E x M where either side is defined (the so-called fundamental domain of X). For a proof, one can see 1 where the family Ft is called an algorithm for X. Two particular algorithms which are of interest to us arise from the following differentiation formulas. Let X and Y be two vector fields. Then ^ p ' x o p ^ I + F, and ^ p / o p / o p ^ o p ^ = [jr,y]. at t_0 at t = 0 + These two algorithms, together with Euler's approximation theorem, lead immediately to the following, key observation. Observation / / a vector field X lies in a Lie algebra g of vector fields which is generated by the complete vector fields on a manifold M, then the time T map of this vector field, if and where it is defined, can be approximated, locally uniformly on its domain of definition, by diffeomorphisms which are time one maps of the complete vector fields in g.
463
We should point out that, while approximating the flows of real vector fields on compact sets by diffeomorphisms is easy (due to the existence of cutoff functions), this is not so for vector fields in Lie algebras which do not admit multiplication by compactly supported functions. 3.2
The density property
In light of what has been developed above, the next definition is quite natural. Definition 3.2. A Lie algebra g C XQ(M) of holomorphic vector fields on a complex manifold M is said to have the density property if the Lie subalgebra Cg C g generated by complete vector fields is dense in g (in the locally uniform topology). If XQ{M) has the density property, we shall say that M has the density property. Lastly, if (M, ui) is a calibrated complex manifold (meaning u is a nondegenerate holomorphic top form, or holomorphic volume element, on M, and if Xo{M,w) := {X € XQ(M) ; divu(X) = 0} has the density property, we say that (M, CJ) has the volume density property. Of course, for such a definition to be meaningful, one has to have examples. While compact manifolds give examples, they are less interesting, due to the fact that they support only finite dimensional Lie algebras of holomorphic vector fields. A more interesting class to consider is that of Stein manifolds. A complex manifold is called Stein if it can be embedded as a closed complex submanifold of C^ for some N. This condition guarantees existence of a lot of holomorphic functions and vector fields. Most Stein manifolds do not have the density property. As an example, note that all open Riemann surfaces have infinite dimensional Lie algberas of holomorphic vector fields, but only finite dimensional diffeomorphism groups. (In fact, most of them only have finite diffeomorphism groups.) The first results in the study of the density property predate the definitions, and were, in fact, the inspiration for the theory. These are the theorems of Andersen 3 and Andersen-Lempert 4 . Theorem 3.3. Let n > 2 and ui := dz\ A ... A dzn. Then 1. (C™,w) has the volume density property z, and 2. C™ has the density property 4 . Proof. We restrict to the case n — 2, since the higher dimensional case can be obtained by an obvious adaptation of this proof. 1. We will prove that given any polynomial divergence zero vector field, we can write it as a sum of Lie brackets of complete divergence zero vector fields. By linearity, it suffices to consider a vector field of the form
X(zltZ2)=z^dXl+g(z)dXa.
464
Consider the vector field Y(z) =
7nF)
-.rn+lg
Then X(z) — Y(z) = h(z)dZ2, and since X and Y have zero divergence, so does X — Y. Thus h is independent of z2, and hence X — Y \& complete. It follows that X — Y + (X — Y) is completely generated, which proves 1. 2. It suffices to show that if P is any polynomial vector field, then there exists a completely generated vector field Q such that div Q = div P, for then P = Q + (P — Q) is completely generated by 1. Moreover, it suffices by linearity to show that there is a completely generated vector field whose divergence is z™2?- To this end, notice that div
+1 $dZl,-!—-zr z2dz m + 1
^2 = z\1 z,
i
which completes the proof. • A similar method, together with some basic complex analysis, can be used to prove the following result. Theorem 3.4. 17 Let G be a complex Lie group. Then 1. G x C has the volume density property. 2. If G has the volume density property, then so does G x C*. 3. If G is Stein and nontrivial, then G x C has the density property. This theorem gives numerous examples of Stein manifolds having the density property. We note the following. • If we consider the case [G, UJ) = (C™, dz\ A... A dzn), this result recovers the Andersen-Lempert theorem. • We can apply 2 to (G,u) = ((C*)*, (zi •... • Zk)~ldz\ A ... A dzk) inductively (it is easy for k = 1). It follows that (C*)fc has the volume density property for k>l. • Every simply connected Lie group is Stein. In fact, this is the case for most Lie groups. For example, every semisimple Lie group is Stein. Theorem 3.5. 15 Every semisimple Lie group has the density property. Finally, a structure result. Theorem 3.6. 17 Let M and N be Stein manifolds. 1. If M and N have the density property, then so does M x N. 2. If M has the density property, then so do M x C and M x C * . 3. If (M x C, w A dz) has the volume density property, then M x C has the density property. It is clear that in order to have complete holomorphic vector fields, a lot of symmetry is needed. One might wonder, based on this and the above results,
465 whether the only possible examples are groups. This turns out not to be the case. In order to give more examples, we need the following definition. Definition 3.7. 19 An EMV manifold is a pair (M,CJ), where M is a complex manifold and u is a holomorphic volume element on M, with the property that for any V 6 XQ{M), compact K <§ M and e > 0 there are functions / i , . . . , / r € O(K) and divergence zero completely generated vector fields X\,...,Xr satisfying <e Examples (See
19
K
for details.)
1. Every complex Lie group G is EMV with respect to an invariant volume. 2. Every Stein complex homogeneous space is EMV, again with respect to an invariant volume. 3. M := {(x, y) € C2 | xy / 1} together with the volume form (xy — \)~xdx/\ dy is EMV, since the complete vector fields {xy — l)dx and (xy — \)dy parallelize the tangent bundle. 4. The space E3:={(o,6,c,rf)GC4
\a2d-bc=l},
which is a smooth subvariety of C4 and is also a branched double cover of SL(2, €) is EMV with respect to a certain volume element. Theorem 3.8. 19 / / (M,u) is an EMV manifold, then (M x C , w A dz) has the volume density property. It follows from theorem 3.6 above that if M is Stein, then M x C has the density property. This theorem gives a lot of "non-group" examples of the density property. There are also the following results. Theorem 3.9. 19 The space in example 3 has the volume density property. Theorem 3.10. 15 ' 16 The complex quadrics Qn:=\(x0,...,xn)eCn+1
£*? = !
have the density and volume density property, the latter with an SO(n + 1,C) left-invariant volume element. Finally, let us conclude this section by mentioning that there are some results about the density property for more general Lie algebras. Due to lack of space, we do not mention these here, but refer the interested reader to 17 .
466
4
Jets
4-1
The jet theorems
Behind all known applications is a key theorem which allows one to realize jets as those of holomorphic diffeomorphisms. To state this theorem, we need to define certain spaces of jets which, a priori, satisfy obvious conditions for being jets of holomorphic diffeomorphisms. Let M be a complex manifold. To recall, two germs f,g £ 0(M,M)XtV (the subscripts indicate that /(re) = g(x) = y) are equivalent if they have the same Taylor expansion to order k, and a fc-jet is simply an equivalence class. Let Jk(M)XiV denote the space of A;-jets of germs from re to y, and write
Jk(M)x,* := \J Jk(M)x,y
and Jk(M) := (J Jk(M)x,*.
We note that both of these spaces are actually manifolds. Given a map / from a neighborhood U of re in M into M, we denote by jx(f) the induced jet in Jk{M)x,, and by jk(f) : U -> Jk(M) the map jk(f)(x) := jk(f). Definition Let M be a complex manifold. 1. Let J°(M)2y := J°(M)Xty, and for k > 1 let Jk{M)*y be the set of all k-jets [/] with the property that Df(x) : TxM —> TyM is an isomorphism. 2. Let UJ be a holomorphic volume element on M. Then J0(M,w)x%y := J°(M)x^y and for k > 1 let Jk(M,u))x^y be the set of all k-jets [/] such that the uJ-Jacobian determinant Jf of f (defined by f*cj = J/(j) coincides to order k with the constant function p(rc) = 1. The jets in Jk(M)*y and Jk(M,cj)x,y might be thought of as jets of maps which satisfy minimal necessary conditions for being holomorphic diffeomorphisms, namely, one point conditions on derivatives. Let g C XQ{M) be a Lie algebra of holomorphic vector fields. Definition The orbit of g through p G M, denoted lZg(p), consists of all points q € M of the form
<7 = P x V " ° P x »
(!)
for some N £ N, X\, ...XN £ g, and ti, ...,£;v £ ^ such that (1) makes sense. Each X £ Xo{M) induces a vector field pk(X) £ Xo{Jh{M)) is defined by :
PWM)
= [PWI-
whose flow
467
Clearly pk maps complete vector fields to complete vector fields. It is not difficult to show that pk : Xo (M) ->• XQ (Jh (M)) is a Lie algebra isomorphism, and that
(pL(*))>*( y ))=M(Px)* y )Definition Let g be a Lie algebra of holomorphic vector fields on a complex manifold M, and let k > 0 be an integer. Then J*(M)x,,:=1lMB){jkx(idM)), and
Jk(M) := (J Jk(M)x,t. x£M
We note that when M is Stein, it is easy to show that 4 o ( M ) ( M k » = J*(M)*,
and
Jlo[M){M)x,.
=
Jk{M,ej)x,,.
However, this is of course false for a general complex manifold, as for example, a compact manifold would show. Finally, we let Autg (M) denote the subgroup of Diff0 (M) generated by time one maps of complete vector fields in g. The key results are now the following. Theorem 4.1. 18 Let g be a Lie algebra of holomorphic vector fields with the density property. Then for each 7 € j£(M) there exists $ € Autg(M) such that #(7)(*)=7Here and below, a and r are the source and target maps, respectively. Theorem 4.2. 18 Let M be a connected Stein manifold, and let K C M be a compact set. 1. If M has the density property and 7 6 Jh(M)x is a k-jet such that x := (7(7) and T("/) are not in the 0(M)-hull of K, then there exists $ S Diff o ( M ) such that
and such that j k ($) is as close to jk(id) as we like for all z € K. Furthermore, we can arrange that j k (<$) = j k (id) for z in some finite subset
ofK.
468
2. If (M,LJ) has the volume density property and 7 e Jk(M,cj) is a k-jet such that x := #(7) and r(7) are not in the 0(M)-hull of K, then there exists <J> £ Diffp(M) with the same properties as in 1, and such that $*u; = u. Let us give a sketch of the proof of theorem 4.1. The first step is to reduce to the case of zero jets; k = 0. To this end, note that since pk : XQ(M) -» Xo(Jg(M)) is just an invariant way of collecting X and its first k derivatives into a single object, it follows from the Cauchy inequalities that pk is continuous, and hence pk (g) has the density property if and only if g does. Consider next the map associating to each $ € Aut(M) an element $ # 6 Aut(Jk(M)) defined by $ # [ / ] = [$ o / ] . Then (Autg(M))#=AutPk(B){Jk(M)), and we are thus reduced to the case k = 0. That is to say, Theorem 4.1 follows immediately from the following theorem. T h e o r e m 4.3. If a Lie algebra g has the density property, then for allp € M, Autg(M) acts transitively on the orbit TZg(p). Now, given two points on the orbit of g, the density property easily implies (using Euler's approximation theorem) the existence of a holomorphic diffeomorphism which maps one of these points arbitrarily close to the other. Since by the orbit theorem 10 the orbit is a manifold, one can correct this approximation using an implicit function type argument. A more complete proof can be found in 18 . 4-2
Applications
Theorem 3.3 was quickly applied by many authors to the study of analytic geometry of C", n > 2. Many of the applications in that context are well documented in the surveys 7 and 12 , so we shall not discuss them here. Instead, we mention the results on more general (mostly Stein) manifolds with the density and volume density property. These results are corollaries of Theorems 4.1 and 4.2. Some of them are just generalizations of similar results in the case of C , but others are of interest only in this general context. All of the results of this section can be found in 18 . 4-3
The Fatou-Bieberbach
Phenomenon
The first consequence of Theorem 4.1 is the following. Corollary 4.4. Let M be a Stein manifold of complex dimension n with the density property. Then there is an open subset of M which is biholomorphic
tocr.
469 Proof. Fix p e M, and let F G Diffo(M) be such that F(p) = p, and that DF{p) = : A : TMP ->• TMP has eigenvalues Ai,...,A n (n := dimc(M)) satisfying |Ai| > |A2| > ... > |A n | > |Ai| 2 . Fix a holomorphic diffeomorphism X from a small neighborhood of p in M to a small neighborhood of 0 in TMP, and denote by U the basin of attraction to p by F. Then the map K :U -» TMP given by K := lim A~k o x o F(k) fc—>oo
is a well defined, injective holomorphic map on U, satisfying the functional equation K = A'1 o K o F. (The eigenvalues guarantee the convergence. See 13 , theorem 9.1.) Note that K conjugates F\U to a linear map on TMP, and that K is injective by the Hurwitz principle. It follows from the functional equation that the image of K is invariant by A~l. Since the latter is an expanding linear map, K must be surjective. This completes the proof. •
Figure 1. K conjugates F to the linear map A on the basin of attraction to p. With little more effort, one can also prove the following result. Corollary 4.5. Let M be an n dimensional Stein manifold with the density property. Then there are infinitely many disjoint domains in M which are biholomorphic toC1. One can also use the so-called "kick out" method of Dixon-Esterle 6 to prove the following result. Corollary 4.6. Let M be a Stein manifold with the density property. Then there exist proper open subsets of M which are biholomorphic to M.
470 We note that when M = C™, this corollary gives another construction of the classical Fatou-Bieberbach domains, i.e., proper open subsets of C™ which are biholomorphic to C . (This fact has been exploited in many results of analytic geometry in C 1 .) However, these corollaries show that the two methods (dynamical and kick out) might be "different". A natural question is whether every Fatou Bieberbach domain in C™ is the region of attraction of a holomorphic diffeomorphism. Corollary 4.6 suggests that the answer might not be very simple. If we consider now a calibrated Stein manifold (M,UJ) with the volume density property, one can show the following. Corollary 4.7. Let (M,ui) be a calibrated Stein manifold with the volume density property. Then there exists a proper open subsets of M which is biholomorphic to M. One can also construct nondegenerate maps of C™ into a calibrated Stein manifold (M, ui) with the volume density property. Corollary 4.8. Let (M,u>) be a calibrated Stein manifold of dimension n having the volume density property. Then there exists a map h : C 1 —> M such that h*ui is not identically zero. One can also get injective immersions of C _ 1 tangent to any given complex hyperplane in TM. Corollary 4.9. Let (M, w) be a calibrated Stein manifold of dimension n with the volume density property, and let Vp C TMP be a complex hyperplane. Then there is an injective holomorphic immersion g : C1'1 -»• M such that dg^C1'1) = Vp. Finally we have the following proposition. Proposition 4.10. Let (M,UJ) be a calibrated Stein manifold with the volume density property, and suppose there exists F € Aut(M) such that the u) Jacobian determinant JF of F has modulus different from 1 at some point p 6 M, then M has an open subset biholomorphic to C™ . The idea of the proof is to use holomorphic diffeomorphisms with jets in Jk(M,ui) to modify F so that p becomes an attracting fixed point, and then apply the same dynamical principle as above.
4-4
Completeness
of vector
fields
One of the consequences of corollaries 4.4 and 4.9 is that on a Stein manifold with the density or volume density property, all bounded plurisubharmonic functions are constant. Then the main theorem of 2 implies the following corollary. Corollary 4.11. Let M be a Stein manifold with the density or volume density property. Then every R + -complete holomorphic vector field on M is C-complete. A holomorphic vector field X is K + -complete if one can extend the flow of X to all of R + , and it is called C-complete if X and iX are complete (in the usual sense).
471 4-5
Interpolation
results
In this paragraph we note that for manifolds with the density property or volume density property, a given (proper, or closed) complex submanifold can be modified so as to interpolate any given discrete sequence. For the proof of the next result in the case M = C (which can easily be adapted to the more general case stated here) see 7 . Corollary 4.12. Let M be a Stein manifold of C-dimension n > 2 with the density or volume density property, S a Stein manifold of'C-dimension r < n, and {/ym; m > 1} C Jk(E,M) a sequence of k-jets such that {c(7m)} and {r(7m)} are discrete sequences in £ and M respectively. If £ admits a proper holomorphic embedding in M, then there exists a proper holomorphic embedding p : £ •—> M such that J*(7m)(p) =7">20
In a recent preprint , J. Winkelmann has constructed "non-tame sequences" in any Stein manifold. These can be used, together with corollary 4.12 to construct non-equivalent embeddings of a given complex manifold £ into a Stein manifold M with the density or volume density property, provided one such embedding exists. Precisely, one has the following. Corollary 4.13. Let M be a Stein manifold of' C-dimension n > 2 with the density or volume density property, and £ a Stein manifold of C-dimension r < n such that there exists a proper holomorphic embedding j : £ •—• M. Then there exists another proper holomorphic embedding j ' : £ •—• M such that for any $ 6 Aut(M), *oj(E)#/(E). In connection with the last corollary, we should mention that the analytically inequivalent embeddings constructed in the proof are all ambiently smoothly isotopic, in the sense that there exists a global smooth diffeomorphism of M which is isotopic to the identity and carries one embedding to the other. Thus the obstructions are not topological. In particular, even in the presence of extreme symmetry, complex rigidity persists. References 1. Abraham, R. Marsden, J., Foundations of Mechanics, Second edition, revised and enlarged (Benjamin/Cummings Publishing Co., Inc., Reading, Mass., 1978). 2. Ahern, P., Flores, M., Rosay, J.-P-, On R + and C complete holomorphic vector fields, Proc. Amer. Math. Soc, to appear. 3. Andersen, E., Volume-preserving automorphisms of C", Complex Variables Theory Appl., 14, 1-4 , 223-235 (1990). 4. Andersen, E., Lempert, L., On the group of holomorphic automorphisms of C \ Invent. Math.,110, 2, 371-388 (1992).
472 5. Buzzard, G., Fornaess, J.-E. An embedding of C in C2 with hyperbolic complement, Math. Ann.,306, 3, 539-546 (1996). 6. Dixon, P. G.; Esterle, J. Michael's problem and the Poincare-Fatou-Bieberbach phenomenon, Bull. Amer. Math. Soc. (N.S.), 15, 2, 127-187 (1986). 7. Forstneric, F., Holomorphic automorphisms of C " : a survey. Complex analysis and geometry , 173-199 (Trento, 1993). 8. Forstneric, F., Interpolation by holomorphic automorphisms and embeddings i n C . Preprint (1996). 9. Griffiths, P., Harris, J., Principles of Algebraic Geometry (Wiley Interscience, 1978). 10. Jurdjevic, V., Geometric control theory. Cambridge Studies in Advanced Mathematics, 52 (Cambridge University Press, Cambridge, 1997). 11. Kobayashi, S., Transformation groups in differential geometry. Ergebnisse der Mathematik und ihrer Grenzgebiete, Band 70 (Springer-Verlag, New YorkHeidelberg, 1972). 12. Rosay, J.-P., Automorphisms of C , a survey of Andersen-Lempert theory and applications. Complex geometric analysis in Pohang (1997), 131-145, Contemp. Math., 222 (Amer. Math. Soc, Providence, RI, 1999). 13. Rosay, J.P., Rudin, W. Holomorphic Maps from C" to C", Trans. AMS, 310, 47-86 (1988). 14. Royden, H., The extension of regular holomorphic maps, Proc. Amer. Math. Soc, 4 3 , 306-310 (1974). 15. Toth, A., Varolin, D., Holomorphic diffeomorphisms of complex semisimple Lie groups, Invent, math., 139, 2, 351-369 (2000). 16. Toth, A., Varolin, D., Work in progress. 17. Varolin, D., The Density Property for Complex Manifolds and Geometric Structures, to appear in J. Geom. Anal. 18. Varolin, D., The Density Property for Complex Manifolds and Geometric Structures II, to appear in Internat. J. Math. 19. Varolin, D., A general notion of shears, and applications, Michigan Math. J., 46, 3, 533-553 (1999). 20. Winkelmann, J., Large discrete sets in Stein manifolds (preprint 1998).
473
Index abnormal curve, 33 abnormal extremal, 20, 32, 143, 184, 193, 194, 198, 200, 201, 249 affine control, 217, 220, 225, 248, 249, 287, 290, 383, 441, 443, 451, 458 Agrachev, 7, 169, 180, 181, 231, 457 algebraic geometry, 4, 305, 328 Anzaldo, 7-9, 109, 183, 202 attainable set, 55, 56, 62, 231 bad brackets, 225, 228, 381, 382, 385 bang-bang, 76, 169-171, 174, 175, 178, 181, 230 Baratchart, 203, 215 Bernoulli, 7, 113-131, 133, 136, 139, 150, 152, 153, 155, 158-163 Bianchini, 5, 217, 225, 230, 231 biextremal, 409 bio-reactor, 292 Bolza, 255, 361, 367, 368, 370, 378-380 Bonnard, 8, 9, 109,116, 163, 183, 194, 233, 256, 457, 458 brachistochrone, 7, 113-120, 121-125, 127, 128, 130-133, 135-140, 144, 149, 150, 152, 153, 155, 156, 158-161 bracket generating, 185, 188 Brockett, 186, 201, 215, 290, 298, 299, 381 Busvelle, 6, 8, 233, 257, 285, 286 Caratheodory, 5
calculus of variations, 114, 117— 119, 128, 129, 132, 133, 137, 158, 163, 176, 187, 220, 255, 380 Campbell-Baker-Hausdorff, 382, 390 Carinena, 5 Cartan, 75, 78, 84-86, 88, 92, 94, 103, 110,177, 183, 184,195198, 201, 348, 354, 357, 423 Casimir functions, 24 Cauchy-Kowalewski, 347 chained form, 190, 300 Chen-Fliess, 381 Chow, 185, 201 chronological, 113, 381, 389-391, 393, 401-403 Chyba, 6, 201, 203, 215, 457 co-adjoint orbits, 23 conservation laws, 23, 25, 409 contact system, 77-80, 82-89, 92-96, 100, 102-104, 106109, 111 control function, 20, 151, 290, 349 cotangent bundle, 19, 20, 81, 171, 185, 235, 412, 413 Darboux, 78, 84 Delaunay, 3 Delgado, 405 diamond Lie algebra, 116, 140, 141 Diop, 6, 344 distillation column, 257, 260, 261, 273, 275, 278, 286 Dubins, 3, 15, 59
474
elastic problem, 3-6, 11, 17, 18, 35, 36, 44-46, 48-50, 296 end-point mapping, 441, 443, 448 Engel, 78, 85, 86, 88-91, 94-96, 183, 194, 195, 200 extremal curves, 19 feedback, 102-104,106, 109-111, 117, 147, 148, 157, 158, 180, 205, 208, 240, 244, 249, 250, 300, 387, 423-425,428-430, 433, 435, 444, 445 flatness, 423, 424, 426, 429, 435 framed Curves, 11 Gauthier, 6, 8, 9, 109, 110, 285, 286, 307, 345, 357 geodesic curvature, 11 geometric control, 1, 4, 5, 7, 8, 50, 51, 76, 109, 115, 123, 287 Goursat, 77-80, 84, 86, 87, 9296, 103, 106-108, 110, 111, 184, 190, 193, 195, 196, 202 Gramm-Schmidt, 12 Griffiths, 4 growth vector, 112, 186, 188, 195, 196 Hamilton-Jacobi, 133 Heisenberg, 183, 186, 197, 202, 298, 356 Hestenes, 116,117, 150-152,155, 158, 164, 458 high gain, 257-265, 268, 270 holomorphic, 8, 76, 354, 459463, 468-471 horizontal lift, 187 Hovercraft, 423-426, 435-437 hyper-elliptic integral, 184, 193
Jacobi, 7, 85, 90, 180, 384, 387, 405, 406, 411, 418, 421 Jakubczyk, 110, 113, 164,165, 215, 303, 357 jet, 77, 82-85, 112, 354-356, 423, 459, 460, 466 Jourani, 5, 359, 361, 368, 379 Jurdjevic, 2, 3, 6, 50, 51, 53, 75, 76, 113, 116, 143, 163, 164, 257, 287, 290, 296, 297, 303, 305, 381, 401, 402 Kalman, 6, 257-262, 264, 265, 273, 281, 283, 286, 307 Kawski, 5, 381, 402 Killing form, 21 Kupka, 116, 143, 163, 164, 194, 201, 214, 256, 285, 307, 345, 457, 458 Lagrangian submanifold, 170, 176, 177 Launay, 8, 233, 256 Lawson, 5, 53, 61, 75 left invariant, 20, 189 Legendre, 174, 227, 412, 413, 415, 416, 442 Lie system, 53, 287-290, 296, 300 local controllability, 218, 231, 359-362, 378, 381, 390, 395, 402, 403 magnetic field, 183, 184, 187, 188, 190, 192-195, 198, 200 Mars, 233, 235, 238 Marsden, 422, 471 Martinet, 183, 194, 195, 201, 450, 456, 458 Maximum Principle, 19, 53, 145, 217 Mayer form, 169
475
mechanical top, 3, 5, 6, 35 Mittenhuber, 5, 35, 50, 51, 53, 76 Monroy, 3, 60, 183 moving frame, 7, 40, 43, 60, 235 Nunez, 7, 421 needle variations, 386 nilpotent group, 183, 202 non-controllable system, 63 non-holonomic, 3, 20, 50, 51, 76, 183, 184, 186, 189, 198 non-linear control, 2, 4, 5, 8 normal form, 70, 72, 77, 79, 80, 83-87, 90, 94-96, 106112, 190, 193, 202, 233, 235, 249, 252, 258, 259, 261-263, 266, 275, 443 observability, 5,6, 8, 258-262, 263, 266, 275, 278, 286, 305309, 311, 313 obstructions, 204, 225, 226, 381, 385, 386, 399, 471 Pasillas, 8, 77, 84, 111 pendulum, 37, 38, 40-42, 49, 121 perturbation, 73, 279, 281, 283, 382, 405, 408, 432, 433, 435 Pfaff, 78, 84, 85, 111 Pfaffian system, 77-79, 81-91, 94, 110, 184, 186, 188, 190193, 195, 197, 198 polynomial systems, 3, 319 Pomet, 6, 79, 84, 89, 109, 111, 203, 215 principal bundle, 186, 187, 289 prolongation, 108, 111, 406, 412, 414, 415 Ramos, 5, 287, 303
reachable set, 56, 62, 143, 217219, 221, 225, 230, 233-235, 240, 241, 248, 249, 252, 255, 256, 383-386, 395-398, 400 realization, 8,141, 299, 302, 347350 reconstruction, 347 Respondek, 8, 77, 84, 111, 113, 164-166, 181, 215, 303, 420, 458 Rigid Body, 51 rigid body, 37, 42, 43 Salas, 7, 405, 421 seminormality, 359-362, 368370, 372-374, 377 Serret-Frenet, 5 simulation, 233, 237, 242, 278, 279, 281, 283, 424, 433 Sira, 8, 423, 424, 437 space shuttle, 6, 8, 233, 235, 252, 255 steering, 54-56, 111, 143, 144, 304, 442, 451, 454, 455 Stefani, 7, 169, 181, 222, 225, 230, 231, 385, 386, 399, 403 Stein, 463-465, 467-472 strong optimality, 170, 181 strong solution, 220, 224 sub-Riemannian, 3, 6, 7, 183186, 193, 198, 201, 202, 248, 441, 450, 451, 453, 455, 457, 458 superposition, 288, 303 supporting hyperplanes, 382, 395-397
476
Sussmann, 113, 153, 159, 165, 180, 193, 201, 215, 225, 231, 256, 290, 303, 347, 357, 380, 385, 387, 399, 401-403, 420 tangent bundle, 38, 80, 406, 407, 412, 415 Trelat, 441, 458 tracking, 423, 424, 429-433, 435, 437 trajectory variations, 220-223, 230 Varolin, 8, 459, 472 weak solution, 217, 220, 221, 224, 229, 230 Wei and Norman, 5, 287, 291 Willems, 7, 113, 165, 303, 345 Zezza, 7, 169, 181
Contemporary Trends m Nonlinear Geometric Control Theory and its Applications Mathematical control theory has evolved from the study of practical problems in engineering and sciences to the elaboration of deep, important concepts in mathematics and applied sciences. This volume concerns contemporary trends in nonlinear geometric control theory and its applications. It is a fine collection of papers presentingnew results, relevant open problems, and important applications regarding academic and real-world problems. The book is dedicated to Velimir Jurdjevic whose scientific activity has been influential in the research of many of the authors. It contains a ^ < < 5 ^ number of articles specially written ^§mW\ by colleagues and friends of Vel Jurdjevic, all of them leading applied mathematicians and control theorists. There is also place for surveys on topics of current research which present the state of the art of modern geometric control theory. Finally, the volume contains several new mathematical ideas generated by geometric control theory techniques, which may initiate new directions of research beyond control theory.
www. worldscientific.com 4879 he T
fi'-*-.*H