Flexible Multibody Dynamics
SOLID MECHANICS AND ITS APPLICATIONS Volume 176
Series Editor:
G.M.L. GLADWELL Department of Civil Engineering University of Waterloo Waterloo, Ontario, Canada N2L 3GI
Aims and Scope of the Series The fundamental questions arising in mechanics are: Why?, How?, and How much? The aim of this series is to provide lucid accounts written by authoritative researchers giving vision and insight in answering these questions on the subject of mechanics as it relates to solids. The scope of the series covers the entire spectrum of solid mechanics. Thus it includes the foundation of mechanics; variational formulations; computational mechanics; statics, kinematics and dynamics of rigid and elastic bodies: vibrations of solids and structures; dynamical systems and chaos; the theories of elasticity, plasticity and viscoelasticity; composite materials; rods, beams, shells and membranes; structural control and stability; soils, rocks and geomechanics; fracture; tribology; experimental mechanics; biomechanics and machine design. The median level of presentation is the first year graduate student. Some texts are monographs defining the current state of the field; others are accessible to final year undergraduates; but essentially the emphasis is on readability and clarity.
For other titles published in this series, go to www.springer.com/series/6557
O. A. Bauchau
Flexible Multibody Dynamics
1C
O. A. Bauchau Georgia Institute of Technology School of Aerospace Engineering Ferst Dr. 270 30332-0150 Atlanta Georgia USA
[email protected]
ISSN 0925-0042 ISBN 978-94-007-0334-6 e-ISBN 978-94-007-0335-3 DOI 10.1007/978-94-007-0335-3 Springer Dordrecht Heidelberg London New York Library of Congress Control Number: 2010938509 © Springer Science+Business Media B.V. 2011 No part of this work may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, microfilming, recording or otherwise, without written permission from the Publisher, with the exception of any material supplied specifically for the purpose of being entered and executed on a computer system, for exclusive use by the purchaser of the work. Cover Design: CREST Printed on acid-free paper Springer is part of Springer Science+Business Media (www.springer.com)
To my wife, Yi-Ling, and my family
Preface
Multibody dynamics analysis was originally developed as a tool for modeling rigid multibody systems with simple tree-like topologies, but has considerably evolved to the point where it can handle linearly and nonlinearly elastic multibody systems with arbitrary topologies. It is now used widely as a fundamental design tool in many areas of engineering. This textbook has emerged over the past two decades from efforts to teach graduate courses in advanced dynamics and ƀexible multibody dynamics to engineering students. Although this book reviews the basic principles of dynamics, it is assumed that students enrolling in these graduate courses have completed a comprehensive set of undergraduate courses in statics, dynamics, deformable bodies, energy methods, and numerical analysis. The advanced dynamics course is, of course, a prerequisite for the ƀexible multibody dynamics course. The book is divided into six parts. The ſrst part presents the basic tools and concepts that form the foundation for the other parts. It begins with a review of basic operations on vectors and tensors. The second chapter deals with coordinate systems. The differential geometry of both curves and surfaces is presented and leads to path and surface coordinates. Chapter 3 reviews the basic principles of dynamics, starting with Newton’s laws. The important concept of conservative forces is discussed. Systems of particles are then treated, leading to Euler’s ſrst and second laws. Chapter 4 concludes the ſrst part of the book with a detailed description of threedimensional rotation. For most graduate students, this chapter is not really a review. Indeed, many undergraduate dynamics courses focus primarily on two-dimensional systems. Problems involving three-dimensional rotation, if treated at all, are often rushed in the last few weeks of the semester, leaving most students with insufſcient time to absorb this difſcult material. Part 2 develops rigid body dynamics, the foundation of multibody dynamics. The analysis of the kinematics of rigid bodies is the focus of chapter 5. It starts with the analysis of the general displacement and velocity ſelds of a rigid body. The classical topics of relative velocities and accelerations are also addressed. The motion tensor and its properties are given an in-depth treatment.
VIII
Preface
Kinetics of rigid bodies is the focus of chapter 6. The various forms of Euler’s law governing the rotational motion of rigid bodies are presented. While the emphasis of the chapter is on three-dimensional problems, planar motion is also treated in details. Part 3 presents the fundamental concepts of analytical dynamics. Chapter 7 introduces the concepts of virtual displacement, virtual rotation, and virtual work. The principle of virtual work for static problems is given extensive coverage as this is an indispensable topic for the study of the variational and energy principles of dynamics presented in chapter 8. D’Alembert’s principle, Hamilton’s principle, and Lagrange’s formulation are derived and their use illustrated with numerous examples. Multibody systems are characterized by two distinguishing features: system components undergo ſnite relative rotations and these components are connected by mechanical joints that impose restrictions on their relative motion. The ſrst distinguishing feature is of a purely kinematic nature: in multibody systems, overall and relative motions are ſnite, leading to inherently nonlinear problems. The second distinguishing feature is the main culprit for the complexity of many multibody formulations. Each component of a ƀexible multibody system is a constrained dynamical system because of the restrictions imposed on it by the mechanical joints connecting it to others. The ſrst three parts of the book present background material on unconstrained dynamical systems, i.e., systems for which the number of generalized coordinated used to describe the system equals the number of degrees of freedom. In contrast, part 4 focuses on constrained dynamical systems. Chapter 9 presents Lagrange’s multiplier technique and the distinction between holonomic and nonholonomic constraints. The combination of the principle of virtual work with Lagrange’s multiplier technique is shown to provide a powerful tool for the analysis of constrained static problems. Chapter 10 reviews the classical formulations for constrained dynamical systems. D’Alembert’s principle, Hamilton’s principle, and Lagrange’s formulation are updated to accommodate the presence of both holonomic and nonholonomic constraints. The kinematic constraints associated with the lower pair joints are described in details. The advanced formulations presented in chapter 11 form the theoretical basis for the practical approaches to numerical solutions of multibody systems. Maggi’s, the index-1, the null space, and Udwadia and Kalaba’s formulations are presented and the chapter concludes with the geometric interpretation of constraints and Gauss’ principle. Finally, chapter 12 describes in a cursory manner the many numerical approaches used to treat constrained dynamical systems, most of which are rooted in the formulations presented in chapter 11. Chapter 12 is in fact a comprehensive review of the literature on methods of constrained dynamics applied to the solution of multibody systems. It is clearly impossible to treat each approach in detail. Rather, the salient features of each approach are given, and the relationships between them are underlined. The chapter concludes with a detailed description of scaling methods for Lagrange’s equations of the ſrst kind.
Preface
IX
Part 5 presents a comprehensive overview of the many approaches used to parameterize rotation and motion. The vectorial parameterizations of rotation and motion are given special emphasis as they provide a uniſed approach to this complex topic. Speciſc parameterizations widely used in multibody formulations are reviewed in details, whereas other are presented in a more cursory manner. The last part of the book focuses on ƀexible multibody dynamics problems, which are categorized into three groups: rigid multibody systems, linearly elastic multibody systems, and nonlinearly elastic multibody systems. The last three chapters of the book focus on the latter category, nonlinearly elastic multibody systems. Chapter 15 presents background material. The basic equations of linear elastodynamics are presented ſrst. Next, ſnite displacement kinematics are studied, with special emphasis on small strain problems. Chapter 16 develops the governing equations of ƀexible joints, cables, beams, and plates and shells. All formulations are geometrically exact, i.e., all structural components are allowed to undergo arbitrarily large displacements and rotations, although strains are assumed to remain small. Finally, chapter 17 presents details of the implementation of these elements within the framework of ſnite element formulations. For instance, interpolation of the rotation ſelds is an issue that requires special attention. The topics covered in the ſrst three parts of the book form the basis for a threecredit hour, graduate level course in advanced dynamics, typically taken by ſrst year graduate students. Topics selected from the last three parts provide an ample material for a follow-on, three-credit hour, graduate level course in ƀexible multibody dynamics. The advanced dynamics course is, of course, a prerequisite for the ƀexible multibody dynamics course. Typically, engineering students generally grasp concepts more quickly when presented ſrst with practical examples, which then lead to broader generalizations. Consequently, most concepts are ſrst introduced by means of simple examples; more formal and abstract statements are presented later, when the student has a better grasp of the signiſcance of the concepts. Numerous homework problems are included throughout the book. Some are straightforward applications of basic concepts, others are small projects that require the use of computers and mathematical software, and others involve conceptual questions that are more appropriate for quizzes and exams. The text also provides many examples that treat practical problems in great details. Some of the examples are re-examined in successive chapters to illustrate alternative or more versatile solution methods. Notation is a challenging issue in dynamics. Given the limitations of the Latin and Greek alphabets, the same symbol is sometimes used to indicate different quantities, but mostly in different contexts. Consequently, no attempt has been made to provide a comprehensive nomenclature, which would lead to even more confusion. It is traditional to use a bold typeface to represent vectors and tensors, but this is very difſcult to reproduce in handwriting, whether on a board or in personal notes. A notation that is more suitable for hand-written notes has been adopted here. Vectors and arrays are denoted using an underline, such as u or F . Unit vectors are used
X
Preface
frequently and are assigned a special notation using a single overbar, such as ¯ı1 , which denotes the ſrst Cartesian coordinate axis. The overbar notation also indicates non-dimensional scalar quantities, i.e., k¯ is a non-dimensional stiffness coefſcient. This is inconsistent, but the two uses are in such different contexts that it should not lead to confusion. Second-order tensors and matrices are indicated using a doubleunderline, i.e., R indicates a 3×3 rotation tensor or M a n × n mass matrix. Notations aT b, ab, and a bT indicate the scalar, vector, and tensor products, respectively, of two vectors, a and b. While many students voice their displeasure with this mnemonic convention that departs from the classical “dot” and “cross product” notations, they very rapidly recognize and appreciate its power and conciseness. Finally, I am indebted to the many students at Georgia Tech who have given me helpful and constructive feedback over the past decade as I developed the course notes that are the precursors of this book. The constructive use of their many questions and confusion has helped shape this book, and the treatment of many topics was modiſed numerous times before ſnding their ſnal form.
Atlanta, Georgia, July 2010
Olivier Bauchau
Contents
Part I Basic tools and concepts 1
Vectors and tensors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1 Free vectors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1.1 Vector sum . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1.2 Scalar multiplication . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1.3 Norm of a free vector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1.4 Angle between two vectors . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1.5 The scalar product . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1.6 Orthonormal bases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1.7 The vector product . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1.8 The tensor product . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1.9 The mixed product . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1.10 Tensor identities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1.11 Solution of the vector product equation . . . . . . . . . . . . . . . . . . 1.1.12 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.2 Bound vectors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.2.1 The position vector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.2.2 Reference frames . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.3 Geometric entities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.3.1 Lines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.3.2 Planes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.3.3 Circles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.3.4 Spheres . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.3.5 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.4 Second-order tensors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.4.1 Basic operations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.4.2 Eigenvalue analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.4.3 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.5 Tensor calculus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3 3 4 4 4 5 5 6 7 9 10 10 11 12 13 14 14 15 15 16 16 16 20 22 22 23 27 27
XII
Contents
1.6 Notational conventions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
29
2
Coordinate systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.1 Cartesian coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.2 Differential geometry of a curve . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.2.1 Intrinsic parameterization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.2.2 Arbitrary parameterization . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.3 Path coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.3.1 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.4 Differential geometry of a surface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.4.1 The ſrst metric tensor of a surface . . . . . . . . . . . . . . . . . . . . . . 2.4.2 Curve on a surface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.4.3 The second metric tensor of a surface . . . . . . . . . . . . . . . . . . . 2.4.4 Analysis of curvatures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.4.5 Lines of curvature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.4.6 Derivatives of the base vectors . . . . . . . . . . . . . . . . . . . . . . . . . 2.4.7 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.5 Surface coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.6 Differential geometry of a three-dimensional mapping . . . . . . . . . . . . 2.6.1 Arbitrary parameterization . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.6.2 Orthogonal parameterization . . . . . . . . . . . . . . . . . . . . . . . . . . 2.6.3 Derivatives of the base vectors . . . . . . . . . . . . . . . . . . . . . . . . . 2.7 Orthogonal curvilinear coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.7.1 Cylindrical coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.7.2 Spherical coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
31 31 32 32 34 39 39 40 40 41 42 43 44 44 48 49 49 50 51 52 53 54 55
3
Basic principles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.1 Newtonian mechanics for a particle . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.1.1 Kinematics of a particle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.1.2 Newton’s laws . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.1.3 Systems of units . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.1.4 The principle of work and energy . . . . . . . . . . . . . . . . . . . . . . . 3.2 Conservative forces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.2.1 Principle of conservation of energy . . . . . . . . . . . . . . . . . . . . . 3.2.2 Potential of common conservative forces . . . . . . . . . . . . . . . . 3.2.3 Non-conservative forces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.2.4 The principle of impulse and momentum . . . . . . . . . . . . . . . . 3.2.5 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.3 Contact forces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.3.1 Kinematics of particles in contact with a surface . . . . . . . . . . 3.3.2 Kinematics of particles in contact with a curve . . . . . . . . . . . . 3.3.3 Constitutive laws for tangential contact forces . . . . . . . . . . . . 3.3.4 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.4 Newtonian mechanics for a system of particles . . . . . . . . . . . . . . . . . . 3.4.1 The center of mass . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
57 57 57 58 60 61 62 65 66 70 74 79 85 86 87 88 92 94 95
Contents
3.4.2 3.4.3 3.4.4 3.4.5 3.4.6 3.4.7 4
XIII
The forces and moments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96 Linear and angular momenta . . . . . . . . . . . . . . . . . . . . . . . . . . . 97 Euler’s laws for a system of particles . . . . . . . . . . . . . . . . . . . . 98 The principle of work and energy . . . . . . . . . . . . . . . . . . . . . . . 100 The principle of impulse and momentum . . . . . . . . . . . . . . . . 100 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
The geometric description of rotation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.1 The direction cosine matrix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.2 Planar rotations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.3 Non-commutativity of rotations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.4 Euler angles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.4.1 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.5 Euler’s theorem on rotations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.6 The rotation tensor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.7 Properties of the rotation tensor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.8 Change of basis operations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.8.1 Vector components in various orthonormal bases . . . . . . . . . . 4.8.2 Second-order tensor components in various orthonormal bases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.8.3 Tensor operations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.8.4 The concept of tensor analysis . . . . . . . . . . . . . . . . . . . . . . . . . 4.8.5 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.9 Composition of rotations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.9.1 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.10 Time derivatives of rotation operations . . . . . . . . . . . . . . . . . . . . . . . . . 4.10.1 The angular velocity vector: an intuitive approach . . . . . . . . . 4.10.2 The angular velocity vector: a rigorous approach . . . . . . . . . . 4.10.3 The addition theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.10.4 Angular acceleration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.11 Euler angle formulas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.11.1 Euler angles: sequence 3-1-3 . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.11.2 Euler angles: sequence 3-2-3 . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.11.3 Euler angles: sequence 3-2-1 . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.11.4 Euler angles: sequence 3-1-2 . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.11.5 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.12 Spatial derivatives of rotation operations . . . . . . . . . . . . . . . . . . . . . . . 4.12.1 Path coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.12.2 Surface coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.12.3 Orthogonal curvilinear coordinates . . . . . . . . . . . . . . . . . . . . . 4.12.4 The differential rotation vector . . . . . . . . . . . . . . . . . . . . . . . . . 4.13 Applications to particle dynamics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.13.1 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.14 Change of reference frame operations . . . . . . . . . . . . . . . . . . . . . . . . . . 4.15 Orientation of a unit vector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
107 107 108 109 109 111 112 113 115 116 116 117 119 121 122 124 127 129 129 131 135 137 137 138 139 139 140 141 143 143 144 145 148 149 152 155 157
XIV
Contents
Part II Rigid body dynamics 5
Kinematics of rigid bodies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.1 General motion of a rigid body . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.2 Velocity ſeld of a rigid body . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.2.1 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.3 Relative velocity and acceleration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.3.1 Point P is in motion with respect to the rigid body . . . . . . . . . 5.3.2 Point P is a material point of the rigid body . . . . . . . . . . . . . . 5.3.3 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.4 Contact between rigid bodies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.4.1 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.5 The motion tensor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.5.1 Transformation of a line of a rigid body . . . . . . . . . . . . . . . . . 5.5.2 Properties of the motion tensor . . . . . . . . . . . . . . . . . . . . . . . . . 5.5.3 Mozzi-Chasles’ axis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.5.4 Intrinsic expression of the motion tensor . . . . . . . . . . . . . . . . . 5.5.5 Properties of the generalized vector product tensor . . . . . . . . 5.5.6 Change of frame operation for linear and angular velocities . 5.5.7 Change of frame operation for forces and moments . . . . . . . . 5.6 Derivatives of ſnite motion operations . . . . . . . . . . . . . . . . . . . . . . . . . 5.6.1 The velocity vector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.6.2 The differential motion vector . . . . . . . . . . . . . . . . . . . . . . . . . . 5.6.3 Change of frame operations . . . . . . . . . . . . . . . . . . . . . . . . . . .
161 161 163 165 166 167 168 174 179 183 187 187 188 189 190 191 192 193 194 194 196 197
6
Kinetics of rigid bodies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.1 The angular momentum vector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.2 The kinetic energy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.3 Properties of the mass moment of inertia tensor . . . . . . . . . . . . . . . . . 6.3.1 The parallel axis theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.3.2 Change of basis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.3.3 Principal axes of inertia . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.3.4 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.4 Derivatives of the angular momentum vector . . . . . . . . . . . . . . . . . . . . 6.5 Equations of motion for a rigid body . . . . . . . . . . . . . . . . . . . . . . . . . . 6.5.1 Euler’s equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.5.2 The pivot equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.5.3 Equations of motion with respect to a material point of the rigid body . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.5.4 Equations of motion with respect to an arbitrary point . . . . . . 6.6 The principle of work and energy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.6.1 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.7 Planar motion of rigid bodies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.7.1 Euler’s equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
201 202 204 205 205 206 207 208 210 211 212 213 213 214 214 222 227 227
Contents
6.7.2 6.7.3
The pivot equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Equations of motion with respect to a material point of the body . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.7.4 Equations of motion with respect to an arbitrary point . . . . . . 6.7.5 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.8 Inertial characteristics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
XV
228 228 229 237 248
Part III Concepts of analytical dynamics 7
Basic concepts of analytical dynamics . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.1 Mathematical preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.1.1 Stationary point of a function . . . . . . . . . . . . . . . . . . . . . . . . . . 7.1.2 Stationary point of a deſnite integral . . . . . . . . . . . . . . . . . . . . 7.2 Generalized coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.3 The virtual displacement and rotation vectors . . . . . . . . . . . . . . . . . . . 7.3.1 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.4 Virtual work and generalized forces . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.4.1 Virtual work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.4.2 Generalized forces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.4.3 Virtual work done by internal forces . . . . . . . . . . . . . . . . . . . . 7.4.4 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.5 The principle of virtual work for statics . . . . . . . . . . . . . . . . . . . . . . . . 7.5.1 Principle of virtual work for a single particle . . . . . . . . . . . . . 7.5.2 Kinematically admissible virtual displacements . . . . . . . . . . . 7.5.3 Use of inſnitesimal displacements as virtual displacements . 7.5.4 Principle of virtual work for a system of particles . . . . . . . . . 7.5.5 The use of generalized coordinates . . . . . . . . . . . . . . . . . . . . . . 7.5.6 The principle of virtual work and conservative forces . . . . . . 7.5.7 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
253 253 254 255 257 263 268 268 268 268 269 271 271 272 277 282 284 287 289 292
8
Variational and energy principles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8.1 D’Alembert’s principle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8.1.1 Equations of motion for a rigid body . . . . . . . . . . . . . . . . . . . . 8.1.2 Equations of motion for the planar motion of a rigid body . . 8.1.3 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8.2 Hamilton’s principle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8.2.1 Use of physical coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8.2.2 Use of generalized coordinates . . . . . . . . . . . . . . . . . . . . . . . . . 8.2.3 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8.3 Lagrange’s formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8.3.1 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8.4 Analysis of the motion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8.4.1 General procedure for the analysis of motion . . . . . . . . . . . . . 8.4.2 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
295 295 297 299 304 305 306 308 320 322 335 336 337 345
XVI
Contents
Part IV Constrained dynamical systems 9
Constrained systems: preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9.1 Lagrange’s multiplier method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9.1.1 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9.2 Constraints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9.2.1 Holonomic constraints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9.2.2 Nonholonomic constraints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9.2.3 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9.3 The principle of virtual work for constrained static problems . . . . . . 9.3.1 The principle of virtual work for a constrained particle . . . . . 9.3.2 The principle of virtual work and Lagrange multipliers . . . . . 9.3.3 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
351 358 362 362 362 364 370 371 372 378 382
10 Constrained systems: classical formulations . . . . . . . . . . . . . . . . . . . . . . . 10.1 D’Alembert’s principle for constrained systems . . . . . . . . . . . . . . . . . 10.1.1 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.2 Hamilton’s principle and Lagrange’s formulation with holonomic constraints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.2.1 Hamilton’s principle with holonomic constraints . . . . . . . . . . 10.2.2 Lagrange’s formulation with holonomic constraints . . . . . . . . 10.2.3 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.3 Hamilton’s principle and Lagrange’s formulation with nonholonomic constraints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.3.1 Hamilton’s principle with nonholonomic constraints . . . . . . . 10.3.2 Lagrange’s formulation with nonholonomic constraints . . . . 10.3.3 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.4 The lower pair joints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.4.1 Kinematics of a typical lower pair joint . . . . . . . . . . . . . . . . . . 10.4.2 Notational conventions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.4.3 Relative motions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.5 Generic constraints for lower pair joints . . . . . . . . . . . . . . . . . . . . . . . . 10.5.1 First constraint: vanishing relative rotation . . . . . . . . . . . . . . . 10.5.2 Second constraint: vanishing relative displacement . . . . . . . . 10.5.3 Third constraint: deſnition of relative rotation . . . . . . . . . . . . 10.5.4 Fourth constraint: deſnition of relative displacement . . . . . . . 10.6 Constraints for the lower pair joints . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.6.1 Revolute joints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.6.2 Prismatic joints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.6.3 Cylindrical joints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.6.4 Screw joints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.6.5 Planar joints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.6.6 Spherical joints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.6.7 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
385 385 390 392 393 394 398 402 403 403 405 405 406 406 407 408 409 410 411 411 412 412 414 415 416 416 417 418
Contents
XVII
10.7 Other joints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.7.1 Universal joints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.7.2 Curve sliding joints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.7.3 Sliding joints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.7.4 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
418 418 419 421 422
11 Constrained systems: advanced formulations . . . . . . . . . . . . . . . . . . . . . . 11.1 Lagrange’s equations of the ſrst kind . . . . . . . . . . . . . . . . . . . . . . . . . . 11.2 Algebraic elimination of Lagrange’s multipliers . . . . . . . . . . . . . . . . . 11.2.1 Maggi’s formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11.2.2 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11.2.3 Index-1 formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11.2.4 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11.2.5 The null space formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11.2.6 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11.2.7 Udwadia and Kalaba’s formulation . . . . . . . . . . . . . . . . . . . . . 11.2.8 Comparison of the ODE formulations . . . . . . . . . . . . . . . . . . . 11.2.9 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11.3 The geometric interpretation of constraints . . . . . . . . . . . . . . . . . . . . . 11.3.1 The orthogonal projection operator . . . . . . . . . . . . . . . . . . . . . 11.3.2 The projection operator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11.3.3 Projection of the equations of motion . . . . . . . . . . . . . . . . . . . 11.3.4 Elimination of Lagrange’s multipliers . . . . . . . . . . . . . . . . . . . 11.4 Gauss’ principle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11.5 Additional formulations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
425 426 427 428 435 438 441 442 444 444 445 447 447 448 450 453 455 457 461
12 Constrained systems: numerical methods . . . . . . . . . . . . . . . . . . . . . . . . . 12.1 Ordinary differential equation techniques . . . . . . . . . . . . . . . . . . . . . . . 12.1.1 “Maggi-like” formulations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12.1.2 Maggi’s formulations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12.1.3 Discussion of the methods based on Maggi’s formulation . . . 12.1.4 Null space formulations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12.1.5 Udwadia and Kalaba’s formulations . . . . . . . . . . . . . . . . . . . . . 12.1.6 The projective formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12.1.7 Modiſed phase space formulation . . . . . . . . . . . . . . . . . . . . . . 12.2 Index reduction techniques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12.3 Constraint violation stabilization techniques . . . . . . . . . . . . . . . . . . . . 12.3.1 Control theory based stabilization techniques . . . . . . . . . . . . . 12.3.2 Penalty based stabilization techniques . . . . . . . . . . . . . . . . . . . 12.4 Constraint violation elimination techniques . . . . . . . . . . . . . . . . . . . . . 12.4.1 Geometric projection approach to stabilization . . . . . . . . . . . . 12.4.2 The mass-orthogonal projection formulation . . . . . . . . . . . . . 12.5 Finite element based techniques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12.5.1 Floating frame of reference approach . . . . . . . . . . . . . . . . . . . . 12.5.2 Component mode synthesis methods . . . . . . . . . . . . . . . . . . . .
463 464 464 466 467 468 469 469 470 470 473 473 475 478 478 480 481 482 484
XVIII
Contents
12.5.3 Basic solution techniques for ſnite element models . . . . . . . . 12.5.4 Numerically dissipative schemes . . . . . . . . . . . . . . . . . . . . . . . 12.5.5 Nonlinear unconditionally stable schemes . . . . . . . . . . . . . . . . 12.5.6 Enforcement of the constraints . . . . . . . . . . . . . . . . . . . . . . . . . 12.5.7 The discrete null space approach . . . . . . . . . . . . . . . . . . . . . . . 12.6 Scaling of Lagrange’s equation of the ſrst kind . . . . . . . . . . . . . . . . . . 12.6.1 Scaling of the equations of motion . . . . . . . . . . . . . . . . . . . . . . 12.6.2 The augmented Lagrangian term . . . . . . . . . . . . . . . . . . . . . . . 12.6.3 Time discretization of the equations . . . . . . . . . . . . . . . . . . . . . 12.6.4 Relationship to the preconditioning approach . . . . . . . . . . . . . 12.6.5 Beneſts of the augmented Lagrangian formulation . . . . . . . . 12.6.6 Using other time integration schemes . . . . . . . . . . . . . . . . . . . 12.7 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
485 487 487 488 489 490 492 493 494 498 499 501 504
Part V Parameterization of rotation and motion 13 Parameterization of rotation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.1 Cayley’s rotation parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.2 Quaternion algebra . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.3 Euler parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.3.1 The rotation tensor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.3.2 The angular velocity vector . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.3.3 Composition of rotations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.3.4 Determination of Euler parameters . . . . . . . . . . . . . . . . . . . . . . 13.3.5 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.4 The vectorial parameterization of rotation . . . . . . . . . . . . . . . . . . . . . . 13.4.1 Fundamental properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.4.2 The rotation tensor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.4.3 The angular velocity vector . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.4.4 Determination of the rotation parameter vector . . . . . . . . . . . 13.4.5 Composition of rotations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.4.6 Linearization of the tangent tensor . . . . . . . . . . . . . . . . . . . . . . 13.4.7 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.5 Speciſc choices of generating function . . . . . . . . . . . . . . . . . . . . . . . . 13.6 The extended vectorial parameterization . . . . . . . . . . . . . . . . . . . . . . . 13.6.1 Singularities of the vectorial parameterization . . . . . . . . . . . . 13.6.2 The rescaling operation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.7 Speciſc parameterizations of rotation . . . . . . . . . . . . . . . . . . . . . . . . . . 13.7.1 The Cartesian rotation vector . . . . . . . . . . . . . . . . . . . . . . . . . . 13.7.2 The Euler-Rodrigues parameters . . . . . . . . . . . . . . . . . . . . . . . 13.7.3 The Cayley-Gibbs-Rodrigues parameters . . . . . . . . . . . . . . . . 13.7.4 The Wiener-Milenkovi´c parameters . . . . . . . . . . . . . . . . . . . . . 13.7.5 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
511 513 514 516 517 517 518 518 523 524 524 526 527 529 530 530 530 531 533 533 534 537 537 538 538 539 541
Contents
14 Parameterization of motion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14.1 Cayley’s motion parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14.2 Bi-quaternion algebra . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14.3 Euler motion parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14.3.1 The motion tensor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14.3.2 The velocity vector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14.3.3 Composition of ſnite motions . . . . . . . . . . . . . . . . . . . . . . . . . . 14.3.4 Determination of Euler motion parameters . . . . . . . . . . . . . . . 14.3.5 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14.4 The vectorial parameterization of motion . . . . . . . . . . . . . . . . . . . . . . . 14.4.1 Fundamental properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14.4.2 The motion parameter vector . . . . . . . . . . . . . . . . . . . . . . . . . . 14.4.3 The generalized vector product tensor . . . . . . . . . . . . . . . . . . . 14.4.4 The motion tensor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14.4.5 The velocity vector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14.4.6 Determination of the motion parameter vector . . . . . . . . . . . . 14.4.7 Composition of ſnite motions . . . . . . . . . . . . . . . . . . . . . . . . . . 14.5 Speciſc parameterizations of motion . . . . . . . . . . . . . . . . . . . . . . . . . . 14.5.1 Alternative choices of the motion parameter vector . . . . . . . . 14.5.2 The exponential map of motion . . . . . . . . . . . . . . . . . . . . . . . . 14.5.3 The Euler-Rodrigues motion parameters . . . . . . . . . . . . . . . . . 14.5.4 The Cayley-Gibbs-Rodrigues motion parameters . . . . . . . . . . 14.5.5 The Wiener-Milenkovi´c motion parameters . . . . . . . . . . . . . . 14.5.6 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
XIX
543 544 546 547 548 549 550 550 553 554 554 556 558 558 559 561 561 561 562 563 563 563 564 564
Part VI Flexible multibody dynamics 15 Flexible multibody systems: preliminaries . . . . . . . . . . . . . . . . . . . . . . . . 569 15.1 Classiſcation of multibody systems . . . . . . . . . . . . . . . . . . . . . . . . . . . 569 15.1.1 Linearly and nonlinearly elastic multibody systems . . . . . . . . 570 15.1.2 Shortcomings of modal analysis applied to nonlinear systems 571 15.1.3 Finite element based modeling of ƀexible multibody systems 575 15.2 The elastodynamics problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 579 15.2.1 Review of the equations of linear elastodynamics . . . . . . . . . 580 15.2.2 The principle of virtual work . . . . . . . . . . . . . . . . . . . . . . . . . . 583 15.2.3 Hamilton’s principle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 586 15.3 Finite displacements kinematics for ƀexible bodies . . . . . . . . . . . . . . 588 15.3.1 The engineering strain components . . . . . . . . . . . . . . . . . . . . . 592 15.3.2 The deformation gradient tensor . . . . . . . . . . . . . . . . . . . . . . . . 592 15.3.3 The metric tensor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 593 15.3.4 The Green-Lagrange strain tensor . . . . . . . . . . . . . . . . . . . . . . 593 15.4 Strain measures for various differential elements . . . . . . . . . . . . . . . . 594 15.4.1 Stretch of a material line . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 594 15.4.2 Angle between two material lines . . . . . . . . . . . . . . . . . . . . . . . 595
XX
Contents
15.4.3 Surface dilatation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15.4.4 Volume dilatation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15.4.5 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15.5 The formulation of small strain problems . . . . . . . . . . . . . . . . . . . . . . . 15.5.1 Decomposition of the deformation gradient tensor . . . . . . . . . 15.5.2 The small strain assumption . . . . . . . . . . . . . . . . . . . . . . . . . . .
595 596 596 597 597 598
16 Formulation of ƀexible elements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16.1 Formulation of ƀexible joints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16.1.1 Flexible joint conſguration . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16.1.2 Flexible joint differential work . . . . . . . . . . . . . . . . . . . . . . . . . 16.1.3 The deformation measures . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16.1.4 Change of reference frame . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16.1.5 Deformation measure invariance . . . . . . . . . . . . . . . . . . . . . . . 16.1.6 Flexible joint constitutive laws . . . . . . . . . . . . . . . . . . . . . . . . . 16.2 Formulation of cable equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16.2.1 The kinematics of the problem . . . . . . . . . . . . . . . . . . . . . . . . . 16.2.2 The small strain assumption . . . . . . . . . . . . . . . . . . . . . . . . . . . 16.2.3 Governing equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16.2.4 Extension to dynamic problems . . . . . . . . . . . . . . . . . . . . . . . . 16.2.5 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16.3 Formulation of beam equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16.3.1 Kinematics of the problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16.3.2 Governing equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16.3.3 Extension to dynamic problems . . . . . . . . . . . . . . . . . . . . . . . . 16.3.4 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16.4 Formulation of plate and shell equations . . . . . . . . . . . . . . . . . . . . . . . 16.4.1 Kinematics of the shell problem . . . . . . . . . . . . . . . . . . . . . . . . 16.4.2 Governing equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16.4.3 Extension to dynamic problems . . . . . . . . . . . . . . . . . . . . . . . . 16.4.4 Mixed interpolation of tensorial components . . . . . . . . . . . . .
601 601 603 604 605 606 607 608 613 613 614 615 616 616 617 618 620 622 628 628 629 631 633 634
17 Finite element tools . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 639 17.1 Interpolation of displacement ſelds . . . . . . . . . . . . . . . . . . . . . . . . . . . 639 17.2 Interpolation of rotation ſelds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 644 17.2.1 Finite element discretization . . . . . . . . . . . . . . . . . . . . . . . . . . . 646 17.2.2 Total versus incremental unknowns . . . . . . . . . . . . . . . . . . . . . 650 17.2.3 Interpolation of incremental rotations . . . . . . . . . . . . . . . . . . . 651 17.3 Governing equations and linearization process . . . . . . . . . . . . . . . . . . 657 17.3.1 Statics problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 657 17.3.2 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 659 17.3.3 Linear structural dynamics problems . . . . . . . . . . . . . . . . . . . . 660 17.3.4 Nonlinear structural dynamics problems . . . . . . . . . . . . . . . . . 661 17.3.5 Multibody dynamics problems with holonomic constraints . 662 17.3.6 Multibody dynamics problems with nonholonomic constraints663
Contents
XXI
17.4 The generalized-α time integration scheme . . . . . . . . . . . . . . . . . . . . . 664 17.4.1 Linear structural dynamics problems . . . . . . . . . . . . . . . . . . . . 665 17.4.2 Nonlinear structural dynamics problems . . . . . . . . . . . . . . . . . 667 17.4.3 Multibody dynamics problems with holonomic constraints . 669 17.4.4 Multibody dynamics problems with nonholonomic constraints669 17.5 Energy preserving and decaying schemes . . . . . . . . . . . . . . . . . . . . . . 670 17.5.1 The symmetric hyperbolic form . . . . . . . . . . . . . . . . . . . . . . . . 672 17.5.2 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 677 17.5.3 Practical time integration schemes . . . . . . . . . . . . . . . . . . . . . . 677 17.5.4 Enforcement of the constraints . . . . . . . . . . . . . . . . . . . . . . . . . 682 17.6 Implementation of cable elements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 685 17.6.1 Inertial forces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 686 17.6.2 Elastic forces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 686 17.6.3 Dissipative forces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 687 17.6.4 Gravity forces for cables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 687 17.6.5 Finite element formulation of cables . . . . . . . . . . . . . . . . . . . . 688 17.7 Finite element implementation of beam elements . . . . . . . . . . . . . . . . 689 17.7.1 Inertial forces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 689 17.7.2 Elastic forces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 691 17.7.3 Dissipative forces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 693 17.7.4 Gravity forces for beams . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 695 17.7.5 Finite element formulation of beams . . . . . . . . . . . . . . . . . . . . 695 18 Mathematical tools . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18.1 The singular value decomposition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18.2 The Moore-Penrose generalized inverse . . . . . . . . . . . . . . . . . . . . . . . . 18.2.1 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18.3 Gauss-Legendre quadrature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
697 697 698 699 699
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 703 Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 721
Part I
Basic tools and concepts
1 Vectors and tensors
Vectors and tensors are basic tools for the formulation of kinematics and dynamics problems. This chapter introduces notations and the fundamental operations on vectors and tensors that will be used throughout this book.
1.1 Free vectors Consider two points, denoted A and B, in three-dimensional space, as shown in ſg. 1.1. The line that connects point A to point B is called an oriented line segment, and denoted AB. In the following, the word “segment” will often be used to indicate an oriented line segment. Next, consider two points, A and B , such that ABB A forms a parallelogram. Segments AB and A B are then of identical length and are parallel to each other. Similarly, if two other points, A and B , are such that ABB A also forms a parallelogram, segments AB and A B are then of identical length and parallel to each other. Segments AB, A B , and A B are said to be equivalent. The ensemble of all equivalent segments deſne the free vector a: a given oriented line segment deſnes a free vector. B¢
B
a
a
a
b
b
c
A¢
C
B¢¢ a
A
A
a
B
A¢¢ Fig. 1.1. A free vector.
Fig. 1.2. Sum of two vectors c = a + b.
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_1 © Springer Science+Business Media B.V. 2011
4
1 Vectors and tensors
1.1.1 Vector sum The addition of two free vectors, a and b, is described in ſg. 1.2. Let segments AB and BC deſne vectors a and b, respectively, and point B is both the end of segment AB and the origin of segment BC. The vector sum of free vectors, a and b, is then free vector c deſned by all segments equivalent to AC. As ſg. 1.2 indicates, the vector sum is commutative, i.e., c = a + b = b + a.
(1.1)
1.1.2 Scalar multiplication
l
B a
A b = aa
B¢
l’ = al a>0
b = aa B¢¢
A¢¢
Figure 1.3 shows segment AB that deſnes vector a; the length of vector a is deſned as the distance, , between points A and B. The multiplication of a free vector, a, by a scalar, α, is depicted in ſg. 1.3 and is denoted as b = αa. (1.2)
l’ = |a| l a<0
If α is positive, vector b is deſned by segment A B parallel to AB, oriented in Fig. 1.3. Multiplying a vector by a scalar: the same direction, and of length = α. If α is negative, vector b is deſned by segb = αa. ment A B parallel to AB, oriented in the opposite direction, and of length = |α|. A¢
1.1.3 Norm of a free vector Segment AB shown in ſg. 1.3 deſnes a free vector, a. The norm of free vector, a, is deſned as the length, , of any segment deſning it. Notation a is used to express the norm of a vector, a = a = . (1.3) A null vector is a vector of vanishing length, i.e., a = 0 implies a = a = = 0. From these deſnitions, it follows that αa = |α| a,
(1.4)
and the triangular inequality implies a + b ≤ a + b.
(1.5)
¯ A unit A unit vector is a vector of unit norm and is indicated by an overbar, (·). vector can be constructed from any vector, a, by dividing it by its norm, a ¯= By construction, ¯ a = 1.
a . a
(1.6)
1.1 Free vectors
5
1.1.4 Angle between two vectors Figure 1.4 deſnes the angle, Φ, between two free vectors, a and b. Let segments AB and AC deſne the free vectors a and b, respectively. Angle Φ is that between segments AB and AC when these two segments share a common point A. The angle between two free vectors is denoted as (1.7)
Φ = (a, b).
Note that (a, b) = (b, a) = Φ. The angle between two vectors is such that 0 ≤ Φ ≤ π, i.e., sin Φ ≥ 0. c C b
b F
A
b+c a
B ||b||cos(a,b)
a
||c||cos(a,c)
||b + c||cos(a,b + c) Fig. 1.4. Angle between two vectors.
Fig. 1.5. The scalar product is distributive.
1.1.5 The scalar product The scalar product, σ, of two vectors, often called the dot product, is deſned as σ = aT b = ab cos(a, b).
(1.8)
Because cos(a, b) = cos(b, a), the scalar product is a commutative operation σ = aT b = bT a.
(1.9)
Furthermore, it is a distributive operation σ = aT (b + c) = aT b + aT c,
σ = (a + b)T c = aT c + bT c.
(1.10)
This property follows from the fact that b + c cos(a, b + c) = b cos(a, b) + c cos(a, c), as illustrated in ſg. 1.5. The scalar product of a vector by itself yields the square of its norm, aT a = a2 = a2 . Statement aT b = 0 implies that either a = 0 or b = 0, or a is orthogonal to b. The condition for the orthogonality of two vectors is aT b = 0,
(1.11)
provided that neither vector is null. The projection ρ of a vector a along a unit direction n ¯ is readily expressed in terms of the scalar product as ρ = aT n ¯ = a cos(a, n ¯ ).
(1.12)
6
1 Vectors and tensors
1.1.6 Orthonormal bases Figure 1.6 depicts an orthonormal basis, I, deſned by a set of three mutually orthogonal free unit vectors, ¯ı1 , ¯ı2 , and ¯ı3 . Orthonormal bases, also called Cartesian bases, will be indicated with the following notation, I = (¯ı1 , ¯ı2 , ¯ı3 ). In view of this deſnition, it is clear that ¯ıT1 ¯ı1 = ¯ıT2 ¯ı2 = ¯ıT3 ¯ı3 = 1 and ¯ıT1 ¯ı2 = ¯ıT2 ¯ı3 = ¯ıT3 ¯ı1 = 0. These relationships can be summarized as i3 ¯ıTi ¯ıj = δij , (1.13) a i a 2 3 I where δij is the Kronecker’s symbol deſned as a 1 i1 a2 1, i = j, (1.14) δij = 0, i = j. Fig. 1.6. An orthonormal basis I.
As shown in ſg. 1.6, an arbitrary vector, a, can be decomposed in the following manner a = (aT ¯ı1 )¯ı1 + (aT ¯ı2 )¯ı2 + (aT ¯ı3 )¯ı3 = a1¯ı1 + a2¯ı2 + a3¯ı3 ,
(1.15)
where a1 , a2 , and a3 are the projections, eq. (1.12), of vector a along unit vectors ¯ı1 , ¯ı2 , and ¯ı3 , respectively. The components of vector a resolved in orthonormal basis I are the projection of the vector along the unit vectors of the basis, ai = aT ¯ıi , i = 1, 2, 3. The following notation is used ⎧ ⎫ ⎨ a1 ⎬ a[I] = a2 . (1.16) ⎩ ⎭ a3 Notation a is used to indicate a free vector, and notation a[I] indicates the components of vector a resolved in basis I. The components of a vector consist of a set of three number, which are arranged in a column array, as shown in eq. (1.16). Braces are used to indicate a column array. The transpose of the column array is a row array and is denoted with a superscript, (·)T . The following notation will be used
a[I]T = a1 a2 a3 ,
T
or a[I] = a1 a2 a3 .
The components of the unit vectors ¯ı1 , ¯ı2 , and ¯ı3 resolved in basis I are ⎧ ⎫ ⎧ ⎫ ⎧ ⎫ ⎨1⎬ ⎨0⎬ ⎨0 ⎬ [I] [I] [I] ¯ı1 = 0 , ¯ı2 = 1 , ¯ı3 = 0 . ⎩ ⎭ ⎩ ⎭ ⎩ ⎭ 0 0 1
(1.17)
(1.18)
Using the properties of an orthonormal basis, eq. (1.13), the scalar product of two vectors becomes
1.1 Free vectors
⎧ ⎫ ⎨b1 ⎬
aT b = a1 b1 + a2 b2 + a3 b3 = a1 , a2 , a3 b2 ⎩ ⎭ b3
7
(1.19)
where bi , i = 1, 2, 3, are the components of b resolved in basis I. For eq. (1.19) to hold, the components of vectors a and b must be evaluated in the same orthonormal basis. The notation for the scalar product, aT b, is a mnemonic notion for the result expressed by eq. (1.19): the scalar product is obtained by multiplying the row array of the components of vector a resolved in basis I by the column array of the components of vector b resolved in the same basis. The operation of computing the components of a vector in a given basis is a fundamental operation. The following expressions are used interchangeably: “computing the components of vector a in basis I,” or “expressing vector a in basis I,”or “resolving vector a in basis I.” For sake of brevity, expressions such as “the components of a in I,” or “expressing a in I” will also be used.
1.1.7 The vector product The vector product, c, of two vectors, a and b, often called the cross product, is deſned as c= ab = ab sin(a, b) n ¯,
c=~ ab b
(1.20)
(a, b)
a
A = ||a|| ||b|| sin(a, b)
a where n ¯ is a unit vector normal to both a and b, and oriented according to the rightb hand rule, as depicted in ſg. 1.7. Note that A = ab sin(a, b) represents the area Fig. 1.7. The vector product of vectors a and b. of the parallelogram spanned by vectors a and b; hence, the norm of the vector product equals this area. The vector product is anti-commutative,
ab = −ba.
(1.21)
Indeed, the norms of the two vectors are equal, ab = ba = A, but according to the right-hand rule, unit vector n ¯ will point in opposite directions when the order of vectors a and b is reversed. Furthermore, the vector product is a distributive operation ( a + b)c = ac + bc,
a(b + c) = ab + ac.
(1.22)
This property follows from geometric considerations detailed in ſg. 1.8. Note that vectors ac, bc, and ( a + b)c are all in the plane normal to c. Furthermore, triangles OAB and OA B are similar.
8
1 Vectors and tensors
Plane normal to c
c
b ||a + b||sin(a+b, c)
a ~ ac
A¢
a+b
B
O
||b||sin(b, c) ~ bc
A ||a||sin(a, c) ~ ~ B¢ (a + b)c
Fig. 1.8. The vector product is distributive.
Statement ab = 0 implies that either a = 0 or b = 0, or a is parallel to b. The condition for the parallelism of two vectors is ab = 0,
(1.23)
provided that neither vector is null. The vector products of the unit vectors deſning an orthonormal basis are readily obtained from the deſnition of the vector product, eq. (1.20), to ſnd ı1¯ı2 = ¯ı3 , ı3¯ı1 = ¯ı2 , ı2¯ı3 = ¯ı1 , ı2¯ı1 = −¯ı3 , ı1¯ı3 = −¯ı2 , and ı3¯ı2 = −¯ı1 . Of course, the cross product of a vector by itself vanishes. These relationships can be summarized as follows ıi¯ıj = ijk¯ık , (1.24) where summation is implied over the repeated indices, and ijk is the Levi-Civita symbol or permutation symbol ⎧ ⎪ ⎨+1, for a cyclic permutation of the indices, (1.25) ijk = −1, for an acyclic permutation of the indices, ⎪ ⎩ 0, for all other cases. The above relationships implicitly assume that vector ¯ı1 , ¯ı2 , and ¯ı3 have been ordered in such a manner that eqs. (1.24) hold. Such bases are call right-hand bases and will be used exclusively in this book. If ai , bi , and ci , i = 1, 2, 3, are the components of vectors a, b, and c, respectively, resolved in a common basis I, the following relationship holds c = c1¯ı1 + c2¯ı2 + c3¯ı3 = ab = (a2 b3 − a3 b2 )¯ı1 + (a3 b1 − a1 b3 )¯ı2 + (a1 b2 − a2 b1 )¯ı3 , where eqs. (1.22) and (1.24) are used. Taking the scalar product of this expression by ¯ı1 , ¯ı2 , and ¯ı3 then yields
1.1 Free vectors
c[I]
9
⎧ ⎫ ⎧ ⎫ ⎡ ⎤⎧ ⎫ 0 −a3 a2 ⎨b1 ⎬ ⎨c1 ⎬ ⎨a2 b3 − a3 b2 ⎬ = c2 = a3 b1 − a1 b3 = ⎣ a3 0 −a1 ⎦ b2 = a[I] b[I] . (1.26) ⎩ ⎭ ⎩ ⎭ ⎩ ⎭ c3 a1 b 2 − a2 b 1 −a2 a1 0 b3
It is now clear that a is a second-order, skew-symmetric tensor whose components in basis I are ⎡ ⎤ 0 −a3 a2 a[I] = ⎣ a3 0 −a1 ⎦ . (1.27) −a2 a1 0 The notation for the vector product, ab, is a mnemonic notion for the result expressed by eq. (1.26): the vector product is obtained by multiplying the components of the skew-symmetric tensor a resolved in basis I by the column array of the components of vector b resolved in the same basis. 1.1.8 The tensor product The tensor product T of two vectors is a second-order tensor deſned as T = a bT .
(1.28)
The fundamental property of tensor T is T c = (bT c)a,
(1.29)
for any arbitrary vector c. By letting a = ¯ıi and b = ¯ıj , eq. (1.29) then implies ⎤ ⎤ ⎤ ⎡ ⎡ ⎡ 100 000 000 [I] [I]T [I] [I]T [I] [I]T ¯ı1 ¯ı1 = ⎣0 0 0⎦ , ¯ı2 ¯ı2 = ⎣0 1 0⎦ , ¯ı3 ¯ı3 = ⎣0 0 0⎦ , 000 000 001 ⎤ ⎤ ⎤ ⎡ ⎡ ⎡ 010 001 000 [I] [I]T [I] [I]T [I] [I]T ¯ı1 ¯ı2 = ⎣0 0 0⎦ , ¯ı1 ¯ı3 = ⎣0 0 0⎦ , ¯ı2 ¯ı3 = ⎣0 0 1⎦ , 000 000 000 ⎤ ⎤ ⎤ ⎡ ⎡ ⎡ 000 000 000 [I] [I]T [I] [I]T [I] [I]T ¯ı2 ¯ı1 = ⎣1 0 0⎦ , ¯ı3 ¯ı1 = ⎣0 0 0⎦ , ¯ı3 ¯ı2 = ⎣0 0 0⎦ . 000 100 010 [I] [I]T
Letting T [I] represent the components of tensor T [I] = ¯ıi ¯ıj kl can be summarized as T [I] = δki δj . k
, these relationships (1.30)
If ai and bi , i = 1, 2, 3, are the components of vectors a and b, respectively, in a common basis I, the following relationship holds ⎡ ⎤ a1 b 1 a1 b 2 a1 b 3 T [I] = ⎣a2 b1 a2 b2 a2 b3 ⎦ = a[I] b[I]T , (1.31) a3 b 1 a3 b 2 a3 b 3
10
1 Vectors and tensors
where eq. (1.30) was used. The notation for the tensor product, a bT , is a mnemonic notion for the result expressed by eq. (1.31): the tensor product is obtained by multiplying the column array of components of vector a in basis I by the row array of the components of vector b in the same basis. 1.1.9 The mixed product Let a, b, and c be three arbitrary vectors. The scalar cT ab is called the mixed product of these vectors. The geometric interpretation of this operation is illustrated in ſg. 1.9. The vector prodh uct ab = A¯ n is deſned by eq. (1.20), where A represents the area spanned by vectors a and b b and the orientation of unit vector n ¯ is selected ~ according to the right-hand rule. The mixed A = ||a b|| product then becomes cT ab = cA cos(¯ n, c), a where c cos(¯ n, c) = h is the projection of ¯ . It then follows Fig. 1.9. The mixed product of vec- vector c along the unit vector n that cT ab = Ah, where A is the area of the partors a, b, and c. allelogram spanned by vectors a and b and h the height of the parallelepiped deſned by vectors a, b, and c. Clearly, the mixed product represents the volume of this parallelepiped. The above interpretation assumes that vectors a, b, and c are ordered according to the right-hand rule. If this is not the case, it is easily veriſed that the mixed product yields the negative of the volume spanned by the three vectors. If ai , bi , and ci are the components of vectors a, b, and c, respectively, resolved in basis I, the mixed product can be written as ⎡ ⎤ a1 a2 a3 cT ab = det ⎣ b1 b2 b3 ⎦ , (1.32) c1 c2 c3 n c
ab = bT ca = aT bc, where eqs. (1.19) and (1.26) were used. It is now clear that cT since these operations correspond to permutations of lines of the determinant. Of course, due to the anti-commutativity property of the vector product, eq. (1.21), cT ba = bT ac = aT cb. 1.1.10 Tensor identities Important tensor identities will be used throughout this book. If a, b, and c are three arbitrary vectors, the following identities can be readily veriſed by painstakingly expanding the various products,
1.1 Free vectors
( a b) = a b − b a, T a b = b a − (aT b)I, a b − b a = ba − ab , T
T
a b) − (aT b)I, a b − a bT = ( ab c = (aT c)b − (b c)a, a b c = (aT c)b − (aT b)c, T
T
T
a b c = (b c)a, aT b c = bT c a = cT a b.
11
(1.33a) (1.33b) (1.33c) (1.33d) (1.33e) (1.33f) (1.33g) (1.33h)
If n ¯ is a unit vector and a an arbitrary vector, the following identities also hold (aT n ¯ )¯ n=a+n n a, n n n = − n, ˙ n n n = 0,
(1.34a) (1.34b) (1.34c)
where notation (·)· indicates a derivative with respect to time. 1.1.11 Solution of the vector product equation Let a, b, and x be three vectors such that ax = b. a If a and b are known vectors, is it possible to solve x for x? Equation ax = b can be viewed as a set of x¢ three linear equations for the components of x. Unfortunately, the matrix of the system of equations is x¢¢ singular because det( a) = 0; in fact, the null space b P of a is a since aa = 0. Hence, a solution only exists if the right-hand side of the system of equations is orthogonal to the the null space of a, i.e., if aT b = 0. Fig. 1.10. The solution of the Figure 1.10 gives a graphical illustration of the vector product equation. problem. The cross product equation, ax = b, implies that b is orthogonal to both a and x. Let plane P be normal to vector b. Because b is orthogonal to a, plane P contains vector a. Any vector in plane P will be normal to vector b. The solution of the problem must be in plane P and hence, can be written as x = μa + α ab, where μ and α are arbitrary scalars. Introducing this solution into the equation yields ax = a(μa + α ab) = b, or α a ab = b. With the help of identity (1.33b), this becomes α(a aT −a2 I)b = b. Because aT b = 0, the equation then reduces to −αa2 b = b, and ſnally, α = −1/a2 . The solution of the vector product equation is x = μa −
ab , a2
(1.35)
12
1 Vectors and tensors
where coefſcient μ remains undetermined. Clearly, the vector product equation possesses an inſnite number of solutions, because μ is arbitrary. Graphically, this corresponds to the various solution labeled as x or x in ſg. 1.10. To obtain a unique solution, an additional constraint must be enforced. For instance, the solution with the smallest norm is found by imposing the solution to be normal to vector a, leading to μ = 0 and ſnally x = − ab/a2 . 1.1.12 Problems Problem 1.1. Lagrange’s identity
Prove Lagrange’s identity: ab2 + (aT b)2 = a2 b2 .
Problem 1.2. Geometric interpretation of identity Prove identity (1.34a) and provide a geometric interpretation.
Problem 1.3. Geometric interpretation of identity
Prove the following identity cT ab = aT ca, based on (1) geometric arguments, and bc = bT (2) algebraic developments.
Problem 1.4. Jacobi’s identity
With the help of the identities of section 1.1.10, prove Jacobi’s identity ab c+ b c a+ ca b = 0.
Problem 1.5. Prove identity
Prove the following identity a b ba = b a aT b.
Problem 1.6. Prove identity If n ¯ is a unit vector and m an arbitrary vector such that λ = n ¯ T m, prove the following identity n n m +n m n +m n n = −m − 2λ n.
(1.36)
Problem 1.7. Criterion for linear independence Show that three vectors a, b, and c are linearly independent if and only if their mixed product does not vanish.
Problem 1.8. Criterion for parallelism Find the vector equation that expresses the fact that vectors a and b are parallel.
Problem 1.9. Criterion for orthogonality Find the vector equation that expresses the fact that vectors a and b are orthogonal.
Problem 1.10. Criterion for coplanarity Find the vector equation that expresses the fact that vectors a, b, and c are coplanar.
Problem 1.11. The projection tensor Consider a plane, P, deſned by its unit normal, n ¯ , and a free vector a, as depicted in ſg. 1.11. Vector a is decomposed as a = a + a , where a is in plane P and a normal to P. (1) Find the expression for the projection tensor, P , such that a = P a. (2) Find tensor Q such that a = Q a.
1.2 Bound vectors
A
n
a
n
13
a
a¢¢ P
a¢ P
a¢ A¢
Fig. 1.11. Projection of vector a onto plane n ¯.
Fig. 1.12. Reƀection of vector a onto plane n ¯.
Problem 1.12. The reƀection tensor Figure 1.12 depicts plane P deſned by its unit normal n ¯ and a free vector, a. Find the expression for the reƀection tensor R such that a = R a, where a is the reƀection of a with respect the plane P. Note that point A is the mirror image of point A with respect the plane P.
Problem 1.13. The covariant and contravariant components of a vector Consider three non-coplanar vectors a1 , a2 , a3 such that V = aT1 a2 a3 = 0. Deſne the a2 a3 , V a2 = a3 a1 , and V a3 = a1 a2 , called the reciprocal following three vectors, V a1 = vectors. Prove that (1) aTi aj = δij , (2) a1T a2 a3 = 1/V , and (3) V a2 a3 = a1 , V a3 a1 = a2 , 1 2 and V a a = a3 . Two arbitrary vectors, u and v, are now resolved in the following manner u = u1 a1 + u2 a2 + u3 a3 = u1 a1 + u2 a2 + u3 a3 ; v = v 1 a1 + v 2 a2 + v 3 a3 = v1 a1 + v2 a2 + v3 a3 . The components u and v i are called the contravariant components of vectors u and v, respectively, whereas the components ui and vi are called the covariant components of vectors u and v, respectively. Prove that (4) uT v = u1 v 1 + u2 v 2 + u3 v 3 = u1 v1 + u2 v2 + u3 v3 . v = (5) u v/V = (u2 v 3 − u3 v 2 )a1 + (u3 v 1 − u1 v 3 )a2 + (u1 v 2 − u2 v 3 )a3 . (6) V u (u2 v3 − u3 v2 )a1 + (u3 v1 − u1 v3 )a2 + (u1 v2 − u2 v1 )a3 . i
1.2 Bound vectors In section 1.1, free vectors were introduced as the ensemble of all segments equivalent to a given segment. In many practical applications, vectors are associated with a speciſc point in space; in that case they are called bound vectors. For instance, the description of a force applied to a rigid body requires knowledge of the force vector, f , (magnitude and orientation of the applied force), and the point of application of the force, xA . Figure 1.13 depicts a force vector, f , applied to a rigid body at point A; the force vector is a bound vector. On the other hand, a moment, m, applied to a rigid body is not attached to a speciſc point of the body; it is a free vector. Similarly, the angular velocity vector, Ω, is a property of the rigid body. It is not associated with a speciſc point of the body, it is a free vector. The velocity vector, v, describes the velocity at a speciſc point of the body; it is a bound vector.
14
1 Vectors and tensors
e03 A
i3 I
xA O
uP
e3
m
f
i2
Rigid body
i1 Fig. 1.13. A bound vector f , a free vector m, and the position vector xA .
P E0 e01 xP Initial configuration i1
e2 i3 e02 O I
B P
e1
XP xA A
i2 Present configuration
Fig. 1.14. A reference frame deſning the conſguration of a rigid body.
1.2.1 The position vector The position vector, xA , speciſes the position of point A in three-dimensional space with respect to a reference point O, as depicted in ſg. 1.13. The components of vector [I] xA resolved in basis I and denoted xA , are the coordinates of point A in Cartesian basis I. 1.2.2 Reference frames Orthonormal or Cartesian bases were introduced in section 1.1.6 as a set of three mutually orthogonal unit vectors, I = (¯ı1 , ¯ı2 , ¯ı3 ). The origin of this orthonormal basis, however, is not deſned because it consists of three free vectors. Let point O be the common origin of the three unit vectors of the basis. It is now possible to deſne a reference frame, denoted F = [O, I], consisting of an orthonormal basis, I, with its origin at point O, see ſg. 1.14. In dynamic problems, an inertial reference frame is always deſned; the origin and orientation of such frame are invariant in time. Reference frames are conveniently used to deſne position vectors. The position of an arbitrary point A is given by its position vector, xA , with respect to the origin of reference frame F, and the [I] components of this vector, xA , are resolved in basis I. Reference frames are closely related to the conſguration of rigid bodies: let point P be a material point of the rigid body, and orthonormal basis E0 = (¯ e01 , e¯02 , e¯03 ) a body attached basis deſning its orientation. Clearly, the initial conſguration of the rigid body is then completely deſned by reference frame F0 = [P, E0 ], see ſg. 1.14. If the rigid body tumbles in space, it will move to its present conſguration; the position vector of its reference point P is now X P , and its orientation is given by a new basis E = (¯ e1 , e¯2 , e¯3 ). Reference frame F = [P, E] now deſnes the present conſguration of the rigid body. The displacement vector, uP , of point P is such that X P = xP + uP . Clearly, a one to one correspondence exists between a reference frame and the conſguration of a rigid body.
1.3 Geometric entities
15
1.3 Geometric entities Geometric problems can be conveniently formulated using a vector formalism. Lines, planes, circles, and spheres are brieƀy described in the following sections. 1.3.1 Lines Figure 1.15 depicts a straight line is deſned by the position vector, xP , of an arbitrary ¯ along the direction of the line. A straight point P on the line, and the unit vector, , ¯ line, L, is denoted L = (xP , ). An arbitrary point Q on the line has a position vector, xQ , given by ¯ (1.37) xQ = xP + λ, where λ is an arbitrary scalar. An alternative deſnition of the line is in terms of its Pl¨ucker coordinates [1] deſned as follows k x P ¯ = ¯ . (1.38) Q= ¯ The ſrst part of the Pl¨ucker coordinates, k, deſnes a point of the line, and the second ¯ its orientation.1 Indeed, it is readily shown that x = k. The two vectors part, , P forming the Pl¨ucker coordinates must be orthogonal, i.e., k T ¯ = 0.
(1.39)
Clearly, αQ, where α is an arbitrary scalar such that α = 0, deſnes the same line, Q. n
L
Q Q
l xP
I
O
P
i2
i1 Fig. 1.15. The deſnition of a straight line.
1
d
i3
i3
I
xQ xP O
i1
P
i2
Fig. 1.16. The deſnition of a plane.
Some authors deſne the Pl¨ucker coordinates as QT = ¯T , kT .
16
1 Vectors and tensors
1.3.2 Planes Similarly, a plane is deſned by the position vector, xP , of an arbitrary point P of the plane, and the unit vector, n ¯ , normal to the plane, see ſg. 1.16. Plane P is denoted P = (xP , n ¯ ). An arbitrary point Q of the plane has a position vector, xQ , satisfying the following relationship n ¯ T (xQ − xP ) = 0. (1.40) This equation expresses the condition that vector xQ − xP must lie in plane P, and is therefore normal to n ¯ . The distance between point O and the plane is d = n ¯ T xP , and hence, the equation of a plane becomes n ¯ T xQ = d.
(1.41)
1.3.3 Circles ¯ , normal A circle is deſned by the position vector, xC , of its center, the unit vector, n to the plane of the circle, and its radius, ρ. Circle C is denoted C = (xC , n ¯ , ρ). An arbitrary point Q of the circle has a position vector, xQ , satisfying the following relationships n ¯ T (xQ − xC ) = 0, xQ − xC = ρ, (1.42) ¯ ) and the where the ſrst equation expresses the fact that point Q is in plane (xC , n second that is it at a distance ρ from the center of the circle. 1.3.4 Spheres A sphere is deſned by the position vector, xC , of its center, and its radius, ρ. Sphere S is denoted S = (xC , ρ). An arbitrary point Q of the sphere has a position vector, xQ , satisfying the following relationship xQ − xC = ρ.
(1.43)
Example 1.1. Intersection between two lines Find the point at the intersection of two lines, L1 = (x1 , ¯1 ) and L2 = (x2 , ¯2 ). What is the condition for this intersection to exist? Figure 1.17 shows the two lines and their intersection at point I, assuming, of course, that this intersection exists. Arbitrary points on lines L1 and L2 , denoted y 1 and y2 , respectively, are given by eq. (1.37) as y 1 = x1 + λ1 ¯1 and y 2 = x2 + λ2 ¯2 , respectively. If an intersection point exists, it must be on both lines, which implies the existence of scalars λ1 and λ2 such that xI = x1 + λ1 ¯1 = x2 + λ2 ¯2 , (1.44) where xI is the position vector of the intersection point. Let x21 = x2 − x1 be the position vector of the reference point of line L2 with respect to that of line L1 . Multiplying eq. (1.44) by xT21 1 and xT21 2 yields the two following conditions that must be satisſed for the intersection to exist,
1.3 Geometric entities
L2
l2
x2 i3 i1
i2 I O x1
17
I
x21 L1
l1
Fig. 1.17. Intersection between two lines.
λ1 (¯T1 x 21 ¯2 ) = 0,
λ2 (¯T1 x 21 ¯2 ) = 0.
The ſrst solution of these equations is λ1 = λ2 = 0, which implies xI = x1 = x2 : the reference points of the two lines are identical and this common point is the intersection of the two lines. The second solution is ¯T1 x 21 ¯2 = 0, the vanishing of the mixed product of vectors ¯1 , x21 , and ¯2 . Because the mixed product represent the volume spanned by these three vectors, the vanishing of the mixed product implies the coplanarity of the three vectors. As illustrated by ſg. 1.17, the existence of an intersection point of the two lines does indeed require the coplanarity of vectors ¯1 , x21 , and ¯2 . To determine the location of the intersection point, scalars λ1 and λ2 must be de1 ¯2 ) termined. Multiplying eq. (1.44) by xT1 1 and xT2 2 yields λ2 = (xT1 1 x2 )/(¯T1 x 2 ¯1 ), respectively. Point I is now found as and λ1 = (xT2 2 x1 )/(¯T2 x xT 2 x xT 1 x xI = x1 + ¯T2 ¯1 ¯1 = x2 + ¯T1 ¯2 ¯2 . 2 x 1 x 2 1 1 2 21 ¯2 , must vanish for the intersection to exist, it Because the mixed product, ¯T1 x T T ¯ ¯ ¯ ¯ follows that 1 x 1 2 = 1 x 2 2 . If ¯T1 x 1 ¯2 = ¯T1 x 2 ¯2 = 0, the denominators in the above expressions vanish and the intersection does not exist because the two lines are parallel, 1 ¯2 = 0. In summary, an intersection exists if ¯T1 x 21 ¯2 = 0, implying the coplanarity of vectors ¯1 , x21 , and ¯2 , and 1 ¯2 = 0, implying that the two lines are not parallel. A special case occurs ¯1 = ¯2 = ¯ and x 12 = 0: the two lines are coincident and all points on the line are intersection points. Example 1.2. Intersection between two lines Find the point at the intersection of two lines deſned by their Pl¨ucker coordinates, L1 = (k 1 , ¯1 ) and L2 = (k 2 , ¯2 ). What is the condition for this intersection to exist? Figure 1.17 shows the two lines and their intersection at point I, assuming, of course, that this intersection exists. The intersection point must be a point of both lines, and hence, by deſnition of the Pl¨ucker coordinates, k1 = x I ¯1 and k 2 = x I ¯2 . Multiplying the ſrst equation by ¯T2 and the second by ¯T1 , yields ¯T2 k1 = ¯T2 x I ¯1 and ¯T1 k 2 = ¯T1 x I ¯2 . Subtracting these two expressions then leads to
18
1 Vectors and tensors
k T1 ¯2 + k T2 ¯1 = 0. This is the condition that must be satisſed if an intersection exists. It is left to the reader to verify that the above condition is equivalent to that developed in example 1.1. Next, the location of the intersection point must be evaluated. By deſnition of the Pl¨ucker coordinates, k 1 = x I ¯1 , which can be recast as 1 xI = −k1 . This implies that the position vector of the intersection point is the solution of a vector product equation, see section 1.1.11. Because the solvability condition is satisſed, ¯T1 k1 = 0, the solution can be written as xI = α¯1 + 1 k1 , where α is an arbitrary scalar. Multiplying this equation by k T2 leads to kT2 xI = (k T2 ¯1 )α + k T2 1 k 1 = 0, because k 2 must be orthogonal to xI . Coefſcient α now becomes α = −(k T2 1 k 1 )/(k T2 ¯1 ). The intersection point is now ¯1 k T2 1 k 1 ¯1 k T2 k2 1 1 k 1 xI = − 1 k 1 = − ¯T + 1 k 1 = I − T , T¯ 1 k 2 k 2 1 k 2 ¯1 where identity (1.33b) was used to evaluate the bracketed term. Using this same identity once more leads to xI =
k2 (I − ¯1 ¯T1 )k 1 k2 k k1 k = ¯T 1 = ¯T 2 , T ¯ 1 k 2 1 k 2 2 k 1
where the second equality follows from the orthogonality of the Pl¨ucker coordinates, ¯T1 k 1 = 0. Of course, the existence of the intersection point requires ¯T1 k 2 = 0 or equivalently, ¯T2 k 1 = 0, which imply that the two lines are not parallel. It is left to the reader to verify that the solution found here is identical to that found in example 1.1. Comparing the solution obtained here with that found in example 1.1, it is clear that the use of the Pl¨ucker coordinates provides an elegant and compact solution of the problem. Example 1.3. Intersection between two planes Find the equation of the line at the intersection of two planes, P1 = (x1 , n ¯ 1 ) and P2 = (x2 , n ¯ 2 ). Does this line always exist? Under what conditions do the two planes ¯ coincide? Figure 1.18 shows the two planes and their intersection line, L = (xP , ), assuming, of course, that this intersection exists. Line L must be entirely in both planes P1 and P2 , and hence, must be normal to both n ¯ 1 and n ¯ 2 , which implies ¯2 n 1 n . ¯ = n1 n ¯2
(1.45)
To fully deſne the intersection line, it is also necessary to ſnd one of its point, say point P, as illustrated in ſg. 1.18. This point must belong to both planes, i.e., n ¯ T1 xP = p1 and n ¯ T2 xP = p2 , where p1 and p2 are the distances from point O to planes P1 and P2 , respectively. These two scalar equations are not sufſcient to
1.3 Geometric entities
n1
l
P1
n2 x1
xP
i3 I i1 O
19
P x2 P2
i2
Fig. 1.18. Intersection between two planes.
determine the position of point P unequivocally. A third condition can be added, for instance, imposing that point P be at the shortest distance from point O, implying ¯T xP = 0. ¯ where The position vector of point P will be written as xP = α¯ n1 + β n ¯ 2 + γ , T ¯ α, β, and γ are three unknown scalars. Multiplying this equation by and using the shortest distance condition leads to γ = 0 and hence, xP = α¯ n1 + β n ¯2 . Imposing the remaining two conditions leads to a set of two algebraic equations for coefſcients α and β, which are found as α = p1 − (¯ n1 n nT1 n ¯ 2 )p2 / ¯ 2 2 and T 2 n1 n ¯ 2 )p1 / ¯ 2 . The position vector of point P now becomes n1 n β = p2 − (¯ xP =
nT1 n ¯ 2 )p2 nT1 n ¯ 2 )p1 p1 − (¯ p2 − (¯ n ¯ + n ¯2. 1 2 n1 n ¯2 n1 n ¯ 2 2
¯ 2 = 0, and the intersection line does not exist. If the two planes are parallel, n 1 n Of course, if the two planes are coincident, i.e., n 1 n ¯ 2 = 0 and p1 = p2 , all lines in the plane are intersection lines. Example 1.4. Intersection between two planes Find the Pl¨ucker coordinates of the line at the intersection of two planes. Does this line always exist? Under what conditions do the two planes coincide? Figure 1.18 ¯ assuming, of course, that shows the two planes and their intersection, L = (k, ), this intersection exists. As discussed in example 1.3, the orientation of the intersection line is given by ¯ where x is any point eq. (1.45). By deſnition of the Pl¨ucker coordinates, k = x P , P ¯ on the line. It follows that k must be normal to , and hence, contained in the plane ¯ deſned by unit vectors n ¯ 1 and n ¯ 2 , i.e., k = α1 n ¯ 1 + α2 n ¯2 = x P . ¯ 1 + α2 n ¯ 2 ). This implies that the This equation can be recast as xP = −(α1 n position vector of the intersection point is the solution of a vector product equation, see section 1.1.11. Because the solvability condition is satisſed, ¯T (α1 n ¯ 1 +α2 n ¯2) = 1n ¯ 1 + α2 n ¯ 2 ), where μ is an arbitrary 0, the solution can be written as xP = μ¯+ (α scalar. Multiplying this equation by n ¯ T1 yields 1n n2 = −α2 ¯T n n ¯ T1 xP = p1 = n ¯ T1 (α ¯ 1 + α2 n ¯ 2 ) = α2 n ¯ T1 ¯ 1 n ¯ 2 = −α2 n1 n ¯ 2 .
20
1 Vectors and tensors
It then follows that α2 = −p1 / n1 n ¯ 2 . Multiplying the equation by n ¯ T2 and proceeding similarly yields α1 = p2 / n1 n ¯ 2 , and ſnally k = (p2 n ¯ 1 − p1 n ¯ 2 )/ n1 n ¯ 2 . If the two planes are parallel, n 1 n ¯ 2 = 0, and the intersection line does not exist. Of course, if the two planes are coincident, i.e., n 1 n ¯ 2 = 0 and p1 = p2 , all lines in the plane are intersection lines. Because the Pl¨ucker coordinates, Q, of a line are deſned within a constant, the Pl¨ucker coordinates of the intersection line can be written as ¯ 1 − p1 n ¯2 p2 n Q= . n 1 n ¯2 Comparing the solution obtained here with that found in example 1.3, it is clear that the use of the Pl¨ucker coordinates provides an elegant and compact solution of the problem. 1.3.5 Problems Problem 1.14. Position vector of a point on a line
¯ Find the position vector of an arbitrary A line is deſned by its Pl¨ucker coordinates, L = (k, ). point on line L.
Problem 1.15. Line deſned by two points Find the equation of a line passing through two points P1 and P2 . Does a solution always exist? Under what conditions do multiple solutions arise?
Problem 1.16. Distance from a point to a line
¯ Find the distance between an arbitrary point Q (of position vector xQ ) and a line L = (xP , ). Find the location of point R on line L that is at the shortest distance of point Q.
Problem 1.17. Distance from a point to a line
¯ Find the distance between an arbitrary point Q (of position vector xQ ) and a line L = (k, ). Find the location of point R on line L that is at the shortest distance of point Q.
Problem 1.18. Distance from a point to a plane Find the distance between an arbitrary point Q (of position vector xQ ) and a plane P = ¯ ). Find the location of point R on plane P that is at the shortest distance of point Q. (xP , n
Problem 1.19. Intersection of a line and a plane
¯ and a plane P = (x , n Find the point at the intersection of a line L = (xQ , ) P ¯ ). Does this point always exist? Under what conditions does the line lie in the plane?
Problem 1.20. Intersection of a line and a pane
¯ expressed in terms of Pl¨ucker coordiFind the point at the intersection of a line, L = (k, ), nates, and a plane, P = (xP , n ¯ ). Does this point always exist? Under what conditions does the line lie in the plane?
Problem 1.21. Distance between two lines
Find the distance between two lines L1 = (x1 , ¯1 ) and L2 = (x2 , ¯2 ). Find the locations of points R1 and R2 , on lines L1 and L2 , respectively that are at the shortest distance from each other.
1.3 Geometric entities
21
Problem 1.22. Distance between two lines
Find the distance between two lines L1 = (k1 , ¯1 ) and L2 = (k2 , ¯2 ), expressed in terms of Pl¨ucker coordinates. Find the locations of points R1 and R2 , on lines L1 and L2 , respectively that are at the shortest distance from each other.
Problem 1.23. Plane deſned by three points Find the equation of a plane passing through three points P1 , P2 , and P3 with position vectors x1 , x2 , and x3 , respectively. Does a solution always exist? Under what conditions do multiple solutions arise?
Problem 1.24. Plane deſned by the intersection of two planes Let point P with position vector xP be on the intersection of two given planes P1 = (xP , n ¯1) ¯ 2 ). Find the equation of plane P3 = (xP , n ¯ 3 ) passing through a given point and P2 = (xP , n Q with position vectors xQ and the intersection of planes P1 and P2 .
Problem 1.25. Circle deſned by three points Find circle C = (xC , n ¯ , ρ) deſned by three points P1 , P2 , and P3 with position vectors x1 , x2 , and x3 , respectively. Does a solution always exist?
Problem 1.26. Tangent to a circle Find the position vector xP of point P such that the tangent to circle C = (xC , n ¯ , ρ) at P passes through a given point Q with position vector xQ . Find the conditions for a solution to exist. Is the solution unique?
Problem 1.27. Distance between two circles Find the shortest distance d between two arbitrary circles C1 = (xC1 , n ¯ 1 , ρ1 ) and C2 = (xC2 , n ¯ 2 , ρ2 ). Hint: let xQ1 and xQ2 be the position vectors of points Q1 and Q2 belonging to circles C1 and C2 , respectively. If Q1 and Q2 are at the shortest distance, vector xQ2 − xQ1 is then normal to the tangent to C1 at point Q1 and to the tangent to C2 at point Q2 .
Problem 1.28. Intersection of a line and a sphere
¯ and sphere S = (x , ρ). Find the intersections between line L = (x, ) C
Problem 1.29. Distance from a disk to a plane
¯ ρ), as depicted in ſg. 1.19. Find the Consider plane P = (xP , n ¯ ) and circle C = (xC , k, shortest algebraic distance d between the disk and the plane. (A positive distance is deſned when the disk is in the direction of n ¯ ).
n P
k
aC r
C
v2
b
R d Q
e2
v1 P
Fig. 1.19. The distance between a disk and a plane.
b
e1
Fig. 1.20. Constructing an orthonormal basis from two vectors.
22
1 Vectors and tensors
Problem 1.30. Orthonormal basis constructed from two vectors Figure 1.20 shows two arbitrary vectors, v 1 and v 2 . Construct a set of three mutually orthogonal unit vectors, e¯1 , e¯2 , e¯3 , such that e¯1 , e¯2 , v 1 , and v 2 are four coplanar vectors, and angle β between e¯1 and v 1 is identical to that between e¯2 and v 2 . Find an expression for angle β.
1.4 Second-order tensors Second-order tensors were encountered in previous sections: the tensor product of two vectors, eq. (1.28), yields a second-order tensor, and the vector product of two vector is conveniently expressed in terms of the second-order, skew-symmetric tensor deſned by eq. (1.27). In general, the components of a second-order tensor, A, are denoted aij , where the indices i = 1, 2, 3 and j = 1, 2, 3. A second-order tensor is said to be a symmetric tensor if aij = aji . For instance, it is readily veriſed that the tensor product of a vector by itself, T = a aT , forms a symmetric tensor. A second-order tensor is said to be a skew-symmetric tensor if aij = −aji . This implies that the diagonal terms vanish, aii = 0, i = 1, 2, 3. Skew-symmetric tensors were encountered when dealing with the vector product, see eq. (1.27). The superscript (·)T is used to denote the transposition operation. If the components of A are aij , the components of AT are aji . 1.4.1 Basic operations The trace of a second-order tensor is a scalar deſned as tr(A) = a11 + a22 + a33 .
(1.46)
The determinant of a second-order tensor is also a scalar quantity deſned as det(A) = a11 a22 a33 + a12 a23 a31 + a13 a21 a32 − a31 a22 a13 − a12 a21 a33 − a11 a23 a32 .
(1.47)
An arbitrary tensor can always be decomposed into its symmetric part and skewsymmetric part A=
A − AT A + AT + = symm(A) + skew(A). 2 2
(1.48)
In this equation, symm(A) denoted the symmetric part of the tensor ⎤ ⎡ 2a11 a12 + a21 a13 + a31 A + AT 1⎣ = symm(A) = a12 + a21 2a22 a23 + a32 ⎦ , 2 2 a13 + a31 a23 + a32 2a33 and skew(A) its skew-symmetric part
(1.49)
1.4 Second-order tensors
skew(A) =
A − AT 2
⎤ 0 (a12 − a21 ) (a13 − a31 ) 1 = ⎣−(a12 − a21 ) 0 (a23 − a32 )⎦ . 2 −(a13 − a31 ) −(a23 − a32 ) 0
23
⎡
(1.50)
The axial vector, a, associated with a second-order tensor, A, is denoted a = axial(A). It is deſned as follows a = axial(A) ⇐⇒ a=
A − AT . 2
(1.51)
It is readily veriſed that
⎧ ⎫ ⎫ ⎧ ⎨a1 ⎬ 1 ⎨a32 − a23 ⎬ a13 − a31 . a = axial(A) = a2 = ⎩ ⎭ 2⎩ ⎭ a3 a21 − a12
(1.52)
A second-order tensor, T , is positive-deſnite if and only if uT T u > 0,
(1.53)
for any arbitrary vector u = 0. It is semi positive-deſnite if uT T u ≥ 0 for any vector u = 0. For instance, consider the tensor corresponding to the tensor product of a vector by itself, T = a aT , a = 0. This tensor is semi positive-deſnite because uT T u = (aT u)2 ≥ 0 for any choice of u = 0; the equality hold when a is normal to u. 1.4.2 Eigenvalue analysis More often than not, the complete analysis of a second-order tensor will require the evaluation of its eigenvalues and eigenvectors. The following relationship A u = λu,
(1.54)
is satisſed by eigenvector u corresponding to eigenvalue λ. This relationship can be recast as (A − λI)u = 0, where I is the identity tensor, i.e., Iij = δij . This means that the eigenvector is the solution of a homogeneous system of algebraic equations. In general, this solution is the trivial solution, u = 0. For a non-trivial solution to exist, the determinant of the set of linear equations must vanish, det(A − λI) = 0. This equation is called the characteristic equation satisſed by the eigenvalue of A; with the help of eq. (1.47), it expands to −λ3 + I1 λ2 − I2 λ + I3 = 0.
(1.55)
I1 , I2 , and I3 are the invariants of the tensor I1 = tr(A),
(1.56a)
I2 = a22 a33 + a33 a11 + a11 a22 − a23 a32 − a13 a31 − a12 a21 , I3 = det(A).
(1.56b) (1.56c)
24
1 Vectors and tensors
Equation (1.55) will yield three solutions λ1 , λ2 , and λ3 , called the eigenvalues of A. One eigenvalue is always real, the other two could be real, or a complex conjugate pair. To each eigenvalue corresponds an eigenvector, u; the eigenpairs are denoted (λ1 , u1 ), (λ2 , u2 ), and (λ3 , u3 ). Because the eigenvectors are the solution of a homogeneous, linear system, they are deſned within a multiplicative constant. If an eigenvalue is real, the corresponding eigenvector is also real. A complex conjugate pair of eigenvectors is associated with complex conjugate eigenvalues. Symmetric, positive-deſnite tensors In general, a real second-order tensor will have one real eigenvalue and the remaining two could be real, or form a complex conjugate pair. If the tensor is symmetric and positive-deſnite, however, all three eigenvalues must be real and positive. Indeed, assume λ is √ a complex eigenvalue and u = v + iw the corresponding eigenvector, where i = −1. The eigenproblem, eq. (1.54), now becomes A(v + iw) = λ(v + iw). Pre-multiplying by vector (v − iw)T leads to (v T A v + w T A w) + i(v T Aw − w T A v) = λ(v T v + wT w).
(1.57)
If tensor A is symmetric, v T A w = wT A v, and the term in the second parenthesis vanishes. It then follows that λ=
vT A v + wT A w . vT v + wT w
(1.58)
Because A, v, and w are real quantities, λ is also a real quantity. It follows that the eigenvalues of a real, symmetric tensor are all real. The original eigenproblem, A(v + iw) = λ(v + iw), splits into its real and imaginary parts, A v = λv and iA w = iλw, respectively. Clearly the two problems are identical and the imaginary part is redundant; nothing is lost by setting w = 0. The eigenvalue now becomes vT A v λ= T . (1.59) v v If A is a positive-deſnite tensor, the numerator is a positive number, see eq. (1.53). On the other hand, the denominator is always a positive number. This proves that the eigenvalues of a real, symmetric, positive-deſnite tensor are all real and positive. If the tensor is symmetric and semi positive-deſnite, its eigenvalues are null or positive. Similarity transformations Consider now a linear transformation of the form u = Q u ¯ , where Q is an orthogonal tensor, i.e., QT Q = I. This transformation is applied to the eigenproblem A u = λu to yield A Q u ¯ = λQ¯ u. Pre-multiplying by QT then leads to QT A Q u ¯ = λQT Q u ¯. Because Q is an orthogonal transformation, this becomes
1.4 Second-order tensors
where
25
¯ = λ¯ u, A¯ u
(1.60)
A¯ = QT A Q,
(1.61)
is a similarity transformation of the original tensor. The transformed eigenproblem has the same form as the original problem, and the relationships between the eigenvalues of the two problems is sought. The eigenvalues of the original and transformed problems are the solutions of the characteristic equations det(A − λI) = 0 and det(A¯ − λI) = 0, respectively. Because Q is an orthogonal tensor, the latter equation can be stated as det QT (A − λI)Q = 0. Since the determinant of a product equals the product of the determinant, this becomes det(QT ) det(A − λI) det(Q) = 0. The orthogonality of Q implies det(Q) = 1; indeed det(QT Q) = det2 (Q) = 1. Finally, the characteristic equation of the transformed problem becomes det(A − λI) = 0, the same as that of the original problem. Consequently, the eigenvalues of the two problems are identical; similarity transformations preserve the spectrum of eigenvalues and the corresponding eigenvectors are related as ui = Q u ¯i. Orthogonality of the eigenvectors Consider a symmetric tensor, A, and two of its eigenpairs, (λi , ui ) and (λj , uj ), satisfying relationships A ui = λi ui , and A uj = λj uj , respectively. Pre-multiplying the ſrst statement by uj and the second by uj leads to uTj A ui = λi uTj ui and uTi A uj = λj uTi uj , respectively. Subtracting these two equations results in (λi − λj )uTj ui = 0, where the symmetry of tensor A was invoked. If λi = λj , uTj ui = 0: the eigenvectors of a symmetric tensor associated with distinct eigenvalues are orthogonal to each other. The orthogonality of the eigenvectors also implies their orthogonality in the space of tensor A, uTj A ui = 0. If the symmetric tensor possesses three distinct eigenvalues, the corresponding eigenvectors form an orthogonal triad: P = [u1 , u2 , u3 ]. Because the eigenvectors are deſned within a multiplicative constant, it is possible to normalize this orthogonal triad and impose P T P = I. This does not completely remove the indeterminacy of the eigenvectors that could still be multiplied by ±1. It is customary to order the eigenvectors in such a way that they form a right-hand basis. With this normalization of the eigenvector, it follows that ⎡ ⎤ λ1 0 0 P T A P = P T λ1 u1 , λ2 u2 , λ3 u3 = ⎣ 0 λ2 0 ⎦ . (1.62) 0 0 λ3 The orthogonality of the eigenvectors is a very important property that has been proved, thus far, for distinct eigenvalues only. What happens if a tensor features repeated eigenvalues, a common occurrence? To be precise, let eigenvalue λ1 have a
26
1 Vectors and tensors
multiplicity of 2. First, an eigenvector, u1 , associated with this eigenvalue is evaluated; next, the following linear transformation is constructed Q = [u1 , qˆ], where Q is an orthogonal matrix; this implies uT1 qˆ = 0. Since u1 is an eigenvector, it is also true that qˆT u1 = qˆT A u1 = 0. A similarity transformation, see eq. (1.61), of the original problem is performed to ſnd T u1 A u1 uT1 A qˆ λ1 0 uT1 A¯ = T A u1 qˆ = T = . (1.63) qˆ 0 qˆT A qˆ qˆ A u1 qˆT A qˆ By construction, the eigenvalues of A¯ are identical to those of A. Hence, λ1 is an
¯T1 = 1, 0, 0 ) eigenvalue of A¯ with a multiplicity of 2. The ſrst eigenpair is (λ1 , u
¯ T and the second is (λ1 , u ¯ 2 is the eigenvector of the reduced ¯ T2 = 0, u 2 ), where u T tensor qˆ A qˆ associated with its single eigenvalue λ1 ; note that u ¯ T1 u ¯2 = 0. Two eigenvectors of the original problem are now u1 = Q u ¯1 = u1 , by construction, and u2 = Q u ¯2 = qˆ u ¯2 . Finally, the orthogonality of the eigenvectors of the transformed ¯2 = problem implies that of their counterparts for the original problem: 0 = u ¯T1 u T T T u ¯1 Q Q u ¯ 2 = u1 u2 . In summary, in the presence of repeated eigenvalues, orthogonal eigenvectors can be always extracted. For eigenvalues of multiplicity 3, the above development could be recursively applied to extract three orthogonal eigenvectors of the symmetric tensor. The orthogonal tensor, P = [u1 , u2 , u3 ], always exists and presents the important property of diagonalizing tensor A P T A P = diag(λi )
(1.64)
Example 1.5. Eigen analysis of the projection tensor Figure 1.11 depicts an arbitrary vector a and a plane P deſned by its unit normal n ¯. The projection tensor P is such that a = P a, where a is the projection of vector a onto plane P. Find the three eigenvalues of P and the corresponding eigenvectors. nT a)¯ n + a . It then follows that a = Inspection of ſg. 1.11 reveals that a = (¯ T T a−n ¯n ¯ a = (I − n ¯n ¯ )a, and hence, the projection tensor is ⎡
⎤ 1 − n21 −n1 n2 −n1 n3 P =I −n ¯n ¯ T = ⎣−n1 n2 1 − n22 −n2 n3 ⎦ . −n1 n3 −n2 n3 1 − n23
(1.65)
The projection tensor is symmetric and semi positive-deſnite. Indeed, aT (I − n ¯n ¯ T )a = aT a − (¯ nT a)2 = a2 − a2 cos2 α, where α is the angle between vectors n ¯ and a. It follows that aT P a = a2 sin2 α ≥ 0 for any arbitrary vector ¯ . It a = 0. Note that aT P a = 0 when α = 0, i.e., when vector a is parallel to n follows that the eigenvalues of P must be real and greater or equal to zero. One eigenvector of P can be found by inspection: P n ¯ = (I − n ¯n ¯ T )¯ n=n ¯ −n ¯= 0. This implies that vector n ¯ is an eigenvector of the projection tensor corresponding
1.5 Tensor calculus
27
to an eigenvalue λ = 0. The invariants of P are readily found as I1 = 2, I2 = 1 and I3 = 0. The eigenvalues then are the solutions of the characteristic equation −λ3 + 2λ2 − λ = 0 or λ(λ − 1)2 = 0. The eigenvalues are λ1 = 0, λ2 = +1 and λ3 = +1; note the multiplicity of two of the unit eigenvalue. As discussed above, the eigenvector corresponding to the null eigenvalue λ1 = 0 is the unit vector u1 = n ¯ . The eigenvectors corresponding to the double unit eigenvalue are the solution of the homogeneous linear problem (I − n ¯n ¯ T − I)x = 0 or T n ¯n ¯ x = 0. Clearly, any vector orthogonal to n ¯ will satisfy this equation. In other words, any vector u2 in plane P, i.e., such that n ¯ T u2 = 0, is an eigenvector. This implies P u2 = u2 , a result that is readily veriſed: (I − n ¯n ¯ T )u2 = u2 if n ¯ T u2 = 0. In geometric terms, this result is obvious: if vector u2 lie in plane P, the projection of that vector onto the plane is the vector itself. In view of the multiplicity of two of the unit eigenvalue, the eigenvector corresponding to λ3 is identical to that corresponding to λ2 : an arbitrary vector in plane P. It is, however, always possible to ſnd an orthogonal vector by selecting u3 = n u2 . In summary, the three eigenvectors of the projection tensor are u1 = n, the normal to plane P, u2 an arbitrary vector in P, and u3 = n u2 . Clearly, the eigenvectors capture the essence of the projection tensor: u ¯1 is the direction normal to the plane, and u2 and u3 are two orthogonal directions within this plane. The multiplicity of two of the unit eigenvector results in the fact that u2 can be chosen arbitrarily within plane P. Geometrically, this is related to the isotropy of the projection tensor: it behaves in the same manner in all direction within plane P. Finally, note that P = P P : once a vector has been projected onto the plane, any subsequent application of the projection tensor will leave the vector unchanged. 1.4.3 Problems Problem 1.31. Solve linear system Solve the following equation for x, x a = b − x.
Problem 1.32. Compute inverse
Show that (I + a)−1 = (I + a aT − a)/(1 + a2 )
Problem 1.33. Eigenvalues of the reƀection tensor Figure 1.12 depicts an arbitrary vector a and plane P, deſned by its unit normal n ¯ . (1) Find the expression for the reƀection tensor, R, such that a = R a, where vector a is the reƀection of vector a with respect the plane P. Note that point A is the mirror image of point A with respect the plane P. (2) Is the reƀection tensor positive-deſnite? (3) By inspection of R ſnd one of its eigenvectors and the corresponding eigenvalue. (4) Compute the three invariants of R. (5) Find the three eigenvalues of R and the corresponding eigenvectors.
1.5 Tensor calculus The derivative of a scalar function s(t) of a single variable, t, say time, is deſned in calculus textbooks (see, for instance, [2]), as
28
1 Vectors and tensors
ds s(t + Δt) − s(t) = s˙ = lim . Δt→0 dt Δt
(1.66)
˙ will be used throughout this book to represent a derivative with The notation (·) respect to time. The derivative of a vector u(t) is deſned in a similar manner as du u(t + Δt) − u(t) = u˙ = lim . Δt→0 dt Δt
(1.67)
The following results stem from elementary rules for derivatives. Derivative of a sum If u(t) and v(t) are two arbitrary vectors d (u + v) = u˙ + v. ˙ dt
(1.68)
Derivative of a product If s(t) is a scalar function of time, d (su) = su ˙ + su. ˙ dt
(1.69)
The derivative of the scalar product becomes d T (u v) = v T u˙ + uT v, ˙ dt
(1.70)
d ( uv) = u ˙ v + u v˙ = u v˙ − vu. ˙ dt
(1.71)
and that of the vector product
Chain rule for differentiation If vector u is a function of a scalar function s(t), the time derivative of this vector becomes du ds d du (u(s(t))) = = s˙ . (1.72) dt ds dt ds As an application of the above rules, consider the derivative of a unit vector, i.e., vector u such that uT u = 1. Equation (1.70) then implies d T (u u) = 2uT u˙ = 0. dt
(1.73)
In other words: the derivative of a unit vector is orthogonal to the vector itself. Next, consider two mutually orthogonal vector u and v, uT v = 0. A derivative of this expression then yields uT v˙ = −v T u. ˙ (1.74)
1.6 Notational conventions
29
1.6 Notational conventions Several notational conventions are used in the literature to denote vectors and tensors. Three widely used notations, the geometric notation, the matrix notation, and the index notation [3] are presented in table 1.1. The geometric notation is widely used in the literature, sometimes the boldface notation for vectors is replaced by a → speciſc “vector” superscript: − a . The index notation is frequently used, specially when higher-order tensors must be manipulated such as in the theory of elasticity. It is, however, less often used in kinematics and dynamics. The matrix notation is a convenient mnemonic notation and will be used exclusively in this book. Vectors are denoted with an underline, u, but unit vectors are simply denoted n ¯ , rather than the more cumbersome n ¯ . Tensors are denoted by a double underline, A, but skew-symmetric tensors are denoted a, rather than the more cumbersome a. Note that the tensor product, u v T , also yields a tensor. Table 1.1. The geometric, matrix, and index notations for vectors and tensors. Geometric Matrix Index notation notation notation vector a a ai tensor A A Aij scalar product u·v uT v ui vi vector product u × v u v ui vj ijk tensor product u ⊗ v u vT ui vj
In practical situations, such computer implementations, it will be necessary to work with the components of speciſc tensors resolved in various bases. In such cases, the following notation will be used ⎧ ⎫ ⎨ a1 ⎬ a[I] = a2 , ⎩ ⎭ a3 where a1 , a2 , and a3 are the components of vector a resolved in basis I. Because the notation a[I] is rather cumbersome, it will be used only when necessary; for instance, when the components of a vector in two different bases are used in the same context. When there is no possible confusion, the notation a[I] will be simpliſed as a, thereby blurring the distinction between a vector and its components in a given basis.
2 Coordinate systems
The practical description of dynamical systems involves a variety of coordinates systems. While the Cartesian coordinates discussed in section 2.1 are probably the most commonly used, many problems are more easily treated with special coordinate systems. The differential geometry of curves is studied in section 2.2 and leads to the concept of path coordinates, treated in section 2.3. Similarly, the differential geometry of surfaces is investigated in section 2.4 and leads to the concept of surface coordinates, treated in section 2.5. Finally, the differential geometry of three-dimensional maps is studied in section 2.6 and leads to orthogonal curvilinear coordinates developed in section 2.7.
2.1 Cartesian coordinates The simplest way to represent the location of a point in three-dimensional space is to make use of a reference frame, F = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )], consisting of an orthonormal basis I with its origin and point O, as described in section 1.2.2. The time-dependent position vector of point P is represented by its Cartesian coordinates, x1 (t), x2 (t), and x3 (t), resolved along unit vectors, ¯ı1 , ¯ı2 , and ¯ı3 , respectively, r(t) = x1 (t)¯ı1 + x2 (t)¯ı2 + x3 (t)¯ı3 ,
(2.1)
i3 P r I
x3
i2
O x1
i1 x2
Fig. 2.1. Cartesian coordinate system.
where t denotes time. Figure 2.1 depicts the situation: Cartesian coordinate x1 = ¯ıT1 r is the projection of the position vector of point P along unit vector ¯ı1 . Similarly, Cartesian coordinates x1 and x2 are the projections of the same position vector along unit vectors ¯ı2 and ¯ı3 , respectively. The components of the velocity vector are readily obtained by differentiating the expression for the position vector, eq. (2.1), to ſnd
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_2 © Springer Science+Business Media B.V. 2011
32
2 Coordinate systems
v(t) = x˙ 1 (t)¯ı1 + x˙ 2 (t)¯ı2 + x˙ 3 (t)¯ı3 = v1 (t)¯ı1 + v2 (t)¯ı2 + v3 (t)¯ı3 .
(2.2)
The Cartesian components of the velocity vector are simply the time derivatives of the corresponding Cartesian components of the position vector: v1 (t) = x˙ 1 (t), v2 (t) = x˙ 2 (t), and v3 (t) = x˙ 3 (t). Finally, the acceleration vector is obtained by taking a time derivative of the velocity vector to ſnd a(t) = x ¨1 (t)¯ı1 + x¨2 (t)¯ı2 + x ¨3 (t)¯ı3 = a1 (t)¯ı1 + a2 (t)¯ı2 + a3 (t)¯ı3 .
(2.3)
Here again, the Cartesian components of the acceleration vector are simply the derivatives of the corresponding Cartesian components of the velocity vector, or the second derivatives of the position components: a1 (t) = v˙ 1 (t) = x ¨1 (t), a2 (t) = v˙ 2 (t) = x ¨2 (t), and a3 (t) = v˙ 3 (t) = x¨3 (t). Cartesian coordinates are simple to manipulate and are the most commonly used coordinate system in computational applications that deal with problems presenting arbitrary topologies. On the other hand, several other coordinate systems, such as those discussed in the rest of this chapter, are often used because they can ease the solution process for speciſc problems. In such cases, a speciſc coordinate system is used solve a speciſc problem. For instance, polar coordinates are very efſcient to describe the behavior of a particle constrained to move along a circular path.
2.2 Differential geometry of a curve This section investigates the differential geometry of a curve, leading to the concept of path coordinates. Both intrinsic and arbitrary parameterizations will be considered. Frenet’s triad is deſned and its derivatives evaluated. 2.2.1 Intrinsic parameterization h s
t
i3 p0 I
b n
C
i2 i1
Fig. 2.2. Conſguration of a curve in space.
Figure 2.2 depicts a curve, denoted C, in threedimensional space. A curve is the locus of the points generated by a single parameter, such that the position vector, p0 , of such points can be written as p0 = p0 (s),
(2.4)
where s is the parameter that generates the curve. If parameter s is the curvilinear coordinate that measures length along the curve, it is said to deſne the intrinsic parameterization or natural parameterization of the curve.
Frenet’s triad A differential element of length, ds, along the curve is written as ds2 = dpT0 dp0 , and in follows that (dp0 /ds)T (dp0 /ds) = 1. The unit tangent vector to the curve is deſned as
2.2 Differential geometry of a curve
33
dp0
. (2.5) ds By construction, this is a unit vector because t¯T t¯ = 1. Taking a derivative of this relationship with respect to the curvilinear coordinate leads to t¯T dt¯/ds = 0. Vector dt¯/ds is normal to the tangent vector. The unit normal vector to the curve is deſned as dt¯ (2.6) n ¯=ρ , ds where ρ is the radius of curvature of the curve, such that t¯ =
1 dt¯ = . ρ ds
(2.7)
The quantity 1/ρ is the curvature of the curve, and ρ its radius of curvature. The two unit vector, t¯ and n ¯ , are said to form the osculating plane of the curve. An orthonormal triad is now constructed by deſning the binormal vector, ¯b, as the cross product of the tangent by the normal vectors, ¯b = tn ¯.
(2.8)
The unit tangent, normal, and binormal vectors form an orthonormal triad, called Frenet’s triad, depicted in ſg. 2.2. Derivatives of Frenet’s triad First, the derivative of the normal vector is resolved in Frenet’s triad as d¯ n/ds = αt¯+ β n ¯ + γ¯b, where α, β, and γ are unknown coefſcients. Pre-multiplying this relationship by n ¯ T yields β = n ¯ T d¯ n/ds = 0, because n ¯ is a unit vector. Pre-multiplying by t¯T yields α = t¯T d¯ n/ds = −¯ nT dt¯/ds = −1/ρ, where eq. (2.6) was used. Finally, pre-multiplying by ¯bT yields γ = ¯bT d¯ n/ds = 1/τ . Combining all these results yields d¯ n 1 1 = − t¯ + ¯b, (2.9) ds ρ τ where τ is the radius of twist of the curve, deſned as n 1 ¯T d¯ =b . τ ds
(2.10)
Next, the derivative of the binormal vector is resolved in Frenet’s triad as d¯b/ds = αt¯+ β n ¯ + γ¯b, where α, β, and γ are unknown coefſcients. Pre-multiplying this relationship by ¯bT yields γ = ¯bT d¯b/ds = 0, because ¯b is a unit vector. Premultiplying by t¯T yields α = t¯T d¯b/ds = −¯bT dt¯/ds = −¯bT n ¯ /ρ = 0. Finally, premultiplying by n ¯ T yields β = n ¯ T d¯b/ds = −¯bT d¯ n/ds = −1/τ , where eq. (2.10) was used. Combining all these results yields d¯b 1 =− n ¯. ds τ
(2.11)
34
2 Coordinate systems
It follows that the twist of the curve can also be written as d¯b 1 = . τ ds
(2.12)
If the binormal vector has a constant direction at all points along the curve, d¯b/ds = 0, and the curve entirely lies in the plane deſned by vectors t¯ and n ¯ , i.e., the osculating plane is the same at all points of the curve. The curve is then a planar curve, and eq. (2.12) implies that 1/τ = 0, i.e., the twist of the curve vanishes. The derivatives of Frenet’s triad can be expressed in a compact manner by combining eqs. (2.6), (2.9), and (2.11), ⎧ ⎫ ⎡ ⎤⎧ ⎫ 0 1/ρ 0 ⎨ t¯⎬ t¯ d ⎨ ⎬ ⎣ ¯ . n ¯ = −1/ρ 0 1/τ ⎦ n (2.13) ⎩¯ ⎭ ds ⎩ ¯ ⎭ b 0 −1/τ 0 b
2.2.2 Arbitrary parameterization The previous section has developed a representation of a curve based on its natural or intrinsic parameterization. In many instances, however, this parameterization is difſcult to obtain; instead, the curve is deſned in terms of a single parameter, η, that does not measure length along the curve, see ſg. 2.2. The position vector of a point on the curve is now p0 = p0 (η). The derivatives of the position vector with respect to parameter η will be denoted as p1 =
dp0 dη
,
p2 =
d2 p0 dη 2
,
p3 =
d3 p0 dη 3
,
p4 =
d4 p0 dη 4
.
A similar notation will be used for the tangent and normal vectors, di t¯ t¯i = i , dη
n ¯i =
di n ¯ . dη i
The differential element of length along the curve can be written as ds2 = (dp0 /dη)T (dp0 /dη) dη 2 . The ratio of the increment in length along the curve, ds, to the increment in parameter value, dη, is then ds T = p1 p1 = p1 . dη
(2.14)
Notation (·) will be used to indicate a derivative with respect to η, and hence, d/ds = (·) /p1 . The unit tangent vector to the curve is evaluated with the help of eq. (2.5) as p t¯ = 1 (2.15) p1 Next, the derivative of the tangent vector is found as
2.2 Differential geometry of a curve
t¯1 =
p1 p2 − p1 (pT1 p2 )/p1 p21
=
1 1 p2 − (t¯T p2 )t¯ . (1 − t¯t¯T )p2 = p1 p1
35
(2.16)
From eq. (2.7), the radius of curvature now becomes dt¯ 1 1 = = t¯1 . ρ ds p1 It follows that t¯1 = t1 = p1 /ρ. For a straight line, the tangent vector has a ſxed direction in space, t¯1 = 0. It follows that for a straight line 1/ρ = 0, i.e., its radius of curvature is inſnite. The curve’s curvature is found to be p21 p22 − (pT2 p1 )2 1 = (2.17) ρ p31 Higher-order derivatives of the tangent vector are found in a similar manner 1 p3 − (t¯T p3 + t¯T1 p2 )t¯ − 2(t¯T p2 )t¯1 , t¯2 = p1 and 1 p4 − (t¯T p4 + 2t¯T1 p3 + t¯T2 p2 )t¯ − 3(t¯T p3 + t¯T1 p2 )t¯1 − 3(t¯T p2 )t¯2 . t¯3 = p1 Next, the normal vector deſned in eq. (2.6) becomes n ¯=
1 t¯1 = t¯1 . t¯1 t1
(2.18)
For a straight line, t¯1 = 0, and hence, the normal vector is not deſned. In fact, any vector normal to a straight line is a normal vector. The derivative of the normal vector with respect to η then follows as 1 ¯ t2 − (¯ nT t¯2 )¯ n . t1
(2.19)
1 ¯ nT t¯3 + n ¯ T1 t¯2 )¯ n − 2(¯ nT t¯2 )¯ n1 . t3 − (¯ t1
(2.20)
n ¯1 = The second-order derivative is then n ¯2 =
The binormal vector is readily expressed as ρ 1 ¯ ¯b = t t1 = 3 p1 p2 . tn ¯= t1 p1
(2.21)
Because the normal vector is not deſned for a straight line, the binormal vector is not deſned in that case. In fact, any vector normal to a straight line is a binormal vector. The derivative of the binormal vector becomes
36
2 Coordinate systems
¯b1 = ( ρ ) p1 p + ρ p1 p . 2 3 p31 p31
(2.22)
Using eq. (2.10), the twist of the curve is found to be 1 T¯ 1 ρ =− n ¯ b1 = − 5 p21 pT2 − (pT1 p2 )pT1 ¯b1 . τ p1 p1 Finally, introducing eq. (2.22) leads to 1 ρ2 = − 6 pT2 p1 p3 . τ p1
(2.23)
The twist of the curve is closely related to the volume deſned by vectors p1 , p2 , and p3 . Note that a straight line has a vanishing twist, 1/τ = 0. Derivatives of the binormal vector are more easily expressed as ¯b1 = t1 n ¯ + tn ¯1 = t1 n ¯1 + tn ¯2 = n t¯2 + tn ¯ 2 , where eqs. (2.18) and (2.19) were used. tn ¯ 1 , and ¯b2 = Example 2.1. The helix Figure 2.3 depicts a helix, which is a three-dimensional curve deſned by the following position vector p0 (η) = a cos η ¯ı1 + a sin η ¯ı2 + kη ¯ı3 ,
(2.24)
where a and k are two parameters deſning the shape of the curve. The derivatives of the position vector are p1 = −a sin η ¯ı1 + a cos η ¯ı2 + k ¯ı3 , p2 = −a cos η ¯ı1 − a sin η ¯ı2 , and p3 = a sin η ¯ı1 − a cos η ¯ı2 . The curvature and twist of the helix are found with the help of eqs. (2.17) and (2.23), respectively, as 1 a , = 2 ρ a + k2
1 k . = 2 τ a + k2
Note that both curvature and twist are constant along the helix. The unit tangent vector is evaluated with the help of eq. (2.15) as 1 1 p1 = √ (−a sin η¯ı1 + a cos η¯ı2 + k¯ı3 ). t¯ = √ 2 2 2 a +k a + k2
(2.25)
The ratio between an increment √ in length along the curve and the increment in the parameter value is then ds = a2 + k 2 dη, see eq. (2.14). Next, the derivative of the tangent vector is computed with the help of eq. (2.16) as t¯1 = p2 /p1 and the normal vector then follows as n ¯ = − cos η ¯ı1 − sin η ¯ı2 . Finally, the binormal vector found from eq. (2.21) ¯b = √ 1 [k sin η ¯ı1 − k cos η ¯ı2 + a ¯ı3 ] . a2 + k 2 The derivatives of Frenet’s triad are found with the help of eq. (2.13) as a dt¯ = 2 n ¯, ds a + k2
a ¯ k ¯ d¯ n =− 2 b, t+ 2 2 ds a +k a + k2
k d¯b =− 2 n ¯. ds a + k2
2.2 Differential geometry of a curve
i2
i3
r = aq
C
r C
q
i1
i2
a i1
37
h
Fig. 2.3. Conſguration of a helix in threedimensional space.
Fig. 2.4. Conſguration of a planar linear spiral.
Example 2.2. The linear spiral Figure 2.4 depicts a linear spiral, which is a planar curve deſned by the following position vector p0 = aθ cos θ ¯ı1 + aθ sin θ ¯ı2 , (2.26) where a is a parameter deſning the shape of the curve. The derivatives of the position vector are p1 = a [(cos θ − θ sin θ)¯ı1 + (sin θ + θ cos θ)¯ı2 ], p2 = a [−(2 sin θ + θ cos θ)¯ı1 + (2 cos θ − θ sin θ)¯ı2 ]. It is readily veriſed that p21 = a2 (1 + θ2 ), p22 = a2 (4 + θ2 ) and pT1 p2 = a2 θ. The curvature of the linear spiral is found with the help of eq. (2.17) a 2 + θ2 = . ρ (1 + θ2 )3/2 Note that the curvature varies along the spiral. Of course, the twist is zero since the curve is planar. The unit tangent vector is evaluated with the help of eq. (2.15) as (cos θ − θ sin θ)¯ı1 + (sin θ + θ cos θ)¯ı2 √ t¯ = . 1 + θ2 Finally, the normal vector becomes − 2 sin θ + θ cos θ(2 + θ2 ) ¯ı1 + 2 cos θ − θ sin θ(2 + θ2 ) ¯ı2 . n ¯= 4 + θ2 (2 + θ2 )2 Example 2.3. Using polar coordinates to represent curves Cams play an important role in numerous mechanical systems: cam-follower pairs typically transform the rotary motion of the cam into a desirable motion of the follower. Figure 2.5 depicts a typical cam whose outer shape is deſned by a curve. It is convenient to deſne this curve using the polar coordinate system indicated on the ſgure: for each angle α, the distance from point O to point P is denoted r. The complete curve is then deſned by function r = r(α); angle α provides an arbitrary parameterization of the curve. If r(α) is a periodic function of angle α, the curve will be a closed curve, as expected for a cam.
2 Coordinate systems
g
e2
b
t ea
r a
1.6
er a
1.4
1.2
P e1
n
1/r
38
1
O
0.8
C
0.6
0.4 0
Fig. 2.5. Conſguration of a cam.
60
120
180
a
240
300
360
Fig. 2.6. Curvature distribution for the cam.
Vectors p0 , p1 , and p2 now become p0 = rCα e¯1 + rSα e¯2 ,
(2.27a)
p1 = (r Cα − rSα ) e¯1 + (r Sα + rCα ) e¯2 ,
(2.27b)
p2 = (r Cα − 2r Sα − rCα ) e¯1 + (r Sα + 2r Cα − rSα ) e¯2 ,
(2.27c)
where the notation (·) indicates a derivative with respect to α, Sα = sin α, and Cα = cos α. It then follows that p21 = r2 +r2 and p22 = (r −r)2 +4r2 . The various properties of the curve can then be evaluated; for instance, eqs. (2.15) and (2.17) yield the tangent vector and curvature along the curve, respectively. The curve depicted in ſg. 2.5 is deſned by the following equation, r(α) = 1.0 + 0.5 cos α + 0.15 cos 2α and ſg. 2.6 shows the curvature distribution as a function of angle α. Figure 2.5 shows the unit tangent vector, t¯, at point P of the curve and deſnes angles β = (¯ e1 , t¯) and γ = (¯ er , t¯); note that γ = β − α. The unit tangent vector can now be written as t¯ = Cβ e¯1 +Sβ e¯2 = p1 /p1 , where the second equality follows from eq. (2.15). Pre-multiplying this relationship by e¯T1 and e¯T2 yields p1 Cβ = r Cα −rSα and p1 Sβ = r Sα + rCα , respectively. Solving these two equations for r and r and using elementary trigonometric identities then leads to r = p1 sin(β − α) = p1 Sγ , r = p1 cos(β − α) = p1 Cγ ,
(2.28a) (2.28b)
where Sγ = sin γ, and Cγ = cos γ. The quotient of these two equations then yields the following relationship dr dα = tan γ . (2.29) r The derivative of the unit tangent vector with respect to the curvilinear coordinate along the curve is dt¯/ds = (−Sγ e¯1 + Cγ e¯2 )dβ/ds, and the curvature is then 1/ρ = |dβ/ds|. If the curve is convex, which is generally the case for cams, angle β is a monotonically increasing function of s, and hence, 1/ρ = dβ/ds. The chain rule
2.3 Path coordinates
39
for derivatives implies dβ = (1/ρ)(ds/dα)(dα/dr)dr and introducing eqs. (2.14), (2.28a), and (2.29) then yields dr dβ = . (2.30) ρCγ It is left to the reader to verify that eq. (2.30) yields an alternative, simpliſed expression for the curvature of the cam 1 2r2 − rr + r2 = . ρ p31
(2.31)
Finally, an increment in angle γ can be expressed as dγ = dβ − dα and introducing eqs. (2.30) and (2.29) yields 1 tan γ dγ = dr. (2.32) − ρCγ r
2.3 Path coordinates Consider a particle moving along a curve such that its position, s(t), is a given function of time. The velocity vector, v, of the particle is then v=
dp0 dt
=
dp0 ds ds dt
= v t¯,
(2.33)
where v = ds/dt is the speed of the particle, Clearly, the velocity vector of the particle is along the tangent to the curve. Next, the particle acceleration vector, a, becomes a=
dv ¯ v2 dv dt¯ ds = = v˙ t¯ + n ¯. t+v dt dt ds dt ρ
(2.34)
The acceleration vector is contained in the osculating plane, and can be written as a = at t¯ + an n ¯ , where at and an are the tangential and normal components of acceleration, respectively. The tangential component of acceleration, at = v, ˙ simply measures the change in particle speed. The normal component, an = v 2 /ρ, is always directed towards the center of curvature since v 2 /ρ is a positive number. This normal acceleration is clearly related to the curvature of the path; in fact, when the path is a straight line, 1/ρ = 0, and the normal acceleration vanishes. 2.3.1 Problems Problem 2.1. Prove identity
Prove that 1/ρ = p2 /p21 | sin α|, where p2 = p2 and α is the angle between vectors p1 and p2 .
40
2 Coordinate systems
Problem 2.2. Study of a curve Consider the following spatial curve: p0 = a(η + sin η)¯ı1 + a(1 + cos η)¯ı2 + a(1 − cos η)¯ı3 , where a > 0 is a given parameter. (1) Find the tangent, normal, and binormal vectors for this curve. (2) Determine the curvature, radius of curvature, and twist of the curve. Is this a planar curve? Is the tangent vector deſned at all points of the curve?
Problem 2.3. Study of a curve Consider the following spatial curve: p0 = ρ(cos αη)(cos η)¯ı1 + ρ(cos αη)(sin η)¯ı2 + ρ(sin αη)¯ı3 , where ρ > 0 and α are given parameters. (1) Find the tangent, normal, and binormal vectors for this curve. (2) Determine the curvature, radius of curvature, and twist of the curve.
Problem 2.4. Short questions (1) A particle of mass m is sliding along a planar curve. Find the component of the particle’s acceleration vector along the binormal vector of Frenet’s triad. (2) A particle of mass m is sliding along a three-dimensional curve. Find the component of the particle’s acceleration vector along the binormal vector of Frenet’s triad. (3) State the criterion used to ascertain whether a curve is planar or three-dimensional.
Problem 2.5. Study of a curve deſned in polar coordinates The outer surface of a cam is speciſed by the following curve deſned in polar coordinates, r(α) = 1.0 − 0.5 cos α + 0.18 cos 2α. (1) Plot the curve. (2) Plot the curvature distribution for α ∈ [0, 2π].
2.4 Differential geometry of a surface This section investigates the differential geometry of surfaces, leading to the concept of surface coordinates. The differential geometry of surfaces is more complex than that of curves. The ſrst and second metric tensors of surfaces are introduced ſrst, and the analysis of the curvature of surfaces leads to the concept of lines of curvatures and associated principal radii of curvature. Finally, the base vectors and their derivatives are evaluated, leading to Gauss’ and Weingarten’s formulæ. 2.4.1 The ſrst metric tensor of a surface Figure 2.7 depicts a surface, denoted S, in three-dimensional space. A surface is the locus of the points generated by two parameters, η1 and η2 , such that the position vector, p0 , of such points can be written as p0 = p0 (η1 , η2 ).
(2.35)
If η2 is kept constant, η2 = c2 , p0 = p0 (η1 , c2 ) deſnes a curve embedded into the surface; such curve is called an “η1 curve.” Figure 2.7 shows a grid of such curves for various values of c2 . Similarly, “η2 curves” can be deſned, corresponding to p0 = p0 (c1 , η2 ); a grid of η2 curves obtained for different constant c1 is also shown on the ſgure. In general, parameters η1 and η2 do not measure length along these
2.4 Differential geometry of a surface
41
embedded curves, and hence, they do not deſne intrinsic parameterizations of the curves. The surface base vectors are deſned as follows a1 =
∂p0 ∂η1
,
a2 =
∂p0 ∂η2
,
(2.36)
and are shown in ſg. 2.7. Clearly, vectors a1 and a2 are tangent to the η1 and η2 curves that intersect at point P, respectively. Consequently, they lie in the plane tann a2 h2 gent to the surface at this point. Since η1 Tangent and η2 do not form an intrinsic parameterplane P S ization, vectors a1 and a2 are not unit tana1 gent vectors. Furthermore, these two veci 3 tors are not, in general, orthogonal to each p0(h1, h2) other. h1 I The ſrst metric tensor of the surface, A, i2 is deſned as i1 T a11 a12 a1 a1 aT1 a2 A= T = , (2.37) Fig. 2.7. The base vectors of a surface. a2 a1 aT2 a2 a12 a22 and its determinant is denoted a = det(A). A differential element of length on the surface is found as " ! ds2 = dpT0 dp0 = aT1 dη1 + aT2 dη2 (a1 dη1 + a2 dη2 ) = dη T A dη. (2.38)
where dη T = dη1 , dη2 . Clearly, the ſrst metric tensor is closely related to length measurements on the surface. Because the base vectors deſne the plane tangent to the surface, the unit vector, n ¯ , normal to the surface is readily found as n ¯=
a1 a2 a1 a = √ 2. a1 a2 a
The area of a differential element of the surface then becomes √ da = a1 a2 dη1 dη2 = a1 a2 dη1 dη2 = a dη1 dη2 .
(2.39)
(2.40)
2.4.2 Curve on a surface Figure 2.8 depicts a curve, C, entirely contained within surface S. Let the curve be deſned by its intrinsic parameter, s, the curvilinear variable along curve C. The tangent vector, t¯, to curve C is deſned by eq. (2.5). This unit tangent vector clearly lies in the plane tangent to S, and hence, it can be resolved along the base vectors, t¯ = λ1 a1 + λ2 a2 . Because t¯ is a unit vector, it follows that
42
2 Coordinate systems
t¯T t¯ = λT A λ = 1,
(2.41)
where λT = λ1 , λ2 . On the other hand, eq. (2.38) can be recast as dη T dη A = 1. ds ds a2 h2
(2.42)
Because eqs. (2.41) and (2.42) must be identical for all curves on the surface,
P i3 I
S p0(h1, h2)
a1 C h1
i2
i1 Fig. 2.8. A curve, C, entirely contained within surface, S
λ=
dη . ds
(2.43)
This result is expected since ds is an increment of length along C, and t¯ is tangent to C. Angles θ1 = (t¯, a1 ) and θ2 = (t¯, a2 ) can be obtained by expanding the dot products t¯T a1 and t¯T a2 , respectively, to ſnd √ a cos θ1 √ 11 = A λ. (2.44) a22 cos θ2
2.4.3 The second metric tensor of a surface Consider once again a curve, C, entirely contained within surface S, as depicted in ſg. 2.8. The unit tangent vector clearly lies in the plane tangent to the surface, but the curvature vector dt¯/ds will have components in and out of this tangent plane, dt¯ = κn n ¯ + κg ρ¯, ds
(2.45)
where κn is the normal curvature, κg the geodesic curvature, and ρ¯ a unit vector belonging to the plane tangent to S. The normal curvature can be evaluated as κn = n ¯T
n dpT d¯ dt¯ d¯ n = −t¯T =− 02 , ds ds ds
(2.46)
where the normality condition, t¯T n ¯ = 0, was used. The numerator can be written as " ∂n ! T ¯ ∂n ¯ T T n = − a1 dη1 + a2 dη2 dη1 + dη2 −dp0 d¯ ∂η1 ∂η2 ¯ 2 ¯ 2 ¯ ¯ T ∂n T ∂n T ∂n T ∂n dη1 dη2 , dη + a2 dη + a1 + a2 = − a1 ∂η1 1 ∂η2 2 ∂η2 ∂η1 ∂a ∂a ∂a ∂a ¯ T 2 dη22 + n ¯T 1 + n ¯ T 2 dη1 dη2 , = n ¯ T 1 dη12 + n ∂η1 ∂η2 ∂η2 ∂η1 where the orthogonality conditions, n ¯ T a1 = 0 and n ¯ T a2 = 0, were used to obtain the last equality.
2.4 Differential geometry of a surface
The second metric tensor of the surface is deſned as ⎤ ⎡ ⎤ ⎡ ∂ 2 p0 ∂ 2 p0 T T T ∂a1 T ∂a1 n ¯ n ¯ n ¯ n ¯ b11 b12 ⎢ ∂η1 ⎥ ⎢ ∂η12 ∂η1 ∂η2 ⎥ ∂η 2 ⎥ ⎢ B = ⎣ ∂a ⎦=⎣ 2 2 ⎦ = b12 b22 , ∂ p ∂ p 2 T T ∂a2 0 0 n ¯ n ¯ n ¯T n ¯T ∂η1 ∂η2 ∂η1 ∂η2 ∂η22
43
(2.47)
and its determinant is denoted b = det(B). The second equality shows that the second metric tensor is a symmetric tensor. It follows that −dpT0 d¯ n = dη T Bdη, and the normal curvature, eq. (2.46), becomes κn =
dη T Bdη dη T dη B = λT B λ. = ds2 ds ds
(2.48)
2.4.4 Analysis of curvatures Figure 2.9 shows a plane, P, containing the normal, P n ¯ , to surface S. Let curve Cn be at the intersection n a2 h2 of plane P and surface S. Because curve Cn is a planar curve, its curvature vector is in plane P. P Next, let plane P rotate about n ¯ . For each new a1 Cn orientation of the plane, a new curve, Cn , is generS i3 ated with its own normal curvature κn . The followp0(h1, h2) h1 ing problem will be investigated: what is the orienI tation of plane P that maximizes the normal curi2 vature κn ? In mathematical terms, the maximum i1 value of κn = λT B λ is sought, under the normalFig. 2.9. Intersection of surface, ity constraint, λT A λ = 1. This constrained maximization problem will be S, with plane, P, that contains the solved with the help of Lagrange’s multiplier tech- normal to the surface. nique max λT B λ − μ(λT A λ − 1) , λ,μ
where μ is the Lagrange multiplier used to enforce the constraint. The solution of this problem implies (B − μA)λ = 0, and the normality condition λT A λ = 1. Pre-multiplying this equation by λT yields the physical interpretation of the Lagrange multiplier: λT B λ − μ λT A λ = 0 or, in view of the normality constraint, μ = λT B λ = κn , Hence, Lagrange’s multiplier can be interpreted as the normal curvature itself. The condition for maximum normal curvature can now be written as (B − κn A)λ = 0. This set of homogeneous algebraic equations admits the trivial solution λ = 0, but this solution violates the normality constraint. Non-trivial solutions correspond to the eigenpairs of the generalized eigenproblem B λ = κn A λ. Because A and B are symmetric and A is positive-deſnite, the eigenvalues are always real, and mutually orthogonal eigenvectors can be constructed.
44
or
2 Coordinate systems
The eigenvalues are the solution of the quadratic equation det(B − κn A) = 0,
b = 0, (2.49) a where κm = (a11 b22 + a22 b11 − 2a12 b12 )/2a. The solutions of this quadratic equation are called the principal curvatures κIn , κII κ2m − b/a. (2.50) n = κm ± κ2n − 2κm κn +
The mean curvature is deſned as κm =
a11 b22 + a22 b11 − 2a12 b12 κIn + κII n = , 2 2a
(2.51)
and the Gaussian curvature as
b . (2.52) a When b/a > 0, the principal curvatures have the same sign, corresponding to a convex shape; when b/a < 0, the principal curvatures are of opposite sign, corresponding to a saddle shape; ſnally, when b/a = 0, one of the principal curvatures is zero, the surface S has zero curvature in one of the principal curvature directions. κIn κII n =
2.4.5 Lines of curvature A line of curvature of a surface is deſned as a curve whose tangent vector always points along the principal curvature directions of the surface. Consider now a set of coordinates, η1 and η2 , such that a12 = b12 = 0. It follows that a = a11 a22 , b = b11 b22 and κm = (b11 /a11 + b22 /a22 )/2. The principal curvatures then simply become b11 b22 , κII . (2.53) κIn = n = a11 a22 On the other hand, in view of eq. (2.41),η1 or η2 curves are characterized by √ √ λT = 1/ a11 , 0 or λT = 0, 1/ a22 , respectively. Their normal curvature then follows from eq. (2.48) as κn = b11 /a11 and κn = b22 /a22 , respectively. It is now clear that when a12 = b12 = 0, the η1 and η2 curves are indeed the lines of curvatures. It is customary to introduce the principal radii of curvature, R1 and R2 , deſned as b11 1 b22 1 = , κII = . (2.54) κIn = n = a11 R1 a22 R2 2.4.6 Derivatives of the base vectors At this point, the discussion will focus exclusively on surface parameterizations deſning lines of curvatures. In this case, vectors a1 , a2 and n ¯ form a set of mutually orthogonal vectors, although the ſrst two are not necessarily unit vectors. An orthonormal triad can be constructed as follows
2.4 Differential geometry of a surface
e¯1 =
a1 , a1
e¯2 =
a2 , a2
e¯3 = n ¯.
45
(2.55)
To interpret the meaning of these unit vectors, the chain rule for derivatives is used to write ∂p ∂p ds1 ds1 a1 = 0 = 0 = e¯1 , ∂η1 ∂s1 dη1 dη1 where s1 is the arc length measured along the η1 curve. Because ∂p0 /∂s1 = e¯1 is the unit tangent vector to the η1 curve, see eq. (2.5), it follows that a1 = h1 =
ds1 , dη1
a2 = h2 =
ds2 . dη2
(2.56)
Notation h1 = a1 was introduced to simplify the writing. Clearly, h1 is a scale factor, the ratio of the inſnitesimal increment in length, ds1 , to the inſnitesimal increment in parameter η1 , dη1 , along the curve. It is interesting to compute the derivatives of the base vectors. To that effect, the following expression is considered ∂ 2 p0 ∂η1 ∂η2
=
∂a1 ∂a2 ∂(h1 e¯1 ) ∂(h2 e¯2 ) = = = . ∂η2 ∂η1 ∂η2 ∂η1
Expanding the derivatives leads to ∂¯ e1 ∂¯ e2 ∂h1 ∂h2 e¯1 + h1 = e¯2 + h2 . ∂η2 ∂η2 ∂η1 ∂η1
(2.57)
Pre-multiplying this relationship by e¯T1 yields the following identity e¯T1
∂¯ e2 1 ∂h1 = . ∂η1 h2 ∂η2
To obtain this result, the orthogonality of the base vectors, e¯T1 e¯2 = 0, was used; furthermore, e¯T1 ∂¯ e1 /∂η2 = 0, since e¯1 is a unit vector. In terms of intrinsic parameterization, this expression becomes e¯T1
∂¯ e2 ∂¯ e1 1 ∂h1 1 = −¯ eT2 = = , ∂s1 ∂s1 h1 ∂s2 T1
(2.58)
where T1 is the ſrst radius of twist of the surface. Next, eq. (2.57) is pre-multiplied e¯T2 to yield e¯T2
∂¯ e1 ∂¯ e2 1 ∂h2 1 = −¯ eT1 = = , ∂s2 ∂s2 h2 ∂s1 T2
(2.59)
where T2 is the second radius of twist of the surface. Since the parameterization deſnes lines of curvatures, b12 = 0, and eq. (2.47) then implies e¯T2
∂n ¯ ∂¯ e2 =n ¯T = 0, ∂s1 ∂s1
e¯T1
∂n ¯ ∂¯ e1 =n ¯T = 0. ∂s2 ∂s2
46
2 Coordinate systems
The deſnitions of the diagonal terms, b11 and b22 , of the second metric tensor, eq. (2.47), lead to e¯T1
∂n ¯ ∂¯ e1 1 = −¯ nT =− , ∂s1 ∂s1 R1
e¯T2
∂n ¯ ∂¯ e2 1 = −¯ nT =− , ∂s2 ∂s2 R2
where the principal radii of curvature, R1 and R2 , were deſned in eq. (2.54). The derivatives of the surface base vector e¯1 can be resolved in the following manner ∂¯ e1 = c1 e1 + c2 e2 + c3 n ¯, (2.60) ∂s1 where the unknown coefſcients c1 , c2 , and c3 are readily found by pre-multiplying the above relationship by e¯T1 , e¯T2 , and n ¯ T to ſnd ∂¯ e1 1 1 = − e¯2 + n ¯. ∂s1 T1 R1
(2.61)
A similar development leads to 1 ∂¯ e1 = e¯2 . ∂s2 T2
(2.62)
The derivatives of the surface base vector e¯2 are found in a similar manner ∂¯ e2 1 = e¯1 , ∂s1 T1
∂¯ e2 1 1 = − e1 + n ¯. ∂s2 T2 R2
(2.63)
These results are known as Gauss’ formulæ. Proceeding in a similar fashion, the derivatives of the normal vector are resolved in the following manner 1 ∂n ¯ = − e¯1 , ∂s1 R1
∂n ¯ 1 = − e¯2 . ∂s2 R2
(2.64)
These results are known as Weingarten’s formulæ. Gauss’ and Weingarten’s formulæ can be combined to yield the derivatives of the base vectors in a compact manner as ⎧ ⎫ ⎡ ⎤⎧ ⎫ e¯ 0 −1/T1 1/R1 ⎨e¯1 ⎬ ∂ ⎨ 1⎬ ⎣ e¯2 = 1/T1 0 0 ⎦ e¯2 , (2.65a) ⎩ ⎭ ∂s1 ⎩ ⎭ −1/R1 0 n ¯ 0 n ¯ ⎧ ⎫ ⎡ ⎤⎧ ⎫ e¯ 0 0 1/T2 ⎨e¯1 ⎬ ∂ ⎨ 1⎬ ⎣ e¯2 = −1/T2 0 1/R2 ⎦ e¯2 . (2.65b) ⎩ ⎭ ∂s2 ⎩ ⎭ n ¯ 0 −1/R2 0 n ¯ These equations should be compared to the derivatives of Frenet’s triad, eq. (2.13).
2.4 Differential geometry of a surface
47
Example 2.4. The spherical surface The spherical surface in three-dimensional space depicted in ſg. 2.10 is deſned by following position vector p0 = R (sin η1 cos η2 ¯ı1 + sin η1 sin η2 ¯ı2 + cos η1 ¯ı3 ), where R is the radius of the sphere. The surface base vectors are readily evaluated as a1 = ∂p0 /∂η1 = R (cos η1 cos η2 ¯ı1 + cos η1 sin η2 ¯ı2 − sin η1 ¯ı3 ), and a2 = ∂p0 /∂η2 = R (− sin η1 sin η2 ¯ı1 + sin η1 cos η2 ¯ı2 ). The ſrst metric tensor of the sphere now becomes 2 0 R A= . 0 R2 sin2 η1 √ Clearly, h1 = R, h2 = R sin η1 , and a = R2 sin η1 . The normal vector is then evaluated with the help of eq. (2.39), to ſnd n ¯=
a1 a2 = sin η1 cos η2¯ı1 + sin η1 sin η2¯ı2 + cos η1¯ı3 . a1 a2
h1
i3
i3
n
P i2
i1
i2 i1
h2
Fig. 2.10. Spherical surface conſguration.
r f
Fig. 2.11. Parabolic surface of revolution.
The second metric tensor of the spherical surface now follows from eq. (2.47) −R 0 . B= 0 −R sin2 η1 Note that since a12 = 0 and b12 = 0, the coordinates used here are lines of curvature for the spherical surface. The orthonormal triad to the surface is e¯1 = cos η1 cos η2 ¯ı1 + cos η1 sin η2 ¯ı2 − sin η1¯ı3 , e¯2 = − sin η2 ¯ı1 + cos η2 ¯ı2 , n ¯ = sin η1 cos η2 ¯ı1 + sin η1 sin η2 ¯ı2 + cos η1 ¯ı3 . These expressions are readily inverted to ſnd ¯, ¯ı1 = cos η1 cos η2 e¯1 − sin η2 e¯2 + sin η1 cos η2 n ¯, ¯ı2 = cos η1 sin η2 e¯1 + cos η2 e¯2 + sin η1 sin η2 n ¯. ¯ı3 = − sin η1 e¯1 + cos η1 n
48
2 Coordinate systems
The mean curvature, eq. (2.51), and Gaussian curvature, eq. (2.52), are R 1 1 R sin2 η1 R2 sin2 η1 1 − 2− 2 2 = − , κIn κII κm = = 2. n = 2 4 2 R R R R sin η1 R sin η1 Finally, the principal curvatures, eq. (2.53), become 1 1 , κII n =− . R R As expected, the principal radii of curvature R1 = R2 = −R are equal to the radius of sphere. The twists of the surface now follow from eqs. (2.58) and (2.59) κIn = −
1 1 ∂h1 = = 0, T1 h1 h2 ∂η2
1 1 ∂h2 cos η1 = = . T2 h1 h2 ∂η1 R sin η1
(2.66)
2.4.7 Problems Problem 2.6. The parabola of revolution Figure 2.11 depicts a parabolic surface of revolution. It is deſned by the following position vector p0 = r cos φ ¯ı1 + r sin φ ¯ı2 + ar 2¯ı3 , where r ≥ 0 and 0 ≤ φ ≤ 2π. The following notation was used η1 = r and η2 = φ. (1) Find the ſrst and second metric tensors of the ¯ . (3) Find the mean curvature, the Gaussian surface. (2) Find the orthonormal triad e¯1 , e¯2 , and n curvature, and the principal radii of curvature of the surface. (4) Find the twists of the surface.
Problem 2.7. Jacobian of the transformation Consider two parameterizations of a surface deſned by coordinates (η1 , η2 ) and (ˆ η1 , ηˆ2 ). Show that the base vectors in the two parameterizations are related as follows a ˆ1 = J11 a1 + J12 a2 and a ˆ2 = J21 a1 + J22 a2 , where J is the Jacobian of the coordinate transformation ⎤ ⎡ ∂η1 ∂η2 J11 J12 ⎢ ηˆ1 ∂ ηˆ1 ⎥ J= = ⎣∂ ∂η1 ∂η2 ⎦ . J21 J22 ∂ ηˆ2 ∂ ηˆ2 ˆ and If A and B are the ſrst and second metric tensors in coordinate system (η1 , η2 ) and A T ˆ the corresponding quantities in coordinate system (ˆ ˆ = J A J and B η1 , ηˆ2 ), show that A T ˆ B =JBJ .
Problem 2.8. Finding the line of curvature system Using the notations deſned in problem 2.7, let (η1 , η2 ) be a known coordinate system and (ˆ η1 , ηˆ2 ) the unknown line of curvature system. Find the Jacobian of the coordinate transforη1 , ηˆ2 ). Show that the mation that will bring (η1 , η2 ) to the desired line of curvature system (ˆ principal radii of curvature are 1 b11 + γ(2b12 + γb22 ) = , R1 a11 + γ(2a12 + γa22 ) Hint: write the Jacobian as
b11 + α(2b12 + αb22 ) 1 = . R2 a11 + α(2a12 + αa22 )
J=
1 γ , α1
and compute the coefſcients α and γ so as to enforce a ˆ12 = ˆb12 = 0. The solution of the problem is α = Cα /[Δ/(1+αγ)] and γ = −Cγ /[Δ/(1+αγ)] where Cα= a22 b12 −b22 a12 , Cγ = a11 b12 −b11 a12 , Δ = a11 b22 −b11 a22 , and Δ/(1+αγ) = Δ/2± (Δ/2)2 + Cα Cγ .
2.6 Differential geometry of a three-dimensional mapping
49
2.5 Surface coordinates A particle is moving on a surface and its position is given by the lines of curvature coordinates, η1 (t) and η2 (t). The velocity vector is computed with the help of the chain rule for derivatives v=
dp0 dt
=
∂p0 ∂η1
η˙ 1 +
∂p0 ∂η2
η˙2 = s˙ 1 e¯1 + s˙ 2 e¯2 .
(2.67)
Note the close similarity between this expression and that obtained for path coordinates, eq. (2.33). The velocity vector is in the plane tangent to the surface, and the speed of the particle is v = s˙ 21 + s˙ 22 . Next, the acceleration vector is computed as a = s¨1 e¯1 + s˙ 1 e¯˙ 1 + s¨2 e¯2 + s˙ 2 e¯˙ 2 ∂¯ e1 ∂¯ e2 ∂¯ e1 ∂¯ e2 = s¨1 e¯1 + s¨2 e¯2 + s˙ 1 s˙ 1 + s˙ 2 + s˙ 2 s˙ 1 + s˙ 2 . ∂s1 ∂s2 ∂s1 ∂s2 Introducing Gauss’ formulae, eq. (2.61) to (2.63), then yields 2 s˙ 1 s˙ 2 s˙ 22 s˙ 1 s˙ 2 s˙ 21 s˙ 1 s˙ 22 e¯1 + s¨2 + e¯2 + n ¯ . (2.68) − − + a = s¨1 + T1 T2 T2 T1 R1 R2 Note here again the similarity between this expression and that obtained for path coordinates, eq. (2.34). The acceleration component along the normal to the surface is related to the principal radii of curvatures, R1 and R2 . For a curve, the radius of curvature is always positive, see eq. (2.7), whereas for a surface, the radii of curvatures could be positive or negative, see eq. (2.54). Hence, the normal component of acceleration is not necessarily oriented along the normal to the surface. The components of acceleration in the plane tangent to the surface are related to the second time derivative of the intrinsic parameters, as expected. Additional terms, however, associated with the surface radii of twist also appear. Clearly, the acceleration of a particle moving on the surface is affected by the surface radii of curvature and twist; the particle “feels” the curvatures and twists of the surface as it moves.
2.6 Differential geometry of a three-dimensional mapping This section investigates the differential geometry of mappings of the threedimensional space onto itself. The differential geometry of such mappings is more complex than that of curves or surfaces. For simplicity, the analysis focuses on orthogonal mappings, leading to the deſnition of the curvatures of the coordinate system and orthogonal curvilinear coordinates. Two orthogonal curvilinear coordinate systems of great practical importance, the cylindrical and spherical coordinate systems are reviewed.
50
2 Coordinate systems
2.6.1 Arbitrary parameterization Consider the following mapping of the three-dimensional space onto itself in terms of three parameters, η1 , η2 , and η3 , p0 (η1 , η2 , η3 ) = x1 (η1 , η2 , η3 )¯ı1 + x2 (η1 , η2 , η3 )¯ı2 + x3 (η1 , η2 , η3 )¯ı3 .
(2.69)
This relationship deſnes a mapping between the parameters and the Cartesian coordinates x1 = x1 (η1 , η2 , η3 ),
x2 = x2 (η1 , η2 , η3 ),
x3 = x3 (η1 , η2 , η3 ).
(2.70)
Let η2 and η3 be constants whereas η1 only is allowed to vary: a general curve in three-dimensional space is generated. The analysis of section 2.2 would readily apply to this curve, called an “η1 curve.” Similarly, η2 and η3 curves could be deſned. Next, let η1 be a constant, whereas η2 and η3 are allowed to vary: a general surface in three-dimensional space is generated. The analysis of section 2.4 would readily apply to this surface, called an “η1 surface.” Here again, η2 and η3 surfaces could be similarly deſned. A point in space with parameters (η1 , η2 , η3 ) is at the intersection of three η1 , η2 , and η3 curves, or at the intersection of three η1 , η2 , and η3 surfaces. Furthermore, an η1 curve forms the intersection of η2 and η3 surfaces. The inverse mapping deſnes the parameters as functions of the Cartesian coordinates η1 = η1 (x1 , x2 , x3 ),
η2 = η2 (x1 , x2 , x3 ),
η3 = η3 (x1 , x2 , x3 ).
(2.71)
It is assumed here that eqs. (2.70) and (2.71) deſne a one to one mapping, which implies that the Jacobian of the transformation, ⎤ ⎡ ∂x1 ∂x1 ∂x1 ⎢ ∂η1 ∂η2 ∂η3 ⎥ ⎢ ∂x ∂x ∂x ⎥ ⎢ 2 2 2⎥ J =⎢ (2.72) ⎥, ⎢ ∂η1 ∂η2 ∂η3 ⎥ ⎣ ∂x3 ∂x3 ∂x3 ⎦ ∂η1 ∂η2 ∂η3 has a non vanishing determinant at all points in space. Next, the base vectors associated with the parameters are deſned as g1 =
∂p0 ∂η1
,
g2 =
∂p0 ∂η2
,
g3 =
∂p0 ∂η3
.
(2.73)
For an arbitrary parameterization, the base vectors will not be unit vectors, nor will they be mutually orthogonal. Consider the example of the cylindrical coordinate system deſned by the following parameterization
2.6 Differential geometry of a three-dimensional mapping
51
x1 = r cos θ, x2 = r sin θ, x3 = z, where r ≥ 0 and 0 ≤ θ < 2π. The following notation was used: η1 = r, η2 = θ and η3 = z. The inverse mapping is readily found as x2 r = x21 + x22 , θ = tan−1 , z = x3 . x1 Figure 2.12 depicts this mapping; clearly, the familiar polar coordinates are used in the (¯ı1 , ¯ı2 ) plane and z is the distance point P is above this plane. The Jacobian of the transformation becomes ⎤ ⎡ cos θ −r sin θ 0 J = ⎣ sin θ r cos θ 0⎦ . 0 0 1
g3 i3
g2 P r
I i1
z r
q
g1 i2
Note that det J = r, and hence, vanishes at r = 0. Fig. 2.12. The cylindrical coorIndeed, cylindrical coordinates are not deſned at the dinate system. origin since when r = 0, any angle θ maps to the same point, the origin. The base vectors of this coordinate system are g 1 = cos θ ¯ı1 + sin θ ¯ı2 , g 2 = −r sin θ ¯ı1 + r cos θ ¯ı2 , and g 3 = ¯ı3 . Note that g 1 is a unit vector, since g1 = 1, but g 2 is not, g 2 = r. Also note that for cylindrical coordinates, g T2 g 3 = g T1 g 3 = g T1 g2 = 0, the base vectors are mutually orthogonal, as shown in ſg. 2.12. 2.6.2 Orthogonal parameterization When the base vectors associated with the parameterization are mutually orthogonal, the parameters deſne an orthogonal parameterization of the three-dimensional space. The rest of this section will be restricted to such parameterization. In this case, it is advantageous to deſne a set of orthonormal vectors e¯1 =
1 g , g1 1
e¯2 =
1 g , g2 2
e¯3 =
1 g . g3 3
(2.74)
To interpret the meaning of these unit vectors, the chain rule for derivatives is used to write ∂p ∂p ds1 ds1 g1 = 0 = 0 = e¯1 , (2.75) ∂η1 ∂s1 dη1 dη1 where s1 is the arc length measured along the η1 curve. Because ∂p0 /∂s1 = e¯1 is the unit tangent to the η1 curve, see eq. (2.5), it follows that g1 = h1 =
ds1 , dη1
g 2 = h2 =
ds2 , dη2
g 3 = h3 =
ds3 . dη3
(2.76)
Notation h1 = g1 is introduced to simplify the notation. Clearly, h1 is a scale factor, the ratio of the inſnitesimal increment in length, ds1 , to the inſnitesimal increment in parameter η1 , dη1 , along the curve.
52
2 Coordinate systems
2.6.3 Derivatives of the base vectors Here again, the derivatives of the base vectors will be evaluated. To that effect, the following expression is considered ∂ 2 p0 ∂η1 η2
=
∂g 1 ∂η2
=
∂g 2 ∂η1
=
∂(h1 e¯1 ) ∂(h2 e¯2 ) = . ∂η2 ∂η1
(2.77)
Expanding the derivatives leads to ∂h1 ∂¯ e1 ∂¯ e2 ∂h2 e¯1 + h1 = e¯2 + h2 . ∂η2 ∂η2 ∂η1 ∂η1
(2.78)
Pre-multiplying this relationship by e¯T1 yields the following identity e¯T1
∂¯ e2 1 ∂h1 = . ∂η1 h2 ∂η2
(2.79)
To obtain this result, the orthogonality of the base vectors, e¯T1 e¯2 = 0, was used; furthermore, e¯T1 ∂¯ e1 /∂η2 = 0, since e¯1 is a unit vector. Next, eq. (2.78) is pre-multiplied e¯T2 to yield ∂¯ e1 ∂¯ e2 1 ∂h2 e¯T2 = −¯ eT1 = . (2.80) ∂η2 ∂η2 h1 ∂η1 Finally, pre-multiplication by e¯T3 leads to h1 e¯T3
∂¯ e1 ∂¯ e2 = h2 e¯T3 . ∂η2 ∂η1
(2.81)
e2 /∂η1 = −¯ eT2 ∂¯ e3 /∂η1 , this result can be manipulated as follows Since e¯T3 ∂¯ h1 e¯T3
∂¯ e1 ∂¯ e3 e1 h1 h2 T ∂¯ = −h2 e¯T2 =− e¯ , ∂η2 ∂η1 h3 2 ∂η3
(2.82)
where identity (2.81) was used with a permutation of the indices. Using the same identities once again leads to h1 e¯T3
∂¯ e1 e2 ∂¯ e3 ∂¯ e1 h1 h2 T ∂¯ = e¯1 = h1 e¯T1 = −h1 e¯T3 . ∂η2 h3 ∂η3 ∂η2 ∂η2
This result clearly implies ∂¯ e1 = 0. ∂η2 The derivatives of the base vector can be resolved as e¯T3
(2.83)
∂¯ e1 = c1 e¯1 + c2 e¯2 + c3 e¯3 , ∂η1 where the unknown coefſcients c1 , c2 , and c3 are found by pre-multiplying this expression by e¯1 , e¯2 , and e¯3 , respectively, and using identities (2.79), (2.80) and (2.83) to ſnd
2.7 Orthogonal curvilinear coordinates
53
∂¯ e1 1 ∂h1 1 ∂h1 =− e¯2 − e¯3 . ∂η1 h2 ∂η2 h3 ∂η3 Proceeding in a similar manner, the derivatives of base vector e¯1 with respect to η2 and η3 are found as ∂¯ e1 1 ∂h2 = e¯2 , ∂η2 h1 ∂η1
∂¯ e1 1 ∂h3 = e¯3 . ∂η3 h1 ∂η1
Similar expression are readily found for the derivatives of the unit base vectors e¯2 and e¯3 through index permutations and are summarized as ⎧ ⎫ ⎡ ⎤⎧ ⎫ e¯1 ⎬ 0 1/R13 −1/R12 ⎨e¯1 ⎬ ⎨ ∂ e¯2 = ⎣−1/R13 0 0 ⎦ e¯2 , (2.84a) ⎩ ⎭ ∂s1 ⎩ ⎭ 1/R12 e¯3 0 0 e¯3 ⎧ ⎫ ⎡ ⎤⎧ ⎫ e¯1 ⎬ 0 1/R23 0 ⎨ ⎨e¯1 ⎬ ∂ e¯2 = ⎣−1/R23 0 1/R21 ⎦ e¯2 , (2.84b) ⎩ ⎭ ∂s2 ⎩ ⎭ e¯3 0 −1/R21 0 e¯3 ⎧ ⎫ ⎡ ⎤⎧ ⎫ e¯1 ⎬ 0 0 −1/R32 ⎨e¯1 ⎬ ⎨ ∂ e¯2 = ⎣ 0 0 1/R31 ⎦ e¯2 , (2.84c) ⎩ ⎭ ∂s3 ⎩ ⎭ e¯3 1/R32 −1/R31 e¯3 where the curvatures of the system were deſned as 1 ∂h1 1 = , R12 h1 ∂s3 1 1 ∂h2 =− , R21 h2 ∂s3 1 1 ∂h3 = , R31 h3 ∂s2
1 1 ∂h1 =− , R13 h1 ∂s2 1 1 ∂h2 = , R23 h2 ∂s1 1 1 ∂h3 =− . R32 h3 ∂s1
(2.85a) (2.85b) (2.85c)
2.7 Orthogonal curvilinear coordinates Consider a particle moving in three-dimension space. The position of this particle can be deſned by eq. (2.69) in terms of an orthogonal parameterization of space. These parameter deſne a set of orthogonal curvilinear coordinates for the particle. The velocity vector is computed with the help of the chain rule for derivatives v=
dp0 dt
=
∂p0 ∂s1
s˙ 1 +
∂p0 ∂s2
s˙ 2 +
∂p0 ∂s3
s˙ 3 = s˙ 1 e¯1 + s˙ 2 e¯2 + s˙ 3 e¯3 .
(2.86)
The expression for the acceleration vector will involve term in s¨1 e¯1 and s˙ 1 e¯˙ 1 , and similar terms for the other two indices. The latter term is further expanded using the chain rule for derivatives, and expressing the derivatives of the base vectors using eqs. (2.84) then yields
54
2 Coordinate systems
a=
s¨1 − s˙ 22 /R23 + s˙ 23 /R32 − s˙ 1 s˙ 2 /R13 + s˙ 1 s˙ 3 /R12 e¯1 + s¨2 + s˙ 21 /R13 − s˙ 23 /R31 + s˙ 1 s˙ 2 /R23 − s˙ 2 s˙ 3 /R21 e¯2 + s¨3 − s˙ 21 /R12 + s˙ 22 /R21 − s˙ 1 s˙ 3 /R32 + s˙ 2 s˙ 3 /R31 e¯3 .
(2.87)
Note here again the similarity between this expression and that obtained for path or surface coordinates, eqs. (2.34) or (2.68), respectively. The acceleration components in each direction involve the second time derivative of the intrinsic parameters, as expected. Additional terms, however, associated with the radii of curvature of the curvilinear coordinate system also appear. 2.7.1 Cylindrical coordinates The cylindrical coordinate system, depicted in ſg. 2.13, is an orthogonal curvilinear coordinate system deſned as follows (2.88)
p0 = r cos θ ¯ı1 + r sin θ ¯ı2 + z ¯ı3 ,
where r ≥ 0 and 0 ≤ θ < 2π. The following notation was used: η1 = r, η2 = θ, and η3 = z. Note that if z = 0, the cylindrical coordinate system reduces to coordinates r and θ in plane (¯ı1 , ¯ı2 ) and are then often called polar coordinates. f e1
i3
i3 e3 e2
i1
e3 i2
e1
p0 r
r p0
z
q
i2
q
e2
i1
Fig. 2.13. The cylindrical coordinate system.
Fig. 2.14. The spherical coordinate system.
The following summarizes important formulæ in cylindrical coordinates. The scale factors are h1 = 1, h2 = r, and h3 = 1. The curvatures of the cylindrical coordinate system all vanish, except that R23 = r. The base vectors expressed in terms of the Cartesian system are e¯1 =
cos θ ¯ı1 + sin θ ¯ı2 ,
e¯2 = − sin θ ¯ı1 + cos θ ¯ı2 , ¯ı3 . e¯3 =
(2.89a) (2.89b) (2.89c)
The time derivatives of the based vectors resolved along this triad are θ˙ e¯2 , e¯˙ 2 = −θ˙ e¯1 , e¯˙ 3 = 0.
e¯˙ 1 =
(2.90a) (2.90b) (2.90c)
2.7 Orthogonal curvilinear coordinates
55
Finally, the position, velocity, and acceleration vectors, resolved along the base vectors of the cylindrical coordinate system are p0 = r e¯1 + z e¯3 , v = r˙ e¯1 + rθ˙ e¯2 + z˙ e¯3 , ˙ e¯2 + z¨ e¯3 . a = (¨ r − rθ˙2 ) e¯1 + (rθ¨ + 2r˙ θ)
(2.91a) (2.91b) (2.91c)
respectively. 2.7.2 Spherical coordinates The spherical coordinate system, depicted in ſg. 2.14, is an orthogonal curvilinear coordinate system deſned as follows p0 = r sin φ cos θ ¯ı1 + r sin φ sin θ ¯ı2 + r cos φ ¯ı3 ,
(2.92)
where r ≥ 0, 0 ≤ φ ≤ π, and 0 ≤ θ < 2π. The following notation was used: η1 = r, η2 = φ, and η3 = θ. The following summarizes important formulæ in spherical coordinates. The scale factors are h1 = 1, h2 = r, and h3 = r sin φ. The curvatures of the spherical coordinate system all vanish, except that R23 = r, R31 = r tan φ and R32 = −r. The base vectors expressed in terms of the Cartesian system are sin φ cos θ ¯ı1 + sin φ sin θ ¯ı2 + cos φ ¯ı3 ,
(2.93a)
e¯2 = cos φ cos θ ¯ı1 + cos φ sin θ ¯ı2 − sin φ ¯ı3 , sin θ ¯ı1 + cos θ ¯ı2 . e¯3 = −
(2.93b) (2.93c)
e¯1 =
The time derivatives of the based vectors resolved along this triad are φ˙ e¯2 + θ˙ sin φ e¯3 , e¯˙ 2 = −φ˙ e¯1 + θ˙ cos φ e¯3 , ˙ φ e¯1 + cos φ e¯2 ). e¯˙ 3 = −θ(sin
e¯˙ 1 =
(2.94a) (2.94b) (2.94c)
Finally, the position, velocity, and acceleration vectors, resolved along the base vectors of the spherical coordinate system are p0 = r e¯1 ,
(2.95a)
(2.95b) v = r˙ e¯1 + rφ˙ e¯2 + rθ˙ sin φ e¯3 , a = (¨ r − rφ˙ 2 − rθ˙2 sin2 φ) e¯1 + (rφ¨ + 2r˙ φ˙ − rθ˙2 sin φ cos φ) e¯2 + (rθ¨ sin φ + 2r˙ θ˙ sin φ + 2rφ˙ θ˙ cos φ) e¯3 .
(2.95c)
3 Basic principles
This chapter reviews the basic principles of dynamics. Newton’s laws are the foundation of mechanics and dynamics and deal with the behavior of particles subjected to forces. Section 3.1 presents Newton’s three laws and the principle of work and energy. Section 3.2 introduces the concept of conservative forces that play a fundamental role in dynamics. The principle of conservation of energy is discussed in section 3.2.1. The potentials of common conservative forces are given in section 3.2.2, which also introduces the concept of strain energy for rectilinear and torsional springs. The principle of impulse and momentum is discussed in section 3.2.4. Section 3.3 presents basic facts about contact forces because they play an important role in dynamics. Newton’s law only apply to a single particle; section 3.4 introduces Euler’s ſrst and second laws, which are applicable to very general systems of particles.
3.1 Newtonian mechanics for a particle Newton’s laws deal with the motion of a particle, i.e., a body of mass m that presents no physical dimension. This abstraction can be visualized by considering a body of mass m and ſnite dimensions. Next, the dimensions of the body are allowed to shrink, while the mass remains constant; at the limit, a particle of mass m is obtained that occupies a single point in space. As the particle moves, the locus of all positions it occupies in time describes a curve in three-dimensional space called the path of the particle. 3.1.1 Kinematics of a particle The position vector of particle P with respect to an inertial frame will be denoted as xP/O , meaning “position vector of particle P with respect to point O,” which is the origin of the inertial frame. Newton’s laws assume the existence of an inertial
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_3 © Springer Science+Business Media B.V. 2011
58
3 Basic principles
frame, that is, a frame that is stationary with respect to the distant stars. In many practical applications, a frame attached to the earth may be used as an inertial frame. For instance, when studying the dynamics of a jet engine on a test bench, a frame of reference attached to the test bench is appropriate. If the same engine is mounted on an aircraft wing, a frame attached to the wing would not be inertial, because the aircraft is itself moving; for such a problem, a frame attached to the surface of the earth could be considered to be inertial. Finally, when studying the motion of satellites, it becomes necessary to select an inertial frame attached to the sun. The inertial velocity vector or absolute velocity vector of the particle is the time derivative of its position vector with respect to the origin of the inertial frame v=
dxP/O = x˙ P/O , dt
(3.1)
where t indicates time. More often than not, the term “velocity vector” will be used instead of “inertial velocity vector.” The norm of the velocity vector is called the speed, v, of the particle v = v. (3.2) Finally, the particle inertial acceleration vector or absolute acceleration vector is deſned as the derivative of the absolute velocity vector a=
d2 xP/O dv = . dt dt2
(3.3)
3.1.2 Newton’s laws This section presents Newton’s three laws and Newton’s law of gravitation. These laws provide the foundation of dynamics and mechanics. Newton’s ſrst law Newton’s ſrst law of motion states that every object in a state of uniform motion tends to remain in that state of motion unless an external force is applied to it. The expression “state of uniform motion” means that the object moves at a constant velocity. If several forces are applied to the object, the “external force” is, in fact, the resultant, i.e., the vector sum, of all externally applied forces. Finally, the “object” mentioned in the law is to be understood as a particle, as deſned in the previous section. With all these clariſcations, Newton’s ſrst law can be restated: a particle moves at a constant velocity unless the sum of the externally applied forces does not vanish. This also implies that if the sum of the externally applied forces does not vanish, the particle no longer moves at a constant velocity. A more mathematical statement of Newton’s ſrst law is Law 1 (Newton’s ſrst law) A particle moves at a constant velocity if and only if the sum of the externally applied forces vanishes.
3.1 Newtonian mechanics for a particle
59
The expression “if and only if” is included in the statement because the vanishing of the externally applied forces is both a necessary and sufſcient condition for the particle to move at a constant velocity. For statics problems, is is customary to focus on particles at rest rather than moving at a constant velocity. Within this framework, Newton’s ſrst law becomes: a particle is at rest if and only if the sum of the externally applied forces vanishes. This statement provides the deſnition of static equilibrium and is the foundation of statics and structural mechanics. Newton’s second law Newton’s second law states that if a force is acting on a particle, its acceleration is proportional to this force; the constant of proportionality is the mass of the particle. Here again, the force acting on the particle is the vector sum of all externally applied forces. Both externally applied force and resulting acceleration must be understood as vector quantities, and furthermore, the acceleration vector is the inertial acceleration vector as deſned by eq. (3.3). Newton’s second law then states Law 2 (Newton’s second law) The inertial acceleration vector of a particle is proportional to the vector sum of the externally applied forces; the constant of proportionality is the mass of the particle. In mathematical terms, Newton’s second law becomes F = ma,
(3.4)
where F is the sum of the externally applied forces acting on the particle, a its inertial acceleration vector, and m its mass. Clearly, the Newton’s ſrst law is implied by the second. Newton’s second law provides the equations of motion for a particle; it relates the motion of the particle to the externally applied forces. Newton’s third law Newton’s third law is also of fundamental importance to dynamics. It states: if particle A exerts a force on particle B, particle B simultaneously exerts on particle A a force of identical magnitude and opposite direction. It is also postulated that these two forces share a common line of action. In a more compact manner, Newton’s third law states that Law 3 (Newton’s third law) Two interacting particles exert on each other forces of equal magnitude, opposite directions, and sharing a common line of action. Newton’s third law is most useful when dealing with systems of particles: it enables the appropriate modeling of the interaction forces among the particles. It also allows “isolating” or “disconnecting” a particle from its surroundings and replacing the connection by a set of forces of equal magnitudes, opposite directions, and sharing a common line of action. This technique is the basis for drawing free body diagrams of a particle or system of particles.
60
3 Basic principles
Newton’s law of gravitation Newton’s law of gravitation also plays an important role in dynamics. It states that Law 4 (Newton’s law of gravitation) Two particles attract each other in proportion to their masses and in inverse proportion to the square of their relative distance. The line of action of this attractive force joins the two particles. This implies
m1 m 2 , (3.5) r2 where F is the magnitude of the attractive force, m1 and m2 the masses of the two particles, r their relative distance, and G the constant of proportionality know as the universal constant of gravitation. Figure 3.1 shows the force, F 12 , that the second particle exerts on the ſrst, and the force, m2 r F 21 , that the ſrst exerts on the second. Forces F21 F 12 and F 21 have the same magnitude F = F12 F 12 = F 21 , opposite directions F 12 + m1 F 21 = 0, and share a common line of action that joins the two particles. Clearly, these two Fig. 3.1. Gravitation force acting beforces present an important example of Newtween two particles. ton’s third law. F =G
3.1.3 Systems of units The quantities involved in Newton’s three laws are length, mass, time, and force, denoted L, M , T , and F , respectively. In view of Newton’s second law, eq. (3.4), these three quantities are not independent, rather F = M L/T 2. This text uses the SI system of units exclusively. In this system of units, the three basic units are length, mass, and time, measured in meters, denoted “m,” kilograms, denoted “kg,” and seconds, denoted “s,” respectively. Force is then a derived unit measured in Newtons, denoted “N.” A force of 1 N imparts an acceleration of 1 m/s2 to a mass of 1 kg. Systems of units where mass is a basic unit are said to be absolute: the SI system is an absolute system of units. In this set of units, the universal constant of gravitation is G = 6.6732 10−11 m3 /(kg · s2 ).
(3.6)
In view of the small value of this constant, the attractive force acting between objects of small masses is very small. The attractive force between particles, however, is large if one of the particles has a large mass. The weight, w, of a particle at the surface of the earth is deſned as the gravitational force applied by the earth to the particle, w=
GM m, re2
(3.7)
3.1 Newtonian mechanics for a particle
61
where M = 5.976 1024 kg is the mass of the earth, re = 6, 378 km its radius, and m the mass of a particle located at the surface of the earth. Using these constants, it follows that the weight of a particle is w = 9.803 m = gm, where g = 9.803 m/s2 is the gravitational constant at the surface of the earth. Because the earth is not a perfect sphere and its mass distribution is not uniform, small variations of the gravitational constant should be expected from point to point. For most dynamics problems, g = 9.81 m/s2 will be a sufſciently accurate value of the gravitational constant. At the surface of the earth, the weight of an 80 kg person is w = 9.81 × 80 = 785 N. In the US customary system of units, the three basic unit are length, time, and force, measured in feet, denoted “ft,” seconds, denoted “s,” and pounds, denoted “lbs,” respectively. In this system, mass is then a derived unit measured in slugs, denoted “slug.” A mass of 1 slug weighs 1 lb when subjected to a gravitational acceleration of 1 ft/s2 . Systems of units where force is a basic unit are said to be gravitational: the US customary system is a gravitational system. In the US customary system, g = 32.17 ft/s2 , and the mass of a particle at the surface of the earth is then found as m = w/g. It should be noted that in the US customary system, length is sometimes measured in inches rather than feet; in this case, g = 386 in/s2 . 3.1.4 The principle of work and energy m Figure 3.2 depicts a particle of mass m whose dr(t) ti position is described by position vector r(t) i3 with respect to an inertial frame, F I = r(t) q [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]. While moving along its tf F(t) path, the particle is acted upon by forces, the reO sultant of which is F (t). These forces are called i2 I externally applied forces, or impressed forces. i1 The differential work, dW , the resultant force performs on the particle as it moves by an Fig. 3.2. Force acting on a particle. differential distance, dr, is deſned as the scalar product of the force vector by the differential displacement vector of its point of application dW = F T dr. (3.8)
In view of the deſnition of the scalar product, this differential work can be written as dW = F dr cos θ, where θ is the angle between the force and the differential displacement vectors, see ſg. 3.2. If the force is normal to the differential displacement, the differential work vanishes, although the force is of ſnite magnitude. The notation dW is used to indicate the differential work, but it does not imply the existence of a work function, W , such that d(W ) is the differential work. Introducing Newton’s second law, eq. (3.4), into the deſnition of the differential work leads to dW = F T dr = maT dr = m
dv T dv T dr dt = m v dt = m v T dv. dt dt dt
(3.9)
62
3 Basic principles
The kinetic energy, K, of the particle is deſned as 1 K = mv T v. (3.10) 2 The differential change in kinetic energy is d(K) = m v T dv, and it follows that (3.11)
dW = d(K).
Consider now two arbitrary instants during the motion of the particle, say times ti and tf , as illustrated in ſg. 3.2. The work done by the force over this period is denoted Wti →tf and can be evaluated as follows % tf % tf Wti →tf = F T dr = d(K) = K(tf ) − K(ti ) = Kf − Ki = ΔK. (3.12) ti
ti
This result is known as the principle of work and energy. Principle 1 (Principle of work and energy for a particle) The work done by the external forces acting on a particle equals the change in the particle’s kinetic energy.
3.2 Conservative forces Figure 3.2 depicts a particle of mass m whose position is described by position vector r(t) with respect to an inertial frame, F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]. Conservative forces are a class of forces that depend only upon the position of the particles on which they act, F = F (r). Although these forces may vary with time as the particle moves, they do not depend explicitly on time or velocity. Figure 3.3 shows two arbitrary paths, denoted ACB and ADB, along which the particle moves in space from point A to point B. Deſnition By deſnition, force F is conservative if and only if the work it performs along any path joining the same initial and ſnal points is identical. This is expressed by the following equation % % T WA→B = F dr = F T dr. (3.13) Path ACB
Path ADB
Since reversing the limits of integration simply changes the sign of the integral, the work done by the force along path ADB is equal in magnitude and opposite in sign to that along path BDA. Equation (3.13) then implies the vanishing of the work done by the force over the closed path ACBDA. Because path ACB and ADB are arbitrary paths joining points A and B, it follows that a force is conservative if and only if the work it performs vanishes over any arbitrary closed path, & & W = F T dr = F T dr = 0, (3.14) Any path
where C is an arbitrary closed curve.
C
3.2 Conservative forces
B
Path ACB C m D
F(r)
n Path ADB
A
r i1
63
C
i3 O I
S i2
Fig. 3.3. Paths ACB and ADB join the same two points, A and B.
Fig. 3.4. Path enclosing a surface of area S with a normal n ¯.
Potential of a conservative force Based on the deſnition of conservative forces, eq. (3.14), Stokes’ theorem [2] then implies that & % dS = 0, F T dr = n ¯ T ∇F (3.15) C
S
where S is a surface bounded by curve C, n ¯ the outward normal to surface S, as = curl(F ). If the force is conservative, the surface integral shown in ſg. 3.4, and ∇F must vanish for any surface, S, and this can only occur if the integrand vanishes, = 0 for any curve, C, and surface, S. Textbooks on vector algebra [2], leading to ∇F prove the following identity: ∇∇V = 0, where V is an arbitrary scalar function and = 0 is ∇V = grad(V ). It can then be shown that the solution of equation ∇F simply F = −∇V, (3.16) where ∇ is the gradient operator. If a vector ſeld, F , can be derived from a scalar function, V , this function is called a potential, and the vector function is said to “be derived from a potential.” Because the potential is an arbitrary scalar function, the minus sign is redundant, but is, however, a convention that will be justiſed later. It has now been established that if a force is conservative, it can be “derived from a potential.” In more mathematical terms, a conservative force must be the gradient a scalar function, called the potential of the force. If I = (¯ı1 , ¯ı2 , ¯ı3 ) is an orthonormal basis, conservative forces can be expressed as F = −∇V = −
∂V ∂V ∂V ¯ı1 − ¯ı2 − ¯ı3 . ∂x1 ∂x2 ∂x3
(3.17)
The work done by a conservative force over an arbitrary path joining point 1 to point 2, with position vectors r 1 and r 2 , respectively, is then % r2 % r2 % r2 ∂V ∂V ∂V F T dr = − ∇T V dr = − dx1 + dx2 + dx3 W1→2 = ∂x ∂x ∂x 1 2 3 r1 r1 r1 % r2 =− dV = V (r 1 ) − V (r 2 ). r1
64
3 Basic principles
Thus the work done by a conservative force along any path joining point 1 to point 2 depends only on the positions of these points and can be evaluated as the difference between the values of the potential function expressed at these two points, W1→2 = V (r 1 ) − V (r 2 ) = −ΔV.
(3.18)
If point 1 and 2 are an inſnitesimal distance apart, dW = V (r 1 ) − V (r 1 + dr) = −d(V ).
(3.19)
The differential work is now the true derivative of the potential function. Summary Conservative forces enjoy a number of remarkable properties. Initially, conservative forces are deſned as forces that perform the same work along any path joining the same initial and ſnal points, as expressed by eq. (3.13). Simple calculus reasoning is then used to prove that a force is conservative if and only if the work it performs vanishes over any arbitrary closed path, see eq. (3.14). Finally, conservative forces are shown to be derivable from a potential, as expressed by eq. (3.16). Consequently, the work done by a conservative force along any path joining two points can be evaluated as the difference between the potential function evaluated at these two points, see eq. (3.18). Examples of conservative forces To illustrate these concepts, consider the gravity force acting on a particle of mass m located at the surface of the earth. It can easily be shown that this force is conservative. Therefore, the scalar potential, V , of the gravity forces is V = mg r·¯ı3 = mgx3 , where r = x1¯ı1 + x2¯ı2 + x3¯ı3 is the position vector of the particle. The gravity force, F g , acting on the particle can be obtained from this potential using eq. (3.17) to ſnd F g = −∇V = −∂V /∂x3 ¯ı3 = −mg¯ı3 , and the gravity forces is said to be “derived from a potential.” The work done by the' gravity force as the ' x3bparticle moves from elevation x3a to x3b x3b then becomes W = x3a F g · dr = − x3a ∂V /∂x3 dx3 = V (x3a ) − V (x3b ). Clearly, this work depends on the initial and ſnal elevations only, but not on the particular path followed by the particle as it moved from the initial to the ſnal elevation. If the particle moves along a closed path starting and ending at the same elevation, the work done by the gravity force vanishes. As another example, consider the restoring force of an elastic spring of stiffness constant k. If the spring is stretched by an amount u, the restoring force is −ku, and can be derived from a potential of the form V (u) = 1/2 ku2 . Indeed, using eq. (3.17), the elastic force in the spring becomes Fs = −∂V /∂u = −ku. This relationship is the constitutive law for the spring because it relates the force in the spring to its elongation.
3.2 Conservative forces
65
Quantity V (u) is called the strain energy and it can be viewed as a “potential of the elastic forces” in the spring. Hence, the strain energy function implicitly deſnes the constitutive behavior of the component. Finally, the work' done by the u elastic restoring force as the spring stretches from ua to ub is W = uab Fs du = ' ub − ua ∂V /∂u du = V (ua ) − V (ub ). Here again, the work depends only on the initial and ſnal positions. At ſrst glance, the potential of a gravity force and the strain energy of an elastic spring seem to be distinct, unrelated concepts. Both quantities, however, share a common property: forces can be derived from these scalar potentials. Consider a particle of mass m connected to an elastic spring of stiffness constant k and subjected to a gravity force acting in the direction of the spring. The downward displacement, u, of the mass measures both the spring stretch and the elevation of the particle. The externally applied gravity force can be derived from the potential, V = mgu, as Fg = −∂V /∂u = −mg; the restoring force in the spring can be derived from the strain energy, V = 1/2 ku2 , which can also be viewed as the potential of the internal forces, as Fs = −∂V /∂u = −ku. The two forces acting on the particle can therefore be derived from a potential. 3.2.1 Principle of conservation of energy The forces applied to a particle can be divided into two categories: the conservative forces, which can be derived from a potential, and the non-conservative forces, for which no potential function exists. The principle of work and energy, eq. (3.11), now becomes dW = dWc + dWnc = −d(V ) + drT F nc = d(K), (3.20) where dWc and dWnc indicate the differential work done by the conservative and non-conservatives forces, respectively, and F nc denotes the non-conservative forces. The work done by these forces over the period from time ti to tf now becomes %
tf
%
tf
F Tnc dr = Kf − Ki .
(3.21)
The ſrst term of this expression readily integrates to yield % tf F Tnc dr = (Kf + Vf ) − (Ki + Vi ),
(3.22)
ti
−d(V ) +
ti
ti
where Vi = V (ti ) and Vf = V (tf ) are the values of the potential function at the initial and ſnal times, respectively. The total mechanical energy, E, is deſned as the sum of the kinetic energy and potential function, E = K + V. (3.23) The principle of work an energy principle now becomes
66
3 Basic principles
%
tf
F Tnc dr = Ef − Ei .
(3.24)
ti
If the particle is acted upon by conservative forces only, the principle of work and energy reduces to Ef = Ei . (3.25) This statement is known as the principle of conservation of energy. Principle 2 (Principle of conservation of energy for a particle) If a particle is subjected to conservative forces only, the total mechanical energy is preserved. Clearly, the term “conservative forces” stems from the fact that in the sole presence of such forces, the total mechanical energy of the particle is conserved. In view of the principle of work and energy, work, kinetic energy, potential energy, and total mechanical energy all share the same units, force times distance, N·m. A Joule is deſned as 1 J = 1 N·m. Although the moment of a force has the same units, N·m, Joules are used only when dealing with energy; in other words, a 10 N·m moment should not be referred to as a 10 J moment. The work done by force over a period of time from ti to tf , see eq. (3.12), can be written as % tf % tf % tf T T dr dt = Wti →tf = F dr = F F T v dt. (3.26) dt ti ti ti The last integrand, F T v, is the power of the externally applied forces; it is a measure of the work done by the forces per unit time. Power has units of work divided by time, J/s. A Watt is deſned as 1 W = 1 J/s = 1 N·m/s. 3.2.2 Potential of common conservative forces In the previous section, it was shown that conservative forces are associated with special functions called potential functions, from which they can be derived. A few commonly used potential functions will be derived in this section. Work done by a central force First, the work done by a central force will be evaluated. A central force is such that its line of action passes m through a ſxed inertial point in space and its magniF tude depends on the sole distance r between the particle i2 I O and the ſxed point. Figure 3.5 shows a particle of mass m subjected to a central force F whose line of action i1 passes through point O, the origin of an inertial frame. Fig. 3.5. A central force. Because distance r between the origin and the particle is inherent to the deſnition of the central force, it seems natural to use the spherical coordinate system deſned in section 2.7.2 to express the position of the particle. i3
r
e1
3.2 Conservative forces
67
The velocity of the particle expressed in spherical coordinates, see eq. (2.95b), is v = r˙ e¯1 + rφ˙ e¯2 + rθ˙ sin φ e¯3 . Multiplying this relationship by dt reveals the relationship between an increment in particle position, dr, and increments in coordinates dr, dθ, and dφ, as dr = dr e¯1 + rdφ e¯2 + rdθ sin φ e¯3 . On the other hand, the central force is expressed as F = −f (r)¯ e1 , where f (r) is its magnitude that depends on r only, and e¯1 its line of action, always passing through point O. The differential work done by the central force now becomes dW = F T dr = −f (r)¯ eT1 (dr e¯1 +rdφ e¯2 +rdθ sin φ e¯3 ) = −f (r)dr. The potential, V , of the central force is deſned as dV . (3.27) f (r) = dr With this deſnition, the differential work done by the central force becomes dW = −
dV dr = −d(V ). dr
Because the differential work can be expressed as an exact differential, the central force is a conservative force, and its potential is the integral of the magnitude of the central force. The potential is deſned within a constant: adding a constant to the potential does not alter the magnitude of the central force. The potential of gravity forces An important example of central forces are gravitational forces, as described by Newton’s gravitation law. The magnitude of the gravitational force is given by eq. (3.5) as f (r) = GM m/r2 . The gravitational force acts on an particle of mass m due to the presence of another particle of mass M assumed to be ſxed with respect to an inertial frame; ſg. 3.1 shows that such force is a central force. The potential function for the gravity forces then follows from eq. (3.27) as V (r) = −G
Mm . r
(3.28)
This potential is called the potential of gravity forces. Consider now a particle located at a height h above the surface of the earth; this implies r = re + h, where re is the radius of the earth. If the particle is close to the surface of the earth, h re and 1/r = 1/[re (1 + h/re )] ≈ (1 − h/re )/re . The potential function now becomes: V (r) = −GM m/re + GM mh/re2 . Because the ſrst term of this expression is a constant, it can be omitted to yield the potential function as M mh V (r) = G 2 = mgh. (3.29) re This potential function is the potential of gravity forces for particles located near the surface of the earth. The height, h, of the particle is measured from a reference elevation, called the datum, which is selected in an arbitrary manner. Indeed, changing the datum is equivalent to adding a constant to the potential function, leaving the gravitation forces unchanged.
68
3 Basic principles
The strain energy function of an elastic spring Consider now a particle of mass m connected to a rectilinear spring; the other end of the spring is attached to inertial point O, as depicted in ſg. 3.6. The spring can stretch elastically, but is massless; it practice, this means that the mass of the spring is negligible with respect to that of the particle. Clearly, the situation is similar to that shown e1 i3 in ſg. 3.5: the particle is subjected to a central r force F (r) = −f (r)¯ e1 . The magnitude of the cenm tral force is related to the stretch of the spring, k I Δ = r − r0 , where r0 is the un-stretched length i2 O of the spring. For a linearly elastic spring, the force i1 in the spring is proportional to its stretch, f (r) = Fig. 3.6. Particle connected to an k(r − r0 ) = kΔ, where k is the spring stiffness constant. The units of the spring stiffness constant are elastic spring. N/m. The potential function of the elastic forces in the spring then follows from eq. (3.27) as 1 V (r) = k Δ2 . (3.30) 2 This work function is often called the strain energy function for the elastic spring. The present formulation is not limited to linearly elastic springs: the magnitude of the elastic force in the spring could be a nonlinear function of the stretch, such as f (r) = k1 Δ + k3 Δ3 . In this case, the strain energy function of the nonlinearly elastic spring is V = 1/2 k1 Δ2 + 1/4 k3 Δ4 . The principle of work and energy affords a description of the kinetics of a particle in terms of energies rather than displacements and accelerations. Consider the system depicted in ſg. 3.6, at time t0 , the particle is at rest and the spring is un-stretched: the velocity of the particle vanishes, implying K0 = 0, and V0 = 0, because the spring is un-stretched. External forces are applied to the particle that bring it to a new rest conſguration at time t1 , hence K1 = 0. Because the system is conservative, ext the work done by the external forces is W0→1 = E1 − E0 = V1 . For this simple case, the principle of work and energy implies that the work done by the externally applied force equals the strain energy in spring. This work is stored in the system in the form of strain energy: no energy has been lost, but its nature has changed from potential to strain energy. In this description, the trajectory of the particle from time t0 to time t1 is irrelevant; the only important quantity is the stretch, Δ1 , of the spring at time t1 , which determines the strain energy, V1 . This is a characteristic of conservative forces: the work they perform does not depend on the particular path followed from time t0 to t1 , but only on the initial and ſnal conſgurations of the system that determine the initial and ſnal stretch of the spring. Next, the set of external forces that maintained the steady deformation Δ1 of the spring is released; the particle evolves along a certain trajectory and at time t2 , the stretch of the spring vanishes, Δ2 = 0. Because no external forces are applied
3.2 Conservative forces
69
ext between time t1 and t2 , the principle of work and energy implies W1→2 = 0 = E2 − E1 = K2 − V1 , where K2 is the kinetic energy of the particle at time t2 . No energy has been lost: the energy transformed from strain to kinetic energy, K2 = V1 . The speed v2 of the particle at time t2 is v2 = k/m Δ1 . Here again, the speciſc trajectory followed by the particle is not relevant. Both kinetic and strain energy functions are positive-deſnite functions, i.e., K = 1/2 mv 2 > 0 for any arbitrary speed of the particle v = 0 and V = 1/2 kΔ2 > 0 for any stretch of the elastic spring Δ = 0. Consider a strain energy function of the form V = 1/2 k0 Δ2 + 1/3 k1 Δ3 ; this strain energy function vanishes for Δcr = −3/2 k0 /k1 . For stretches Δ < Δcr , the strain energy becomes negative, hence this strain energy function is invalid because it is not positive-deſnite. For Δ < Δcr , the spring will add energy to the system; energy is being created, a physical impossibility for a passive device.
The strain energy function of a torsional spring Consider the planar problem depicted in ſg. 3.7: e2 e1 a particle of mass m is connected to a rigid rod i3 m r of length . The rod pivots about inertial point O, l where a torsional spring of stiffness constant k is loF k q cated. The torsional spring applies a moment to the i2 O rigid rod about point O, which is then transmitted to the particle in the form of a force F , acting in the direction normal to the rod; this force is clearly not Fig. 3.7. Particle subjected to a force generated by a torsional a central force. The position of the particle will be spring. represented by polar coordinates, r and θ, see section 2.7.1. The velocity of the particle is v = r˙ e¯1 + rθ˙ e¯2 , see eq. (2.91b). Because the rod is rigid, r˙ = 0, and multiplying the velocity relationship by dt implies dr = dθ e¯2 . The force vector, F , has a line of action along e¯2 and its magnitude is a function of the sole angle θ: F = −f (θ)¯ e2 . The differential work done by this force now becomes dW = F T dr = −f (θ)¯ eT2 dθ¯ e2 = −f (θ)dθ. (3.31) Clearly, M (θ) = f (θ) is the moment the torsional spring applies to the rigid rod and hence, dW = −M (θ)dθ. For a linearly elastic torsional spring, M (θ) = k(θ − θ0 ), where θ0 is the angular position of the rigid rod for which the torsional spring is unstretched. The units for the stiffness constant k are N·m/rad. The potential function for the torsional spring now becomes V (θ) =
1 k(θ − θ0 )2 . 2
(3.32)
This potential function is called the strain energy function of the torsional spring. It is also possible to deſne nonlinearly elastic torsional springs, for which the elastic moment is a nonlinear function of angle θ; for instance, if M (θ) = k1 (θ − θ0 ) + k3 (θ − θ0 )3 , the strain energy function is then
70
3 Basic principles
V (θ) = k1 (θ − θ0 )2 /2 + k3 (θ − θ0 )4 /4. 3.2.3 Non-conservative forces Consider now a particle of mass m connected to a rectilinear dashpot; the other end of the dashpot r m is attached to an inertial point O, as depicted in ſg. 3.8. The dashpot can slide axially and it is massc I i2 less; in practice, this means that its mass is negligiO i1 ble compared to that of the particle. For a linear dashpot, the magnitude of viscous Fig. 3.8. Particle connected to a force it generates is proportional to the time rate of dashpot. change of its length, i.e., f (r) ˙ = cr. ˙ The coefſcient c is called the dashpot constant and it units are N·s/m. Although the line of action of the force generated by the dashpot passes through an inertial point, it is not a central force because its magnitude does not depend on the sole distance between the particle and the inertial point. If the position of the particle is expressed in terms of spherical coordinates, differential displacements are then dr = dr e¯1 + rdφ e¯2 + rdθ sin φ e¯3 . The differential work done by the dashpot force now becomes dW = F T dr = −f (r)¯ ˙ eT1 (dr e¯1 + rdφ e¯2 + rdθ sin φ e¯3 ) = −cr˙ dr. Because of the r˙ dependency of the viscous force, the differential work cannot be cast in the form of an exact differential; there exist no potential function, V (r), such that dV /dr = −crdr. ˙ The force in the dashpot is a non-conservative force. The work done by the viscous forces in the dashpot is % tf % tf % tf dr Wti →tf = − dt = − cr˙ dr = − cr˙ cr˙ 2 dt < 0. (3.33) dt ti ti ti i3
e1
The presence of the r˙ 2 term implies that the work done by the viscous forces is always negative, i.e., they are dissipative forces. For the system depicted in ſg. 3.8, the principle of work and energy implies that Wti →tf = Ef − Ei , or Ef = Ei + Wti →tf . Because the work is a negative quantity, the total mechanical energy of the system monotonically decreases in time; furthermore, the change in total mechanical energy exactly equals the work done by the viscous forces in the dashpot. This result explains the term “dissipative forces” or “non-conservative forces” used to qualify the viscous forces in the dashpot. Of course, dashpots are not always linear; the magnitude of the viscous force could be a nonlinear function of velocity, such as f (r) ˙ = c1 r˙ + c3 r˙ 3 , for instance. Function f (r) ˙ r, ˙ however, must be a positive-deſnite function of r˙ to guarantee the dissipative nature of the resulting viscous force. Finally, it is also possible to encounter torsional dashpots; in ſg. 3.7, the torsional spring would be replaced by a dashpot that applies to the rigid bar a moment whose magnitude is a function of the time rate of change of angle θ. The differential work done by the viscous forces in the torsional is then ˙ ˙ ˙ = cθ, ˙ where dW = −f (θ)dθ = −M (θ)dθ; for a linear torsional dashpot, M (θ)
3.2 Conservative forces
71
the dashpot constant now has units of N·m·s. The energy closure equation
't Consider the work and energy principle given by eq. (3.24), written as tif F Tnc dr = Ef − Ei . In this expression, the initial and ſnal time instants can be selected arbitrarily; in particular, let the ſnal time be an arbitrary time, tf = t, during the evolution of the system. The principle of work and energy now becomes % t E(t) − F Tnc dr = Ei . (3.34) ti
In the absence of non-conservative forces, this equation reduces to E(t) = Ei , the statement of conservation of the total mechanical energy of the system. Even in the presence of non-conservative forces, however, equation (3.34) implies the conservation a scalar quantity, the difference between the total mechanical energy and the cumulative work done by the non-conservative forces, must remain constant. This relationship is known as the energy closure equation. Example 3.1. bungee jumping A man of mass m is jumping off a bridge while attached to a bungee cord of unstretched length d0 . An inertial frame, F = [O, I = (¯ı1 , ¯ı2 )], is attached to the bridge. The man is jumping from point O with an initial velocity, v0 , oriented along horizontal axis ¯ı2 , and the acceleration of gravity is acting along vertical axis ¯ı1 . During the ſrst part of his fall, the man is in free ƀight under the effect of gravity, and at some instant in time, the bungee becomes taut. During the second portion of his fall, the man is subjected to the combined effects of gravity and the elastic force of the bungee. The potential of the bungee is of the following form: Vb = ¯ where Δ¯ = (d − d0 )/d0 = Δ/d0 is the non-dimensional 1/2 k0 d20 ln2 (1 + Δ), stretch of the bungee, and d the distance from point O to the man. The magnitude of ¯ ¯ the force the bungee applied to the man is Fb = dV /dΔ = k0 d0 ln(1 + Δ)/(1 + Δ). Determine the trajectory of the fall. Free fall Let the man’s trajectory be denoted r(t) = x1 (t) ¯ı1 + x2 (t) ¯ı2 . During free fall, Newton’s second law writes m¨r = mg¯ı1 , where g is the acceleration of gravity. Integration yields 1 v¯ = g¯τ¯ı1 + ¯ı2 , ¯r = g¯τ 2¯ı1 + τ¯ı2 . (3.35) 2 The following non-dimensional quantities were introduced: ¯r = r/d0 , v¯ = r˙ /v0 , τ = v0 t/d0 , and g¯ = gd0 /v02 . The bungee cord becomes taut when r = d0 , or ¯r(τt )2 = 1, where τt denotes the instant at which the bungee becomes taut. Introducing this condition in eq. (3.35) and solving for τt yields √ 2 τt = 1 + g¯2 − 1. (3.36) g¯
72
3 Basic principles
Trajectory when the bungee cord is taut Once the bungee is taut, Newton’s second law implies m¨r = mg¯ı1 + Fb u ¯, were Fb is the magnitude of the elastic force the bungee applies on the man and u ¯ the unit vector pointing from the man to point O. The distance from the man to point O is d = d0 + Δ = x21 + x22 , and u ¯ = (x1 ¯ı1 + x2 ¯ı2 )/d. In non-dimensional form, the equation of motion becomes ¯ ln(1 + Δ) ¯r = g¯ ¯ı1 − k¯0 (¯ x1 ¯ı1 + x ¯2 ¯ı2 ), ¯ (1 + Δ)2
(3.37)
where (·) indicates a derivative with respect to the non-dimensional time τ , Δ¯ = x ¯21 + x¯22 − 1, k¯0 = k0 d20 /(mv02 ), x¯1 = x1 /d0 , and x ¯2 = x2 /d0 . Because the equations of motion are nonlinear, their solution can only be obtained by means of numerical methods, which often require recasting the governing equations in ſrst-order form. In the present case, the ſrst-order form of the equations is ⎫ ⎧ v¯1 ⎪ ⎧ ⎫ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ v ¯ x ¯ ⎪ ⎪ ⎪ ⎪ 2 1⎪ ⎪ ⎪ ⎪ ⎨ ⎬ ⎬ ⎨ ¯ ln(1 + Δ) x ¯2 ¯ x ¯ = g¯ − k0 1 ¯ 2 ⎪, v¯1 ⎪ ⎪ ⎪ (1 + Δ) ⎪ ⎪ ⎪ ⎩ ⎪ ⎪ ⎪ ⎭ ¯ ⎪ ⎪ ln(1 + Δ) v¯2 ⎪ ⎪ ⎪ ⎪ ¯ ⎩ − k0 x ¯ 2 ¯2 ⎭ (1 + Δ) where v¯1 = v1 /v0 and v¯2 = v2 /v0 are the non-dimensional components of vertical and horizontal velocity, respectively. The ſrst two equations, x ¯1 = v¯1 and x ¯2 = v¯2 , simply deſne the velocity components, v¯1 and v¯2 , and the last two equations are the actual equations of motion. In this form, many standard time integration methods such as Runge-Kutta integrators, among many others, can be used. Extensive discussion of these integrators can be found in many textbook on numerical analysis, see refs. [4, 5], for instance. The following non-dimensional parameters are used for the simulation: g¯ = 12, and k¯0 = 50. The end of the free fall phase occurs at time τt = 0.4254. Figure 3.9 show the man’s trajectory during free fall and when the bungee cord is taut. For all times τ < τt the bungee cord is slack and its stretch vanishes; ſg. 3.10 shows the ¯ for τ ≥ τt . bungee’s non-dimensional stretch, Δ, At time τ = 1.5018, the bungee becomes slack again, and equation of motion, eq. (3.37), is no longer valid because it include the force stemming from the bungee cord. To continue the simulation past that time, the equation of motion for free fall under gravity, m¨r = mg¯ı1 , would be used again, with initial conditions corresponding to the man’s position and and velocity at the end of the previous phase, i.e., at time τ = 1.5018. Figure 3.11 depicts the bungee non-dimensional force, F¯b = Fb /(k0 d0 ), versus ¯ The apparent stiffness, k, of the bungee cord is the its non-dimensional stretch, Δ. tangent to the force-stretch curve,
3.2 Conservative forces 0
73
1.5
BUNGEE STRETCH D/d0
-0.5
x1/d0
-1
-1.5
-2
-0.4 -0.2
0
0.2
1
0.5
0 0.4
0.4
x2/d0
Fig. 3.9. Man’s trajectory. The symbols ◦ indicate the free fall portion of the trajectory.
k=
0.6
0.8
1
t
1.2
1.4
1.6
1.8
Fig. 3.10. Non-dimensional stretch of the ¯ versus time τ . bungee, Δ,
¯ 1 − ln(1 + Δ) dFb = k0 ¯2 . dΔ (1 + Δ)
(3.38)
As the stretch of the cord increases, its stiffness decreases and vanishes when ln(1 + ¯ = 1, or Δ¯ ≈ 1.718. Clearly, the parameters selected for the present simulation Δ) result in a very large stretching of the bungee cord, which would threaten the safety of the jumper. 15
25
10
SYSTEM ENERGIES
20
Fb/k0
15
10
5
0
5
0
-5
-10
-15 0
0.5
D/d0
1
1.5
Fig. 3.11. Magnitude of the non-dimensional ¯ force, Fb , in the bungee versus stretch, Δ.
0
0.5
t
1
1.5
Fig. 3.12. System energies: kinetic energy, solid line; potential energy, dashed line; total mechanical energy, dashed-dotted line.
The kinetic energy of the system is K = 1/2 m˙r2 . The potential of the gravity forces is Vg = −mgx1 , and the potential of the elastic bungee cord Vb = ¯ In non-dimensional form, the total mechanical energy of the 1/2 k0 d20 ln2 (1 + Δ). system becomes ¯ = E = K + V = 1 ¯r 2 − g¯x¯1 + E mv02 mv02 mv02 2
1¯ 2 ¯ k0 ln (1 + Δ). 2
74
3 Basic principles
Figure 3.12 depicts the evolution of the system’s energies versus time τ . Because the forces acting on the system are conservative forces, the total mechanical energy remains constant during the simulation. This observation provides a validation of the derivation of the equation of motion and of its numerical solution. Effect of drag forces. The developments presented in the previous paragraphs have ignored the effect of air friction on the man’s trajectory. These forces can be taken into account in an approximate manner applying to the man a drag force, F d = −1/2 Cd ρA vv, where Cd is the non-dimensional drag coefſcient, ρ the air density, and A the man’s cross-sectional area. This drag force is at all times proportional to the square of the speed, aligned with the velocity vector, and oriented in the direction opposite to this vector. During free fall, the equation of motion is m¨r = mg¯ı1 − 1/2 Cd ρA vv; as before, the bungee cord will become taut when r(τt ) = 1. Because the governing differential equation is now a nonlinear differential equation, a numerical process must be used for its solution and time τt must be determined numerically. A closed form analytical solution such as that given by eq. (3.36) no longer exists. When the bungee cord is taut, the differential equation governing the problem becomes ¯ ln(1 + Δ) 1 ¯r = g¯ ¯ı1 − k¯0 μ ¯ C (¯ x ¯ ı + x ¯ ¯ ı ) − v¯12 + v¯22 (¯ v1 ¯ı1 + v¯2 ¯ı2 ), 1 1 2 2 d ¯2 2 (1 + Δ) where μ ¯ = ρAd0 /m. Here again, the equation of motion is nonlinear, and its solution can be obtained only by means of numerical methods 3.2.4 The principle of impulse and momentum The principle of impulse and momentum involves two sets of new quantities. First, the linear and angular momentum vectors of a particle are introduced; the angular momentum is the moment of the linear momentum vector. Next, the linear and angular impulse vectors of the externally applied forces are introduced. Principle of linear impulse and momentum Figure 3.13 shows a particle of mass m in motion with respect to an inertial frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]. The inertial velocity vector of the particle is denoted v. The linear momentum vector of a particle is deſned as the product of its mass by its inertial velocity vector p = mv. (3.39) Taking a time derivative of the linear momentum vector yields p˙ = ma. Comparing this result with Newton’s second law, eq. (3.4), leads to F = p. ˙
(3.40)
3.2 Conservative forces
75
This result implies that the time derivative of the linear momentum vector of a particle equals the sum of the externally applied forces. Clearly, this result is a direct corollary of Newton’s second law. It is interesting to integrate the above equation in i3 p = mv time, between an initial and a ſnal time, denoted ti and tf , respectively. These two instants are chosen arbitrarv h = ~r mv m ily, but ti < tf , r % tf % tf I i2 F (t) dt = p˙ dt = p(tf ) − p(ti ). (3.41) O ti
i1 Fig. 3.13. Linear and angular momenta vectors of a particle.
ti
The term on the left-hand side is called the linear impulse of the externally applied forces, and has units of mass times velocity, or N·s. Equation (3.41) expresses the principle of linear impulse and momentum for a particle.
Principle 3 (Principle of linear impulse and momentum for a particle) The linear impulse of the externally applied forces equals the change in linear momentum. In the absence of external forces, this principle implies p(tf ) = p(ti ), i.e., the linear momentum remains constant at all times, since ti and tf are instants chosen arbitrarily. In other words, the linear momentum vector of a particle remains a constant when the externally applied forces vanish. Principle of angular impulse and momentum Next, the moment of the particle’s linear momentum vector is computed with respect to point O. This quantity if more often called the angular momentum vector of the particle, hO , where the subscript, (·)O indicates that the angular momentum is computed with respect to point O. As illustrated in ſg. 3.13, the moment of the linear momentum vector is expressed as the cross product of the particle’s inertial position vector, r, by its linear momentum vector, mv, to ſnd hO = r mv.
(3.42)
Taking a time derivative of the angular momentum vector yields h˙ O = r˙ mv + rma. The time derivative of the inertial position vector, r˙ , equals the inertial velocity vector, v, eq. (3.1); it then follows that h˙ O = vmv + rma. Finally, since v mv = 0, the time derivative of the angular momentum vector reduces to h˙ O = rma The moment of Newton’s second law computed with respect to the origin of the inertial frame implies rF = rma. Comparing these two results then leads to rF = h˙ O , where the left-hand side term can be interpreted as the moment of the externally applied forces evaluated with respect to point O, denoted M O . In summary, M O = h˙ O
(3.43)
76
3 Basic principles
This result implies that the time derivative of the angular momentum vector of a particle computed with respect to an inertial point equals the sum of the externally applied moments computed with respect to the same point. Here again, this result is a direct corollary of Newton’s second law. As in the case of the linear momentum, the above equation can be integrated in time between two arbitrary instants to yield %
%
tf
tf
M O (t) dt = ti
h˙ O dt = hO (tf ) − hO (ti ).
(3.44)
ti
The term on the left-hand side is called the angular impulse of the externally applied forces, and has units of N·m·s. Equation (3.44) expresses the principle of angular impulse and momentum for a particle. Principle 4 (Principle of angular impulse and momentum for a particle) The angular impulse of the externally applied forces equals the change in angular momentum when both angular impulse and momentum are computed with respect to the same inertial point. In the absence of external moments, this principle implies hO (tf ) = hO (ti ), i.e., the angular momentum remains constant at all times. In other words, the angular momentum vector of a particle remains a constant when the externally applied moments vanish. Example 3.2. Particle in a pinned tube Figure 3.14 depicts a particle of mass m connected to inertial point A by means of a spring of stiffness k and dashpot of constant c. At the initial time, the particle is located at θ = 0, φ = π/2, and r = r0 , which corresponds to the un-stretched conſguration of the spring; r, φ, and θ form a spherical coordinate system, see section 2.7.2. The initial velocity vector of the particle is v 0 . Derive the equations of motion of the system.
i3
f
A
c
k r I O i1
q
e3 m e2
i2 e1
Fig. 3.14. Particle subjected to a central force due to a spring and dashpot.
First, Newton’s second law is used to obtain the desired equations of motion as
3.2 Conservative forces
77
m (¨ r − rφ˙ 2 − rθ˙2 sin2 φ)¯ e1 + (rφ¨ + 2r˙ φ˙ − rθ˙2 sin φ cos φ)¯ e2 + (rθ¨ sin φ + 2r˙ θ˙ sin φ + 2rφ˙ θ˙ cos φ)¯ e3 = − [k(r − r0 ) + cr] ˙ e¯1 , where the components of the acceleration vector in the spherical coordinate system are given by eq. (2.95c). Projecting this equation along the unit vectors e¯1 , e¯2 , and e¯3 , ¨ r˙ φ−r ˙ θ˙2 sin φ cos φ = then yields m(¨ r −rφ˙ 2 −rθ˙2 sin2 φ) = −k(r −r0 )−cr, ˙ rφ+2 0, and rθ¨ sin φ + 2r˙θ˙ sin φ + 2rφ˙ θ˙ cos φ = 0, respectively. These three nonlinear differential equations can be solved for the coordinates of the particle, r, φ, and θ. Although this approach will indeed yield the solution of the problem, much information about the nature of the particle’s motion can be obtained from the application the principle of angular impulse and momentum. Because the line of action of the forces applied to the particle passes through point A, the moment of these forces with respect to point A vanishes. The principle of angular impulse and momentum, eq. (3.44), then implies that the angular momentum must remain constant, H A = H A0 = r0ı1 mv 0 . It follows that r e1 mv = H A0 . This vector product equation, see section 1.1.11, affords a solution if and only if the particle’s position vector, r¯ e1 , and velocity vector, v, are both contained in the plane normal to the initial angular momentum vector, H A0 . Because the particle’s position and velocity vectors are contained in the same plane, the particle’s motion is contained entirely in the plane normal to the angular momentum vector. This result is quite general: if a particle is subjected to forces with a line of action passing through a ſxed inertial point, its trajectory is contained in the plane normal to the initial angular momentum vector. In particular, if a particle is subjected to a central force, its trajectory lies in the plane normal to the initial angular momentum vector. In the present example, the force associated with the elastic spring is a central force, whereas that associated with the dashpot is not. The solution of the problem is now considerably simpliſed. Without loss generality, axis ¯ı3 is selected to be along H A0 and hence, φ = π/2, φ˙ = 0, v 0 = r˙0¯ı1 + r0 θ˙0¯ı2 , and H A0 = mr02 θ˙0¯ı3 . The constancy of the angular momentum then ˙ ı3 = mr2 θ˙0¯ı3 , or r2 θ˙ = r2 θ˙0 . The ſrst equation of motion now implies H A = mr2 θ¯ 0 0 becomes m(¨ r −rθ˙2 ) = −k(r−r0 )−cr, ˙ whereas the last two are identically satisſed. It is convenient to introduce the following parameters: Ω = k/m, the frequency of the spring mass system, ζ = cm/(2Ω), the damping ratio of the dashpot and τ = Ωt, the non-dimensional time. The equations of motion then reduce to r − rθ2 = −(r − r0 ) − 2ζr , and 2 r θ = r02 θ0 , where notation (·) indicates a derivative with respect to τ . Finally, the non-dimensional position of the particle is introduced, r¯ = r/r0 , and the equations of motion simply become r¯ = θ02 /¯ r3 − (¯ r − 1) − 2ζ r¯ and θ = θ0 /¯ r2 , respectively; T the initial conditions are r¯(t = 0) = 1, r¯ (t = 0) = (¯ı1 v 0 )/(Ωr0 ), and θ0 = (¯ıT2 v 0 )/(Ωr0 ). Example 3.3. Particle sliding on a helix Consider the motion of a particle sliding without friction along the helix depicted in ſg. 2.3. Gravity acts down, in the opposite direction of axis ¯ı3 . Since the particle is
78
3 Basic principles
constrained to move along the helix, a constraint force is applied to the particle. This force acts in the plane normal to the curve, i.e., it has components along the normal and binormal vectors, but not along the tangent vector. Newton’s second law then implies −mg¯ı3 + Fn n ¯ + Fb¯b = m(¨ st¯ +
s˙ 2 n ¯ ), ρ
where Fn and Fb are the components of the reaction force in the normal and binormal directions, respectively, and the components of the acceleration vector are expressed in terms of path coordinates, eq. (2.34). The application of Newton’s second law requires the consideration of all externally applied forces acting on the particle, including the reaction forces. Projecting this equation along the tangent direction and using eq. (2.25) yields kg s¨ = −g t¯T ¯ı3 = − √ , a2 + k 2 The√particle slides down the helix acted upon by an “apparent gravity,” kg/ a2 + k 2 . The equation of motion is expressed in terms of the curvilinear coordinate s; it can be readily modiſed to be expressed in terms of parameter √ η deſned in ſg.√ 2.3. Indeed, using the results established in example 2.1, s˙ = η˙ a2 + k 2 and s¨ = η¨ a2 + k 2 . It follows that η¨ = −kg/(a2 + k 2 ). Projecting Newton’s law along the normal and binormal vectors yields Fn =
mas˙ 2 , a2 + k 2
mga Fb = √ , a2 + k 2
respectively. The normal component of the constraint force stems from the normal component of acceleration. Because the component of acceleration in the binormal direction vanishes, the corresponding component of the constraint force is solely due to the gravity component in that direction. Example 3.4. Particle sliding on a spherical surface Consider the motion of a particle sliding on the spherical surface depicted in ſg. 2.10. Gravity acts down, in the opposite direction of axis ¯ı3 . Since the particle is constrained to move on the spherical surface, a constraint force is applied to the particle. This force acts in the direction normal to the surface, i.e., it has a single component along the surface normal. Using the surface coordinates introduced in section 2.5 for a sphere, see example 2.4, Newton’s second law states that s˙ 2 s˙ 1 s˙ 2 s˙ 2 + s˙ 22 −mg¯ı3 + Fn n n ¯ , (3.45) ¯ = m (¨ s1 − 2 )¯ e1 + (¨ s2 + )¯ e2 − 1 T2 T2 R where Fn is the magnitude of the reaction force in the normal direction. For a sphere, the following results were derived in example 2.4, 1/R1 = 1/R2 = −1/R and
3.2 Conservative forces
79
1/T1 = 0. Projecting this equation along unit vectors e¯1 and e¯2 yields s¨1 − s˙ 22 /T2 = g sin η1 and s¨2 + s˙ 1 s˙ 2 /T2 = 0, respectively. The equations of motion are expressed in terms of the curvilinear coordinates, s1 and s2 , but can be readily modiſed to be expressed in terms of the surface coordinates η1 and η2 . Indeed, s˙ 1 = Rη˙ 1 and s˙ 2 = Rη˙ 2 sin η1 . Introducing the expression for the twist of the spherical surface, eq. (2.66), then yields η¨1 − η˙22 sin η1 cos η1 =
g sin η1 , R
η¨2 sin η1 + 2η˙ 1 η˙ 2 cos η1 = 0.
Projecting Newton’s law along the normal vector yields s˙ 21 + s˙ 22 . Fn = m g cos η1 − R
(3.46)
(3.47)
The magnitude of the constraint force stems from the normal component of acceleration and from the component of the gravity force in that direction. For small motions of the particle near the lowest point on the sphere, i.e for η1 = π + ηˆ1 , ηˆ1 1. The equations of motion can be linearized as η¨ ˆ1 + g ηˆ1 /R = 0, the well known equation governing the small amplitude motion of a pendulum under gravity. The same results could have been obtained using spherical coordinates, see section 2.7.2, instead of surface coordinates; the fact that the particle is moving on the surface of a sphere then implies r˙ = 0. 3.2.5 Problems Problem 3.1. Simple spring mass system Consider a simple spring mass system: a particle of mass m is connected to a spring of stiffness k and a gravity ſeld with an acceleration g is acting on the system. At time t0 , the system is at rest and the spring is un-stretched. Consider the following two scenarios. Scenario 1: the mass is released from rest and oscillates freely thereafter. Scenario 2: the mass is slowly brought to its static equilibrium position. (1) Find the maximum displacement of the particle for scenario 1. (2) Find the maximum displacement of the particle for scenario 2. (3) If there exist any difference in the maximum displacements for scenarios 1 and 2, give work and energy arguments to justify the discrepancy.
Problem 3.2. Work done by conservative forces Prove that the work done by a conservative force applied to a particle between times ti and tf is independent of the path of the particle during that time.
Problem 3.3. Is a constant force a conservative force? Is a constant force a conservative force? If yes, ſnd the potential of this force.
Problem 3.4. Particle subjected to friction forces Consider a particle of mass m = 1 kg subjected to a friction force F f = −kvv, where k is the friction coefſcient and v = v. The particle is also subjected to gravity forces (g = 9.81 m/s2 ), see ſg. 3.15. At time t = 0, the particle is launched with an initial speed v0 = 100 m/s with an angle θ = 30 deg with respect to the horizontal. (1) Write the equations of motion for
80
3 Basic principles
the particle. (2) Solve these equations for k = 0, 0.001, and 0.002 kg/m. (3) Plot the trajectory of the particle for the three cases on the same graph. (4) Determine the distance d and the maximum height h from the computed trajectory. (5) Plot d and h as a function of the friction coefſcient k ∈ [0, 0.003] kg/m.
i2
i2 v0
Fd q
m g
x g
h i1
O W
d Fig. 3.15. Particle subjected to friction.
c
m k
f i1
Fig. 3.16. Particle in a slot on a rotating disk.
Problem 3.5. Particle in a slot on a rotating disk
Figure 3.16 depicts a disk rotating in a vertical plane at a constant angular speed, φ˙ = Ω, around inertial point O. Mass m is free to slide in a radial slot on the disk and is connected to the center of the disk by means of a spring of stiffness constant k and a dashpot of constant c. The system is subjected to gravity and a torque, Q, is applied to the disk. The spring’s unstretched length is denoted x0 . (1) Derive the equation of motion of the system in terms of distance x from point O to the particle. (2) Find the horizontal and vertical components of the reaction force at point O. (3) Find the applied torque, Q, required to maintain this constant angular speed.
Problem 3.6. Free falling parachute Figure 3.17 shows a payload of mass m = 120 kg attached to a parachute. The payload is dropped from an altitude h = 1000 m with a horizontal velocity of magnitude v0 = 100 m/s. The payload is subjected to a drag force F d = −1/2 Cd ρA vv, where Cd = 1.42 is the drag coefſcient, ρ = 1.23 kg/m3 the air density, A = πD2 /4 the cross-sectional area of the parachute, and D its diameter. The velocity vector is denoted v and the speed is v = v. The payload is also subjected to gravity forces (g = 9.81 m/s2 ), see ſg. 3.17. (1) Write the equations of motion for the payload. (2) Solve these equations numerically for parachutes of diameter D = 3, 4, and 6 m. (3) Plot the horizontal position of the payload for the three cases on the same graph. (4) Plot the vertical position of the payload for the three cases on the same graph. (5) Plot the horizontal velocity of the payload for the three cases on the same graph. (6) Plot the vertical velocity of the payload for the three cases on the same graph. (7) Find an analytical expression for the constant horizontal velocity that is eventually reached by the payload. (8) Find an analytical expression for the constant vertical velocity that is eventually reached by the payload. (9) Based on this constant vertical velocity, ſnd an analytical expression for the time it takes for the payload to reach the ground. (10) Compute the time to reach the ground as a function of parachute diameter D ∈ [3, 6] m. On the same graph, plot the numerical and analytical solutions. (11) Compute the ſnal vertical velocity as a function of parachute diameter D ∈ [3, 6] m. On the same graph, plot the numerical and analytical solutions
Problem 3.7. Pendulum under gravity forces Consider a pendulum with a bob of mass m = 1.5 kg, length = 0.75 m and subjected to gravity forces (g = 9.81 m/s2 ). The pendulum is released from rest with θ = 0, see ſg. 3.18.
3.2 Conservative forces
81
v0
i2
Fd
i1 q
m
h
g g
i1
m i2
d Fig. 3.17. Parachute subject to drag force.
Fig. 3.18. Pendulum under gravity forces.
(1) Write the equations of motion for the pendulum. (2) Solve these equations numerically. (3)Plot the angular motion θ and angular velocity θ˙ as functions of time on two separate graphs. (4) Compute and plot the tension in the cord as a function of time. (5) On one graph, plot the kinetic energy, potential energy, and total mechanical energy of the system versus time. (6) Plot the total mechanical energy of the system versus time. Does it remain constant? Comment on your results
Problem 3.8. Inverted pendulum Consider an inverted pendulum with a bob of mass m = 1 kg. A massless, rigid bar of length = 1 m supports the bob. Gravity, g = 9.81 m/s2 , acts in the direction indicated on ſg. 3.19. A torsional spring of stiffness k = 10 N·m/rad is located at point O and applies a moment M = −kθ on the rigid bar. The pendulum is released from θ = 0, with an initial speed v0 = 2 m/s to the right, see ſg. 3.19. (1) Write the equations of motion for the system. (2) Solve these equations numerically. (3) Plot the angular motion θ as a function of time. (4) Plot the angular velocity θ˙ as a function of time. (5) Plot the load in the rigid bar as a function of time. (6) On one graph, plot the kinetic energy, potential energy, strain energy, and total mechanical energy of the system versus time. (7) On one graph, plot the total mechanical energy of the system versus time. Does it remain constant? Comment on your results. (8) Consider two states ot the system: the initial conſguration, (θ = 0), and a ſnal conſguration, (θ = θf ), where the angle θf is maximum. Find the maximum angular deƀection, θf . Check your answer against the numerical simulation.
i2 q l k O
m
Present configuration
i2 r c
q v0
k m
g i1
O r0
Fig. 3.19. Inverted pendulum under gravity forces.
i1 Initial configuration
Fig. 3.20. Particle connected to the ground by a spring and damper.
82
3 Basic principles
Problem 3.9. Particle connected to the ground with spring and damper A particle of mass m is connected to the ground by a spring of stiffness k and a damper of constant c. The initial conſguration of the system is indicated on ſg. 3.20, and the initial velocity vector is v 0 . The following quantities are deſned: Ω 2 = k/m and c = 2mΩζ. For this problem, it is convenient to use the polar coordinate system indicated on the ſgure. (1) Set up the equations of motion of the system. (2) Plot r¯ = r/r0 as a function of the nondimensional time τ = Ωt for τ ∈ [0, 20π]. (3) Plot θ(τ ). (4) Plot the trajectory of the particle ˙ ¯ ) = θ/Ω. in space. (5) Plot the history of the non-dimensional angular velocity Ω(τ (6) Plot the history of the components of the velocity vector in the inertial frame, v¯x = vx /(Ωr0 ) and v¯y = vy /(Ωr0 ). (7) Plot the history of the non-dimensional total mechanical energy of ¯ ) = E/(mΩ 2 r02 ); comment your result. (8) Compute the non-dimensional the system E(τ ¯ (τ ) = W/(mΩ 2 r02 ). (9) Plot the history of cumulative energy dissipated in the damper W ¯ )+W ¯ (τ ); comment your result. Use the following data: ζ = 0.05; at time the quantity E(τ τ = 0, v 0 /(Ωr0 ) = 1.2 ¯ı1 + 0.8 ¯ı2 , θ0 = 0 and r(t = 0)/r0 = 1; the un-stretched length of the spring is r0 .
Problem 3.10. Particle sliding along a curve A particle of mass m freely slides along a given curve C in three-dimensional space. A point on the curve has a position vector p0 (s). Find the equation of motion for the particle if it is subjected to externally applied forces F (t). What are the components of the constraint force acting on the particle.
Problem 3.11. Particle sliding along a helix A particle slides along a helix and is subjected to a gravity force acting along the ¯ı1 direction, see ſg. 2.3. Find the equation of motion for the particle in terms of the parameter η. If the √ initial condition at t = 0 are η = 0 and η˙ = v0 / a2 + k2 , ſnd the minimum value of v0 such that the particle proceeds along the helix with η˙ > 0 at all time.
Problem 3.12. Particle sliding along a circular ring Figure 3.21 depicts a particle of mass m sliding along a circular ring under the effect of gravity. The ring rotates on two bearing about an axis parallel to ¯ı3 ; a torque Q(t), acting about axis ¯ı3 , is applied to the ring. (1) Find the equations of motion for the particle. (2) Write the expression for the potential of the gravity forces. (3) Write the expression for the kinetic energy of the particle. (4) Write the expression for the work done by the applied torque Q(t).
Problem 3.13. Particle sliding along a circular ring Figure 3.21 depicts a particle of mass m sliding along a circular ring under the effect of gravity. The ring rotates on two bearing about an axis parallel to ¯ı3 ; a torque Q(t), acting about axis ¯ı3 , is applied to the ring. (1) Find the equations of motion for the particle based on the principle of impulse and momentum.
Problem 3.14. Particle in a massless tube Figure 3.22 shows a particle of mass m sliding in a massless tube is connected to a spring of stiffness kr and a damper of constant cr . The un-stretched length of the spring is r0 . A spherical coordinate system r, φ and θ with corresponding unit vectors e¯1 , e¯2 and e¯3 will be convenient to use. The spring/damper assembly is attached to the ground at point A by means of a joint that allows rotation about axis e¯3 . This joint features a torsional spring of stiffness kφ and a torsional damper of constant cφ . The torsional spring is un-stretched when φ = π/2. The angle θ has a prescribed schedule θ(t) = ωt. The following quantities are deſned: the non-dimensional time τ = ωt, the axial spring frequency Ωr = kr /m, and
3.2 Conservative forces
i3
i3
kf cf
Q(t)
g f
kr r i2
e3 m
O I
q
i2 e2
i1
i1
Fig. 3.21. Particle sliding on a circular ring.
f cr
A
m
r I
83
q
e1
Fig. 3.22. Particle connected to the ground by a spring, damper and revolute joint.
its critical damping ratio ζr = cr /(2mΩr ), the torsional spring frequency Ωφ = kφ /mr02 2 and its critical damping ratio ζφ = cφ /(2mr0 Ωφ ). (1) Set up the equations of motion of the system. (2) Plot r¯ = r/r0 as a function of the non-dimensional time τ ∈ [0, 20π]. (3) Plot φ(τ ). (4) Plot the trajectory of the particle in three-dimensional space. (5) Plot the history of the non-dimensional force F¯3 = F3 /(mr0 ω 2 ) that the massless tube applies on the particle. (6) Plot the history of the non-dimensional total mechanical energy of the sys¯ ) = E/(mr02 ω 2 ); comment your result. (7) Plot the history of the cumulative nontem E(τ ¯ d (τ ) = W d /(mr02 ω 2 ). (8) Plot the history dimensional energy dissipated in the dampers W ¯ M = W M /(mr02 ω 2 ) done by the torque required to of cumulative non-dimensional work W ¯ )+W ¯ d (τ ) − W ¯ M (τ ); comment prescribe θ(t) = ωt. (9) Plot the history of the quantity E(τ ¯ ¯ your result. Use the following data: Ωr = Ωr /ω = 5, ζr = 0.05; Ωφ = Ωφ /ω = 1.5, ζφ = 0.05; at time τ = 0, v 0 /(ωr0 ) = 0.6 ¯ı1 + 1 ¯ı2 + 0.75 ¯ı3 , φ0 = π/2 and r/r0 = 1.
Problem 3.15. Particle moving on a track Figure 3.23 shows particle of mass m moving on a track deſned by a curve C while constrained to remain within a slot inside a massless arm. The massless arm is prescribed to move at a constant angular speed, θ˙ = Ω. (1) Plot the radial location, r/R, of the particle as a function of θ. (2) Plot the moment m ¯ = M/(mR2 Ω 2 ) necessary to drive the system at a constant angular speed. (3) Plot the non-dimensional normal force F¯n = Fn /(mRΩ 2 ) the curved track applies on the particle. (4) Determine the minimum stiffness of the spring, ¯ for which the particle remains on the track i.e., the minimum non-dimensional frequency Ω, at all times. The curve is deſned in the polar coordinate system as p0 (θ) = r(θ)¯ e1 , where r(θ) = R − b cos N θ. It will be convenient to deſne the normal to the curve as n ¯ = ı3 t¯, where t¯ is the tangent to the curve. Use the following data: ¯b = b/R = 0.25; N = 6; ¯ = ω/Ω = 3. The spring is un-stretched when r = 0. At time t = 0, θ = 0. ω 2 = k/m; Ω
Problem 3.16. Particle moving on a track Figure 3.23 shows particle of mass m moving on a track deſned by a curve C while constrained to remain within a slot inside a massless arm. A moment, M , is applied to the arm ˙ at point O. (1) Plot the time history of the angle θ. (2) Plot the angular speed θ/ω. (3) Plot the normal force F¯n = Fn /(mRω 2 ) the curved track applies on the particle. (4) Plot the ¯ = E/(mR2 ω 2 ). Discuss your results. (5) Comtotal mechanical energy of the system, E ¯ 0 = 0.75 and 0.80. Explain your results. The curve pare the responses of the system at M
84
3 Basic principles
e1 , where r(θ) = R − b cos N θ. is deſned in the polar coordinate system as p0 (θ) = r(θ)¯ It will be convenient to deſne the normal to the curve as n ¯ = ı3 t¯, where t¯ is the tangent to the curve. Use the following data: ¯b = b/R = 0.25; N = 6; ω 2 = k/m. The spring is un-stretched when r = 0. At time t = 0, θ = 0 and θ˙ = 0. The applied moment is given as ¯ = M/(mR2 ω 2 ) = M ¯ 0 (1 − cos τ ) for τ ≤ 2π and m M ¯ = 0 for τ > 2π, where τ = ωt is ¯ 0 = 0.75. Simulate the system for τ ∈ [0, 6π]. the non-dimensional time and M
i2 e1
k m
C
A
r
h q
e2 O
i1
Fig. 3.23. Particle moving on a track.
i2 g k I O
m
i1
Fig. 3.24. Particle connected to a spring with unilateral contact to a horizontal plane.
Problem 3.17. Particle with unilateral contact The particle of mass m depicted in ſg. 3.24 is subjected to a gravity ſeld of acceleration g and is connected to a spring of stiffness k and un-stretched length h. The spring is attached to inertial point A, located a distance h above point O. A unilateral contact condition is imposed on the particle by the horizontal plane P = (O, ¯ı2 ); this means that the particle can only move in the half-space above this plane. At the initial time, the particle is at point O and has a velocity v(t = 0) = v0¯ı1 . (1) Write the equation of motion for the particle while it is in contact with the plane. (2) Plot the non-dimensional position of the particle as a function of nondimensional time. (3) Plot the non-dimensional velocity of the particle as a function of nondimensional time. (4) Find the time at which the particle leaves the plane and its corresponding position and velocity. (5) Under what condition will the particle always remain on the plane for any magnitude of the initial velocity v0 ? (6) Find the time at which the particle will ſrst hit the plane after leaving it. (7) Plot the trajectory of the particle during its free ƀight. Use the following data: v¯0 = v0 /(ωh) = 1; g¯ = mg/(kh) = 0.25; ω 2 = k/m. Use the following non-dimensional time τ = ωt. All lengths are non-dimensionalized by h, velocities by ωh.
Problem 3.18. Particle moving on a parabolic surface of revolution Figure 2.11 shows a particle sliding on a parabolic surface of revolution and subjected to a gravity force acting along the negative ¯ı3 direction. This surface is deſned by the position vector of one of its points, p0 = r cos φ ¯ı1 +r sin φ ¯ı2 +ar 2¯ı3 , where r ≥ 0 and 0 ≤ φ ≤ 2π. The following notation was used η1 = r and η2 = φ. (1) Find the equation of motion for the particle in terms of the surface coordinates r and φ. (2) Find the constraint force acting on the particle.
Problem 3.19. Particle sliding on a linear spiral A particle of mass m is sliding along a linear spiral, as deſned in example 2.2, under the effect of gravity acting down along the ¯ı2 axis, see ſg. 2.4. (1) Derive the governing equation of motion using Newton’s law. (2) Plot the angle θ as a function of time. (3) Plot the θ˙ as a function of time. (4) Plot the time history of the magnitude of the normal reaction force that the spiral applies to the particle. (5) On one graph, plot the time history of the kinetic energy
3.3 Contact forces
85
of the particle, its potential energy, and its total mechanical energy. (6) Derive the governing equation of motion of the particle from the principle of work and energy. Use the following data: m = 2.5 kg; a = 0.2 m; g = 9.81 m/s2 . At time t = 0, θ = 0 and θ˙ = 50 rad/s. Present all your results for t ∈ [0, 15] s.
Problem 3.20. Bungee jumping A man of mass m is jumping off a bridge while attached to a bungee cord of un-stretched length d0 , as described in example 3.1. An inertial frame, F = [O, I = (¯ı1 , ¯ı2 )], is attached to the bridge. The man is jumping from point O with an initial velocity, v0 , oriented along horizontal axis ¯ı2 , and the acceleration of gravity is acting along vertical axis ¯ı1 . In the developments presented in example 3.1, the effect of air friction on the man was ignored. In this problem, these forces will be taken into account in an approximate manner applying to the man a drag force, F d = −1/2 Cd ρA vv, where Cd is the non-dimensional drag coefſcient, ρ the air density, and A the man’s cross-sectional area. This drag force is at all times proportional to the square of the speed, aligned with the velocity vector, and oriented in the direction opposite to the velocity vector. (1) Derive the equation of motion for the free fall portion of the man’s trajectory. Solve the equations numerically to ſnd the time τt at which the bungee becomes taught. (2) Derive the equations of motion once the bungee is taut. Solve the equations numerically. (3) On one graph, plot the components x ¯1 and x ¯2 of the man’s position vector as functions of τ . (4) Plot the trajectory of the man. (5) On one graph, plot the components v¯1 and v¯2 of the velocity vector as functions of τ . (6) Plot the stretch of the bungee as a function of τ . (7) On one graph, plot the non-dimensional kinetic energy, ¯ = K/(mv02 ), potential energy, V¯ = V /(mv02 ), and total potential energy E ¯ =K ¯ + V¯ . K (8) Determine the non-dimensional time at which the bungee becomes slack again. Use the ¯2 = x2 /d0 ; v¯1 = v1 /v0 , v¯2 = v2 /v0 ; following non-dimensional quantities: x ¯1 = x1 /d0 , x use the non-dimensional time τ = v0 t/d0 . Use the following data: g¯ = gd0 /v02 = 10, ¯0 = k0 d20 /(mv02 ) = 60, Cd = 0.47 and μ ¯ = ρAd0 /m = 0.03. Present all your results for k τ ∈ [0, 3.5τt ].
3.3 Contact forces When dealing with particle dynamics, it is often the case that the particle is in contact with another body. Contact can be of a continuous nature; for instance, a particle is moving while in continuous contact with a curve or a surface, see example 3.3 or 3.4, respectively. Contact could also be of an intermittent nature, such as, for instance, the impact of a particle on an obstacle. These contact forces are forces acting on the particle, which must therefore be included in the statement of Newton’s second law when studying the dynamic response of the particle. Both magnitude and direction of these forces must be studied to properly state Newton’s second law. The kinematics of contact of a particle with a surface and a curve will be studied ſrst in sections 3.3.1 and 3.3.2, respectively; contact forces are categorized into normal and tangential contact forces. Next, the magnitudes of these forces will be studied in section 3.3.3. Typically, constitutive laws are postulated that relate the magnitude of the contact forces to contact parameters. For instance, Coulomb’s friction law relates the friction force to both normal force and relative velocity of the particle with respect to surface it is in contact with.
86
3 Basic principles
3.3.1 Kinematics of particles in contact with a surface Figure 3.25 depicts a particle of mass m in continuous contact with surface S at point P. The differential geometry of a surface was studied in details in section 2.4, and it is assumed here that parameters η1 and η2 deſne the lines of curvature presented section 2.4.5. Unit vectors e¯1 and e¯2 given by eq. (2.55) deſne the plane tangent to the surface at point P. The normal to the surface is now deſned as n ¯ = e1 e¯2 and E = (¯ e1 , e¯2 , n ¯ ) forms an orthonormal basis. n
n S
e2 h2
P
F Tangent plane
n
Fn
e2 F2t
e1
F2t F1t
h1
Surface’s free body diagram
e1
m F1t Particle’s free body diagram
Fig. 3.25. Particle moving on a surface.
The contact force, F c , between the particle and the surface is conveniently divided into two components, the normal contact force, F n = F n n ¯ , which acts along the normal to the surface, and the tangential contact force, F t = F1t e¯1 +F2t e¯2 , which acts in the plane tangent to the surface. Hence, the contact force is written as F c = F n + F t = F nn ¯ + (F1t e¯1 + F2t e¯2 ).
(3.48)
Imagine ſrst that the particle slides over the surface without any friction: the tangential contact forces vanish in eq. (3.48). Further assume that the surface on which the particle slides is a plane. If the particle slides on this plane under the effect of externally applied forces acting in the same plane, F a = F1a e¯1 +F2a e¯2 , the normal contact force also vanishes. Indeed, Newton’s law now reduces to F1a e¯1 + F2a e¯2 + 0¯ n = m(¨ x1 e¯1 + x ¨2 e¯2 + 0¯ n): both externally applied forces and accelerations vanish along the normal direction. Consider now the same particle sliding on a curved surface under the effect of external forces applied in the plane tangent to the surface; Newton’s law now becomes F1a e¯1 + F2a e¯2 + F n n ¯ = m(a1 e¯1 + a2 e¯2 + an n ¯ ), where the acceleration components are given by eq. (2.68). Since the particle has to follow the curvature of the surface, the acceleration component in the normal direction, an , does not vanish, and the normal contact force, F n , is necessary to equilibrate the corresponding inertial forces. The normal contact force can be interpreted as the constraint force that constrains the particle to remain on the surface. If the interface between the particle and the surface is rough, friction forces acting in the plane tangent to the surface will appear in addition to the normal contact
3.3 Contact forces
87
force. Figure 3.25 also gives free body diagrams for the particle and surface. Force components F n , F1t , and F2t are the forces the surface applies to the particle. Note that according to Newton’s third law, the particle applies equal and opposite forces to the surface. 3.3.2 Kinematics of particles in contact with a curve Figure 3.26 shows a common situation where a particle moves along a curve. This would be, for instance, the case of a train moving along its rails, or of a roller coaster car moving along its track. The differential geometry of a curve was studied in details in section 2.2. At point P of the curve, it is possible to deſne Frenet’s triad consisting of the tangent, normal, and binormal vectors, denoted t¯, n ¯ , and ¯b, respectively. b
b s
b
P m
F
Fb m
t
F
n
F
t n C
n
t
F
n
F Curve’s free body diagram
t
Particle’s free body diagram
Fig. 3.26. Particle moving on a curve.
The contact force, F c , between the particle and the curve is conveniently divided into two components, the normal contact force, F n = F n n ¯ + F b¯b, which acts in the plane normal to the curve, and the tangential contact force, F t = F t t¯, which acts along the tangent to the curve. Hence, the contact force is written as F c = F t t¯ + (F n n ¯ + F b¯b).
(3.49)
Note the different expressions for the contact forces between a particle and a surface, eq. (3.48), and those between a particle and a curve, eq. (3.49). For a surface, the normal contact force has a single component along the normal to the surface, whereas for a curve, the normal contact force has two components along the normal and binormal to the curve. On the other hand, for a surface, the tangential contact force has two components in the plane tangent to the surface, whereas for a curve, the tangential contact force has a single component along the tangent to the curve. It is important to distinguish the difference between unilateral and bilateral contact. For instance, a train is in unilateral contact with its rails: the train cannot go through the rails, nothing, however, prevents the train from moving off the rails in the upward direction. Of course, gravity forces are, in general, sufſcient to keep the train on its rails. This contrasts with roller coasters: in this case, cars are connected to the track by a set of wheels that prevent them from running off track in any direction.
88
3 Basic principles
When dealing with unilateral contact, it is often important to determine when a particle will loose contact with the surface or curve. Consider, at ſrst, the case of a particle on a surface and assume the particle can freely move in the direction of the normal to the surface. In that case, F n is positive when the particle is on the surface and the unilateral contact condition cannot support a negative normal force. Clearly, the particle is about to leave the surface when F n = 0. In the case of a particle on a curve, the normal to the curve is always pointing to the concave side of the curve: the normal ƀips direction at an inƀection point of the curve. Due to this discontinuity, the condition F n = 0 must be applied with care when dealing with particle moving along a curve. For more details about the complex problems associated with unilateral contact conditions can be found in the textbook by Pfeiffer [6]. 3.3.3 Constitutive laws for tangential contact forces The tangential contact forces are friction forces between the particle and surface or curve it moves on. Coulomb’s friction law is commonly used to evaluate the friction forces, and sometimes, friction forces are assumed to be of a viscous type. Coulomb’s friction law Coulomb’s friction law has been extensively used to model friction forces. It postulates that the friction force between the particle and surface is proportional to the absolute value of the normal contact force. The empirical coefſcient of proportion0.4 ality, μk , is called the coefſcient of kinetic 0.3 m friction. The friction force always acts in 0.2 the direction opposing the relative velocity 0.1 of the particle with respect to the surface, FRICTION FORCE [N]
k
0
v0
F f = −μk F n
-0.1 -0.2
v rel , v rel
(3.50)
where v rel is the relative velocity of the particle with respect to the surface and RELATIVE VELOCITY [m/s] F n = |F n |, see eq. (3.48). Fig. 3.27. Coulomb’s friction law: solid In the case of a particle moving along line; continuous friction law: dashed line. a curve, the relative velocity of the particle with respect to the curve is along the tangent to the curve, i.e., F f = F f t¯, where -mk
-0.3
-0.4 -5
-4
-3
-2
-1
0
1
2
3
4
5
F f = −μk F n sign(vrel ),
(3.51)
where vrel is the speed of the particle with respect to the curve, v rel = vrel t¯, and F n 2 = F n2 + F b2 , see eq. (3.49). Sliding gives way to sticking when the relative velocity vanishes. In that case, the magnitude of the friction force must be smaller than that of the normal contact force times an empirical coefſcient μs , the coefſcient of static friction, or
3.3 Contact forces
F f ≤ μs F n .
89
(3.52)
Figure 3.27 depicts the friction force as a function of the relative velocity, for the simple case of a constant friction coefſcient μk = 0.3 and unit normal force F n = 1. Coulomb’s friction law presents a discontinuity of the friction force at zero relative velocity, as shown in ſg. 3.27. This discontinuity causes numerical difſculties in computer simulations and hence, various approximations to Coulomb’s law have been proposed in the literature [7, 8, 9, 10, 11]. These various approximations can be viewed as continuous friction laws that replace the discontinuity at zero relative velocity by a smooth, rapidly varying function of the relative velocity. A typical expression for continuous friction laws is F f = −μk F n sign(vrel ) (1 − e−|vrel |/v0 ),
(3.53)
where v0 is a characteristic relative velocity typically chosen to be small compared to the maximum relative velocity encountered during the simulation. Figure 3.27 shows the friction force corresponding to the continuous friction law for v0 = 0.5 m/s. The continuous friction law replaces both kinetic and static friction laws. Viscous friction law It is sometimes assumed that friction forces are of a viscous type, i.e., the friction forces are proportional to the relative velocity of the particle with respect to the surface or curve it moves on. The coefſcient of proportionality, c, is called the coefſcient of viscous friction, and hence F f = −cv rel . (3.54) Note that the normal force does not appear in this expression. Coulomb’s friction is sometimes called “dry friction,” as opposed to the present “viscous friction” phenomenon. For cases of friction between a particle and a lubricated surface, a combination of dry and viscous frictions forces is often observed. Both dry and viscous friction laws are approximations to the experimentally observed friction forces. In fact, friction is a very complex phenomenon that involves many, often poorly understood physical processes; the following references give detailed descriptions of the friction process and a wealth of experimental observations Rabinowicz [12], or Oden and Martins [13]. Example 3.5. Particle elastically suspended to a straight track Consider a particle of mass m suspended to a straight track by means of a spring in parallel with a dashpot of constant c, as depicted in ſg. 3.28. The magnitude of the force in the spring, Fs , is a nonlinear function of its stretch, Fs = k1 Δ + k3 Δ3 , where Δ = r − r0 is the stretch of the spring, r the distance from the particle to point A, and r0 the un-stretched length of the spring. The spring-dashpot system is connected at point A to a massless slider that moves along a straight track, which makes an angle α with respect to the horizontal. The motion of the slider along the track is prescribed as s(t) = s0 sin ωt. Considering
90
3 Basic principles
i2
e2 A
g
e1
e2 D
s(t) I
e1
c
F
a
c i1
O x
k
r
- Fsd Fsd
u m
-mg i2 m
Fig. 3.28. Particle on a straight track.
the free body diagram for the particle illustrated on the right portion of ſg. 3.28, the equation of motion of the system is found as −mg¯ı2 + F sd = m¨ x,
where xT = x1 , x2 are the components of the position vector of the particle resolved in the inertial frame F I = [O, I = (¯ı1 , ¯ı2 )], and F sd those of the the force vector applied to the particle by the spring-dashpot system. The force in the spring-dashpot system has a line of action that joins the particle to point A; the unit vector along this line of action is u¯ = [s(t)¯ e1 − x]/r, where r = s(t)¯ e1 − x and e¯1 = cos α ¯ı1 + sin α ¯ı2 is the unit vector along the track. The force vector generated by the spring-dashpot system is now readily expressed as ˙ u¯, where Δ˙ = r˙ = u F sd = [k1 Δ + k3 Δ3 + cΔ] ¯T (s¯ ˙ e1 − x) ˙ is the stretch rate, i.e., the projection of the relative velocity of the two ends of the dashpot, (s¯ ˙ e1 − x), ˙ along unit vector u ¯ pointing from one end of the damper to the other. Because the time history of the slider motion is given, the equation of motion for the particle can be integrated numerically to yield the response of the system. The following physical parameters are used: α = π/6; s0 = 0.45 m; ω = 2 rad/s; m = 1.5 kg; r0 = 0.25 m; k1 = 50 N/m, k3 = 20 kN/m3 ; c = 2.6 N·s/m; and 2 g = 9.81
m/s . At the initial time, the particle is at rest at the following position: T x = 0, −r0 . Figure 3.29 shows the inertial position and velocity of the particle as a function of time. The spring stretch and force, and the damper stretch rate and force are depicted in ſg. 3.31 and 3.32, respectively. The kinetic energy of the system is readily evaluated as K = 1/2 m x˙ T x. ˙ The potential energy of the system consists of two terms, the potential of the gravity forces, mgx2 , and the strain energy in the spring, leading to 1 1 V = mgx2 + k1 Δ2 + k3 Δ4 . 2 4 The second term of the strain energy expression, 1/4 k3 Δ4 , is associated with the nonlinear force term in the spring, k3 Δ3 . Figure 3.30 shows the time histories of the kinetic, potential and total mechanical energies of the system. As expected, the total mechanical energy does not remain constant, because the present system is not conservative.
3.3 Contact forces 4
2 0
SYSTEM ENERGIES [J]
x1 and x2
1
-1
v1 and v2
2
0
0
-2
-4
-6
-2 0
2
4
6
8
10
-8 0
TIME [s]
2
4
6
8
10
TIME [s]
Fig. 3.29. Top ſgure. Particle position: x1 , solid line and x2 , dashed line. Bottom ſgure. Particle velocity: v1 = x˙ 1 , solid line and v2 = x˙ 2 , dashed line.
Fig. 3.30. System energies: kinetic energy: (◦); potential energy: (+); total mechanical energy: ( ).
2
D [N]
dD/dt [m/s]
0.4
0.3
0
0.2
-2
60
5
40
Fd[N]
Fs [N]
91
0
20
0
0
2
4
6
8
10
TIME [s]
Fig. 3.31. Top plot: spring stretch, Δ. Bottom plot: spring force, Fs .
-5 0
2
4
TIME [s]
6
8
10
˙ Fig. 3.32. Top plot: damper stretch rate, Δ. Bottom plot: damper force, Fd .
To verify the energy closure condition for this system, the work done by two additional forces, the driving force and the damper force, must be brought into the picture. The statement of the problem speciſes that “The motion of the slider along the track is prescribed as s(t) = s0 sin ωt.” Clearly a driving force must be applied to the slider, if this desired motion is to be achieved. It is implicitly assumed that a device applies this force and is sufſciently powerful to instantaneously generate the required force that achieves the desired motion of the slider. The right portion of ſg. 3.28 shows the free body diagram for the slider, leading to the following equilibrium equation D¯ e1 + F c e¯2 − F sd = 0, where D(t) is the magnitude of the driving force acting along the direction of axis e¯1 , and F c the magnitude of the force that the track applies to the slider along the direction normal to the track, e¯2 . Because the slider is massless, the right-hand side of the equation vanishes, leading to a static equilibrium condition.
92
3 Basic principles
Pre-multiplying this equation by e¯T1 yields the magnitude of the driving force: D = e¯T1 F sd . The cumulative work done by this force is now found as %
%
t
WD =
(D¯ e1 ) (ds¯ e1 ) = 0
%
t
T
Ds˙ dt = 0
0
t
(¯ eT1 F sd )s˙ dt.
Figure. 3.33 shows the history of the required driving force together with the work it performs.
5
20
Fd [N]
D [N]
40
0
-20
-5 0
Wnc [J]
WD [J]
5
0
-5
0
0
2
4
6
8
10
-0.5
-1 -1.5 0
2
4
6
8
10
TIME [s]
TIME [s]
Fig. 3.33. Top plot: history of the driving force, D(t). Bottom plot: cumulative work done by the driving force, WD .
Fig. 3.34. Top plot: history of the damper force, Fd . Bottom plot: cumulative work done by the damper force, Wnc .
As discussed in section 3.2.3, the cumulative work done by the damper force is % t % t (−cΔ˙ u¯)T (¯ u dΔ) = − cΔ˙ 2 dt. Wnc = 0
0
This quantity represents the energy dissipated in the dashpot. Figure. 3.34 depicts the history of the damper force and the work it performs. As expected, this is a monotonically decreasing function of time, which represents the amount of energy dissipated in the damper in the form of heat. Finally, the energy closure equation for this problem writes E(t) − WD − Wnc = Ei . Figure 3.35 show the various quantities in this equation: the total mechanical energy, the work done by the driving force, and the energy dissipated in the damper. As expected, the sum of these three energies remains constant during the evolution of the system, expressing the energy closure equation. 3.3.4 Problems Problem 3.21. Particle sliding along a slot in a rotating disk Figure 3.36 shows a particle of mass m sliding along a slot in a rotating disk. The disk rotates at a constant angular velocity, Ω, while the time-dependent position of the particle along the slot
3.3 Contact forces
93
6
SYSTEM ENERGIES [J]
4
2
i2 0
x(t)
m
f = Wt
-2
O -4
i1
Q
-6
-8
0
2
4
6
8
10
TIME [s]
Fig. 3.35. Total mechanical energy (◦), work done by the driving force (2); energy dissipated in the damper (); energy closure equation ( ).
Fig. 3.36. Particle sliding along a slot in a rotating disk.
is prescribed as x(t). (1) Find the driving torque, Q(t), required to keep the angular velocity of the disk constant. (2) Find the driving force, F d , required to prescribe to position of the particle along the slot. (3) Assume now that a friction force, F f , acts between the particle and the slot; repeat questions (1) and (2).
Problem 3.22. Particle sliding along a helix with friction A particle slides along a helix as deſned by eq. (2.24) and is subjected to a gravity force see ſg. 2.3. The particle is also subjected to a friction force acting along the −¯ı3 direction, of magnitude Ff = μk Fn2 + Fb2 , where Fn and Fb are the components of the constraint force acting on the particle in the normal and binormal directions, respectively, and μk the kinetic coefſcient of friction. (1) Find the equation of motion for the particle. (2) Plot the time history of the particle curvilinear coordinate. (3) Plot the speed of the particle versus time. (4) Find an analytical expression for the limit velocity of the particle, i.e., the velocity reached by the particle after it has been sliding along the helix for a long time. (5) Find an analytical expression for the limit magnitude of the acceleration vector. (6) What condition must the satisſed by the kinetic coefſcient of friction is the particle does not remain stuck. Use the following parameters: non-dimensional time τ = a/g t; lengths are non-dimensionalized √ ¯ = k/a = 0.35, μk = 0.3. by a, velocities by ag, accelerations by g. k
Problem 3.23. Motion of a particle on a track Figure 3.37 depicts a particle sliding along a planar track under the effect of gravity forces. The constraint force between the particle and the track is unilateral, i.e., the particle cannot go through the track, but it can leave it moving upwards. (1) Find the condition that must be satisſed by the kinetic energy of the particle if it is about to leave the track. (2) Could this condition be satisſed at point A? or at point B? (3) If a friction force (friction coefſcient μ) is present between the particle and the track, what is the condition that must be satisſed by the kinetic energy of the particle if it is about to leave the track.
Problem 3.24. Particle on circular track The particle of mass m is sliding on a circular track under the effect of gravity forces, as depicted in ſg. 3.38. The particle is connected to ſxed point A by means of a spring of stiffness constant k in parallel with a dashpot of constant c. The spring has an un-stretched
94
3 Basic principles
i2 i2
q n A m
A
k
B p0(s) t O I
m
c
g
O
d
q
g i1
R
i1
Fig. 3.37. Particle sliding on a track under the effect of gravity.
Fig. 3.38. Particle on a circular track.
length Δ0 . A viscous friction force F f = μs˙ is acting between the particle and the track. (1) Derive the equation of motion of the system. (2) Plot angle θ as a function of time. (3) ˙ Plot the history of the angular rate θ = θ/ω. (4) Plot the history of the normal force, F c , c c ¯ the track applies on the particle, as F = F /(mRω 2 ). (5) Plot the non-dimensional ki¯ = K/(mR2 ω 2 ), potential energy, V¯ = V /(mR2 ω 2 ), energy dissipated in netic energy, K ¯ c = W c /(mR2 ω 2 ), and energy dissipated at the viscous friction interface, the damper, W ¯ f = W f /(mR2 ω 2 ). Verify that the energy closure equation is satisſed. Use the following W non-dimensional quantities: non-dimensional time τ = ωt, where ω 2 = k/m. Use the fol¯0 = Δ0 /R = 1; g¯ = g/(Rω 2 ) = 2.5; lowing data: d¯ = d/R = 2, ζ = c/(2mω) = 0.1; Δ μ ¯ = μ/(mω) = 0.1. At the initial time, the system is at rest and θ = π/2. Present all your results for τ ∈ [0, 20].
3.4 Newtonian mechanics for a system of particles Newton’s laws, as presented in section 3.1.2, are concerned with a single particle. For many practical engineering applications, these laws must be extended to deal with systems of particles, rather than a single particle. Figure 3.39 depicts a system of N parC F2 ticles. The particles are of mass mi , i = 2 Fi f 1, 2, . . . , N . Each particle is subjected to f23 21 f rC 2i forces that can be divided into two catesi f gories: the externally applied forces and the fi1 i f i2 f12 i3 f1i internal forces. The words “internal” and f3i f32 i3 “external” should be understood with reF1 1 f13 q i spect to the system of particles. Internal f31 ri 3 F forces act and are reacted within the sys3 P I tem, whereas external forces act on the sysi2 O tem but are reacted outside the system. i1 The externally applied force acting on particle i, also called impressed force, is deFig. 3.39. A system of particles. noted F i . The origin of these forces is external to the set of particles; for instance, if the system is subjected to a gravity ſeld or an electromagnetic ſeld, the resulting gravity or electromagnetic forces, respectively,
3.4 Newtonian mechanics for a system of particles
95
would be external forces. Interaction forces between the particles of the system and other material particles would also give rise to external forces. The internal forces, denoted f ij , correspond to the forces that the various particles of the system apply on each other. According to Newton’s third law, these forces appear in pairs of forces of equal magnitude and opposite sign, sharing a common line of action. Force f ij is the force exerted by particle j on particle i, and the companion force, f ji , is that applied by particle i on particle j. Newton’s third law then implies f ij + f ji = 0. The system of particles under scrutiny is very general; it could be a rigid body, a ƀexible body, or a large number of sand particles. For the rigid body, the internal forces are the cohesion forces that make the body a “rigid body.” Of course, there exist no truly rigid body; all bodies will exhibit some amount of elastic deformation under load. For an elastic body, the internal forces are the stresses acting between the particles of the body. The deformable body could also exhibit internal energy dissipation mechanisms; such would be the case of two deformable bodies connected by dashpots, or a single body with internal material damping. If two contacting bodies are taken to form a single system of particles, the contact forces between the bodies are internal forces. If one of the two bodies, however, constitute the system of particles, the contact forces applied on that body will be external forces. 3.4.1 The center of mass As shown in ſg. 3.39, the inertial position vector of particle i is denoted ri and its mass mi . The total mass of the system, denoted m, is then found by summing up the masses of individual particles N ( m= mi . (3.55) i=1
The center of mass of the system of particles will play an important role in the analysis. The location of the center of mass is deſned as follows 1 ( mi r i . m i=1 N
rC =
(3.56)
Let si denote the relative position vector of particle i with respect to the center of mass, see ſg. 3.39. It follows that r i = r C + si , and hence, 1 ( 1 ( mi (r C + si ) = r C + mi s i . m i=1 m i=1 N
rC =
N
This result reveals an important property of the center of mass: Successive time derivatives then yield
)N i=1
mi si = 0.
96
3 Basic principles N (
mi si = 0,
(3.57a)
mi s˙ i = 0,
(3.57b)
mi ¨si = 0.
(3.57c)
i=1 N ( i=1 N ( i=1
3.4.2 The forces and moments The forces applied on each particle can be divided into two categories: the externally applied forces and the internal forces. The sum of all the forces externally applied on the system is N ( F = F i. (3.58) i=1
On the other hand, the sum of all internal forces applied on the system vanishes because all the internal forces can be grouped in pairs (f ij + f ji ) that individually vanish due to Newton’s third law; hence N (
N (
(3.59)
f ij = 0.
i=1 j=1, j=i
The sum of the moments of all the forces externally applied to the system evaluated with respect to the origin of the inertial frame, point O, is MO =
N (
ri F i .
(3.60)
i=1
The subscript, (·)O , indicates the point about which the moments are evaluated. Indeed, the moments could have been evaluated with respect to any arbitrary point. For instance, ) the moments evaluated with respect to the center of mass would N write M C = i F i , and those evaluated with respect to point P are M P = i=1 s )N q F . These various quantities are not independent; indeed, i=1 i i MP =
N ( i=1
qi F i =
N ( i=1
( rP C + si )F i = rP C F +
N (
si F i ,
i=1
where r P C is the relative position vector of the center of mass with respect to point P. It the follows that M P = M C + rP C F . (3.61) Finally, the sum of the moments of all the internal forces of the system evaluated with respect to point O vanishes. First, the moments of all internal forces are grouped
3.4 Newtonian mechanics for a system of particles
97
in pairs ( ri f ij + rj f ji ). Next, it is clear that ri f ij = r⊥ f ij and rj f ji = r⊥ f ji , where r⊥ is the vector that joins point O to the point on the common line of action of the internal force pair that is at the shortest distance from point O. It then follows that ri f ij + rj f ji = r⊥ (f ij + f ji ) = 0, by virtue on Newton’s third law. In summary, N (
N (
ri f ij = 0.
(3.62)
i=1 j=1, j=i
Note that the sum of these moments vanishes when computed with respect to any arbitrary point. 3.4.3 Linear and angular momenta The linear momentum of the system, P , is the sum of the linear momenta of the individual particles N ( P = mi v i . (3.63) i=1
A time derivative of expression r i = r C + si leads to r˙ i = r˙ C + s˙ i = v C + s˙ i , where v C = r˙ C is the velocity vector of the center of mass. The system’s linear momentum ) ) r C + s˙ i ) = m˙rC + N ˙ i = mv C , where the now becomes P = N i=1 mi (˙ i=1 mi s second property of the center mass, eq. (3.57b), was used. The linear momentum of the system then simply becomes (3.64)
P = mv C .
The angular momentum of the system computed with respect to the origin of the inertial frame, denoted H O , is the sum of the corresponding angular momenta of all particles the system N ( HO = ri mi v i . (3.65) i=1
The subscript, (·)O , indicates the point about which the angular momentum is evaluated. The angular momentum vector can be computed with respect to any point; )N for instance, H C = i=1 si mi v i is the angular momentum vector computed with )N respect to the center of mass and H P = i=1 qi mi v i the corresponding quantity evaluated with respect to an arbitrary point P, see ſg. 3.39. These various quantities are not independent of each other; indeed HP =
N ( i=1
qi mi v i =
N (
( rP C + si ) mi v i = rP C P +
i=1
N (
si mi v i .
(3.66)
i=1
It follows that H P = H C + rP C P .
(3.67)
98
3 Basic principles
3.4.4 Euler’s laws for a system of particles Euler’s ſrst law Newton’s second law applied to each of the N particles writes N (
Fi +
f ij = mi ai ,
i = 1, 2, 3, . . . N.
(3.68)
j=1, j=i
Although these equations are all correct, they are difſcult to manipulate because, in general, the system comprises a very large number of particles. To circumvent this problem, the equations of motion of all particles are added together to yield N N N N ( ( ( ( Fi + f ij = mi a i . (3.69) i=1
i=1 j=1, j=i
i=1
The ſrst term represents the sum of all externally applied forces on the system, see eq. (3.58). The second term vanishes in view of eq. (3.59). The ) last term is simpliſed by introducing the expression for the center of mass: N i=1 mi ai = )N )N r C + ¨si ) = rC = m¨rC = maC , where the property of the i=1 mi (¨ i=1 mi ¨ center mass, eq. (3.57c), was used. It follows that F = maC .
(3.70)
This result is known as Euler’s ſrst law [14, 15]. Law 5 (Euler’s ſrst law) The inertial acceleration vector of the center of mass of a system of particles is proportional to the vector sum of all externally applied forces; the constant of proportionality is the total mass of the system. Note the striking resemblance between eq. (3.70) and Newton’s second law for a single particle, eq. (3.4). It appears that Newton’s second law can be applied to a ſctitious particle of mass m located at the center of mass of the system and subjected to all the forces externally applied on the system. Equation (3.70) is much more convenient to use than the N equations of motion for each individual particle; it gives information about the overall response of the system in terms of the motion of its center of mass. Much information, however, has been lost: the N individual vector equations, eqs. (3.68), gave rise to a single vector equation of motion for the system, eq. (3.70). In fact, this latter equation cannot predict the motion of individual particles, nor does it allow to predict the internal forces in the system. In view of eq. (3.64), the time derivative of the linear momentum is P˙ = maC , and hence F = P˙ . (3.71) Clearly, this equation is identical to Euler’s ſrst law, eq. (3.70). Law 6 (Alternative statement of Euler’s ſrst law) The time derivative of the linear momentum vector of a system of particles equals the sum of all externally applied forces.
3.4 Newtonian mechanics for a system of particles
99
Euler’s second law To extract additional information about the response of the system, the moments of the equations of motion for individual particles, eqs. (3.68), with respect to the origin of the inertial frame are evaluated and summed up for all particles to yield N (
ri F i +
i=1
N (
N (
ri f ij =
i=1 j=1, j=i
N (
mi ri ai .
(3.72)
i=1
The ſrst term represents the moment of the externally applied forces computed with respect to point O, see eq. (3.60). The second term vanishes in view of eq. (3.62). Equation (3.72) now reduces to MO =
N (
mi ri ai .
(3.73)
i=1
The right-hand side of this equation can be expressed in a simpler manner in terms of the angular momentum vector; indeed, a time derivative of eq. (3.65) yields ˙ = H O
N ( i=1
mi vi v i +
N (
mi ri ai =
i=1
N (
mi ri ai .
(3.74)
i=1
Comparing the last two equations then leads to ˙ . MO = H O
(3.75)
This result is known as Euler’s second law [14, 15]. Law 7 (Euler’s second law) The time derivative of the angular momentum vector of a system of particles equals the sum of all moments externally applied to the system, when these quantities are evaluated with respect to a common inertial point. Introducing eqs. (3.61) and (3.67) into eq. (3.75) leads to ˙ + vC P + rC P˙ = H ˙ + rC F , M C + rC F = H C C which reduces to ˙ C. MC = H
(3.76)
This is another form of Euler’s second law for a system of particles. Law 8 (Alternative statement of Euler’s second law) The time derivative of the angular momentum vector of a system of particles equals the sum of all moments externally applied to the system, when these quantities are evaluated with respect to the system’s center of mass. It would be erroneous to believe that this statement holds when moments and angular momentum vectors are evaluated with respect to an arbitrary point P. Indeed, ˙ −( introducing eqs. (3.61) and (3.67) into eq. (3.75) leads to M P − rP C F = H vC − P ˙ + vP P − rP C F , and ſnally vP )P − rP C P˙ = H P ˙ P + vP P . MP = H
(3.77)
100
3 Basic principles
3.4.5 The principle of work and energy In section 3.1.4, the principle of work and energy was developed for a single particle, see eq. (3.12). When dealing with a system of particles, this principle can be applied to each individual particle, leading to %
tf
N (
(F Ti +
ti
f Tij ) dri = Ki (tf ) − Ki (ti ).
(3.78)
j=1, j=i
In this expression, Ki = 1/2 mi vi2 represents the kinetic energy of particle i. As was done in the previous sections, the equations for each individual particle are added together to ſnd %
N tf (
ti
% (F Ti dr i ) +
N tf (
ti
i=1
N (
(f Tij dr i ) =
i=1 j=1, j=i
N ( i=1
Ki (tf ) −
N (
Ki (ti ). (3.79)
i=1
This ſrst term clearly represents the work done by all externally applied forces. The second term is a complex double summation over the work done by all internal forces. It would be erroneous to believe that this term vanishes; indeed, consider two internal forces that obey Newton’s third law: f ij + f ji = 0. The differential work done by these two forces is f Tij dr i + f Tji dr j = f Tij (dr i − dr j ) = 0, since the two particles have two distinct differential displacements along their distinct paths. Finally, the terms on the right-hand side represent the difference between the total kinetic energies of the system at the ſnal and initial times. The total kinetic energy of the system, )N K, is found by summing up the contributions of each individual particle, K = i=1 Ki . The principle of work and energy for a system of particles now becomes % Wti →tf =
N tf (
ti
i=1
% (F Ti
dr i )+
N tf (
ti
N (
(f Tij dri ) = K(tf )−K(ti ). (3.80)
i=1 j=1, j=i
Although this statement is correct, it is of limited practical use because it requires the evaluation of the work done by all internal forces. This contrasts with the equations of motion derived in the previous section, eqs. (3.71) and (3.75), which do not involve the internal forces. For speciſc systems of particles it will be possible to prove that the term involving the work done by all internal forces does indeed vanish; this is the case for a rigid body, for instance. In such case, the internal forces do not appear in the statement of the principle of work and energy that can then be used conveniently. 3.4.6 The principle of impulse and momentum It is interesting to integrate Euler’s ſrst law, eq. (3.71), over a period of time from ti to tf , to ſnd
3.4 Newtonian mechanics for a system of particles
%
%
tf
tf
F (t) dt = ti
P˙ dt = P (tf ) − P (ti ).
101
(3.81)
ti
The term on the left-hand side is called the linear impulse of all externally applied forces. Equation (3.81) expresses the principle of linear impulse and momentum for a system of particles. Principle 5 (Principle of linear impulse and momentum for a system) The linear impulse of all externally applied forces equals the change in linear momentum of the system of particles. In the absence of external forces, this principle implies P (tf ) = P (ti ), i.e., the system’s linear momentum remains constant at all times, since ti and tf are instants chosen arbitrarily. A similar treatment of Euler’s second law, eq. (3.75), leads to % tf % tf ˙ dt = H (tf ) − H (ti ). M O (t) dt = (3.82) H O O O ti
ti
The term on the left-hand side is called the angular impulse of all externally applied forces. Equation (3.82) expresses the principle of angular impulse and momentum for a system of particles. Principle 6 (Principle of angular impulse and momentum for a system) The angular impulse of all externally applied forces equals the change in angular momentum of the system when both angular impulse and momentum are computed with respect to the same inertial point. In the absence of external moments with respect to point O, this principle implies H O (tf ) = H O (ti ), i.e., the system’s angular momentum remains constant at all times. Of course, a similar principle can be derived from eq. (3.76); in this case, both angular impulse and momentum must be evaluated with respect to the center of mass of the system of particles. 3.4.7 Problems Problem 3.25. Particles interconnected by a massless link Consider the dumbbell consisting of two particles of mass m1 and m2 , respectively, connected by a massless arm of constant length , as depicted in ſg. 3.40. (1) Show that the work done by all internal forces in the system vanishes. (2) Write the principle of work and energy for the system. Hint: the differential displacements of the two particles dr 1 and dr 2 , respectively, are not independent; they must satisfy the constraint imposed by the constant length bar.
Problem 3.26. Particles linked by an inextensible cable The system depicted in ſg. 3.41 consists of two particles of mass m1 and m2 , respectively, linked by an inextensible cable. (1) Show that the work done by all internal forces in the system vanishes. (2) Write the principle of work and energy for the system.
102
3 Basic principles
l
m2 m1
m2
m1 Fig. 3.40. Particles interconnected by a massless link.
Fig. 3.41. Two particles linked by an inextensible cable.
Problem 3.27. System of three rigidly connected particles Figure 3.42 depicts three particles of masses m1 , m2 , and m3 , respectively, located at the vertices of an equilateral triangle with sides of length . Particle m1 touches a ſxed plane at point A at all times and the system is subjected to a gravity ſeld as indicated on the ſgure. (1) Derive the equation of motion of the system based on Euler’s second law. (2) Show that the same equation can be obtained from the principle of conservation of total mechanical energy. (3) Plot the time history of angle θ. (4) Plot the time history of angular velocity θ . (5) Find the reaction forces at point A. (6) On one graph, plot the time histories of the non-dimensional normal contact force, F¯ n , and friction force, F¯ f , at point A. (7) If the static friction coefſcient at point A is μs = 0.5, for what value of angle θ will particle m1 start sliding? Use the following data: m1 = 10, m2 = 2, and m3 = 10 kg. Use the non-dimensional time τ = t g/, and non-dimensional forces F¯ = F/(mg), where m = m1 + m2 + m3 . At time t = 0, θ = 2π/3 radians and θ = −1, where (·) indicates a derivative with respect to the non-dimensional time τ . Present all your results for τ ∈ [0, 2].
m2 60
m1
l
o
60
m3
o
k2
i2 k3
l 60
o
m1 A
l
k1
g
q
Fig. 3.42. Three interconnected particles touching a plane at point A.
m3
c2 c3 c1
m2
i1
Fig. 3.43. Three interconnected particles.
Problem 3.28. System of three interconnected particles Figure 3.43 shows a system of three particles of masses m1 , m2 , and m3 , respectively. The particles are linked by springs of stiffness constants k1 , k2 , and k3 , respectively, and dashpots of constants c1 , c2 , and c3 , respectively. The un-stretched lengths of the springs are 1 , 2 , and 3 respectively. (1) Draw a free body diagram of each particle. (2) Derive the equations of motion of the system. (3) Solve these equations numerically for a period of 50 s. (4) On one graph, plot the coordinates of particle 1 relative to the center of mass as a function of time. (5) Plot the relative coordinates of particle 2 versus time. (6) Plot the relative coordinates of particle 3 versus time. (7) Plot the magnitude of the forces in the three spring/dashpot systems. (8) Plot the components of the linear momentum vector of the system. Comment on
3.4 Newtonian mechanics for a system of particles
103
your results. (9) Plot the components of the angular momentum vector of the system computed with respect to the center of mass. Comment on your results. (10) Plot the kinetic, and strain energies of the system. (11) Compute the energy dissipated in the dashpots. (12) Demonstrate that the principle of work and energy is veriſed by your solution. Use the following data: m1 = 5, m2 = 3 and m3 = 7 kg; k1 = 12, k2 = 25 and k3 = 6 N/m; c1 = 0.25, time, c2 = 0.12 and c3 = 0.16 N·s/m; 1 = 0.6, 2 = 0.4 and 3 = 0.9m. At the initial 0, 0 and r = 0, 0.4, 0 the position vector of the particles are r1 = 0, 0, 0 , r 2 = 0.9, 3 m, respectively. The initial velocities of the particles are v 1 = −25, 25, 0 and v 2 = v 3 = 0, 0, 0 .
Problem 3.29. System of three interconnected particles Figure 3.43 shows a system of three particles of masses m1 , m2 , and m3 , respectively. The particles are linked by springs of stiffness constants k1 , k2 , and k3 , respectively, and dashpots of constants c1 , c2 , and c3 , respectively. The un-stretched lengths of the springs are 1 , 2 , and 3 respectively. The system of particle evolves freely in two-dimensional space. (1) Is the linear momentum of the system preserved? (2) Is the angular momentum of the system preserved. (3) Is the total mechanical energy of the system preserved. (4) Write an energy related quantity that is preserved during the evolution of the system.
Problem 3.30. Particles interconnected by a spring and damper Figure 3.44 shows two particles of mass m1 and m2 connected together by a spring of stiffness k and a damper of constant c. The initial conſguration of the system is indicated on the ſgure and the initial velocity vectors of the two particles are v 10 and v 20 , respectively. The following quantities are deſned: Ω 2 = k/m and c = 2mΩζ, where m = m1 + m2 . In the present conſguration, r1 and r2 measure the distance from the center of mass to particles m1 and m2 , respectively. For this problem, it is convenient to use the polar coordinate system indicated on the ſgure with its origin at the center of mass of the system. (1) When applying Newton’s second law to this problem, can the accelerations of the particles with respect to the center of mass be used? Justify your answer. (2) Are r1 and r2 independent variables? (3) Set up the equations of motion of the system. (4) Plot r1 as a function of the non-dimensional time τ = Ωt for τ ∈ [0, 10π]. (5) Plot θ(τ ). (6) Plot the trajectory of the particle in space. (7) Plot ˙ ¯ ) = θ/Ω. the history of the non-dimensional angular velocity Ω(τ (8) Plot the history of the components of the velocity vector of particle m1 in the inertial frame, v¯1x = v1x /(Ωr10 ) and v¯1y = v1y /(Ωr10 ). Plot the corresponding quantities for the velocity vector of the second particle. (9) Plot the history of the non-dimensional total mechanical energy of the system 2 ¯ ) = E/(mΩ 2 r10 E(τ ); comment your result. (10) Compute the non-dimensional cumulative 2 ¯ (τ ) = W/(mΩ 2 r10 ) as a function of τ . (11) Plot the energy dissipated in the damper W ¯ ¯ history of the quantity E(τ ) + W (τ ); comment your result. Use the following data: μ1 = m1 /m = 0.3; ζ = 0.02; v 1 (t = 0)/(Ωr10 ) = −0.1¯ı1 − 0.5 ¯ı2 ; v 2 (t = 0)/(Ωr10 ) = 2.2 ¯ı1 + 0.6 ¯ı2 ; θ0 = 0; r1 (t = 0)/r10 = 1;
Problem 3.31. Particle suspended from a circular track Figure 3.45 shows a particle of mass M sliding along a track deſned by a curve p0 (s) under the effect of gravity. A particle of mass m is suspended from the ſrst particle by means of a spring of stiffness constant k in parallel with a dashpot of constant c. The un-stretched length of the spring is Δ0 . A viscous friction force F f = μs˙ is acting between particle M and the track. (1) Derive the three equations of motion of the system for a curve of arbitrary shape. (2) Particularize the equations of motion to the case where the curve is a circle of radius R, as depicted in the right portion of ſg. 3.45. (3) Solve these equations numerically. (4) On one ¯2 = x2 /R, of particle m. (5) Plot the history of graph, plot the coordinates, x ¯1 = x1 /R and x
104
3 Basic principles
i2 v20 m2
Initial configuration v10
s p0(s) i2
m1
C r20
r10
Present configuration r1 c e2 k r2
k x
i1 e1
c
g m i1
O
s
q
M
i2
R
q
M
i1 c
C
k
Fig. 3.44. Particles interconnected by a spring and damper.
m
Fig. 3.45. Particle suspended from a circular track.
angle θ. (6) On one graph, plot the velocity components, v¯1 = v1 /(Rω) and v¯2 = v2 /(Rω), ˙ (8) Plot the history of the of particle m. (7) Plot the history of the angular rate θ = θ/ω. normal force, F c , the track applies on the particle, as F¯ c = F c /(mRω 2 ). (9) Plot the non¯ = K/mR2 ω 2 , potential energy, V¯ = V /(mR2 ω 2 ), energy dimensional kinetic energy, K ¯ c = W c /(mR2 ω 2 ), and energy dissipated at the viscous friction dissipated in the damper, W ¯ f = W f /(mR2 ω 2 ). Verify that the energy closure equation is satisſed. Use interface, W the following non-dimensional quantities: θ = s/R, and non-dimensional time τ = ωt, ¯0 = Δ0 /R = 0.5; where ω 2 = k/m. Use the following data: ζ = c/(2mω) = 0.2; Δ g¯ = g/(Rω 2 ) = 2.5; μ ¯ = μ/(M ω) = 0.2; m ¯ = m/M = 0.25. At the initial time, the ¯0 ), 0 and θ = 0. Present system is at rest, the position vector of particle m is x = −(1 + Δ all your results for τ ∈ [0, 20]
Problem 3.32. Two particles linked by an elastic spring Consider the system depicted in ſg. 3.46 that consists of two particles of mass m1 and m2 , respectively, connected by a massless spring of stiffness k. (1) Show that the work done by the force in the elastic spring, can be derived from a potential. (2) What is the expression of the strain energy function of the spring if it is a linearly elastic spring of stiffness constant k.
u
m2
i2 k
q
m
x m1 Fig. 3.46. Two particles linked by an elastic spring.
g
l k
M
i1
Fig. 3.47. Inverted pendulum mounted on a track.
3.4 Newtonian mechanics for a system of particles
105
Problem 3.33. Inverted pendulum mounted on a track Figure 3.47 shows an inverted pendulum of length with a tip mass m. The pendulum is mounted on a cart of mass M free to translate along a horizontal track. A torsional spring of stiffness constant k restrains the pendulum at its attachment point. The spring is un-stretched when angle θ = 0. (1) Derive the two equations of motion of the system. (2) Solve these equations numerically. (3) Plot the cart’s position, x ¯ = x/ versus τ . (4) Plot angle θ. (5) ¯ . (8) Plot θ . (9) Plot the cart’s velocity, x ¯ . (6) Plot θ . (7) Plot the cart’s acceleration, x 2 2 2 2 ¯ ¯ Plot the system’s kinetic, K = K/m ω , potential, V = V /(m ω ), and total mechanical energies. Use the following data: μ = M/m = 1.5. Use non-dimensional time τ = ωt, where ω 2 = k/(m2 ) and (·) denotes a derivative with respect to τ . At the initial time, x ¯ = 0, x ¯ = 1, θ = π/4, θ = 0. Present all your results for τ ∈ [0, 20]. Study two cases, g¯ = g/(ω 2 ) = 0.8 and g¯ = 4, and comment on the differences.
Problem 3.34. Flexible pendulum on a slider Figure 3.48 depicts a slider of mass M constrained x M e2 i2 to move along a horizontal track. A bob of mass m O is attached to the slider at point A by means of a A k r g spring of stiffness constant k and un-stretched length Fd r0 . The displacement of the slider is denoted x, and m i1 the position of the bob is expressed by its polar coe1 ordinates, r and θ. Gravity acts on the system as inq dicated in the ſgure. The bob is subjected to a drag force F d = −Acd v m v m , where is the ƀuid Fig. 3.48. Flexible pendulum mass density, A the cross-sectional area of the bob, mounted on a slider. cd is the drag coefſcient, and v m the velocity vector of the bob. (1) Derive the equations of motion of the system using the coordinates x, r, and θ. (2) Solve these equations numerically. (3) Plot the cart’s position, x ¯ = x/r0 versus τ . (4) Plot distance r¯ = r/r0 . (5) Plot angle θ. ¯ . (10) (6) Plot the cart’s velocity, x ¯ . (7) Plot r¯ . (8) Plot θ . (9) Plot the cart’s acceleration, x 2 ¯ d = Wd /(mr0 ω 2 ). (13) Plot r¯ . (11) Plot θ . (12) Plot the cumulative dissipated energy, W 2 2 2 2 ¯ ¯ Plot the system’s kinetic, K = K/mr0 ω , potential, V = V /(mr0 ω ), and total mechanical energies. Check the energy closure equations. Use the following data: μ = M/m = 1.5, ¯ d = 0.01, where ¯ = r03 /m and A ¯ = A/r02 . Use g¯ = g/(r0 ω 2 ) = 0.2, and ζ = ¯Ac 2 non-dimensional time τ = ωt, where ω = k/m and (·) denotes a derivative with respect to τ . At the initial time, x ¯ = 0, r¯ = 2, θ = π/4, x ¯ = 1, r¯ = 1, θ = 0. Present all your results for τ ∈ [0, 20].
4 The geometric description of rotation
The most natural way of describing rotations is rooted in their geometric representation, which is the focus of this chapter. More abstract approaches, however, also exist and will be presented in chapter 13. Consider an orthonormal basis I = (¯ı1 , ¯ı2 , ¯ı3 ). The rotation operation brings orthonormal basis I to a new orthonormal basis E = (¯ e1 , e¯2 , e¯3 ). In section 4.1, the rotation operation is characterized by expressing the unit vectors of basis E in terms of those of basis I. This leads to the concept of direction cosine matrix. The simplest rotation operation consists of a rotation of basis I about one of its unit vectors. This operation, called a planar rotation, is discussed in section 4.2. The fact that successive planar rotations in distinct planes do not commute is emphasized in section 4.3, and leads to the representation of arbitrary rotations in terms of three successive planar rotations. The resulting Euler angle representation is described in section 4.4. Euler’s theorem on rotations presented in section 4.5 states that any arbitrary rotation that leave a point ſxed can be viewed as a single rotation about a unit vector. This fundamental result leads to the concept of rotation tensor presented in section 4.6; a formal deſnition of tensors follows. Important rotation operations are examined in details: the composition of rotations is presented in section 4.9, and time and space derivatives of rotations in sections 4.10 and 4.12, respectively. Applications to particle dynamics are presented in section 4.13.
4.1 The direction cosine matrix Consider the two orthonormal bases I = (¯ı1 , ¯ı2 , ¯ı3 ) and E = (¯ e1 , e¯2 , e¯3 ) shown in ſg. 4.1. A rotation is deſned as the operation that brings basis I to basis E. Unit vector e¯1 can be expressed as a linear combination of the vectors of basis I e¯1 = D11¯ı1 + D21¯ı2 + D31¯ı3 .
(4.1)
The coefſcients of this linear combination are readily expressed as Dk1 = ¯ıTk e¯1 . Proceeding similarly with the three unit vectors deſning basis E yields the terms of
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_4 © Springer Science+Business Media B.V. 2011
108
4 The geometric description of rotation
the direction cosine matrix, D, as Dk = ¯ıTk e¯ . e3 E
i3
e2
(4.2)
Using eq. (1.8), and observing that vectors ¯ık and e¯ are unit vectors yields an alternative expression for the direction cosine matrix is obtained Dkl = cos (¯ık , e¯ ) .
(4.3)
e1
This expression gives its name to the direction cosine matrix: its entries are the cosine of the angle between ¯ık i1 I and e¯ , the unit vectors deſning bases I and E, respectively. Each component of the direction cosine matrix is Fig. 4.1. rotation from basis a scalar quantity. The direction cosine matrix, however, I to E. is not a second-order tensor, see section 4.8.2. The matrix of direction cosines provides a simple description of rotations. Each term of the direction cosine matrix is a scalar quantity representing the cosine of the angle between two vectors, eq. (4.2). As will be shown in the following sections, rotations can be represented by as few as three parameters. This basic property of rotation is not apparent in this description. i2
4.2 Planar rotations A simple example of a rotation is a planar rotation deſned as a rotation of angular magnitude e3 φ about one of the axes deſning basis I, say f ¯ ı 1 , as depicted in ſg. 4.2. The direction cosine i3 matrix corresponding to this planar rotation can be readily obtained from eq. (4.3) and inspecf i2 e2 tion of ſg. 4.2. The coefſcients of the direction cosine matrix are obtained from elementary Fig. 4.2. Planar rotation of magnitude trigonometry as φ about axis ¯ı1 . ⎧ ⎤ ⎡ 1 0 0 ⎨ e¯1 = ¯ı1 e¯2 = cos φ ¯ı2 + sin φ ¯ı3 ⇐⇒ D1 (φ) = ⎣0 cos φ − sin φ ⎦ . (4.4) ⎩ 0 sin φ cos φ e¯3 = − sin φ ¯ı2 + cos φ ¯ı3 i1 = e1
The direction cosine matrix corresponding to planar rotation of magnitude φ about axis ¯ı2 is found in a similar manner as ⎧ ⎤ ⎡ cos φ 0 sin φ ⎨ e¯1 = cos φ ¯ı1 − sin φ ¯ı3 1 0 ⎦. e¯2 = ¯ı2 (4.5) ⇐⇒ D2 (φ) = ⎣ 0 ⎩ − sin φ 0 cos φ e¯3 = sin φ ¯ı1 + cos φ ¯ı3 The corresponding matrix for a planar rotation of magnitude φ about axis ¯ı3 becomes
4.4 Euler angles
⎧ ⎤ ⎡ cos φ − sin φ 0 ⎨ e¯1 = cos φ ¯ı1 + sin φ ¯ı2 e¯2 = − sin φ ¯ı1 + cos φ ¯ı2 ⇐⇒ D3 (φ) = ⎣ sin φ cos φ 0⎦ . ⎩ 0 0 1 e¯3 = ¯ı3
109
(4.6)
4.3 Non-commutativity of rotations Rotation operations do not commute. 90° about i3 90° about i2 This means that the order in which successive rotations are performed is imi3 portant. This point is most easily understood by looking at the simple example i2 depicted in ſg. 4.3. A rigid block is rotated by 90◦ i1 about ¯ı2 , then by 90◦ about ¯ı3 . The ſ90° about i3 nal conſguration of the block is shown 90° about i2 in the top portion of ſg. 4.3. The same rigid block is now rotated by 90◦ about Fig. 4.3. Two successive planar rotations do not ¯ı3 , then by 90◦ about ¯ı2 . The ſnal con- commute. ſguration, depicted in the bottom portion of ſg. 4.3, is clearly different from that obtained when the two successive rotations were performed in the reverse order. In this example, the two successive rotations are performed about axes ſxed in space. If the two rotations are performed about body ſxed axes, the same conclusion is reached: the ſnal conſguration depends on the order of the rotation operations. In the next section, it will be shown that an arbitrary rotation can be viewed as a succession of three planar rotations. The fact that rotation operations about distinct axes do not commute implies that the order in which these three successive planar rotations are performed is important. More generally, when several rotations are involved in a problem, the order of application of these rotations must be carefully speciſed.
4.4 Euler angles An arbitrary rotation from I = (¯ı1 , ¯ı2 , ¯ı3 ) to E = (¯ e1 , e¯2 , e¯3 ) can be viewed as a succession of three planar rotations about three different axes [16]. Figure 4.4 shows one possible set of three planar rotations, which can be described as follows. 1. A planar rotation of magnitude φ, called precession, about axis ¯ı3 brings basis I to basis A = (¯ a1 , a ¯2 , a ¯3 ). Equation (4.6) gives the corresponding direction cosine matrix ⎧ ⎤ ⎡ ¯1 = cos φ ¯ı1 + sin φ ¯ı2 , cos φ − sin φ 0 ⎨a a ¯2 = − sin φ ¯ı1 + cos φ ¯ı2 , ⇐⇒ D3 (φ) = ⎣ sin φ cos φ 0⎦ . (4.7) ⎩ 0 0 1 a ¯3 = ¯ı3 .
110
4 The geometric description of rotation
2. A planar rotation of magnitude θ, called nutation, about axis a ¯1 brings basis A to basis B = (¯b1 , ¯b2 , ¯b3 ). Equation (4.4) gives the corresponding direction cosine matrix ⎧ ⎤ ⎡ 1 0 0 ¯1 , ⎨ ¯b1 = a ¯b2 = cos θ a ¯2 + sin θ a ¯3 , ⇐⇒ D1 (θ) = ⎣0 cos θ − sin θ ⎦ . (4.8) ⎩¯ 0 sin θ cos θ b3 = − sin θ a ¯2 + cos θ a ¯3 . 3. A planar rotation of magnitude ψ, called spin, about axis ¯b3 brings basis B to basis E. Once again, eq. (4.6) gives the corresponding direction cosine matrix ⎧ ⎤ ⎡ cos ψ − sin ψ 0 ⎨ e¯1 = cos ψ ¯b1 + sin ψ ¯b2 , e¯2 = − sin ψ ¯b1 + cos ψ ¯b2 , ⇐⇒ D3 (ψ) = ⎣ sin ψ cos ψ 0⎦ . (4.9) ⎩ ¯b3 . 0 0 1 e¯3 = The relationship between bases I and E is obtained by combining the three successive rotations described by eqs. (4.7) to (4.9) to ſnd ⎧ ⎨ e¯1 = ( Cφ Cψ − Sφ Cθ Sψ ) ¯ı1 +( Sφ Cψ + Cφ Cθ Sψ ) ¯ı2 +Sθ Sψ ¯ı3 , e¯2 = (−Cφ Sψ − Sφ Cθ Cψ ) ¯ı1 +(−Sφ Sψ + Cφ Cθ Cψ ) ¯ı2 +Sθ Cψ ¯ı3 , (4.10) ⎩ e¯3 = Sφ Sθ ¯ı1 −Cφ Sθ ¯ı2 +Cθ ¯ı3 , where the following short-hand notations were used: Cφ = cos φ, Sφ = sin φ, etc. The three rotation angles, φ, θ, and ψ, are called the Euler angles. The direction cosine matrix expressed in terms of Euler angles becomes ⎡ ⎤ Cφ Cψ − Sφ Cθ Sψ −Cφ Sψ − Sφ Cθ Cψ Sφ Sθ D3-1-3 = ⎣Sφ Cψ + Cφ Cθ Sψ −Sφ Sψ + Cφ Cθ Cψ −Cφ Sθ ⎦ . (4.11) Sθ Sψ Sθ Cψ Cθ It is often important to perform the inverse operation: given a direction cosine matrix, ſnd the corresponding Euler angles. The following process will yield the desired angles. Assuming D32 = 0, (4.12a)
tan ψ = D31 /D32 , sin θ = D31 sin ψ + D32 cos ψ, sin φ = D21 cos ψ − D22 sin ψ,
cos θ = D33 , cos φ = D11 cos ψ − D12 sin ψ.
(4.12b) (4.12c)
To remove the ambiguity associated with inverse trigonometric functions, both sine and cosines of the angles are derived, leading to a deſnition of each angle in the range [−π, +π].1 When θ = 0 or π, a singularity occurs. In fact, the process then reduces to a single rotation of magnitude (φ + ψ) or (φ − ψ) for θ = 0 or π, respectively, because the direction cosine matrix reduces to 1
In computer implementations, these operations are conveniently performed with the help of the function atan2(y, x) = tan−1 (y/x), yielding an angle in the range [−π, +π].
4.4 Euler angles
⎡
111
⎤
cos(φ ± ψ) − sin(φ ± ψ) 0 D = ⎣ sin(φ ± ψ) cos(φ ± ψ) 0⎦ . 0 0 1
(4.13)
Clearly, angles φ and ψ cannot be determined individually, the sole combination φ ± ψ can be evaluated. The Euler angles introduced above correspond to the following sequence of planar rotations: a rotation of magnitude φ, about axis ¯ı3 , then, a rotation of magnitude θ about axis a ¯1 , and ſnally, a rotation of magnitude ψ about axis ¯b3 . This sequence will be called the “3-1-3 sequence” to indicate the sequence of body axes about which the three successive rotations are taking place. Clearly, Euler angles could be deſned i3 = a3 e2 q in several different manners: the ſrst rotay b tion could occur about either of the three e = b 2 3 3 axes, ¯ı1 , ¯ı2 , or ¯ı3 , offering three choices. q Because two consecutive rotations cannot a2 take place about the same axis, two alterf natives are possible for the second rotation. i2 Two choices are again possible for the last rotation. In all, 3×2 ×2 = 12 possible choices ex- i1 ist, corresponding to sequences labeled 1-2y e1 f 1, 1-2-3, 1-3-1, 1-3-2, 2-1-2, 2-1-3, 2-3-1, a1 = b1 2-3-2, 3-1-2, 3-1-3, 3-2-1 and 3-2-3. Three of these sequences, 3-2-3, 3-2-1 and 3-1- Fig. 4.4. An arbitrary rotation viewed as 2 will be the focus of problems below. A three successive planar rotations. summary of expressions and formulæ involving Euler angles appears in section 4.11. The representation of rotation in terms of three Euler angles shows that the direction cosine matrix can be expressed in terms of three parameters only. This representation, however, presents several drawbacks. First, Euler angles can be deſned in several different manners, and the choice of the rotation sequence is entirely arbitrary. Furthermore, the expression for the direction cosine matrix, as seen for this example in eq. (4.11), is rather complicated and involves the evaluation of numerous trigonometric functions. Finally, singularities will occur in the evaluation of Euler angles from a direction cosine matrix for all 12 possible sequences. 4.4.1 Problems Problem 4.1. Euler angles, sequence 3-2-3 A popular choice of Euler angles is the 3-2-3 sequence that corresponds to the following sequence of planar rotations: a rotation of magnitude ψ, called precession, about axis ¯ı3 , then, a rotation of magnitude θ, called nutation, about axis a ¯2 , and ſnally, a rotation of magnitude φ, called spin, about axis ¯b3 . (1) Find the rotation matrix in terms of this Euler angle sequence. (2) Determine the singularities associated with this choice of Euler angles.
112
4 The geometric description of rotation
Problem 4.2. Euler angles, sequence 3-2-1 A popular choice of Euler angles for airplane ƀight mechanics is the 3-2-1 sequence that corresponds to the following sequence of planar rotations: a rotation of magnitude ψ, called ¯2 , and heading, about axis ¯ı3 , then, a rotation of magnitude θ, called attitude, about axis a ſnally, a rotation of magnitude φ, called bank, about axis ¯b1 . (1) Find the rotation matrix in terms of this Euler angle sequence. (2) Determine the singularities associated with this choice of Euler angles.
Problem 4.3. Euler angles, sequence 3-1-2 A possible choice of Euler angles is the 3-1-2 sequence that corresponds to the following sequence of planar rotations: a rotation of magnitude φ, about axis ¯ı3 , then, a rotation of magnitude θ about axis a ¯1 , then ſnally, a rotation of magnitude ψ about axis ¯b2 . (1) Find the rotation matrix in terms of this Euler angle sequence. (2) Determine the singularities associated with this choice of Euler angles.
4.5 Euler’s theorem on rotations Euler’s theorem [17] on rotations states the following. Theorem 4.1 (Euler’s theorem on rotations). Any arbitrary rotation of a rigid body that leaves on of its point ſxed can be viewed as a single rotation of magnitude φ about a unit vector n ¯. To prove this statement, consider two frames, F1 = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )] and F2 = e3 [O, E = (¯ e1 , e¯2 , e¯3 )], shown in ſg. 4.5, and C¢ n associated with two conſgurations of a rigid B¢ body that its material point O ſxed. Because e2 the vectors deſning bases I and E are unit O i2 vectors, they all are radii of a sphere of unit B radius and center O. Vector ¯ı1 can be brought to vector e¯1 by a single rotation about axis n ¯ 1 . This axis passes A through point O and belongs to plane P1 that i1 is normal to segment AA and passes through e1 A¢ point O, as shown in ſg. 4.5. Fig. 4.5. An arbitrary rotation viewed as On the other hand, vector ¯ı2 can be a single rotation about axis n ¯. brought to vector e¯2 by a single rotation about axis n ¯ 2 , passing through point O and belonging to plane P2 , which is normal to the segment BB and passes through point O. If both operations must be achieved by a single rotation, axis n ¯ about which this common rotation takes place must be at the intersection of planes P1 and P2 . Let point P be the intersection of axis n ¯ with the unit sphere. Figure 4.5 shows the great circle segments PA, PA , PB, PB , AB, and A B . By construction, PA = PA because point A can be brought to point A by a rotation i3
C
P
4.6 The rotation tensor
113
about n ¯ . Similarly, PB = PB . Finally, AB = A B since both segments correspond to a 90 degree rotation. Consequently, the spherical triangles APB and A PB are equal. This, in turn, implies the equality of angles ∠APB and ∠A PB . Subtracting from these two angles their common part, ∠A PB, yields the following result: ∠APA = ∠BPB = φ, where φ is now the magnitude of the rotation about axis n ¯ that simultaneously brings ¯ı1 to e¯1 and ¯ı2 to e¯2 . A rotation of magnitude φ about axis n ¯ has been shown to bring ¯ı1 to e¯1 , and ¯ı2 to e¯2 simultaneously. It remains to prove that a rotation of the same magnitude will bring ¯ı3 to e¯3 . Let the rotation of magnitude φ about axis n ¯ bring ¯ı3 to vector ¯ı3 . Rea soning as before, it is clear that PC = PC’, PA = PA , and by construction ∠APC = ∠A PC = φ + ∠APC . This shows the equality of spherical triangles APC and A PC . This in turns implies the equality of segments AC and A C . Since segment AC corresponds to a 90 degree rotation, so does segment A C , implying the orthogonality of ¯ı3 and e¯1 . A similar reasoning on spherical triangles BPC and B PC leads to the orthogonality of ¯ı3 and e¯2 . Finally, since ¯ı3 is orthogonal to both e¯1 and e¯2 , it is clear that ¯ı3 = e¯3 . In summary, basis I can be brought to basis E by a single rotation of magnitude φ about axis n ¯ , which proves Euler theorem on rotations.
4.6 The rotation tensor Euler’s theorem on rotations leads a compact expression for the rotation tensor. Consider an arbitrary vector a and let the rotation of magnitude φ about unit vector n ¯ bring this vector to b. The rotation tensor, R, relates these two vectors, b = R a Basic expression for the rotation tensor Figure 4.6 depicts the conſguration of the problem. A s n Vector b is the sum of segments OC and CB, and C elementary geometry then yields b = OC + CB = i3 b cos α n ¯ + b sin α [¯ s cos φ + t¯sin φ]. Unit veca f tor t¯ is along the vector product of vectors n ¯ and a t a B a, t¯ = n a/ na, and unit vector s¯ is s¯ = tn ¯ = b ( na)¯ n/ na. O i2 The fundamental property of rotation is to preserve length, i.e., the norms of vectors a and b i1 must be identical, leading to n ¯ T a = a cos α = b cos α and na = a sin α = b sin α. With Fig. 4.6. A rotation of magnitude φ about axis n ¯. the help of these relationships, vector b becomes b = (¯ nT a) n ¯ + ( na)¯ n cos φ + ( na) sin φ. Applying identity (1.34a) then leads to b = a + sin φ ( na) + (1 − cos φ) n n a = R a,
(4.14)
114
4 The geometric description of rotation
where the rotation tensor [15], R, is deſned as R = I + sin φ n + (1 − cos φ) nn .
(4.15)
This result is known as Rodrigues’ rotation formula. This fundamental result expresses the rotation tensor in terms of a unit vector n ¯, and a rotation of magnitude φ about this unit vector. It is a direct consequence of Euler’s theorem on rotations, theorem 4.1. In view of eq. (4.15), vector n ¯ can be expressed as ⎧ ⎫ ⎫ ⎧ R − R23 ⎬ ⎨1/(R32 + R23 )⎬ 1 ⎨ 32 R13 − R31 = n1 n2 n3 (1 − cos φ) 1/(R13 + R31 ) . n ¯= ⎩ ⎭ 2 sin φ ⎩R − R ⎭ 1/(R21 + R12 ) 21 12 Hence, the orientation of this vector is ⎧ ⎫ ⎨R32 − R23 ⎬ n ¯ R13 − R31 , ⎩ ⎭ R21 − R12
⎧ ⎫ ⎨1/(R32 + R23 )⎬ n ¯ 1/(R13 + R31 ) , ⎩ ⎭ 1/(R21 + R12 )
where symbol indicates the parallelism of two vectors. Relating the rotation tensor to the matrix of direction cosines The rotation tensor and matrix of direction cosines are closely related to each other. e1 , e¯2 , e¯3 ) and Consider a rotation that brings basis I = (¯ı1 , ¯ı2 , ¯ı3 ) to basis E = (¯ let the matrix of direction cosines, D , eq. (4.1), deſne this rotation. Resolving the vector quantities in basis I then yields [I]
[I]
e¯1 = D ¯ı1 ,
(4.16)
[I]T
[I]T = 1, 0, 0 , ¯ı2 = 0, 1, 0 , and where the following identities were used: ¯ı1
[I]T ¯ı3 = 0, 0, 1 . On the other hand, if rotation tensor R rotates vector ¯ı1 to e¯1 , eq. (4.14) implies e¯1 = R ¯ı1 . Resolving this tensor relationship in basis I then yields [I]
[I]
e¯1 = R[I]¯ı1 .
(4.17)
Identifying eqs. (4.16) and (4.17) yields the relationship between the direction cosine matrix and the rotation tensor as D = R[I] .
(4.18)
The entries of the direction cosine matrix describing the rotation from I to E are identical to the components of the rotation tensor describing the same rotation and resolved in basis I.
4.7 Properties of the rotation tensor
115
Multiplicative decompositions of the rotation tensor Two multiplicative decompositions of the rotation tensor are now presented. The ſrst is the “square root,” G, of the rotation tensor, R, deſned as R = G G.
(4.19)
φ φ + (1 − cos ) n n . G = I + sin n 2 2
(4.20)
It is readily veriſed that
It is interesting to note that the “square root” of the rotation tensor corresponds to a rotation of φ/2 about axis n ¯ , and hence, is itself an orthogonal tensor, G GT = I. The following results then follow R − I = G G − G GT = G(G − GT ) = 2 sin
φ φ Gn G. = 2 sin n 2 2
(4.21)
The second multiplicative decomposition of the rotation tensor is φ R = I + tan 2 φ = I + tan 2 Note that
−T φ n I + tan n 2 −T φ n . I + tan n 2
−T " φ 1! I + tan n R+I . = 2 2
(4.22)
(4.23)
4.7 Properties of the rotation tensor Inspection of equation (4.15) reveals that symm(R) = I cos φ + (1 − cos φ)¯ nn ¯T , and skew(R) = n sin φ. It then follows that axial(R) = n ¯ sin φ. The invariants of the rotation tensor can also be directly evaluated from eq. (4.15) as I1 = tr(R) = 1 + 2 cos φ, I2 = 1 + 2 cos φ, and I3 = det(R) = 1. The characteristic equations, eq. (1.55), now becomes −λ3 + (1 + 2 cos φ)λ2 − (1 + 2 cos φ)λ + 1 = 0. Eigenvalues and eigenvectors of the rotation tensor The ſrst fundamental property of the rotation tensor is that it possesses a unit eigenvalue, λ = +1, associated with eigenvector n ¯ . Indeed, it follows from eq. (4.15) that Rn ¯=n ¯. (4.24)
116
4 The geometric description of rotation
This indicates that (λ − 1) should be a factor of the characteristic equation, which can indeed be written as (λ − 1)(λ2 − 2λ cos φ + 1) = 0. The other two eigenvalues √ of the rotation tensor are complex conjugate roots, cos φ ± i sin φ, where i = −1. In summary, the eigenvalues of the rotation tensor are λ1 = 1, λ2,3 = cos φ ± i sin φ = e±iφ . (4.25) Consider now two mutually orthogonal, unit vectors u ¯ and v¯, which lie in the plane normal to n ¯ , such that E = (¯ n, u ¯, v¯) forms an orthonormal basis. It is easily veriſed that Ru ¯ = cos φ u ¯ + sin φ v¯,
R v¯ = − sin φ u ¯ + cos φ v¯.
Clearly, vectors u ¯ and v¯ undergo a planar rotation of magnitude φ in the plane normal to the axis of rotation, n ¯ . Linear combinations of these two equations then leads to R(¯ u − iv) = (cos φ + i sin φ)(¯ u − i¯ v ),
R(¯ u + iv) = (cos φ − i sin φ)(¯ u + i¯ v ).
This reveals that (¯ u ∓ i¯ v) are the complex conjugate eigenvectors associated with the complex conjugate eigenvalues cos φ ± i sin φ, respectively. Orthogonality of the rotation tensor The second fundamental property of the rotation tensor is that it is an orthogonal tensor. Using eq. (4.15), it is readily veriſed that R RT = RT R = I,
(4.26)
which implies det(R) = ±1. in general, orthogonal tensors have a determinant of ±1. Equation (4.25) shows, however, that det(R) = λ1 λ2 λ3 = +1: the rotation tensor belongs to the class of proper orthogonal tensors for which det(R) = +1.
4.8 Change of basis operations 4.8.1 Vector components in various orthonormal bases Consider an orthonormal basis B 1 = (¯ı11 , ¯ı12 , ¯ı13 ), and an arbitrary vector a1 . Next, consider a rotation of magnitude φ about a unit vector n ¯ . The corresponding rotation tensor is denoted R. Let vectors ¯ı21 , ¯ı22 , ¯ı23 , and a2 be the vectors resulting from the application of rotation R to vectors ¯ı11 , ¯ı12 , ¯ı13 , and a1 , respectively. It is clear that vectors ¯ı21 , ¯ı22 , and ¯ı23 deſne a new orthonormal basis B 2 . Vectors a1 and a2 are related by eq. (4.14), i.e., a2 = R a1 . This tensor relationship is now resolved in basis B 1 to 1
1
1
ſnd a2[B ] = R[B ] a1[B ] . By construction, the components of vector a1 resolved in basis B 1 are identical 1 2 those of a2 resolved in basis B 2 , a1[B ] = a2[B ] . It follows that the relationship
4.8 Change of basis operations
117 1
between the components of vector a2 expressed in bases B 1 and B 2 is a2[B ] = 1 2 R[B ] a2[B ] . Because vector a1 is arbitrary, this relationship holds for any vector v, i.e., 1
v [B ] = R[B v
[B2 ]
=R
1
]
2
v [B ] ,
(4.27a)
[B1 ]T [B1 ]
(4.27b)
v
.
These equations express the transformation laws for the components of a ſrst-order tensor. In fact, the rigorous deſnition of a ſrst-order tensor is as follows. Deſnition 4.1 (ſrst-order tensor). A ſrst-order tensor is a mathematical entity whose components resolved in two bases are related by eqs. (4.27). 1
The component of vector n ¯ resolved in bases B 1 and B 2 are identical, n ¯ [B ] = [B2 ] n ¯ . Consequently, in view of eq. (4.15), the components of the rotation tensor that bring basis B 1 to B 2 , resolved in those two bases, are also identical 1
2
R[B ] = R[B ] .
(4.28)
4.8.2 Second-order tensor components in various orthonormal bases Consider now a second-order tensor such as T = a bT , where a and b are two arbitrary vectors. The components of this tensor in two distinct bases are T [B 1
[B1 ] [B ]T
2
[B ]
1
]
=
2
[B2 ] [B ]T
and T = a b . Using the transformation law for ſrst-order a b tensors, eq. (4.27), the transformation laws for the components of second-order tensors are found to be 2
T [B ] = R[B T
[B1 ]
1
]T
[B1 ]
=R
1
1
T [B ] R[B ] ,
(4.29a)
[B2 ]
(4.29b)
T
R
T [B1 ]
.
The rigorous deſnition of second-order tensors is as follows. Deſnition 4.2 (second-order tensor). A second-order tensor is a mathematical entity whose components resolved in two bases are related by eqs. (4.29). Example 4.1. First- and second-order tensors The previous sections have given precise deſnitions of ſrst- and second-order tensors as mathematical entities whose components resolved in two bases are related by eqs. (4.27) and (4.29), respectively. Consider a vector (ſrst-order tensor) whose components in two bases, B and B ∗ , are denoted a and a∗ , respectively. In the notation of the above sections, a = a[B] and ∗ a∗ = a[B ] . The simpliſed notation, a and a∗ , is clearly much simpler, provided that all symbols are clearly deſned. If the components of the rotation tensor that brings basis B to B ∗ , resolved in B, are denoted R, eq. (4.27) implies that a∗ = RT a. Consider now the skew symmetric operators a and a∗ formed with the components
118
4 The geometric description of rotation
of vector a resolved in bases B and B ∗ , respectively. Prove that a and a∗ are the components of a skew-symmetric, second-order tensor, a. If a is a second-order tensor, its components resolved in two bases must be related by eq. (4.29). Hence, the tensorial nature of a will be established if and only if a∗ = R T aR ⇐⇒ a∗ = RT a.
(4.30)
This statement can be proved based on simple, but tedious algebraic manipulations, taking into account the fact that R is an orthogonal tensor. Example 4.2. The rotation tensor The rotation tensor was introduced in section 4.6 and was called a “tensor.” Prove that the rotation tensor is indeed a second-order tensor. Euler’s theorem deſnes the rotation tensor in terms of a unit vector n ¯ about which a rotation of magnitude φ is taking place. Let n ¯ be the components of this unit vector resolved in basis B; the components of the rotation tensor, S, resolved in the same basis are then given by eq. (4.15) as S = I + sin φ n + (1 − cos φ) n n . Consider now an arbitrary basis B ∗ and let the components of unit vector n ¯, resolved in this basis, be denoted n ¯ ∗ = RT n ¯ , where R are the components of the rotation tensor that brings basis B to B ∗ , resolved in basis B. It then follows that n∗ RT + (1 − cos φ) R n ∗ R RT n ∗ RT S = I + sin φ R = R I + sin φ n ∗ + (1 − cos φ) n ∗ n ∗ RT = R S ∗ RT , where the orthogonality property of the rotation tensor was used together with eq. (4.30). Clearly, S ∗ = I + sin φ n ∗ + (1 − cos φ) n ∗ n ∗ are the components ∗ of the rotation tensor resolved in basis B , and the above result then provides the transformation rule for the components of the rotation tensor. This transformation rule is, as expected, the rule that characterizes the transformation of components of second-order tensors, see eq. (4.29). Hence, the tensorial nature of the rotation tensor is established. The proof of the tensorial nature of the rotation tensor rests on the deſnition of the rotation tensor provided by Rodrigues’ rotation formula, eq. (4.15), and on the tensorial nature of the unit vector about which the rotation is taking place, expressed as n ¯ = Rn ¯ ∗ . Consequently, the deſnition of the rotation tensor by Rodrigues’ rotation formula guarantees the following equivalence S = R S ∗ RT ⇐⇒ n ¯ = Rn ¯∗.
(4.31)
The components of the ſrst-order tensor, unit vector n ¯ , transform according to eqs. (4.27), and the components of the second-order tensor, S, transform according to eqs. (4.29). Example 4.3. Canonical basis for the rotation tensor In section 4.7, the following orthonormal basis was introduced E = (¯ n, u ¯, v¯),
(4.32)
4.8 Change of basis operations
119
where u ¯ and v¯ are mutually orthogonal unit vectors in the plane normal to n ¯ . Such basis is called a canonical basis for the rotation tensor, R. Let S be the rotation tensor that brings basis I to basis E; the components of this tensor resolved in basis I are S [I] = (¯ n[I] , u ¯[I], v¯[I] ). Because the rotation tensor is a second-order tensor, eq. (4.15) yields its components in basis E as R[E] = S [I]T R[I] S [I] ¯ [I] , cos φ u = S [I]T n ¯[I] + sin φ v¯[I] , − sin φ u ¯[I] + cos φ v¯[I] . Finally, the components of the rotation tensor in the canonical basis become ⎤ ⎡ 1 0 0 [E] (4.33) R = ⎣0 cos φ − sin φ ⎦ . 0 sin φ cos φ In this canonical form, the rotation tensor takes the expected form of the direction cosine matrix for a planar rotation, see eq. (4.4). When resolved in the same canonical basis, the components of rotation tensor G deſned by eq. (4.20) become ⎤ ⎡ 1 0 0 (4.34) G[E] = ⎣0 cos φ/2 − sin φ/2 ⎦ . 0 sin φ/2 cos φ/2 In this canonical form, rotation tensor G takes the expected form of the direction cosine matrix for a planar rotation of half angle, φ/2. 4.8.3 Tensor operations Sections 4.8.1 and 4.8.2 give formal deſnitions of ſrst- and second-order tensors. For completeness of the discussion, a formal deſnition of zeroth order tensors is also given. Deſnition 4.3 (Zeroth order tensor). A zeroth order tensor is a mathematical entity that remains invariant under a change of basis operation. Take, for instance, the mass of a particle. This scalar quantity is invariant under a change of basis operation and hence, is a zeroth order tensor. The length of a vector or the angle between two vectors are two other examples of scalar quantities that remain invariant under a change of basis and hence, are also zeroth order tensor. Chapter 1 deſnes a number of operations between vectors: the scalar product, the vector product and the tensor product, among others. A tensor operation is an operation using two or more tensors and resulting in another tensor. As a ſrst example of a tensor operation, consider the differential work deſned in eq. (3.8) as the scalar product of the force vector by the differential displacement of its point of application, dW = F T dr. Two analysts working with two
120
4 The geometric description of rotation
different bases, B and B ∗ , will write this differential work as dW = F T dr and dW ∗ = F ∗T dr∗ , respectively. The three numbers, F , representing the force vector in basis B are different from the three numbers, F ∗ , representing the same force vector in basis B ∗ . Similarly, the numbers representing the components of the differential displacement vector resolved in the two bases, dr and dr∗ , differ. Because the force and differential displacement vectors are ſrst-order tensors, their components in the two bases are related by eq. (4.27), i.e., F ∗ = RT F and dr ∗ = RT dr, respectively. It then follows that dW ∗ = F ∗T dr ∗ = F T R RT dr = F T dr = dW.
(4.35)
This well known results stems from the orthogonality of the rotation tensor, eq. (4.26). Because dW ∗ = dW , the differential work is a zeroth order tensor, i.e., a quantity that remains invariant under a change of basis. The same conclusion can be reached by looking at the deſnition of the scalar product, eq. (1.8), dW = F dr cos(F , dr) = F ∗ dr∗ cos(F ∗ , dr ∗ ) = dW ∗ . In this case, the invariance of the differential work under a change of basis stems from the fact that the length of a vector and the angle between two vectors are zeroth order tensors. In summary, the scalar product is an operation based on two ſrst-order tensors, which produces a zeroth order tensor. This proves that the scalar product is a tensor operation. While this proof seems rather technical, it has fundamental physical implications. Because the differential work is obtained from a scalar product, i.e., from a tensor operation, it is invariant under a change of basis and hence, is a physically meaningful quantity. Indeed, if the value of the differential work were to depend on the basis in which the force and differential displacement vectors are resolved, this quantity would have no physical meaning because two analysts using two different bases to represent the same vectors would ſnd two different values of the differential work. A second example of tensor operation is the moment of a force, deſned as the vector product of the position vector of the point of application of a force by the force vector itself, M = rF . Two analysts working with two different bases, B and B ∗ , will write this moment as M = rF and M ∗ = r∗ F ∗ , respectively. Here again, because the position and force vectors are ſrst-order tensors, their components in the two bases are related by eq. (4.27), i.e., r ∗ = RT r and F ∗ = RT F . The components of the moment are as follows, T M ∗ = r∗ F ∗ = R r RT F = RT rR RT F = RT rF = RT M ,
(4.36)
where eq. (4.30) and the property of orthogonality of the rotation tensor were used. The result of the vector product operation is a quantity, M , whose components obey the rules of transformation for ſrst-order tensors, eq. (4.27), M ∗ = RT M ; hence, the vector product is a tensor operation. The same conclusion can be reached by looking at the deſnition of the vector product, eq. (1.20), M = r F sin(r, F ) n ¯ = r ∗ F ∗ sin(r ∗ , F ∗ ) R n ¯∗ = ∗ R M . This is a tensor operation because the length of a vector and the angle between
4.8 Change of basis operations
121
two vectors are invariant under a change of basis. Furthermore, unit vector n ¯ , normal to vectors r and F , is a ſrst-order tensor, implying the following transformation rule for its components, n ¯ = Rn ¯∗. In summary, the vector product is an operation based on two ſrst-order tensors, which produces a ſrst-order tensor. This proves that the vector product is a tensor operation. As a corollary, the moment of a force, the vector product of the position vector of the point of application of a force by the force vector itself, is a physically meaningful quantity because its a ſrst-order tensor. It is left to the reader to verify that the various operations deſned in chapter 1 are indeed tensor operations, i.e., operations that are invariant under a change of basis, see problem 4.7. As a last example, consider the product of a zeroth order by a ſrstorder tensor, which deſnes the linear momentum vector, p = mv. The mass of the particle is a zeroth order tensor and its inertial velocity a ſrst-order tensor, implying v ∗ = RT v. It then follows that p∗ = mv ∗ = mRT v = RT mv = RT p.
(4.37)
Because the components of the linear momentum obey the rules of transformation for ſrst-order tensors, eq. (4.27), it is a ſrst-order tensor and hence, the product of a zeroth order by a ſrst-order tensor is a tensor operation. It follows that the linear momentum is a physically meaningful quantity. 4.8.4 The concept of tensor analysis Zeroth-, ſrst-, and second-order tensors are mathematical entities whose components resolved in different bases transform according to strict rules. Manipulation of tensors through tensor operations lead to new tensors. For instance, the vector product of the position vector of the point of application of a force by the force vector itself produces a new vector, the moment of the force. These is a rather abstract mathematical concepts have important physical implications. In fact, the use of tensors expresses the invariance of the laws of physics with respect to change of basis operations [3]. Consider, for instance, Newton’s second law, which states that the force and acceleration vectors must be parallel to each other and the ratio of their lengths must equal the mass of the particle. Clearly, Newton’s second law is invariant under a change of basis. Indeed, the condition of parallelism between the force and acceleration vectors is invariant under a change of basis. Furthermore, because the mass of the particle and the length of the force and acceleration vectors are three invariant quantities, the equality of the length ratio with the particle’s mass is also invariant under a change of basis. Using the vector formalism, Newton’s second law is written as F = ma. This law involves three tensors: a zeroth order tensor, the particle’s mass, and two ſrst-order tensors, the externally applied force vector and the particle’s acceleration vector. Furthermore, Newton’s second law uses tensor operations only: the product of the mass by the acceleration vector is indeed a tensor operation, the product of a zeroth order by a ſrst-order tensor, see eq. (4.37). The combined use of tensor quantities and
122
4 The geometric description of rotation
tensor operations guarantees the invariance of Newton’s second law under change of basis operations. Two analysts working with two different bases, B and B ∗ , will write Newton’s second law as F = ma and F ∗ = ma∗ , respectively. Yet both analysts express the same physical law: the force and acceleration vectors must be parallel and the ratio of their lengths must equal the mass of the particle. In summary, the laws of physics should be expressed in terms of tensors exclusively and should only involve tensor operations. When these two conditions are met, the invariance of the laws of physics under a change of basis is achieved. 4.8.5 Problems Problem 4.4. Geometric interpretation of tensor Prove eq. (4.19), where G is given by eq. (4.20). Give the geometric interpretation of this result.
Problem 4.5. Orthogonality of the rotation tensor Prove the orthogonality of the rotation tensor, eq. (4.26), based on its expression based on Euler theorem, eq. (4.15).
Problem 4.6. Base transformation for skew symmetric tensor
Prove eq. (4.30). Hint: remember that the rotation tensor is orthogonal, R−1 = RT .
Problem 4.7. Tensor operations (1) Prove that the product of a zeroth order tensor by a ſrst-order tensor is a tensor operation. (2) Prove that the product of a zeroth order tensor by a second-order tensor is a tensor operation. (3) Prove that the tensor product of two vectors, eq. (1.28), is a tensor operation. (4) Prove that the mixed product of three vectors is a tensor operation. (5) Let A be a second-order tensor and let A and A∗ its components in two bases, B and B∗ . Prove that the eigenvalues of A and A∗ are identical and that the eigenvectors of A are ſrst-order tensors.
Problem 4.8. Tensors R and G The components of rotation tensor R resolved in basis I are given as follows, ⎡ ⎤ 0.6272 −0.7305 0.2700 [I] R = ⎣−0.1268 −0.4379 −0.8900⎦ . 0.7684 0.5240 −0.3673 (1) Find the components of tensor G in the same basis such that R[I] = G[I] G[I] . (2) Verify that your answer is correct by evaluating the product G[I] G[I] .
Problem 4.9. Components of a vector in two bases Rotation tensor R brings basis I to basis E. The components of tensor R and vector a, both resolved in basis I, are given as follows, ⎫ ⎧ ⎡ ⎤ 0.2944 0.9433 −0.1536 ⎨ 7.54⎬ R = ⎣−0.9005 0.2199 −0.3751⎦ , a = −3.44 . ⎭ ⎩ −0.3200 0.2488 0.9142 1.77 (1) Find the components of vector a in basis E , denoted a∗ , as a∗ = RT a. (2) Verify that a∗ = R T aR.
4.8 Change of basis operations
123
Problem 4.10. Relationship among unit vectors of a basis The rotation tensor can be written as R = [¯ e1 , e¯2 , e¯3 ], where e¯1 , e¯2 , and e¯3 form an orthonormal basis. Show that e1 = e¯3 e¯T2 − e¯2 e¯T3 , e¯1 e¯T1
+
e¯2 e¯T2
+
e2 = e¯1 e¯T3 − e¯3 e¯T1 , e¯3 e¯T3
e3 = e¯2 e¯T1 − e¯1 e¯T2 ,
(4.38a) (4.38b)
= I.
Problem 4.11. Analysis of the projection operator Prove that the projection operator deſned in example 1.5 is a second-order tensor.
Problem 4.12. Analysis of the reƀection operator Prove that the reƀection operator deſned in problem 1.12 is a second-order tensor.
Problem 4.13. Rotation tensor in canonical form
(1) Compute the components of tensor R[E] in the canonical basis, E , deſned by eq. (4.32), i.e., verify eq. (4.33). (2) Compute the eigenvalues of R[E] , i.e., verify eq. (4.25).
Problem 4.14. Square root of rotation tensor (1) Compute the components of tensors R and G in the canonical basis, E , deſned by eq. (4.32), denoted R[E] and G[E] , respectively. (2) Verify eq. (4.19) by checking that R[E] = G[E] G[E] . (3) Find the kth root of the rotation tensor R, denoted Gk . Discuss the geometric meaning of this tensor.
Problem 4.15. Orthogonality in canonical form Verify the orthogonality property of the rotation tensor, R RT = I, by ſrst computing the components of R in the canonical basis, E, deſned by eq. (4.32), denoted R[E] , then checking that R[E] R[E]T = I.
Problem 4.16. Multiplicative decomposition of the rotation tensor Consider three rotation tensors, R, R1 , and R2 , corresponding to rotations of magnitude φ, ηφ, and (1 − η)φ, respectively, about the same unit vector, n ¯ , where η ∈ [0, 1]. Prove that R = R1 R2 = R2 R1 . Hint: write the three rotation tensors in their common canonical basis.
Problem 4.17. Properties of rotation tensors R and G Prove the following relationships. (R − I)(R + I)−1 = (R + I)−1 (R − I) = n tan φ/2, (G − I)(G + I)
−1
T −1
(I − R )(I + R ) T
T −1
(I − G )(I + G ) T
= (G + I)
−1
(G − I) = n tan φ/4,
(4.39a) (4.39b)
T −1
(I − R ) = n tan φ/2,
(4.39c)
T −1
(I − G ) = n tan φ/4.
(4.39d)
= (I + R )
= (I + G )
T
T
Problem 4.18. Multiplicative decomposition of rotation tensors R and G Prove the following relationships. R = (I − α n)−1 (I + α n) = (I + α n)(I − α n)−1 , n) G = (I − β
−1
(I + β n) = (I + β n)(I − β n)
−1
,
α = tan φ/2,
(4.40a)
β = tan φ/4.
(4.40b)
124
4 The geometric description of rotation
Problem 4.19. Properties of rotation tensors R and G Prove the following relationships. n) = 2I, (R + I)(I − α
α = tan φ/2,
(4.41a)
(G + I)(I − β n) = 2I,
β = tan φ/4.
(4.41b)
Problem 4.20. Properties of rotation tensors R and G Prove the following relationships. (R + I)−1 + (RT + I)−1 = I, (G + I)
−1
T
+ (G + I)
−1
(4.42a) (4.42b)
= I.
4.9 Composition of rotations Figure 4.7 shows three orthonormal bases, B 1 = (¯ı11 , ¯ı12 , ¯ı13 ), B 2 = (¯ı21 , ¯ı22 , ¯ı23 ), and B 3 = (¯ı31 , ¯ı32 , ¯ı33 ). Let i2 3 B the rotations of magnitude φ1 about a unit vector n ¯ 1 and 2 2 i i2 of magnitude φ2 about a unit vector n ¯ 2 , represented by R2 3 tensors R1 and R2 , respectively, express the rotations 3 i1 B2 from basis B 1 to B 2 and from basis B 2 to B 3 , respeci31 R tively. R1 i12 Application of the rotation operation, eq. (4.14), yields ¯ı21 = R1¯ı11 and ¯ı31 = R2¯ı21 . Eliminating ¯ı21 from 1 1 1 i1 B i2 these two expressions leads to ¯ı31 = R2 R1¯ı11 = R ¯ı11 , Fig. 4.7. Composition of ro- where R = R2 R1 is the rotation tensor that brings basis B 1 to B 3 . The operation that combines two rotations, tations. that from basis B 1 to B 2 and that from basis B 2 to B 3 , into a single rotation from basis B 1 to B 3 is called composition of rotations, a concept that was first addressed by Rodrigues [18]. Mathematically, the composition of two rotations is expressed by the multiplication of the corresponding rotation tensors. Finite rotations do not form a linear space: the expression “composition of finite rotation” is used to underline the fact these quantities are not additive. The tensor relationship, R = R2 R1 , can be resolved in any basis, in particular bases B 1 and B 3 , to find 3
i3
3
1
3
1
1
3
3
] [B ] R1 = R2[B ] R1[B ] . R[B ] = R[B ] = R[B 2
(4.43)
where eq. (4.28) was used to obtain the first equality. It is often convenient to resolve rotation tensor R2 in basis B 2 . The secondorder tensor component transformation law, eq. (4.29), relates the components of 1 1 ] [B2 ] [B1 ]T this tensor resolved in the two bases as R2[B ] = R[B R2 R1 . Introducing this 1 transformation into eq. (4.43) yields the additional result 1
3
1
2
R[B ] = R[B ] = R1[B ] R2[B ] .
(4.44)
4.9 Composition of rotations
125
Hence, the total rotation R from B 1 to B 3 can be expressed in two alternative ways 1
3
1
1
1
2
] [B ] R[B ] = R[B ] = R[B R1 = R1[B ] R2[B ] . 2
(4.45)
Note that the order in which the rotation tensors appear depends on the basis in which they are resolved. Example 4.4. Euler angles In section 4.4, Euler angles were deſned as the magnitudes of three successive planar rotations describing an arbitrary rotation, as illustrated in ſg 4.4. Considering the 3-1-3 sequence, rotation tensor Rφ rotates basis I to basis A, next, tensor Rθ brings basis A to basis B, and ſnally, tensor Rψ rotates basis B to E. The operations ¯1 , and e¯1 = R ¯b1 . Eliminating the can be summarized as: a ¯1 = R ¯ı1 , ¯b1 = R a φ
θ
ψ
intermediate bases then yields e¯1 = Rψ Rθ Rφ¯ı1 = R ¯ı1 , where R is the tensor that brings basis I to E. The statement R = Rψ Rθ Rφ is a tensor relationship that is true when expressed in any basis, provided that all tensors are resolved in the same basis; for instance, one could write R[I] = R[I] R[I] R[I] . In this expression, Rφ represents a planar rotation ψ θ φ and the components of this tensor resolved in basis I, denoted R[I] , are in the form φ of the direction cosine matrix given by eq. (4.6). Tensor Rθ also represents a planar
, are not of the form rotation, but its components resolved in basis I, denoted R[I] θ of a direction cosine matrix as given in eq. (4.4). However, the components of this tensor resolved in basis A, denoted R[A] , would be of the form given in eq. (4.4). θ The same remarks can be made about tensor Rψ : its components in basis B, denoted
R[B] , are of the form of the direction cosine matrix for a planar rotation as given by ψ eq. (4.6), whereas its components in basis I, denoted R[I] , are not. ψ The above discussion indicates that the evaluation of rotation tensor R will be easier if the tensor relationship R = Rψ Rθ Rφ is expressed in component form as R[I] = R[I] R[A] R[B] , a recursive application of eq. (4.44). This yields φ θ ψ R
[I]
⎤⎡ ⎤⎡ ⎡ cos ψ − sin ψ 0 cos φ − sin φ 0 1 0 = ⎣ sin φ cos φ 0⎦ ⎣0 cos θ − sin θ ⎦ ⎣ sin ψ cos ψ 0 0 0 0 1 0 sin θ cos θ
⎤ 0 0⎦ . 1
Performing the triple matrix multiplication yields the components of the rotation tensor in basis I; the result yields the entries of the direction cosine matrix deſned in eq. (4.74), as expected from eq. (4.18). Example 4.5. Time-dependent motion of a rigid body Consider a rigid body moving in three-dimensional space. In many computational schemes, it is necessary to track down the motion of the body by determining its actual position and orientation in space at various instants in time, as depicted in ſg. 4.8.
126
4 The geometric description of rotation
a3
e3 B0
e2 F
F
ui, Ri
E
x0, R 0
F O
a2
a1 I
i2
b3
u, R
Bi
i3
e1
A
b1
Bf B
b2
r, S
i1 Fig. 4.8. Time-dependent motion of a rigid body.
The following frames will be used in this problem: F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )], is an inertial frame, F E = [B0 , E = (¯ e1 , e¯2 , e¯3 )], a body attached frame that deſnes the conſguration of the rigid body in its reference conſguration (say at time t = 0), F A = [Bi , A = (¯ a1 , a ¯2 , a ¯3 )], a bodyattached frame that deſnes the conſguration of the body at ti , and ſnally F B = Bf , B = (¯b1 , ¯b2 , ¯b3 ) , a body attached frame that deſnes the conſguration of the body at tf . Typically, ti and tf would be the initial and ſnal times, respectively, for a time step of the computation that proceeds in an incremental manner. The position vector of point B0 of the rigid body with respect to point O is denoted x0 , and the orientation of the body is determined by rotation tensor R0 that brings basis I to basis E. Next, the position vector of point Bi with respect to point B0 is denoted ui and the corresponding orientation of the body is determined by rotation tensor Ri that brings basis E to basis A. Note that ui and Ri deſne the conſguration of the rigid body at time ti relative to that at time t = 0. The conſguration of the body with respect to the inertial frame would have to be obtained from a composition of the partial displacements and rotations. Finally, the incremental motion of the body from time ti to tf is deſned by position vector u of point Bf with respect to point Bi and rotation tensor R that brings basis A to basis B. Determine the inertial position and orientation of the body at time tf . The inertial position of the body is readily found by adding the various displacements to ſnd r = x0 + ui + u. This vector equation can be resolved in any basis, for instance the inertial basis. The various bases are related to each other through the corresponding rotation tensors: e¯1 = R0¯ı1 , a ¯1 = Ri e¯1 , and ¯b1 = R a ¯1 . Eliminating the intermediate bases ¯ yields b1 = R Ri R0¯ı1 = S ¯ı1 , where S = R Ri R0 is the rotation tensor that brings basis I to basis B. This tensor relationship can be expressed in component form as follows: S [I] = R[I] R[I] R[I] , where all tensors have been expressed in a common i 0 basis I, see eq. (4.43). It is sometimes more convenient to express each rotation tensor in the local basis; in that case eq. (4.44) yields: S [I] = R[I] Ri[E] R[A] . Note the reversing of the order 0 of the individual rotations depending of the basis in which the tensors are expressed. This behavior is a consequence of the nonlinear nature of rotation operations.
4.9 Composition of rotations
127
The operation of composing displacements corresponds to a simple addition of vectors. In contrast, the corresponding operation for rotations is far more complex: rotations cannot be added by simply adding “rotation vectors.” Rather, the components of the corresponding rotation tensors are multiplied, and the order in which the tensors appear depends on the bases in which their components are resolved. This fundamental difference is reƀected in the vocabulary: displacement vectors are added, rotations are composed. 4.9.1 Problems Problem 4.21. Sequence of rotations
Consider a sequence of n orthonormal bases denoted B1 , B2 , . . . Bk , . . . Bn . Let rotation tensor Rk deſne the rotation from Bk to Bk+1 . Rotation tensors Rk , k = 1, 2, . . . n − 1, then deſne the successive rotations between these bases. (1) Prove the following tensor relationship R = Rn−1 Rn−2 . . . R2 R1 , where rotation tensor R deſnes the rotation from basis B1 to basis Bn . (2) Prove that R[B
R[B
1
]
= R[B
n
]
1
2
n−2
= R1[B ] R2[B ] . . . R[B n−2
]
1
]
= R[B n−1
[B Rn−1
]
n
]
1
1
1
1
[B ] [B ] = Rn−1 Rn−2 . . . R2[B ] R1[B ] , and
.
Problem 4.22. Composition of rotations
Consider three orthonormal bases B, B0 , and B∗ . Let rotation tensor R0 describe the rotation from basis B to B0 and R that from basis B0 to B∗ . The components of tensors R0 and R resolved in basis B are ⎡ ⎤ ⎡ ⎤ 0.3258 −0.9377 −0.1212 0.2944 0.9433 −0.1536 R[B] = ⎣0.8683 0.3474 −0.3540⎦ , R[B] = ⎣−0.9005 0.2199 −0.3751⎦ . 0 0.3740 0.0101 0.9274 −0.3200 0.2488 0.9142 ∗
] Let Ri = R R0 . Prove the following relationships: (1) R[B] = R[B . (2) R[B] = R[B] R[B] . i i i 0
(3) R[B] = R[B] R[B0 ] . (4) Verify each relationship numerically by performing the matrix i 0 multiplications.
Problem 4.23. Robotic system with spinning disk The system depicted in ſg. 4.9 consists of a shaft of height h rigidly connected to an arm of length La and of a spinning disk of radius R mounted at the free end of the s , s¯ , s¯ ) is attached to the shaft at point S, and frame arm. Frame F S = S, S + = (¯ 1 2 3 F D = C, B∗ = (¯b1 , ¯b2 , ¯b3 ) is attached to the disk at point C. Superscripts (.)+ and (.)∗ indicate components of tensors resolved in bases S + and B∗ , respectively. Angle α(t) and β(t) are the magnitudes of the planar rotations about axis ¯ı3 and s¯1 , respectively, that bring basis I to S + and basis S + to B∗ , respectively. Tensors Rα and Rβ are the rotation tensors associated with those two rotations, and R is the rotation tensor that brings basis I to B∗ . If angles α(t) and β(t) are given, write compact expressions for the following tensor components: (1) Rα , R+ , and R∗α ; (2) Rβ , R+ , and R∗β . (3) Express R in terms of Rα and Rβ , (4) α β . express R in terms of Rα and R+ β
Problem 4.24. Relative rotation at a revolute joint Figure 4.10 depicts two rigid bodies denoted with superscripts (·)k and (·) , respectively, linked together by a revolute joint. In the reference conſguration, the orientation of the rigid
128
4 The geometric description of rotation l
s3 S FS
e2
e2
La
e1
k
l
f
k
e1
+
S
s2
b3 B
s1
C
i3 h
b(t) R
b1 O
i1 a(t)
b2
*
F
R
l
k
e3 = e3
k
k
i3 l
k
R0 = R0 i1
l
e01 = e01
i2
Fig. 4.9. Spinning disk mounted on a rotating arm.
l
k
e02 = e02
Final configuration
I
s1
l
e03 = e03
Rl
k
I
i2
Reference configuration
Fig. 4.10. Revolute joint in the reference and ſnal conſgurations.
bodies is deſned by coincident bases B0k = B0 . In the deformed conſguration, the orientations of the bodies are deſned by two distinct bases Bk and B , respectively. No relative displacement is permitted between the bodies that are allowed to rotate with respect to each other in such a way that e¯k3 = e¯3 . Rotation tensor Rk0 = R0 describes the rotation from I to B0k = B0 , tensor Rk that from B0k to Bk , and tensor R that from B0 to B . If ⎡
⎤ 0.1043 0.5561 0.8245 = ⎣−0.1873 0.8252 −0.5329⎦ ; −0.9767 −0.0989 0.1902 ⎡ ⎤ ⎡ ⎤ 0.6311 −0.7492 0.2010 0.6272 −0.7305 0.2700 [I] = ⎣0.0140 −0.2480 −0.9687⎦ , R = ⎣−0.1268 −0.4379 −0.8900⎦ , 0.7756 0.6141 −0.1460 0.7684 0.5240 −0.3673 R[I] 0
Rk[I]
ſnd the relative rotation, φ, of the revolute joint.
Problem 4.25. Rigid bodies connected by torsional springs Figure 4.11 shows two rigid bodies denoted with superscripts (·)k and (·) , respectively, linked together by torsional springs at a point. In the reference conſguration, the orientation of the rigid bodies is deſned by coincident bases B0k = B0 . In the deformed conſguration, the orientations of the bodies are deſned by two distinct bases Bk and B , respectively. No relative displacement is permitted between the bodies that are allowed to rotate with respect to each other in an arbitrary manner. Rotation tensor Rk0 = R0 describes the rotation from I to B0k = B0 , tensor Rk that from B0k to Bk , and tensor R that from B0 to B . Let R be the rotation tensor from Bk to B . The deformation of the torsional springs will be measured by the following vector ⎫ ⎧ kT e¯ e¯ − e¯kT ¯3 ⎬ 2 e 1 ⎨ 3kT 2 e¯ e¯ − e¯kT ¯1 . s= 3 e 2 ⎩ 1kT 3 ⎭ e¯2 e¯1 − e¯kT e 1 ¯2
4.10 Time derivatives of rotation operations
129
k
(1) Find the relationship between components s[B ] and the components of tensor R resolved in an appropriate basis. Clearly deſne this appropriate basis, and give the components of R in k
that basis. (2) Find the relationship between components s[B ] and the magnitude φ and unit axis n ¯ characterizing the rotation tensor R expressed in the previously deſned basis. k
l
e3
l
e2
e3
k
k
e2
R
B Bk Final configuration
l
e02 = e02
l
e1 l
k
l
e03 = e03
l
R i3
k
e1
k k
l
k
R0 = R0
I i1
i2
l
l
e01 = e01
B = B0 Reference configuration k 0
Fig. 4.11. Rigid bodies linked by torsional springs. For clarity of the ſgure, the reference and ſnal conſgurations have been translated with respect to each other.
4.10 Time derivatives of rotation operations Consider a ſxed orthonormal basis I = (¯ı1 , ¯ı2 , ¯ı3 ) and a time-dependent orthonormal basis E = (¯ e1 , e¯2 , e¯3 ). It is often the case that the orientation of this moving orthonormal basis depend on a scalar variable, say time t. If R(t) is the time-dependent rotation tensor that bring I to E, e¯1 (t) = R(t)¯ı1 . The time derivative of this expres˙ indicates a time derivative. Clearly, R(t) ˙ ı1 , where notation (·) ˙ sion is e¯˙ 1 (t) = R(t)¯ can be evaluated directly by taking a time derivative of the rotation tensor, eq. (4.15). The concept of angular velocity vector, however, considerably simpliſes this operation and will be explored in the next sections. 4.10.1 The angular velocity vector: an intuitive approach Consider a constant norm, time-dependent vector, b(t), and a rotation operation characterized by an instantaneous unit vector, n ¯ (t), and an inſnitesimal rotation, Δφ. Figure 4.12 shows the effect of this inſnitesimal rotation on the orientation of vector b: at time t, its orientation is b(t), at time t + Δt, its orientation is b(t + Δt). Because vector b is of constant norm, it sweeps the outer surface of a cone, whose summit is at the origin and its basis is a circle in a plane normal to n ¯ . The radius of the circle is r = b sin α, where α is the angle between vectors b and n ¯ . The increment in b is Δb = b(t + Δt) − b(t), a vector that lies in a plane normal to n ¯. If Δφ → 0, vector Δb becomes tangent to the circle, and hence, normal to b(t). In this case, Δb is normal to both n ¯ and b(t), and hence, Δb = c n b. The unknown constant, c, can be determined by taking the norm of both sides of this equation to
130
4 The geometric description of rotation
ſnd Δb = cb sin α. Because Δb = rΔφ, it follows that rΔφ = cr, and ſnally c = Δφ. The incremental change in vector b now becomes Δb = Δφ n b. By deſnition, the time derivative of vector b is b(t + Δt) − b(t) Δb Δφ = lim = lim n b = φ˙ n b˙ = lim b. Δt→0 Δt→0 Δt Δt→0 Δt Δt ˙ n is the angular velocity vector; the time derivative of vector b now Vector ω = φ¯ becomes b˙ = ω b. (4.46) This important relationship implies that the time derivative of a constant norm vector equals the vector product of the angular velocity vector by the constant norm vector itself. Clearly, the angular velocity vector will play a fundamental role in computing the time derivatives of vectors.
b(t) a I i1
e3
n
Df
i3
i3
Db
b(t+Dt) O
i2
Fig. 4.12. Rotation of a constant norm vector b.
i1
E
R(t) i2
I
e2
e1
O
Fig. 4.13. Fixed and rotating orthonormal bases.
The above result can also be obtained from eq. (4.14) that describes the rotation an arbitrary vector a. Let φ = 0 and φ = Δφ at time t = 0 and t = Δt, respectively. It follows that b(0) = a and b(Δt) = a + Δφ n a, where higher order terms were neglected. The increment in vector b is then Δb = Δφ n b, a result identical to that obtained above. Clearly, this result is valid for inſnitesimal rotations about φ = 0. If the axis n ¯ about which the rotation is taking place has a constant direction ˙ n = d(φ¯ in time, the angular velocity vector can be written as ω = φ¯ n)/dt, i.e., the angular velocity is the time derivative of vector φ¯ n. The results obtained above, however, are not limited to the case where axis n ¯ is of constant direction. In the ˙ n(t), and because φ(t) and n general case, ω = φ¯ ¯ (t) are independent functions of time, there exist no vector such that its derivative equals the angular velocity vector. The angular velocity vector is a nonholonomic vector, i.e., a vector that cannot be integrated. This contrasts with the expression of the velocity vector of a particle: the position vector is u(t) and the velocity v(t) = u. ˙ In this case, the integral of the velocity vector is the position vector. When it comes to the angular velocity, there exist no vector x such that ω = x. ˙
4.10 Time derivatives of rotation operations
131
4.10.2 The angular velocity vector: a rigorous approach The development presented in the previous section is limited to rotations about φ = 0; consequently, the resulting expressions are not general. In the present section, a rigorous deſnition of the angular velocity vector is derived by considering the timedependent orthonormal basis, E(t) = (¯ e1 , e¯2 , e¯3 ), depicted in ſg. 4.13. Deſnition of the angular velocity vector Because e¯1 is a unit vector e¯T1 e¯1 = 1 and a time derivative of this equation yields e¯T1 e¯˙ 1 = 0, i.e., vectors e¯˙ 1 and e¯1 must be perpendicular to each other. This implies the existence of vector a1 , such that e¯˙ 1 = a1 e¯1 .
(4.47)
To determine a1 , this equation is recast as e1 a1 = −e¯˙ 1 , a vector product equation for unknown vector a1 . In view of eq. (1.35), the solution of this equation is a1 = ω1 e¯1 + e1 e¯˙ 1 , where ω1 is an arbitrary constant; this solution exists because the right-hand side of the equation, −e¯˙ 1 , is orthogonal to the the null space of e1 , i.e., e¯T1 e¯˙ 1 = 0. Vectors e¯2 an e¯3 are also unit vectors and a reasoning similar to that developed above leads to a1 = ω1 e¯1 + e1 e¯˙ 1 , a2 = ω2 e¯2 + e2 e¯˙ 2 , and a3 = ω3 e¯3 + e3 e¯˙ 3 , where ω1 , ω2 , and ω3 are arbitrary constants. The components of vector a1 in the rotating basis E are readily found as e¯T1 a1 = ω1 , e¯T2 a1 = −e¯˙ T1 e¯3 and e¯T3 a1 = e¯˙ T1 e¯2 . A similar reasoning applied to vectors a2 and a3 then leads to a1 =
ω1 e¯1 − (e¯˙ T1 e¯3 ) e¯2 + (e¯˙ T1 e¯2 ) e¯3 ,
(4.48a)
a2 =
(e¯˙ T2 e¯3 ) e¯1 + ω2 e¯2 − (e¯˙ T2 e¯1 ) e¯3 ,
(4.48b)
a3 =
−(e¯˙ T3 e¯2 )
e¯1 +
(e¯˙ T3 e¯1 )
e¯2 + ω1 e¯3 ,
(4.48c)
where the last two equations were obtained by evaluating the components of vectors a2 and a3 in rotating basis E. Unit vectors e¯2 and e¯3 are mutually orthogonal, i.e., e¯T2 e¯3 = 0; a time derivative of this orthogonality condition implies e¯˙ T2 e¯3 = −e¯˙ T3 e¯2 . Since ω1 is arbitrary, it is possible to select ω1 = e¯˙ T2 e¯3 = −e¯˙ T3 e¯2 . The three arbitrary constants are selected as follows, ω1 = e¯˙ T2 e¯3 = −e¯˙ T3 e¯2 ,
ω2 = e¯˙ T3 e¯1 = −e¯˙ T1 e¯3 ,
ω3 = e¯˙ T1 e¯2 = −e¯˙ T2 e¯1 ,
(4.49)
where the last two equations stem from the orthogonality conditions, e¯T1 e¯3 = 0 and e¯T1 e¯2 = 0, respectively. Inspection of eqs. 4.48 and 4.49 then reveals that the three vectors a1 , a2 , and a3 are equal to each other, i.e., a1 = a2 = a3 = ω, where ω = (e¯˙ T2 e¯3 ) e¯1 + (e¯˙ T3 e¯1 ) e¯2 + (e¯˙ T1 e¯2 ) e¯3 .
(4.50)
This relationship provides a formal deſnition of the angular velocity vector. Since the quantities in parentheses are scalar products of vector, the angular velocity vector is indeed a ſrst-order tensor, because it is a linear combination of ſrst-order tensors.
132
4 The geometric description of rotation
Properties of the angular velocity vector The fundamental property of the angular velocity vector is its relationship to the time derivative of an orthonormal basis; indeed, eq. (4.47) now becomes e¯˙ 1 = ω e¯1 ,
e¯˙ 2 = ω e¯2 ,
e¯˙ 3 = ω e¯3 .
(4.51)
Clearly, these results are identical to those obtained in the previous section, see eq. (4.46): the vector product of the angular velocity vector by a constant norm vector yields the time derivative of the vector itself. The concept of angular velocity is associated with the time derivative of orthonormal basis E. Let b be an arbitrary vector attached to basis E, i.e., b = α1 e¯1 + α2 e¯2 + α3 e¯3 , where α1 , α2 , and α3 are time independent constants. The time derivative of this vector then becomes b˙ = α1 e¯˙ 1 + α2 e¯˙ 2 + α3 e¯˙ 3 = ω (α1 e¯1 + α2 e¯2 + α3 e¯3 ) = ω b. The time derivative of any vector attached to basis E is b˙ = ω b, i.e., the angular velocity vector characterizes the time derivative of the angular motion of the basis, not just that of a single unit vector. Because a one to one correspondence exists between the angular motion of an orthonormal basis and that of a rigid body, the angular velocity vector characterizes the time derivative of the angular motion of a rigid body. The following alternative expression, which presents a higher symmetry in the indices, can also be used to deſne the angular velocity vector ω=
1 T (¯ e3 e¯˙ 2 − e¯T2 e¯˙ 3 ) e¯1 + (¯ eT1 e¯˙ 3 − e¯T3 e¯˙ 1 ) e¯2 + (¯ eT2 e¯˙ 1 − e¯T1 e¯˙ 2 ) e¯3 . 2
(4.52)
The components of the angular velocity vector resolved in the rotating basis, E, denoted ω ∗ are ⎫ ⎧ T ⎨e¯3 e¯˙ 2 − e¯T2 e¯˙ 3 ⎬ 1 e¯T e¯˙ 3 − e¯T3 e¯˙ 1 . ω∗ = (4.53) ⎭ 2 ⎩ 1T ˙ e¯2 e¯1 − e¯T1 e¯˙ 2 Relating the angular velocity vector to the rotation tensor Let R be the rotation tensor that bring basis I to basis E; the components of this tensor in basis I are R = [¯ e1 , e¯2 , e¯3 ] and it then follows that ⎡ ⎤ 0 e¯T1 e¯˙ 2 e¯T1 e¯˙ 3 ˙ = ⎣e¯T2 e¯˙ 1 0 e¯T2 e¯˙ 3 ⎦ RT R e¯T3 e¯˙ 1 e¯T3 e¯˙ 2 0 ⎤ ⎡ (4.54) eT1 e¯˙ 3 − e¯T3 e¯˙ 1 ) 0 −(¯ eT2 e¯˙ 1 − e¯T1 e¯˙ 2 ) (¯ 1⎣ (¯ eT2 e¯˙ 1 − e¯T1 e¯˙ 2 ) 0 −(¯ eT3 e¯˙ 2 − e¯T2 e¯˙ 3 )⎦ . = 2 T ˙ T ˙ T ˙ T ˙ −(¯ e1 e¯3 − e¯3 e¯1 ) (¯ e3 e¯2 − e¯2 e¯3 ) 0 Comparing eq. (4.53) and (4.54) then yields ˙ ω ∗ = RT R.
(4.55)
4.10 Time derivatives of rotation operations
133
Because the angular velocity vector is a tensor, its components in the ſxed basis I are then obtained from eq. (4.29) as ˙ RT . ˙ T =R ω = Rω ∗ RT = R(RT R)R
(4.56)
This equation deſnes the angular velocity in terms of the rotation tensor and its time derivative. Since it is a tensor relationship, it is true in all bases, and could be taken as the deſnition of the angular velocity vector, although it more abstract and algebraic than the deſnition given by eq. (4.50), which is rooted in more geometric arguments. The results derived above can be recovered from purely algebraic manipulation. Let R(t) be the time-dependent rotation tensor that brings basis I to basis ˙ ¯ı1 (t) = E(t), e¯1 (t) = R(t)¯ı1 . A time derivative of this expression yields e¯˙ 1 = R T ˙ R R e¯1 (t). A time derivative of the orthogonality property of the rotation tensor, ˙ RT = −(R ˙ RT )T , which shows that tensor R ˙ RT is skew eq. (4.26), leads to R symmetric, as implied by eq. (4.56), which is taken to be the deſnition of the angular velocity vector. The time derivative of unit vector e¯1 then becomes e¯˙ 1 (t) = ω e¯1 (t), as expected from earlier developments. The components of this vector in basis E are now RT e¯˙ 1 (t) = RT ω e¯1 = RT ω R RT e¯1 = ω ∗¯ı1 . Explicit expression of the angular velocity vector The angular velocity vector can be expressed in terms of quantities φ(t) and n ¯ (t) that characterize the rotation. Introducing the rotation tensor, eq. (4.15), into eq. (4.56) and using identity (1.34c) yields ˙ n + sin φ n ω = φ ˙ + (1 − cos φ)( nn ˙ − n ˙ n ).
(4.57)
The angular velocity vector now becomes ω = φ˙ n ¯ + sin φ n ¯˙ + (1 − cos φ) nn ¯˙
(4.58)
˙ n, the result obtained with the simpliſed approach of the Note that for φ = 0, ω = φ¯ previous section. The time derivative of unit vector n ¯ about which the rotation takes place explicitly appears in the rigorous expression of the angular velocity vector. Because eq. (4.58) cannot be integrated in general, the angular velocity vector is a nonholonomic vector. Example 4.6. Angular velocity in terms of Euler angles Find the angular velocity vector expressed in terms of Euler angles and their time derivatives; use the 3-1-3 sequence to deſne the Euler angles. The components of the angular velocity vector resolved in the moving basis will be evaluated ſrst. In section 4.4, the rotation tensor expressed in terms of Euler angles was found to be R[I] = R[I] R[A] R[B] . In the following, the superscripts will φ θ ψ be dropped to simplify the writing. Equation (4.55) then yields ˙ R R +R R ˙ = (R R R )T (R ˙ R +R R R ˙ ) ω ∗ = RT R φ θ ψ φ θ ψ φ θ ψ φ θ ψ ˙ )(R R ) + RT (RT R ˙ )R + (RT R ˙ ). = (Rθ Rψ )T (RTφ R ψ θ θ ψ ψ φ θ ψ ψ
134
4 The geometric description of rotation
˙ represents the angular velocity associated with the ſrst planar rotation of the RTφ R φ 3-1-3 Euler angle sequence. It can be readily evaluated using elementary trigonomet˙ = φ˙ ı[I] . ric formulæ to ſnd RTφ R 3 φ Proceeding in a similar manner with the other terms leads to ω ∗ = [B] T ˙ [A] T ˙ [I] (Rθ Rψ ) (φı3 )(Rθ Rψ ) + Rψ (θ a1 )Rψ + (ψ˙b3 ), and ſnally ˙ ı[I]) + R[B]T (θ¯ ˙ a[A] ) + (ψ˙ ¯b[B] ), ω ∗ = (R[A] R[B] )T (φ¯ 3 1 3 θ ψ ψ
[I]T [B]T [A]T where ¯ı3 = ¯b3 = 0, 0, 1 and a ¯1 = 1, 0, 0 . Performing the matrix multiplications and casting the result in a matrix form leads to ⎧ ⎫ ⎡ ⎤ Sθ Sψ Cψ 0 ⎨ φ˙ ⎬ ω ∗ = H ∗3-1-3 θ˙ , with H ∗3-1-3 = ⎣Sθ Cψ −Sψ 0⎦ . (4.59) ⎩ ˙⎭ C 0 1 θ ψ Operator H ∗3-1-3 is called the tangent operator because it is tangent to the rotation manifold. Of course, the components of the angular velocity vector resolved in the ſxed basis could also be evaluated. Starting from eq. (4.56), the desired components are [I] [I] ˙ ˙ R ¯ı[I] ; in matrix form, this becomes found as ω = φ˙ ¯ı3 + θR ¯ı + ψR φ 1 φ θ 3 ⎧ ⎫ ⎨ φ˙ ⎬ ω = H 3-1-3 θ˙ , ⎩ ˙⎭ ψ
with
H 3-1-3
⎡ ⎤ 0 Cφ Sφ Sθ = ⎣0 Sφ −Cφ Sθ ⎦ . 1 0 Cθ
(4.60)
Sometimes, the angular velocity components of a rigid body are known, or have been computed from dynamical equations of motion. The orientation of the rigid body is then obtained by integration the following kinematical equations ⎧ ⎫ ⎡ ⎤ Cψ 0 Sψ ⎨ φ˙ ⎬ 1 ⎣ Sθ Cψ −Sθ Sψ 0 ⎦ . ω ∗ , with H ∗−1 = θ˙ = H ∗−1 3-1-3 3-1-3 ⎩ ˙⎭ Sθ −Cθ Sψ −Cθ Cψ Sθ ψ ⎧ ⎫ ⎨ φ˙ ⎬ ω, θ˙ = H −1 3-1-3 ⎩ ˙⎭ ψ
⎤ −Sφ Cθ Cφ Cθ Sθ 1 ⎣ Cφ Sθ Sφ Sθ 0 ⎦ . = Sθ Sφ −Cφ 0 ⎡
with
H −1 3-1-3
These relationships become singular when Sθ = 0; as was noted in section 4.4, singularities occurs when using Euler angles to represent rotations, for all possible sequence choices. Because ω ∗ = RT ω, it follows that R H ∗3-1-3 = H 3-1-3 , or R = H 3-1-3 H ∗−1 . 3-1-3
4.10 Time derivatives of rotation operations
135
4.10.3 The addition theorem Consider now the problem of two time-dependent bases rotating with respect to an inertial frame of reference, I. The ſrst basis is denoted E ∗ = (¯ e1 , e¯2 , e¯3 ) and the second B = (¯b1 , ¯b2 , ¯b3 ), as depicted in ſg. 4.14. Rotation tensor R1 brings basis I to basis E ∗ , and tensor R2 brings E ∗ to B. Clearly, the instantaneous orientation of bab2 e3 sis B with respect to basis I is a function of R2(t) * both tensors R1 and R2 . Similarly, the angular B e1 E i3 velocity of basis B with respect to I depends on b3 e2 the angular velocities of both bases E ∗ and B. b1 If the components of tensors R1 and R2 are R1(t) i2 resolved in basis I, I i1 ¯b1 = R R ¯ı1 = R RT R R ¯ı1 = R R∗¯ı1 , 2 1 1 1 2 1 1 2 Fig. 4.14. Two time-dependent bases, E and B.
where R∗2 are the components of tensor R2 resolved in basis E , see eq. (4.29). Superscript (·)∗ is used here to indicate tensor components resolved in basis E ∗ . The time derivative of unit vector ¯b1 now becomes ∗
¯b˙ 1 = (R ˙ R∗ + R R ˙ RT + R R ˙ ∗ )(R R∗ )T ¯b1 = (R ˙ ∗ R∗T RT )¯b1 . 1 1 2 1 2 1 2 1 1 1 2 2
(4.61)
The ſrst term of the last equality is the angular velocity of basis E ∗ with respect ˙ RT ). The components of this angular velocity to basis I, denoted ω 1 = axial(R 1 1 ˙ ∗ R∗T ) are the components of vector are resolved in basis I. Next, ω∗2 = axial(R 2 2 the angular velocity vector of basis B with respect to basis E ∗ , resolved in E ∗ . The second term of the last equality involves the components of the angular velocity vector, ω 2 = R1 ω∗2 , of basis B with respect to basis E ∗ , resolved in basis I, because ˙ ∗ R∗T RT = R ω R1 R ∗ RT = R1 ω∗2 = ω 2 . 1 2 2 1 2 1 The derivative of the unit vector, eq. (4.61), now reduces to ¯b˙ 1 = ( ω1 + ω 2 )¯b1 = ω ¯b1 , where ω = ω 1 + ω2 . (4.62) Vector ω is the angular velocity of basis B with respect to I. This result is know as the addition theorem. Theorem 4.2 (Addition theorem). The angular velocity of basis B with respect to basis I is the sum of the angular velocities of basis A with respect to I and of basis B with respect to A, where A is an arbitrary basis. The angular velocity of basis B with respect to I is ω = ω 1 + ω2 , where ω1 is the angular velocity of basis E ∗ with respect to basis I, and ω 2 is the angular velocity of basis B with respect to basis E ∗ . This is a tensor relationship can be expressed in [I] [I] [B] [B] any basis; for instance, ω [I] = ω 1 + ω 2 or ω [B] = ω1 + ω 2 . It would appear that angular velocity vector ω 2 is more naturally resolved in basis E ∗ ; its components are then denoted ω ∗2 . It is clearly incorrect, however, to write
136
4 The geometric description of rotation
ω = ω1 + ω∗2 , because it is wrong to add the components of two vectors resolved in different bases. In many applications, the angular velocity of the second basis will be deſned by its components in that basis, ω ∗2 ; in that case, the addition theorem states ω = ω 1 + R1 ω∗2 . In applications of the addition theorem, it is important to correctly identify and evaluate the angular velocities of the two bases at hand. For instance, it would be ˙ RT . incorrect to believe that the angular velocity of basis B with respect to E ∗ is R 2 2 Indeed, because R2 = R1 R∗2 RT1 , it follows that ˙ RT = (R ˙ R∗ RT + R R ˙ ∗ RT + R R∗ R ˙ T )(R R∗ RT )T R 2 2 1 2 1 1 2 1 1 2 1 1 2 1
˙ RT + R R ˙ T R (R R∗ )T ˙ ∗ R∗T RT + (R R∗ )R =R 1 1 1 1 1 1 2 2 1 2 1 2 ˙ T R (R R )T =ω 1 + R1 ω 2∗ RT1 + (R2 R1 )R 1 1 2 1 T
˙ RT = ω =ω 1 + ω 2 + R2 R1 R 1 + ω 2 + R2 ω 1T RT2 1 2 =ω 2 + (I − R2 )ω 1 . ˙ RT = ω Clearly, R 2 . This is due to the fact that although R2 is the rotation tensor 2 2 that rotates basis E ∗ to basis B, the components of this tensor are resolved in basis I. The components of the angular velocity of basis B with respect to basis E ∗ , resolved ˙ ∗ R∗T ), because R∗ are the components of the rotation tensor in E ∗ , are ω ∗2 = axial(R 2 2 2 that rotates basis E ∗ to basis B, resolved in basis E ∗ . Example 4.7. Angular velocity in terms of Euler angles In section 4.4, Euler angles were deſned as the magnitudes of three successive planar rotations that produce an arbitrary rotation, as shown in ſg 4.4. In example 4.6, expressions were derived for the components of the angular velocity vector in terms of Euler angles and their time derivatives. Derive these expressions using the addition theorem. According to this theorem, the angular velocity of basis E with respect to basis I is simply ω = ω φ + ω θ + ωψ , where ω φ is the angular velocity associated with the planar rotation that brings basis I to basis A, ωθ that associated with the planar rotation from basis A to B, and ωψ that associated with the planar rotation from basis B to E. It follows that ω∗ = ω [E] = ω φ + ωθ + ωψ ; while correct, this expression is not convenient to use because the partial angular velocities are all expressed in the same basis, E. Using the rules of transformation for the components of ſrst-order tensors, [A] [B] [E] eq. (4.27), yields ω ∗ = (R[A] R[B] )T ω φ + R[B]T ωθ + ωψ . Because each partial θ ψ ψ [A] [I] ˙ ı[I] , ω [B] = ω [A] = θ¯ ˙a[A] , rotation is a planar rotation, it is clear that ω = ω = φ¯ [E]
[E]
φ
[E]
φ
3
θ
θ
1
[B] ˙ ı[I] )+R[B]T (θ¯ ˙ a[A] )+(ψ˙ ¯b[B] ). = = ψ˙ ¯b3 , and ſnally, ω∗ = (R[A] R[B] )T (φ¯ 3 1 3 θ ψ ψ This expression is identical to that found in example 4.6. [E] ωψ
[B] ωψ
4.11 Euler angle formulas
137
4.10.4 Angular acceleration The angular velocity vector enables the evaluation of the derivative of a unit vector, see eq. (4.51). The second derivative of a unit vector then becomes ˙ e¯1 + ω e¯˙ 1 = ω ˙ e¯1 + ω ω e¯1 = (ω ˙ + ω ω )¯ e1 . e¨¯1 = ω
(4.63)
The angular acceleration vector is deſned as α = ω; ˙ it then follows that α+ω ω )¯ e1 . e¨¯1 = (
(4.64)
Consider now the problem of two time-dependent bases rotating with respect to an inertial frame of reference, I, as depicted in ſg. 4.14. The addition theorem, eq. (4.62), implies ¯b˙ 1 = ω ¯b1 , where ω = ω1 + ω2 . The second derivative of the unit vector now becomes ¨¯b1 = ( α+ω ω )¯b1 , where the angular acceleration of basis B with respect to basis I is α = ω˙ 1 + ω˙ 2 . (4.65) This result corresponds to the addition theorem for angular acceleration, and echoes the corresponding result for angular velocities, eq. (4.62). The angular acceleration of basis B with respect to I is α = ω˙ = ω˙ 1 + ω˙ 2 , where ω 1 is the angular velocity of basis E ∗ with respect to basis I and ω 2 is the angular velocity of basis B with respect to basis E ∗ . This tensor relationship is true in any [I] [I] [B] [B] basis, α[I] = ω˙ 1 + ω˙ 2 or α[B] = ω˙ 1 + ω˙ 2 . Of course, it would be incorrect ∗ ∗ to write α = ω˙ 1 + ω˙ 2 , where notation (·) indicates tensor components resolved in basis E ∗ , because it is wrong to add the components of two vectors resolved in different bases. If the angular acceleration is to be written in terms of ω∗2 , the following expression should be used instead α = ω˙ 1 + (R1 ω∗2 )· = ω˙ 1 + R1 ω˙ ∗2 + ω 1 R1 ω∗2 .
4.11 Euler angle formulas This section gives a summary of formulas used for the manipulation of rotation operations expressed in terms of Euler angles. As mentioned in section 4.4, twelve different sequences of planar rotations can be used to express an arbitrary rotation; of those twelve possible sequences, the four sequences starting with a rotation about the third axis will be detailed in this section. Arbitrary rotations from basis I = (¯ı1 , ¯ı2 , ¯ı3 ) to basis E = (¯ e1 , e¯2 , e¯3 ) will be considered, with two intermediate bases, A = (¯ a1 , a ¯2 , a ¯3 ) and B = (¯b1 , ¯b2 , ¯b3 ). The complete sequence of bases is as follows: I → A → B → E. The three Euler angles are denoted q1 , q2 , and q3 , and the array of Euler angles is then q T = q1 , q2 , q3 . For each sequence, the following information is given. 1. The matrix of direction cosines expressed in terms of the Euler angles, D = D(q). See section 4.4 and eq. (4.11) for the 3-1-3 sequence.
138
4 The geometric description of rotation
2. The Euler angles expressed in terms of the components of the direction cosine matrix, q1 = q1 (D), q2 = q2 (D), and q3 = q3 (D). See section 4.4 and eq. (4.12) for the 3-1-3 sequence. For computer implementation, it is convenient to use the function φ = atan2(sin q, cos q) that deſnes angle φ ∈ [−π, π]. 3. The tangent operators, H(q) and H ∗ (q), which express the components of the angular velocity vector resolved in basis I and E, respectively, in terms of the time derivatives of the Euler angles, i.e., ω = H(q)˙q and ω ∗ = H ∗ (q)˙q , respectively. See example 4.6 and eqs. (4.60) and (4.59), respectively, for the 3-1-3 sequence. 4. The inverses of the tangent operators are also given. 4.11.1 Euler angles: sequence 3-1-3 Euler angles with the 3-1-3 sequence are deſned as follows. 1. A planar rotation of magnitude φ, called precession, about axis ¯ı3 brings I to A. 2. A planar rotation of magnitude θ, called nutation, about axis a ¯1 brings A to B. 3. A planar rotation of magnitude ψ, called spin, about axis ¯b3 brings B to E.
The array of Euler angles is now q T = φ, θ, ψ . 1) The direction cosine matrix is ⎡ ⎤ Cφ Cψ − Sφ Cθ Sψ −Cφ Sψ − Sφ Cθ Cψ Sφ Sθ D3-1-3 = ⎣Sφ Cψ + Cφ Cθ Sψ −Sφ Sψ + Cφ Cθ Cψ −Cφ Sθ ⎦ . (4.66) Sθ Sψ Sθ Cψ Cθ 2) Euler angle expressed in terms of the direction cosine matrix components are ψ = atan2(D31 , D32 ), if D32 = 0, θ = atan2(D31 sin ψ + D32 cos ψ, D33 ),
(4.67)
φ = atan2(D13 , −D23 ). A singularity occurs when θ = 0 or π, . 3) The tangent operators are ⎡ ⎡ ⎤ ⎤ 0 Cφ Sφ Sθ Sθ Sψ Cψ 0 H 3-1-3 = ⎣0 Sφ −Cφ Sθ ⎦ , H ∗3-1-3 = ⎣Sθ Cψ −Sψ 0⎦ , Cθ 1 0 Cθ 0 1
(4.68)
respectively. 4) The inverses of the tangent operators are ⎤ ⎡ ⎡ ⎤ Cψ 0 −Sφ Cθ Cφ Cθ Sθ Sψ 1 ⎣ 1 ⎣ Sθ Cψ −Sθ Sψ 0 ⎦ . (4.69) Cφ Sθ Sφ Sθ 0 ⎦ , H ∗−1 = = H −1 3-1-3 3-1-3 Sθ S θ −C S −C C S Sφ −Cφ 0 θ ψ θ ψ θ
4.11 Euler angle formulas
139
4.11.2 Euler angles: sequence 3-2-3 Euler angles with the 3-2-3 sequence are deſned as follows. 1. A planar rotation of magnitude ψ, called precession, about axis ¯ı3 brings I to A. 2. A planar rotation of magnitude θ, called nutation, about axis a ¯2 brings A to B. 3. A planar rotation of magnitude φ, called spin, about axis ¯b3 brings B to E.
The array of Euler angles is now q T = ψ, θ, φ . 1) The direction cosine matrix is ⎡ ⎤ Cψ Cθ Cφ − Sψ Sφ −Cψ Cθ Sφ − Sψ Cφ Cψ Sθ D3-2-3 = ⎣Sψ Cθ Cφ + Cψ Sφ −Sψ Cθ Sφ + Cψ Cφ Sψ Sθ ⎦ . (4.70) −Sθ Cφ Sθ Sφ Cθ 2) Euler angle expressed in terms of the direction cosine matrix components are φ = atan2( D32 , −D31 ), if D31 = 0, θ = atan2(−D31 cos φ + D32 sin φ, D33 ),
(4.71)
ψ = atan2( D23 , D13 ). It is clear that when θ = 0 or π, a singularity occurs. 3) The tangent operators are ⎡ ⎡ ⎤ ⎤ 0 −Sψ Cψ Sθ −Sθ Cφ Sφ 0 H 3-2-3 = ⎣0 Cψ Sψ Sθ ⎦ , H ∗3-2-3 = ⎣ Sθ Sφ Cφ 0⎦ , Cθ 1 0 Cθ 0 1
(4.72)
respectively. 4) The inverses of the tangent operators are ⎤ ⎡ ⎡ ⎤ −Cψ Cθ −Sψ Cθ Sθ −Cφ Sφ 0 1 1 ⎣ −Sψ Sθ Cψ Sθ 0 ⎦ , H ∗−1 = ⎣ Sθ Sφ Sθ Cφ 0 ⎦ . (4.73) = H −1 3-2-3 3-2-3 Sθ Sθ Cψ Cθ Cφ −Cθ Sφ Sθ Sψ 0 4.11.3 Euler angles: sequence 3-2-1 Euler angles with the 3-2-1 sequence are commonly used in airplane ƀight mechanics formulations and are deſned as follows. 1. A planar rotation of magnitude ψ, called heading, about axis ¯ı3 brings I to A. 2. A planar rotation of magnitude θ, called attitude, about axis a ¯2 brings A to B. 3. A planar rotation of magnitude φ, called bank, about axis ¯b1 brings B to E.
The array of Euler angles is now q T = ψ, θ, φ . 1) The direction cosine matrix is ⎡ ⎤ Cψ Cθ −Sψ Cφ + Cψ Sθ Sφ Sψ Sφ + Cψ Sθ Cφ D3-2-1 = ⎣ Sψ Cθ Cψ Cφ + Sψ Sθ Sφ −Cψ Sφ + Sψ Sθ Cφ ⎦ . (4.74) −Sθ Cθ Sφ Cθ Cφ
140
4 The geometric description of rotation
2) Euler angle expressed in terms of the direction cosine matrix components are φ = atan2( D32 , D33 ),
if D33 = 0, (4.75)
θ = atan2(−D31 , D32 sin φ + D33 cos φ), ψ = atan2( D21 , D11 ). It is clear that when θ = π/2 or 3π/2, a singularity occurs. 3) The tangent operators are ⎡ ⎡ ⎤ 0 −Sψ Cψ Cθ −Sθ 0 H 3-2-1 = ⎣0 Cψ Sψ Cθ ⎦ , H ∗3-2-1 = ⎣ Cθ Sφ Cφ Cθ Cφ −Sφ 1 0 −Sθ
⎤ 1 0⎦ , 0
(4.76)
respectively. 4) The inverses of the tangent operators are ⎤ ⎡ ⎡ ⎤ Cφ C S S S C 0 Sφ 1 ⎣ ψ θ ψ θ θ⎦ 1 ⎣ −1 ∗−1 −Sψ Cθ Cψ Cθ 0 , H 3-2-1 = 0 Cθ Cφ −Cθ Sφ ⎦ . (4.77) H 3-2-1 = Cθ Cθ Cψ Sψ 0 Cθ Sθ Sφ Sθ Cφ 4.11.4 Euler angles: sequence 3-1-2 Euler angles with the 3-1-2 sequence are deſned as follows. 1. A planar rotation of magnitude φ about axis ¯ı3 brings I to A. 2. A planar rotation of magnitude θ about axis a ¯1 brings A to B. 3. A planar rotation of magnitude ψ about axis ¯b2 brings B to E.
The array of Euler angles is now q T = φ, θ, ψ . 1) The direction cosine matrix is ⎡ ⎤ Cφ Cψ − Sφ Sθ Sψ −Sφ Cθ Cφ Sψ + Sφ Sθ Cψ D3-1-2 = ⎣Sφ Cψ + Cφ Sθ Sψ Cφ Cθ Sφ Sψ − Cφ Sθ Cψ ⎦ −Cθ Sψ Sθ Cθ Cψ
(4.78)
2) Euler angle expressed in terms of the direction cosine matrix components are ψ = atan2(−D31 , D33 ),
if D33 = 0;
θ = atan2( D32 , −D31 sin ψ + D33 cos ψ); φ = atan2(−D12 , D22 ). It is clear that when θ = π/2 or 3π/2, a singularity occurs. 3) The tangent operators are ⎡ ⎡ ⎤ ⎤ 0 Cφ −Sφ Cθ −Cθ Sψ Cψ 0 ∗ 0 1⎦ , H 3-1-2 = ⎣0 Sφ Cφ Cθ ⎦ , H 3-1-2 = ⎣ Sθ Cθ Cψ Sψ 0 1 0 Sθ
(4.79)
(4.80)
4.11 Euler angle formulas
141
respectively. 4) The inverses of the tangent operators are ⎤ ⎡ ⎡ ⎤ Cψ Sφ Sθ −Cφ Sθ Cθ −Sψ 0 1 1 ⎣Cφ Cθ Sφ Cθ 0 ⎦ , H ∗−1 = ⎣Cθ Cψ 0 Cθ Sψ ⎦ . (4.81) H −1 = 3-1-2 3-1-2 Cθ Cθ −Sφ Cφ 0 Sθ Sψ Cθ −Sθ Cψ 4.11.5 Problems Problem 4.26. Angular velocity for 3-2-3 Euler angles A popular choice of Euler angles is the 3-2-3 sequence that corresponds to the following sequence of planar rotations. First, a rotation of magnitude ψ about axis ¯ı3 , called precession, brings basis I to A. Second, a rotation of magnitude θ about axis a ¯2 , called nutation, brings basis A to B. Finally, a rotation of magnitude φ about axis ¯b3 , called spin, brings basis B to E. (1) Find the angular velocity vector associated with this rotation. (2) Determine the components of this vector in the ſxed and moving bases. (3) Discuss the occurrence of singularities.
Problem 4.27. Angular velocity for 3-2-1 Euler angles A popular choice of Euler angles for airplane ƀight mechanics is the 3-2-1 sequence that corresponds to the following sequence of planar rotations. First, a rotation of magnitude ψ about axis ¯ı3 , called heading, brings basis I to A. Second, a rotation of magnitude θ about axis a ¯2 , called attitude, brings basis A to B. Finally, a rotation of magnitude φ about axis ¯b1 , called bank, brings basis B to E. (1) Find the angular velocity vector associated with this rotation. (2) Determine the components of this vector in the ſxed and moving bases. (3) Discuss the occurrence of singularities.
Problem 4.28. Angular velocity for 3-1-2 Euler angles A choice of Euler angles is the 3-1-2 sequence that corresponds to the following sequence of planar rotations. First, a rotation of magnitude φ about axis ¯ı3 brings basis I to A. Second, a rotation of magnitude θ about axis a ¯1 brings basis A to B. Finally, a rotation of magnitude ψ about axis ¯b2 brings basis B to E. (1) Find the angular velocity vector associated with this rotation. (2) Determine the components of this vector in the ſxed and moving bases. (3) Discuss the occurrence of singularities.
Problem 4.29. Spinning disk on a rotating arm The system depicted in ſg. 4.9 consists of a shaft of height h rigidly connected to an of radius R mounted at the free end of the arm. arm of length La and of a spinning disk Frame F S = S, S + = (¯ s1 , s¯2 , s¯3 ) is attached to the shaft at point S, whereas frame F D = C, B∗ = (¯b1 , ¯b2 , ¯b3 ) is attached to the disk at point C. Superscripts (.)+ and (.)∗ will be used to denote tensor components in bases S + and B∗ , respectively. Angle α(t) and β(t) are the magnitudes of the planar rotations about axis ¯ı3 and s¯1 , respectively, that bring basis I to S + and basis S + to B∗ , respectively. (1) Find the angular velocity vector of basis B∗ with respect to basis I. (2) Find the components of this vector resolved in basis I, then resolved in B∗ . (3) Find the angular acceleration vector of basis B with respect to basis I. (4) Find the components of this vector resolved in basis I, then in basis B∗ .
Problem 4.30. Alternative expression of the angularvelocity vector
Show that the angular velocity vector can be written as ω = e1 e¯˙ 1 + e2 e¯˙ 2 + e3 e¯˙ 3 /2. Give a geometric interpretation of this result.
142
4 The geometric description of rotation
Problem 4.31. Components of the angular velocity vector in the rotating basis Based on eqs. (4.55) and (4.15), show that the components of the angular velocity vector in ¯ + sin φ n ¯˙ + (1 − cos φ)n ˙ n ¯ . Compare your result the rotating basis can be written as ω ∗ = φ˙ n with eq. (4.58).
Problem 4.32. Derivatives with respect to the rotation parameters This section has focused on the time derivatives of the rotation tensor. In some applications, derivatives of the rotation tensor with respect to the rotation parameters are needed. Let R = R(q), where q T = q1 , q2 , q3 are the rotation parameters that could be, for instance, Euler angles with a speciſc sequence of planar rotations, as discussed in section 4.11. Let ∂R = Ri , ∂qi
i = 1, 2, 3.
(1) Show that Ri RT = hi , i = 1, 2, 3, where hi are the columns of the tangent operator H, ˙ RT . (2) Show that RT R = i.e., H = [h , h , h ], ω = H q˙ and ω = R h∗i , i = 1, 2, 3, 1
2
3
i
where h∗i are the columns of the tangent operator H ∗ , i.e., H ∗ = [h∗1 , h∗2 , h∗3 ], ω ∗ = H ∗ q˙ and ˙ (3) If vector u is not a function of q and u∗ = RT u, show that ω ∗ = RT R. ∂u∗ =u ∗ H ∗ = RT u H. ∂q
(4.82)
(4) If vector u∗ is not a function of q and u = R u∗ , show that ∂u =u T H = R u ∗T H ∗ . ∂q
(4.83)
Problem 4.33. Derivatives of angular velocity with respect to the rotation parameters Prove the following two identities ∂ω ∗ ˙ = ∂ω + ω H H = R , ∂q ∂q
(4.84)
∗ ∂ω ˙ ∗ = ∂ω − ω H ∗ H ∗ = RT . ∂q ∂q
(4.85)
Hint: be familiar with the results of the previous problem. First show that h1 h2 = R2 RT1 − h1 h2 = (R21 RT + R2 RT1 ) − (R12 RT + R1 RT2 ). R1 RT2 , and because R12 = R21 , show that The following relationships result ∂h2 ∂h1 − , h1 h2 = ∂q1 ∂q2
∂h3 ∂h2 h2 h3 = − , ∂q2 ∂q3
∂h1 ∂h3 h3 h1 = − . ∂q3 ∂q1
Combining these equations then yields ∂h1 = H1 − h1 H, ∂q
∂h2 = H2 − h2 H, ∂q
∂h3 = H3 − h3 H, ∂q
where notation H i = ∂H/∂qi was introduced. Application of the chain rule for derivatives then leads to the desired identities.
4.12 Spatial derivatives of rotation operations
143
4.12 Spatial derivatives of rotation operations In the previous sections, time-dependent rotations were treated. Space-dependent rotations will be treated in a similar manner in this section. In fact, space-dependent rotations were encountered in chapter 2 when dealing with path coordinates, surface coordinates, and orthogonal curvilinear coordinates, as discussed in sections 2.3, 2.5, and 2.7, respectively. Consider now a space-dependent orthonormal basis E(s) = (¯ e1 , e¯2 , e¯3 ). If R(s) is the space-dependent rotation tensor that bring I to E(s), e¯1 (s) = R(s)¯ı1 . A spatial derivative of this expression yields e¯1 (s) = R¯ı1 (t) = R RT e¯1 (s) = κ e¯1 (s), where κ = R RT and notation (·) indicates a derivative with respect to the spatial variable s. κ is the curvature vector, and by analogy with eq. (4.58), is expressed as ¯ + sin φ n ¯ + (1 − cos φ) nn ¯. κ = φ n
(4.86)
This result is similar to that obtained for the angular velocity vector, eq. (4.58): ˙ is replaced by the spatial derivative, (·) . The curvature vector the time derivative, (·), resolved in basis E is κ∗ = RT κ, and κ ∗ = RT R . 4.12.1 Path coordinates Section 2.2 studies the differential geometry of curves in three-dimensional space. Unit vector t¯ was shown to deſne the tangent to the curve at a point, vector n ¯ to be normal to the curve at the same point, and the binormal vector ¯b was selected to be orthogonal to the two other vectors, see ſg. 2.2. In section 2.2.1, vectors t¯, n ¯ , and ¯b were shown to form an orthonormal basis, F = (t¯, n ¯ , ¯b), called Frenet’s triad. The following orthogonal tensor is now deſned ¯ , ¯b . F (s) = t¯, n (4.87) This tensor can be interpreted as the space-dependent rotation tensor that bring the reference triad I, to Frenet’s triad F. The curvature tensor, κ ∗ , of the curve is deſned as dF FT (4.88) =κ ∗ . ds With the help of eqs. (2.13), the curvature vector becomes 1 1 , 0, . (4.89) κ∗T = τ ρ Clearly, the twist and curvature of the curve, deſned in eqs. (2.12) and (2.7), respectively, are the two non-vanishing components of the curvature vector resolved in Frenet’s triad. The components of the curvature vector resolved in the reference frame I are then κ = F κ∗ .
144
4 The geometric description of rotation
Finally, let curvilinear variable s be a function of time. Frenet’s triad now becomes an implicit function of time, F (t) = F (s(t)). Using the chain rule for derivatives, the angular velocity of Frenet’s triad, resolved in basis F, is now ω ∗ = F T F˙ = F T
dF s˙ = s ˙ κ∗ . ds
(4.90)
This implies ω∗ = sκ ˙ ∗ : the angular velocity vector is parallel to the curvature vector. 4.12.2 Surface coordinates In the study of the differential geometry of surfaces in three-dimensional space, see section 2.4, unit vectors e¯1 and e¯2 were shown to deſne the plane tangent to the surface at a point and vector n ¯ to be normal to the surface at the same point, see ſg. 2.7. In section 2.4.6, vectors e¯1 , e¯2 , and n ¯ were shown to form an orthonormal basis B = (¯ e1 , e¯2 , n ¯ ), when using lines of curvature. The following orthogonal tensor is now deſned F (η1 , η2 ) = [¯ e1 , e¯2 , n ¯] .
(4.91)
This tensor can be interpreted as the space-dependent rotation tensor that brings the reference triad I to triad B. The curvature tensors of the surface are now deſned as FT
∂F =κ ∗1 , ∂s1
FT
∂F =κ ∗2 . ∂s2
(4.92)
With the help of Gauss’ and Weingarten’s formulæ, eqs. (2.65), the curvature vectors are found as 1 1 1 1 ∗T ∗T , , 0, , κ2 = . (4.93) κ1 = − 0, R1 T1 R2 T2 The principal radii of curvature, see eqs. (2.54), and the twists, eqs. (2.58) and (2.59), of the surface, are the components of the curvature vectors resolved in frame B. The components of the curvature vectors resolved in frame I are then κ1 = F κ∗1 and κ2 = F κ∗2 . If curvilinear coordinates s1 and s2 are functions of time, tensor F becomes an implicit function of time, F (t) = F (s1 (t), s2 (t)). The angular velocity of tensor F , resolved in basis B, is now ∂F ∂F ω ∗ = F T F˙ = F T s˙ 1 + s˙ 2 = s˙ 1 κ ∗1 + s˙ 2 κ ∗2 , (4.94) ∂s1 ∂s2 where the chain rule for derivatives was used. This implies ω ∗ = s˙ 1 κ∗1 + s˙ 2 κ∗2 : both curvature vectors contribute to the total angular velocity of basis B. In terms of the surface coordinates, eqs. (4.92) imply ∂F /∂η1 = F h1 κ ∗1 , and ∗ ∂F /∂η2 = F h2 κ 2 , where h1 and h2 are the scale factors introduced in eq. (2.56).
4.12 Spatial derivatives of rotation operations
145
Taking partial derivatives with respect to η2 and η1 of the ſrst and second equations, respectively, leads to h1 h 2 κ ∗2 κ ∗1 +
∂ ∂ (h1 κ∗1 ) = h1 h2 κ ∗1 κ∗2 + (h2 κ ∗2 ), ∂η2 ∂η1
because ∂ 2 F /∂η1 ∂η2 = ∂ 2 F /∂η2 ∂η1 . With the help of identity (1.33a), the Gauss-Codazzi conditions are then obtained 1 ∂ ∗ ∗ κ∗ = 1 ∂ (h κ (h2 κ∗2 ). 1 κ1 ) − 1 2 h1 ∂s2 h2 ∂s1
(4.95)
More explicitly, this vector condition gives rise to three scalar conditions ∂ h2 h2 ( )= , ∂s1 R2 R1 T2 ∂ h1 h1 ( )= , ∂s2 R1 R2 T1 1 ∂ h1 1 ∂ h2 1 ( )+ ( )+ = 0. h1 ∂s2 T1 h2 ∂s1 T2 R1 R2
(4.96a) (4.96b) (4.96c)
These equations express three conditions that must be satisſed by the radii of curvature, twists, and their spatial derivatives. 4.12.3 Orthogonal curvilinear coordinates In the study of the differential geometry of a mapping of the three-dimensional space onto itself, see section 2.6, vectors e¯1 , e¯2 , and e¯3 were deſned along the base vectors of the mapping. In section 2.6.2, these vectors were shown to form an orthonormal triad, E, in the case of orthogonal curvilinear coordinate systems. The following orthogonal tensor is now deſned F (η1 , η2 , η3 ) = [¯ e1 , e¯2 , e¯3 ] .
(4.97)
This tensor can be interpreted as the rotation tensor that brings the reference triad, I, to orthonormal triad E. The curvature tensors of the orthogonal curvilinear coordinate system are now deſned as ∂F ∂F ∂F FT =κ ∗1 , F T = κ∗2 , F T =κ ∗3 . (4.98) ∂s1 ∂s2 ∂s3 With the help of eqs. (2.84), these curvature vectors are found to be 1 1 1 1 1 1 , , 0, , , 0 , (4.99) , κ∗2 = , κ∗3 = κ∗1 = 0, R12 R13 R21 R23 R31 R32 where the radii of curvatures of the coordinate system were deſned in eqs. (2.85). The various derivatives of the scale factors are the components of the curvature vectors resolved in frame B. The components of the curvature vectors resolved in frame I are then κ1 = F κ∗ , κ2 = F κ∗2 , and κ3 = F κ∗3 .
146
4 The geometric description of rotation
If the curvilinear coordinates s1 , s2 , and s3 are functions of time, tensor F becomes an implicit function of time, F (t) = F (s1 (t), s2 (t), s3 (t)). The angular velocity of tensor F , resolved in basis B, is now ∂F ∂F ∂F ω ∗ = F T F˙ = F T s˙ 1 + s˙ 2 + s˙ 3 = s˙ 1 κ ∗1 + s˙ 2 κ ∗2 + s˙ 3 κ ∗3 , (4.100) ∂s1 ∂s2 ∂s3 where the chain rule for derivatives was used. This implies ω∗ = s˙ 1 κ∗1 + s˙ 2 κ∗2 + s˙ 3 κ∗3 : the three curvature vectors contribute to the total angular velocity of basis B. The various components of curvature are not independent of each other; following the process outlined in the previous section, relationships similar to the GaussCodazzi conditions can be readily derived. Example 4.8. Motion of a particle on a curve Figure 4.15 depicts a particle sliding along curve C embedded in a rigid body. The curvilinear variable along the curve is denoted s. The rigid body is moving with respect to an inertial frame of reference, F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]. The conſguration of the rigid body is deſned by the body attached frame, F B = B, B ∗ = (¯b1 , ¯b2 , ¯b3 ) . Superscript (·)∗ indicates the components of tensors resolved in basis B ∗ . The components of the position vector of point B with respect to point O, resolved in basis I, are denoted r B , and R(t) are the components of the rotation tensor that brings basis I to B ∗ , resolved in basis I.
b3
b1
B B
i3
P + F
p n
rB(t), R(t) O
b
s
*
b2 rP
I
C
t
i2
i1 Fig. 4.15. Rigid body with an embedded curve.
Let point P be a point along curve C; the curvilinear variable at the location of point P is denoted s. The position vector of point P with respect to point B is denoted p, and the components of this vector resolved in basis B ∗ are denoted p∗ . Because curve C is embedded in the rigid body, its shape is deſned by the position vector of a point on the curve, p∗ = p∗ (s); clearly, the components of this position vector are most naturally resolved in the body attached basis, B ∗ . Find the position, velocity, and acceleration vector of point P. The inertial position vector of point P is r P = rB + p = r B + R p∗ , where r P are the components of the position vector of point P with respect to point O, resolved in basis I.
4.12 Spatial derivatives of rotation operations
147
The component of the inertial velocity vector of point P resolved in basis I are readily obtained by taking a time derivative of the position vector to ſnd ˙ p∗ + R p˙ ∗ = v + ω vP = vB + R R p∗ + sR ˙ t¯∗ , B 0 ˙ RT ) are the comwhere v B = r˙ B is the inertial velocity of point B, ω = axial(R ponents of the angular velocity vector of the rigid body resolved in basis I, and t¯∗ the component of the unit tangent vector to the curve resolved in basis B ∗ . The ſrst term of this expression stems from the translation of the rigid body, and the second from its rotation. The last term describes the velocity associated with the sliding of the particle along the curve. The components of the inertial velocity vector resolved in the body attached basis, B ∗ , now become RT v P = RT v B + ω ∗ p∗ + s˙ t¯∗ . Note that the expression for the components of the inertial velocity of point P is simpler when resolved in the body attached basis, B ∗ , than when resolved in the inertial basis, I. This is expected, because the quantities associated with curve C, p∗ and t¯∗ , are most naturally expressed in basis B ∗ . Next, the components of the inertial acceleration of point P resolved in basis I are obtained by taking a derivative of the velocity components in the same basis to ſnd ˙ p∗ + ω ˙ t¯∗ + sR aP = aB + ω ˙ R p∗ + ω R R p˙ ∗ + s¨R t¯∗ + s˙ R ˙ t¯˙∗ s˙ 2 Rn = aB + (ω ˙ + ω ω )R p∗ + 2s˙ ω R t¯∗ + s¨R t¯∗ + ¯∗, ρ ¯ ∗ are the component of where aB = v˙ B is the inertial acceleration of point B and n ∗ the unit normal vector to the curve resolved in basis B . Here again, the components of the inertial acceleration vector are simpler when expressed in the body attached basis, s˙ 2 ∗ T RT aP = RT aB + (R n ¯ . ω˙ + ω ∗ω ∗ )p∗ + 2s˙ ω ∗ t¯∗ + s¨t¯∗ + ρ The ſrst two terms represent the contributions of the translation and rotation of the rigid body, respectively. The third term is the Coriolis acceleration. Finally, the last two terms are the acceleration of the particle with respect to the rigid body, which in this case, are the acceleration of the particle obtained using path coordinates, see eq. (2.34). The components of the angular acceleration vector expressed in basis B ∗ , RT ω, ˙ T T T ˙ ∗ ∗ ∗ · ∗ ∗ ∗ are easily evaluated. Indeed, R ω˙ = R (R ω ) = R R ω + ω˙ = ω ω + ω˙ = ω˙ ∗ . A simpliſed expression for the components of the inertial acceleration vector expressed in the body attached basis is then ∗
RT aP = RT aB + (ω ˙ + ω ∗ω ∗ )p∗ + 2s˙ ω ∗ t¯∗ + s¨t¯∗ +
s˙ 2 ∗ n ¯ . ρ
148
4 The geometric description of rotation
It is possible to deſne a frame associated with the particle, F F = ¯ , ¯b) , where F + is the curve’s Frenet’s triad at point P. Superscript P, F + = (t¯, n + (·) indicates tensor components resolved in this basis. The components of the rotation tensor that brings basis B ∗ to basis F + , resolved in basis B ∗ , are denoted F ∗ . This tensor, like all other characteristics of the curve, is most naturally expressed in basis B ∗ , a basis attached to the body in which the curve is embedded. The components of the unit vector tangent to curve C, resolved in basis B ∗ , are t¯∗ = F ∗¯b∗1 = F ∗¯ı1 , and hence, its components resolved in basis I become t¯ = R F ∗¯ı1 . The angular velocity of basis F + with respect to basis I is now evaluated by taking a time derivative of this tangent vector to ſnd
˙ F ∗ + R F˙ ∗ )(R F ∗ )T t¯ t¯˙ = (R = ω + s(R ˙ F ∗ )F ∗T F ∗ (R F ∗ )T t¯ = ( F ∗ κ+ )t¯, ω + s˙ R where κ+ = F ∗T F ∗ are the components of the curvature vector of curve C resolved in basis F + , see eq. (4.88); it follows that κ = R F ∗ κ+ are its components resolved in basis I. The angular velocity of basis F + with respect to basis I, denoted Ω, is now Ω = ω + sR ˙ F ∗ κ+ = ω + sκ. ˙ The ſrst term represents the contribution of the angular velocity of the rigid body; the second term stems from the change in orientation of Frenet’s triad as the particle moves along the curve. Note that the above result could have been established more expeditiously with the help of the addition theorem. Finally, the angular acceleration of basis F + is the time derivative of the angular velocity ˙ = ω˙ + s¨κ + s( ˙ F ∗ κ+ + sR Ω ˙ R ˙ F ∗ κ+ + sR ˙ F ∗ κ+ ) = ω˙ + s¨κ + s( ˙ ω κ + sR ˙ F ∗κ + κ+ + sR ˙ F ∗ κ+ ). The second term inside the parentheses vanishes because κ + κ+ = 0. The ſnal expression for the angular acceleration is Ω˙ = ω˙ + s¨κ + s˙ ω κ + s˙ 2 R F ∗ κ+ . The inertial acceleration of Frenet’s triad depends on the curvature vector κ = R F ∗ κ+ , but also on its derivative along the curve, κ+ . Both quantities, κ+ and κ+ , are intrinsic properties of curve C because they are components of the curvature vector and its spatial derivative resolved in Frenet’s triad, F + . 4.12.4 The differential rotation vector Let time-dependent rotation tensor R(t) describe the rotation from basis I = (¯ı1 , ¯ı2 , ¯ı3 ), called the ſxed basis, to basis B = (¯b1 , ¯b2 , ¯b3 ), called the rotating basis. The differential rotation vector is deſned by analogy to the angular velocity vector, see eq. (4.56), as
4.13 Applications to particle dynamics
dψ = axial(dR RT ),
149
(4.101)
where dψ is called the differential rotation vector. Note that there exist no “rotation vector,” ψ, such that d(ψ) gives the differential rotation vector. To emphasize this important fact, notation dψ is used to indicate the differential rotation vector, rather than dψ. Taking a differential of eq. (4.56) and a time derivative of eq. (4.101) then yields ˙ T ˙ RT + RdR ˙ ˙ RT + dR R ˙ T , respectively. Subtracting these d ω = dR and dψ = dR two equations, and using the orthogonality of the rotation tensor, eq. (4.26), then leads to ˙ ˙ ˙ dRT − dR R ˙ RT )(R dRT ) − (dR RT )(R R ˙ T = dψ ˙ T ). d ω = dψ +R + (R With the help of the deſnition of the angular velocity vector, eq. (4.56), and of the differential rotation vector, eq. (4.101), this reduces to ˙ ˙ ω − ω = dψ ω) d ω = dψ + dψ dψ + (dψ where identity (1.33a) was used. Finally, a differential in the angular velocity vector ˙ −ω becomes dω = dψ dψ. This important result relates differentials in the angular velocity vector to the differential rotation vector and its derivatives. Differentials of the components of the angular velocity vector expressed in the rotating frame can also be obtained in a similar manner ˙ ∗, dω = R dψ
˙ −ω dω = dψ dψ, ∗
∗
∗
∗
˙ +ω dψ , dω = dψ
∗
˙ dω = R dψ. T
(4.102a) (4.102b)
4.13 Applications to particle dynamics The geometric description of rotation presented in the previous sections is used extensively when analyzing the dynamic behavior of systems of particles when rotations are required to describe the kinematics of the system. Furthermore, Newton’s laws will be expressed in various bases to ease the analysis and help understand the physical interpretation of the various quantities involved in the problem. Example 4.9. Pendulum with rotating mass Figure 4.16 depicts a pendulum of length and tip mass M featuring an additional rotating mass m located at a ſxed distance d from the tip mass. Frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )] is inertial and the pendulum is attached to the ground at point O where a bearing allows rotation about axis ¯ı3 ; gravity acts along axis ¯ı1 . A second frame, F E = [O, E + = (¯ e1 , e¯2 , e¯3 )], is deſned; tensor components resolved in basis E + are denoted with a superscript (·)+ . A planar rotation of magnitude φ about axis ¯ı3 brings basis I to basis E + . Axis e¯1 is aligned with the massless rigid arm OA of the pendulum.
150
4 The geometric description of rotation
A third frame, F B = A, B ∗ = (¯b1 , ¯b2 , ¯b3 ) , is also deſned; tensor components resolved in basis B ∗ are denoted with a superscript (·)∗ . At point A, a bearing allows rotation of the massless rigid bar AT about axis e¯1 . A planar rotation of magnitude θ about axis e¯1 brings basis E + to basis B ∗ . Axis ¯b2 passes through the rotating mass m. Derive the equation of motions of the system using Newton’s second law. e2 i3 = e3 O
l
I f
i1
d T g m M + E A e1 = b1
e3
b2
i2
b3 q
e2
m
d
T
A
B M e1 = b1 *
Fig. 4.16. Conſguration of a pendulum with a rotating mass.
Let Rφ and R+ be the components of the rotation tensors that bring basis I to θ + + E and basis E to B ∗ , respectively, resolved in basis I and E + , respectively. The components of the tensor that brings basis I to B ∗ , resolved in basis I, are then R = Rφ R+ , and hence θ ⎤⎡ ⎡ ⎤ ⎡ ⎤ Cφ −Sφ 0 1 0 0 Cφ −Sφ Cθ Sφ Sθ R = ⎣ Sφ Cφ 0⎦ ⎣0 Cθ −Sθ ⎦ = ⎣ Sφ Cφ Cθ −Cφ Sθ ⎦ , 0 0 1 0 Sθ Cθ 0 Sθ Cθ where the short-hand notation, Sφ = sin φ, Cφ = cos φ was used, with similar conventions for angle θ. The angular velocity vector of basis B ∗ with respect to basis I is found with the ˙ θ ¯b2 + φC ˙ θ ¯b3 . help of the addition theorem to be ω = φ˙ e¯3 + θ˙ ¯b1 = θ˙ ¯b1 + φS The components of the angular and acceleration vectors, resolved in basis B ∗ , now become ⎧ ⎧ ⎫ ⎫ θ¨ ⎨ θ˙ ⎬ ⎨ ⎬ ˙ θ , ω˙ ∗ = φS ¨ θ + φ˙ θC ˙ θ . ω∗ = φS ⎩˙ ⎭ ⎩¨ ˙ θ⎭ φCθ φCθ − φ˙ θS The position vector of particle m with respect to inertial point O is xm = ¯b1 + ¯ d b2 . The inertial velocity vector then becomes x˙ m = ω ¯b1 + d ω ¯b2 and ſnally, the ˙ ˙ ¯ ¯ +ω ω ) b1 + d(ω +ω ω ) b2 . The components of this acceleration vector is x ¨ m = (ω vector, resolved in basis B, then become ⎧ ⎫ ¨ θ + 2dφ˙ θS ˙ θ⎬ −φ˙ 2 − dφC ⎨ ∗ ∗ ¨ − dφ˙ 2 C 2 − dθ˙2 RT x ¨ m = (ω ˙ + ω ∗ω ∗ )¯b∗1 + d(ω ˙ + ω ∗ω ∗ )¯b∗2 = , φC θ ⎩ ¨ θ ⎭ −φSθ + dθ¨ + dφ˙ 2 Sθ Cθ
4.13 Applications to particle dynamics
151
¯∗T ¯ ¯ where ¯b∗T 1 = 1, 0, 0 and b2 = 0, 1, 0 are the components of vectors b1 and b2 , ∗ respectively, resolved in basis B . The position vector of particle M with respect to inertial point O is xM = ¯b1 . The velocity and acceleration vectors of mass M can be found by letting d = 0 in the corresponding expressions for the velocity and acceleration vectors of mass m. The left portion of ſg. 4.17 shows a free body diagram of the two mass particles; F M and F m are the reaction forces exerted by the rigid bars onto particles M and m, respectively. For particle M , the applied forces are F M and the gravity force, M g¯ı1 ; Newton’s second law, resolved in basis B ∗ , then yields ⎧ ⎫ ⎧ ∗ ⎫ + M gCφ ⎬ ⎨ −φ˙ 2 ⎬ ⎨ FM1 ¨ M = F ∗ − M gSφ Cθ , (4.103) φC ⎩ ¨ θ ⎭ ⎩ M2 ⎭ ∗ FM3 + M gSφ Sθ −φSθ
∗ ∗ ∗ ∗ where F ∗T M = FM1 , FM2 , FM3 are the components of vector F M in basis B . T The components of the gravity force vector, resolved in basis B ∗ , are M gR ¯ı1 .
i3
O
i2 l
i1
A Mg
T mg
O
i2 l
Fm
d FM
i3
-Fm
d
-FM i1
A
T e1 = b1
Fig. 4.17. Left portion: free body diagram of the two masses. Right portion: free body diagram of rigid bars OA and AT.
For particle m, the applied forces are F m and the gravity force mg¯ı1 ; Newton’s second law, resolved in basis B ∗ , then yields ⎧ ⎫ ⎧ ⎫ ¨ θ + 2dφ˙ θS ˙ θ ⎬ ⎨ F ∗ + mgCφ ⎬ ⎨ −φ˙ 2 − dφC m1 ¨ − d(φ˙ 2 C 2 + θ˙2 ) m = F ∗ − mgSφ Cθ , (4.104) φC θ ⎩ ¨ θ ⎭ ⎩ m2 ⎭ ∗ 2 ¨ ˙ F + mgS S φ θ −φSθ + d(θ + φ Sθ Cθ ) m3
∗ ∗ ∗ ∗ where F ∗T m = Fm1 , Fm2 , Fm3 are the components of vector F m in basis B . Equations (4.103) and (4.104) are the equations of motion of the system. They involve two kinematic unknowns, φ and θ, and six force unknowns, the components of the reaction forces F M and F m , for a total of eight unknowns. Since the system comprises two particles, Newton’s equations yield a total of six equations. Consequently, two additional relationships are required to solve the problem. The right portion of ſg. 4.17 illustrates the two massless bars OA and TA. In view of the presence of a bearing at point A, the moment of the forces applied to bar TA must vanish about axis ¯b1 , i.e., −¯bT1 db2 F m = 0; this condition reduces to
152
4 The geometric description of rotation
∗ −d¯bT3 F m = 0 and ſnally, Fm3 = 0. The third line of eq. (4.104) the yields the following equation ¨ θ + d(θ¨ + φ˙ 2 Sθ Cθ ) = mgSφ Sθ . m −φS (4.105)
Similarly, the moment of the forces applied to bars OA and TA must vanish about axis ¯ı3 , because a bearing is present at point O. This condition implies −¯ıT3 [b1 F M + (b1 +db2 )F m ] = 0, or, expressed in basis B ∗ , −¯ıT3 R[b∗1 F ∗M +(b∗1 +db∗2 )F ∗m ] = 0. Expanding this relationship leads to ⎧ ⎫ 0 ⎬
⎨ ∗ 0, Sθ , Cθ FM3 = 0. ⎩ ∗ ∗ ∗ ⎭ dFm1 − (FM2 + Fm2 ) Eliminating the reaction force components with the help of eqs. (4.103) and (4.104) leads to the following equation 2 M + m(2 + d2 )Cθ2 φ¨ − mdCθ (φ˙ 2 Cθ2 + θ˙2 ) (4.106) ˙ θ Cθ = −M gSφ − mgCθ (Sφ Cθ + dCφ ). +mdφ˙ 2 Cθ − 2md2 φ˙ θS Equations (4.105) and (4.106) are two nonlinear, coupled, ordinary differential equations for the two kinematic variables, φ and θ. Once these equations have been solved, eqs. (4.103) and (4.104) will yield the reaction forces, thereby completing the solution of the problem. 4.13.1 Problems Problem 4.34. Relationships between angular velocity and curvature Consider a rotation ſeld that is a function of both space and time, i.e., R = R(s, t). It is now ˙ RT ) possible to deſne the components of the angular velocity vector as ω(s, t) = axial(R ∗ T ˙ and ω (s, t) = axial(R R) resolved in the inertial and rotating frames, respectively. Similarly, the components of the curvature vector are κ(s, t) = axial(R RT ) and κ∗ (s, t) = axial(RT R ) in the inertial and rotating frames, respectively. Based on the developments preκω, ω ∗ = κ˙ ∗ + ω ∗ κ∗ , sented in section (4.12.4), prove the following results ω = κ˙ + ∗ ∗ T ˙ ω = R κ˙ , and ω = R κ.
Problem 4.35. Rigid body with a slot Figure 4.18 depicts a rigid body with a slot in its reference conſguration as deſned by frame F0 = B, B0 = (¯b01 , ¯b02 , ¯b03 ) , where basis B0 determines the orientation of the body. Position vector xO determines the location of a reference point O on the rigid body with respect to inertial frame I. In the ſnal conſguration, the rigid body is deſned by frame F = B, B = (¯b1 , ¯b2 , ¯b3 ) , where its orientation is determined by basis B. The displacement of point O is denoted uO . Point P moves along a slot ſxed with respect to the rigid body in such a way that the distance from point B to point P is a given function of time d(t). The unit ¯ vectors aligned with the slot in the reference and ſnal conſgurations are denoted s¯ and S, respectively. Let R0 and R be the rotation tensors that bring basis I to B0 and basis B0 to B,
4.13 Applications to particle dynamics
153
respectively. (1) Find the inertial velocity and acceleration of point P in terms of the velocity and acceleration of point B, the angular velocity and acceleration of the rigid body, and function d(t). Express this vector in inertial basis I. All tensor components should be resolved in basis I. (2) Express these inertial velocity and acceleration vectors in the body attached basis B. All tensor components should be resolved in basis B.
S
Initial configuration b03
b3
b2
P
u 0, R
B
b1
i3
b02 B0
Final Configuration I i2 O
s b01
xO, R0
l f
i3
P
i2
O
B B
I
i1
e2 s M
k m E A
g
e1
i1
Fig. 4.18. Rigid body with a slot in the reference and ſnal conſgurations.
Fig. 4.19. Pendulum with a sliding mass connected to a spring.
Problem 4.36. Pendulum with a sliding mass Figure 4.19 depicts a pendulum of mass M and length connected at point A to a massless tube in which a point mass m is sliding while restrained by a spring of stiffness constant k. Frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )] is inertial and the pendulum is attached to the ground at point O where a bearing allows rotation about axis ¯ı3 ; gravity acts along axis ¯ı1 . A second e1 , e¯2 , e¯3 )] is deſned. A planar rotation of magnitude φ about axis ¯ı3 frame F E = [A, E = (¯ brings basis I to basis E; axis e¯2 is aligned with the massless tube. The position of mass m with respect to point A is denoted s. (1) Using Newton’s second law, derive the equations of motion of the system for φ(τ ) and s(τ ). (2) Plot the time history of φ(τ ). (3) Plot the time history of s¯(τ ). (4) On one graph, plot the kinetic, potential and total mechanical energies of the system. (5) Plot the normalized components of the reaction force vector acting on particle M . (6) Plot the normalized components of the reaction force vector acting on particle m. (7) Compute the angular momentum vector of the system evaluated with respect to point O. From this expression, derive a differential equation that must be satisſed by φ and s. Show that this relationship can be derived from the equations of motion obtained in step 1. Use the following data: μ = M/m = 1, g¯ = g/(ω 2 ) = 0.6, s¯ = s/. The following non-dimensional time is deſned: τ = ωt, where ω 2 = k/m; derivatives with respect to τ are denoted (·) . Forces are normalized by mω 2 . At time t = 0, φ = π/2, s¯ = 0, φ = 0 and s¯ = 2. For all plot, τ ∈ [0, 200].
Problem 4.37. Mass particle moving in a tube Figure 4.20 shows a particle of mass m moving in a rigid slot under the effect of an actuator. The actuator is connected to the particle at point P and to the slot at point A; for clarity, the actuator is not shown on the ſgure. A rigid bar OA of length r rotates in plane P = (¯ı1 , ¯ı2 ) at a constant angular velocity, Ω. A planar rotation of magnitude ψ = Ωt about axis ¯ı3 brings e1 , e¯2 , e¯3 ); axis e¯1 is along rigid bar OA. The rigid slot basis I = (¯ı1 , ¯ı2 , ¯ı3 ) to axis E = (¯
154
4 The geometric description of rotation
e2 i2
i3 O
A
e3 y
b3
b2
P r
e3
e1
m
Rigid slot
q
e2
i1
P m
s A
e1 = b1
Fig. 4.20. Mass particle moving in a tube. is connected to bar OA at point A and is allowed to rotate with respect the bar about axis e1 , e¯2 , e¯3 ) to axis e¯1 . A planar rotation of magnitude θ about axis e¯1 brings basis E = (¯ B = (¯b1 , ¯b2 , ¯b3 ); axis ¯b2 is along the rigid slot. The position of the mass in the slot is deſned by the curvilinear variable s that is positive along axis ¯b2 . (1) Find the position, velocity and acceleration vectors of point P with respect to inertial point O. (2) Find the components of the acceleration vector resolved in basis B. (3) Write Newton’s second law for the particle, resolved in basis B. (4) Identify the nature of the forces applied on the particle. (5) On one graph, plot the components of the force vector acting on the particle, resolved in basis B. (6) Find the moment of the forces that the particle and actuator apply on the slot with respect to point O. (7) On one graph, plot the components of this moment vector resolved in basis E + . Use the following data: θ(t) = θ0 + θs sin Ωt + θc cos Ωt, where θ0 = 15, θs = −6 and θc = 8 degrees; s(t) = ss sin Ωt + sc cos Ωt, where ss = 0.05 and sc = −0.03 m. m = 10 kg, Ω = 27.02 rad/s, r = 0.8 m.
Problem 4.38. Pendulum with a rotating mass Figure 4.16 depicts a pendulum of length and tip mass M featuring an additional rotating mass m located at a ſxed distance d from the tip mass, as treated in example 4.9 on page 149. Frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )] is inertial and the pendulum is attached to the ground at point O where a bearing allows rotation about axis ¯ı3 ; gravity acts along axis ¯ı1 . A second e1 , e¯2 , e¯3 ) , is deſned. A planar rotation of magnitude φ about frame, F E = O, E + = (¯ axis ¯ı3 brings basis I to basis E +. Axis e¯1 is aligned with the massless rigid arm OA of the pendulum. A third frame, F B = A, B∗ = (¯b1 , ¯b2 , ¯b3 ) , is also deſned. At point A, a bearing allows rotation of the massless rigid bar AT about axis e¯1 . A planar rotation of magnitude θ about axis e¯1 brings basis E + to basis B∗ . Axis ¯b2 passes through the rotating mass m. Derive the equation of motions of the system using Newton’s second law. (1) Using Newton’s second law, derive the equations of motion of the system for φ(τ ) and θ(τ ). (2) On one graph, plot the time history of φ(τ ) and θ(τ ). (3) On one graph, plot the time history of φ and θ . (4) On one graph, plot the kinetic, potential and total mechanical energies of the system. (5) Plot the normalized components of the reaction force vector acting on particle M , resolved in basis B. (6) Plot the normalized components of the reaction force vector acting on particle m, in basis B. (7) Plot the normalized components of the reaction force vector at point O, in basis I. (8) Plot the normalized components of the reaction moment vector at point O, in basis I. (9) Compute the angular momentum vector of the system evaluated with respect to point O. From this expression, derive a differential equation that must be satisſed by φ and θ. Show that this relationship can derived from the equations of motion obtained in step 1. Use the following data: μ = m/M = 1, d¯ = d/ = 0.2. The following non-dimensional time is deſned: τ = ωt, where ω 2 = g/; a derivative with respect to τ is denoted (·) . Forces are
4.14 Change of reference frame operations
155
normalized by M ω 2 , moments by M 2 ω 2 . At time t = 0, φ = π/2, θ = 0, φ = 0 and θ = 0.1. For all plot, τ ∈ [0, 50].
4.14 Change of reference frame operations the conſguration of a rigid body characterized by frame F0A = FigureA4.21 depicts A A A A, E0 = (¯ e01 , e¯02 , e¯03 ) . In the reference conſguration, the position vector of point A P with respect to inertial point O is uA 0 and rotation tensor R0 bring basis I to basis E0A . B In its ſnal conſguration, the rigid A e3 B/A B/A e3 body is characterized by reference x , R B frame F A = A, E A = (¯ eA ¯A ¯A 1 ,e 2 ,e 3) . E A A A e B 2 The displacement vector of point A E B e2 Final from the reference to the ſnal conſgA configuration B uration of the rigid body is uA and roe1 e1 A B B A A tation tensor R bring basis E0A to bau ,R u ,R A B e A 03 e03 sis E . The position vector of point A Reference in the reference conſguration with reconfiguration B A A B spect to point O is uA and roA E0 0 + u E 0 e02 A tation tensor RA RA brings basis I to i B 3 B 0 e02 e01 B B basis E A . u , R0 0 A I e01 u0A, RA0 Figure 4.21 also shows a second O rigid body in its reference and ſnal i2 i1 conſgurations. All quantities belonging this second rigid body are denoted Fig. 4.21. Change of reference frame with superscript (·)B . All vectors and tensor are expressed by their components in the inertial frame I; i.e., all quantities are “viewed by an inertial observer.” The position vector of point B with respect to point A, denoted r B/A , is " ! A " ! B − u0 + uA . r B/A = uB 0 +u The components of this vector resolved in basis E A , denoted r ∗B/A , are +T ! * " ! A " B uB − u0 + uA . r ∗B/A = RA RA 0 +u 0 These components are often called the “components of the position vector of point B as viewed by an observer on frame F A .” Similarly, the rotation tensor that bring basis E A to basis E B , denoted RB/A , can be evaluated as follows +* +T * B B A A B/A A e¯B R = R R R e¯A e¯α . α α = R 0 0 The components of this tensor resolved in E A , denoted R∗B/A , are
156
4 The geometric description of rotation
+T * + * B B R . R∗B/A = RA RA R 0 0 These are often called the “components of the rotation tensor of basis E B as viewed by an observer on frame F A .” The position vector of point B with respect to point A in the reference conſguB/A A A ration is r 0 = uB 0 − u0 . The components of this vector resolved in basis E0 are ∗B/A AT B A r0 = R0 (u0 − u0 ). If body B is rigidly connected to body A, the components of the position vec∗B/A tor of point B as viewed by an observer on frame F A are still x0 in the ſnal conſguration. The components of displacement vector of point B with respect to point A as viewed by an observer on frame F A in the ſnal conſguration are +T ! * " ! A " ! B " B u∗B/A = RA RA uB − u0 + uA − RAT u0 − uA (4.107) 0 +u 0 . 0 0 The rotation tensor that brings basis E0A to basis E0B is RB/A = RB RAT . The 0 0 0
= RAT RB . components of this tensor resolved in E0A are R∗B/A 0 0 0 If body B is rigidly connected to body A, the components of the rotation tensor that bring basis E A to basis E B , resolved in basis E A , are still R∗B/A . Let QB/A be 0
the rotation tensor that measures the change in orientation of basis E B with respect to basis E A , between the reference and ſnal conſgurations. The components of this change in orientation of basis E B as viewed by an observer on frame F A in the ſnal conſguration are ! AT B " A Q∗B/A = R∗B/A R∗B/AT = RAT (4.108) R R R0 . 0 0 Let the ſnal conſguration of the system be time-dependent. The inertial velocities of points A and B, denoted v A and v B , respectively, are easily found as v A = u˙ A and v B = u˙ B , respectively. Similarly, the angular velocity vectors of bases E A and ˙ A RAT ) and E B , denoted ω A and ω B , respectively, are easily found as ω A = axial(R ˙ B RBT ), respectively. All these vectors are expressed by their compoω B = axial(R nents in the inertial frame I; i.e., all quantities are “viewed by an inertial observer.” The components of the velocity vector of point B as viewed by an observer on frame F A , denoted v ∗B/A , are readily found by taking a time derivative of eq. (4.107) to ſnd +T
* ! " ! A " ! " B v ∗B/A = RA RA ω AT uB − u0 + uA + v B − v A . (4.109) 0 +u 0 Similarly, the components of the angular velocity vector of basis E B as viewed ˙ ∗B/A Q∗B/AT ), and it the follows by an observer on frame F A are ω∗B/A = axial(Q from eq. (4.108) that +T ! * " ω ∗B/A = RA RA ωB − ωA . 0
(4.110)
4.15 Orientation of a unit vector
157
4.15 Orientation of a unit vector Consider a unit vector, ¯ı3 , called a director, that rotates to a ſnal orientation e¯3 , as depicted in ſg. 4.22. For convenience, this director is considered to be the third unit vector of a basis I = (¯ı1 , ¯ı2 , ¯ı3 ), rotating to a basis E = (¯ e1 , e¯2 , e¯3 ). The relationship between these two bases is (4.111)
e¯α = R ¯ıα ,
where R is an orthogonal rotation tensor. If attention solely focuses on the director, this rotation tensor is not uniquely deſned, because any rotation about the director leaves its orientation unchanged. A differential change in the director’s orientation is e3 d¯ e3 = eT3 dψ, where dψ is the differential rotation veci3 e1 tor deſned by eq. (4.101). The components of the differE ential change in director orientation resolved in basis E e2 I become ⎧ ⎫ i2 i1 ⎨ dψ2∗ ⎬ RT d¯ e3 = RT eT3 dψ = ıT3 RT dψ = −dψ1∗ , Fig. 4.22. Change of orien⎩ ⎭ 0 tation of the director ¯ı3 . where dψ ∗ are the components of the differential rotation vector resolved in basis E. This relationship demonstrates that differential changes in the orientation vector only depend on two components, dψ1∗ and dψ2∗ , of the differential rotation vector. Arbitrary values of dψ3∗ , corresponding to differential rotations of the director about its own orientation, will not affect differential changes in the director orientation, and hence, setting dψ3∗ = 0 is a valid choice. The following notation is adopted dψ ∗ = ¯ı1 dα∗1 + ¯ı2 dα∗2 = b dα∗ , where ⎡ 1 b = [¯ı1 , ¯ı2 ] = ⎣0 0
⎤ 0 1⎦ . 0
(4.112)
The “two parameter” differential rotation vector is denoted dα∗ . Array b simply expands this two parameter differential rotation vector, dα∗ , to the differential rotation vector, dψ ∗ , by imposing the condition dψ3∗ = 0. It follows that dψ = R dψ ∗ = R b dα∗ , and ſnally, differential changes in the orientation of the triad become d¯ eα = R ıTα b dα∗ .
(4.113)
Part II
Rigid body dynamics
5 Kinematics of rigid bodies
Newton’s laws deal with the dynamic behavior of a single particle and Euler’s laws generalize the analysis to the case of a system of particles. Rigid bodies form a special case of “systems of particles,” and their dynamic behavior is studied in depth in chapter 6. This chapter focuses on the kinematics of rigid bodies, i.e., the description of the motion of rigid bodies without consideration of the forces that create this motion. Sections 5.1 and 5.2 study the displacement and velocity ſelds, respectively, of rigid bodies undergoing arbitrary, time-dependent motion. The concept of relative velocity and acceleration is treated in section 5.3, while section 5.4 addresses the problem of contact between two rigid bodies. The chapter concludes with the analysis of the motion tensor.
5.1 General motion of a rigid body Figure 5.1 depicts a rigid body deſned in its reference conſguration by frame F0 = [A, E0 = (¯ e01 , e¯02 , e¯03 )]. The position vector of point A with respect to point O is denoted r 0 . Let r P be the position vector of a material point P of the rigid body with respect to inertial frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]. The position vector of the same material point with respect to point A is denoted sP . Hence, r P = r 0 + sP . The rigid body now undergoes an arbitrary motion that brings it to a ſnal conſguration deſned by frame F = [A, E = (¯ e1 , e¯2 , e¯3 )]. Let R0 and R be the rotation tensors that bring basis I to E0 and basis E0 to E, respectively. Considering ſg. 5.1, the following vector relationship is easily established, uP = u + S P − sP ,
(5.1)
where S P is the position vector of material point P with respect to point A in the T ſnal conſguration. Let s∗P = RT0 sP and S + P = (R R0 ) S P denote the components of vector sP in basis E0 and of vector S P in basis E, respectively.
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_5 © Springer Science+Business Media B.V. 2011
162
5 Kinematics of rigid bodies
Because the body is assumed to be rigid, the components of vector sP in E0 are identical to those of S P in E, i.e., s∗P = S + P , and hence, S P = R sP . Equation (5.1) now becomes ! " uP = u + R − I sP . (5.2) This relationship describes the displacement of a material point P of the rigid body in terms of u, the displacement of its reference point, and tensor R that deſnes its orientation. Note that the choice of reference point A is arbitrary, and hence, eq. (5.2) is not an intrinsic relationship. Reference configuration
Q
e3
e2
n SQ
u, R
E
e03 uP E0 A
d
P
rP
sP P
I O
Q e01
e1
i3 e02
sQ
SP
A
r0, R0
Final Configuration i2
i1
Fig. 5.1. General motion of a rigid body.
To obtain a more general expression of the displacement ſeld, the following question can be asked: is it possible to ſnd a material point of the rigid body, say point Q, whose displacement is parallel to n ¯ , the axis deſning rotation tensor R? If point Q exist, its relative position vector, sQ , must satisfy the following relationship ! " uQ = u + R − I sQ = d¯ n. (5.3) Constant d can be evaluated by taking the scalar product this equation by n ¯ T to ſnd T d=n ¯ u. It then follows that " " ! T ! n−u= n ¯n ¯ − I u. (5.4) R − I sQ = d¯ In view of eq. (4.21) and identity (1.33b), this equation can be written as n 2 sin φ/2 G sQ − n ¯ , which u = 0. The bracketed must be parallel to unit vector n implies 2 sin φ/2 G sQ − n u = β n ¯ , where β is an arbitrary constant. The location of point Q is now readily found as sQ =
n GT β u+ n ¯. 2 sin φ/2 2 sin φ/2
This represents the equation of a line passing through point Q and parallel to n ¯ . The displacements of all points on this line are along n ¯.
5.2 Velocity ſeld of a rigid body
163
Point Q can be deſned uniquely by requiring sQ to be orthogonal to n ¯ , i.e., n ¯ T sQ = 0, and hence, β = 0. The location of point Q [19] now becomes sQ =
n GT u. 2 sin φ/2
(5.5)
By construction, the displacement of point Q is parallel to n ¯ , see eq. (5.3). Combining eqs. (5.2) and (5.3) now yields uP = d¯ n + (R − I)(sP − sQ ).
(5.6)
This relationship expresses the displacement of a material point P of the rigid body as a translation, d¯ n, parallel to axis n ¯ , followed by a rotation about that same axis. The displacement d=n ¯ T u, (5.7) is the intrinsic displacement of the rigid body: all points of the rigid body undergo the same displacement, d, followed by a rotation. If the rigid body undergoes a general planar motion, u lies in the plane of the motion, and n ¯ is perpendicular this plane. Hence, d = n ¯ T u = 0, the intrinsic displacement, d, of a rigid body in general planar motion always vanishes. If the rigid body undergoes a pure translation, axis n ¯ is along the displacement u of all the points of the body. The motion is then decomposed into a translation, d¯ n, followed by a rotation of vanishing magnitude about the same axis. Equation (5.6) expresses the general motion of a rigid body as screw motion about axis n ¯ . The pitch of the screw, , is deſned as =
2πd . φ
(5.8)
Mozzi-Chasles’ theorem [20, 21] states the results obtained here in a compact manner. Theorem 5.1 (Mozzi-Chasles’ theorem). The most general motion of a rigid body consists of a translation along an axis followed by a rotation about the same axis. The Mozzi-Chasles axis is deſned by its orientation, n ¯ , and the position of one of its points, sQ , given by eq. (5.5). Alternatively, this axis can be deſned by its Pl¨ucker coordinates [19, 22] ⎫ ⎧ n GT ⎬ ⎨ n QMC = − 2 sin φ/2 u (5.9) ⎭ ⎩ n ¯
5.2 Velocity ſeld of a rigid body The time-dependent motion of a rigid body, as depicted in ſg. 5.2, will now be investigated. The structure of the velocity ſeld of the entire rigid body is the focus of the analysis.
164
5 Kinematics of rigid bodies
The inertial velocity of an arbitrary point P is obtained from a time derivative of ˙ s = v+R ˙ RT S , where v = u˙ and v = u˙ are the inertial eq. (5.2), v P = v + R P P P P velocity vectors of point P and A, respectively. This equation becomes vP = v + ω SP ,
(5.10)
˙ RT ) is the angular velocity vector of the rigid body. This relawhere ω = axial(R tionship describes the velocity of an arbitrary point P of the rigid body in terms of v, the velocity of a reference point, and ω, the angular velocity vector of the rigid body. Here again, the choice of reference point A is arbitrary, and hence, eq. (5.10) is not an intrinsic relationship. Reference configuration
u, R
e3
e2
w
E
e03 uP E0 A e01
sP P
SQ
Q
xP
P
e1
i3 Final Configuration
e02 I O
sQ x0, R 0
SP
A
i2
i1
Fig. 5.2. Time-dependent motion of a rigid body.
To obtain a more general description of the velocity ſeld, the following question can be asked: is it possible to ſnd a material point of the rigid body, say point Q, whose velocity vector is parallel to the angular velocity vector? If such a point exists, the following relationship must hold vQ = v + ω S Q = μω,
(5.11)
where μ is an arbitrary scalar that can be found by taking the scalar product of this equation by ω T to ſnd μ = (ω T v)/ω 2 . Equation (5.11) now becomes ω S Q = (ω ω T /ω 2 − I)v = ω ω v/ω 2 , where v/ω 2 = 0. identity (1.33b) was used. This equation can be recast as ω SQ − ω The bracketed term is parallel to the angular velocity vector, which implies S Q − ω v/ω 2 = αω, where α is an arbitrary constant. The location of point Q is now found as ω S Q = αω + 2 v. ω The solution is the locus of points along a straight line parallel to ω, and hence, no unique solution exists for the location of point Q. To remove this ambiguity, point Q will be selected as that at the shortest distance from point A, i.e., ωT S Q = 0. It follows that α = 0, and
5.2 Velocity ſeld of a rigid body
SQ =
ω v. ω2
165
(5.12)
In summary, material point Q of the rigid body exists whose velocity vector is parallel to the angular velocity vector. The location of this point is given by eq. (5.12). Combining eqs. (5.10) and (5.11) now yields vP =
ωT v ω+ω (S P − S Q ) = v Q + ω (S P − S Q ) ω2
(5.13)
This relationship expresses the velocity of material point P of the rigid body as the velocity of point Q, v Q , which is parallel to angular velocity vector ω, followed by a rotation about that same axis. This is referred to as screw motion about axis ω. The screw axis is deſned as the line passing through point Q and parallel to ω. The Pl¨ucker coordinates, Q, of the screw axis are , ω ω − v (5.14) QSA = ω2 ω
5.2.1 Problems Problem 5.1. General motion of a rigid body Figure 5.1 depicts the general motion of a rigid body. Find material point Q of the rigid body whose displacement vector is of minimum norm. Is this point unique? Hint: The condition for minimization of the displacement norm is ||uQ ||2 = minsq [u + (R − I)sq ]T [u + (R − I)sq ]. The minimum displacement norm is found when (R−I)T [u+(R−I)sq ] = (R−I)T uQ = 0. n. The solution of this system then uQ = u + (R − I)sq = d¯
Problem 5.2. Time-dependent motion of a rigid body Figure 5.2 shows the time-dependent motion of a rigid body. Find material point Q of the rigid body whose velocity vector is of minimum norm. Is this point unique? Hint: The condition for minimization of the velocity norm is ||v Q ||2 = minS Q [v + ω S Q ]T [v + ω S Q ]. The minimum T T SQ] = ω v Q = 0. velocity norm is found when ω [v + ω
Problem 5.3. Location of the average velocity point Consider three material points, P, Q, and R, of a rigid body with position vectors xP , xQ , and xR , respectively, and velocity vectors v P , v Q , and v R , respectively. Find the location of point C of the rigid body whose velocity is v C = (v P + v Q + v R )/3.
Problem 5.4. Relating the velocity vectors of three points of a rigid body Consider two material points P and Q of a rigid body and their velocity vectors, v P and v Q , respectively. (1) Find the velocity vector of point R of the rigid body, assuming that points P, Q, and R are not collinear. (2) Is the velocity of point R fully determined?
Problem 5.5. Relating the velocity vectors of three points of a rigid body Consider two material points P and Q of a rigid body and their velocity vectors, v P and v Q , respectively. (1) Find the velocity vector of point R of the rigid body, assuming that points P, Q, and R are collinear. (2) Is the velocity of point R fully determined?
166
5 Kinematics of rigid bodies
Problem 5.6. Computing the angular velocity of a rigid body The velocity vectors of material points P, Q, and R of a rigid body are given as v P , v Q , and v R , respectively. (1) Find the angular velocity vector of the rigid body. (2) State the three scalar constraints that the given velocity vectors must satisfy.
Problem 5.7. Relating the velocity vectors of four points of a rigid body Consider three material points P, Q, and R of a rigid body and their velocities, v P , v Q , and v R , respectively. (1) Find the velocity of point S of the rigid body, assuming that points P, Q, and R are not collinear. (2) State the three scalar constraints that the given velocity vectors must satisfy.
Problem 5.8. Determination of Mozzi-Chasles axis Figure 5.3 depicts a cube of unit size. Point A is selected as the reference point of the body; its displacement vector is denoted u. The rotation of the rigid body is deſned as a rotation of magnitude φ about unit vector n ¯ . (1) Determine the coordinates of a point on the MozziChasles axis characterizing the motion of the rigid body. (2) Find the Pl¨ucker coordinates of the Mozzi-Chasles axis. (3) Compute the intrinsic displacement of the rigid body. (4) Using T eq. A, B, C, and D. Use the following data: u = (5.6), compute the displacements of points 3.2, 4.5, 0.76 m, φ = 1.25 rad, nT = 0.20, −0.26, 0.95 (normalize this vector to make it a unit vector).
i3
b1 D rB, R
i3 i2 A i1
C
i1
B
Fig. 5.3. Arbitrary motion of a rigid body.
O
b3 B
u
B rP
I
*
P b2
i2
Fig. 5.4. Motion of a point deſned with respect to frame F B .
5.3 Relative velocity and acceleration Figure 5.4 depicts a practical situation that occurs in many engineering problem. The motion of a point, P, is deſned with respect to a rigid body associated with frame F B = B, B ∗ = (¯b1 , ¯b2 , ¯b3 ) . Tensor components resolved in basis B ∗ are denoted with superscript (·)∗ . The motion of the rigid body is deſned with respect to an inertial frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]. The components of the position vector of point P with respect to point B, resolved in basis B ∗ , are denoted u∗ . The motion of frame F B is deſned by the components of the position vector of point B with respect to point O, denoted r B , and the components of the rotation tensor that bring basis I to basis B ∗ , denoted R, both resolved in basis I.
5.3 Relative velocity and acceleration
167
Consider, for instance, a piece of rotating machinery such as a jet engine or a helicopter rotor. It makes sense to attach a frame at the hub of the rotor: point B is at the hub center point and basis B ∗ rotates with the rotor. The position vector of a point on the blade, say point P, is then most naturally expressed in terms of its components in the hub attached basis B ∗ ; in fact, if the rotor blade is rigid, the components of the position vector of a point on the blade resolved in basis B ∗ are constant. The motion of the hub attached frame, deſned by position vector r B and rotation tensor R, characterize the motion of the hub with respect to an inertial frame. Two important cases will be considered. In the most general case, point P is in relative motion with respect to the rigid body, i.e., the components of the position vector of point P with respect to point B, resolved in the body attached basis, are a function of time, u∗ = u∗ (t). In the second case, point P is a ſxed, or material point of the rigid body, which implies that the components of the position vector of point P with respect to point B, resolved in the body attached basis, are constant in time, u∗ = u∗ (t).
5.3.1 Point P is in motion with respect to the rigid body The inertial velocity and acceleration vectors of point P will now be evaluated assuming that this point is in motion with respect to the rigid body. The inertial position of point P, denoted r P , is expanded as r P = r B + u = rB + R u∗ ,
(5.15)
where u = R u∗ are the components of the position vector of point P with respect to point B resolved in basis I. The inertial velocity vector of point P, denoted v P , now becomes ˙ u∗ + R u˙ ∗ = v + R ˙ RT (r − r ) + R u˙ ∗ vP = vB + R B P B = vB + ω (r P − r B ) + R u˙ ∗ ,
(5.16)
where v B = r˙ B is the inertial velocity of point B and, eq. (5.15), written as u∗ = RT (r P − r B ), is used to eliminate u∗ . The ſrst term of eq. (5.16) represents the inertial velocity of the origin of the body attached frame, F B , and the second term accounts for the effects of its angular velocity. The last term is the relative velocity of point P with respect to point B, resolved in inertial basis I. Of course, the inertial velocity vector of point P could also be resolved in the body attached basis B ∗ ; multiplication by RT yields ∗ u∗ + u˙ ∗ . RT v P = RT v B + ω
(5.17)
Next, the inertial acceleration of point P, denoted aP , is obtained by taking a time derivative of the inertial velocity, eq. (5.16), to ſnd ˙ u˙ ∗ + R u aP = v˙ B + ω ˙ (r P − r B ) + ω (v P − v B ) + R ¨∗.
168
5 Kinematics of rigid bodies
The velocities appearing in the third term are eliminated using eq. (5.16), to ſnd ˙ RT R u˙ ∗ + R u ˙ (r P − r B ) + ω ω (r P − r B ) + R u˙ ∗ + R ¨∗ aP = a B + ω (5.18) = a + (ω ˙ + ω ω )(r − r ) + 2 ωR u˙ ∗ + R u ¨∗ , B
P
B
where aB = v˙ B is the inertial acceleration of point B. The ſrst term of this expression represents the inertial acceleration of the origin of the body attached frame F B , and the second term accounts for the effects of its angular acceleration and velocity. The third term is known as the Coriolis acceleration. Finally, the last term is the relative acceleration of point P with respect to point B, resolved in the inertial basis I. Here again, the inertial acceleration vector of point P could also be resolved in the body attached basis B ∗ ; multiplication by RT yields T ω) ˙ +ω ∗ω ∗ ]u∗ + 2 ω ∗ u˙ ∗ + u ¨∗. RT aP = RT aB + [(R
(5.19)
Term RT ω˙ represents the angular acceleration of the rigid body, resolved in body attached frame B ∗ ; this quantity is readily evaluated as RT ω˙ = RT (R ω ∗ )· = ˙ ω∗ + ω˙ ∗ = ω RT R ∗ ω ∗ + ω˙ ∗ , and ſnally RT ω˙ = ω˙ ∗ .
(5.20)
With this result at hand, the components of the inertial acceleration vector of point P, resolved in basis B ∗ , become ∗
RT aP = RT aB + (ω ˙ + ω ∗ω ∗ )u∗ + 2 ω ∗ u˙ ∗ + u ¨∗.
(5.21)
5.3.2 Point P is a material point of the rigid body If point P is a material point of the rigid body, the components of its position vector with respect to point B, resolved in the body attached basis, are constant in time, u∗ = u∗ (t). The velocity vector of point P, eq. (5.16), now reduces to v P = vB + ω (r P − r B ).
(5.22)
Points B and P are two arbitrary material points of the rigid body. This means that eq. (5.22) relates the velocity vectors of two arbitrary points of the same rigid body. Of course, this relationship is identical to that found earlier, see (5.13), using a different reasoning. When expressed in the body attached basis, the same relationship becomes RT v P = RT v B + ω ∗ u∗ . (5.23) The acceleration vector of point P, eq. (5.18), reduces to ˙ + ω ω )(r P − r B ). aP = aB + (ω
(5.24)
5.3 Relative velocity and acceleration
169
This equation relates the acceleration vectors of two arbitrary points of the same rigid body. When expressed in the body attached basis, the same relationship becomes ∗ RT aP = RT aB + (ω ˙ + ω ∗ω ∗ )u∗ . (5.25) Example 5.1. Velocities and acceleration of a robotic arm Figure 5.5 depicts a robotic system. The shaft is allowed to rotate about axis ¯ı3 with respect to inertial frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]. The time-dependent rotation angle of unit vector s¯1 with respect to axis ¯ı1 is denoted α(t). Frame F S = [S, S + = (¯ s1 , s¯2 , s¯3 )] is attached to the shaft at a distance h from the origin of the inertial frame, as indicated on the ſgure; tensor quantities resolved in basis S + are denoted with superscript (·)+ . An arm of length La extends along the direction of axis s¯2 and is attached to the shaft at point S. Finally, a rigid manipulator of length Lb is connected to the arm at point B. The manipulator is allowed to rotate with respect to frame F S , about axis s¯1 . The time¯b2 with respect to axis s¯2 is denoted β(t). dependent rotation angle of unit vector Frame F B = B, B ∗ = (¯b1 , ¯b2 , ¯b3 ) is attached to the manipulator; tensor quantities resolved in basis B ∗ will be denoted with superscript (·)∗ . Determine the velocity and acceleration vectors of point P, located at the tip of the manipulator, at a distance Lb from point B.
b3
s3 = i3
rb
s2
S S
Lb B * B
+
P b2 b(t)
s1 La
b1 = s1
h i3 O I
i1 a(t)
i2
s1 Fig. 5.5. Robotic arm conſguration.
Let Rα and Rβ be the components of the rotation tensors that bring basis I to basis S + and basis S + to basis B ∗ , respectively, both resolved in basis I. This implies s¯1 = Rα¯ı1 and ¯b1 = Rβ s¯1 ; it follows that ¯b1 = Rβ Rα ¯ı1 = R ¯ı1 , where R = Rβ Rα are the components of the rotation tensor that brings basis I to basis B ∗ , resolved in basis I. It is more natural to work with the components of rotation tensor Rβ resolved in basis S + , R+ = RTα Rβ Rα , see eq. (4.29). The components β
170
5 Kinematics of rigid bodies
of rotation tensor R now become ⎤⎡ ⎤ ⎡ ⎤ 1 0 0 Cα −Sα 0 Cα −Sα Cβ Sα Sβ R = Rα R+ = ⎣ Sα Cα 0⎦ ⎣0 Cβ −Sβ ⎦ = ⎣ Sα Cα Cβ −Cα Sβ ⎦ , β 0 0 1 0 Sβ Cβ 0 Sβ Cβ ⎡
are planar rotations about axes ¯ı3 and s¯1 , respectively, see because Rα and R+ β eqs. (4.6) and (4.4), respectively. Notations Sα = sin α and Cα = cos α were used to simplify the writing; similar expressions are used for angle β. The angular velocity of basis B ∗ with respect to basis I, denoted ω, is readily found by using the addition theorem, eq. (4.62), as ω = α¯ ˙ s3 + β˙ ¯b1 = α(S ˙ β ¯b2 + ˙ ¯ ˙ ¯ ¯ ¯ ¯ ˙ β b2 + αC ˙ β b3 . Expressing this tensor relationship in basis Cβ b3 ) + β b1 = β b1 + αS B ∗ yields ω ∗ = β˙ ¯b∗1 + αS ˙ β ¯b∗2 + αC ˙ β ¯b∗3 , and hence, the components of the angular
˙ αS velocity vector expressed in basis B ∗ are ω ∗T = β, ˙ β , αC ˙ β . The components of
˙ α , βS ˙ α , α˙ . this vector in basis I are then evaluated as ω = R ω∗ , to ſnd ωT = βC The position vector of point P with respect to point O is r P = h¯ı3 + La s¯2 + Lb¯b2 = h¯ı3 + (Lb + La Cβ )¯b2 − La Sβ ¯b3 . The inertial velocity of point P, denoted ˙ β ¯b2 − La βC ˙ β ¯b3 + v P = r˙ P , is then obtained from time differentiation v P = −La βS ˙ ˙ ¯ ¯ (Lb + La Cβ )b2 − La Sβ b3 . Time derivatives of unit vectors ¯b2 and ¯b3 are readily evaluated as ¯b˙ 2 = ω ¯b2 ˙ ¯ ¯ and b3 = ω b3 , respectively, see eq. (4.51). Regrouping the terms then yields v P = −(La + Lb Cβ )α˙ ¯b1 + Lb β˙ ¯b3 . Expressing this tensor relationship in basis B ∗ yields RT v P = −(La + Lb Cβ )α˙ ¯b∗1 + Lb β˙ ¯b∗3 , and hence, the components of the
velocity vector expressed in basis B ∗ are (RT v P )T = −(La + Lb Cβ )α, ˙ 0, Lb β˙ . The components of this vector in basis I are then readily obtained as ⎫ ⎧ ⎧ ⎫ ⎨−(La + Lb Cβ )α˙ ⎬ ⎨−(La + Lb Cβ )Cα α˙ + Lb Sα Sβ β˙ ⎬ 0 RT v P = , v P = −(La + Lb Cβ )Sα α˙ − Lb Cα Sβ β˙ . ⎭ ⎩ ⎩ ⎭ Lb β˙ Lb Cβ β˙ Next, the inertial acceleration vector of point P is obtained from a time derivative of its inertial velocity vector aP = Lb Sβ α˙ β˙ ¯b1 − (La + Lb Cβ )¨ α¯b1 − (La + Lb Cβ )α˙ ¯b˙ 1 + Lb β¨¯b3 + Lb β˙ ¯b˙ 3 . Here again, the time derivatives of the unit vectors deſning basis B ∗ are evaluated with the help of eq. (4.51), to yield ˙ β ¯b1 − (La + Lb Cβ )α˙ 2 Cβ + Lb β˙ 2 ¯b2 aP = −(La + Lb Cβ )¨ α + 2Lb α˙ βS + (La + Lb Cβ )α˙ 2 Sβ + Lb β¨ ¯b3 . This expression reveals the components of the inertial acceleration vector, resolved in basis B ∗ , as
5.3 Relative velocity and acceleration
171
⎫ ⎧ ˙ β⎬ α + 2Lb α˙ βS ⎨−(La + Lb Cβ )¨ a∗P = −(La + Lb Cβ )α˙ 2 Cβ − Lb β˙ 2 , ⎭ ⎩ (La + Lb Cβ )α˙ 2 Sβ + Lb β¨ and the corresponding components in basis I are aP = R a∗P , ⎫ ⎧ ¨ β ) + 2Lb α˙ βC ˙ α Sβ ⎬ ¨ Cα ) − Lb Sα (β˙ 2 Cβ − βS ⎨ (La + Lb Cβ )(α˙ 2 Sα − α ¨ β ) + 2Lb α˙ βS ˙ α Sβ . aP = −(La + Lb Cβ )(α˙ 2 Cα + α ¨ Sα ) + Lb Cα (β˙ 2 Cβ − βS ⎭ ⎩ ¨ β) Lb (β˙ 2 Sβ + βC In this example, the components of various vectors were derived in both bases B ∗ and I. Of course, it is possible to work with the components of vectors in any basis, and hence, the choice of a speciſc basis is just a matter of convenience. For this problem, the body attached basis B ∗ is a good choice because the expressions for the components of the velocity and acceleration vectors appear to be simpler in that basis as compared to the corresponding expressions in basis I. Example 5.2. Velocities and acceleration of a spatial mechanism The spatial mechanism depicted in ſg. 5.6 consists of an arm of length La attached to the ground at point S and rotating about axis ¯ı1 of inertial frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]; the time-dependent rotation angle of unit vector s¯2 with respect to axis ¯ı2 is denoted θ(t). Frame F S = [S, S + = (¯ s1 , s¯2 , s¯3 )], is attached to the arm; tensor quantities resolved in basis S + will be denoted with superscript (·)+ . A rigid link connects point P, at the tip of the arm, to point Q that is free to slide along axis ¯ı1 . The link is of length Lb and the distance from point O to point Q is denoted x. Find the inertial velocity and acceleration of point Q and the angular velocity of the link. The inertial position vectors of points P and Q are readily found as r P = La cos θ ¯ı2 + (h + La sin θ)¯ı3 and rQ = x ¯ı1 , respectively. Vector sP Q , extending from point P to point Q, then becomes sP Q = x¯ı1 − La cos θ¯ı2 − (h + La sin θ)¯ı3 . The link is of length Lb , and hence, L2b = sP Q 2 . Expressing the norm of vector sP Q implies that L2b = x2 + L2a + h2 + 2hLa sin θ, which yields the position of point Q along axis ¯ı1 as x = [L2b − L2a − h2 − 2hLa sin θ]1/2 . A ſrst derivative x= of this expression yields xx˙ = −hLa θ˙ cos θ, and a second derivative leads to x¨ −hLa θ¨ cos θ + hLa θ˙2 sin θ − x˙ 2 . The inertial velocity and acceleration of point Q are then v Q = x˙ ¯ı1 and aQ = x ¨ ¯ı1 , respectively. Because points P and Q are two material points of the same rigid body, link PQ, eq. (5.22) implies v Q = v P + ω (r Q −r P ), where v P is the inertial velocity of point P, and ω the angular velocity of the link. This equation can be cast as sP Q ω = −s˙ P Q , where s˙ P Q = v Q − v P . In view of eq. (1.35), this vector product equation admits the following solution sP Q s˙ P Q ω = μsP Q + , (5.26) L2b
172
5 Kinematics of rigid bodies
where μ is an arbitrary constant. This solution exists if and only if sTP Q s˙ P Q , a condition that is always satisſed because vector sP Q is of constant length. The indeterminacy of the solution is due to the fact that the link is free to rotate about its own axis, because its end points rotate freely. s2
i3 s3
P
La
i2
q
P
i2
S
x
s
a1 q
+
h
S
C
Lb
p0
A A
s1
O I x(t)
O I
Q i 1
Fig. 5.6. Conſguration of the spatial mechanism.
a2 d
h i1
Fig. 5.7. Particle P sliding along curve C and through a slotted arm.
The ſrst term of eq. (5.26), μsP Q , reƀects this indeterminacy, which can be removed by assuming that the component of the angular velocity vector along the link vanishes, sTP Q ω = 0, i.e., the link is not allowed to rotate about its own axis. This condition leads to μ = 0, and hence, L2b ω = sP Q s˙ P Q . Expanding the vector product then leads to ˙ a + h sin θ)¯ı1 + xLa θ˙ cos θ − x(h ˙ + La sin θ) ¯ı2 L2b ω =La θ(L + La (xθ˙ sin θ + x˙ cos θ)¯ı3 . This expression gives the components of the angular velocity vector of the link in basis I. Example 5.3. Particle sliding on a curve Figure 5.7 shows particle P sliding along a planar curve ſxed with respect to an inertial frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]. A slotted arm pivots about point A whose position vector is rA = d¯ı1 + h¯ı2 . Frame F A = [A, A = (¯ a1 , a ¯2 , a ¯3 )] is attached to the arm. The rotation angle between unit vector ¯ı1 and axis a ¯1 , denoted θ(t), is a given function of time. The particle slides along the curve and through the slot in the arm. Find the velocity and acceleration of the particle along the curve and the relative velocity and acceleration of the particle with respect to the arm. Let x denote the distance between the particle and point A. The inertial position vector of the particle then becomes p0 (s) = d¯ı1 + h¯ı2 + x¯ a1 . This equation involves two unknowns: the position of the particle along the curve, s, and the position of the particle along the arm, x. Projecting this equation along unit
5.3 Relative velocity and acceleration
173
vectors a ¯2 and a ¯1 yields two scalar equations a ¯T2 p0 (s) = −d sin θ + h cos θ, and T x=a ¯1 p0 − (d cos θ + h sin θ), respectively, that can be solved for s and x, respectively, as functions of angle θ. The ſrst equation is a nonlinear scalar equation; in general, several solutions might exist. The given initial conſguration of the system should remove any ambiguity in the solution process at the initial time; for subsequent times, the requirement of a continuous solution for s should remove any further ambiguities. Because particle P slides along the curve, its inertial velocity vector is v P = s˙ t¯, see eq. (2.33). On the other hand, the velocity vector of point P on the arm is v P = ˙a2 . Equating these two expressions yields x¯ ˙ a1 + xθ¯ ˙ a2 . s˙ t¯ = x¯ ˙ a1 + xθ¯
(5.27)
Let Rα be the components of Frenet’s triad, see eq. (4.87), at point P of curve C; hence, t¯ = Rα¯ı1 . Let Rθ be the components of the rotation tensor that bring basis I to basis A; hence, a ¯1 = Rθ¯ı1 . With the help of these deſnitions, eq. (5.27) becomes ⎧ ⎫ ⎧ ⎫ ⎡ ⎤⎧ ⎫ cos(α − θ) sin(α − θ) 0 ⎨ x˙ ⎬ ⎨s˙ ⎬ ⎨ x˙ ⎬ 0 = RTα Rθ xθ˙ = ⎣ − sin(α − θ) cos(α − θ) 0⎦ xθ˙ . (5.28) ⎩ ⎭ ⎩ ⎭ ⎩ ⎭ 0 0 0 1 0 0 The ſrst two scalar equations are readily solved to ſnd x˙ and s˙ as x˙ = xθ˙
cos(α − θ) , sin(α − θ)
s˙ = xθ˙
1 . sin(α − θ)
The relative velocity of the particle with respect to the arm is x, ˙ and s˙ is the speed of the particle along the curve. Both results depend on the angle (α − θ), which represents the relative rotation of Frenet’s triad with respect to basis A. When those two bases are parallel to each other, α = θ and the tangent to the curve is parallel to the arm. Clearly, the mechanism “locks” in such a case, as implied by the inſnite velocities x˙ and s˙ → ∞. The accelerations of the system are obtained by taking a time derivative of eq. (5.28) to ſnd ⎫ ⎧ ⎫ ⎤⎧ ⎤⎧ ⎫ ⎡ ⎡ ¨ ⎬ cos(α − θ) sin(α − θ) 0 ⎨ x 0 1 0 ⎨s˙ ⎬ ⎨s¨⎬ ˙ ⎣−1 0 0⎦ 0 + ⎣ − sin(α − θ) cos(α − θ) 0⎦ x˙ θ˙ + xθ¨ . 0 = (α˙ − θ) ⎩ ⎭ ⎩ ⎭ ⎩ ⎭ 0 0 1 0 000 0 0 ˙ = s Because RTα R ˙ κ∗ , see eq. (4.88), α˙ = s/ρ, ˙ where ρ is the radius of curvature α of curve C. Here again, the ſrst two scalar equations are readily solved to ſnd the desired accelerations, x¨ and s¨, leading to x ¨=
¨ cos(α − θ) − (α˙ − θ) ˙ s˙ (x˙ θ˙ + xθ) , sin(α − θ)
s¨ =
¨ − (α˙ − θ) ˙ s˙ (x˙ θ˙ + xθ) . sin(α − θ)
Several observation can be made concerning this example. This problem involves several bases: the inertial basis, I, the arm attached basis, A, and Frenet’s triad for
174
5 Kinematics of rigid bodies
curve C. The components of the velocity vector of the particle are most easily expressed in the arm attached basis, A. Because the particle slides along curve C, it is natural to use Frenet’s triad of the curve, since its properties are also expressed naturally in this triad. The analyst should always use the most appropriate basis to express the various kinematic characteristics of the system; typically, this implies selecting the basis that leads to the simplest, or most natural, analytical expressions. Once kinematics conditions have been expressed in different bases, it is often necessary to “reconcile” the various equations, i.e., express them in a common basis. This operation is most effectively achieved with the help of rotation tensors and the systematic use of their orthogonality property: the inverse of the rotation tensor equals its transpose, and the time derivative of the rotation tensor calls for the use of the angular velocity vector. 5.3.3 Problems Problem 5.9. Retraction of a landing gear Figure 5.8 depicts the extension of a simple landing gear. It consists of a link of length L = 1.2 m and of a wheel. The length (t) of the hydraulic actuator is a given function of time: (t) = h + g[1 − cos πt/T ], where g = [ (L2 /2 + hL + h2 ) − h]/2, h = 0.6 m and T = 1.5 s is the total time required for extending the landing gear. (1) Compute and plot the angular velocity of the link as a function of time. (2) Compute and plot the angular acceleration of the link as a function of time. (3) Compute the inertial velocity vector of point P at the tip of the link. Plot the components of this vector resolved in I. (4) Compute the inertial acceleration vector of point P. Plot the components of this vector resolved in I.
A L/2 i1
i2 O
I
L/2
R
L/2 h
T M
q Retracted position
Lc
q d i2
W
L/2 WExtended position
Fig. 5.8. Landing gear in retracted and extended conſgurations.
I
Lb P O
i1
Fig. 5.9. Quick return mechanism conſguration.
Problem 5.10. Quick return mechanism The quick return mechanism shown in ſg. 5.9 consists of a crank of length Lc = 0.30 m and of a bar of length Lb = 1.6 m. The crank is pinned at point R and the bar is pinned at point O. The distance between these two points is d = 0.35 m. At point P, a slider allows the tip
5.3 Relative velocity and acceleration
175
of the crank to slide along the bar. The time history of angle θ is θ(t) = ωt, where ω = 1.25 rad/s. (1) On the same graph, plot the time history of the angular velocities of the crank and bar for two revolutions of the crank. (2) On the same graph, plot the angular accelerations of the crank and bar. (3) Compute the inertial velocity vector of point T at the tip of the bar. Plot the components of this vector resolved in I. (4) Compute the inertial acceleration vector of point T. Plot the components of this vector resolved in I.
Problem 5.11. Crank-slider mechanism Figure 5.10 depicts a crank-slider mechanism. The crank of length 1 = 0.20 m rotates counterclockwise at a constant angular velocity ω1 = 200 rad/s and is connected to the ground at point O. At point A the crank connects to a linkage of length 2 = 0.6 m. Finally, at point B, this linkage connects of a piston that is constrained to move in the horizontal direction. The angular position of the crank is θ(t) = ω1 t. (1) Compute the angular velocity ω2 of the linkage and the velocity vp = x˙ of the piston. (2) Plot the horizontal position x of the piston as a function of time. (3) On one graph, plot the angles θ and φ as a function of time. (4) On one graph, plot the angular velocities ω1 and ω2 of the two bodies as a function of time. (5) Plot the velocity vp of the piston as a function of time. (6) Compute the angular acceleration ¨ of the piston. (7) On one graph, plot the angular α2 of the linkage and the acceleration ap = x acceleration α1 and α2 of the two bodies as a function of time. (8) Plot the acceleration ap of the piston as a function of time. For all plots, the time scale should cover a complete revolution of the crank, i.e., t ∈ [0, 2π/ω1 ].
i2
i2
A l1 O
l2 f
q x
i1 B
Fig. 5.10. Crank-slider mechanism rotating at a constant angular velocity.
O R
w L/2
P q
f i1
d
C
A k T
Fig. 5.11. Locking mechanism conſguration.
Problem 5.12. Locking mechanism Figure 5.11 shows a locking mechanism used in the deployment of large space structures. When the homogeneous disk of radius R rotates about its ſxed point O, bar PT of length L slides at point A through a collar that is allowed to swivel about the pin at point A. The bar has a length L, and w(t) denotes the part of the bar between point P and A. The time history of angle θ is prescribed as θ(t) = π(1 + cos πt/T )/4. (1) On the same graph, plot angles θ and ˙ (3) Plot θ¨ and φ. ¨ (4) Plot w. (5) Plot w. φ as a function of time. (2) Plot θ˙ and φ. ˙ (6) Plot w. ¨ Use the following data: R = 0.15 m; d = 0.2 m; L = 0.4 m; T = 2 s. Present the response of the system for a duration of 2 s.
Problem 5.13. Crank-slider mechanism Figure 5.12 depicts a crank-slider mechanism. The crank of length = 0.30 m rotates counterclockwise at a constant angular velocity ω1 = 200 rad/s and is connected to the ground at point O. At point B, the crank connects to a linkage that slides along point P, a ſxed point in space, located at a distance d = 0.6 m from point O. The angular position of the crank is
176
5 Kinematics of rigid bodies
θ(t) = ω1 t. Let w denote the distance from point B to point P. Point T is located at the tip of the linkage, at a distance b = 1.2 m from point B. (1) On one graph, plot the angles θ and φ as a function of time. (2) Plot the distance w as a function of time. (3) Compute the angular velocity ω2 of the linkage and the relative velocity, w, ˙ of point P with respect to the linkage. (4) On one graph, plot the angular velocities ω1 and ω2 of the two bodies as a function of time. (5) Plot the relative velocity, w, ˙ of point P with respect to the linkage as a function of time. ¨ of (6) Compute the angular acceleration, α2 , of the linkage and the relative acceleration, w, point P with respect to the linkage. (7) On one graph, plot the angular acceleration α1 and α2 of the two bodies as a function of time. (8) Plot the relative acceleration, w, ¨ of point P with respect to the linkage as a function of time. (9) On one graph, plot the horizontal and vertical components of the inertial velocity vector of point T. (10) On one graph, plot the horizontal and vertical components of the inertial acceleration vector of point T. For all plots, the time scale should cover a complete revolution of the crank, i.e., t ∈ [0, 2π/ω1 ].
i2
O
l
P
s
B
b
w d
P
C
b2
q f T
Fig. 5.12. Crank-slider mechanism rotating at a constant angular velocity.
O
I
L
p0
b1 a a 1
q
i1
A d
a2
Fig. 5.13. Rotating curve connected to link.
Problem 5.14. Rotating curve connected to link Planar curve C is embedded into a rigid body that rotates with respect to an inertial frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )], as depicted in ſg. 5.13. A frame F B = O, B = (¯b1 , ¯b2 , ¯b3 ) , is attached to the body and the rotation angle is a known quantity α(t). A link is attached to inertial point A whose position vector is rA = d¯ı1 . The other end of the link is connected to a particle that slides along curve C. A frame F A = [A, A = (¯ a1 , a ¯2 , a ¯3 )], is attached to the link; the rotation angle for frame F A is denoted θ. Let Fβ∗ (s) be the components of Frenet’s triad of curve C resolved in basis B and β the angle that brings basis B to Frenet’s triad. (1) Find a scalar equation to determine the location s of point P along curve C. Is the solution uniquely deſned? (2) Find a scalar equation to determine angle θ. (3) Determine the angular velocity of the link. (4) Determine the speed s˙ of the particle along curve C. (5) Determine the angular acceleration of the link. (6) Determine the acceleration s¨ of the particle along curve C. (7) Under what condition does the mechanism lock? Explain your answer in geometric terms. Express your answers in terms of the angles α, θ and β.
Problem 5.15. Spinning disk mounted on rotating arm The system depicted in ſg. 4.9 consists of a shaft of height h rigidly connected to an arm of length La and of a spinning disk of radius R mounted at the free end of the s1 , s¯2 , s¯3 ) is attached to the shaft at point S and frame arm. Frame F S = S, S + = (¯ F D = C, B∗ = (¯b1 , ¯b2 , ¯b3 ) is attached to the disk at point C. Superscripts (·)+ and (·)∗ will be used to denote tensor components resolved in bases S + and B∗ , respectively. Angle α(t) and β(t) are the magnitudes of the planar rotations about axis ¯ı3 and s¯1 , respectively,
5.3 Relative velocity and acceleration
177
that bring basis I to S + and basis S + to B∗ , respectively. (1) Find the angular velocity vector of basis B∗ with respect to basis I. (2) Find the components of this vector in basis I. (3) Find the components of this vector in basis B∗ . (4) Find the angular acceleration vector of basis B∗ with respect to basis I. (5) Find the components of this vector in basis I. (6) Find the components of this vector in basis B∗ .
Problem 5.16. Robotic system Figure 5.5 depicts a robotic system. The shaft is allowed to rotate with respect to an inertial frame F I , about axis ¯ı3 ; the time-dependent angle of rotation is denoted α(t). A frame F S = [S, S = (¯ s1 , s¯2 , s¯3 )] is attached to the shaft at a distance h from the origin of the inertial frame, as indicated on the ſgure. An arm of length La = 1.2 m, extending along the direction of axis s¯2 , is attached to the shaft at point S. Finally, a rigid manipulator of length Lb = 0.5 m is connected to the arm at point B. The manipulator is allowed to rotate with respect to B frame F S , about axis s¯1 ; the time-dependent angle of rotation is denoted β(t). Frame F = B, B = (¯b1 , ¯b2 , ¯b3 ) is attached to the manipulator. Angles α(t) and β(t) are prescribed as α(t) = π(1 − cos πt/T )/2 and β(t) = 2π (1 − cos πt/T ), where T = 2 s. (1) Compute the angular velocity vector ω of the manipulator with respect to the inertial system. On one graph, plot the components of this vector in basis B. (2) On one graph, plot the components of this vector in basis I. (3) Compute the position vector rP of point P with respect to point O. On one graph, plot the components of this vector in basis I. (4) Evaluate the inertial velocity vector of point P. On one graph, plot the components of this vector in basis B. (5) On one graph, plot the components of this vector in basis I. (6) Compute the inertial acceleration of point P. On one graph, plot the components of this vector in basis B. (7) On one graph, plot the components of this vector in basis I.
Problem 5.17. Swiveling plate Figure 5.14 depicts a homogeneous, rectangular plate of height a, width b and mass m connected to the ground by a rigid, massless link of length d. At point O, a bearing allows the link to rotate with respect to axis ¯ı3 , whereas at point B, the plate is free to rotate with respect to the link about axis a ¯1 . Three frames will be used in this problem: the inertial frame, F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )], a frame attached to the link, F A = [O, A = (¯ a1 , a ¯2 , a ¯3 )], and ſnally, a frame attached to the plate at its center of mass, F B = C, B = (¯b1 , ¯b2 , ¯b3 ) . A planar rotation of magnitude α about axis ¯ı3 brings basis I to basis A, whereas a planar rotation of magnitude β about axis a ¯1 brings basis A to basis B. (1) Find the components of the inertial position vector of point P in basis B. (2) Find the components of the inertial velocity vector of point P in basis B. (3) Find the components of the inertial acceleration vector of point P in basis B.
Problem 5.18. Robotic system with a sliding manipulator Figure 5.15 depicts a robotic system with a sliding manipulator. The shaft is allowed to rotate with respect to an inertial frame F I , about axis ¯ı3 ; the time-dependent angle of rotation is denoted α(t). A frame F S = [S, S = (¯ s1 , s¯2 , s¯3 )] is attached to the shaft at a distance h from the origin of the inertial frame, as indicated on the ſgure. An arm of length Lb , connected to the shaft at point S is allowed to pivot with respect to the shaft about axis s¯1 ; the time-dependent angle of rotation is denoted β(t). A frame F B = B, B = (¯b1 , ¯b2 , ¯b3 ) is attached to the arm. Finally, a rigid manipulator slides with respect to the arm along axis ¯b2 ; the displacement of the manipulator is denoted u(t). Angles α(t) and β(t) and displacement u(t) are known, prescribed functions of time. (1) Compute the angular velocity vector ω of the manipulator with respect to the inertial system. (2) Give the components of this vector in
178
5 Kinematics of rigid bodies
a
O
b3
a1
a/2
B
a/2
b2 i2
b
b2 a2
P
h
b(t) SB i3
b1 = s1
C b/2
P
b2
b3 B
i1
d a2
u(t)
s3 Lb
i3 = a3
O b/2 b1
Fig. 5.14. Conſguration of the swiveling plate.
i1
I a(t)
S
s2
i2
Fig. 5.15. Robotic arm with a sliding manipulator.
basis B. (3) Compute the position vector r P of point P with respect to point O. (4) Evaluate the inertial velocity vector of point P. (5) Compute the inertial acceleration of point P.
Problem 5.19. Robotic system with a sliding manipulator Figure 5.15 depicts a robotic system with a sliding manipulator. The shaft is allowed to rotate with respect to an inertial frame F I , about axis ¯ı3 ; the time-dependent angle of rotation is s1 , s¯2 , s¯3 )] is attached to the shaft at a distance h = 1 m denoted α(t). Frame F S = [S, S = (¯ from the origin of the inertial frame, as indicated on the ſgure. An arm of length Lb = 0.75 m, connected to the shaft at point S is allowed to pivot with respect to the shaft about axis s¯1 ; the time-dependent angle of rotation is denoted β(t). A frame F B = B, B = (¯b1 , ¯b2 , ¯b3 ) is attached to the arm. Finally, a rigid manipulator slides with respect to the arm along axis ¯b2 ; the displacement of the manipulator is denoted u(t). Angles α(t) and β(t) are prescribed as α(t) = π(1 − cos πt/T )/2 and β(t) = π(1 − cos πt/T )/6, where T = 2 s. The sliding motion is prescribed as u(t) = 0.5 (1 − exp(−5t/T )) m. (1) Compute the angular velocity vector ω of the manipulator with respect to the inertial system. On one graph, plot the components of this vector in basis B. (2) On one graph, plot the components of this vector in basis I. (3) Compute the position vector r P of point P with respect to point O. On one graph, plot the components of this vector in basis I. (4) Evaluate the inertial velocity vector of point P. On one graph, plot the components of this vector in basis B. (5) On one graph, plot the components of this vector in basis I. (6) Compute the inertial acceleration of point P. On one graph, plot the components of this vector in basis B. (7) On one graph, plot the components of this vector in basis I. For all plots, the time scale should cover t ∈ [0, 4T ] s.
Problem 5.20. Robotic system with a manipulator on screw joint Consider the robotic system with a manipulator mounted on a screw joint depicted in ſg. 5.16. The shaft is allowed to rotate with respect to an inertial frame F I , about axis ¯ı3 ; the times1 , s¯2 , s¯3 )] is attached to dependent angle of rotation is denoted α(t). Frame F S = [S, S = (¯ the shaft at a distance h = 0.5 m from the origin of the inertial frame, as indicated on the ſgure. An arm of length La = 0.6 m, extending along the direction of axis s¯2 , is attached to the shaft at point S. Finally, a rigid manipulator is connected to the arm by means of a screw joint. Frame F B = B, B = (¯b1 , ¯b2 , ¯b3 ) is attached to the manipulator. The manipulator slides and rotates with respect to the arm; the sliding distance is denoted u(t) and the rotation angle is β(t). The screw joint implies the following relationship between these two motions: u(t) = β(t)/(2π), where = 0.5 m is the pitch of the screw. Angles α(t) and β(t) are
5.4 Contact between rigid bodies
179
prescribed as α(t) = π/2 (1 − cos πt/T ) and β(t) = 5π t/T , where T = 2 s. (1) Compute the angular velocity vector ω of the manipulator with respect to the inertial system. On one graph, plot the components of this vector in basis B. (2) On one graph, plot the components of this vector in basis I. (3) Compute the position vector r P of point P with respect to point O; point P is located at a distance Lb = 0.25 m from the manipulator elbow. On one graph, plot the components of this vector in basis I. (4) Evaluate the inertial velocity vector of point P. On one graph, plot the components of this vector in basis B. (5) On one graph, plot the components of this vector in basis I. (6) Compute the inertial acceleration of point P. On one graph, plot the components of this vector in basis B. (7) On one graph, plot the components of this vector in basis I. For all plots, the time scale should cover t ∈ [0, 2T ] s.
s3
b3 b u
La
S S h s1 i3 i1 a I s1
s2
B P b1
B b2 Lb
O i2
Fig. 5.16. Robotic arm with manipulator.
i1 x
T q C
O I
R
B
A e1 e2
i2
Fig. 5.17. Wheel rolling between two plates.
Problem 5.21. Wheel rolling between two plates Figure 5.17 depicts a wheel of radius R rolling without sliding between two horizontal plates. The top plate moves horizontally and is at a distance x from axis ¯ı1 . Frame F = [C, E = (¯ e1 , e¯2 )] rotates with the wheel. A planar rotation of magnitude θ(t) about unit vector ¯ı3 brings basis I to basis E. (1) Find the velocity vector of material point A of the wheel. (2) Resolve this vector in basis I then in basis E. (3) Find the acceleration vector of material point A of the wheel. (2) Resolve this vector in basis I then in basis E .
5.4 Contact between rigid bodies Many commonly used mechanical systems involve contacting rigid bodies. Figure 5.18 shows two rigid bodies, denoted body k and body , with outer shapes deſned by two closed curves, denoted curves Ck and C , respectively. Point P is the instantaneous point of contact between the two rigid bodies. For the purpose of illustration, the two bodies are assumed to undergo planar motion and rotate about ſxed inertial points Ok and O , respectively. The mechanism shown in ſg. 5.18 is generally called a cam-follower pair. The angular motion of body k, called the cam, is typically prescribed to be a constant angular speed, say Ω. As the cam rotates, body , called the follower, is assumed to remain in contact with the cam at all times at a single point. The cam-follower pair transforms the constant angular motion of the cam into a rocking motion of the
180
5 Kinematics of rigid bodies
follower. By tailoring the shapes of curves Ck and C , desirable periodic schedules of the follower can be achieved. The contact point between the two rigid bodies is not a material point of either bodies. At the cam rotates, the location of the contact point coincides with a different material point of the cam at each instant: the contact point slides along curve Ck . Similarly, the contact point slides along curve C because the location of the contact point coincides with a different material point of the follower at each instant. At instant t, let K and L be i2 n the material points of the cam and follower, respectively, that Body l vPk vPl are instantaneously coincident t el l 2 C with the contact point, P(t). To k l P e2 e1 avoid confusion, points K and L k k rP rPl e1 are not shown in ſg. 5.18 bel k cause their location is identical O O i1 to that of point P. Let rK and r L k Body k C be the position vectors of material points K and L with respect Fig. 5.18. Body k and body in contact at point P. to point O, respectively. Furthermore, the position vector of the instantaneous point of contact with respect to point O is denoted r P (t). Given these deſnitions, r K = rL = rP (t). At instant t > t, K and L are the material points of the cam and follower, respectively, that are instantaneously coincident with the contact point, P(t ). If r K and rL are the position vectors of material points K and L with respect to point O, respectively, and if r P (t ) denotes the position vector of the instantaneous point of contact with respect to point O, it is still true that r K = r L = r P (t ). In general, however, r K = r K because points K and K are two different material points of the cam and r L = r L because points L and L are two different material points of the follower. Because the instantaneous point of contact slides over curves Ck and C , r P (t) = r P (t ) and furthermore, rK = r P (t ) and r L = rP (t ). Because point K is a material point of body k, its inertial velocity and acceleration vectors can be evaluated using eqs. (5.22) and (5.24), respectively. The relative position vectors of point P with respect to points Ok and O are denoted r kP and rP , respectively. Let point Ok be the reference point for body k; the velocity of material point K, denoted v kP , is given by eq. (5.22) as v kP = ω k rkP , where ωk is the angular k k velocity of body k. Vectors v P and r P are perpendicular to each other, as illustrated in ſg. 5.18. A similar expression holds for the velocity of point L, denoted v P . The components of the same velocity vectors in the body attached basis are given by eq. (5.23). Let F k = Ok , B k = (¯ ek1 , e¯k2 ) be a frame attached to the cam, as shown in ſg. 5.18, and notation (·)∗ indicates tensor components resolved in basis B k . The components of the inertial velocity vector of point K resolved in this basis ∗k are then v ∗k ∗k r ∗k P = ω P ; because array r P stores the components of the relative position vector of material point K of the cam resolved in a cam attached basis, this array is time-independent.
5.4 Contact between rigid bodies
181
It is assumed that the two bodies are in contact at a single point, and the unit tangent vectors to curves Ck and C at point P are coincident and denoted t¯. The unit vector perpendicular to this common tangent is the unit normal vector, denoted n ¯ . As discussed in section 2.2, the unit vector tangent to curve Ck is given by eq. (2.5) and its orientation depends on the curvilinear variable used to parameterize the curve. If curves Ck and C are both parameterized in the counterclockwise direction, and if the unit vectors tangent to the two curves are denoted t¯k and t¯ , respectively, and ſg. 5.18 shows that at the instantaneous contact point, t¯ = −t¯k = t¯ . For the conſguration illustrated in the ſgure, n ¯ = −¯ nk = −¯ n . If the two bodies of the cam-follower pair remain in contact at a single point, the normal projections of the velocity vectors of the material points that are instantaneously coincident with the contact point must be identical, n ¯ T v kP = n ¯ T v P .
(5.29)
If this condition were not satisſed, the two bodies would either separate or interpenetrate and contact at a single point would not be maintained. The relative velocity of the material points that are instantaneously coincident with the contact point, denoted v rP , is nT v P )¯ n + (t¯T v P )t¯ − (¯ nT v kP )¯ n − (t¯T v kP )t¯ v rP = v P − v kP = (¯ = (t¯T v P )t¯ − (t¯T v kP )t¯ = (t¯T v P ) − (t¯T v kP ) t¯.
(5.30)
where the third equality follows from the contact condition, eq. (5.29). As expected, the relative velocity of the material points that are instantaneously located at the contact point is oriented along to the common tangent vector at this point. Smooth operation of cam-follower systems generally require a single point contact between the two rigid bodies. For arbitrary shapes of the bounding curves, contact could occur at two or more points simultaneously, or even along a line if portions of the outer curves are straight, for instance. Such situations rarely occur in mechanical systems. To guarantee single point contact, the bounding curves must satisfy speciſc conditions at the contact point. For instance, a sufſcient condition for single point contact is for the cam and follower to be bounded by strictly convex curves. For the case illustrated in ſg. 5.18, the cam and follower are convex and concave, respectively, at the point of contact. For single point contact to occur, the cam’s radius of curvature must be smaller than that of the follower. The discussion has focused thus far on contacting rigid bodies undergoing planar motion. If the problem is fully three-dimensional, it becomes necessary to deſne the external surfaces of bodies k and , denoted Sk and S , respectively. If the contact between the two bodies occurs at a single point, the planes tangent to surfaces Sk and S at the instantaneous contact point must coincide and it is still possible to deſne a unit normal vector that is perpendicular to this common tangent plane. The contact condition expressed by eq. (5.29) still holds for this problem, but the relative velocity vector will have components along two directions within the common tangent plane.
182
5 Kinematics of rigid bodies
Example 5.4. Cam-follower pair Consider the planar cam-follower pair depicted in ſg. 5.19. The cam rotates at a constant angular velocity, Ω, about ſxed inertial point O. Frame F I = [O, I = (¯ı1 , ¯ı2 )] is inertial and frame F E = [O, E = (¯ e1 , e¯2 )] is attached to the cam. The external shape of the cam is deſned by curve C and the follower slides over this curve; the contact point between the cam and follower is denoted P. The motion of the follower is constrained to be along axis ¯ı2 and its displacement is denoted x. Find the velocity and acceleration of the follower.
i2 x h e2
P
er a e1 q e1
q
i1
g
r
eq O
a
C
Fig. 5.19. Conſguration of the cam-follower pair.
Let angle θ deſne the rotation of the cam; it follows that θ˙ = Ω. The geometry of curve C is deſned in polar coordinates, as discussed in example 2.3 on page 37, and angle α deſnes an arbitrary parameterization of the curve. The shape of the curve is then deſned by a given function, r = r(α), and notation (·) indicates a derivative with respect to angle α. Given the conſguration of the system, the tangent to curve C at the point of contact must remain horizontal at all times; this implies θ + α + γ = π.
(5.31)
It then follows that sin(θ + α) = Sγ = r/p1 , where Sγ = sin γ and the second equality results from eq. (2.28a). Similarly, cos(θ + α) = −Cγ = −r /p1 , where Cγ = cos γ and the second equality results from eq. (2.28b). Eliminating p1 from these two relationships leads to r(α) cos(θ + α) + r (α) sin(θ + α) = 0. For a given value of angle θ, this transcendental equation can be solved for angle α, which determines the location of the point of contact between the cam and the follower. It then becomes possible to evaluate r(α), r (α), and angle γ then follows from eqs. (2.28a) and (2.28b). A time derivative of eq. (5.31) yields Ω + α˙ + γ˙ = 0. Introducing eqs. (2.29) and (2.32) then yields r˙ = −ΩρCγ . The velocity of the material point of the cam located at the instantaneous point of contact between the cam and the follower is Ωr¯ eθ . The velocity of the material
5.4 Contact between rigid bodies
183
point of the follower at the same location is −x¯ ˙ ı2 . The contact condition for these two bodies, eq. (5.29), then implies ¯ıT2 Ωr¯ eθ = −¯ıT2 x¯ ˙ ı2 , or x˙ = ΩrCγ . The acceleration of the follower is obtained by taking a time derivative of this expression to ſnd x ¨ = Ω rC ˙ γ − ΩrγS ˙ γ . Introducing eq. (2.32) then yields x ¨ = (1 − Sγ r/ρ)Ω r/C ˙ γ and ſnally x¨ = Ω 2 (rSγ − ρ).
(5.32)
5.4.1 Problems Problem 5.22. Cam-follower pair questions Figure 5.20 shows the instantaneous point of contact, P, between to rigid bodies, denoted body k and body . Let t¯ be the unit vector tangent to the curves bounding the two bodies and n ¯ is perpendicular to this tangent vector. Vectors r kP and r P are the relative position vectors of point P with respect to points Ok and O , respectively. (1) Is θ˙ k the angular velocity of body k? (2) Let v kP and v P be the velocity vectors of the material points of body k and , respectively, that are coincident with the instantaneous point of contact, P. An analyst has evaluated these vectors, which are shown in ſg. 5.20. Are his predictions correct? (3) Is the relative velocity of body with respect to body k parallel to unit vector n ¯ ? (4) If v P = 4.5 k m/s in the upwards direction, determine v P . (5) Find the relative velocity vector of body with respect to body k. Justify all your answers; YES/NO answers are not sufſcient.
v
l
O
rPl
vPk
l P
P
15o
k P
r
q
q
Body l i1
Body k
l
g
i2
n
i2
k
t 50 O
o
k
Fig. 5.20. Conſguration of the cam-follower system.
P
eq e2 C r O
er a q
e1 e1 f
a
i1 f
q
A
d Fig. 5.21. Conſguration of the cam-follower system.
Problem 5.23. Cam-rocking bar pair Consider the planar cam-rocking bar pair depicted in ſg. 5.21. The cam rotates at a constant angular velocity, Ω, about ſxed inertial point O. Frame F I = [O, I = (¯ı1 , ¯ı2 )] is inertial and e1 , e¯2 )] is attached to the cam. The external shape of the cam is deſned frame F E = [O, E = (¯ by curve C and the rocking bar slides over this curve; the contact point between the cam and bar is denoted P. The bar is pivoted about point A and the distance between point O and A is denoted d. (1) Plot the curve deſning the outer shape of the cam. (2) Plot the curvature of curve C as a function of α ∈ [0, 360] deg. (3) Find the location of contact point P as a function of the cam rotation angle θ. This step requires the numerical solution of a transcendental equation
184
5 Kinematics of rigid bodies
for each value of angle θ. (4) On one graph, plot angles α, γ, and φ at the point of contact for θ ∈ [0, 360] deg. (5) On one graph, plot r, r , and r at the point of contact for θ ∈ [0, 360] ˙ deg. (6) Plot the non-dimensional angular velocity of the bar, φ/Ω, for θ ∈ [0, 360] deg. (7) 2 ¨ Plot the non-dimensional angular acceleration of the bar, φ/Ω , for θ ∈ [0, 360] deg. Use the following data: r(α) = r0 + r1c cos α + r2c cos 2α, r0 = 1, r1c = 0.54, and r2c = 0.18.
Problem 5.24. Cam-follower pair Consider the planar cam-follower pair depicted in ſg. 5.19. The cam rotates at a constant angular velocity, Ω, about ſxed inertial point O. Frame F I = [O, I = (¯ı1 , ¯ı2 )] is inertial e1 , e¯2 )] is attached to the cam. The external shape of the cam is and frame F E = [O, E = (¯ deſned by curve C and the follower slides over this curve; the contact point between the cam and follower is denoted P. The motion of the follower is constrained to be along axis ¯ı2 and its displacement is denoted x. (1) Plot the curve deſning the outer shape of the cam. (2) Plot the curvature of curve C as a function of α ∈ [0, 360] deg. (3) Find the location of contact point P as a function of the cam rotation angle θ. This step requires the numerical solution of a transcendental equation for each value of angle θ. (4) On one graph, plot angles α and γ at the point of contact versus θ ∈ [0, 360] deg. (5) On one graph, plot r, r , and r at the point of contact versus θ. (6) Plot the non-dimensional velocity of the follower, x/(Ωr ˙ 0 ), versus θ. (7) Plot the non-dimensional acceleration of the follower, x ¨/(Ω 2 r0 ), versus θ. Use the following data: r(α) = r0 + r1c cos α + r2c cos 2α, r0 = 1, r1c = 0.5, and r2c = 0.18.
Problem 5.25. Cam-push rod pair Figure 5.22 depicts a planar cam-push rod pair. The cam rotates at a constant angular velocity, Ω, about ſxed inertial point O. Frame F I = [O, I = (¯ı1 , ¯ı2 )] is inertial and frame e1 , e¯2 )] is attached to the cam. The external shape of the cam is deſned F E = [O, E = (¯ by curve C and the push rod slides over this curve; the contact point between the cam and push rod is denoted P. The push rod’s axis is at a distance d from axis ¯ı2 and its support at a distance h from axis ¯ı1 . (1) Plot the curve deſning the outer shape of the cam. (2) Plot the curvature of curve C as a function of α ∈ [0, 360] deg. (3) Find the location of contact point P as a function of the cam rotation angle θ. This step requires the numerical solution of a transcendental equation for each value of angle θ. (4) On one graph, plot angles α, β, and γ at the point of contact versus θ ∈ [0, 360] deg. (5) On one graph, plot r, r , and r at the point of contact versus θ. (6) Plot the non-dimensional position of the push rod, x/r0 , versus θ. (7) Plot the non-dimensional angular velocity of the push rod, x/(Ωr ˙ 0 ), versus θ. (8) Plot the non-dimensional angular acceleration of the push rod, x ¨/(Ω 2 r0 ), versus θ. Use the following data: r(α) = r0 + r1c cos α + r2c cos 2α, r0 = 1, r1c = 0.50, r2c = 0.18, d¯ = d/r0 = 0.5, ¯ = h/r0 = 1.8. h
Problem 5.26. Oscillating disk with sliding bar Figure 5.23 shows an oscillating disk (body ) pinned to the ground at its center point C, while a bar of length L, pinned at point O, slides in a radial track of the disk. The angular motion of the disk is prescribed as φ = φ0 sin ωt. The distance between points O and C is denoted b. The angular position of the bar (body k) is denoted θ and the point of contact between the bar and the track is at a distance r from the center of the disk. (1) On one graph, plot angles φ and θ versus τ . (2) Plot r¯ = r/L versus τ . (3) On one graph, plot angular velocities φ and θ versus τ . (4) Plot r¯ versus τ . (5) On one graph, plot angular accelerations φ and θ versus τ . (6) Plot r¯ versus τ . (7) Evaluate the velocities of the material points of body k and , denoted v kP and v P , respectively, that are instantaneously located at the point of contact of the two bodies. (8) Verify that eq. (5.29) is satisſed for your solution. (9) Evaluate the relative velocity of body with respect to body k, denoted v rP . (10) Let v P be the velocity vector of
5.4 Contact between rigid bodies
i2
185
i2
er d
x h e2
e1
P a
t r
eq
O
O
P f
i1
C
i1
q
r
L q
body k
b
body l
C
Fig. 5.22. Conſguration of the cam-push rod pair.
Fig. 5.23. Conſguration of the oscillating disk with sliding bar.
the material point at the tip of the bar. On one graph, plot the inertial components of v rP /(ωL) and v P /(ωL). Use the following data: ¯b = b/L = 0.75, φ0 = π/3. Use non-dimensional time τ = ωt and notation (·) indicates a derivative with respect to τ . Present all the results for τ ∈ [0, 2π].
Problem 5.27. Piston with track and pin Figure 5.24 depicts an oscillating piston with a track along which a vertical pin is sliding. The motion of the piston is prescribed as x = x0 (1 − cos ωt)/2. The shape of the track is deſned ¯ = x/L and u ¯ = u/L versus by an arbitrary parameterization, p0 (η). (1) On one graph, plot x τ . (2) Plot η versus τ . (3) On one graph, plot x ¯ and u ¯ versus τ . (4) Plot η versus τ . (5) On one graph, plot x ¯ and u ¯ versus τ . (6) Plot η versus τ . (7) Evaluate the relative velocity vector of body with respect to body k. (8) Evaluate the tangential and normal components of this relative velocity vector. (9) Plot the tangential components of the relative velocity vector versus τ . Use the following data: a ¯ = a/L = 1.5, ¯b = b/L = 3, x ¯0 = x0 /L = 1, ¯ = h/L = 1, and p (η) = aη¯ e + bη 2 e¯2 . Use non-dimensional time τ = ωt and notation h 1 0 (·) indicates a derivative with respect to τ . Present all the results for τ ∈ [0, 2π].
i2 e2 x
Body l
Body l u
L h
P p0(h) Body k
La
S
i2
Body k
w
h i1
e1 Fig. 5.24. Conſguration of sliding piston with track and pin.
A
q i1
d
f O
Fig. 5.25. Conſguration of the two-bar mechanism.
Problem 5.28. Two-bar mechanism Figure 5.25 shows a planar, two-bar mechanism. Crank OS is of length La and rotates at a constant angular velocity, Ω. Its tip slides along bar AS. The distance between points A and
186
5 Kinematics of rigid bodies
O is denoted d and that between point A and S is denoted w. (1) On one graph, plot angles φ and θ versus τ . (2) Plot w ¯ = w/d versus τ . (3) On one graph, plot angular velocities φ and θ versus τ . (4) Plot w ¯ versus τ . (5) On one graph, plot angular accelerations φ and θ versus τ . (6) Plot w ¯ versus τ . (7) Evaluate the relative velocity vector of body with respect to body k. (8) Evaluate the tangential and normal components of this relative velocity vector. (9) On one graph, plot the magnitude of the relative velocity vector and that of the slider. Use ¯ a = La /d = 0.8. Use non-dimensional time τ = Ωt and notation (·) the following data: L indicates a derivative with respect to τ . Present all the results for τ ∈ [0, 2π].
Problem 5.29. Disk-follower mechanism Figure 5.26 depicts a disk-follower mechanism. The disk of radius R rotates at a constant anguP C lar velocity, θ˙ = Ω, about point C. A pin is lo+ h B p cated at the rim of the disk. The slotted follower q b1 is hinged at point O and f the pin slides inside O the slot. Frame F F = O, B+ = (¯b1 , ¯b2 , ¯b3 ) C R i1 is attached to the follower and the distance from d point O to C is denoted d. The shape of the slot is deſned by curve C and the position vector of Fig. 5.26. The disk-follower mechanism. a point on this curve with respect to point O, resolved in basis B+ , is denoted p+ = (x0 +x1 η+x2 η 2 +x3 η 3 )¯b1 +(y0 +y1 η+y2 η 2 +y3 η 3 )¯b2 , where η is an arbitrary parameterization of the curve. (1) Plot the shape of curve C. (2) Plot angle φ versus τ . (3) Plot parameter η. (4) Plot angular velocity φ . (5) Plot η . (6) Plot angular acceleration φ . (7) Plot η . (8) Show that eq. (5.30) holds for your solution. (8) Plot the magnitude of the relative velocity vector. Use non-dimensional time τ = Ωt; notation (·) denotes a derivative with respect to τ . Use the following data: R = 1.2, d = 1.8, x0 = 0, x1 = 1, x2 = 0, x3 = 0.5, y0 = y1 = 0, y2 = −1.4, and y3 = 1 m.
b2
a1
i2
Problem 5.30. Geneva wheel mechanism Figure 5.27 depicts the Geneva wheel mechanism, which consists of a disk and slotted arm. The disk of radius R rotates about inertial point O at a constant angular velocity, θ˙ = Ω. A pin is located at the rim of the disk at point P. The slotted arm is hinged at point A and the pin slides inside the slot. The distance from point A to the pin is denoted w. (1) On one graph, plot angle φ versus θ for one revolution of the disk. (2) Plot distance w ¯ = w/R. (3) Plot angular velocity φ . (4) Plot w ¯ . (5) Plot angular acceleration φ . (6) Plot w ¯ . (7) Show that eq. (5.30) holds for your solution. Use non-dimensional time τ = Ωt; notation (·) denotes a derivative ¯ = L/R = 1.5. with respect to τ . Use the following data: L
A f
w P
q
i2 L
e2
r
e1 P
q
A
i1
O
O
d R
Fig. 5.27. Geneva wheel mechanism.
Fig. 5.28. Scotch yoke mechanism.
5.5 The motion tensor
187
Problem 5.31. Scotch yoke mechanism Figure 5.28 depicts the Scotch yoke mechanism, which consists of a disk and slotted yoke. The disk rotates about inertial point O at a constant angular velocity, θ˙ = Ω. A pin is located at a distance r from the center of the disk. The slotted yoke is allowed to move horizontally and the pin slides inside the slot. (1) Find the position of point A as a function of angle θ. (2) Find the velocity of point A. (3) Find the acceleration of point A. (4) Show that eq. (5.30) holds for your solution.
5.5 The motion tensor In this section, the motion tensor is introduced as the tensor that relates the Pl¨ucker coordinates of a line of a rigid body in its initial and ſnal conſgurations. 5.5.1 Transformation of a line of a rigid body Figure 5.29 shows a rigid body in its referFinal Configuration b3 ence conſguration deſned by frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]. Two points of this rigid SQ b2 Q body, denoted points P and Q, are deſned by B* their position vectors with respect to point O b1 A given as sP and sQ , respectively. In the ſS i3 P P nal conſguration, the body is associated rigid P with frame F = A, B ∗ = (¯b1 , ¯b2 , ¯b3 ) . SusP u, R perscripts (·)∗ indicate tensor components reI ∗ solved in basis B . The position vectors of maQ i2 sQ O terial points P and Q with respect to point A are now S P and S Q , respectively. Because points Reference i1 configuration P and Q are material points of the rigid body, S P = R S ∗P and S Q = R S ∗Q . Fig. 5.29. A line of a rigid body in the Consider now the line passing through these reference and ſnal conſgurations. two points in the ſnal conſguration. Its orienta∗ ∗ ∗ ∗ ∗ tion, resolved in basis B , is ¯ = (S Q −S P )/(S Q −S ∗P ). The Pl¨ucker coordinates of this line, eq. (1.38), evaluated with respect to point A, are ∗ k SP∗ ¯∗ = ¯∗ . (5.33) Q∗ = ¯∗ The Pl¨ucker coordinates of the same line with respect to point O will now be evaluated and resolved in basis I. First, the orientation of the line is now ¯ =
(u + S Q ) − (u + S P ) SQ − SP S ∗Q − S ∗P = =R ∗ = R ¯∗ . (u + S Q ) − (u + S P ) S Q − S P S Q − S ∗P
Next, the Pl¨ucker coordinates of the same line become
188
5 Kinematics of rigid bodies
Q=
R SP∗ ¯∗ Ru u R¯∗ + RSP∗ RT R ¯∗ ( u + SP )¯ = = . 0 R ¯ R ¯∗ ¯∗
(5.34)
The motion tensor is deſned as C=
R Ru , 0 R
and eq. (5.34) can now be written in a compact form as ∗ k k Q = ¯ = C Q∗ = C ¯∗ .
(5.35)
(5.36)
Clearly, the motion tensor relates the Pl¨ucker coordinates of an arbitrary line of the rigid body resolved in two frames. This change of frame operation is more complex than the change in basis operation discussed in section 4.8: it involve both a change of basis and a change of reference point [1, 23, 22]. Equation (5.36) can be written in a more explicit manner as Q[F
I
]
I
= C [F ] Q[F ] .
In this form, the present change of frame operation mirrors the change of basis operation expressed by eq. (4.27). 5.5.2 Properties of the motion tensor The motion tensor can be factorized in the following manner R 0 Iu C= = T R, 0I 0 R
(5.37)
where R is the rotation tensor and T the translation tensor. The eigenvalues of the motion tensor are now easily computed. Indeed, det(C) = det(T ) det(R) = det(T ) det2 (R) and because det(T ) = 1, det(C) = det2 (R). Hence, the eigenvalues of the motion tensor are identical to those of the rotation tensor, but each with a multiplicity of two. The motion tensor, however, unlike the rotation tensor, is not an orthogonal tensor. Two linearly independent eigenvectors of the motion tensor associated with its unit eigenvalues are found to be ⎫ ⎧ T ⎨ G u ⎬ n ¯ N †1 = , and N †2 = 2 sin φ/2 . (5.38) 0 ⎭ ⎩ n ¯ The fact that N †1 is an eigenvector of the motion tensor stems from the corre¯ = n ¯ . It is readily veriſed that N †2 sponding property for the rotation tensor, R n
5.5 The motion tensor
189
is also an eigenvector of the motion tensor, indeed, R GT u/(2 sin φ/2) + u R n ¯ = (G − 2 n sin φ/2)u/(2 sin φ/2) = GT u/(2 sin φ/2). Any linear combination of eigenvectors N †1 and N †2 is still an eigenvector of the motion tensor. Consequently, the family of eigenvectors associated with the unit eigenvalue is expressed as follows (α − 1)d † m N = = N + N †2 , (5.39) n ¯ 2 sin φ/2 1 where α is an arbitrary scalar and d the intrinsic displacement of the rigid body. The displacement related part of the eigenvector is m=
GT u (α − 1)d + n ¯. 2 sin φ/2 2 sin φ/2
(5.40)
The scalar product of the two vectors forming the eigenvector is closely related to the intrinsic displacement of the rigid body λ=n ¯T m =
αd . 2 sin φ/2
(5.41)
5.5.3 Mozzi-Chasles’ axis In general, an arbitrary line of a rigid body is different in the reference and ſnal conſgurations. The following question can then be asked: is it possible to ſnd a line of the rigid body that is identical in the reference and ſnal conſgurations? If such line exists, its Pl¨ucker coordinates in the reference and ſnal conſgurations are identical, i.e., Q = Q∗ , or, using eq. (5.36), Q = C Q. This implies that the Pl¨ucker coordinates of this line must form an eigenvector of the motion tensor, as given by eq. (5.39). Because the ſrst three components of the Pl¨ucker coordinates of a line must be orthogonal to the last three, eq. (5.41) implies λ = α = 0, and hence, ⎫ ⎧ T n ⎬ ⎨ G n d u . QMC = N †2 − (5.42) N† = − 2 sin φ/2 1 ⎩ 2 sin φ/2 ⎭ n ¯ In summary, the Pl¨ucker coordinates of the line of the rigid body that is identical in the reference and ſnal conſgurations are given by eq. (5.42). These coordinates are those of Mozzi-Chasles’ axis, see eq. (5.9). Hence, Mozzi-Chasles’ axis is the line of the rigid body that is identical in the reference and ſnal conſgurations. This can be written as QMC = C QMC : Mozzi-Chasles’ axis is an eigenvector of the motion tensor corresponding to a unit eigenvalue.
190
5 Kinematics of rigid bodies
5.5.4 Intrinsic expression of the motion tensor The motion tensor was deſned by eq. (5.35), which is not an intrinsic expression because the displacement vector of the reference point of the rigid body, u, explicitly appears in this deſnition. In this section, an intrinsic expression of the motion tensor is sought, i.e., an expression in which vector u does not appear explicitly. Rodrigues’ rotation formula, eq. (4.15), provides an intrinsic equation for the rotation tensor in terms of n ¯ , the eigenvector of the rotation tensor associated with its unit eigenvalue, and φ, the magnitude of the rotation. A similar approach is followed here for the motion tensor, which should be expressed in terms of N , the eigenvector of the motion tensor associated with its unit eigenvalue, φ, the magnitude of the rotation, and d, the intrinsic displacement of the rigid body. The motion tensor, eq. (5.35), is composed of two sub-matrices: the rotation tensor, repeated twice along the diagonal, and tensor u R, appearing as an off-diagonal term. The intrinsic expression of the rotation tensor is provided by Rodrigues’ rotation formula, eq. (4.15). In contrast, the term u R is not intrinsic because the displacement vector of the reference point, u, appear explicitly. Using the deſnition of the intrinsic displacement of the rigid body, eq. (5.7), the displacement vector is related of the motion tensor, with the to the eigenvector help of eq. (5.40) to ſnd m = GT u + (α − 1)¯ nn ¯ T u /(2 sin φ/2). Introducing the expression for the half-angle rotation tensor, eq. (4.20), then yields m = E u, where second-order tensor E is deſned as α 1 α 1 I− n + − n n . E= 2 sin φ/2 2 2 sin φ/2 2 tan φ/2
(5.43)
(5.44)
It now becomes possible to express the displacement vector in terms of the ſrst part of the eigenvector of the motion tensor as u = J m,
(5.45)
where tensor J = E −1 is easily found as J=
2 sin φ/2 n+ I + (1 − cos φ) α
2 sin φ/2 − sin φ n n . α
(5.46)
Equation (5.45) now yields an explicit expression of the displacement of the body’s reference point φ m = sin φ m + d(1 − α cos ) u = J n + (1 − cos φ)( nm − m n). 2
(5.47)
Finally, tedious algebra reveals the following result, m R = sin φ m + dc1 n + (1 − cos φ) ( nm + m n) + dc2 n n , u R = J
(5.48)
5.5 The motion tensor
191
where coefſcients c1 and c2 are deſned as c1 = cos φ − α cos φ/2, c2 = sin φ − 2α sin φ/2.
(5.49a) (5.49b)
Combining Rodrigues’ rotation formula, eq. (4.15), and eq. (5.48), the motion tensor, eq. (5.35), becomes dc2 I sin φ I dc1 I (1 − cos φ) I n m n m n m C=I+ + . 0 n 0 sin φ I 0 n 0 (1 − cos φ) I 0 n (5.50) To simplify the writing of this seemingly complicated expression, the following notation is introduced. First, tensor Z, a function of two scalars, α and β, is introduced βI αI Z(α, β) = . (5.51) 0 βI Second, the generalized vector product tensor is deſned m = n . N 0 n
(5.52)
does not indicate a 6 × 6 skew-symmetric tensor, but rather the above Notation N 6 × 6 tensor formed by three skew-symmetric sub-tensors. Introducing these various notations into eq. (5.50) yields the desired intrinsic expression of the motion tensor and of its inverse + Z(dc2 , 1 − cos φ)N N , C(N ) = I + Z(dc1 , sin φ)N
(5.53a)
+ Z(dc2 , 1 − cos φ)N N . C −1 (N ) = I − Z(dc1 , sin φ)N
(5.53b)
The parallel between this intrinsic expression for the motion tensor and that for the rotation tensor given by Rodrigues’ rotation formula, eq. (4.15), is striking. Clearly, the skew-symmetric tensor, n , appearing in the expression for the rotation tensor , appearing in that for the is replaced by the generalized vector product tensor, N motion tensor. The two scalars, sin φ and (1 − cos φ), appearing in the expression for the rotation tensor becomes the second arguments of tensor Z appearing in that for the motion tensor. Rodrigues’ rotation formula, eq. (4.15), provides an intrinsic expression for the rotation tensor and is a direct consequence of Euler’s theorem on rotations, theorem 4.1. Similarly, the intrinsic expression for the motion tensor is a direct consequence of the Mozzi-Chasles theorem 5.1. The parallel between the rotation and motion tensors will be further explored in section 5.6.3. 5.5.5 Properties of the generalized vector product tensor The generalized vector product tensor deſned by eq. (5.52) enjoys remarkable properties that generalize those of the skew-symmetric tensor. First, the skew-symmetric
192
5 Kinematics of rigid bodies
operator, n , possesses a null eigenvalue, n n ¯ = 0¯ n. Similarly, the generalized vector N = 0N . product tensor also possesses a null eigenvalue, N The second property of the generalized vector product tensor generalizes the behavior of the skew-symmetric tensor under a change of basis operation, eq. (4.30). Consider the following triple matrix product T2 2 R2 RT2 RT2 u n 3 m n 1 m 3 1 R2 u = . T 0 n 3 0 n 0 R2 0 R2 1 This equality implies two conditions. The ſrst condition is n 3 = RT2 n 1 R2 , which, T in view of eq. (4.30), implies n ¯ 3 = R2 n ¯ 1 . The second condition is m 3 = RT2 (m 1 + T n 1 u 2 − u 2 n 1 )R2 , and tensor identities then lead to m3 = R2 (m1 + n 1 u2 ). These results can be summarized by the following equivalence, 3 = C −1 (N )N 1 C(N ) ⇐⇒ N = C −1 (N )N . N 2 2 3 2 1
(5.54)
The third property of the generalized vector product tensor generalizes identity (1.34b), which holds for unit vectors and is rewritten here as n n n +n = 0. N N + Z(2λ, 1)N = 0. N
(5.55)
The use of identities (1.34b) and (1.36) yields the above result, where λ = n ¯ T m. 5.5.6 Change of frame operation for linear and angular velocities Let the reference conſguration of the rigid body shown in ſg. 5.29 be the conſguration of the body at time t = 0, and its ſnal conſguration is time-dependent. Consider now two vectors associated with the rigid body: the velocity vector of point A, denoted v A , a bound vector, see section 1.2, and the angular velocity vector of the body, denoted ωA , a free vector, see section 1.1. The components of these two vectors resolved in basis B ∗ are denoted v ∗A and ∗ ω A , respectively, where the subscript on the latter symbol is, of course, superƀuous because the angular velocity is identical for all points of the body. The following velocity vector is now deſned ∗ vA V∗ = . (5.56) ω ∗A Strictly speaking, quantity V ∗ should not be called a vector: it is, in fact, an array composed of two individual vectors, the linear and angular velocity vectors. The rules of transformation of ſrst-order tensors, eq. (4.27), apply to these two vectors, but not to quantity V ∗ . It is convenient, however, to call quantity V ∗ a vector to underline the tensorial nature of the two vectors it is composed of. Symbols in calligraphic type, such as V ∗ , are used to denote quantities composed of two vectors. For simplicity, these quantities will be referred to as vectors in the following.
5.5 The motion tensor
193
In the previous section, the motion tensor was shown to transform the Pl¨ucker coordinates of a line from one frame to the other, and hence, it is interesting to consider the following transformation vO V= = C V ∗. (5.57) ωO To understand the physical meaning of this transformation, the physical interpretation of the velocity vectors v O and ωO must be identiſed ſrst. It is clear that v A = R v ∗A and ω A = R ω ∗A are the components of vectors v A and ω A , respectively, resolved in basis I. This corresponds to a change of basis operation, which establishes the relationship between the components of vectors in two bases. Next, because v O = R v ∗A + u R ω ∗A = v A − ω A u, velocity vector v O is that of the point of the rigid body which instantaneously coincides with the origin of the reference frame, point O. Of course, ωO can also be interpreted as the angular velocity vector of the same point, because the angular velocity vector is the same for all points of a rigid body. Hence, this second operation corresponds to a change of reference point operation, which establishes the relationship between the velocities of two different points of the rigid body. In summary, the operation described by eq. (5.57) corresponds to a change of frame operation, which combines a change of basis operation and a change of reference point operation. The factorized form of the motion tensor, eq. (5.37), clearly underlines the double effect of a frame change. It consists of two operations: ſrst a change of basis operation characterized by the rotation operator, R, then a change of reference point operation characterized by the translation operator, T . This change of frame operation can be inverted to yield V ∗ = C −1 V, where the inverse of the motion tensor is T T T T T Iu R R u R 0 C −1 = R−1 T −1 = RT T −1 = = . T T 0 I 0 R 0 R
(5.58)
(5.59)
An an intrinsic expression for this inverse is given by eq. (5.53b). 5.5.7 Change of frame operation for forces and moments A similar study can be made concerning two other vectors associated with the rigid body: the force vector acting on the rigid body, F A , and the moment acting on the rigid body, M A , evaluated with respect to point A. The components of these two vectors resolved in basis B ∗ are denoted F ∗A and ∗ M A , respectively, where the subscript on the former symbol is, of course, superƀuous because the force vector can be applied at any point of the rigid body. The following applied load vector is deſned ∗ FA A∗ = . (5.60) M ∗A
194
5 Kinematics of rigid bodies
Consider now the effect of the following transformation R 0 FO = C −T A∗ = A∗ . A= MO u R R
(5.61)
Here again, it is clear that F A = R F ∗A and M A = R M ∗A are the components of the force and moment vectors, respectively, resolved in the inertial basis. This corresponds to a change of basis operation, which establishes the relationship between the components of vectors in two orthonormal bases. Next, because M O = R M ∗A + u R F ∗A = M A + u F A , moment vector M O is the applied moment computed with respect to the point of the rigid body that instantaneously coincides with point O. Of course, F O can also be interpreted as the force applied on the rigid body at the same point, because this force is the same at all points of the rigid body. In summary, the operation described by eq. (5.61) is a change of frame operation that combines a change of basis and a change of reference point. This change of frame operation can be inverted to yield A∗ = C T A.
(5.62)
The components of velocity quantities and applied loads quantities transform differently under a frame change operation, as indicated in eqs. (5.57) and (5.61), respectively. Both transformations, however, are based on the motion tensor which appears to be a fundamental quantity associated with frame changes.
5.6 Derivatives of ſnite motion operations The derivatives of ſnite rotation operations were discussed in section 4.10 and led to the concept of angular velocity vector. The present section focuses on the study of time derivatives of the motion tensor, which leads to both velocity and angular velocity vectors. Differential changes in motion are also investigated. 5.6.1 The velocity vector The time-dependent motion of a rigid body is represented by the time-dependent motion of the body attached frame, F = A, B ∗ = (¯b1 , ¯b3 , ¯b3 ) , depicted in ſg. 5.29. Let C be the motion tensor that brings reference frame F I to frame F , and eq. (5.36) then implies Q(t) = C(t)Q∗ . Taking a time derivative of this equation leads to ˙ = C˙ Q∗ , and eliminating Q∗ then yields Q ˙ = C˙ C −1 Q. Q
(5.63)
Comparing this equation with eq. (4.56) reveals that expression C˙ C −1 , associated ˙ RT , associated with the rotation with the motion tensor, generalizes expression R tensor. The use of identity (1.33a) leads to
5.6 Derivatives of ſnite motion operations
C˙ C −1
˙ ˙ u T RT RT u ˙ R + u R ω v R + u ω) ω ( u ˙ = = = . ˙ 0ω 0 RT 0 R 0 ω
195
(5.64)
This expression gives rise to two quantities. First, the angular velocity of the rigid ˙ RT ); as body emerges from the time derivative of the rotation tensor, ω = axial(R expected, this quantity is identical to that which arose for the study of time derivatives of time-dependent rotations, see section 4.10. Second, the velocity vector of the rigid body, v = u˙ + u ω, also emerges from the time derivative of the motion tensor. This quantity can be interpreted as the linear velocity of the point of the rigid body that instantaneously coincides with the origin of the reference frame, point O, see section 5.5.6. The velocity vector of the rigid body resolved in frame F I is now deſned as v , (5.65) V= ω ˙ = VQ, where the generalized vector product tensor is and eq. (5.63) becomes Q given by eq. (5.52). It is also possible to resolve the components of the velocity vector in the moving frame, ˙ = C −1 C˙ Q∗ . C −1 Q (5.66) ˙ associated Comparing this equation with eq. (4.55) reveals that expression C −1 C, ˙ associated with the rotation with the motion tensor, generalizes expression RT R, tensor. It is readily found that T T T ˙ ˙ T ˙ u ∗ R R u R + u R R ω ∗ v∗ u ˙ R ω −1 ˙ C C= = . (5.67) = T ˙ 0 ω ∗ 0 R 0 R 0 ω This expression gives rise to two quantities. First, the components of the angular ˙ Second, velocity of the rigid body resolved in the rotating basis, ω ∗ = axial(RT R). the components of the velocity vector of the reference point of rigid body resolved in the rotating basis, v ∗ = RT u. ˙ The components of the velocity vector of the rigid body resolved in the material frame are now deſned as ∗ v ∗ V = = C −1 V, (5.68) ω∗ where the second equality follows from eq. (5.58). Equation (5.66) now becomes ∗ Q∗ , where the generalized vector product operator is given by eq. (5.52). ˙ =V C −1 Q The above developments are summarized in the following relationships C˙ C −1 = V, ∗, C −1 C˙ = V It is readily shown that
C C˙ C˙
−1
−1
= −V,
(5.69a)
∗. C = −V
(5.69b)
196
5 Kinematics of rigid bodies
∗ = C −1 VC, V
(5.70a)
∗ C −1 , =CV V
(5.70b)
as can be expected from the transformation formulae for the velocity vectors, eqs. (5.57) and (5.58). 5.6.2 The differential motion vector The concept of differential rotation vector was introduced in section 4.12.4 based on the rotation tensor, eq. (4.101). By analogy, the following expression is formed T T T du (du R+u dψ dψ dR R R u +u dψ) dR du −1 dC C = = = . 0 RT 0 dR 0 dψ 0 dψ This expression gives rise to two quantities. First, the differential rotation vector of the rigid body emerges from differential changes of the rotation tensor, dψ = axial(dR RT ); this quantity is identical to that deſned by eq. (4.101). As discussed in section 4.12.4, no vector ψ exists such that d(ψ) gives the differential rotation vector. Second, the differential displacement vector of the rigid body, du = du + u dψ, also emerges from the differential of the motion tensor. du is the differential displacement of point A and du = du + u dψ the differential displacement of the material point of the rigid body that instantaneously coincides with point O. Of course, there exist no displacement vector, say x, such d(x) = du + u dψ. Notations du and dψ will be used to denote the differential displacement and rotation vectors, respectively. By analogy to eqs. (5.69a) and (5.69b), the following compact notation is adopted , dC C −1 = dU
CdC −1 = −dU,
∗
∗
, C −1 dC = dU
, dC −1 C = −dU
where the components of the differential motion vector are deſned as du = C dU ∗ , dU = dψ ∗ du = C −1 dU, dU ∗ = dψ ∗
(5.71a) (5.71b)
(5.72a) (5.72b)
in the ſxed and moving frames, respectively. The components of the differential rotation and displacement vectors, both resolved in the moving frame, are dψ ∗ = axial(RT dR) and du∗ = RT du, respectively. It is readily shown that ∗
C, = C −1 dU dU ∗ −1
= C dU C dU
(5.73a) .
(5.73b)
5.6 Derivatives of ſnite motion operations
197
Taking a differential of eq. (5.69a) and a time derivative of eq. (5.71a) leads to −1 ˙ ˙ −1 and dU = dC˙ C −1 + dC C˙ , respectively. Subtracting these dV = dC˙ C −1 + CdC two equations and using eqs. (5.69a) and (5.71a) then yields ˙ + dU V. − dU dU dV = −V Expanding these expressions and using identity (1.33a) then leads to this important ˙ − VdU , which relates differentials in the velocity vector to the difresult dV = dU ferential motion vector and its time derivative. This equation generalizes eq. (4.102a) written for the sole angular velocity. The following results are obtained in a similar manner ˙ ∗, dV = C dU
˙ − VdU, dV = dU ∗
∗
∗
∗
˙ + V dU , dV = dU
∗
dV = C
−1
˙ dU.
(5.74a) (5.74b)
5.6.3 Change of frame operations Section 4.8.1 discussed change of basis operations. By deſnition 4.1, a vector, or ſrst-order tensor, is a mathematical entity whose components resolved in two bases are related by eqs. (4.27). This deſnition applies equally to kinematic quantities such as displacement and rotation vectors, and load quantities such as force or moment vectors. For instance, the components of the velocity vector resolved in inertial and material bases, denoted v and v ∗ , respectively, are such that v = R v ∗ , if R are the components of the rotation tensor that brings the inertial to the material basis, resolved in the inertial basis. The components of the angular velocity vector resolved in the same bases are such that ω = R ω ∗ . Similar relationships hold for the components of the force and moment vectors. In fact, according to deſnition 4.1, the components of all vectors follow the same transformation rule under a change of basis. Section 5.5.6 presented the change of frame operation for the linear and angular velocity vectors. For instance, eq. (5.57) provides the relationship between the components of the linear and angular velocity vectors resolved in the inertial and material frames, denoted V and V ∗ , respectively, as V = C V ∗ , if C are the components of the motion tensor that brings the inertial frame to the material frame, resolved in the inertial frame. The change of frame transformation operates on the linear
and angular velocity vectors simultaneously. The notational convention, V T = v T , ω T and the use of a 6 × 6 motion tensor enable the simultaneous manipulation of the two vectors. On the other hand, section 5.5.7 introduced the change of frame operation for forces and moments. For instance, eq. (5.61) provides the relationship between the components of the force and moment vectors resolved in the inertial and material frames, denoted A and A∗ , respectively, as A = C −T A∗ . The change of frame operation for kinematic quantities is based on the motion tensor, C, but the same change of frame operation for loads uses the transpose of its inverse, C −T .
198
5 Kinematics of rigid bodies
Energetically conjugate quantities To understand the crucial difference between change of basis and change of frame operations, consider the differential work done by the force and moment vectors, denoted F and M , respectively, applied at point A of the rigid body, dW = F T du + M T dψ,
(5.75)
where du and dψ are the differential displacement vector of the point of application of the force and the differential rotation of the rigid body, respectively, see ſg. 5.29. The force and differential displacement vectors are said to be energetically conjugate quantities because their scalar product yields the differential work. Similarly, the moment and differential rotation vectors are also energetically conjugate quantities. Because the scalar product is a tensor operation, the differential work can also be expressed as dW = F ∗T du∗ + M ∗T dψ ∗ , where F ∗ = RT F and M ∗ = RT M are the components of the force and moment vectors, resolved in the body attached basis, and du∗ = RT du and dψ ∗ = RT dψ the components of the differential displacement and rotation vectors resolved in the same basis. Energetically conjugate quantities, such as the moment and differential rotation vectors, follow the same rules of transformation under a change of basis. The following compact notation is introduced dW = F ∗T du∗ + M ∗T dψ ∗ = A∗T dU ∗ ,
(5.76)
where A∗T = F ∗T , M ∗T is the applied loading vector and dU ∗ the differential motion vector deſned by eq. (5.72b). These two quantities, A∗ and dU ∗ , are energetically conjugate because their scalar product yields the differential work. To explore the effect of a change of frame, the following transformation is performed, dW = A∗T dU ∗ = A∗T C −1 C dU ∗ = AT dU, (5.77) where A = C −T A∗ , as expected from eq. (5.61), and dU = C dU ∗ , as expected from eq. (5.72a). Under a change of frame, the rules of transformations for energetically conjugate quantities differ. This difference stems from the fact that the motion tensor is not an orthogonal tensor, C −1 = C T . In contrast, the rotation tensor is orthogonal, R−1 = RT and consequently, the rules of transformations for energetically conjugate quantities are identical for change of basis operations. Generalization of the concept of tensor analysis Section 4.8.4 introduced the concept of tensor analysis. The combined use of tensor quantities and tensor operations leads to a formulation of the laws of physics that guarantees their invariance with respect to change of basis operations. Intuitively, the laws of physics should be invariant under a change of basis operation because this operation simply corresponds to the selection of a different basis in which all
5.6 Derivatives of ſnite motion operations
199
tensor components are resolved, but does not change the physical behavior of the system. Intuitively, the laws of physics should also be invariant with respect to a change in the location of the origin of the coordinate system, as long as this new origin is still an inertial point. Combining these two intuitive observations, the invariance of the laws of physics with respect to both basis and origin selection, leads to the natural conclusion that the laws of physics must be invariant with respect to a change of frame. The generalization of the concept of tensor analysis to the invariance of the laws of physics to change of frame operations involves two parts, the use of generalized tensors quantities and of generalized tensor operations. These two concepts generalize the use of tensor quantities and tensor operations characteristics of tensor analysis, see sections 4.8.3 and 4.8.4. When dealing with change of frame operation, linear and angular quantities becomes coupled. For instance, linear and angular velocity vectors are paired to form the generalized velocity vector deſned by eq. (5.56), similarly, the force and moment vectors are paired to form the generalized loading vector deſned by eq. (5.60). The generalized velocity vector is composed of two vectors, or ſrst-order tensors Consider now the change of frame operation expressed by eq. (5.57) and repeated here in more explicit details v O = R v ∗A + u R ω∗A R ∗ Ru ⇐⇒ V V = C V ∗. = (5.78) ωO = R ω ∗A 0 R The two equations on the left-hand side are basis invariant because they only involve tensor quantities and tensor operations; they satisfy all the rules of tensor analysis. Taken together, they express a change of frame operation, which is repeated on the right-hand side with a more compact notation. The generalized motion tensor, C, of size 6 × 6, is composed of four sub-matrices, each of size 3 × 3, which are each second-order tensors. Clearly, the right-hand side of eq. (5.78) generalizes the change of basis operation, v = R v ∗ , to the change of frame operation, V = C V ∗ . In the following developments, quantities such as V or V ∗ will be called vectors or ſrst-order tensors, and quantities such as the motion tensor will be called second-order tensors. This terminology is more convenient to use in place of the more awkward “generalized velocity vector” and “generalized motion tensor.” Because generalized vectors and tensors are indicated in calligraphic type, their generalized nature is clearly implied. Example 5.5. First- and second-order tensors In the previous sections, ſrst- and second-order tensors have already been encountered. The second property of the generalized vector product operator given by eq. (5.54) is repeated here for convenience 3 = C −1 (N )N 1 C(N ) ⇐⇒ N = C −1 (N )N . N 2 2 3 2 1 The right-hand side of this equivalence expresses the rules of transformation for the ſrst-order tensor, N : its components in frames F1 and F3 are N 1 and N 3 , respec-
200
5 Kinematics of rigid bodies
tively, and C(N 2 ) are the components of the motion tensor that brings frame F1 to F3 , resolved in frame F1 . The left-hand side of this equivalence expresses the rules of transformation for : its components in frames F1 and F3 are N 1 and N 3 , the second-order tensor, N respectively. This gives a formal proof that the generalized vector product operator deſned by eq. (5.52) is in fact a second-order tensor. Clearly, these results generalize the corresponding results obtained for the skew-symmetric tensor in eq. (4.30). Example 5.6. The motion tensor The motion tensor was introduced in section 5.5.1 and was called a “tensor.” Prove that the motion tensor is indeed a second-order tensor. Consider the intrinsic expression of the motion tensor given by eq. (5.53a) as + Z(dc2 , 1 − cos φ)N N . The arguments of operator S(N ) = I + Z(dc1 , sin φ)N Z are functions of two variables, rotation angle φ and the intrinsic displacement of the rigid body, d. Both quantities are zeroth-order tensor because they are unaffected by a change of frame operation. Next, it is easily veriſed that operator Z(α, β), where α and β are zeroth-order tensors, is itself a zeroth-order tensor. Indeed, Z(α, β) = C −1 (N 2 )Z(α, β)C(N 2 ),
(5.79)
which implies the invariance of Z(α, β) under a change of frame operation, −1 −1 −1 S(N )C 2 = I + Z(dc1 , sin φ) C N + Z(dc2 , 1 − cos φ) C N C N C −1 2 2 2 2
= S(C −1 N ), 2
where the tensorial nature of the generalized vector product tensor, eq. (5.54), was taken into account. The tensorial nature of the motion tensor is now established. A more formal expression of the tensorial nature of the motion tensor is C(N 3 ) = C −1 (N 2 )C(N 1 )C(N 2 ) ⇐⇒ N 3 = C −1 (N 2 )N 1 .
(5.80)
This result for the motion tensor should be compared with the corresponding result for the rotation tensor, eq. (4.31).
6 Kinetics of rigid bodies
In section 3.4, the dynamic response of a system of particles subjected to both internally and externally applied loads is studied and leads to Euler’s ſrst and second laws [14, 15]. Rigid bodies can be viewed as systems of particles subjected to both internal and external forces. The former forces are those that maintain the shape of the rigid body. By deſnition, a rigid body is one for which the distance between any two of its particles remains constant at all times. The displacement ſeld of the rigid body must satisfy the kinematic constraints developed in section 5.1 and its velocity ſeld those described in section 5.2. The conſguration of the rigid body is deſned by six parameters: the three coordinates describing the location of one of its points and three parameters describing its orientation. Similarly, the velocity ſeld of the rigid body is determined by six parameters: the three components of the linear velocity vector of one of its points and the three components of its angular velocity vector. Clearly, all the results derived in section 3.4 concerning the Newtonian mechanics of systems of particles are readily applicable to rigid bodies. In particular, the motion of the center of mass of the rigid body is governed by the following equation: F = P˙ , where P = mv C is the linear momentum of the body, m its total mass, v C the velocity of its center of mass, and F the sum of all externally applied forces. An˙ , where H other vector equation that applies to systems of particles is M C = H C C is the angular momentum vector of the rigid body and M C the sum of the externally applied moments, both computed with respect to the center of mass of the rigid body. ˙ , i.e., both angular momentum and externally applied It is also true that M O = H O moments can be evaluated with respect to an inertial point O. These two differential vector equations in time provide the six scalar equations necessary to solve for the motion of the rigid body. The ſrst equation is very similar to Newton’s second law for a single particle, eq. (3.4). The mass of the entire rigid body multiplied by the acceleration of its center of mass equals the sum of all externally applied forces. The rigid body can be replaced by a ſctitious particle of mass m located at its center of mass and subjected to all the forces externally applied to the body.
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_6 © Springer Science+Business Media B.V. 2011
202
6 Kinetics of rigid bodies
The second equation describes the motion of the rigid body around its center of mass. This equation is more complex than the ſrst and does require the evaluation of the angular momentum vector of the rigid body, which will bring to light an additional inertial characteristics of the rigid body, the mass moment of inertia tensor. The evaluations of the angular velocity vector and of the kinetic energy of a rigid body are presented in sections 6.1 and 6.2, respectively. The evaluation of these quantities gives rise to the tensor of mass moments of inertia whose properties are reviewed in section 6.3. The equations of motion of a rigid body are derived in section 6.5 and the principle of work and energy in section 6.6. A special case of particular interest for many applications is the planar motion of rigid bodies, which is the focus of section 6.7.
6.1 The angular momentum vector The angular momentum vector of a system of particles, )Ncomputed with respect to an arbitrary point O, is deſned in section 3.4 as H O = i=1 ri mi v i , where mi is the mass of a particle of the system, v i its inertial velocity vector, r i its position vector with respect to point O, and N the total number of particles of the system. When dealing with a rigid body, this deſnition is not easy to handle: the number of particles is very large while the mass of each one is very small. Each atom the rigid body could be considered to be a particle of very small mass and the total number of particles would be extremely large. Consequently, the familiar concepts of continuum mechanics are introduced: each differential volume element of the body, dV, is considered to be a particle of mass mi = ρdV, where ρ is the mass density of the body. The sum over all particles is then replaced by an integral over the entire volume of the body. Figure 6.1 show the conſguration of the b3 rigid body; frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )] is v b2 an inertial frame. Point B is a reference point B of the body while point Q is a material point s B i3 of the body. The position vector of point Q with Q respect to point B is denoted s. The angular mob1 mentum vector of the body, computed with reO i2 spect to point B, is then i1 I % H = sv Q ρdV, Fig. 6.1. Conſguration of a rigid B V
body.
where v Q is the inertial velocity of point Q, and V the total volume of the body. Because the body is rigid, its velocity ſeld is described by eq. (5.10), i.e., v Q = vB + ω s and the angular momentum vector now becomes % s(v B + ω s) ρdV. HB = V
6.1 The angular momentum vector
203
Because v B is the velocity of reference point B and ω the angular velocity of the rigid body, this expression can be recast as % % HB = s ρdV v B + s sT ρdV ω. V
V
The ſrst bracketed term ' is related to the location of the center of mass of the rigid body, see eq. (3.56); V s ρdV = mr BC , where r BC is the position vector of the rigid body center of mass with respect to point B. The second bracketed term is the tensor of mass moments of inertia, evaluated with respect to point B. This second-order, symmetric tensor is deſned as % B I = ssT ρdV, (6.1) V
With these deſnitions, the angular momentum vector takes the following form rBC v B + I B ω. H B = m
(6.2)
The sole inertial characteristic of a particle is its mass, but the characterization of the inertial properties of a rigid body is more complex. Ten quantities are required: the total mass of the body, m (a single scalar quantity), the location of the center of mass, r BC (three components of this vector), and the mass moments of inertia tensor, I B (six independent components of this symmetric tensor). The units of the mass moments of inertia tensor are kg·m2 . An arbitrary orientation of the inertial basis, I, was selected. A different basis, say I , could have been selected. Let R be the components of the rotation tensor that brings basis I to basis I , resolved in basis I. If s and s denote the components of vector s in bases I and I , respectively, eq. (4.27) implies s = RT s. It then follows that % % (I B ) = s sT ρdV = RT sR RT sT R ρdV V V % (6.3) T T B T =R s s ρdV R = R I R. V
This expression relates the components of tensor of mass moments of inertia resolved in two bases, I and I , denoted I B and (I B ) , respectively. The fact that these components are related by the transformation rules for the components of second-order tensors, eq. (4.29), proves the tensorial nature of the mass moments of inertia tensor deſned in eq. (6.1). Often, it will be convenient to compute the angular momentum vector with respect to the center of mass of the rigid body. Indeed, selecting the center of mass as the reference point of the body implies r CC = 0, and eq. (6.2) reduces to H C = I C ω.
(6.4)
A similar simpliſcation is achieved if the reference point on the rigid body happens to be an inertial point, i.e., if v B = 0. In this case, eq. (6.2) reduces to
204
6 Kinetics of rigid bodies
H B = I B ω.
(6.5)
Note that the angular momentum vector and the tensor of mass moments of inertia are quantities computed with respect to a speciſc point. Notations I C and I B denote the tensors of mass moments of inertia computed with respect to points C and B, respectively. Similarly, H C and H B indicate the angular momentum vectors evaluated with respect to points C and B, respectively.
6.2 The kinetic energy The kinetic energy of a particle, eq. (3.10), is deſned as K = 1/2 mv T v, where m is the mass of the particle and v its inertial velocity vector. The kinetic energy of a differential element of the rigid body located at point Q is now K = 1/2 ρdV v TQ v Q , ' and that of the entire body becomes K = 1/2 V v TQ v Q ρdV. The velocity ſeld of a rigid body is described by eq. (5.10) as v Q = v B + ω s and the kinetic energy now becomes % 1 K= (v T v + 2v TB ω s + sT ω T ω s) ρdV. 2 V B B Because vector v B is the velocity of reference point B and ω the angular velocity vector of the rigid body, this expression is recast as % % % 1 K= ρdV v TB v B + 2vTB sT ρdV ω + ω T s sT ρdV ω . 2 V V V The ſrst bracketed term simply represents the total mass of the rigid body. The second bracketed' term is related to the location of the center of mass of the rigid body, eq. (3.56), V s ρdV = mr BC . Finally, the last bracketed term is the tensor of mass moments of inertia evaluated with respect to point B deſned by eq. (6.1). The kinetic energy expression now reduces to K=
" 1! T m v TB v B + 2mv TB rBC ω + ωT I B ω . 2
(6.6)
Here again, it is possible to simplify this expression by selecting the center of mass of the rigid body as the reference point; this implies r CC = 0, and hence, K=
1 1 m v TC v C + ω T I C ω. 2 2
(6.7)
The ſrst term represents the kinetic energy associated with the translational motion of the rigid body, and the second represents that associated with the rotation of the body. The expression for the translational kinetic energy of the rigid body, 1/2 m v TC v C , is identical to that of a particle of mass m moving at velocity v C . The rotational motion of the body about its center of mass is associated with an additional amount of kinetic energy called rotational kinetic energy, 1/2 ω T I C ω, that is a quadratic function of the angular velocity of the rigid body.
6.3 Properties of the mass moment of inertia tensor
205
Because the kinetic energy is a positive quantity, the translational energy must always be positive, i.e., 1/2 m v TC v C > 0 for any vector v C = 0; it follows that the mass of the body, m, must be positive number, a forgone conclusion. The same argument applied to the rotational kinetic energy yields 1/2 ω T I C ω > 0 for all angular velocity vectors ω = 0; this implies that the tensor of mass moments of inertia, I C , is a positive-deſnite tensor, eq. (1.53).
6.3 Properties of the mass moment of inertia tensor This section investigates the properties of the mass moment of inertia tensor. If the location of the reference point of the rigid body is changed, the components of the mass moment of inertia tensor change according to the parallel axis theorem studied in section 6.3.1. Furthermore, section 6.3.2 shows that the components of the mass moment of inertia tensor change according to the rules of transformation for secondorder tensors if the orientation of the body attached basis is modiſed. 6.3.1 The parallel axis theorem In the previous section, the mass moment of Q inertia tensor was evaluated with respect to an q s arbitrary reference point of the body and with i3 C respect to its center of mass. Figure 6.2 derPC B picts the conſguration of the rigid body: s is the position vector of a material point Q of the O i2 rigid body with respect to reference point B, i1 I and q is the position vector of the same point with respect to the center of mass C. Clearly, Fig. 6.2. Evaluating the mass moments s = rBC +q, where rBC is the position vector of inertia with respect to a reference of the center of mass with respect to point B. point B and the center of mass C. The tensor of mass moments of inertia evaluated with respect to point B is now % % T IB = s sT ρdV = ( rBC + q)( rBC + qT ) ρdV. V
V
Expanding the integrand and taking advantage of the fact that r BC can be factored out of the integral sign leads to % % % T T I B = m rBC rBC + rBC qT ρdV + q ρdV rBC + q qT ρdV. V
'
V
V
The two middle terms vanish because V q ρdV = m rCC = 0. The last term is the mass moment of inertia tensor, I C , evaluated with respect to the center of mass, and hence, T I B = I C + m rBC rBC . (6.8)
206
6 Kinetics of rigid bodies
B C Let Iij and Iij be the components of the mass moment of inertia tensors I B and I C , T
be the components of vector r BC , all resolved in respectively, and let x1 , x2 , x3 a given basis. The diagonal components of tensor I B now become B C = I11 + m(x22 + x23 ), I11
(6.9a)
B I22 B I33
(6.9b)
= =
C I22 C I33
+ +
m(x21 m(x21
+ +
x23 ), x22 ).
(6.9c)
B C and I11 of the mass moment of inertia tensor are evaluated with Components I11 respect to two different points, an arbitrary point B and the center of mass, respectively, but in the same basis, i.e., with respect to parallel axis systems; hence, the name of parallel axes theorem. The properties of the center of mass were used in the derivation of this theorem, B R hence, it is incorrect to write I11 = I11 + m(x22 + x23 ) if points B and R are two arbitrary points of the rigid body. Because the second term on the right-hand side of eqs. (6.9) is strictly positive, B C it follows I11 > I11 , that is, the moment of inertia always increases when moving away from the center of mass. In other words, the minimum value of I11 is obtained when it is computed with respect to the center of mass. The off-diagonal terms of tensor of moments of inertia are called products of inertia; in view of eq. (6.8), they become B C I23 = I23 − mx2 x3 ,
(6.10a)
− mx1 x3 ,
(6.10b)
− mx1 x2 .
(6.10c)
B I13 B I12
= =
C I13 C I12
In this case, the second term on the right-hand side could be positive or negative; consequently, products of inertia could increase of decrease when moving away from the center of mass. Theorem 6.1 (Parallel axis theorem). The components of the mass moment of inertia tensor of a rigid body computed with respect to an arbitrary point B are related to their counterparts resolved in the same basis but computed with respect to the body’s center of mass by eqs. (6.9) and (6.10).
6.3.2 Change of basis In the previous section, relationships were derived between the components of the tensor of mass moments of inertia evaluated with respect to two different points, but resolved in the same basis. In this section, relationships are sought between the components of this tensor resolved in two different bases, but evaluated with respect to the same point.
6.3 Properties of the mass moment of inertia tensor
207
Consider two bases, B and B , and let R be the components of the rotation tensor that brings basis B to basis B , resolved in basis B. Equation (6.3) then implies ⎡ ⎤ ⎡ ⎤ I11 I12 I13 I11 I12 I13 ⎦ ⎣I12 I22 I23 (6.11) = RT ⎣I12 I22 I23 ⎦ R, I13 I23 I33 I13 I23 I33 where I and I are the components of the mass moment of inertia tensor in bases B and B, respectively. It is instructive to look at the transformation laws for speciſc components. The rotation tensor will be represented by its direction cosines, ⎡ ⎤ 1 m1 n 1 R = ⎣2 m2 n2 ⎦ . (6.12) 3 m3 n 3 First, the diagonal terms of I are = 21 I11 + 22 I22 + 23 I33 + 22 3 I23 + 21 3 I13 + 21 2 I12 , I11 I22
=
m21 I11
+
m22 I22
+
m23 I33
(6.13a)
+ 2m2 m3 I23 + 2m1 m3 I13 + 2m1 m2 I12 , (6.13b)
I33 = n21 I11 + n22 I22 + n23 I33 + 2n2 n3 I23 + 2n1 n3 I13 + 2n1 n2 I12 ,
(6.13c)
Next, the off-diagonal terms of I are = n1 m1 I11 + n2 m2 I22 + n3 m3 I33 + (n2 m3 + n3 m2 )I23 I23
+ (n1 m3 + n3 m1 )I13 + (n1 m2 + n2 m1 )I12 . = n1 1 I11 + n2 2 I22 + n3 3 I33 + (n2 3 + n3 2 )I23
(6.14a)
I13
+ (n1 3 + n3 1 )I13 + (n1 2 + n2 1 )I12 . = 1 m1 I11 + 2 m2 I22 + 3 m3 I33 + (2 m3 + 3 m2 )I23
(6.14b)
I12
+ (1 m3 + 3 m1 )I13 + (1 m2 + 2 m1 )I12 .
(6.14c)
6.3.3 Principal axes of inertia The tensor of mass moments of inertia was shown to be a symmetric, positive-deſnite tensor. In view of section 1.4.2, its eigenvalues must be real and positive. Furthermore, it is always possible to construct a set the orthogonal eigenvectors that will diagonalize this tensor, see eq. (1.64). Let u1 , u2 , and u 2 be the eigenvector of the tensor of mass moments of inertia, and P = u1 , u2 , u3 . It the follows that ⎡ ∗ ⎤ I1 0 0 P T I P = diag(Ii∗ ) = ⎣ 0 I2∗ 0 ⎦ , 0 0 I3∗
(6.15)
208
6 Kinetics of rigid bodies
where I1∗ , I2∗ , and I3∗ are the eigenvalues of the tensor of mass moments of inertia. For symmetric, positive-deſnite tensors, the eigenvectors can be selected to form an orthogonal tensor, which itself, can be interpreted as a rotation tensor. Let B ∗ be the basis deſned by the eigenvectors; P is then the rotation tensor that brings basis B to basis B ∗ . In view of eq. (6.3), the statement diag(Ii∗ ) = P T I P is a change of basis operation: I and diag(Ii∗ ) are the components of the moment of inertia tensor in bases B and B ∗ , respectively. The transformation deſned by the principal axes of inertia brings the components of the mass moments of inertia tensor to a diagonal form. 6.3.4 Problems Problem 6.1. Kinetic energy for a rigid body undergoing rotational motion A rigid body is in rotational motion about ſxed inertial point O, which does not coincide with the center of mass of the rigid body. Starting from eq. (6.7), prove that the kinetic energy of the rigid body can be expressed in the following form K=
1 ∗T O∗ ∗ ω I ω , 2
(6.16)
where I O∗ is the mass moment of inertia tensor computed with respect to point O. Notation (·)∗ indicates the components of vectors and tensors resolved in a body attached basis.
Problem 6.2. Two interconnected particles in planar motion Figure 6.3 depicts a system of two rigidly interconnected particles undergoing planar motion and subjected to the acceleration of gravity, −g¯ı2 . Point C is the center of mass of the system and θ the angle the massless rigid bar connecting the particles makes with the horizontal. (1) Does the following relationship hold, m1 ¨ r 1 = −m1 g¯ı2 ? (2) If d1 and d2 are the distances from the two particles to the center of mass, prove that d1 m1 = d2 m2 . (3) Does the following relationship hold, ¨r C = −g¯ı2 ? (4) Evaluate the angular momentum vector of the system ˙ (5) Is this angular momentum of the system with respect to its center of mass in terms of θ. preserved? Justify all your answers. YES/NO answers are not valid.
i2
C
d1 m1 rc r1
m2
d2
q
r2
i2
v2 m2 q
g
g
i1 i1
Fig. 6.3. Two interconnected particles in planar motion.
v1
m1
Fig. 6.4. Two interconnected particles in planar motion.
6.3 Properties of the mass moment of inertia tensor
209
Problem 6.3. Two interconnected particles in planar motion Figure 6.4 show a system of two rigidly interconnected particles undergoing planar motion and subjected to the acceleration of gravity, −g¯ı2 . The angle the massless rigid bar connecting the particles makes with the horizontal is denoted θ(t). (1) Determine the initial location of the center of mass, rC0 , and its initial velocity, v C0 . (2) Find the initial angular velocity vector of the system, ω 0 . (3) Find the condition that must be satisſed by the initial velocity vectors of the two particles, v 10 and v 20 . (4) Is the angular momentum of the system preserved? (5) Find the time history of the position vectors of the two particles, r 1 (t) and r 2 (t). Use the following data: m1 = 1.3 kg, m2 = 5.2 kg, r1 (t = 0) = r 10 = 5 ¯ı1 m, r 2 (t = 0) = r 20 = 3 ¯ı2 m, v 1 (t = 0) = v 10 = −2.2 ¯ı1 − 3 ¯ı2 m/s, v 2 (t = 0) = v 20 = 2.12 ¯ı1 + 4.2 ¯ı2 m/s.
Problem 6.4. Moments of inertia of a rectangular plate with side bar A homogeneous rectangular plate of mass M , length a, and width b is connected to a homogeneous rod of mass m and length a/2, as depicted in ſg. 6.5. (1) Determine the mass moment of inertia tensor of the system evaluated with respect to point O, the plate’s geometric center, and resolved in a set of axes parallel to the edges of the plate. (2) Determine the orientation of the principal axes of inertia at point O and the corresponding principal mass moments of inertia. (3) Find the location of the center of mass of the system, point C. (4) Determine the orientation of the principal axes of inertia at point C and the corresponding principal mass moments of inertia. Use the following data: a = 0.48 m, b = 0.24 m, M = 0.5 kg, and m = 0.3 kg.
c
m
m C a
O a
O M
M
b b Fig. 6.5. Rectangular plate with side bar.
Fig. 6.6. Rectangular plate with corner normal bar.
Problem 6.5. Moments of inertia of a rectangular plate with corner normal bar A homogeneous rod of mass m and length c is connected at the corner of a homogeneous rectangular plate of mass M , length a, and width b, as depicted in ſg. 6.6. The rod is normal to the plate. (1) Determine the mass moment of inertia tensor of the system evaluated with respect to point O, the plate’s geometric center, and resolved in a set of axes parallel to the edges of the plate. (2) Determine the orientation of the principal axes of inertia at point O and the corresponding principal mass moments of inertia. (3) Find the location of the center of mass of the system, point C. (4) Determine the orientation of the principal axes of inertia at point C and the corresponding principal mass moments of inertia. Use the following data: a = 0.64 m, b = 0.36 m, c = 0.48 m, M = 0.5 kg, and m = 0.4 kg.
210
6 Kinetics of rigid bodies
Problem 6.6. Moments of inertia of the ƀywheel governor Figure 6.7 shows a simpliſed conſguration of the ƀywheel governor. Two particles of mass m are connected to four articulated bars of length L, which remain in a plane at all times. Angle θ is changing according to the following schedule: θ(t) = π/4 + π/6 sin 2πt. At time t = 0, the system has an angular velocity ω = Ω0 ¯ı3 . (1) Is the angular momentum of the system preserved? (2) Is the angular velocity of the system preserved? (3) Find the time history of the angular velocity vector of the system, ω(t). (4) Plot ω(t)/Ω0 for t ∈ [0, 1] s.
i3
L
L
m
m L
q
L
a2
i1
L/2 m
i2 Fig. 6.7. Flywheel governor.
a1 e2 a r/2 C r/2
M
e1
L/2
Fig. 6.8. Two rigidly connected bars.
Problem 6.7. Rigid bar connected to a shaft Figure 6.8 shows rigid shaft of length L and mass M . Basis E = (¯ e1 , e¯2 , e¯3 ) is attached to the shaft; unit vector e¯1 is aligned with the shaft. A rigid bar of length r and mass m is ¯2 , a ¯3 ) is attached to the bar, e¯3 = a ¯3 and rigidly connected to the shaft. Basis A = (¯ a1 , a α = (¯ e2 , a ¯1 ); unit vector a ¯1 is aligned with the bar. The shaft and bar are homogeneous slender rods, see ſg. 6.42, and their centers of mass coincide at point C. (1) Determine the tensor of mass moments of inertia of the assembly in basis E .
Problem 6.8. Rigid disk connected to a shaft Figure 6.8 shows rigid shaft of length L, radius R, and mass M . Basis E = (¯ e1 , e¯2 , e¯3 ) is attached to the shaft; unit vector e¯1 is aligned with the shaft. A rigid disk of radius r and mass ¯2 , a ¯3 ) is attached to the disk, e¯3 = a ¯3 m is rigidly connected to the shaft. Basis A = (¯ a1 , a and α = (¯ e2 , a ¯1 ); unit vector a ¯2 is normal the disk. The shaft is a homogeneous cylinder, see ſg. 6.40, and the disk a homogeneous thin disk, see ſg. 6.41. Their centers of mass coincide at point C. (1) Determine the tensor of mass moments of inertia of the assembly in basis E .
6.4 Derivatives of the angular momentum vector ˙ , apBecause a rigid body is a system of particles, Euler’s second law, M C = H C plies. Use of this equation calls for the evaluation of the time derivative of the angular momentum vector evaluated with respect to the center of mass of the rigid body, H C = I C ω, see eq. (6.4). The time derivative of this quantity is ˙ = I˙ C ω + I C ω. H ˙ C Evaluation of the mass moment of inertia tensor is a cumbersome task. If the basis in which the components of this tensor are computed changes its orientation
6.5 Equations of motion for a rigid body
211
with respect to the rigid body, I C = I C (t), and this evaluation must be repeated at C each instant in time; furthermore, this implies I˙ = 0. Consequently, it is convenient to select a body attached frame for the evaluation of the mass moments of inertia, which then become constant quantities in time, and their time derivatives vanish. Consider an inertial basis, I, and a body attached basis, B ∗ ; superscript (·)∗ indicates tensor components resolved in basis B ∗ . Let R be the components of the rotation tensor that brings basis I to basis B ∗ , resolved in I. Furthermore, let H C , I C , and ω be the components of the angular momentum vector, mass moment of inertia tensor, and angular velocity vector, respectively, all resolved in basis I. It then follows that RT H C = RT I C R RT ω = I C∗ ω ∗ , where I C∗ and ω ∗ are the components of the mass moment of inertia tensor and angular velocity vector, respectively, resolved in basis B ∗ . The orthogonality of the rotation tensor then implies H C = R I C∗ ω∗ . While the component of I C are time-dependent quantities, those of I C∗ are constants and their time derivatives vanish. The time derivative of the angular momentum vector now becomes ! " ˙ =R ˙ I C∗ ω ∗ + R I C∗ ω˙ ∗ = R I C∗ ω˙ ∗ + ω H (6.17) ∗ I C∗ ω∗ . C Euler’s second law holds true when expressed about the rigid body’s center of mass, but it is equally valid when expresses with respect to an inertial point O, M O = ˙ , see eq. (3.75). Here again, evaluation of the derivative of the angular momentum H O vector involves derivatives of the mass moment of inertia tensor. To ease the evaluation of these derivatives, the ſrst step is to work in a body attached basis B ∗ , and the components of the mass moment of inertia tensor resolved in this basis are denoted I O∗ . Unfortunately, this is not yet sufſcient to guarantee O∗ I˙ = 0. Indeed, point O is an inertial point that is not necessarily a material point of the body; consequently, the mass moments of inertia might still be time-dependent. If inertial point O is a ſxed point on the body, the mass moments of inertia O∗ resolved in the body attached basis become constant and I˙ vanishes. This happens O∗ only if the rigid body is undergoing pure rotation about inertial point O. If I˙ = 0, ˙ = R(I O∗ ω˙ ∗ + ω developments similar to those presented above lead to H ∗ I O∗ ω∗ ). O In summary, the time derivative of the angular momentum vector is ! " ˙ = R I A∗ ω˙ ∗ + ω H (6.18) ∗ I A∗ ω∗ , A when (1) point A is the center of mass of the rigid body, or (2) the rigid body is undergoing pure rotation about inertial point A.
6.5 Equations of motion for a rigid body The equations of motion for a rigid body are derived from the equations of motion for a general system of particles presented in section 3.4.4. The ſrst equation of motion governs the motion of the center of mass of the rigid body, eq. (3.70), and the
212
6 Kinetics of rigid bodies
second equation governs its angular motion, eq. (3.76). The general form of the ſrst equation is F = maC , (6.19) where aC is the inertial acceleration of the center of mass of the rigid body and F the sum of the externally applied forces. The general form of the second equation is ˙ , MC = H C
(6.20)
where H C is the angular momentum computed with respect to the center of mass and M C the sum of the externally applied moments computed with respect to the center of mass. This second equation of motion can be written in several different manners depending on the point with respect to which the externally applied moments are computed. The following four cases will be considered. 1. The sum of the externally applied moments is computed with respect to the center of mass of the rigid body. 2. The sum of the externally applied moments is computed with respect to a pivot point of the rigid body. A pivot point is a point of the body that happens to be an inertial point; clearly, such a point does not always exist. 3. The sum of the externally applied moments is computed with respect to a material point of the rigid body. 4. The sum of the externally applied moments is computed with respect to an arbitrary point. This arbitrary point is not necessarily inertial and is not a material point of the body. The choice among the various forms of the equations is purely a matter of convenience: for speciſc applications, one formulation might lead to simpler equations. The four approaches are detailed in the following sections. 6.5.1 Euler’s equations In view of eq. (6.17), the second equation of motion of the rigid body, eq. (6.20), becomes M C = R(I C∗ ω˙ ∗ + ω ∗ I C∗ ω ∗ ). Multiplying this equation by RT then leads to M ∗C = I C∗ ω˙ ∗ + ω ∗ I C∗ ω ∗ , (6.21) where M ∗C = RT M C is the sum of the externally applied moments computed with respect to the center of mass and resolved in a body attached basis. If this basis coincides with the principal axes of inertia, the mass moment of inertia tensor reduces to a diagonal form, see eq. (6.15), and the governing equations further simplify to " ! ∗ MC1 = I1C∗ ω˙ 1∗ − I2C∗ − I3C∗ ω2∗ ω3∗ , " ! ∗ (6.22) MC2 = I2C∗ ω˙ 2∗ − I3C∗ − I1C∗ ω3∗ ω1∗ , ! C∗ " ∗ ∗ ∗ C∗ ∗ C∗ MC3 = I3 ω˙ 3 − I1 − I2 ω1 ω2 ,
6.5 Equations of motion for a rigid body
213
∗ C∗ ∗ ∗ where M ∗T = diag(I1C∗ , I2C∗ , I3C∗ ). These equaC = MC1 , MC2 , MC3 and I tions are known as Euler’s equations for the angular motion of a rigid body. The sum of the externally applied moments is computed with respect to the center of mass of the rigid body. 6.5.2 The pivot equations The governing equations of a system of particles can also be written with respect ˙ , see eq. (3.75). Furthermore, if this inertial point is to an inertial point, M O = H O also a material point of the rigid body, the time derivative of the angular momentum vector is given by eq. (6.18) and the equation of motion becomes M O = R(I O∗ ω˙ ∗ + ω ∗ I O∗ ω ∗ ). Multiplying this equation by RT then leads to M ∗O = I O∗ ω˙ ∗ + ω ∗ I O∗ ω ∗ ,
(6.23)
where M ∗O = RT M O is the sum of the externally applied moments computed with respect to point O, resolved in a body attached basis. Equation (6.23) only holds if point O is an inertial point that is also a material point of the rigid body; this implies that the rigid body is undergoing pure rotational motion about inertial point O. Point O is then often called a pivot point of the rigid body, and hence, eqs. (6.23) are known as the pivot equations for the angular motion of a rigid body; the sum of the externally applied moments is computed with respect to this pivot point. 6.5.3 Equations of motion with respect to a material point of the rigid body Let point B be a material point of the rigid body. The angular momentum vector computed with respect to this point is given by eq. (6.2) as H B = rBC mv B + I B ω, and it can be related to the angular momentum computed with respect to the center of mass by eq. (3.67) as H B = H C + rBC mv C . Equating these two expressions and taking a time derivative leads to ˙ C + rBC maC + r˙ BC mv C = r˙ BC mv B + rBC maB + (I B ω)· . H In view of eqs. (6.19) and (6.20), the ſrst two terms on the left-hand side are ex˙ + rBC ma = M + rBC F = M , where the last equality pressed as H C C C B follows from eq. (3.61), and the above expression becomes M B = r˙ BC m(v B − v C ) + rBC maB + (I B ω)· . The ſrst term on the right-hand side vanishes because r˙ BC m(v B − v C ) = − vBC mv BC = 0, and ſnally "· ! M B = rBC maB + I B ω .
(6.24)
To evaluate the time derivative of the last term, it is convenient to express the moment of inertia tensor in the body attached basis: (I B ω)· = (R I B∗ ω∗ )· =
214
6 Kinetics of rigid bodies
R(I B∗ ω˙ ∗ + ω ∗ I B∗ ω ∗ ). Resolving all quantities in the body attached basis, eq. (6.24) becomes ∗ M ∗B = rBC RT maB + I B∗ ω˙ ∗ + ω ∗ I B∗ ω∗ , (6.25) where M ∗B is the sum of the externally applied moments computed with respect to material point B of the rigid body, r ∗BC is the position vector of the center of mass with respect to point B, and RT aB the components of the acceleration vector of point B, all resolved in the body attached frame. 6.5.4 Equations of motion with respect to an arbitrary point Let point P be an arbitrary point, i.e., point P is neither inertial, nor a material point of the rigid body. Using eq. (3.61), the moment computed with respect to point P is related to that computed with respect to the center of mass as M P = M C + rP C F . Introducing eqs. (6.19) and (6.20) then leads to ! " ˙ C + rP C maC = rP C maC + R I C∗ ω˙ ∗ + ω MP = H (6.26) ∗ I C∗ ω ∗ , where the last equality follows from eq. (6.17). Resolving all quantities in the body attached basis, eq. (6.26) becomes ∗ I C∗ ω∗ , M ∗P = rP∗ C RT maC + I C∗ ω˙ ∗ + ω
(6.27)
where M ∗P is the sum of the externally applied moments, computed with respect to an arbitrary point P, r∗P C are the components of the position vector of the center of mass with respect to point P, and RT aC the components of the acceleration vector of the center of mass, all resolved in the body attached basis.
6.6 The principle of work and energy In section 3.4.5, the principle of work and energy was derived for a system of particles, see eq. (3.80). For an arbitrary system of particles, the work done by the internal forces explicitly appears in the statement of the principle, which is, consequently, of little practical use. If the system of particles, however, is a rigid body, the work done by the internal forces can be eliminated from the statement of the principle of work and energy, making it a powerful, practical tool. The work done by all external and internal forces acting of the rigid body is found by summing up the work done by all external and internal forces acting on each particle of the body Wti →tf =
N % ( i=1
tf
ti
(F Ti +
N (
f Tij )dr i .
(6.28)
j=1, j=i
The sum of the externally applied forces acting on particle i is denoted F i , f ij denotes the internal forces resulting from the interaction of particles i and j, and dr i is the differential displacement of particle i.
6.6 The principle of work and energy
215
Because the body is rigid, the differential displacements of one of its points can be expressed in terms of the differential displacement of reference point B of the body, dr B , and the differential rotation vector of the body, dψ, as dr i = dr B + , where s is the position vector of particle i with respect to reference point B. dψs i i Introducing this expression for the differential displacement into eq. (6.28) leads to ⎛ ⎞ N % tf N + * ( ( T T ⎝F i + Wti →tf = f ij ⎠ dr B + dψs i . ti
i=1
j=1, j=i
Expanding the scalar products then yields % tf ( N N ( T F i dr B + F Ti sTi dψ Wti →tf = ti
i=1
+
i=1
N (
N (
f Tij drB +
i=1 j=1, j=i
N (
N (
f Tij sTi dψ
i=1 j=1, j=i
⎫ ⎬ ⎭
.
Because quantities dr B and dψ do not depend on the particle number, the various summations appearing in this expression can be regrouped in the following manner % Wti →tf = %
⎡ tf
+ ti
⎣
tf
ti
N ( i=1
N (
N (
T Fi
%
tf
drB +
N (
ti
⎤T f ij ⎦ drB +
i=1 j=1, j=i
%
T si F i
dψ
i=1
tf
ti
⎡ N ( ⎣
N (
⎤T
(6.29)
si f ij ⎦ dψ.
i=1 j=1, j=i
In the ſrst term, the bracketed expression )N represents the sum of all externally applied forces to the rigid body, F = i=1 F i . In the second term, the bracketed expression represents the sum of all moments externally applied to the rigid body, )N M B = i=1 si F i . The third term in this expression vanishes in view of eq. (3.59), and eq. (3.62) implies ' t the vanishing ' t of the last term. Equation (6.29) ſnally reduces to Wti →tf = tif F T dr B + tif M TB dψ. The principle of work and energy, eq. (3.80), applied to a rigid body now becomes % tf % tf F T dr B + M TB dψ = K(tf ) − K(ti ). ti
ti
This result is known as the principle of work and energy. Principle 7 (Principle of work and energy for a rigid body) The work done by the external forces and moments acting on a rigid body equals the change in the rigid body’s kinetic energy.
216
6 Kinetics of rigid bodies
Example 6.1. Rotating disk on a bent arm Figure 6.9 shows a rotating disk connected to a bent arm. Massless arm OAB features a bend of β rad at point A. At point O, a bearing allows the arm to rotate at a constant angular velocity, Ω, with respect to ground. A disk of mass m and radius r rotates at a constant angular velocity, ω, and is connected to the arm at point B by means of a massless shaft. The dimensions of the system are indicated on the ſgure. A planar rotation of magnitude Ωt about axis ¯ı3 brings inertial frame, F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )] to frame F A = [O, A = (¯ a1 , a ¯2 , a ¯3 )] that is attached to the arm; all tensor components resolved in basis A are denoted with superscripts (·) .
d
b2
b2
a
C i3 = a3
e3 b
A
B e2
b3
b3 = e3
r
b1 wt FB
b1
C
B MB
-MB -FB
e1 a2
h
i2
I O i1
Wt
a1
MO
FO
Fig. 6.9. Conſguration of the rotating disk.
A second planar rotation of magnitude β about axis a ¯2 brings basis A to basis E + E and frame F = [A, E = (¯ e1 , e¯2 , e¯3 )] is attached to the arm, with axis e¯3 pointing along the bent segment AB; all tensor components resolved in basis E + are denoted with superscripts (·)+ . Finally, a planar rotation of magnitude ωt about axis e¯3 brings basis E + to basis B ∗ and frame F B = C, B ∗ = (¯b1 , ¯b2 , ¯b3 ) is attached to the rotating disk; all tensor components resolved in basis B ∗ are denoted with superscripts (·)∗ . The components of the rotation tensor that brings inertial basis I to basis B ∗ , resolved in basis I, will be constructed as R = RΩ Rβ R+ , where RΩ , Rβ , and R+ ω ω are the components of the rotation tensors that bring basis I to basis A , A to E + , and E + to B ∗ , respectively, resolved in bases I, A , and E + , respectively. Compute the forces and moments acting in the shaft at point B, those acting in the arm at the same point, and ſnally, the forces and moments acting in the bearing at point O. The angular velocity of the disk is readily obtained from the addition theorem as ω = Ω¯ı3 + ω¯b3 . The components of the angular velocity and acceleration vectors then become +
6.6 The principle of work and energy
217
⎧ ⎧ ⎫ ⎫ ⎨−ΩSβ Cω ⎬ ⎨ ΩωSβ Sω ⎬ ω ∗ = Ω(RΩ Rβ Rω+ )T ¯ı3 + ωb∗3 = ΩSβ Sω , and ω˙ ∗ = ΩωSβ Cω , ⎩ ⎩ ⎭ ⎭ ΩCβ + ω 0 where CΩ = cos Ωt, Cβ = cos β, and Cω = cos ωt, with similar expressions for the sine functions of the corresponding angles. The inertial position of the center of mass of the disk is r C = d¯b3 ; its acceleration ˙ + ω ω )¯b3 . The components of this acceleration vector vector then becomes aC = d(ω ∗ ∗ ∗ ˙ in basis B then become aC = d(ω +ω ∗ω ∗ )¯b∗3 , or ⎧ ∗ ⎫ ⎧ ⎫ ⎨ ω˙ 2 + ω1∗ ω3∗ ⎬ ⎨−Ω 2 Sβ Cβ Cω ⎬ Ω 2 Sβ Cβ Sω . a∗C = d −ω˙ 1∗ + ω2∗ ω3∗ = d ⎩ ⎩ ⎭ ∗2 ∗2 ⎭ −Ω 2 Sβ2 −ω1 − ω2 Body attached frame F B is located at the center of mass of the disk and is aligned with its principal axes of inertia. Figure 6.41 gives the principal mass moments of C∗ C∗ C∗ inertia of the disk as I11 = I22 = mr2 /4 and I33 = mr2 /2. With the help of the free body diagram shown in ſg. 6.9, the equations of motion of the disk, see eqs. (6.19) and (6.21), then become ⎧ ⎫ ⎨−Sβ Cβ Cω ⎬ Sβ Cβ Sω , F ∗B = mdΩ 2 ⎩ ⎭ −Sβ2 and MB∗ − ab∗3 F ∗B
⎫ ⎧ ΩωSβ Sω + ΩSβ Sω (ΩCβ + ω) ⎬ ⎨ 1 = mr2 ΩωSβ Cω + ΩSβ Cω (ΩCβ + ω) ⎭ ⎩ 4 0 ⎧ ⎫ ⎨ Sω ⎬ mr2 Ω(2ω + ΩCβ )Sβ Cω , = ⎩ ⎭ 4 0
(6.30)
respectively. The resultant of the externally applied moments was computed with respect to the center of mass, as required by eq. (6.21). In these equations, F B and M B are the externally applied force and moment vectors acting on the disk at point B. Eliminating force F ∗B from the equations of motion yields an expression for moment M ∗B , ⎧ ⎫ 2 ⎨ Sω ⎬ mr Cω . Ω(2ω + ΩCβ )Sβ − madΩ 2 Sβ Cβ M ∗B = ⎩ ⎭ 4 0 ∗ , vanishes; this implies that no moment needs The third component of moment, MB3 to be applied to the disk about unit vector ¯b3 to maintain its constant angular velocity, ω. Due to the presence of the trigonometric functions Sω = sin ωt and Cω = cos ωt,
218
6 Kinetics of rigid bodies
the components of the moment vector, M B , acting on the disk are time-dependent when resolved in the disk attached basis B ∗ . This implies that the shaft will be subjected to fatigue loading and as the angular speed, ω, of the disk increases, it will accumulate an increasing amount of loading cycles per unit time. On the other hand, the components of the same moment resolved in the arm + ∗ attached basis, E + , denoted M + B = Rω M B , become M+ B =
2
mr Ω(2ω + ΩCβ )Sβ − madΩ 2 Sβ Cβ 4
⎧ ⎫ ⎨0⎬ 1 . ⎩ ⎭ 0
(6.31)
Unlike the shaft carrying the disk, the arm is subjected to a constant bending moment. It is easily veriſed that the components of force in the arm attached basis, F + = R+ F ∗ , are also constant in time. ω Figure 6.9 also shows a free body diagram of the arm; because this component is massless, the equations of statics apply: the sum of both forces and moments must vanish. This yields the following expressions for the externally applied force and moment vectors at point O, denoted F O and M O , respectively: F O = F B and M O = M B + rOB F B . The components of these vectors, expressed in basis A , + + are F O = Rβ F + OB F+ B and M O = Rβ (M B + r B ), respectively, and are, of course, constant in time. Finally, the forces and moments acting on the bearing at point O, resolved in the inertial frame, are F O = RΩ F O and M O = RΩ M O , respectively. The third component of moment, MO3 , vanishes: no moment needs to be applied to the arm about axis ¯ı3 to maintain the constant angular velocity, Ω, of the system. Because SΩ = sin Ωt and CΩ = cos Ωt, the other loading components are time-dependent: as expected, the bearing will be subjected to cyclic loading. The bearing is subjected to loads oscillating at a frequency Ω, in contrast with those acting in the shaft, which have a frequency ω. Point A is a ſxed point, or pivot point, of the disk: rotation about the shaft and rotation of the bent arm leave point A at an inertial location. Furthermore, point A is a material point of the disk, consequently, the pivot equation, eq. (6.23), could have been used instead of Euler’s equation, eq. (6.21). The mass moment of inertia tensor of the disk with respect to point A can be obtained from its counterpart about point C with the help of the parallel axis theorem, eq. (6.9), to ſnd I1A∗ = m(r2 /4 + d2 ), I2A∗ = m(r2 /4+d2 ), I3A∗ = mr2 /2. The pivot equation about point A now becomes ⎫ ⎧ 2 ⎨ (r /4 + d2 )ωSω + (r2 /4 − d2 )Sω (ΩCβ + ω) ⎬ M ∗B +(d−a)b∗3 F ∗B = mΩSβ (r2 /4 + d2 )ωCω + (r2 /4 − d2 )Cω (ΩCβ + ω) . ⎭ ⎩ 0 This equation is equivalent to that derived above. Indeed, introducing the expression for the force F ∗B from eq. (6.30), leads again to eq. (6.31) for the externally applied moment M ∗B .
6.6 The principle of work and energy
219
Example 6.2. Swiveling plate Figure 6.10 shows a homogeneous, rectangular plate of height a, width b, and mass m connected to the ground by a rigid, massless link of length d. At point O, a bearing allows the link to rotate with respect to axis ¯ı3 , and at point B, the plate is free to rotate with respect to the link about axis a ¯1 . Three frames are used in this problem: the inertial frame, F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )], a frame attached to the link, F A = [O, A+ = (¯ a1 , a ¯2 , a ¯3 )], and ſnally, a frame attached to the plate’s center of mass, F B = C, B ∗ = (¯b1 , ¯b2 , ¯b3 ) . Tensor components resolved in bases A+ and B ∗ are denoted with superscripts (·)+ and (·)∗ , respectively. A planar rotation of magnitude α about axis ¯ı3 brings basis I to basis A+ , a planar rotation of magnitude β about axis a ¯1 brings basis A+ to basis ∗ B , and rotation tensors Rα and Rβ are associated with these two planar rotations, respectively
i3 = a3
I
i1
a a1
+ A O
a2
B b3
d
B
b
B C
a/2 *
g
b2
b/2 i2
b3
MB
FB
a/2
MO
C
b2
FO
b1 mg i2
O b/2 b1 = a1
B
-FB
-MB Fig. 6.10. Conſguration of the swiveling plate.
The inertial angular velocity vector of the plate is readily found with the help of ˙ a1 . The components of the angular velocity and the addition theorem as ω = α¯ ˙ a3 + β¯ acceleration vectors resolved in basis B ∗ are then found to be ⎧ ⎧ ⎫ ⎫ β¨ ⎨ β˙ ⎬ ⎨ ⎬ ˙ + ˙ β , ˙ β+T a ¯+ ω∗ = αR αS ˙ β , and ω˙ ∗ = α ¨ Sβ + α˙ βC 3 + βa1 = ⎩ ⎩ ⎭ ˙ β⎭ αC ˙ β α ¨ Cβ − α˙ βS respectively. The inertial position of the center of mass of the plate is r C = (d + a/2)¯ a1 and the acceleration vector aC = (d + a/2)(¨ αa ¯2 − α˙ 2 a ¯1 ); the components ∗ of this vector resolved in basis B then become ⎧ ⎫ ⎨ −α˙ 2 ⎬ a α ¨ Cβ . a∗C = (d + ) ⎭ 2 ⎩ −α ¨ Sβ Body attached frame F B is located at the plate’s center of mass and is aligned with its principal axes of inertia, see ſg. 6.45. The principal mass moments of inertia
220
6 Kinetics of rigid bodies
C∗ C∗ C∗ are I11 = mb2 /12, I22 = ma2 /12, I33 = m(a2 + b2 )/12. Figure 6.10 shows a free body diagram of the plate; F B and M B are the externally applied force and moment vectors acting on the plate at point B. The equations of motion for the plate, see eqs. (6.19) and (6.21), then become ⎧ ⎧ ⎫ ⎫ ⎨ −α˙ 2 ⎬ ⎨ Sα ⎬ a Cα Cβ = m(d + ) α ¨ Cβ , F ∗B + mg (6.32) ⎩ ⎭ ⎭ 2 ⎩ −Cα Sβ −α ¨ Sβ
and M ∗B
⎫ ⎧ b2 β¨ + b2 α˙ 2 Sβ Cβ ⎬ a ∗ ∗ m⎨ ˙ β ) − a2 α˙ βC ˙ β − b1 F B = , a2 (¨ αSβ + α˙ βC 2 12 ⎩ 2 ˙ β ) − (b2 − a2 )α˙ βS ˙ β⎭ (a + b2 )(¨ αCβ − α˙ βS
respectively. After simpliſcation, this last equation becomes ⎫ ⎫ ⎧ ⎧ ∗ b2 (β¨ + α˙ 2 Sβ Cβ ) MB1 ⎬ ⎬ ⎨ ⎨ m ∗ ∗ a2 α MB2 + a/2 FB3 ¨ Sβ = = M ∗I , ⎭ ⎩ ∗ 12 ⎩ 2 ∗ ⎭ ˙ β MB3 − a/2 FB2 (a + b2 )¨ αCβ − 2b2 α˙ βS
(6.33)
where M ∗I represents the right-hand side of this equation. The ſrst component of ∗ moment, MB1 , must vanish because the plate is free to rotate with respect to the arm ¯ about axis b1 ; this reveals the ſrst equation of motion of the problem, β¨ + α˙ 2 Sβ Cβ = 0. Figure 6.10 also shows a free body diagram of the massless arm OB; F O and M O are the externally applied force and moment vectors acting on the arm at point O. The moment equilibrium equation about point O, expressed in basis A+ , is M + O− + + + + + d a+ a+ 1 F B −M B = 0. Introducing eq. (6.33) then yields M O = (d+a/2) 1 F B +M I . With the help of eqs. (6.32) and (6.33), this applied moment at point O becomes ⎧ ⎫ 0 ⎨ ⎬ 2 b ˙ − S (¨ α C − 2 α ˙ βS ) M+ = m . β β O 12 β ⎩ a2 ⎭ b2 2 ˙ ¨ + 12 Cβ (¨ αCβ − 2α˙ βSβ ) + (d + a/2) α ¨ − g(d + a/2)Cα 12 α Here again, the conſguration of the system implies the vanishing of the third component of this moment, M + O3 . The equations of motion of the system correspond ∗ to the vanishing of two components of moment, M + O3 = 0 and M B1 = 0, or ˙ β Cβ = (d + a/2)gCα , a2 /12 + b2 /12 Cβ2 + (d + a/2)2 α ¨ − b2 /6 α˙ βS β¨ + α˙ 2 Sβ Cβ = 0, respectively. The two conditions leading to the equations of motion of the problem can be expressed as scalar products: ¯bT1 M B = 0 and ¯ıT3 M O = 0, which are easily ∗ evaluated when the vectors are expressed in an appropriate basis: ¯b∗T 1 M B = 0 and +T + ¯ı3 M O = 0.
6.6 The principle of work and energy
221
Example 6.3. Rigid body connected to a spring and dashpot Figure 6.11 depicts a rigid body connected to the ground at point B by means of a spring of stiffness constant k and dashpot of constant c. The rigid body is of mass M and its moment of inertia tensor with respect to the center of mass is I C . Vector η deſnes the position of the center of mass with respect to point B. Frame F B = B, B ∗ = (¯b1 , ¯b2 , ¯b3 ) is attached to the rigid body; superscript (·)∗ indicates tensor components resolved in basis B ∗ . The components of the rotation tensor that brings basis I to basis B ∗ , resolved in basis I, are denoted R. Find the equations of motion of the system. i3
FB = - F u g
i2
O I
B
b3
h
c i1
k
C
b2 B h
b1
C
- Mg i3 Free body diagram
Fig. 6.11. Conſguration of the rigid body connected to a spring and dashpot.
Let xB and xC be the position vectors of points B and C with respect to point O; it follows that xB = xC − η. The force vector, F B , applied to the rigid body at point B then acts in the direction of unit vector u ¯ = xB /xB , or F B = −F u ¯, ˙ where F is the magnitude of the applied force. It then follows that F = kΔ + cΔ, where Δ = xB − 0 is the stretch of the spring and 0 its un-stretched length. The time rate of change of the stretch is easily found as Δ˙ = u ¯T x˙ B . The equation of motion for the center of mass is M x ¨ C = −F u¯ −M g ¯ı3 . Euler’s equation, eq. (6.21), implies I C∗ ω˙ ∗ + ω ∗ I C∗ ω ∗ = RT [− η (−F u ¯)], where the right-hand side represents the moment of the externally applied forces, resolved in basis B ∗ . Although it is convenient to work with the components of all vectors and tensors resolved in the body attached basis, it is also possible to use the corresponding components resolved in the inertial basis. Multiplying by R leads to I C ω˙ + ω I C ω = F ηu ¯, where all vectors and tensors are now resolved in the inertial basis. For numerical solution of the equations of motion, it is convenient to recast them as a set of ſrst-order equations by introducing the velocities of the center of mass, v C , and a set of parameters, q, that represent the rotation of the rigid body. These parameters could be selected as Euler angles with a speciſc sequence of planar rotations, see section 4.11. The angular velocity of the body then becomes ω = H(q)˙q . For Euler angles with the sequence 3-1-3 deſned in section 4.11.1, the tangent operator, H, is given by eq. (4.68). The complete set of ſrst-order equations now becomes
222
6 Kinetics of rigid bodies
⎫ ⎧ ⎫ ⎧ vC x˙ C ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎬ ⎨ ⎬ ⎨ H −1 ω q˙ . = −F/M u ¯ − g¯ı3 ⎪ ⎪v˙ C ⎪ ⎪ ⎪ ⎪ ⎪ ⎩ C −1 ⎭ ⎩ ⎭ ⎪ C ω˙ (I ) (F ηu ¯−ω I ω)
(6.34)
This formulation requires the tangent operator, H, to be singularity free; as discussed in section 4.11.1, such is not the case for Euler angles with the sequence 3-1-3, for which H −1 is singular then angle θ = 0. 6.6.1 Problems Problem 6.9. Kinetic energy of a rigid body Derive an expression for the rotational kinetic energy of a rigid body. Use a body attached frame with its origin at the center of mass and orientation that coincides with the principal axes of inertia. The orientation of the body attached frame with respect to an inertial frame will be determined by Euler angles with the 3-1-3 sequence, see section 4.11.
Problem 6.10. Rigid body connected to a ſxed point Point C is the center of mass of a rigid body of arbitrary shape. This point is connected to an inertial point O by means of a ball and socket joint. Point C and O are coincident. The only externally applied forces are the gravity forces and the reactions at point O. (1) Prove that the angular momentum vector H O of the body is of constant magnitude and direction. (2) Prove that the kinetic energy of the body remains a constant. (3) Show that the magnitude of the projection of the angular velocity vector along the direction of the angular momentum vector is a constant; ſnd this constant.
Problem 6.11. Rigid body moving along a curve A rigid body freely slides along a given curve C in three-dimensional space. A point of the curve has a position vector p0 (s). The position of the reference point of the rigid body is p0 (s) and its orientation is determined by Frenet’s triad R(s) = [t¯(s), n ¯ (s), ¯b(s)]. Find the equation of motion for the rigid body if it is subjected to externally applied forces and moments. Hint: the only degree of freedom of the problem is s, the position of the body along the curve.
Problem 6.12. Spinning rotor mounted on a rotating disk Figure 6.12 depicts a homogeneous disk of mass M and radius R rotating about inertial axis e1 , e¯2 , e¯3 ) is attached to the disk. The disk rotates about unit ¯ı3 . Frame F D = O, E + = (¯ vector ¯ı3 at a constant angular velocity, Ω. At the rim of the disk, a rigid massless shaft of length d extends in the radial direction and connects to a homogeneous disk of mass m and radius r spinning about unit vector e¯1 at a constant angular velocity, ω. (1) Find the three components of the reaction force in the bearing at point B, resolved in basis E + . (2) Find the three components of the reaction moment in the bearing at point B, resolved in basis E + .
Problem 6.13. Spinning rotor mounted on a rotating disk Figure 6.12 depicts a homogeneous disk of mass M and radius R rotating about inertial axis e1 , e¯2 , e¯3 ) is attached to the disk. Torque T is applied to the ¯ı3 . Frame F D = O, E + = (¯ disk and act about unit vector ¯ı3 . At the rim of the disk, a rigid massless shaft of length d extends in the radial direction and connects to a homogeneous disk of mass m and radius r spinning about unit vector e¯1 . Frame F R = R, B∗ = (¯b1 , ¯b2 , ¯b3 ) is attached to the rotor. Torque Q is applied to the rotor and act about unit vector e¯1 . (1) Develop the equations of
6.6 The principle of work and energy
223
motion of the system in terms of angles φ and θ, where θ is the rotation of the rotor about unit vector e¯1 . (2) Find the three components of the reaction moment in the bearing at point B, resolved in basis E + . (3) Find the three components of the reaction force in the bearing at point B, resolved in basis E + . (4) Find the three components of the reaction moment in the bearing at point O, resolved in basis E + and I. (5) Find the three components of the reaction force in the bearing at point O, resolved in basis E + and I. (6) After an initial start-up phase, the disk and rotor spin at constant angular velocities, φ˙ = Ω and θ˙ = ω, respectively. Determine the reaction forces and moments of questions (2) to (5).
e2
T M, R i1
i3 = e3
g
i3 = e3
e2
r O
g
B
I O B f
e1
i2 m, r R
d
Q
Fig. 6.12. Spinning rotor mounted on a rotating disk.
i1
i2
f b
q e1 w
Fig. 6.13. Plate hinged at the rim of a rotating disk.
Problem 6.14. Plate hinged at the rim of a rotating disk Figure 6.13 depicts a homogeneous disk of mass M and radius R rotating about inertial axis e1 , e¯2 , e¯3 ) is attached to the disk. At point B, a point on the ¯ı3 . Frame F D = O, E + = (¯ rim of the disk, a homogeneous plate of mass m, length b, and width w is hinged to the disk. The hinge’s axis is aligned with unit vector e¯2 ; a torsional spring of stiffness constant k and a torsional dashpot of constant c are located at the hinge. The torsional spring is un-stretched when θ = θ0 . The system is subjected to gravity acting in the direction indicated on the ſgure. (1) Develop the equations of motion of the system in terms of angles φ and θ indicated on the ſgure. (2) On one graph, plot angles φ and θ versus τ . (3) On one graph, plot angular speeds φ and θ . (4) On one graph, plot angular accelerations φ and θ . (5) Plot the cumulative energy, ¯ d = W d /k, dissipated in the dashpot. (6) On one graph, plot the kinetic, K ¯ = K/k and W potential, V¯ = V /k energies of the system. Check that the energy closure equation is satisſed. (7) On one graph, plot the three components of the moment in the bearing at point B, resolved in basis E + . (8) On one graph, plot the three components of the force in the bearing at point B, resolved in basis E + . Use the following data: μ = M/m = 1.5, w ¯ = w/b = 0.2, ¯ = R/b = 0.2, ζ = ωc/(2k) = 0.05, g¯ = g/(bω 2 ) = 2, θ0 = 0. A non-dimensional time R is deſned, τ = ωt, where ω 2 = 3k/(mb2 ); notation (·) indicates a derivative with respect to τ . Use the following initial conditions, φ(τ = 0) = 0, θ = 0, φ = 1, θ = −1. Present all your results for τ ∈ [0, 20].
Problem 6.15. Spinning Satellite
Frame F = B, B∗ = (¯b1 , ¯b2 , ¯b3 ) is attached to a satellite. Point B is the satellite’s center of mass and basis B∗ is aligned with its principal axes of inertia. Tensor components resolved in
224
6 Kinetics of rigid bodies
basis B∗ are denoted with a superscript (·)∗ . The components of the angular velocity vector of the satellite, resolved in B∗ , are denoted ω ∗ . The mass moments of inertia are I1∗ = 12, I2∗ = 16 and I3∗ = 20 kg.m2 . During a maneuver, thrusters apply a moment M (t) to the satellite. For t ≤ T , M ∗ (t) = Q∗ sin 2πt/T , and for t > T , M ∗ (t) = 0, where T = 5 s. The initial angular velocity of the satellite is ω ∗T (t = 0) = 0, 0.5, 0 rad/s. Moment vector Q∗ is deſned by its components in the body attached basis. Two cases will be considered here: for case 1, Q∗T = 0, 5, 0 N·m, for case 2, Q∗T = 5, 0, 0 N·m. (1) Solve Euler’s equation for the time history of the angular velocity of the satellite. (2) On one graph, plot the three components of the angular velocity vector in the body attached frame as a function of time for t ∈ [0, 30T ]. Present one graph for case 1 and one for case 2. (3) At the end of the maneuver, will the orientation of the satellite remain ſxed with respect to an inertial frame for case 1? What about case 2?
Problem 6.16. Double spatial pendulum Figure 6.14 depicts a double spatial pendulum consisting of two bodies subjected to gravity. The ſrst body, of mass ma and mass moment of inertia tensor I Ca , is connected to the ground at point O by means of a ball and socket joint. The position vectors of points O and B with respect to the center of mass, Ca , of the body are denoted η a and μa , respectively. The second body, of mass mb and mass moment of inertia tensor I Cb , is connected to the ſrst body at point B though a ball and socket joint. The position vector of point B with respect to the a1 , a ¯2 , a ¯3 )] center of mass, Cb , of the body is denoted η b . Two frames, F A = [Ca , A = (¯ and F B = Cb , B = (¯b1 , ¯b2 , ¯b3 ) , are attached to the ſrst and second body, respectively. Let Ra and Rb be the rotation tensors that bring basis I to A and basis I to B, respectively. Tensor Ra will be represented with Euler angles φa , θa , and ψa using the 3-1-3 sequence, and Euler angles φb , θb , and ψb , also using the 3-1-3 sequence, represent tensor Rb . (1) Draw free body diagrams for each of the bodies. (2) Derive the equations of motion of the system. Carefully deſne all terms appearing in the equations.
i2
g
i3 = a3
i1
O a ha 2 a1 Ca ma ma a 3 i3
B
Cb mb
B
i1
i2
a1 b 3
Wt q
hb
b2
a2
r O
g b1 b3
Fig. 6.14. Conſguration of the double spatial pendulum.
b2 C b1
w
Fig. 6.15. Conſguration of the plate hinged at the rim of a disk.
Problem 6.17. Plate hinged at the rim of a disk The system depicted in ſg. 6.15 features a disk of radius r rotating at a constant angular velocity Ω about inertial axis ¯ı3 . Frame F A = [O, A = (¯ a1 , a ¯2 , a ¯3 )] is attached to the disk. At point B, a point on the rim of the disk, a homogeneous plate of mass m, length b, and width
6.6 The principle of work and energy
225
w is hinged to the disk; the plate is free to rotate with respect to the disk about axis a ¯2 . Frame F B = C, B = (¯b1 , ¯b2 , ¯b3 ) is attached to the plate; point C is the center of mass of the plate. Line BC is at an angle θ with respect to the vertical. The system is subjected to gravity acting in the direction indicated on the ſgure. Determine the angular speed, Ω, required to maintain a given, constant angle θ.
Problem 6.18. Robotic arm Figure 5.5 shows a robotic system. The shaft is allowed to rotate with respect to an inertial frame F I , about axis ¯ı3 ; the time-dependent angle of rotation is denoted α(t). Frame F S = + S, S = (¯ s1 , s¯2 , s¯3 ) is attached to the shaft at a distance h = 0.5 m from the origin of the inertial frame, as indicated on the ſgure. An arm of length La = 1.2 m, extending along the direction of axis s¯2 , is attached to the shaft at point S. Finally, a rigid manipulator of length Lb = 0.5 m, radius rb = 0.02 m, and mass mb = 10 kg is connected to the arm at point S B. The manipulator is allowed to rotate with respect to frame F∗ , about axis s¯1 ; the timeB dependent angle of rotation is denoted β(t). Frame F = B, B = (¯b1 , ¯b2 , ¯b3 ) is attached to the manipulator. Superscripts (·)∗ and (·)+ denote tensor components resolved in bases B∗ and S + , respectively. Angles α(t) and β(t) are prescribed as α(t) = π/2 (1 − cos πt/T ), and β(t) = 2π(1 − cos πt/T ), respectively, where T = 2 s. The acceleration of gravity is g = 9.81 m/s2 . (1) Compute the components of the force vector F ∗ and moment vector M ∗ applied to the manipulator at point B. (2) On one graph, plot the components of the force vector F ∗ . (3) On one graph, plot the components of the moment vector M ∗ . (4) What is the moment required to rotate the manipulator?
Problem 6.19. Rotating disk on a bent arm Figure 6.9 shows a rotating disk connected to a bent arm. Massless arm OAB features a bend of β rad at point A. At point O, a bearing allows the arm to rotate with an angular velocity Ω with respect to ground. A disk of mass m and radius r rotates with an angular velocity ω and is connected to the arm at point B by means of a massless shaft. A planar rotation about axis a1 , a ¯2 , a ¯3 )] that ¯ı3 brings inertial frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )] to frame F A = [O, A = (¯ is attached to thearm. A second planarrotation of magnitude β about axis a ¯2 brings frame A to frame F E = A, E + = (¯ e1 , e¯2 , e¯3 ) that is also attached to the arm, with axis e¯3 pointing along the bent segment AB. Superscripts (·) and (·)+ denote tensor components resolved + a planar rotation about axis e¯3 brings frame E + to in basis A and E ∗, respectively. Finally, B frame F = C, B = (¯b1 , ¯b2 , ¯b3 ) that is attached to the rotating disk; all tensor components resolved in basis B∗ are denoted with superscripts (·)∗ . The components of the rotation tensor , that brings inertial basis I to basis B∗ , resolved in I, is constructed as R = RΩ Rβ R+ ω where RΩ , Rβ , and R+ are the components of the rotation tensors that bring basis I to basis ω
A , A to E + , and E + to B∗ , respectively, resolved in bases I, A , and E + , respectively. The angular velocities of the bent arm are prescribed to be Ω = Ωf (1 − cos 2πt/T )/2 for 0 ≤ t ≤ T /2 and Ω = Ωf for t > T /2. The angular velocities of the disk are prescribed to be ω = ωf (1 − cos 2πt/T )/2 for 0 ≤ t ≤ T /2, and ω = ωf for t > T /2. This represents the start-up sequence for the system from the rest condition to a nominal operating point where the angular velocities of the arm and disk are stabilized to their ſnal values, Ωf and ωf , respectively. These angular velocity proſles are achieved by applying to the bent arm a torque QO (t) about axis ¯ı3 at point O and to the shaft a torque QB (t) about axis ¯b3 at point B. (1) On one graph, plot the time history of angular velocities Ω and ω. (2) On one graph, plot angular accelerations Ω˙ and ω. ˙ (3) Plot the three components of the angular velocity vector of the disk, ω ∗ . (4) Plot the three components of the angular acceleration vector of the disk, ω˙ ∗ . (5) Plot the three components of the moment vector applied to the shaft at point
226
6 Kinetics of rigid bodies
B, M ∗B . (6) Plot the three components of the moment vector applied to the arm at point B, M+ B . (7) Plot the three components of the force vector applied to the arm at point O, F . (8) Plot the three components of the moment vector applied to the arm at point O, M + O . (9) Plot the three components of the moment vector applied to the arm at point O, M O . (10) Plot the cumulative work done by torques QO (t) and QB (t), and the total kinetic energy of the system. (11) Demonstrate by a graph that your predictions satisfy the principle of work and energy. (12) Plot the instantaneous power required by the servomotors located at points B and O. (13) If the servomotors can deliver a maximum power of 50 Watts each, ſnd the minimum time T required to bring the system to steady angular velocities. Use the following data: β = π/6 rad, ωf = 50 rad/s, Ωf = 10 rad/s, r = 0.2 m, m = 10 kg, h = 0.6 m, d = 0.3 m, a = 0.1 m, and T = 15 s.
Problem 6.20. Swiveling plate Figure 6.10 shows a homogeneous, rectangular plate of height a, width b, and mass m connected to the ground by a rigid, massless link of length d. At point O, a bearing allows the link to rotate with respect to axis ¯ı3 , and at point B, the plate is free to rotate with respect to the link about axis a ¯1 . Three frames will be used in this problem: inertial frame a1 , a ¯2 , a ¯3 ) , and F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )], a frame attached to the link, F A = O, A+ = (¯ ſnally, a frame attached to the plate at its center of mass, F P = C, B∗ = (¯b1 , ¯b2 , ¯b3 ) . A planar rotation of magnitude α about axis ¯ı3 brings basis I to basis A+ , and a planar rotation of magnitude β about axis a ¯1 brings basis A+ to basis B∗ . Rotation tensors Rα and Rβ repre-
sent these two planar rotations, respectively; tensor components resolved in basis A+ and B∗ are denoted with superscripts (·)+ and (·)∗ , respectively. (1) Derive the equations of motion ˙ of the problem. (2) On one graph, plot the time histories of angles α and β. (3) Plot α˙ and β. (4) On one graph, plot the components of the angular velocity of the plate in basis I. (5) Plot the components of the same vector in basis B∗ . (6) On one graph, plot the kinetic, potential, and total mechanical energies of the system. Comment on your results. (7) On one graph, plot the components of the force applied to the plate at point B resolved in basis B∗ . (8) Plot the components of the moment applied to the plate at point B in basis B∗ . (9) Plot the components of the moment applied to the link at point O in basis I. Use the following data: a = 0.2 m, b = 0.2 m, d = 0.5 m, acceleration of gravity g = 9.81 m/s2 , and m = 2 kg. Present the response on the system for a period of 15 s. At ſrst, use the following initial conditions: α = π/4, β = π/12, and α˙ = β˙ = 0. Next, consider a different set of initial conditions: α = 0, β = π/4, and α˙ = β˙ = 0. Comment on the response of the system for these two sets of initial conditions.
Problem 6.21. Rigid body connected to spring and dashpot Figure 6.11 depicts a rigid body connected to the ground at point B by means of a spring of stiffness constant k and dashpot of constant c. The rigid body is of mass M and its moment of inertia tensor with respect to the center of mass is I C . Vector η deſnes the position of the center of mass with respect to point B. Frame F B = B, B = (¯b1 , ¯b2 , ¯b3 ) is attached to the rigid body. The components of the rotation tensor that brings inertial basis I to basis B, resolved in basis I, are denoted R. (1) Derive the equations of motion of the problem; resolve the components of all vectors and tensors in the inertial frame. (2) On one graph, plot the time histories of the three components of vector x ¯ C . (3) On one graph, plot Euler angles ψ, θ, and φ as a function of τ . (4) On one graph, plot the time histories of the three components of the velocity vector, ¯ v C = v C /(Ω0 ). (5) On one graph, plot the time histories of the three components of the angular velocity vector, ω ¯ = ω/Ω. (6) On one graph, plot the time histories of the forces in the elastic spring and dashpot. (7) On one graph, plot the kinetic and
6.7 Planar motion of rigid bodies
227
potential energies of the system as well as the energy dissipated in the dashpot. Show that the energy closure equation is veriſed. Treat the problem using a non-dimensional scheme with ¯ C = xC /0 . Use the following data: g¯ = g/(0 Ω 2 ) = 0.4, τ = Ωt, Ω 2 = k/M and x C∗ ∗ = I C∗ /(M 20 ) = ζ = c/(2M Ω) = 0.1, η¯ = η ∗ /0 = [0.8, 1.25, −1.8]T , and I¯ T diag(1, 2.3, 1.5). Use the following initial conditions: x ¯ C (τ = 0) = [0, 1, 0], q T (τ = 0) = [0, 0, 0], and the system is at rest. Present the response on the system for τ ∈ [0, 100]. Hint: to avoid singularities, use Euler angles, ψ, θ, and φ, with the 3-2-1 sequence, as deſned in section 4.11.3, to represent the rotation of the rigid body.
6.7 Planar motion of rigid bodies The previous sections have focused on the three-dimensional motion of rigid bodies. In some cases, the motion of the body is restricted to a planar motion: the center of mass of the body moves in an inertial plane and its angular velocity vector is at all time normal to this plane. Let axes ¯ı1 and ¯ı2 deſnes the inertial plane in which the center of mass moves; the position vector of the center of mass then becomes rC = xC1¯ı1 + xC2¯ı2 vector is ω = ω¯ı3 . Next, a body attached frame, F = and the angular velocity C, B = (¯b1 , ¯b2 , ¯b3 ) , is deſned, where point C is the body’s center of mass. For convenience, axes ¯b1 and ¯b2 are selected to be in the plane of the motion whereas ¯b3 is normal to the same plane. It follows that the position vector of the center of mass becomes r C = x∗C1¯b1 + ∗ ¯ xC2 b2 and the angular velocity vector is ω = ω¯b3 . The components of the position vector of the mass center resolved in the inertial basis are xC1 and xC2 , and x∗C1 and x∗C2 are their counterparts resolved in the body attached basis. The only non-vanishing component, ω, of angular velocity vector is the same in both frames: indeed, ω = ω¯ı3 = ω ∗¯b3 implies ω = ω ∗ , since ¯b3 = ¯ı3 is an inertial direction. The acceleration vector of the center of mass is now aC = aC1¯ı1 + aC2¯ı2 = a∗C1¯b1 + a∗C2¯b2 , and the equations of motion for the center of mass, eq. (6.19), becomes F1 = maC1 , F2 = maC2 , and F3 = 0. This last equation implies that the sum of the externally applied forces acting in the direction normal to the plane of motion must vanish if the motion is to remain planar. The following two equations of motion are sufſcient to determine the motion of the center of mass F1 = maC1 ,
F2 = maC2 .
(6.35)
The second equation of motion can be written in several different manners depending on the point with respect to which the externally applied moments are computed, as discussed in section 6.5. The various options are detailed in the following sections. 6.7.1 Euler’s equations First, Euler’s equations, see eqs. (6.21), specialized to the planar motion case become
228
6 Kinetics of rigid bodies ∗ C∗ C∗ 2 MC1 = I13 ω˙ − I23 ω ,
(6.36a)
∗ MC2 ∗ MC3
(6.36b)
= =
C∗ C∗ 2 I23 ω˙ + I13 ω , C∗ I33 ω. ˙
(6.36c)
∗ ∗ Moment components MC1 and MC2 must be applied to sustain the planar motion; such moments are called gyroscopic moments. The sum of the externally applied moments is computed with respect to the center of mass of the rigid body. If axes ¯b1 , ¯b2 , and ¯b3 coincide with the principal axes of inertia of the body, the equations of motion further simplify since the cross products of inertia vanish, C∗ C∗ I13 = I23 = 0, leading to MC3 = I3C∗ ω. ˙ (6.37)
In this case, the two components of moment in the plane of motion must vanish, ∗ ∗ MC2 = MC3 = 0. The only non-vanishing force components are those in the plane of motion, F1 and F2 . A single component of moment remains, MC3 ; of course, ∗ ¯ ∗ M = MC3¯ı3 = MC3 b3 implies MC3 = MC3 , because ¯b3 = ¯ı3 . When a rigid body is in planar motion, its conſguration is deſned by three parameters only: two displacement components locate its center of mass, and a single rotation component determines its orientation. Equations of motion (6.35) and (6.37) provide the three equations necessary to solve the problem. 6.7.2 The pivot equations When the rigid body undergoes pure rotation about an inertial point O, eqs. (6.23) were shown to hold. Specializing these equations to the case of planar motion leads to ∗ O∗ O∗ 2 MO1 = I13 ω˙ − I23 ω ,
(6.38a)
∗ MO2
(6.38b)
=
O∗ O∗ 2 I23 ω˙ + I13 ω , O∗ I33 ω. ˙
MO3 = (6.38c) ∗ ∗ ∗ where M ∗T O = MO1 , MO2 , MO3 is the sum of the externally applied moments computed with respect to a pivot point O, resolved in the body attached basis. If axes ¯b1 , ¯b2 , and ¯b3 coincide with the principal axes of inertia, the equations of motion further simplify since the cross products of inertia vanish, leading to MO3 = I3O∗ ω. ˙
6.7.3 Equations of motion with respect to a material point of the body Let point B be a material point of the rigid body; eqs. (6.25) then govern the motion of the rigid body. Introducing the assumption of planar motion, these equations become ∗ B∗ B∗ 2 MB1 = I13 ω˙ − I23 ω ,
(6.39a)
∗ MB2
(6.39b)
=
B∗ I23 ω˙
+
B∗ 2 I13 ω ,
MB3 = [ rBC maB ]3 +
B∗ I33 ω, ˙
(6.39c)
6.7 Planar motion of rigid bodies
229
where notation [v]3 indicates the third component of vector v. Because vectors r ∗BC ∗ and RT maB both lie in the plane of the motion, the vector product rBC RT maB appearing in eq. to the plane of motion and it follows that ∗ (6.25) is normal ∗ rBC RT maB 1 = 0 and rBC RT maB 2 = 0. It is also easy to verify that ∗
∗ ∗ ∗ rBC RT maB 3 = [ rBC maB ]3 . Moment M ∗T B = MB1 , MB2 , MB3 is the sum of the externally applied moments computed with respect to material point B of the rigid body. If axes ¯b1 , ¯b2 , and ¯b3 coincide with the principal axes of inertia, the equations of motion further simplify because the cross products of inertia vanish. 6.7.4 Equations of motion with respect to an arbitrary point Let point P be an arbitrary point, i.e., point P is neither inertial, nor a material point of the rigid body; the motion of the body is then governed by eq. (6.27). Introducing the assumption of planar motion, these equations become C∗ C∗ 2 MP∗ 1 = I13 ω˙ − I23 ω ,
(6.40a)
MP∗ 2
(6.40b)
=
C∗ I23 ω˙
+
C∗ 2 I13 ω ,
MP 3 = [ rP C maC ]3 +
(6.40c)
C∗ I33 ω, ˙
where M ∗P is the sum of the externally applied moments, computed with respect to an arbitrary point P. Example 6.4. Rolling disk with bar A homogeneous cylinder of mass M and radius R rolls without sliding on a horizontal plane under the effect of gravity. A homogeneous bar of mass m and length is rigidly attached to the center of the cylinder, as shown in ſg. 6.16. Angle θ denotes the orientation of the bar with respect to the vertical axis. At the tip of the bar, denoted point T, a spring of stiffness constant k connects the bar to inertial point A; the un-stretched length of the spring vanishes. Derive the equations of motion of the system in terms of angle θ.
e1
i2 q
k
g
A
Fs T G mg
T C D P
d
R
D
l
Mg
P
f N
i1 e2 Fig. 6.16. Conſguration of the rolling cylinder.
The center of mass of the system is located on the line joining the centers of the disk and bar, at a distance d = m/ [2(M + m)] from the center of the disk. Let axes
230
6 Kinetics of rigid bodies
e¯1 and e¯2 be a system of body attached axes, as indicated in ſg. 6.16. The position of the center of mass of the system now becomes rC = −Rθ ¯ı1 + R ¯ı2 + d e¯1 and its ¨ θ + dθ˙2 Sθ )¯ı1 + (−dθS ¨ θ − dθ˙2 Cθ )¯ı2 , where acceleration is then aC = (−Rθ¨ − dθC Sθ = sin θ and Cθ = cos θ. Equation (6.35) governing the motion of the center of mass of the system now becomes f¯ı1 + N¯ı2 + F s − (M + m)g¯ı2 = (M + m) aC , where F s = k[(Rθ + Sθ )¯ı1 + (1 − Cθ )¯ı2 ] is the elastic force the spring applies at the tip of the bar, and N and f are the normal reaction and friction forces the plane applies to the disk, respectively. The following two scalar equations of motion are obtained m ¨ f + k(Rθ + Sθ ) = −(M + m)Rθ¨ − (θCθ − θ˙2 Sθ ), 2 m ¨ (θSθ + θ˙2 Cθ ). N − (M + m)g + k(1 − Cθ ) = − 2
(6.41a) (6.41b)
Equation (6.40) will be used to derive the third equation of motion governing the angular behavior of the rigid body. It is convenient to compute the sum of the externally applied moments with respect to point P, the instantaneous point of contact of the cylinder with the ground because the normal reaction and friction forces will be eliminated from the equation, as their lines of action pass through point P, C∗ ¨ [ rP T F s − rP G mg¯ı2 ]3 = [ rP C (m + M )aC ]3 + I33 θ,
e1 is the position vector of the center of mass with respect to where r P C = R¯ı2 + d¯ point P. The position vectors of points T and G with respect to point P are denoted r P T and rP G , respectively. The moment of inertia of the system with respect to the center of mass is found by adding the contributions of the cylinder and bar to ſnd 2 m M R2 C∗ I33 + M d2 + + m( − d)2 . = 2 12 2 Note the use of the parallel axis theorem: the moment of inertial of the cylinder with respect to its own center of mass is M R2 /2, see ſg. 6.40, and the transport term is M d2 . The rotational equation now becomes mg Sθ − k R2 θ + R(Sθ + θCθ ) + 2 Sθ 2 3 R 3M R2 + m(R2 + + RCθ ) θ¨ − m θ˙2 Sθ . = 2 3 2 Given initial conditions, this differential equation can be solved to ſnd the response of the system. Introducing θ into eqs. (6.41a) and (6.41b) then yields the friction and normal forces, respectively. The derivation presented here assumes that at all times, the cylinder is rolling without slipping. To make sure the analysis is consistent, it is then important to check that N > 0 and |f | ≤ μs N at all times, where μs is the static friction coefſcient between the cylinder and the ground.
6.7 Planar motion of rigid bodies
231
Example 6.5. The double pendulum with elastic joint Figure 6.17 depicts a double pendulum comprising bar 1, of mass m1 and length 1 , and bar 2, of mass m2 and length 2 . Let frame F A = [A, A = (¯ a1 , a ¯2 )] be attached to bar 1 and frame F E = [E, E = (¯ e1 , e¯2 )] be attached to bar 2. A massless tube allows bar 2 to slide in the direction of a ¯2 ; the slider is of mass M and is connected to bar 1 at point A by means of a spring of stiffness constant k. The position of the slider is determined by its distance, x, from point A, the tip of bar 1; the angular positions of the two bars with respect to the vertical are denoted θ1 and θ2 , respectively. The system is subjected to gravity along the inertial ¯ı1 direction. Find the equations of motion of the system. HO
i2
O
O
I m1, l1 q1
x E A
i1
i2
a2 M q2
x E
Fs R
a1 R e2
m2, l2 e1
Free body a2 diagram of bar 1
m1g A
i1
g
e2
k a1
VO q1
Free body diagram of bar 2 G e1
d
E q2
Fs Mg
m2g Fig. 6.17. Conſguration of the double pendulum with elastic joint.
First, the equations of motion of bar 2, including the concentrated mass of the slider, M , will be derived. The center of mass of the combined body is at a distance d from point E, where (M + m2 )d = m2 2 /2; for simplicity, the following notation is used, μ2 = M + m2 . Considering the free body diagram shown in ſg. 6.17, eq. (6.39) gives the sum of the moments computed with respect to point E as −
m2 22 ¨ m2 2 gS2 = e1 μ2 aE ]3 , θ2 + [d 2 3
where the following notation was introduced: S1 = sin θ1 and C1 = cos θ1 , with similar conventions for angle θ2 . Point E was selected as the point about which moments were computed because this choice automatically eliminates the reaction force, R, and spring force, Fs , from the resulting equation of motion. The acceleration, aE , of point E is readily computed as the second time derivative of the position vector of point E, r E = 1 a ¯1 + x¯ a2 . The ſrst equation of motion is now
232
6 Kinetics of rigid bodies
m2 22 ¨ m2 2 m2 2 θ2 + gS2 + (¨ x + 1 θ¨1 )C21 + (x˙ + 1 θ˙1 )θ˙1 S21 3 2 2 +(x˙ θ˙1 + xθ¨1 )S21 − xθ˙2 C21 = 0,
(6.42)
1
where the following notation was introduced: S21 = sin(θ2 − θ1 ) and C21 = cos(θ2 − θ1 ). The acceleration of the center of mass of bar 2 is found by taking two derivatives of its position vector, rG = 1 a ¯1 + x¯ a2 + d¯ e1 ; hence, the equations of motion for the center of mass becomes −R¯ a1 − Fs a ¯2 + μ2 g¯ı1 = μ2 [−(2x˙ θ˙1 + 1 θ˙12 + xθ¨1 )¯ a1 + (¨ x + 1 θ¨1 − xθ˙12 )¯ a2 + dθ¨2 − dθ˙22 e¯1 ]. Taking a scalar product of this relationship by a ¯1 and a ¯2 yields the reaction force m2 2 ¨ R = μ2 gC1 + μ2 (2x˙ θ˙1 + 1 θ˙12 + xθ¨1 ) + (θ2 S21 + θ˙22 C21 ), 2
(6.43)
and the spring force m2 2 ¨ (θ2 C21 − θ˙22 S21 ), x + 1 θ¨1 − xθ˙12 ) − Fs = −μ2 gS1 − μ2 (¨ 2
(6.44)
respectively. Next, the equations of motion for bar 1 are derived from the free body diagram shown in ſg. 6.17. The pivot equation, eq. (6.38), is applied about point O to ſnd m1 21 ¨ m1 1 gS1 + 1 Fs − xR. θ1 = − 3 2
(6.45)
The three equations of motion of the problem can now be summarized. The ſrst equation is eq. (6.45), where the reaction and spring forces are eliminated by means of eqs. (6.43) and (6.44), respectively; the second equation is eq. (6.42); ſnally, the last equation is the constitutive equation for the elastic spring, Fs = kx, where the elastic force is eliminated with the help of eq. (6.44). These three equations are recast in a matrix form, leading to ⎡ ⎤ m1 m2 2 ⎧ ⎫ ( + μ2 )21 + μ2 x2 (1 C21 + xS21 ) μ2 1 ⎢ 3 ⎥ ⎨θ¨1 ⎬ 2 2 ⎢ m2 2 ⎥ m2 2 m2 2 ⎢ ¨ C21 ⎥ ⎢ 2 (1 C21 + xS21 ) ⎥ ⎩ θ2 ⎭ 3 2 ⎣ ⎦ x ¨ m2 2 C21 μ2 1 μ2 2 ⎧ ⎫ m 2 2 2 ⎪ ⎪ ˙ ˙ ⎪ 2μ ( x x ˙ θ − S − xC ) θ + (m /2 + μ )g S + μ gxC 2 1 1 21 21 2 1 2 1 1 2 1⎪ ⎪ ⎪ ⎪ ⎪ 2 ⎨ ⎬ m m 2 2 2 2 2 = 0. + (1 S21 − xC21 )θ˙1 + gS2 m2 2 x˙ θ˙1 S21 + ⎪ ⎪ 2 2 ⎪ ⎪ ⎪ ⎪ m2 2 ˙ 2 ⎪ ⎪ ⎩ ⎭ θ S21 + kx + μ2 gS1 −μ2 xθ˙12 − 2 2 These equations form a set of coupled, nonlinear, second-order, ordinary differential equations in time for the three unknowns of the problem, θ1 , θ2 , and x.
6.7 Planar motion of rigid bodies
233
The equations of motion for the center of mass of bar 1 are readily found as m1 1 ¨ (−θ1 S1 − θ˙12 C1 ) = m1 g¯ı1 + R¯ a1 + Fs a2 + VO¯ı1 + HO¯ı2 . 2 Projection of this relationship along unit vectors ¯ı1 and ¯ı2 yields the components of the reaction force at point O in the vertical and horizontal directions as VO = Fs S1 −RC1 −m1 g−m1 1 (θ¨1 S1 +θ˙12 C1 )/2 and HO = −Fs C1 −RS1 +m1 1 (θ¨1 C1 − θ˙12 S1 )/2, respectively. Example 6.6. Pendulum with sliding mass Figure 6.18 shows a pendulum comprising a bar of mass m and length and a rigid body of mass M . Frame F E = [O, E = (¯ e1 , e¯2 )] is attached to the bar. The rigid body is connected at point B to the tip of the bar at point A by means of a spring of stiffness constant k and a dashpot of constant c. The stretch of the spring is denoted x and its un-stretched length vanishes. The center of mass of the rigid body is located at point C and vector η deſnes the position of the center of mass with respect to point B; the moment of inertia of the body with respect to center of mass is denoted I C . The angular position of the bar with respect to the vertical is deſned by angle θ. The system is subjected to gravity along unit vector ¯ı1 . Find the equations of motion of the system.
VO O q
i2
O
A
mg
A Q
Fsd
x
Free body diagram of the bar
FA
m g
c
M i1
HO
e2 l
k B
h
x
C Q
FA
e1 Fsd B
C
h
Free body diagram of the mass
Mg e1
Fig. 6.18. Conſguration of the pendulum with sliding mass.
First, the equations of motion of the bar will be derived based on the free body diagrams depicted in ſg. 6.18. Because point O is a pivot point, eq. (6.38) yields m2 ¨ m gSθ , θ = Q + FA − 3 2
(6.46)
234
6 Kinetics of rigid bodies
where the notation Sθ = sin θ and Cθ = cos θ was introduced. Because the angular orientations of the bar and rigid body must remain identical, equal and opposite moments of magnitude Q must be applied to the bar and rigid body, as shown in ſg. 6.18. Similarly, equal and opposite forces of magnitude FA must be applied to the tip of the bar at point A and to the rigid body, along a common line of action normal to the bar and passing through point A. The position of the center mass of the bar is r = /2 e¯1 , and hence, the equation ¨e2 )/2 = −VO ¯ı1 − HO ¯ı2 + of motion for the center of mass becomes m(−θ˙2 e¯1 + θ¯ mg ¯ı1 +Fsd e¯1 +FA e¯2 , where VO and HO are the vertical and horizontal components of the reaction force at point O, and Fsd the sum of the forces acting in the spring and dashpot. The equation of motion for the center of mass can be projected along axes ¯ı1 and ¯ı2 to obtain the reaction forces at point O m ˙2 θ )Cθ − (FA − 2 m ˙2 θ )Sθ + (FA − (Fsd + 2
VO = mg + (Fsd + HO =
m ¨ θ)Sθ , 2 m ¨ θ)Cθ , 2
(6.47a) (6.47b)
respectively. Next, the equations of motion of the rigid body will be derived. Because point B is a ſxed point of the body, eq. (6.39) gives the sum of the moments computed ¨ The moment of with respect to point B as −Q + xFA + η M g¯ı1 = η M aB + I B θ. inertia of the body with respect to point B is found with the help of the parallel axis theorem, eq. (6.8), as I B = I C +M (η1∗2 +η2∗2 ), where η1∗ and η2∗ are the components of vector η resolved in basis E. The position vector of point B is r B = ( + x)¯ e1 and its acceleration, aB , is then readily obtained. Expanding the various terms then leads to − Q + xFA − M gη1∗ Sθ − M gη2∗ Cθ (6.48) ¨ = M η1∗ 2x˙ θ˙ + ( + x)θ¨ − M η2∗ x¨ − ( + x)θ˙2 + I B θ. The acceleration of the center of mass of the rigid body is found by taking two time derivatives of its position vector, r = (+x+η1∗ )¯ e1 +η2∗ e¯2 ; hence, the equations of motion for the center of mass become − FA e¯2 − Fsd e¯1 + M g¯ı1 ¨ − ( + x + η ∗ )θ˙2 e¯1 + M (2x˙ θ˙ − η ∗ θ˙2 ) + ( + x + η ∗ )θ¨ e¯2 . = M (¨ x − η2∗ θ) 1 2 1 Projecting this equation along unit vectors e¯2 and e¯1 yields FA = M gSθ − M (2x˙ θ˙ − η2∗ θ˙2 ) + ( + x + η1∗ )θ¨ ,
(6.49a)
¨ − M ( + x + η ∗ )θ˙2 + kx + cx˙ − M gCθ = 0. M (¨ x − η2∗ θ) 1
(6.49b)
The ſrst equation gives the interaction force acting at point A. The constitutive law for the spring dashpot assembly is Fsd = kx + cx. ˙
6.7 Planar motion of rigid bodies
235
Equation (6.49b) is the ſrst equation of motion; the second equation of motion is found by introducing into eq. (6.46) the expression for the interaction moments from eq. (6.48), and interaction force from eq. (6.49a) to ſnd 2 m + I C + M η2∗2 + M ( + η1∗ + x)2 θ¨ − M η2∗ x¨ 3 (6.50) m + M ( + η1∗ + x) gSθ + M gη2∗ Cθ = 0. + 2M ( + η1∗ + x)x˙ θ˙ + 2 In summary, the equations of motion can be recast as a system of coupled, ordinary differential equations by combining eqs. (6.50) and (6.49b) to ſnd 2 m /3 + I C + M η2∗2 + M ( + η1∗ + x)2 −M η2∗ θ¨ −M η2∗ M x ¨ ∗ ∗ ˙ 2M ( + η1 + x)x˙ θ + [m/2 + M ( + η1 + x)] gSθ + M gη2∗ Cθ + = 0. −M ( + x + η ∗ )θ˙2 + kx + cx˙ − M gCθ 1
Once the solution of these equations has been obtained, the vertical and horizontal components of the reaction force at point O can be obtained by eqs. (6.47a) and (6.47b), respectively; next, the interaction moment Q is obtained from eq. (6.48) and the interaction force at point A by eq. (6.49a). Example 6.7. The unbalanced rotor Figure 6.19 shows a rigid rotor of length L and mass M supported by two end bearings at points B and D. A torque, T , is applied to the rotor at point D. Let frame F B = B, B ∗ = (¯b1 , ¯b2 , ¯b3 ) be attached to the body; superscript (·)∗ denotes components resolved in basis B ∗ . Point G is located at the intersection of the shaft’s axis with the plane passing through the center of mass of the rotor and normal to the shaft’s axis. The coordinates of the center of mass of the rotor, resolved in basis B ∗ , are denoted x∗1c , x∗2c , and x3c . At point B, three reaction forces, denoted B1∗ , B2∗ , and B3 are applied to the shaft; at point D, two reaction forces, D1∗ and D2∗ , are applied to the shaft together with the torque T . Find the reactions forces applied to the bearings. *
i2 b2
B
i1
B2
b1 x1c* g x C
q
* 2c
x3c
B1* B B3*
G
D2*
D
x3c
G L
D
i3 = b3
Fig. 6.19. Conſguration of a rotor with an imbalance.
D1*
T
236
6 Kinetics of rigid bodies
The components of tensor of moment of inertia of the rigid rotor with respect to point B, resolved in basis B ∗ , are denoted I B∗ . The position vector of the center of mass is r C = x∗1c¯b1 + x∗2c¯b2 + x3c¯b3 and the corresponding acceleration is readily found. Equation (6.35) then yields the ſrst three equations of motion of the rotor B1∗ + D1∗ = −M (ω 2 x∗1c + ωx ˙ ∗2c ), B2∗ + D2∗ = −M (ω 2 x∗2c − ωx ˙ ∗1c ), and B3 = 0. Because point B is a material point of the body and is inertial, it is a pivot point for the B∗ B∗ 2 B∗ B∗ 2 rigid body and eq. (6.39) leads to −LD2∗ = I13 ω˙ − I23 ω , LD1∗ = I23 ω˙ + I13 ω , B∗ and T = I33 ω. ˙ Given the applied torque, this last equation can be integrated to ſnd the angular velocity and acceleration of the system. The remaining equations then yield expressions for the reaction forces at the bearing B∗ B∗ 2 LB1∗ = −I23 ω˙ − I13 ω − M L(ω 2 x∗1c + ωx ˙ ∗2c ),
B∗ B∗ 2 ω˙ − I23 ω − M L(ω 2 x∗2c − ωx ˙ ∗1c ), LB2∗ = +I13 B∗ B∗ 2 ω˙ + I13 ω , LD1∗ = I23 B∗ B∗ 2 ω˙ + I23 ω . LD2∗ = −I13
If the rotor rotates at a constant angular speed, ω = Ω, ω˙ = 0, the reaction forces at the bearing will be constant when resolved in the body attached basis. Of course, in the inertial system, these reaction forces will be harmonic forces at frequency Ω, as expected. It is often desirable to minimize or eliminate the reaction forces at the bearing. To eliminate these reaction forces, two conditions must be satisſed: (1) the rotor center of mass must be located on the axis of the shaft, i.e., x∗1c = x∗2c = 0, and B∗ B∗ (2) axis ¯b3 must be a principal axis of inertia, i.e., I13 = I23 = 0. Example 6.8. The cam-valve system Figure 6.20 shows a planar cam-valve system. The cam rotates at a constant angular velocity, Ω, about ſxed inertial point O. Frame F I = [O, I = (¯ı1 , ¯ı2 )] is inertial and frame F E = [O, E = (¯ e1 , e¯2 )] is attached to the cam. The external shape of the cam is deſned by curve C and the valve of mass m slides over this curve; the contact point between the cam and valve is denoted P. The motion of the valve is constrained to be along axis ¯ı2 and its displacement is denoted x. A spring of stiffness constant k is connected to the valve and is pre-compressed by a distance d. The kinematics of this problem have been treated in example 5.4 on page 182. Find the contact force acting between the cam and the valve. Assuming that the cam and valve are in contact at all times, the motion of the valve is known once the shape of curve C is given: this problem has no degrees of freedom. The right portion of ſg 6.20 depicts a free body diagram of the valve and its equation of motion is m¨ x = k(d − x) − N , where k(d − x) is the force the spring applies on the valve and N the desired contact force. Solving for the contact force yields N = k(d − x) − m¨ x = k(d − h + rSγ ) − mΩ 2 (rSγ − ρ). To obtain the second equality, the valve’s position was evaluated using elementary trigonometry as x = h − r sin(θ + α), its acceleration was found using eq. (5.32), and the following notation was deſned, Sγ = sin(θ + α).
6.7 Planar motion of rigid bodies
237
i2 k
x
k(d - x)
h r
e2
P a q
O
e1
x
i1
N
P
C
Fig. 6.20. Conſguration of the cam-valve system.
In non-dimensional form, the contact force becomes ¯ = N
ω2 ω2 ¯ ¯ N = ρ ¯ + ( − 1)¯ r S + (d − h), γ mr0 Ω 2 Ω2 Ω2
¯ = h/r0 . The where r0 is a reference length, r¯ = r/r0 , ρ¯ = ρ/r0 , d¯ = d/r0 , and h natural frequency of the spring-valve system was deſned as ω 2 = k/m. 6.7.5 Problems Problem 6.22. Retraction of a landing gear Figure 5.8 shows a simple landing gear system. It consists of a uniform link of mass mL and the hydraulic actuator is length L, and a wheel of mass mW (a point mass). The length (t) of given as function of time: (t) = h + g[1 − cos(πt/T )], where g = [ (L2 /2 + hL + h2 ) − h]/2. (1) Compute the magnitude F of the force that the actuator must apply to generate the desired motion. Plot F versus time τ = t/T . (2) Compute the vertical and horizontal components of reaction at point O, denoted V and H respectively. Plot V and H versus τ . Use the following data: h = 0.6 m, L = 1.2 m, T = 1.5 s, mL = 120 kg, mW = 80 kg, and gravity = 9.81 m/s2 .
Problem 6.23. Locking mechanism Figure 5.11 shows a locking mechanism used in the deployment of large space structures. When the homogeneous disk of radius R and mass M rotates about its ſxed point O, bar PT of length L and mass mb slides at point A through a collar that is allowed to swivel about the pin at point A. The mechanism is spring loaded by connecting a spring of stiffness k between the tip of the bar at point T and the collar at point A. The spring is un-stretched when θ = 90 deg. The bar has a length L, and w(t) denotes the portion of the bar between points P and A. The time history of angle θ is prescribed as θ(t) = π/4 (1 + cos πt/T ). (1) Compute the reaction forces S and Q at point A. Force S is oriented in the direction parallel to the bar, and Q is perpendicular to the bar. On the same graph, plot S and Q as a function of time τ = t/T . (2) Compute the horizontal and vertical components of force, denoted H and V , respectively, at point P. Plot H and V . (3) Compute and plot the torque required to rotate the disk. Use the following data: R = 0.15 m, M = 1.2 kg, d = 0.2 m, k = 1.5 kN/m, L = 0.4 m, mb = 0.5 kg, T = 2 s, and g = 10 m/s2 .
238
6 Kinetics of rigid bodies
Problem 6.24. Deployment of a satellite The satellite depicted in ſg. 6.21 is powered by solar panels. Initially, the three articulated solar panels are in the stowed conſguration indicated on the ſgure. To become operational, these panels are deployed by means of motors located at points A, B, and C. These motors provide torques that will deploy the system in such a way that the time schedule of angle θ is θ(t) = π [1 − cos(πt/T )]/4, where T is the total time required for the deployment. Each panel of the solar array is uniform, has a mass mP = 120 kg and a length P = 5 m. The total time to complete the deployment is T = 5 s. Let MA , MB , and MC be the torques that the motors located at points A, B, and C, respectively, must apply to complete the desired schedule of deployment. Let HA , HB , and HC be the horizontal components of force at the joint located at points A, B, and C, respectively; VA , VB , and VC are the corresponding vertical force components. Finally, FA , FB , and FC are the magnitudes of the force at each ˙ and θ¨ versus time. (2) Draw free body diagrams for each of the three joint. (1) Plot θ, θ, panels and the corresponding dynamic equations of motion. (3) Plot MA , MB , and MC . Find the instant at which each torque is maximum. Which motor will have to produce the highest torque? Why? (4) On one graph, plot HA , HB , and HC versus time. (5) Plot VA , VB , and VC . (6) Plot FA , FB , and FC . Find the instant at which each force component is maximum. Which joint is the most heavily loaded? Why? (7) If the maximum torque the motors can produce is MMAX = 100 N·m, what is the minimum time in which the deployment can be completed?
Partially deployed configuration
Stowed configuration B i2 q A Satellite
q q i1
L q
q
Solar panel
i2
C
Fig. 6.21. Satellite in the stowed and partially deployed conſgurations.
F(t)
u(t)
k A
f i1
Fig. 6.22. Satellite release conſguration.
Problem 6.25. Satellite release A satellite is released from a launch vehicle, as depicted in ſg. 6.22. The satellite is composed of a rigid body and of two solar panels of length L = 5 m. During release, force F (t) imparts to the satellite the following vertical motion (rectilinear motion along ¯ı2 ): u(t) = Δ0 (1 − cos πt/T )/2 for t ≤ T /2 and u(t) = Δ0 [π(2t/T − 1)/2 + 1] /2 for t > T /2, where Δ0 = 0.5 m and T = 0.5 s is the characteristic release time. Due to the impulsive nature of the applied force, the solar panel will start to vibrate. Each panel is uniform and has a mass mp = 100 kg. The elasticity of the panels will be represented by torsional springs of stiffness k = 5 kN·m/rad at their root. In view of the symmetry of the problem, the motions of the two solar panels will be identical. Consequently, the sole right panel will be investigated here. (1) Draw a free body diagram for the right panel. (2) Derive the differential equation of motion ˙ and of the panel. (3) Solve this equation numerically. (4) On three separate graphs, plot φ, φ, ¨ φ as a function of time, for t ∈ [0, 5] s. (5) Plot the horizontal and vertical components of reaction at point A. (6) Plot the torque in the torsional spring as a function of time.
6.7 Planar motion of rigid bodies
239
Problem 6.26. Quick return mechanism The quick return mechanism shown in ſg. 5.9 consists of a uniform crank of length c = 0.30 m and mass mc = 12 kg, and of a uniform bar of length b = 1.6 m and mass mb = 60 kg. The crank is pinned at point R and the bar is pinned at point O. The distance between these two points is d = 0.35 m. At point P, a slider allows the tip of the crank to slide along the bar. Gravity acts in the vertical direction, g = 9.81 m/s2 . The system is driven by a torque M applied to the crank at point R. The time history of angle θ is: θ(t) = π(1 − cos πt/T )/2, where T = 5 s is the time required for the crank to rotate 180 degrees. (1) Draw free body diagrams of the bar and crank. Write the equations of motion of the system. (2) Plot the time history of the contact force at the slider. (3) On the same graph, plot the horizontal and vertical components of the reaction force at point O. (4) Plot the time history of the torque M required to drive the system. (5) On the same graph, plot the horizontal and vertical components of the reaction force at point R.
Problem 6.27. Bar hinged at rim of rotating disk Figure 6.23 shows a homogeneous disk of radius R m, l and mass M rotating in a vertical plane around iner i f D + 2 e1 tial point O. Frame F = O, E = (¯ e1 , e¯2 , e¯3 ) is e2 attached to the disk. At point B, a homogeneous bar B g of length and mass m is hinged to the disk. A torI q O sional spring of stiffness constant k and a torsional i1 dashpot of constant c are located at the hinge. The M, R spring is un-stretched when φ = φ0 . (1) Derive the equations of motion of the system in terms of angles θ and φ. (2) On one graph, plot angles θ and φ versus Fig. 6.23. Bar hinged at rim of rotatτ . (3) On one graph, plot angular velocities θ and ing disk. φ . (4) On one graph, plot angular accelerations θ and φ . (5) Plot the cumulative energy ¯ = K/k, potential, ¯ d = Wd /k. (6) Plot the system’s kinetic, K dissipated in the dashpot, W V¯ = V /k, energies. Check that the energy closure equation is satisſed. (7) Plot the compo¯ = F /(mω 2 ), in the hinge, resolved in basis E + . Use the following nents of the force, F B B ¯ = R/ = 1, g¯ = g/(ω 2 ) = 1.2, ζ = ωc/(2k) = 0.05, and data: μ = M/m = 3, R φ0 = 0. Use the following non-dimensional time τ = ωt, where ω 2 = 3k/(m2 ) and (·) indicates a derivative with respect to τ . Plot all results for τ ∈ [0, 30]. The initial conditions are θ = φ = 0, θ = 1, and φ = −1.
Problem 6.28. Robotic arm in space Consider a robotic arm in space depicted in ſg. 6.24. The ƀexibility of the arm will be represented in a crude manner by a mid-span torsional spring of stiffness kt = 1, 500 N·m/rad. The ſrst segment of the robotic arm is of length La = 2.4 m and its orientation is prescribed as θ(t) = π(1 − cos πt/T )/6 for t ≤ T and θ(t) = π/3 for t > T , where T = 25 s. The second segment of the robotic arm is of length Lb = 2.4 m and mass mb = 60 kg. The system is used to manipulate a payload of mass Mp = 1, 000 kg and moment of inertia Ip = 250 kg·m2 connected to the tip of the second segment of the robotic arm. (1) Derive the equation of motion for the orientation angle φ of the second segment of the robotic arm. (2) Solve this differential equation numerically assuming initial conditions at rest. (3) On the same graph, plot θ and φ as a function of time. What is the maximum overshoot, (φmax − θmax )/θmax , of φ with respect to the command signal θ? (4) Plot the angular velocity φ˙ as a function of time. (5) Plot the torque, Q, in the torsional spring as a function of time. (6) Find the components of force through the pin at point A. (7) On the same graph, plot the horizontal and vertical com-
240
6 Kinetics of rigid bodies
ponents of this force and its magnitude as a function of time. For all graphs, use t ∈ [0, 100] s.
Initial configuration
i2 A La O
Lb, mb T f Mp, Ip kt q i1
Fig. 6.24. Robotic arm conſguration.
L2
Satellite
L1
P
A
i2 O
f
q
i1
Fig. 6.25. Satellite capture conſguration.
Problem 6.29. Satellite capture A satellite is to be brought to the cargo bay of the space shuttle. Figure 6.25 shows the initial conſguration of the system, with the satellite connected to the end of the shuttle robotic system. The ſrst part of the robotic system, bar 1, is a uniform bar of length L1 = 4 m and mass m1 = 100 kg. The second part of the robotic system, bar 2, is a uniform bar of length L2 = 3 m, and mass m2 = 65 kg. The satellite has a mass mpl = 1, 500 kg and a moment of inertia Ipl = 1, 200 kg·m2 . Torques TO and TA are applied at the joints located at points O and A, respectively, in such a way that the time histories of angles θ and φ are as follows: θ(t) = π(1 + cos πt/T ) and φ(t) = π(1 − 3 cos πt/T )/4, respectively, where T = 10 s is the total time needed bring the satellite into the cargo bay. (1) On the same graph, plot the time history of angles θ and φ. (2) Plot the angular velocities of bars 1 and 2. (3) Plot the angular accelerations of bars 1 and 2. (4) Plot the trajectory of the satellite as it is brought into the cargo bay. (5) Draw free body diagrams for bars 1 and 2. (6) Plot the horizontal and vertical components of the reaction force at point A, denoted HA and VA , respectively. (7) Plot the horizontal and vertical components of the reaction force at point O, denoted HO and VO , respectively. (8) Plot the torques TO and TA applied at points O and A, respectively. (9) If the actuators at points O and A can generate a maximum torque of 1,000 N·m, ſnd the minimum maneuver time, Tmin .
Problem 6.30. Rolling cylinder with bar Figure 6.26 shows a homogeneous cylinder of mass M and radius r rolling without sliding on a horizontal plane under the effect of gravity. A homogeneous bar of mass m and length is rigidly attached to the center of the cylinder. Angle θ denotes the orientation of the bar with respect to the vertical axis. At the tip of the bar, denoted point T, a spring of stiffness constant k connects the bar to ſxed point A; the un-stretched length of the spring vanishes. (1) Derive the equations of motion of the system in terms of angle θ. (2) Plot θ as a function of time. (3) Plot θ˙ as a function of time. (4) On one graph, plot the kinetic, potential and total mechanical energies of the system. (5) On one graph, plot the normal reaction and friction forces acting on the disk. (5) If the friction coefſcient between the disk and the horizontal plane is μ = 0.3, will the disk start sliding? Use the following data: r = 0.25, = 1.25 m, M = 5, m = 1.25
6.7 Planar motion of rigid bodies
241
kg, k = 10 N/m, and g = 9.81m/s2 . At time t = 0, θ = 4π/5 rad, and θ˙ = 0. Present all your results for t ∈ [0, 5] s.
i2 q k
f
g
A
i2 k
A
T
T C
r
l
l
kf i1
Fig. 6.26. Conſguration of the rolling cylinder.
g
q C
r i1
Fig. 6.27. Conſguration of the rolling cylinder with an articulated bar.
Problem 6.31. Rolling cylinder with articulated bar Figure 6.27 shows a homogeneous cylinder of mass M and radius r rolling without sliding on a horizontal plane under the effect of gravity. A homogeneous bar of mass m and length is articulated to the rim of the cylinder. Angle θ denotes the rolling angle of the cylinder and angle φ the orientation of the bar with respect to the vertical axis. At the tip of the bar, denoted point T, a spring of stiffness constant k connects the bar to inertial point A; the un-stretched length of the spring vanishes. A torsional spring of constant kφ acts at the connection between the cylinder and the bar; the spring is un-stretched when the bar point radially outwards. (1) Derive the equations of motion of the system in terms of angles θ and φ. (2) On one graph, plot θ and φ as a function of time. (3) On one graph, plot θ˙ and φ˙ as a function of time. (4) On one graph, plot the kinetic, potential and total mechanical energies of the system. Use the following data: r = 0.25, = 1.25 m, M = 5, m = 1.25 kg, k = 10 N/m, kφ = 15 N·m/rad, and g = 9.81m/s2 . At time t = 0, θ = π/2, φ = 0 rad, and θ˙ = φ˙ = 0. Present all your results for t ∈ [0, 5] s.
Problem 6.32. Balancing a rotor Consider the rigid rotor of length L and mass M supported by two end bearings at points B and D, as depicted in ſg. 6.19 and discussed in example 6.7. Due to manufacturing imperfections, the rotor is not balanced, i.e., the coordinates of the center of mass do not vanish, x∗1c = 0 and B∗ B∗ = 0 and I23 = 0. To x∗2c = 0, and axis ¯b3 is not a principal axis of inertia of the rotor, I13 estimate these unknown parameters, the rotor is spun at a constant angular velocity, Ω, and the bearing reactions B1∗ , B2∗ , D1∗ , and D2∗ are measured. To balance the rotor, i.e., to eliminate the reaction forces at the bearings, it is proposed to add two point masses to the rotor. The ſrst point mass is located at point (x∗1a = Ra cos θa , x∗2a = Ra sin θa , x3a ) and is of mass ma ; similarly, the second point mass is located at point (x∗1b = Rb cos θb , x∗2b = Rb sin θb , x3b ) and is of mass mb . This implies that the balancing masses ma and mb are located on circles of radii Ra and Rb , respectively, at angular locations θa and θb , respectively. (1) Based on the B∗ B∗ B∗ B∗ measured reactions, evaluate I¯13 = I13 /(M L2 ) and I¯23 = I23 /(M L2 ). (2) Based on the ¯∗2c = x∗2c /L. (3) Find the magnitude of the measured reactions, evaluate x ¯∗1c = x∗1c /L and x balancing masses, ma and mb , and the angular locations, θa and θb . Use the following data: ¯1∗ = B1∗ /(M Ω 2 L) = −0.0005; B ¯2∗ = B2∗ /(M Ω 2 L) = .0008; D ¯ 1∗ = D1∗ /(M Ω 2 L) = B ∗ ∗ 2 ¯ 0.0005; D2 = D2 /(M Ω L) = −.0004. L = 0.5 m; M = 10 kg; x3a = 0.2 and x3b = 0.3 m; Ra = 0.2 and Rb = 0.1 m.
242
6 Kinetics of rigid bodies
Problem 6.33. Double pendulum with elastic joint Figure 6.17 depicts a double pendulum comprising bar 1, of mass m1 and length 1 , and bar a1 , a ¯2 )] be attached to bar 1 and 2, of mass m2 and length 2 . Let frame F A = [A, A = (¯ e1 , e¯2 )] be attached to bar 2. A massless tube allows bar 2 to slide in frame F E = [E, E = (¯ the direction of a ¯2 ; the slider has a mass M and is connected to bar 1 at point A by means of a spring of stiffness constant k. The position of the slider is determined by its distance, x, from point A, the tip of bar 1; the angular positions of the two bars with respect to the vertical are denoted θ1 and θ2 , respectively. The system is subjected to gravity along the inertial ¯ı1 direction. (1) Derive the equations of motion of the system in terms of θ1 , θ2 and x. (2) On one graph, plot θ1 and θ2 as a function of time. (3) Plot x as a function of time. (4) On one graph, plot the angular velocities of the two bars. (5) Plot x˙ as a function of time. (6) On one graph, plot the kinetic, potential and total mechanical energies of the system. (7) On one graph, plot the reaction and elastic forces at the joint. (8) On one graph, plot the vertical and horizontal components of the reaction force at point O. Use the following data: M = 1, m1 = 1 and m2 = 1 kg; 1 = 0.4 and 2 = 0.5 m; k = 400 N/m; g = 9.81 m/s2 . At the initial time t = 0, θ1 = θ2 = π/2 and x = 0. Present all the results of the simulation for t ∈ [0, 15] s.
Problem 6.34. Pendulum with sliding mass Figure 6.18 shows a pendulum comprising a bar of mass m and length and a rigid body e1 , e¯2 )] be attached of mass M , as discussed in example 6.6. Let frame F E = [(O, E = (¯ to the bar. The rigid body is connected at point B to the tip of the bar at point A by means of a spring of stiffness constant k and a dashpot of constant c. The stretch of the spring is denoted x and its un-stretched length vanishes. The center of mass of the rigid body is located at point C and vector η deſnes the position of the center of mass with respect to point B; the moment of inertia of the body with respect to center of mass is denoted I C . The angular position of the bar with respect to the vertical is denoted θ. The system is subjected to gravity along the inertial ¯ı1 direction. (1) Derive the equations of motion of the system in terms of θ and x. (2) Plot θ as a function of time. (3) Plot x as a function of time. (4) Plot the angular velocity of the bar. (5) Plot x˙ as a function of time. (6) On one graph, plot the kinetic and potential energies of the system as well as the energy dissipated in the dashpot. Verify the energy closure equation. (7) On one graph, plot the interaction force at point A and the total force in the spring and dashpot assembly. (8) On one graph, plot the vertical and horizontal components of the reaction force at point O. (9) Plot the interaction moment between the bar and the rigid body. Use the following data: m = 0.4, M = 2.5 kg, = 0.45 m, k = 10 N/m, c = 0.05 N·s/m, I C∗ = 0.75 kg·m2 , η1∗ = 0.2, and η2∗ = 0.3 m are the components of vector η in basis E, and g = 9.81 m/s2 . At the initial time t = 0, θ = π/2 and x = 0. Present all the results of the simulation for t ∈ [0, 50] s.
Problem 6.35. Milling machine Consider the simpliſed model of a milling machine as depicted in ſg. 6.28. The tool support is a rigid body of mass m and moment of inertia I O with respect to point O connected to the ground at point O. Its center of mass is located at point A, which is at a distance 1 from point O. A torsional spring of stiffness constant kθ , the un-stretched rotation of the spring is denoted θ0 , and a torsional dashpot of constant cθ act at point O. Let frame F E = (O, E ), E = (¯ e1 , e¯2 ), be attached to the tool support; the angle between axes ¯ı1 and e¯1 is denoted θ. A massless, rigid bar DB of length 2 is free to slide inside the tool support. A spring of stiffness constant kx , the un-stretched length of the spring is denoted x0 , and a dashpot of constant cx connect the tool support at point A to the bar at point D. At point B, the bar connects to
6.7 Planar motion of rigid bodies
243
the milling machine tool, which is free to rotate about point B. Let frame F B = (B, B), B = (¯b1 , ¯b2 ), be attached to the tool, whose center of mass C is located a distance d along axis ¯b1 . The tool rotates at a constant angular velocity, Ω, with respect to the bar, such that the angle between axes e¯1 and ¯b1 is φ = Ωt. The tool is of mass M and moment of inertia I B with respect to point B. (1) Derive the equations of motion of the system in terms of angle θ and x, the distance from point A to D. (2) Plot θ as a function of time. (3) Plot x as a function of time. (4) Plot the angular velocity of the tool support. (5) Plot x˙ as a function of time. (6) Plot the torque TB applied to the tool at point B. (7) On one graph, plot the cumulative work dissipated in the two dashpots and that done by torque TB . (8) On one graph, plot the kinetic and potential energies of the system. Verify the energy closure equation. Use the following data: 1 = 0.25, 2 = 0.3, d = 0.002 m, m = 2, M = 4 kg, I O = 0.2, I B = 0.0125 kg·m2 , Ω = 400 rad/s, kx = 10 kN/m, kθ = 15 kN·m/rad, cx = 10 N·s/m, cθ = 10 N·m·s/rad, θ0 = π/4, x0 = 0.1 m, and g = 9.81 m/s2 . At the initial time t = 0, θ = π/4 and x = 0.1 m. Present all the results of the simulation for t ∈ [0, 0.5] s.
k q , cq O
e2
l1 m
A
e2 i2
O I
x cx
kx
D b2
l2
q
D k
b2
l
g
s*1 B q
d i1
w
B M e1
C b1 f = Wt
Fig. 6.28. Conſguration of the milling machine.
i1
d
g s*2
A M e 1
i2 c
C b1 f
Fig. 6.29. Conſguration of the simpliſed suspension system.
Problem 6.36. Suspension system Figure 6.29 shows the conſguration of a simpliſed planar suspension system. A rigid body of mass M is connected to the ground at point A by means of a massless rigid bar of length and at point B by means of a spring of stiffness constant k and dashpot of constant c. Reference frame F B A, B = (¯b1 , ¯b2 ) is attached to the rigid body at point A; the center of mass of the rigid body is located at distance d from point A, along axis ¯b1 . The coordinates of point B, resolved in B, are (s∗1 , s∗2 ). Point D is located a distance w from point O. The conſguration of the system is represented by angles θ and φ, as indicated in the ſgure. (1) Draw a free body diagram of the system. (2) Derive the two equations of motion of the system. (3) Find the load in the bar.
Problem 6.37. Bar rocking on top of a curve A homogeneous bar a length L, thickness h, and mass M is rocking without sliding on top of a ſxed curve, as depicted in ſg. 6.30. At contact point P, a normal contact force, N , and a friction force, F , are acting on the bar. (1) Find the work done by the normal contact force, N . Under what condition will this force perform work? (2) Find the work done by the friction force, F . Under what condition will this force perform work? (3) By means of high-speed
244
6 Kinetics of rigid bodies
cameras, an experimentalist is monitoring the elevation, d, of the bar’s center of mass above the apex of the curve. At times t1 and t2 , the elevations of the center of mass were measured to be d1 and d2 , respectively. What can be said about the evolution of the bar’s kinetic energy during that time. (4) Is the system’s total mechanical energy preserved? (5) Does the bar’s angular momentum remain constant? Justify all your answers.
Problem 6.38. Bar rocking atop a curve Figure 6.30 depicts a homogeneous bar a length L, thickness h, and mass M rocking without sliding on top of a ſxed curve. The curve is deſned by its intrinsic parametrization, r(s), where s is the curvilinear variable measuring length along the curve. (1) Find the equation of motion of the system. (2) Evaluate the normal contact and friction force at point P. (3) If the curve is a circle of radius R, what is the form of the equation of motion?
h C g
s
s
k M
d r(s)
i2
x c
L
P n i1
g
A
a(s) t
Fig. 6.30. Homogeneous bar rocking without sliding atop a curve.
i2
m, L i1
q
e1
Fig. 6.31. Pendulum connected to a plunging mass.
Problem 6.39. Pendulum connected to a plunging mass A pendulum of mass m and length L is connected to a mass M that is allowed to slide vertically, as depicted in ſg. 6.31. Mass M is connected to the ground be means of a spring of stiffness constant k and dashpot of constant c. The spring is un-stretched when x = 0. (1) Find the equations of motion of the system. (2) Plot the time history of the plunging motion, x ¯(τ ). (3) Plot the time history of angle θ(τ ). (4) Plot the velocity of the plunging mass, x ¯ (τ ). (5) Plot the angular velocity of the pendulum, θ (τ ). (6) On one graph, plot the non-dimensional horizontal and vertical components of the force applied to the plunging mass at point A, de¯ A = HA /(kL) and V¯A = VA /(kL), respectively. (7) Plot the cumulative energy noted H ¯ d = W d /(kL2 ). (8) On one graph, plot the system’s kinetic endissipated in the damper, W 2 ¯ ergy, K = K/(kL ), potential energy, V¯ = V /kL2 , and the energy closure equation. Use the following data: non-dimensional time, τ = ωt, where ω 2 = k/(M + m), (·) indicates a derivative with respect to τ , x ¯ = x/L, μ = m/(M + m) = 0.5, ζ = cω/(2k) = 0.05, and g¯ = (M + m)g/(kL) = 1.5. At the initial time (τ = 0), x ¯ = 0.5, x ¯ = 0, θ = π/3, and θ = 0. Present all results for τ ∈ [0, 20].
Problem 6.40. Two-bar mechanism The two bar mechanism shown in ſg. 6.32 comprises bar OB of length L1 and mass m1 , and bar BAT of length L2 and mass m2 . Bar BAT passes through a slider located at ſxed point A but free to swivel about that point. A spring of stiffness constant k connects the tip of the bar at point T to the slider at point A and is of vanishing un-stretched length. A viscous
6.7 Planar motion of rigid bodies
245
˙ acts at the interface between the bar and the slider. (1) Derive friction force, F f = −cw, the equation of motion of the system using generalized coordinate θ1 . (2) On one graph, plot angles θ1 and θ2 as functions of the non-dimensional time τ . (3) On one graph, plot angular velocities θ1 and θ2 . (4) On one graph, plot angular accelerations θ1 and θ2 . (5) Plot the ¯ = Δ/L1 . (6) On one graph, plot the friction force, F¯ f = F f /(kL1 ), spring stretch, Δ and reaction force at the slider, S¯ = S/(kL1 ). (7) Plot the cumulative energy dissipated at ¯ = K/(kL21 ), ¯ d = W d /(kL21 ). (8) On one graph, plot the kinetic energy, K the slider, W 2 ¯ potential energy, V = V /(kL1 ), and the energy closure equation. Use the following data: ¯2 = μ1 = m1 /(m1 + m2 ) = 0.6, μ2 = m2 /(m1 + m2 ) = 1 − μ1 , d¯ = d/L1 = 3, L L2 /L1 = 5, and g¯ = (m1 + m2 )g/(kL1 ) = 0.2. Use the non-dimensional time τ = ωt, where ω 2 = k/(m1 + m2 ). The viscous friction coefſcient is written as c = 2(m1 + m2 )ωζ, where ζ = 0.02. At the initial time, θ1 = 0 and θ1 = 2.4, where (·) indicates a derivative with respect to τ . Present all your results for τ ∈ [0, 40].
i2 m1, L1
B q1
w
g
q2
O d
B
i1 A
a
e1
m2, L2
k T
Fig. 6.32. Two-bar mechanism.
a2 L/2 m
a1 r/2
C r/2
M D e3
L/2
Fig. 6.33. Rotor with skewed bar.
Problem 6.41. Rigid bar connected to a rotor Figure 6.33 shows rigid rotor of length L and mass M . Basis E = (¯ e1 , e¯2 , e¯3 ) is attached to the rotor; unit vector e¯3 is aligned with the shaft. A rigid bar of length r and mass m is ¯2 , a ¯3 ) is attached to the bar, e¯3 = a ¯3 and rigidly connected to the shaft. Basis A = (¯ a1 , a ¯1 ); unit vector a ¯1 is aligned with the bar. The shaft and bar are homogeneous α = (¯ e2 , a slender rods, see ſg. 6.42, and their centers of mass coincide at point C. The rotor rotates at a constant angular velocity, Ω, about axis e¯3 , and is supported by bearings at points B and D. (1) Compute the components of the reaction forces at points B and D resolved in basis E . (2) Compute the components of the reaction forces at points B and D resolved in the inertial basis.
Problem 6.42. Rigid disk connected to a rotor Figure 6.33 shows rigid rotor of length L, radius R, and mass M . Basis E = (¯ e1 , e¯2 , e¯3 ) is attached to the rotor; unit vector e¯3 is aligned with the shaft. A rigid disk of radius r and mass ¯2 , a ¯3 ) is attached to the disk, e¯3 = a ¯3 m is rigidly connected to the shaft. Basis A = (¯ a1 , a and α = (¯ e2 , a ¯1 ); unit vector a ¯2 is normal the disk. The shaft is a homogeneous cylinder, see ſg. 6.40, and the disk a homogeneous thin disk, see ſg. 6.41. Their centers of mass coincide at point C. The rotor rotates at a constant angular velocity, Ω, about axis e¯3 , and is supported by bearings at points B and D. (1) Compute the components of the reaction forces at points B and D resolved in basis E . (2) Compute the components of the reaction forces at points B and D resolved in the inertial basis.
246
6 Kinetics of rigid bodies
Problem 6.43. Particle sliding in a rolling wheel Figure 6.34 shows a homogeneous wheel of mass M and radius R rolling without sliding on a horizontal plane under the effect of gravity. A particle of mass m slides in a radial slot of the wheel and is connect to its center by means of a spring of stiffness constant k and dashpot of constant c. The un-stretched length of the spring is x0 . (1) Derive the system’s equations of motion using the generalized coordinates x and θ indicated on the ſgure. (2) Plot the position of mass m, x ¯ = x/R, versus τ . (3) Plot angle θ versus τ . (4) Plot the velocity of mass m, x ¯ . (5) Plot the wheel’s angular velocity, θ . (6) Plot the acceleration of mass m, x ¯ . (7) Plot the wheel’s angular acceleration, θ . (8) Plot the cumulative energy dissipated in the dashpot, ¯ = K/(mω 2 R2 ), and potential, ¯ d = Wd /(mω 2 R2 ). (9) On one graph, plot the kinetic, K W 2 2 ¯ V = V /(mω R ), energies of the system. Verify that the energy closure equation is satisſed. (10) On one graph, plot the force in the spring-dashpot system, F¯sd = Fsd /(mω 2 R), and the contact force between the particle and slot, F¯ c = F c /(mω 2 R). (11) On one graph, plot the normal and tangential force components at the point of contact of the wheel with the ¯ = N/(mω 2 R) and F¯ f = F f /(mω 2 R), respectively. (12) What is the minimum plane, N required friction coefſcient if the wheel is to roll without sliding. Use the following data: ¯0 = x0 /R = 0.5. Use μ = M/m = 5, ζ = c/(2mω) = 0.01, g¯ = g/(Rω 2 ) = 0.2, x the following non-dimensional time τ = ωt, where ω 2 = k/m and (·) indicates a derivative with respect to τ . Plot all results for τ ∈ [0, 200]. The initial conditions are x ¯=θ=x ¯ = 0, and θ = 0.1.
i2
g
M
R c x k m
i2
q
x g
O
i1
m k
f i1
Q
O Fig. 6.34. Particle sliding in a rolling wheel.
c
Fig. 6.35. Particle in a slot on a rotating disk.
Problem 6.44. Particle in a slot on a rotating disk Figure 6.35 depicts a homogeneous disk of mass M and radius R rotating in a vertical plane around inertial point O. Mass m is free to slide in a radial slot on the disk and is connected to the center of the disk by means of a spring of stiffness constant k and a dashpot of constant c. The system is subjected to gravity and a torque, Q, is applied to the disk. The spring’s un-stretched length is denoted x0 . (1) Derive the equations of motion of the system in terms of angle φ and distance x from point O to the particle. (2) Find the horizontal and vertical components of the reaction force at point O. (3) If the disk is to rotate at a constant angular velocity, φ˙ = Ω, ſnd the equation of motion for the particle. (4) Find the applied torque, Q, required to maintain this constant angular speed.
Problem 6.45. Pendulum connected to horizontal piston Figure 6.36 shows a pendulum of length with a tip mass m. A piston of mass M is rigidly connected to a horizontal rod sliding along the pendulum by means of a slider at point S. A spring of stiffness constant k and dashpot of constant c connect the piston to the ground. The spring is un-stretched when angle θ = 0. The distance from the vertical to point S is denoted x. (1) Derive the system’s equation of motion in terms of angle θ indicated on the
6.7 Planar motion of rigid bodies
247
ſgure. (2) Plot angle θ versus time τ . (3) Plot the rod’s angular velocity, θ . (4) Plot the rod’s angular acceleration, θ . (5) Plot distance x ¯ = x/. (6) Plot the piston’s speed x ¯ . (7) Plot the piston’s acceleration x ¯ . (8) Plot the cumulative energy dissipated in the dashpot, ¯ = K/(mω 2 2 ), and potential, ¯ d = Wd /(mω 2 2 ). (9) On one graph, plot the kinetic, K W 2 2 ¯ V = V /(mω ), energies of the system. Verify that the energy closure equation is satisſed. (10) Plot the normal slider force, S¯ = S/(mω 2 ). Use the following data: μ = M/m = 2, ¯ = h/ = 0.25, g¯ = g/(ω 2 ) = 0.1, ζ = c/(2mω) = 0.01. Use the following nonh dimensional time τ = ωt, where ω 2 = k/m and (·) indicates a derivative with respect to τ . Plot all results for τ ∈ [0, 50]. The initial conditions are θ = 0, and θ = 1.
I
i2
O x
h
q
c
l m
q
g
i2
M
S i1
m, l
k
x
g
e1 Fig. 6.36. Pendulum connected to horizontal piston.
O
I
k
A
M i1
d/2 d/2 Fig. 6.37. Inverted pendulum mounted on a cart.
Problem 6.46. Inverted pendulum mounted on a cart Figure 6.37 depicts an inverted homogeneous pendulum of mass m and length . The pendulum is mounted on a cart of mass M free to translate along a horizontal track. A torsional spring of stiffness constant k restrains the pendulum at its attachment point. The spring is un-stretched when angle θ = θ0 . (1) Derive the two equations of motion of the system. (2) Plot the cart’s position, x ¯ = x/ versus τ . (3) Plot angle θ. (4) Plot the cart’s velocity, x ¯ . (5) Plot θ . (6) Plot the cart’s acceleration, x ¯ . (7) Plot θ . (8) Plot the system’s kinetic, ¯ = K/m2 ω 2 , potential, V¯ = V /(m2 ω 2 ), and total mechanical energies. (9) Plot the K ¯ = H/(mω 2 ) horizontal and vertical components of the internal force at point A, denoted H 2 ¯ and V = V /(mω ), respectively. (10) Plot the vertical force components in the front and rear wheels, denoted F¯f = Ff /(mω 2 ) and F¯r = Fr /(mω 2 ), respectively. Use the following data: μ = M/m = 1.5, θ0 = 0, and d¯ = d/ = 1. Use non-dimensional time τ = ωt, where ω = k/(m2 ) and (·) denotes a derivative with respect to τ . At the initial time, x ¯ = 0, x ¯ = 1, θ = π/4, θ = 0. Present all your results for τ ∈ [0, 20]. Study two cases, g¯ = g/(ω 2 ) = 0.8 and g¯ = 4, and comment on the differences.
Problem 6.47. Geneva wheel mechanism Figure 6.38 depicts the Geneva wheel mechanism, which consists of a disk and slotted arm. The disk of radius R and mass M is free to rotate about inertial point O. A pin is located at the rim of the disk at point P. The slotted arm of length and mass m is hinged at point A and the pin slides inside the slot. The distance from point A to the pin is denoted w. At point A, the arm is restrained by a torsional spring of stiffness constant k and a torsional dashpot of constant ˙ acts at c1 . The spring is un-stretched when θ = 0. A viscous friction force, F f = −c2 w, the interface between the pin and the slot. (1) On one graph, plot angles φ and θ versus τ . (2) On one graph, plot angular velocities φ and θ . (3) On one graph, plot angular accelerations φ and θ . (4) Plot the cumulative energy dissipated in the dashpot and friction mechanism,
248
6 Kinetics of rigid bodies
¯ = K/k, potential, V¯ = V /k, energies. Check ¯ d = Wd /k. (5) Plot the system’s kinetic, K W that the energy closure equation is satisſed. (6) On one graph, plot the normal contact force, F¯ c = F c /(mRω 2 ), and viscous friction force, F¯ f = F f /(mRω 2 ), at the pin. Use the ¯ = L/R = 1.5, ¯ = /R = 2, g¯ = g/(ω 2 ) = 0.2, following data: μ = M/m = 2, L ζ1 = ωc1 /(2k) = 0.02, and ζ2 = ωc2 R2 /(2k) = 0.01. Use the following non-dimensional time τ = ωt, where ω 2 = 3k/(m2 ) and (·) indicates a derivative with respect to τ . Plot all results for τ ∈ [0, 5]. The initial conditions are θ = 0 and θ = 1.5.
w P m, l
k
A f
q
g L
O
i2 e2
r
e1 P
q
Ak
i1
O
c
d
M, R
Fig. 6.38. Geneva wheel mechanism.
Fig. 6.39. Scotch yoke mechanism.
Problem 6.48. Scotch yoke mechanism Figure 6.39 depicts the Scotch yoke mechanism, which consists of a disk and slotted yoke. The disk of radius R and mass M is free to rotate about inertial point O. A pin is located at a distance r from the center of the disk. The slotted yoke of length and mass m is allowed to move horizontally and the pin slides inside the slot. At point A, the yoke is restrained by a spring of stiffness constant k and a dashpot of constant c1 . The spring is un-stretched when θ = 0. A viscous friction force, F f = −c2 vr , acts at the interface between the pin and the slot; vr is the relative velocity of the pin with respect to the slot. (1) Plot angle θ versus τ . (2) Plot angular velocity θ . (3) Plot angular acceleration θ . (4) Plot the cumulative energy dissipated ¯ d = Wd /(mω 2 r 2 ). (5) Plot the system’s kinetic, in the dashpot and friction mechanism, W 2 2 ¯ ¯ K = K/(mω r ), potential, V = V /(mω 2 r 2 ), energies. Check that the energy closure equation is satisſed. (6) On one graph, plot the normal contact force, F¯ c = F c /(mrω 2 ), and viscous friction force, F¯ f = F f /(mrω 2 ), at the pin. Use the following data: μ = M/m = 2, ¯ = R/r = 0.8, ζ1 = ωc1 /(2k) = 0.01, and ζ2 = ωc2 /(2k) = 0.01. Use the following R non-dimensional time τ = ωt, where ω 2 = k/m and (·) indicates a derivative with respect to τ . Plot all results for τ ∈ [0, 30]. The initial conditions are θ = 0 and θ = 1.5.
6.8 Inertial characteristics The inertial characteristics of rigid bodies with simple shapes are presented below. For each rigid body, the volume, V, of the body and the principal mass moments of ∗ ∗ ∗ inertia I11 , I22 , and I33 are given. The ſgures also indicate the location of the center of mass and the orientation of the principal axes of inertia. •
Cylinder (Figure 6.40): volume, V = πR2 L; principal mass moments of inertia, ∗ ∗ ∗ I11 = I22 = mR2 /4 + mL2 /12, I33 = mR2 /2.
6.8 Inertial characteristics
•
Thin disk (Figure 6.41): volume, V = πR2 L; principal mass moments of iner∗ ∗ ∗ tia, I11 = I22 = mR2 /4, I33 = mR2 /2. These results are obtained from their counterparts for a cylinder when L/R 1.
L/2
i2
L
L/2
i2
R
C
R
i3
i3
C
i1
i1 Fig. 6.40. Cylinder.
•
•
249
Fig. 6.41. Thin disk.
Slender rod (Figure 6.42): volume, V = AL; principal mass moments of inertia, ∗ ∗ ∗ I11 = I22 = mL2 /12, I33 ≈ 0. These results are obtained from their counterparts for a cylinder when d/R 1, where d is a representative dimension of area A. Half cylinder (Figure 6.43): volume, V = πR2 L/2; principal mass moments ∗ ∗ ∗ of inertia, I11 = m(R2 /4 − d2 ) + mL2 /12, I22 = mR2 /4 + mL2 /12, I33 = 2 2 m(R /2 − d ); center of mass location, d = 4R/3π. i2 i2 L/2
L/2 C
C i1
i3
Area A
Fig. 6.42. Slender rod.
• • • •
L/2
d R
L/2
i3
i1
Fig. 6.43. Half cylinder.
Parallelepiped (Figure 6.44): volume, V = abc; principal mass moments of ∗ ∗ ∗ inertia, I11 = m(b2 + c2 )/12, I22 = m(a2 + c2 )/12, I33 = m(a2 + b2 )/12. Thin plate (Figure 6.45): volume, V = abc; principal mass moments of inertia, ∗ ∗ ∗ I11 = mb2 /12, I22 = ma2 /12, I33 = m(a2 + b2 )/12; center of mass location, d = 4R/3π. Sphere (Figure 6.46): volume, V = 4πR3 /3; principal mass moments of inertia, ∗ ∗ ∗ I11 = I22 = I33 = 2mR2 /5. Half sphere (Figure 6.47): volume, V = 2πR3 /3; principal mass moments of ∗ ∗ ∗ inertia, I11 = I22 = 83mR2 /320, I33 = 2mR2 /5.
250
6 Kinetics of rigid bodies
i2 a/2
c/2
i2
a/2
c/2
c
b/2
a
b/2
b
C
C i3
i1
i3
i1
Fig. 6.44. Parallelepiped.
Fig. 6.45. Thin plate.
i2 i3 R
3R/8
i1
i3
i1
i2
Fig. 6.46. Sphere.
• •
R
C
C
Fig. 6.47. Half sphere.
Ellipsoid (Figure 6.48): volume, V = 4πabc/3; principal mass moments of ∗ ∗ ∗ inertia, I11 = 1/5 m(b2 + c2 ), I22 = 1/5 m(a2 + c2 ), I33 = 1/5 m(a2 + b2 ). 2 Hollow cylinder (Figure 6.49): volume, V = π(Ro − Ri2 )L; principal mass ∗ ∗ ∗ moments of inertia, I11 = I22 = m(Ro2 − Ri2 )/4 + mL2 /12, I33 = m(Ro2 − 2 Ri )/2. i2
L/2
L/2 i2
c
a
Ro C
b C i3
Ri
i1
i1
i3 Fig. 6.48. Ellipsoid.
Fig. 6.49. Hollow cylinder.
Part III
Concepts of analytical dynamics
7 Basic concepts of analytical dynamics
Newtonian mechanics deals with the response of particles to externally applied loads and Euler generalized these concepts to systems of particles. For simple systems of particles, it is convenient to use Cartesian coordinates to represent the conſguration of the system, but more often than not, other types of coordinates are used as well. For instance, path or surface coordinates were introduced in chapter 2. The manipulation of ſnite rotation also plays an important role in dynamics and was studied in depth in chapter 4. In fact, the ability to use various types of coordinates considerably simpliſes the description of dynamical systems and the analysis of their response to externally applied loads. The concepts of generalized coordinates, kinematic constraints, and degrees of freedom are introduced in section 7.2. Next, the important concepts of virtual displacements and rotations presented in section 7.3 lead to the deſnition of a scalar quantity of fundamental importance to dynamics, the virtual work presented in section 7.4. The principle of virtual work for static problems is introduced in section 7.5 and is shown to be equivalent to Newton’s ſrst law. Examples of application of this important principle are presented using both arbitrary virtual displacements and kinematically admissible virtual displacements. Finally, in the presence of conservative forces, the statement of the principle of virtual work is shown to simplify remarkably. The ſrst section introduces the mathematical tools required for the comprehension of this chapter.
7.1 Mathematical preliminaries In this section, the stationarity conditions of a function of several variables are expressed as both differential and variational conditions. These concepts will play a fundamental role in the remainder of the chapter.
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_7 © Springer Science+Business Media B.V. 2011
254
7 Basic concepts of analytical dynamics
7.1.1 Stationary point of a function Consider a function of n variables, F = F (u1 , u2 , . . . , un ). By deſnition, the stationary points [2] of this function are deſned as those for which ∂F = 0, ∂ui
(7.1)
i = 1, 2, . . . , n.
For a function of a single variable, this condition corresponds to a horizontal tangent to the graph of the function, as illustrated in ſg. 7.1. At a stationary point, the function can present a minimum, a maximum, or a saddle point. F
F ¶F/¶u1 = 0 u1 Minimum
¶F/¶u1 = 0
F
¶F/¶u1 = 0 u1
u1 Maximum
Saddle point
Fig. 7.1. Stationary points of a function.
If a function is stationary at a point, conditions (7.1) hold and the following statement is then true ∂F ∂F ∂F w1 + w2 + . . . + wn = 0, ∂u1 ∂u2 ∂un where w1 , w2 , . . ., wn are arbitrary quantities. It is convenient to use a special notation for these arbitrary quantities, wi = δui , where δui are called virtual changes in ui . The above statement now becomes ∂F ∂F ∂F δu1 + δu2 + . . . + δun = 0. ∂u1 ∂u2 ∂un Comparison of this result with a similar expression for the differential, dF , of the same function expanded using the chain rule for derivatives implies that virtual changes, δui , are similar to differentials in the variables, dui . Consequently, the virtual change operator, denoted “δ,” behaves in a manner similar to the differential operator, denoted “d”. This relationship between the two operators will be further investigated in later sections. The variation in function F , noted δF , is deſned as δF =
∂F ∂F ∂F δu1 + δu2 + . . . + δun . ∂u1 ∂u2 ∂un
(7.2)
If follows that the stationarity conditions, eq. (7.1), now become δF = 0,
(7.3)
7.1 Mathematical preliminaries
255
for all arbitrary variations δu1 , δu2 , . . ., δun . The differential conditions, eq. (7.1), and the variational condition, eq. (7.3), both express the necessary and sufſcient conditions for the stationarity of function F at a point. From the above developments, it is clear that eq. (7.1) implies eq. (7.3) and since the above reasoning can be reversed, it is simple to prove that eq. (7.3) implies eq. (7.1). Hence, the two conditions are equivalent. The process deſned by a “variation of function F ” can be thought of as a “mathematical experiment,” or “what if?” scenario. The condition δF = 0 for all arbitrary variations δu1 , δu2 , . . ., δun at a stationary point means “the change in function F would vanish if I were to change the values of all variables at the stationary point.” Or, “if I were to experiment with changes in all variables about a stationary point, I would ſnd no corresponding change in function F .” Because the changes in variable values deſned by such a mathematical experiment are not actual changes, the words “virtual change” are used. The symbol “δ” is associated with such virtual changes as opposed to the symbol “d” that refers to actual, inſnitesimal changes. To determine whether a stationary point is a minimum, a maximum, or a saddle point it is necessary to consider the second derivatives [2] of the function. If n ( i,j=1
∂2F dui duj > 0 ∂ui ∂uj
(7.4)
at a stationary point for all differentials dui and duj , the function presents a minimum. If, on the other hand, the same quantity is negative for all dui and duj , the function presents a maximum. Finally, if the same quantity can be positive or negative depending on the choice of the differentials, the function presents a saddle point. From the deſnition of the variation of a function, eq. (7.2), it follows that δ2F =
n ( i,j=1
∂2F δui δuj . ∂ui ∂uj
It is now clear that a stationary point is a minimum if δ 2 F > 0,
(7.5)
for all arbitrary variations δui and δuj . It is a maximum if δ 2 F < 0 for all variations, and a saddle point occurs if the sign of the second variation depends on the choice of the variations of the independent variables. 7.1.2 Stationary point of a deſnite integral Next, consider the determination of the stationary point of the following deſnite integral % b I= F (y, y , x) dx, (7.6) a
256
7 Basic concepts of analytical dynamics
where notation (·) indicates a derivative with respect to variable x. The integrand involves an unknown function, y(x), which is subjected to boundary conditions y(a) = α and y(b) = β. This problem seems to be of a completely different nature from that treated in the previous section. Indeed, integral I is a “function of a function,” i.e., the value of the deſnite integral, I, depends on the choice of the unknown function, y(x). Because there are an inſnite number of values of function y(x) for x ∈ [a, b], deſnite integral, I, is equivalent to a function of an inſnite number of variables. This problem will be treated using the variaf(x) tional formalism introduced in the previous section. The concept of variation of a variable, δu, df f(x) is extended to the concept of variation of a function, denoted δf . Figure 7.2 shows two funcf(x) tions, f (x) and f¯(x), such that x a b δf = f¯(x) − f (x) = φ(x), (7.7) Fig. 7.2. The concept of variation of a function.
where φ(x) is a continuous and differentiable, but otherwise arbitrary function such that φ(a) = φ(b) = 0. In other words, δf is a virtual change that brings the function f (x) to a new, arbitrary function f¯(x). Note that δf (a) = δf (b) = 0. The stationarity of I requires %
b
δI = δ
%
b
F (y, y , x) dx = a
δF (y, y , x) dx = 0.
a
With the help of eq. (7.2), this becomes % b ∂F ∂F δy + δy dx = 0. δI = ∂y ∂y a Functions y(x) and y (x) are not independent of each other; hence, variations δy and δy are not independent, making it difſcult to draw any conclusion from this statement. To eliminate the variation δy , an integration by parts is performed on the second term in the square bracket % a
b
∂F dy δ( ) dx = ∂y dx
% a
b
∂F d (δy) dx = − ∂y dx
% a
b
d ∂F ∂F ( ) δy dx + δy dx ∂y ∂y
b
. a
The boundary terms vanish because δy(a) = δy(b) = 0, and the stationarity condition now becomes % b d ∂F ∂F − ( ) δy dx = 0. δI = ∂y dx ∂y a The bracketed term must vanish because the integral must vanish for all arbitrary variations δy. This yields
7.2 Generalized coordinates
∂F d ∂F − ( ) = 0. ∂y dx ∂y
257
(7.8)
Here again, the above reasoning can be reversed. Starting from eq. (7.8), and performing the integration by parts in the reverse order implies δI = 0. In summary, the necessary and sufſcient condition for the deſnite integral to be at a stationary point is that eq. (7.8) be satisſed. This differential equation is called the Euler-Lagrange equation of the problem. The variational formalism introduced in this section will be systematically applied to dynamics problem. It will be shown that the equations of motions of dynamics can be viewed as the Euler-Lagrange equations associated with the stationarity condition of deſnite integrals. Various forms of the equations of dynamics can be easily obtained by direct manipulations of these deſnite integrals. It is therefore important to understand the variational formalism and its implications. A crucial difference exists between a diff(x) f(x) ferential, df , of function f (x) and a variation, f(x) δf , of the same function, as depicted in ſg. 7.3. df A differential, df , is an inſnitesimal change in df f (x) resulting from an inſnitesimal change, dx, dx in the independent variable; df /dx represents x the tangent at the point. On the other hand, δf is an arbitrary virtual change that brings f (x) to b a f¯(x). The two quantities, df and δf , are clearly Fig. 7.3. The difference between a unrelated, the former is positive in ſg. 7.3, but differential, df , and a variation, δf . the latter is negative. Although the concepts associated with a differential of a function, df , and a variation of the same function, δf , are clearly distinct, manipulations of the two symbols are quite similar. For instance, the order of application of the two operations can be interchanged. Indeed, d d ¯ df¯ df df (δf ) = (f − f ) = − = δ( ). dx dx dx dx dx Similarly, the order of the integration and variation operations commutes 1% 2 % % b % b % b b b ¯ ¯ δ F dx = F dx = (f − F ) dx = δF dx. f dx − a
a
a
a
a
7.2 Generalized coordinates Consider a system consisting of N particles that are free to move in threedimensional space. The position vector of particle i will be expressed in terms of its Cartesian coordinates as ri = xi¯ı1 + yi¯ı2 + zi¯ı3 . The total number of parameters required to deſne the conſguration of the system is 3N , three parameters for each of the N particles. The solution of the problem involves the determination of the time
258
7 Basic concepts of analytical dynamics
history of these 3N Cartesian coordinates when the system is subjected to a set of time-dependent forces. Of course, Cartesian coordinates are not the only way to determine the position of a particle in space; for instance, the spherical coordinates introduced in section 2.7.2 could be used, and the Cartesian coordinates of particle i would then be expressed in terms of the spherical coordinates ri , φi , and θi as xi = ri sin φi cos θi , yi = ri sin φi sin θi , and zi = ri cos φi . As discussed in section 2.6, this coordinate transformation corresponds to a mapping of the three-dimensional space onto itself. Generalized coordinates In general, the Cartesian coordinates of particle i could be expressed in terms of n = 3N parameters, called generalized coordinates, as xi = xi (q1 , q2 , . . . qn ),
(7.9a)
yi = yi (q1 , q2 , . . . qn ), zi = zi (q1 , q2 , . . . qn ).
(7.9b) (7.9c)
The solution of the problem now involves the determination of the time history of the n generalized coordinates, qi , i = 1, 2, . . . n. Presumably, the choice of appropriate generalized coordinates will ease the solution of the problem. For instance, the solution of a problem involving spherical symmetry is often simpliſed by using spherical coordinates. It is assumed here that eqs. (7.9) deſne a one to one mapping between Cartesian and generalized coordinates; this implies that the Jacobian of the coordinate transformation, see eq. (2.72), has a non vanishing determinant at all points in space. Cartesian coordinates determine the position of a particle in space: the three parameters x, y, and z are the projections of the position vector of the particle along the axes of an orthonormal basis in three-dimensional space. Let the position of the particle be determined by spherical coordinates, q1 = r, q2 = φ, and q3 = θ. It now becomes possible to consider the three numbers, q1 , q2 , and q3 , to be the rectangular coordinates of a point in a three-dimensional space, called the conſguration space. Figure 7.4 depicts this concept: on the left, the particle is shown in the geometric space deſned by Cartesian coordinates x, y, and z; on the right, is it shown in the conſguration space deſned by generalized coordinates q1 , q2 , and q3 . The geometry of the problem is distorted in the conſguration space; if the particle is constrained to move on a spherical surface in the geometric space, it must remain in the shaded rectangular area shown on the right portion of ſg. 7.4. The concept of conſguration space can be generalized to higher-dimensional problems. If the system is deſned by n generalized coordinates, q1 , q2 , . . ., qn , these n numbers become the rectangular coordinates of a point in the n dimensional conſguration space. The trajectory of a particle is deſned by three functions, xi = xi (t), yi = yi (t), and zi = zi (t), a curve in three-dimensional space. In the conſguration space, the trajectories of all particles are deſned by a single curve in the n dimensional conſguration space, q1 = q1 (t), q2 = q2 (t), . . ., qn = qn (t).
7.2 Generalized coordinates
i3
q3 = q
f r
2p
P i2
q2 = f
P p
q i1
259
q1 = r Fig. 7.4. Geometric and conſguration spaces for n = 3.
Kinematic constraints The concept of generalized coordinates is intimately linked to that of kinematic constraints. For instance, ſg. 7.5 depicts a dumbbell consisting of two masses moving in two-dimensional space and linked by a rigid bar. At ſrst, the conſguration of the system will deſned by four generalized coordinates consisting of the Cartesian coordinates of the two particles: q1 = x1 and q2 = y1 for the ſrst particle, and q3 = x2 and q4 = y2 for the second. This representation, however, ignores the fact that the rigid bar imposes a kinematic constraint on the system: at all times, the two particles must remain at a distance from each other, and hence, (q3 − q1 )2 + (q4 − q2 )2 = 2 . Of the four generalized coordinates, three only are independent. Next, the conſguration of the system will be i2 m2 deſned by the Cartesian coordinates of its cenl xC ter of mass, point C, q1 = xC and q2 = yC , and q the orientation, q3 = θ, of the rigid bar with respect to axis ¯ı1 , as shown in ſg. 7.5. This secC yC m1 ond approach bypasses the need for kinematic i1 constraints. Clearly, the number of generalized coordinates is not a characteristic of the system: the dumbbell system can be represented alterna- Fig. 7.5. Dumbbell in twotively by three or four generalized coordinates. dimensional space. Degrees of freedom This discussion also leads to the concept of degree of freedom: the system depicted in ſg. 7.5 presents three degrees of freedom because three parameters are required to uniquely deſne its conſguration. Let n denote the number of generalized coordinates, m the number of kinematic constraints, and d the number of degrees of freedom; it then follows that d = n − m. (7.10)
260
7 Basic concepts of analytical dynamics
The ſrst approach discussed above involves 4 generalized coordinates - 1 kinematic constraint = 3 degrees of freedom. The second approach features 3 generalized coordinates - 0 kinematic constraint = 3 degrees of freedom. The number of degrees of freedom is an intrinsic characteristic of the system. On the other hand, the choice of the number of generalized coordinates is left to analyst. If the number of generalized coordinates exceeds that of degrees of freedom, m = n − d kinematic constraints must exist among the n generalized coordinates. If the number of generalized coordinates equals that of degrees of freedom, all generalized coordinates are independent and no kinematic constraints are involved. Finally, if the number of generalized coordinates is less than that of degrees of freedom, the conſguration of the system cannot be fully deſned. The number of degrees of freedom is an invariant characteristic of a given mechanical system; it is deſned as the minimum number of parameters necessary to determine the conſguration of the system. Here are a few sample mechanical systems involving various numbers of degrees of freedom. 1. One degree of freedom: a particle moving along a ſxed curve in space, a rigid body rotating about a ſxed axis in space while one of its points remains a ſxed inertial point. 2. Two degrees of freedom: a particle moving on a surface, a planar double pendulum. 3. Three degrees of freedom: a particle moving in three-dimensional space, the planar motion of a rigid body, the three-dimensional motion of a rigid body rotating about a ſxed inertial point. 4. Four degrees of freedom: a double pendulum moving in three-dimensional space. 5. Five degrees of freedom: two particles linked by a rigid bar and moving in threedimensional space. 6. Six degrees of freedom: the arbitrary motion of a rigid body in three-dimensional space. The time derivatives of the generalized coordinates are called the generalized velocities. The 2n dimensional space deſned by the generalized coordinates and velocities is called the state space. Example 7.1. The rigid body Consider a rigid body consisting of N particles, where N is a very large number. In the ſrst approach, the conſguration of the rigid body will be deſned by the 3N Cartesian coordinates of its N particles. This representation involves a large number of kinematic constraints that enforce the rigidity of the body: the distance between any two particles of the body must remain constant. To evaluate the number of kinematic constraints, consider four particles of the body located at the vertices of a tetrahedron. This simpliſed conſguration features 4×3 = 12 generalized coordinates, the positions of the four particles, linked by the six kinematic constraints enforcing to the constant length conditions for the six edges
7.2 Generalized coordinates
261
of the tetrahedron. The ſfth particle of the system adds three new generalized coordinates, the Cartesian coordinates of the particle, and three new kinematic constraints, three constant length constraints linking the particle to the previous four. The complete rigid body is then constructed by adding the particles one at a time; each new particle adds three new generalized coordinates and three new constraints. The complete system involves n = 3N generalized coordinates and m = 6 + 3(N − 4) = 3N − 6 kinematic constraints, for a total of d = 3N − (3N − 6) = 6 degrees of freedom. This reasoning establishes the fact that a rigid body involves six degrees of freedom only, a very intuitive fact. A second approach to the representation of a rigid body takes advantage of the of the fact that six parameters only are required to deſne the conſguration of the body. Such a representation could use the Cartesian coordinates of one arbitrary reference point of the body, and three rotation components to deſne the orientation of the body; Euler angles, for instance, could be used for this purpose. The last approach to be discussed here is one that involves 12 generalized coordinates, selected to be the 4×3 = 12 Cartesian coordinates of four points on the body forming a tetrahedron and 6 kinematic constraints, imposing the constant length constraint for the six edges of the tetrahedron. One advantage of this formulation is that it bypasses the need for the nonlinear kinematics associated with rotations: this is a rotationless formulation. Example 7.2. The slider-arm mechanism Figure 7.6 depicts a mechanism consisting of a slider free to move along unit vector ¯ı1 and connected to arm AP of length . The arm is free to rotate in the plane normal to ¯ı1 . This mechanical system features two degrees of freedom: the position of the slider q1 = x1 , and angle q2 = θ between the arm and the horizontal plane, for instance. Indeed, the conſguration of the system is unequivocally deſned once these generalized coordinates are known. Although the number of degrees of freedom, d, is an inherent property of the system, the choice of a speciſc set of generalized coordinates is far from being unique. Consider the following choice of generalized coordinates: q1 = x1 and q2 = x2 . In this case, the number of generalized coordinates still equals the number of degrees of freedom and there are no kinematic constraints. This simple choice, however, might not be the most appropriate: for a given value of q2 , two conſgurations of the system are possible, corresponding to arm positions above and below the horizontal plane, respectively. Alternatively, it is possible to select more generalized coordinates than strictly necessary. For instance, three generalized coordinates could be used to deſne this system, the Cartesian coordinates of point P: q1 = x1 , q2 = x2 , and q3 = x3 . Clearly, this choice does not increase the number of degrees of freedom to three; rather, it implies that a single relationship or kinematic constraint must exist between the three generalized coordinates. Indeed, q2 and q3 must be such that q22 + q32 = 2 . Hence, the system presents two degrees of freedom: 3 generalized coordinates - 1 constraint = 2 degrees of freedom.
262
7 Basic concepts of analytical dynamics
Example 7.3. The crank-slider mechanism Figure 7.7 depicts a crank slider mechanism. An experienced analyst will correctly identify this system as presenting a single degree of freedom; indeed, selecting a single generalized coordinate, q1 = θ1 , unequivocally deſnes the conſguration of the system. i3
x2 x3
O l
x1 i1
q
i2
i2
l1
P O
A
f
A q1
y x
Fig. 7.6. Slider with arm mechanism.
l2
i1
q2 B
Fig. 7.7. Crank slider mechanism.
A less experimented analyst might select four generalized coordinates, q1 = θ1 , q2 = θ2 , q3 = φ, and q4 = x. A second look at the system, however, reveals that these generalized coordinates are linked by a number of constraints q3 = q1 + q2 ,
1 cos q1 + 2 cos q2 = q4 ,
1 2 q4 = = . sin q1 sin q2 sin q3
(7.11)
The ſrst constraint is an angle equality in triangle OAB; the second stems from the projection of segments OA and AB along unit vector ¯ı1 ; ſnally, the last two constraints express the laws of sines in triangle OAB. Consequently, the number of degrees of freedom is: 4 generalized coordinates - 4 constraints = 0 degrees of freedom. This reasoning is erroneous because the four kinematic constraints are not independent. Indeed, the law of sine constraint implies 1 = q4 sin q1 / sin q3 and 2 = q4 sin q2 / sin q3 ; introducing these expressions in the second constraint leads to sin(q1 + q2 )/ sin q3 = 1, a result that is implied by the ſrst constraint. Consequently, three constraints only are independent, and hence, the system presents a single degree of freedom: 4 generalized coordinates - 3 independent constraints = 1 degree of freedom. Here again, the choice of a speciſc set of generalized coordinates is far from being unique. Clearly, each one of the three angles θ1 , θ2 , or φ would be a valid choice for the generalized coordinate. Position x of the piston could be another possible choice for the generalized coordinate, although not a very desirable choice. Indeed, two possible conſgurations of the system are associated with the same value of x: the two conſgurations are mirror images about unit vector ¯ı1 . Furthermore, when x reaches its maximum value, 1 + 2 , i.e., when the two linkages become collinear, the value of x does not accurately determine the position of the system. Let y be the position of point A above unit vector ¯ı1 , see ſg. 7.7; kinematic arguments yield the following relationship
7.3 The virtual displacement and rotation vectors
263
3 1 2 2 1+ 1+ y¯2 = 1 − c¯ x ¯ =1− y2 2 1 1 where x ¯ = x/(1 + 2 ) and y¯ = y/(1 + 2 ). It then follows that d¯ x/d¯ y = −2c¯ y. When y¯ → 0, d¯ x/d¯ y → 0; this means that when y¯ becomes small, generalized coordinate x¯ does not accurately deſne the conſguration of the system.
7.3 The virtual displacement and rotation vectors Consider a particle whose displacement vector is given as r(t) = x1 (t)¯ı1 +x2 (t)¯ı2 + x3 (t)¯ı3 in a Cartesian coordinate system. The variation of the position vector is then δr = δx1¯ı1 + δx2¯ı2 + δx3¯ı3 , where δxi (t), i = 1, 2, 3, are the variations of the corresponding Cartesian coordinates, as deſned by eq. (7.7) for an arbitrary function. The virtual displacement vector Next, the Cartesian coordinates of the particle are assumed to be expressed in terms of generalized coordinates, xi = xi (q1 , q2 , . . . , qn ), i = 1, 2, 3. The variation of this Cartesian coordinate then follows from the deſnition of the variation of a function, eq. (7.2), ∂xi ∂xi ∂xi δxi = δq1 + δq2 + . . . + δqn . ∂q1 ∂q2 ∂qn Applying the same treatment to each coordinate leads to the following expression for the variation of the position vector, δr =
∂r ∂r ∂r δq1 + δq2 + . . . + δqn . ∂q1 ∂q2 ∂qn
(7.12)
The terms variation of position vector, virtual change of the position vector or virtual displacement vector are used interchangeably, because a virtual change in position is, in fact, a virtual change in displacement. The study of constrained dynamical systems will be delayed up to chapter 10. For the remainder of this chapter, it is assumed that the number of generalized coordinates used to represent the system is equal to its number of degrees of freedom, hence, the systems is not subjected to any kinematic constraints. Under this restriction, the virtual displacements deſned by eq. (7.12) are called virtual displacements compatible with the constraints, or kinematically admissible virtual displacements. Comparing the differential and virtual displacement vectors It is interesting to compare eq. (7.12) with its counterpart for the differential position vector or differential displacement vector dr =
∂r ∂r ∂r ∂r dq1 + dq2 + . . . + dqn + dt. ∂q1 ∂q2 ∂qn ∂t
(7.13)
264
7 Basic concepts of analytical dynamics
If the position vector is an explicit function of time, the last term, involving the partial derivative with respect to time, appears in the expression for the differential displacement. This contrasts with the expression for the virtual displacement vector that does not involve partial derivatives with respect to time. This important difference stems from the fact that a virtual displacement is an arbitrary change in displacement at a given, ſxed instant. Consequently, when evaluating virtual displacements, time is held constant, and partial derivatives with respect to time vanish. Dividing eq. (7.13) by a time increment, dt, yields the expression for the velocity vector ∂r ∂r ∂r ∂r r˙ = . (7.14) q˙1 + q˙2 + . . . + q˙n + ∂q1 ∂q2 ∂qn ∂t A comparison between eqs. (7.12), (7.13) and (7.14) for the virtual displacement, differential displacement and velocity vectors, respectively, reveals close similarities, but also important differences among these three concepts. The velocity vector is simply the time derivative of the position vector, a familiar concept. The differential displacement vector is the inſnitesimal change in position resulting from inſnitesimal changes in the generalized coordinates and time. Finally, the virtual displacement vector corresponds to the change in displacement associated with virtual changes in the generalized coordinates at a ſxed instant in time. The differential displacement is the actual displacement resulting from actual inſnitesimal changes in generalized coordinates and time. In contrast, a virtual displacement is associated with an arbitrary virtual changes that bring the conſguration of the system described by generalized coordinates qi to a new conſguration described by generalized coordinates q¯i , at a given, ſxed instant in time. While it is important to keep in mind the fundamental differences between these concepts, the similarities between eqs. (7.12) and (7.14) can be used to expeditiously evaluate virtual displacement vectors. Consider, for instance, the velocity vector expressed in cylindrical coordinates, see eq. (2.91b), and the corresponding expression for virtual displacements v = r˙ e¯1 + rθ˙ e¯2 + z˙ e¯3 ⇐⇒ δr = δr e¯1 + rδθ e¯2 + δz e¯3 . The velocity vector, v, is replaced by the virtual displacement vector, δr, and the ˙ and z˙ are replaced by the corretime derivatives of the generalized coordinates, r, ˙ θ, sponding virtual changes in generalized coordinates, δr, δθ, and δz, respectively. Similar guidelines are used to obtain the expression for the the virtual displacement vector in spherical coordinates from the corresponding expression for the velocity vector, eq. (2.95b), v = r˙ e¯1 + rφ˙ e¯2 + rθ˙ sin φ e¯3 ⇐⇒ δr = δr e¯1 + rδφ e¯2 + rδθ sin φ e¯3 . The virtual rotation vector A striking example of the analogy between velocity and virtual displacement vectors is the concept of virtual rotation vector. The angular velocity vector was deſned by
7.3 The virtual displacement and rotation vectors
265
˙ RT ); the virtual rotation vector, δψ, is deſned in an eq. (4.56) as ω = axial(R analogous manner as δψ = axial(δR RT ). (7.15) In section 4.12.4, the differential rotation vector itself was introduced by analogy to the angular velocity vector, underlining the close connection between the three concepts. Here again, it is crucial to understand that there exist no vector ψ such that δ(ψ) is the virtual rotation vector; to emphasize is important fact, the notation δψ, rather than δψ, is used denote the virtual rotation vector. Note the parallel between the virtual rotation vector deſned here and the differential rotation vector deſned in section 4.12.4. Developments identical to those presented in section 4.12.4 lead to the following important relationship between virtual changes in angular velocity and the virtual rotation vector ˙ −ω δω = δψ δψ. (7.16) Virtual changes in the components of the angular velocity vector expressed in the rotating frame can also be obtained in a similar manner ˙ ∗, δω = R δψ
˙ −ω δω = δψ δψ, ∗
∗
∗
∗
˙ +ω δψ , δω = δψ
∗
˙ δω = R δψ. T
(7.17a) (7.17b)
Example 7.4. The two-bar linkage with slider system Figure 7.8 shows a single degree of freedom planar mechanism. The system is represented by a single generalized coordinate, θ. Determine the kinematically admissible virtual displacement vector at point T in terms of the virtual rotation component, δθ. The position vector of point T is r T = Lb e¯1 , where F A = [A, E = (¯ e1 , e¯2 )] is a frame attached to bar AT at point A. A virtual change is the position vector of point T then becomes δrT = Lb δ¯ e1 = Lb δφ e¯2 . This relationship should be compared with its counterpart for velocities, r˙ T = Lb e¯˙ 1 = Lb φ˙ e¯2 , where φ˙ is the angular velocity of bar AT. The problem now reduces to ſnding a relationship between virtual rotations δφ and δθ, or equivalently, between the angular velocities of bars AT and OB, denoted ˙ respectively. To that effect, the position vector of point B is written in two φ˙ and θ, alternative manners: r B = w e¯1 = Lc a ¯1 , where F O = [O, A = (¯ a1 , a ¯2 )] is a frame attached to bar OB at point O. A virtual change in the position vector of point B then becomes δr B = δw e¯1 + wδφ e¯2 = Lc δθ a ¯2 . Here again, it is interesting to compare ˙ e2 = Lc θ¯ ˙ a2 . this expression with its counterpart relating velocities: r˙ B = w¯ ˙ e1 + wφ¯ The desired result is then obtained by evaluating the scalar product of the virtual displacement by e¯2 to ſnd w δφ = Lc δθ e¯T2 a ¯2 = Lc δθ cos(θ + φ). The virtual displacement at point T then follows as Lb Lc cos(θ + φ)δθ e¯2 . w If so desired, the components of the virtual displacement vector could be evaluated in the ſxed basis I = (¯ı1 , ¯ı2 ) by projecting vector e¯2 along that basis. δr T =
266
7 Basic concepts of analytical dynamics
i3 = a3 e1
I O
T i2
Lb
a1 Lc
a2
B
i1
w
e3
e2
f
q
i1 A
O
i2
d Fig. 7.8. Two bar linkage with slider.
O’
h
g
b2 = e2 Universal joint a1 = b 1 Points O and e2 O’ are coincident. e1
C
Rigid body
Fig. 7.9. Conſguration of the rigid body connected to a universal joint.
Example 7.5. The rigid body/universal joint system Figure 7.9 depicts a rigid body attached to the ground by means of a universal joint. This common mechanical joint, shown in detail in ſg. 7.10, consists of a rigid cruciform articulated to two rigid components, denoted components k and . The cruciform consists of two orthogonal bars, and unit vectors ¯b1 and ¯b2 are aligned with those bars. Component k is articulated with respect to the cruciform and is allowed to rotate about unit vector ¯b1 . Similarly, component is also articulated to the cruciform and rotates about unit vector ¯b2 . Component k of the universal joint is connected to the ground at point O by means of a bearing allowing rotation about axis ¯ı3 . Component is connected to a rigid body at point O . A ſrst planar rotation about axis ¯ı3 , of magnitude φ, brings inertial basis I = (¯ı1 , ¯ı2 , ¯ı3 ) to basis A = (¯ a1 , a ¯2 , a ¯3 ), where a ¯1 is aligned with unit vector ¯b1 of the cruciform. A second planar rotation about axis a ¯1 , of magnitude θ, brings basis A to basis B = (¯b1 , ¯b2 , ¯b3 ), where ¯b2 is the second unit vector aligned with the cruciform. Finally, a third planar rotation about axis ¯b2 , of magnitude ψ, bring basis B to basis E = (¯ e1 , e¯2 , e¯3 ) that is attached to the rigid body. Points O and O are coincident. These three planar rotations describe the orientation of the rigid body using Euler angles with the 3-1-2 sequence, see eq. (4.78). The ſrst planar rotation is prescribed to be φ = Ωt. Compute the velocity of point C, the center of mass of the rigid body. The position vector of point C with respect to point O is denoted η. Because angle φ is a known function of time, the system features two degrees of freedom. Evaluate the kinematically admissible virtual displacement vector of the center of mass in terms of the virtual rotation components δθ and δψ. Let R denote the rotation tensor that brings basis I to basis E. The components of the inertial position of point C in basis I are r C = R η ∗ , where η ∗ are the components of the vector η in the body attached basis, E. The components of the inertial velocity of point C now become v C = R ω ∗ η ∗ , where ω ∗ are the components of
7.3 The virtual displacement and rotation vectors
267
the angular velocity vector of the rigid body resolved in basis E. The relationship between the components of the angular velocity vector and the time derivatives of the Euler angles is given by eq. (4.80), and hence, ⎤⎧ ⎫ ⎡ −Cθ Sψ Cψ 0 ⎨ φ˙ ⎬ v C = R η∗T ⎣ Sθ 0 1⎦ θ˙ , ⎩ ˙⎭ Cθ Cψ Sψ 0 ψ where φ˙ = Ω and the components of the rotation tensor expressed in terms of Euler angles are given by eq. (4.78).
b1
i2 Component k Cruciform
Lb
B
Lc
q
b2 Component l Fig. 7.10. Conſguration of the universal joint.
i1
O d
A
T
Fig. 7.11. Two bar linkage.
Next, the components of the virtual displacement vector of point C are evaluated ∗ η ∗ , where δψ ∗ are the components of the virtual rotation vector of as δr C = R δψ the rigid body resolved in basis E. Using eq. (4.80) once again leads to ⎤⎧ ⎫ ⎡ −Cθ Sψ Cψ 0 ⎨ δφ ⎬ 0 1⎦ δθ . δrC = R η∗T ⎣ Sθ ⎩ ⎭ δψ Cθ Cψ Sψ 0 Because virtual changes are taken at a given, ſxed instant in time, δφ = δ(Ωt) = 0. The virtual displacement vector now becomes ⎤ ⎡ Cψ 0 δθ δr C = R η∗T ⎣ 0 1⎦ . δψ Sψ 0 The components of the velocity or virtual displacement vectors can be evaluated in any basis; for instance, their components in the body attached basis E are RT v C and RT δr C , respectively. This example illustrates an important difference between the velocity and virtual displacement vectors. In contrast with the velocity vector that does depends on the prescribed angular velocity, φ˙ = Ω, the virtual displacement vector is independent of this quantity.
268
7 Basic concepts of analytical dynamics
7.3.1 Problems Problem 7.1. Virtual displacement of a two-bar linkage system The two bar linkage shown in ſg. 7.11 comprises bar OB of length Lb and bar BAT of length Lc . Bar BAT passes through ſxed point A but is free to swivel about that point. (1) Compute the virtual displacement vector of point T in terms of the virtual rotation component, δθ.
7.4 Virtual work and generalized forces The differential work done by a force was deſned as the scalar product of the force vector by the differential displacement vector of its point of application, see eq. (3.8). By analogy, the virtual work done by a force is deſned in this section as the scalar product of the force vector by the virtual displacement vector of its point of application. The concept of virtual work then gives rise to that of generalized forces. 7.4.1 Virtual work The virtual work done by the forces externally applied to a particle is deſned as δW = F T δr.
(7.18)
Note the parallel between the deſnition of the virtual work and that of the differential work, see eq. (3.8). The virtual work corresponds to the work that would be performed by the externally applied forces if the particle were to undergo virtual displacement δr. This contrasts with the differential work that corresponds to the work performed by the same forces when the particle undergoes an actual, inſnitesimal displacement dr. Notation δW denotes the virtual work, but this does not imply the existence of a work function, W , such that δ(W ) is the virtual work. In general, the virtual work is a nonholonomic quantity, i.e., a quantity that cannot be integrated. For a system of N particles, the virtual work is found by summing the contributions of all particles, each undergoing its own virtual displacement δri : δW = )N T i=1 Fi δr i . 7.4.2 Generalized forces As discussed in section 7.2, it is often convenient to
represent theconſguration of a system by a set of generalized coordinates, q T = q1 , q2 , . . . , qn . Let the position vector of a particle be a function of generalized coordinates: r = r(q). The virtual work done by the externally applied forces now becomes ∂r ∂r ∂r T T δW = F δr = F δq1 + δq2 + . . . + δqn ∂q1 ∂q2 ∂qn (7.19) = Q1 δq1 + Q2 δq2 + . . . + Qn δqn ,
7.4 Virtual work and generalized forces
269
where the quantities, Qi = F T
∂r , ∂qi
(7.20)
are called the generalized forces. In section 3.1.4, the forces applied to a particle were shown fall into two categories: conservative forces, i.e., those that can be derived from a potential, and non-conservative forces, i.e., those for which no potential function exists. Similarly, generalized forces that can be derived from a potential are called conservative generalized forces; in this case, a potential function, V , exists such that Qci = −
∂V . ∂qi
(7.21)
The virtual work done by a generalized conservative force, denoted δWc , can now be computed as δWc = −
∂V ∂V ∂V δq1 − δq2 − . . . − δqn = −δ(V ). ∂q1 ∂q2 ∂qn
(7.22)
The virtual work done by a generalized conservative force can be evaluated as the variation of a potential function, V , and becomes an integrable expression. 7.4.3 Virtual work done by internal forces It is of interest to compute the virtual work done by the internal forces of a system. Consider the single degree of freedom, planar mechanism shown in ſg. 7.12; the system is represented by a single generalized coordinate, θ. At ſrst, the virtual work done by the internal force at point B will be computed. To that effect, bar OB is separated from the slider at point B and the corresponding free body diagram is shown in ſg. 7.12a, revealing the internal force vector, F B . The virtual work done by this internal force is δW B = δr TB F B + δr TB (−F B ). In view of Newton’s third law, the internal forces acting at points B and B are of equal magnitudes, opposite directions, and share a common line of action; on the other hand, because the virtual displacements are kinematically admissible, they do not violate the kinematic constraints of the system, and hence, δr B = δr B . The virtual work done by the internal force at point B now becomes δW B = δr TB F B − δr TB F B = 0.
(7.23)
The virtual work done by internal forces vanishes. This important result will be used extensively in many methods of analytical dynamics. The evaluation of the work done by the internal force at point B will now be contrasted with that done by the friction force, F f , acting between the slider and bar AT. The virtual work done by the friction force is δW f = δr TB F f + δr TB (−F f ). Here again, in view of Newton’s third law, the friction forces acting at points B and B are of equal magnitudes, opposite directions, and share a common line of action;
270
7 Basic concepts of analytical dynamics
~
B
e1
a) Internal force at point B
T i2
Lb
a1
O
-F
B¢
B FB
q
f
A
B¢¢
w
f
A
f
-F
Lc
B
a2
w
e2
f
q
i1 A
O
b) Friction force at point B ~
~
B f
F
d Fig. 7.12. Two bar linkage with slider. a) Internal force at point B. b) Friction force in the slider.
the work done by the friction force becomes δW f = F f e¯T1 (−δr B + δr B ), where F f is the magnitude of the friction force. On bar AT, the point of application of the friction force is point B , the material point on bar AT that is at a distance w from point A. Point B is the material point of bar AT located at the instantaneous point of contact between the slider and bar AT, see section 5.4. The position vector of this point is rB = w e¯1 and the virtual displacement vector become δr B = w δ¯ e1 = wδφ e¯2 , because w remains a constant for material point B . With this result, the virtual work done by the friction force becomes δW f = F f e¯T1 (−δr B + wδφ e¯2 ) = −F f e¯T1 δrB . The position vector of point B is r B = w e¯1 , and the virtual displacement vector of this point is then δr B = δw e¯1 + w δφ e¯2 ; the virtual work now becomes δW f = −F f e¯T1 (δw e¯1 + w δφ e¯2 ) = −F f δw. This result is rather intuitive: the virtual work done by the friction force equals the product of the magnitude of the friction force by the virtual displacement of its point of application. Since the friction force is directed along e¯1 , any virtual displacement along the direction perpendicular to the bar, e¯2 , does not contribute to the virtual work. The law of cosines applied to triangle OBA reveals that w2 = d2 + L2c − 2dLc cos θ, and hence, wδw = dLc δθ sin θ. Finally, the virtual work done by the friction force becomes dLc sin θ δθ. (7.24) w The work done by the internal force at point B vanishes, see eq. (7.23), but the work done by the friction force does not, see eq. (7.24). The force at point B is a constraint force: it imposes the kinematic constraint that the displacement of the slider must equal that of the tip of bar AB at all times. The virtual work done by the constraint forces vanishes because the virtual displacements of the points of application of constraint forces, F B and −F B , are identical. The work done by the friction force does not vanish because the virtual displacements of the points of application of friction forces, F f and −F f , are different. δW B = −F f δw = −F f
7.5 The principle of virtual work for statics
271
Indeed, although the position vectors of points B and B are identical, rB = r B = w¯ e1 , the corresponding virtual displacement vectors are not, δr B = δw e¯1 + w δφ e¯2 and δr B = w δφ e¯2 . Point B is the material point of bar AT that is at the location of the point of contact of the slider with the bar. Because point B is a material point of bar AT, the value of w that deſnes its location remains constant, i.e., δw = 0 when computing the virtual displacement δr B . Here again, it is important to distinguish the contact point from the material points that instantaneously coincide with this contact point, see section 5.4. 7.4.4 Problems Problem 7.2. Virtual work done by friction force The two bar linkage shown in ſg. 7.11 comprises bar OB of length Lb and bar BAT of length Lc . Bar BAT passes through ſxed point A but is free to swivel about that point. (1) Assuming that a friction torque, M f , is acting in the joint at point B, compute the virtual work done by this torque. (2) Assuming that a friction force, F f , is acting in the sliding joint at point A, compute the virtual work done by this force. In both cases, express the virtual work in terms of the virtual rotation component, δθ.
Problem 7.3. Two rigid bodies connected by an actuator Figure 7.13 depicts two rigid bodk e1l e3 ies, denoted bodies k and , rel e3 F spectively, connected by an actuator. F L l k Frame F k = K, E k = (¯ ek1 , e¯k2 , e¯k3 ) l P d Pk d i is attached to body k and a similarly K 3 e2k l l deſned frame, F , is attached to body u,R e2l . The conſguration of frame F k is O determined by the position vector, uk , Body l e1k Body k uk, Rk I i of its reference point K and rotation 2 i1 tensor Rk that brings triad I to triad k E . The conſguration of frame F is Fig. 7.13. Two rigid bodies connected connected by deſned by corresponding quantities, an actuator. u and R . The actuator is connected at points Pk and P to bodies k and , respectively. Let dk and d be the position vectors of points Pk and P with respect to the reference points K and L, respectively. The actuator applies known forces F of equal magnitudes and opposite signs to bodies k and , respectively, as indicated on the ſgure. (1) Find the virtual work done by the actuator. (2) Find the generalized forces applied to body k and , respectively. (3) Discuss the physical interpretation of the various generalized force components.
7.5 The principle of virtual work for statics As discussed in section 3.1.2, the static equilibrium condition for a particle, as stated by Newton’s ſrst law, is written as a vector equation that imposes the vanishing of the externally applied forces. In the present section, an alternative formulation will be developed, which results in the principle of virtual work. Although expressed in
272
7 Basic concepts of analytical dynamics
terms of work rather than force vectors, the principle of virtual work will be shown to be equivalent to Newton’s ſrst law. In this section, the principle of virtual work is develop for static problems only; applications of this principle to dynamical system will be treated in chapter 8. The principle will be developed ſrst for a single particle; next, it will be generalized to systems of particles. The principle of virtual work introduces the fundamental concept of “arbitrary virtual displacements” sometimes called “arbitrary test displacements,” or also “arbitrary ſctitious displacements,” and all of these expressions will be used interchangeably. The word “arbitrary” is easily understood: it simply means that the displacements can be chosen in an arbitrary manner without any restriction imposed on their magnitudes or orientations. More difſcult to understand are the words “virtual,” “test,” or “ſctitious.” All three imply that these are not real, actual displacements. More importantly, these ſctitious displacements do not affect the forces acting on the particle. These important concepts will be explained in the following sections. 7.5.1 Principle of virtual work for a single particle Consider a particle in static equilibrium under a set a externally applied loads, as depicted in ſg. 7.14. According to Newton’s ſrst law, the sum of the exters nally applied load must vanish. Next, consider a ſctitious displacement of arbitrary magnitude and orienF2 tation, denoted s. Although the problem appears to be F1 two-dimensional in the ſgure, both forces and ſctitious Fig. 7.14. A particle with displacements are three-dimensional quantities. applied forces subjected to a The virtual work done by the externally applied ſctitious test displacement. forces is now evaluated by computing the scalar product of the externally applied load by the ſctitious displacement vector to ſnd ( W = sT (7.25) F = 0. F3
Because the particle is in static equilibrium, Newton’s ſrst law implies the vanishing of the bracketed term. It follows that the scalar product vanishes for any arbitrary ſctitious displacement. This result sheds some light on the special nature of the ſctitious, or virtual displacements. If the particle ) is in static equilibrium in a given conſguration, the sum of the forces vanishes, i.e., F = 0. Assume now that one of the externally applied forces, say F 1 , is the force acting in an elastic spring connected to the particle. If the particle undergoes a real, but arbitrary displacement, d, the force in the spring will change to become F 1 . All displacement-dependent forces applied) to the particle will change, and the sum of the externally applied loads becomes F . In the new conſguration resulting from the application of the real displacement, d, static ) equilibrium will not be satisſed, i.e., F = 0. Indeed, if the particle is in static equilibrium in the conſguration resulting from the application of an arbitrary displacement, it would be in static equilibrium in any conſguration, which makes little sense.
7.5 The principle of virtual work for statics
273
In contrast with real displacements, virtual or ſctitious displacements do not affect the forces applied to the particle. This means that even in the presence of displacement-dependent loads such as those arising within an elastic spring, if the particle is in static equilibrium, it remains in static equilibrium when virtual or ſctitious displacements are applied. This is the reason why eq. (7.25) remains true for all arbitrary virtual displacements. The discussion thus far has thus established that if the particle is in static equilibrium, eq. (7.25) holds for all arbitrary ſctitious displacements. Next, the following question is asked: if eq. (7.25) holds, is the particle in static equilibrium? Consider ſg. 7.14, and let the components of the applied forces be F 1 = F11¯ı1 + F12¯ı2 + F13¯ı3 , F 2 = F21¯ı1 + F22¯ı2 + F23¯ı3 , and F 3 = F31¯ı1 + F32¯ı2 + F33¯ı3 , while the components of the virtual displacement are s = s1¯ı1 + s2¯ı2 + s3¯ı3 , where I = (¯ı1 , ¯ı2 , ¯ı3 ) is an orthonormal basis. Equation (7.25) now states (F11 + F21 + F31 )s1 + (F12 + F22 + F32 )s2 + (F13 + F23 +) F33 )s3 = 0. At ſrst, assume that the particle is not in static equilibrium, i.e., F = 0. It is always possible to ſnd a particular virtual displacement for which eq. (7.25) will be satisſed. Indeed, for a given set of forces, select s1 and s2 in an arbitrary manner, then solve eq. (7.25) for s3 to ſnd s3 = −[(F11 + F21 + F31 )s1 + (F12 + F22 + F32 )s2 ]/(F13 + F23 + F33 ). Consequently, the fact that eq. (7.25) is satisſed for a particular virtual displacement does not imply that it is in static equilibrium. In fact, even if it is satisſed for many virtual displacements, static equilibrium is still not guaranteed. Indeed, for each new arbitrary choice of s1 and s2 , it is possible to compute an s3 for which eq. (7.25) is satisſed. Different conclusions are reached if eq. (7.25) is satisſed for all arbitrary virtual displacements. Indeed, if (F11 + F21 + F31 )s1 + (F12 + F22 + F32 )s2 + (F13 + F23 + F33 )s3 = 0 for all independently chosen quantities s1 , s2 , and s3 , it follows that F11 + F21 + F31 = 0, F12 + F22 + F32 = 0, and F13 + F23 + F33 = 0, is the only solution of eq. (7.25). In turn, this can be written as (F11) + F21 + F31 )¯ı1 + (F12 + F22 + F32 )¯ı2 + (F13 + F23 + F33 )¯ı3 = 0, and ſnally, )F = 0. Thus, if eq. (7.25) is satisſed for all arbitrary virtual displacements, then F = 0, and the particle is in static equilibrium. In conclusion, if a particle is in static equilibrium, the virtual work done by the externally applied forces vanishes for all arbitrary virtual displacements. Furthermore, it is also true that if the virtual work vanishes for all arbitrary ſctitious test displacements, the sum of the externally applied forces vanishes, and hence, the particle is in static equilibrium. These two facts can be combined into the statement of the principle of virtual work for a particle. Principle 8 (Principle of virtual work for a particle) A particle is in static equilibrium if and only if the virtual work done by the externally applied forces vanishes for all arbitrary virtual displacements. Because the condition for static equilibrium is nothing but Newton’s ſrst law, it follows that the principle of virtual work, which states the condition for static equilibrium, is equivalent to Newton’s ſrst law, and either statement provides a
274
7 Basic concepts of analytical dynamics
fundamental deſnition of static equilibrium. Simple examples will now be used to illustrate the principle of virtual work. Example 7.6. Equilibrium of a particle Consider the particle depicted in ſg. 7.15, which is subjected to two vertical forces F 1 = 1¯ı1 and F 2 = −3¯ı1 . The following question is asked: is the particle in static equilibrium? Rather than relying on Newton’s ſrst law, the principle of virtual work will used to answer the question. Consider the following arbitrary virtual displacement, s = s1¯ı1 + s2¯ı2 , and the associated virtual work W = (1¯ı1 − 3¯ı1 )T (s1¯ı1 + s2¯ı2 ) = −2¯ıT1 (s1¯ı1 + s2¯ı2 ) = −2s1 = 0. The fact that s is an arbitrary virtual displacement implies that s1 and s2 are arbitrary scalars, and hence, W = −2s1 = 0. Because the virtual work done by the externally applied forces does not vanish for all virtual displacements, the principle of virtual work, principle 8, implies that the particle is not in static equilibrium. i1
F1 = 1
i2 L
s1 s2
i2
m
u
i1
F2 = - 3 Fig. 7.15. A particle under the action of two forces.
k m ku s2
s1
mg
Fig. 7.16. A particle suspended to an elastic spring.
It is important to understand the implications of the last part of the principle of virtual work, “for all arbitrary virtual displacements.” Consider the following arbitrary virtual displacement, s = s2¯ı2 , and the associated virtual work W = (1¯ı1 − 3¯ı1 )T s2¯ı2 = −2¯ıT1 s2¯ı2 = 0. This result is due to the fact that the sum of the externally applied loads, −2¯ı1 , is orthogonal to the virtual displacement, s2¯ı2 , and hence, the virtual work vanishes. One might be tempted to conclude from the above result that the particle is in static equilibrium because the virtual work vanishes. To satisfy the principle of virtual work, however, the virtual work must vanish for all arbitrary virtual displacements. The above result shows that the virtual work may vanish for “a particular virtual displacement,” but this is not a sufſcient condition to guarantee static equilibrium. For the two-dimensional problem shown in ſg. 7.15, an arbitrary ſctitious displacement must span the plane of the problem, i.e., must be of the form s = s1¯ı1 + s2¯ı2 . For three-dimensional problems, a three-dimensional virtual displacement must be selected, s = s1¯ı1 + s2¯ı2 + s3¯ı3 , where s1 , s2 , and s3 are three arbitrary scalars, and I = (¯ı1 , ¯ı2 , ¯ı3 ) a basis that spans the three-dimensional space.
7.5 The principle of virtual work for statics
275
Example 7.7. Equilibrium of a particle connected to an elastic spring Consider next a particle in static equilibrium under the effect of gravity and the restoring force of an elastic spring of stiffness constant k, as depicted in ſg. 7.16. Find the displacement of the particle in its actual static equilibrium conſguration. For this two-dimensional problem, assume that the particle is at position u. An arbitrary ſctitious displacement is selected as s = s1¯ı1 + s2¯ı2 , where s2 and s2 are two arbitrary scalars. The virtual work done by the externally applied loads becomes W = (mg¯ı1 − ku¯ı1 )T (s1¯ı1 + s2¯ı2 ) = [mg − ku]s1 . The principle of virtual work now implies that the particle is in static equilibrium at position u if and only if the virtual work done by the externally applied loads vanishes for all arbitrary virtual displacements, i.e., if and only if [mg−ku]s1 = 0 for all values of s1 . Equation [mg − ku]s1 = 0 possesses two solutions, [mg − ku] = 0 or s1 = 0; the second solution, however, is not valid because, as implied by the principle of virtual work, s1 is arbitrary. In conclusion, the vanishing of the virtual work for all arbitrary virtual displacements implies that mg − ku = 0, and the equilibrium conſguration of the system is found as u = mg/k. Of course, the same conclusion can be drawn more expeditiously from a direct application of Newton’s ſrst law, which requires the sum of the externally applied forces to vanish, i.e., mg¯ı1 − ku ¯ı1 = 0, or (mg − ku)¯ı1 = 0, and ſnally, mg − ku = 0. This example involves the restoring force of an elastic spring, a displacementdependent force. Indeed, the elastic force in the spring is −ku¯ı1, and if the particle undergoes a real downward displacement of magnitude d, the restoring force becomes −k(u + d)¯ı1 . In contrast, if the particle undergoes a virtual downward displacement of magnitude s1 , the restoring force remains unchanged as −ku¯ı1 . This difference has profound implications on the computation of work. First, consider the work done by the elastic force, −ku¯ıT1 du ¯ı1 , under a virtual displacement, s1 , %
u+s1
W = u
% −ku du = −ku
u+s1
u+s1
du = −ku [u]u
= −kus1 .
(7.26)
u
It is possible to factor out the elastic force, −ku, from the integral because this force remains unchanged by the virtual displacement, and hence, it can be treated as a constant. In contrast, the work done by the same elastic force under a real displacement, d, is % u+d u+d 1 1 W = −ku du = − ku2 = −kud − kd2 . (7.27) 2 2 u u In this case, the real work includes an additional term that is quadratic in d and represents the work done by the change in force that develops due to the stretching of the spring. Even if the magnitude of the real displacement is equal to that of the virtual displacement, i.e., even if d = s1 , the two expressions for the work done by the elastic restoring force differ.
276
7 Basic concepts of analytical dynamics
These observations help explain the terminology used when dealing with the principle of virtual work. The concept of virtual displacement is key to the correct use of the principle of virtual work, which requires the virtual work done by displacement-dependent forces to be evaluated according to eq. (7.26) rather than eq. (7.27). Of course, the real work done by the elastic force as it undergoes a real displacement is correctly evaluated by eq. (7.27). Clearly, it is important to keep in mind the crucial difference between “real displacements” and “virtual” or “ſctitious displacements.” The words “virtual” or “ſctitious” are used to emphasize the fact the forces remain unaffected by these displacements. In practice, the term “real displacement” is rarely used; real displacements are simply called displacements. The terms “virtual,” “ſctitious,” or “test displacements” all imply that the forces acting on the system remain unaffected by the application of such displacements. The term “virtual displacement” is the most widely used. Example 7.8. Equilibrium of a particle sliding on a track Figure 7.17 shows a particle of mass m sliding on a track. The externally applied horizontal force is resited by friction between the particle and track. Newton’s ſrst law expresses the condition for static equilibrium as mg ¯ı1 − R ¯ı1 + P ¯ı2 − F ¯ı2 = 0, where −R ¯ı1 is the reaction force the track exerts on the particle, and −F ¯ı2 the friction force applies to the particle. The four forces applied to the particle are of different physical natures: P ¯ı2 is an externally applied force, mg¯ı1 the force of gravity, −R ¯ı1 a reaction force, and −F ¯ı2 a friction force. Yet, all forces play an equal role in Newton’s law, which states that the sum of all forces must vanish. The law simply states “all forces” without making any distinction among them. Newton’s ſrst law is readily solved to ſnd (mg−R) ¯ı1 + (P − F ) ¯ı2 = 0, and ſnally R = mg and F = P , as expected. Track
mg
P
m
i2 i1
s2 P
F s1
R
Fig. 7.17. A particle sliding on a track.
Next, the principle of virtual work will be used to solve the same problem. For this two dimensional problem, an arbitrary virtual displacement will be written as s = s1 ¯ı1 + s2 ¯ı2 , and the vanishing of the virtual work it performs implies W = (mg ¯ı1 − R ¯ı1 + P ¯ı2 − F ¯ı2 )T (s1 ¯ı1 + s2 ¯ı2 ) = [mg − R]s1 + [P − F ]s2 = 0.
(7.28)
Following a reasoning similar to that developed in the previous example, it is easy to show that the vanishing of the virtual work for all arbitrary scalars s1 and s2 implies the vanishing of the two bracketed terms in the above equation: mg − R = 0
7.5 The principle of virtual work for statics
277
and P − F = 0. This result is identical to that obtained from Newton’s ſrst law, as expected, since the principle of virtual work and Newton’s ſrst law are identical. This example illustrates a crucial relationship between Newton’s ſrst law and the principle of virtual work. The projection of Newton’s law along unit vectors ¯ı1 and ¯ı2 yields two scalar equilibrium equations, mg − R = 0 and P − F = 0, respectively. The same two equilibrium equations are obtained by imposing the vanishing of the factors multiplying the arbitrary virtual displacement components, s1 and s2 , resolved along the same unit vectors, ¯ı1 and ¯ı2 , respectively. The principle of virtual work yields scalar equilibrium equations which are the projections of Newton’s ſrst law along the directions associated with the virtual displacement components. Because it is based on a scalar quantity, the virtual work, the principle of virtual work yields scalar equations of equilibrium, rather than their vector counterparts inherent to the application of Newton’s ſrst law. 7.5.2 Kinematically admissible virtual displacements Example 7.8 illustrates an important feature of virtual displacements, which are selected to have components in the horizontal direction, s2¯ı2 , and the vertical direction, s1¯ı1 . This raises a basic question: how could the particle move in the vertical direction when it is constrained to remain on the track? The answer to this question lies in the nature of the virtual displacements that are not real, but rather are virtual or ſctitious displacements. Of course, the particle cannot possibly undergo real displacements in the vertical direction because it must remain on the track, but virtual or ſctitious displacements in that same direction are allowed. In the derivation of the principle of virtual work, it is necessary to use completely arbitrary virtual displacements to prove that the vanishing of the virtual work implies Newton’s ſrst law. The completely arbitrary nature of the virtual displacements is key to the successful use of the principle of virtual work. The expression, “arbitrary virtual displacements” means any virtual displacements, including those that violate the kinematic constraints of the problem. In ſg. 7.17, the particle is conſned to remain on the track; it can move along the track, but not in the direction perpendicular to it. The direction along the track is called the kinematically admissible direction, and the direction normal to it is called the kinematically inadmissible direction, or the infeasible direction. It is sometimes convenient to introduce the concept of kinematically admissible virtual displacements. These are virtual displacements that satisfy the kinematic constraints of the problem. For the problem depicted in ſg. 7.17, the kinematic constraint enforces the particle to remain on the track. Arbitrary virtual displacements are written as s = s1 ¯ı1 + s2 ¯ı2 , but since these include a component in the vertical direction, i.e., in a kinematically inadmissible direction, these are not kinematically admissible virtual displacements. On the other hand, virtual displacements of the form s = s2 ¯ı2 , are kinematically admissible because these are oriented along the track. At this point, the relationship between kinematic constraints and reaction forces should be clariſed. Reaction forces are those forces arising from the enforcement of
278
7 Basic concepts of analytical dynamics
kinematic constraints. The particle depicted in ſg. 7.17 is constrained to move along the track, and this kinematic constraint gives rise to a reaction force. Note that the reaction force acts along the kinematically inadmissible direction, i.e., the direction normal to the track. Consider now the virtual work done by the reaction force under arbitrary virtual displacements, W = (−R¯ı1 )T (s1¯ı1 + s2¯ı2 ) = −Rs1 = 0. Next, consider the virtual work done by the same reaction force under arbitrary kinematically admissible virtual displacements, W = (−R ¯ı1 )T (s2 ¯ı2 ) = 0. Because the reaction force acts along the infeasible direction and the kinematically admissible virtual displacement is along the admissible direction, these two vectors are normal to each other, and hence, the virtual work done by the reaction force vanishes. In contrast, the work done by the same reaction force under arbitrary virtual displacements does not. The vanishing of the virtual work done by reaction forces under kinematically admissible virtual displacements has profound implications for applications of the principle of virtual work. The principle is repeated here: “a particle is in static equilibrium if and only if the virtual work done by the externally applied forces vanishes for all arbitrary virtual displacements”. Because this principle calls for the use of arbitrary virtual displacements, it is of crucial importance to treat reaction forces as externally applied forces. For instance, in example 7.8, the virtual work done by the reaction force must be included in the statement of the principle, as is done in eq. (7.28), because completely arbitrary virtual displacements are used. Consider now a modiſed version of the principle of virtual work: “a particle is in static equilibrium if and only if the virtual work done by the externally applied forces vanishes for all arbitrary kinematically admissible virtual displacements”. Rather than considering completely arbitrary virtual displacements, only kinematically admissible virtual displacements are considered now. Because the virtual work done by the constraint forces vanishes for kinematically admissible virtual displacements, constraint forces are automatically eliminated from this statement of the principle of virtual work. This often simpliſes the statement of the principle because fewer terms are involved. On the other hand, because the constraint forces are eliminated from the formulation, this modiſed principle will not yield the equations required to evaluate the reaction forces, which are often quantities of great interest. As pointed out earlier, Newton’s ſrst law requires the sum of all forces to vanish for static equilibrium to be achieved. The “sum of all forces” involves all forces without distinction. While the principle of virtual work is shown to be identical to Newton’s ſrst law, this principle creates an important distinction between reaction forces stemming from kinematic constraints, and all other forces. Indeed, reaction forces, also called forces of constraint, can be completely eliminated from the formulation by using kinematically admissible virtual displacements.
7.5 The principle of virtual work for statics
279
All other forces, such as those generated by springs, gravity, friction, temperature, electric or magnetic ſelds, are of a physical origin. It is easy to recognize such forces because their description involves physical constants that can only be determined by experiment. For instance, the stiffness constant of a spring, the universal constant of gravitation appearing in gravity forces, or the friction coefſcient appearing in Coulomb’s friction law. All these forces are referred to as natural forces, which can be further differentiated into internal and external forces. Internal forces are natural forces arising from and reacted within the structural system under consideration, whereas external forces are natural forces that act on the system but stem from outside it; these forces are also called externally applied loads. Example 7.9. Equilibrium of a particle sliding on a track Consider once again the particle of mass m sliding on a track and shown in ſg. 7.17. For this simple problem, the kinematically admissible direction is along axis ¯ı2 , while the infeasible direction is along axis ¯ı1 . The free body diagram in the right part of ſg. 7.17 shows the forces acting on the particle. The reaction force, −R¯ı1 , acts in the infeasible direction, as expected. In contrast with example 7.8, which uses completely arbitrary virtual displacements, kinematically admissible virtual displacements will be used here, and hence, s = s2¯ı2 . The vanishing of the virtual work then implies W = (mg ¯ı1 − R ¯ı1 + P ¯ı2 − F ¯ı2 )T s2 ¯ı2 = [P − F ]s2 = 0. Because s2 is an arbitrary quantity, the bracketed term must vanish, leading to F = P. First, reaction force R is eliminated from the formulation: the statement of the principle of virtual work becomes simply (P − F )s2 = 0 for all values of s2 . The reaction force does not appear in this statement. It is also possible to apply external loads along the infeasible direction: for instance, in this problem, gravity loads act in the infeasible direction and are also eliminated from the formulation. Of course, if gravity acts along the kinematically admissible direction, i.e., along the track, this force will appear in the statement of the principle. In contrast, reaction forces always act along the infeasible direction and hence, are always eliminated from the formulation. Second, note that less information about the system is obtained. In example 7.8 that uses arbitrary virtual displacements, two equations are obtained: F = P and R = mg. In contrast, the use of kinematically admissible virtual displacements yields a single equation, F = P . On the other hand, the solution process is simpler and involves one single equation; however, no information about the reaction force is available. Finally, it is shown here that the modiſed version of the principle of virtual work stating “a particle is in static equilibrium if and only if the virtual work done by the externally applied forces vanishes for all arbitrary kinematically admissible virtual displacements,” is not entirely correct. The vanishing of the virtual work for all kinematically admissible virtual displacements is a necessary condition, but it is not sufſcient, because it does not guarantee equilibrium of the particle in the infeasible
280
7 Basic concepts of analytical dynamics
direction. Indeed, this latter condition, R = mg, is not recovered by the modiſed principle. Example 7.10. Equilibrium of a particle on a curved track Figure 7.18 depicts a particle of mass m constrained to move on a semi-circular track of radius R under the combined effects of gravity, friction, and elastic forces. Determine the equilibrium position of the particle and the forces acting on it in the equilibrium state. The spring of stiffness constant k is pinned at point C located at coordinates x1 = c1 R and x2 = c2 R and its un-stretched length vanishes. Force N is the reaction force acting on the particle due to its contact with the track and acts in direction n ¯, which is normal to the track. Force F is the force exerted by the track on the particle and acts in the tangential direction, t¯; this force arises from friction between the particle and track.
i2
k
t R
t
C (c1R, c2R)
n m
n q
i1
N
m
Fs = kD F
mg
mg Fig. 7.18. Particle constrained to slide with friction on a circular track.
The position of the particle on the track is conveniently represented by angle θ. The unit vector tangent to the circular track is given by t¯ = − sin θ ¯ı1 + cos θ ¯ı2 , and the normal to the track is n ¯ = − cos θ ¯ı1 −sin θ ¯ı2 . For this problem, the kinematically admissible direction is t¯, and n ¯ the infeasible direction. In contrast with the previous example, the admissible direction is not a ſxed direction in space, but instead, it depends on the position of the particle on the track, t¯ = t¯(θ). The reaction force of magnitude N acts along the infeasible direction, as expected. The friction force of magnitude F acts in the admissible direction. The force, F s , applied by the elastic spring to the particle is given by the spring stiffness constant times the distance between the particle and point C and is oriented in that same direction: F s = kR[(c1 − cos θ)¯ı1 + (c2 − sin θ)¯ı2 ]. This can be expressed in terms of admissible and infeasible directions, t¯ and n ¯ , respectively, as F s = kR[(−c1 sin θ + c2 cos θ)t¯ + (1 − c1 cos θ − c2 sin θ)¯ n] where use is made of the following relationships: ¯ı1 = − sin θ t¯ − cos θ n ¯ and ¯ı2 = cos θ t¯ − sin θ n ¯. An arbitrary virtual displacement of the form s = st t¯ + sn n ¯ is selected, where st and sn are arbitrary quantities. The virtual work done by the forces acting on the particle now becomes
7.5 The principle of virtual work for statics
281
W = {kR [(−c1 sin θ + c2 cos θ)t¯ + (1 − c1 cos θ − c2 sin θ)¯ n] + N n ¯ − F t¯ T + mg(− cos θt¯ + sin θ¯ n)} (st t¯ + sn n ¯) = [kR(−c1 sin θ + c2 cos θ) − F − mg cos θ] st
+ [kR(1 − c1 cos θ − c2 sin θ) + N + mg sin θ] sn . Because the virtual work must vanish for arbitrary st and sn , the two bracketed terms must vanish, leading to the two equilibrium equations of the problem, kR( − c1 sin θ + c2 cos θ) − mg cos θ,
(7.29a)
N = −kR(1 − c1 cos θ − c2 sin θ) − mg sin θ.
(7.29b)
F =
This forms a set of two equations for the three unknowns of the problem: the reaction force, N , the friction force, F , and the equilibrium position of the particle, θ. One additional equation is required to solve the problem. Coulomb’s law of static friction requires the friction force to be smaller than the normal contact force multiplied by the static friction coefſcient, μs , i.e., |F | ≤ μs |N |. Substituting the friction and normal forces from eqs. (7.29a) and (7.29b), respectively, leads to kR(−c1 sin θ + c2 cos θ) − mg cos θ ≤ ±μs [−kR(1 − c1 cos θ − c2 sin θ) − mg sin θ]. This equation can be solved to ſnd two solutions, θ and θu : the particle is in equilibrium for all conſgurations, θ, such that θ ≤ θ ≤ θu . Next, kinematically admissible virtual displacements of the form s = st t¯ will be selected, where st is an arbitrary quantity. The virtual work done by the forces acting on the particle then becomes n] + N n ¯ − F t¯ W = {kR [(−c1 sin θ + c2 cos θ)t¯ + (1 − c1 cos θ − c2 sin θ)¯ T
+ mg(− cos θt¯ + sin θ¯ n)} st t¯ = [kR(−c1 sin θ + c2 cos θ) − F − mg cos θ] st . Because the virtual work must vanish for all arbitrary st , the bracketed term must vanish, yielding a single equilibrium equation of the problem, which is the same as eq. (7.29a) above. As expected, the normal reaction force, N , is eliminated from the formulation. The problem still features three unknowns, N , F , and θ, and the addition of the static friction law provides a second equation for the problem. Clearly, the principle of virtual work with kinematically admissible virtual displacements does not provide enough equations to solve this problem. This is because the static friction law establishes a relationship between friction and normal forces. By eliminating the normal contact force from the formulation, the use of kinematically admissible virtual displacements yields too little information to solve the problem. If friction is neglected, the friction force will vanish, F = 0, and the single equation stemming from the use of kinematically admissible virtual displacements yields the solution of the problem, kR(−c1 sin θ + c2 cos θ) − mg cos θ = 0, or tan θ = (c2 − mg/kR)/c1. In summary, when using kinematically admissible virtual displacements, the principle of virtual work yields a reduced set of equilibrium equations from which the
282
7 Basic concepts of analytical dynamics
forces of constraints are eliminated. This often greatly simpliſes and streamlines the solution process. In some cases, however, too few equations are obtained, giving the false impression that the problem cannot be solved. Arbitrary virtual displacements, i.e., virtual displacements that violate the kinematic constraints must then be used to obtain the missing equations of equilibrium, which correspond to the projection of Newton’s ſrst law along the infeasible directions. 7.5.3 Use of inſnitesimal displacements as virtual displacements In the previous sections, three-dimensional virtual displacements are denoted s = s1¯ı1 + s2¯ı1 + s3¯ı3 , where s1 , s2 , and s3 are arbitrary quantities. In view of the fundamental role they play in energy and variational principles, a special notation is commonly used to denote virtual displacements, s = δu.
(7.30)
The symbol “δ” is placed in front of the displacement vector, u, to indicate that it should be understood as a virtual displacement. Similarly, the virtual work done by a force undergoing a virtual displacement will be denoted δW to distinguish it from the real work done by the same force undergoing real displacements. The new notation changes nothing to the special nature of virtual displacements, which are ſctitious displacements that do not alter the applied forces. In many applications of the principle of virtual work, it will also be convenient to use virtual displacements of inſnitesimal magnitude. Because virtual displacements are of arbitrary magnitude, virtual displacements of inſnitesimal magnitude qualify as valid virtual displacements. The inſnitesimal magnitude of virtual displacements is a convenience that often simpliſes algebraic developments, but is by no means a requirement. Displacement-dependent forces A key simpliſcation arising from the use of virtual displacements of inſnitesimal magnitude is that displacement-dependent forces automatically remain unaltered by their application, as illustrated in the following example. Example 7.11. Equilibrium of a particle connected to an elastic spring Consider a particle connected to an elastic spring, as illustrated in ſg. 7.19. This is the same problem treated in example 7.7. The principle of virtual work requires that δW = (mg ¯ıT1 − ku ¯ıT1 )(δu ¯ı1 + δv ¯ı2 ) = [mg − ku]δu = 0, for all virtual displacements, δu, where the virtual displacements must leave the forces applied to the particle unchanged. Consider now a virtual displacement of inſnitesimal magnitude, δu = du. The virtual work done by this virtual displacement of inſnitesimal magnitude is still given by eq (7.27) as
7.5 The principle of virtual work for statics
spring force
i2 L i1 u
283
k m ku
ku dW = ku du s2
s1
mg
du
u
Fig. 7.19. Use of a differential displacement as a virtual displacement.
%
u+du
u
1 −ku du = − ku2 2
u+du u
1 = −kudu − k(du)2 = −ku du, 2
where the last equality follows from neglecting the higher-order differential quantity. The virtual work is now equal to the real work done by an inſnitesimal displacement of magnitude du = δu. The right portion of ſg. 7.19 illustrates the differential work, dW , for a displacement of inſnitesimal magnitude. Rigid bodies Next, the close relationship between inſnitesimal displacements and virtual displacements of inſnitesimal magnitude will be explored further in the context of rigid bodies. Consider two material points, P and Q, of a rigid body. When the rigid body undergoes arbitrary motions, the velocities of these two points must satisfy eq. (5.22), v P = v Q + ω r QP , where v P and v Q are the velocities of points P and Q, respectively, ω is the angular velocity of the rigid body, and r QP the position vector of point P with respect to Q. This relationship is now written as duP /dt = duQ /dt + (dψ/dt)r QP , where duP and duQ are the inſnitesimal displacement vectors of points P and Q, respectively, and dψ is the differential rotation vector for the rigid body. After multiplication by dt, the differential displacements r . are found to satisfy the following equation, duP = duQ + dψ QP Because virtual displacements can be of inſnitesimal magnitude, it is possible to write r . δuP = δuQ + δψ (7.31) QP where δuP and δuQ are the virtual displacement vectors of arbitrary points P and Q, respectively, and δψ is the virtual rotation vector for the rigid body. Equation (7.31) describes the ſeld of kinematically admissible virtual displacements for a rigid body. Indeed, these virtual displacements satisfy the kinematic constraints for two points belonging to the same rigid body. The discussion of the previous paragraph underlines the close relationship between inſnitesimal quantities, denoted with symbol “d,” and virtual quantities, denoted with symbol “δ.” To obtain eq. (7.31) symbol “d” is replaced by “δ” in the last step of the reasoning. While this approach is correct, it must be emphasized that virtual displacements remain ſctitious displacements, whereas inſnitesimal displacements are real displacements. Furthermore, virtual displacements leave the forces
284
7 Basic concepts of analytical dynamics
unchanged, whereas no such requirement applies for real inſnitesimal displacements. Finally, virtual displacements are allowed to violate the kinematic constraints, whereas real displacement are not. Using virtual displacements of inſnitesimal magnitude greatly simpliſes the treatment of many problems. In the mathematical treatment of virtual quantities, a branch of mathematics called calculus of variations, virtual quantities are systematically assumed to be of inſnitesimal magnitude [24, 25]. 7.5.4 Principle of virtual work for a system of particles
r2
O
F2
2
Fi f f23 21 f 2i fi2 fi1 fi3 f1i i f r3 3i f32
ri r1
f12
F1 1 f13
f31
F3 3
Fig. 7.20. A system of particles.
Figure 7.20 depicts a system of N particles. This problem is treated in section 3.4 using the classical Newtonian approach. Particle i is subjected to an external force, F i , and to N − 1 interaction forces, f ij , j = 1, 2, . . . , N , j = i. For particle i, the virtual work, denoted δWi , done by all applied forces when subjected to a virtual displacement, δui , is δWi = (F Ti +
N (
f Tij )δui .
(7.32)
j=1,j=i
According to the principle of virtual work, this virtual work must vanish for all virtual displacements, δui . The principle can be applied to each particle independently, leading to δWi = 0, where δWi is given by eq. (7.32), for i = 1, 2, . . . N . Because the virtual work must vanish for each particle independently, the sum of the virtual work for all particles must also vanish, leading to the following statement of the principle of virtual work for a system of N particles: a system of particle is in static equilibrium if and only if the virtual work, ⎧⎡ ⎫ ⎤ N ⎨ N ⎬ ( ( T T ⎣F i + δW = f ij ⎦ δui , (7.33) ⎩ ⎭ i=1
j=1,j=i
vanishes for all virtual displacements, δui , i = 1, 2, . . . , N . Because the N virtual displacements are all arbitrary and independent, the bracketed term in eq. (7.33) must vanish for i = 1, 2, . . . , N , leading to equilibrium equations that are identical to those obtained from Newton’s ſrst law. Because each of the N virtual displacement vectors involves three scalar components, the principle of virtual work yields 3N scalar equations for a system of N particles; all must be satisſed for the system to be in static equilibrium. The system is said to present 3N degrees of freedom. For a two-dimensional, or planar system, the number of scalar equations would reduce to 2N , i.e., 2N degrees of freedom. The above developments have shown, once again, that the principle of virtual work is equivalent to Newton’s ſrst law, and gives the necessary and sufſcient
7.5 The principle of virtual work for statics
285
conditions for the static equilibrium of the system. Equilibrium is the most fundamental requirement in structural analysis, and must always be satisſed. This means that Newton’s ſrst law and the principle of virtual work, because they are both equivalent, always apply. The system of particles considered above is very general; it could represent a rigid body, a ƀexible body deforming elastically or plastically, a ƀuid, or a planetary system. Yet, the same equilibrium requirements apply equally to all systems. Internal and external virtual work Equation (7.33) also affords another important interpretation. The forces acting on the system are separated into two groups, the externally applied forces, F i , and the internal forces, f ij . The words “internal” and “external” should be understood with respect to the system of particles. Internal forces act and are reacted within the system, and external forces act on the system but are reacted outside the system. The virtual work done by the external and internal forces, denoted δWE and δWI , respectively, are deſned as δWE =
N ( i=1
δWI =
N ( i=1
F Ti δui , ⎡ ⎣
N (
(7.34a) ⎤ f Tij ⎦ δui ,
(7.34b)
j=1,j=i
respectively. With these deſnitions, eq. (7.33) is becomes δW = δWE + δWI = 0,
(7.35)
for all arbitrary virtual displacements. This leads to the principle of virtual work for a system of particles. Principle 9 (Principle of virtual work for a system of particles) A system of particles is in static equilibrium if and only if the sum of the virtual work done by the internal and external forces vanishes for all arbitrary virtual displacements. Finally, note that because the virtual displacements are arbitrary, it is possible to choose them to be the actual displacements, and eq. (7.35) then implies W = WE + WI = 0,
(7.36)
where WE and WI are the actual work done by the external and internal forces, respectively. Equation (7.36) states that if a system of particles is in static equilibrium, the sum of the work done by the internal and external forces vanishes.
286
7 Basic concepts of analytical dynamics
Euler’s laws The 3N scalar equations implied by the vanishing of the virtual work expressed in eq. (7.33) are often cumbersome to use because they all involve the interaction forces between the particles of the system. To obtain equations that are more convenient to use, a special set of virtual displacements will be selected. Inspired by eq. (7.31), the virtual displacement of particle i is written as r , δui = δuB + δψ i
(7.37)
where δuB is the virtual displacement of the reference point B of the rigid body, δψ the virtual rotation vector, and r i the relative position vector of particle i with respect to point B. The virtual displacements of all particles are now expressed in terms of a virtual translation of the rigid body, δuB , and its virtual rotation, δψ, both chosen to be of inſnitesimal magnitude. This corresponds to 6 independent virtual displacement components, far fewer than the original 3N . The virtual work done by all forces acting on the system under these virtual displacements is ⎫ ⎧⎡ ⎤T 2 1N ⎪ ⎪ N N ⎬ ⎨ ( ( ( T ⎣ ⎦ δW = f ij (δuB + δψ r i ) = F i δuB F + ⎪ ⎪ i ⎭ i=1 ⎩ i=1 j=1,j=i ⎛ ⎞ N N N N N ( ( ( ( ( T r + r . +⎝ f ij ⎠ δuB + F Ti δψ f Tij δψ i i i=1 j=1,j=i
i=1
i=1 j=1,j=i
The last two terms of this expression can be simpliſed using identity (1.33h), and the above equation now becomes ⎛ ⎞ 1N 2 N N ( ( ( δW = δuTB F i + δuTB ⎝ f ij ⎠ i=1
1 + δψ T
N (
i=1 j=1,j=i
⎛
2 ri F i
+ δψ T ⎝
i=1
N N ( (
⎞ ri f ij ⎠ .
i=1 j=1,j=i
In view of eqs. (3.59) and (3.62), the terms in the second and last sets of parenthesis now vanish, reducing the expression to N N ( ( T T F i + δψ ri F i = δuTB F + δψ T M B , δW = δuB i=1
i=1
where the last equality follows from eqs. (3.58) and (3.60). Because the virtual work must vanish for all virtual displacements and virtual rotations, the sum of the externally applied forces and moments must vanish, F = 0 and M B = 0. Clearly, these two equations are identical to Euler’s ſrst and second laws obtained directly
7.5 The principle of virtual work for statics
287
from Newtonian arguments, see eqs. (3.70) and (3.75). The present problem is a static problem, and hence, the time derivatives of the linear and angular momenta appearing in eqs. (3.70) and (3.75) are absent. These two vector equations are necessary but not sufſcient conditions to guarantee static equilibrium. Indeed, static equilibrium requires a total of N vector equations to be satisſed; eqs. (3.70) and (3.75) are two linear combinations of those N equations. Only two vector equations are obtained from the principle of virtual work because the virtual displacement ſeld, eq. (7.37), selected for the rigid body involves a single virtual displacement vector, δuB , and a single virtual rotation vector, δψ. 7.5.5 The use of generalized coordinates In the previous sections, the conſguration of the system was represented by the Cartesian coordinates of the various particles. As discussed in section 7.2, it is often convenient to represent the conſguration of the system by means of generalized coordinates, which give rise to the concept of the generalized forces deſned by eq. (7.20). When using generalized coordinates, the virtual work done by a force is expressed by eq. (7.19). This expression can be written for both internal and external forces, leading to N (
δWI =
QIi δqi ,
(7.38a)
QE i δqi ,
(7.38b)
i=1
δWE =
N ( i=1
where QIi and QE i are the generalized forces associated with the internal forces and externally applied loads, respectively. The principle of virtual work, expressed by eq. (7.35), now becomes δWI + δWE =
N ( i=1
QIi δqi +
N ( i=1
QE i δqi =
N ( I Qi + QE i δqi = 0, i=1
for all virtual generalized displacements, δqi . Because the virtual generalized displacements, δqi , are arbitrary, each of the N bracketed terms under the summation sign must vanish, leading to QIi + QE i = 0,
i = 1, 2, . . . , N.
(7.39)
This equation represents yet another statement of the principle of virtual work. As discussed in section 7.5.2, the principle of virtual work can be used with either arbitrary or kinematically admissible virtual displacements. Similarly, the present statement of the principle can be used with either arbitrary or kinematically admissible virtual changes in generalized coordinates. When using arbitrary virtual generalized coordinates, the virtual work done by the reaction forces must be included in
288
7 Basic concepts of analytical dynamics
the evaluation of the virtual work done by the external forces; this implies that the generalized forces associated with the reaction forces must be included in QE i . If the virtual generalized coordinates are kinematically admissible, the reaction forces are eliminated from the formulation. Example 7.12. Pendulum with torsional spring A rigid arm of length R connects mass m to a pinned support point where a torsional spring of stiffness constant k acts between ground and the rod. The torsional spring is un-stretched when the arm is horizontal. The mass is subjected to gravity loading. The conſguration of the system is conveniently represented by the angular position, φ, of the arm and is selected to be the single generalized coordinate for this single degree of freedom problem. Consider ſrst the virtual work done by the gravity load, δWE = −mg¯ıT2 δuT , where δuT is the virtual displacement at point T. Since uT = R(cos φ ¯ı1 + sin φ ¯ı2 ), an inſnitesimal virtual displacement of the same quantity is δuT = R(− sin φ ¯ı1 + cos φ ¯ı2 )δφ. It now follows that δWE = −mgR cos φ δφ, and by deſning the genE eralized force as QE φ = −mgR cos φ, the virtual work becomes δWE = Qφ δφ. The same result can be obtained in a more expeditious manner by using eq. (7.20) to ſnd QE ıT2 ∂uT /∂φ = −mg¯ıT2 R(− sin φ ¯ı1 + cos φ ¯ı2 ) = −mgR cos φ. φ = −mg¯ An even simpler interpretation is as follows. Because the virtual displacement is a rotation, δφ, it must be multiplied by a moment to yield a virtual work; hence, the generalized force is simply the moment of the gravity load, −mgR cos φ. For this problem, the virtual work done by the internal forces reduces to the virtual work done by the restoring moment of the elastic spring, δWI = −kφ δφ = QIφ δφ, where QIφ = −kφ is the generalized internal force of the system. The generalized force is, in this case, a moment, and hence, the expression “generalized force” must be interpreted carefully.
i2
T m R
k
f
R
mg i1
Fig. 7.21. Pendulum with torsional spring.
f i1
k T
i2
m mg
Fig. 7.22. Rotating mass with vertical spring.
The principle of virtual work, eq. (7.39), yields the equilibrium equation for the system as QIφ + QE φ = −mgR cos φ − kφ = 0. This is a transcendental equation, but if the angular displacement of the pendulum remains small, cos φ ≈ 1, and the equilibrium conſguration becomes φ = −mgR/k. Example 7.13. Pendulum with rectilinear spring Consider next the modiſed system shown in ſg. 7.22 where a rigid arm of length R connects mass m to a pinned support at the ground. A linear spring of stiffness
7.5 The principle of virtual work for statics
289
constant k supports the mass; this spring remains vertical because its support point is free to move horizontally on rollers. The spring is un-stretched when the arm is horizontal. As in the previous example, the virtual work done by the gravity load is easily found as δWE = mg¯ıT1 δuT , where δuT is the virtual displacement at point T. Because uT = R(cos φ ¯ı1 + sin φ ¯ı2 ), an inſnitesimal virtual displacement of the same quantity is δuT = R(− sin φ ¯ı1 + cos φ ¯ı2 )δφ. The virtual work done by the gravity load now becomes δWE = −mgR sin φ δφ, and the corresponding generalized force is QE φ = −mgR sin φ. Next, the virtual work done by the restoring force in the spring is δWI = −kR cos φ¯ıT1 δuT , which yields QIφ = kR2 cos φ sin φ. The principle of virtual work, as expressed by eq. (7.39), now implies 2 QIφ + QE φ = kR cos φ sin φ − mgR sin φ = R sin φ(kR cos φ − mg) = 0.
Two solutions are possible. First, sin φ = 0: this leads to φ = 0 or π, i.e., the arm is in the down or up vertical position, respectively. The second solution is cos φ = mg/(kR). For mg/(kR) > 1, however, this solution no longer exists, leaving the ſrst solution as the only valid solution of the problem. 7.5.6 The principle of virtual work and conservative forces The principle of virtual work was ſrst developed for a single particle, then extended to a system of particles. In this latter case, a distinction was made between internal and external forces acting on the system. On the other hand, section 3.2 introduced the concept of conservative forces. In this section, the internal and external forces applied to the system of particles will be divided into two groups, the conservative and the non-conservative forces. The principle of virtual work is now expressed as δW = δWc + δWnc = 0, where δWc and δWnc denote the virtual work done by the conservative and nonconservative forces, respectively. The virtual work done by the conservative forces can be evaluated with the help of eq. (7.22) to yield δW = −δ(V ) + δWnc = 0,
(7.40)
where V is the potential of the conservative forces. This leads to the following principle. Principle 10 A system of particles is in static equilibrium if and only if virtual changes in the potential of the conservative force equal the virtual work done by the non-conservative forces for all arbitrary virtual displacements. If all the forces applied to a system of particles are conservative, the system is called a conservative system. The virtual work done by the non-conservative forces is absent, and principle of virtual work, eq. (7.40), takes on a particularly simple form, δW = −δ(V ) = 0. The following principle follows.
(7.41)
290
7 Basic concepts of analytical dynamics
Principle 11 A conservative system of particles is in static equilibrium if and only if virtual changes in the potential is stationary for all arbitrary virtual displacements. Statements (7.35), (7.39), (7.40), or (7.41) all are statements of the principle of virtual work. In the ſrst two statements, a distinction is made between internal and external forces. In the last two statements, a distinction is made between conservative and non-conservative forces. For conservative forces, the virtual work can be expressed as the variation of a potential function, whereas this is not possible for non-conservative forces. It was shown that Newton’s ſrst law and the principle of virtual work are equivalent; indeed, the principle of virtual work was derived from Newton’s ſrst law. Newton’s law does not distinguish among the various types of forces: it simply states that “the sum of all forces must vanish.” On the other hand, the nature of the applied forces profoundly affects the statement of the principle of virtual work: conservative forces are derived from a potential, but non-conservative forces are not; this fundamental difference is reƀected in the principle. Example 7.14. Four particles on a single rigid bar Consider the system depicted in ſg. 7.23: four particles of masses ma , mb , mc , and md , respectively, are connected to the ground by four springs of identical stiffness k. The un-stretched length of the springs are a , b , c , and d , respectively. The four particles are also connected to a rigid bar, as indicated on the ſgure. The rotation of the rigid bar is assumed to remain small, and hence, the motion of the particles is purely vertical. This system could be represented by four generalized coordinates, the vertical motions of the four particles, subjected to two kinematic constraints imposed by the rigid bar, for a total of two degrees of freedom. Another approach is to select two generalized coordinates only, the vertical motion of the bar’s mid-span, u, and its rotation, θ. Using this latter approach, the potential of the forces associated with the elastic springs is k Ve = (u − Lθ/2 − a )2 + (u − Lθ/6 − b )2 2 2 2 + (u + Lθ/6 − c ) + (u + Lθ/2 − d ) , and the potential of the gravity forces is V m = g [ma (u − Lθ/2) + mb (u − Lθ/6) + mc (u + Lθ/6) + md (u + Lθ/2)] . Because all forces acting on the system are conservative, the statement of the principle of virtual work based on kinematically admissible virtual displacements reduces to δ(V e + V m ) = 0, see eq. (7.41), and leads to ! " ! " u ¯ − θ/2 − ¯a (δu − δθ/2) + u ¯ − θ/6 − ¯b (δu − δθ/6) kL2 ! " ! " + u¯ + θ/6 − ¯c (δu + δθ/6) + u ¯ + θ/2 − ¯d (δu + δθ/2) +gL [ ma (δu − δθ/2) + mb (δu − δθ/6) +mc (δu + δθ/6) + md (δu + δθ/2)] = 0,
7.5 The principle of virtual work for statics
291
where u ¯ = u/L, ¯a = a /L, and similar notations are used for the un-stretched length of the other springs. Because the kinematically admissible virtual displacements are arbitrary, two equations are obtained; these are readily solved to ſnd u ¯ = ¯ − m¯ g, ¯ 5θ mc − mb d − ¯a ¯c − ¯b md − ma = + − + g¯. 9 2 6 2 6
(7.42a) (7.42b)
where ¯ = (¯a + ¯b + ¯c + ¯d )/4 is the average non dimensional un-stretched length of the springs, m = (ma + mb + mc + md )/4 the average mass of the particles and g¯ = g/(kL). md mc
mb
ma
mc
md q
ma i1
i1 k
k k u
k
q2
mb q1 k k
u
k
k L/3 L/3 L/3
L/3 L/3 L/3 Fig. 7.23. Four spring supporting a rigid bar.
Fig. 7.24. Four spring supporting an articulated rigid bar.
The principle of virtual work, per se, does not provide any information about the loads acting in the rigid bar. To compute the bar mid-span bending moment, for instance, the basic methods of statics could be used: summing up the bending moments acting on a free body diagram of the right side of the beam yields 2 ¯ 1 ¯a + ¯d ¯b + ¯c m a + md mb + mc g¯ ¯ M = ( − m¯ g) − + + + , (7.43) 3 2 2 6 2 6 2 ¯ = M/(kL2 ) is the non dimensional mid-span bending moment. where M Example 7.15. Four particles on two rigid bars Consider now the system depicted in ſg. 7.24: the four springs support two rigid bars connected at mid-span by means of a hinge. The system now presents three degrees of freedom that are conveniently chosen as the vertical displacement of the hinge, u, and the orientations, θ1 and θ2 , of the left and right bars, respectively. The potential of the elastic forces in the springs becomes Ve =
k (u − Lθ1 /2 − a )2 + (u − Lθ1 /6 − b )2 2 2 2 + (u + Lθ2 /6 − c ) + (u + Lθ2 /2 − d ) ,
(7.44)
292
7 Basic concepts of analytical dynamics
and the potential of the gravity forces is Vm =
ma g (u − Lθ1 /2) + mb g (u − Lθ1 /6) + mc g (u + Lθ2 /6) + md g (u + Lθ2 /2) ,
where the rotations, θ1 and θ2 , are assumed to remain small. Here again, all forces acting on the system are conservative, the statement of the principle of virtual work based on kinematically admissible virtual displacements reduces to δ(V e +V m ) = 0, see eq. (7.41), and leads to the following set of equations ⎫ ⎧ ⎤⎧ ⎫ ⎡ ¯⎬ 4 −2/3 2/3 ⎨ u 24¯ − 24m¯ g ⎬ ⎨ 1 ⎣−2/3 5/18 0 ⎦ θ1 = −(3a + b ) + (3ma + mb )¯ g . (7.45) ⎩ ⎭ 6⎩ ⎭ 2/3 0 5/18 (3d + c ) − (3md + mc )¯ θ2 g This solution is of course different from that of the previous problem; indeed, the mid-span hinge relieves the bending moment at the middle of the bar. 7.5.7 Problems Problem 7.4. Rotating disk with spring restraint A mechanism consists of the rotating circular disk pinned at its center as shown in ſg. 7.25. A cable is wrapped around the outer edge and a force, P , is applied tangentially. The rotation is resisted by a spring of stiffness constant k attached to a pin on the disk’s outer radius and ſxed horizontally to a support that can move vertically, leaving the spring horizontal at all times. The spring is un-stretched when θ = 0. Use the principle of virtual work to determine the force, P , required to keep the disk in static equilibrium as a function of angle θ.
Q A R f O
k ?
R
B x
P Fig. 7.25. Rotating disk with spring restraint.
L k dx
Fig. 7.26. Crank-slider mechanism with a spring.
Problem 7.5. Crank-slider mechanism with a spring Consider the crank-slider mechanism depicted in ſg. 7.26. The crank of length R is actuated by a torque Q, and the link of length L transforms the rotary motion of the crank into a linear motion of the slider. A spring of stiffness constant k connects the slider to the ground and is un-stretched when x = 0. Use the principle of virtual work to ſnd the static equilibrium conſguration of the system.
Problem 7.6. Lever with sliding pivots Bar ABC is of length b + a and is constrained to move vertically at point A and horizontally at B, while a horizontal force, P , is applied at point C, as depicted in ſg. 7.27. Point A is restrained by a vertical spring of stiffness constant k, which is relaxed when angle θ = 0. Use the principle of virtual work to determine the static equilibrium conſguration of the system.
7.5 The principle of virtual work for statics
293
A b
k ?
L
q B
L
?
a
k
M
P C
Fig. 7.27. Lever with spring-restrained sliding pivots.
L
L
Fig. 7.28. Screw jack type of scissor lift.
Problem 7.7. Screw jack scissor lift Consider the scissor lift with a spring of stiffness constant k linking the opposite joints shown in ſg. 7.28. The conſguration of the system is represented by a single generalized coordinate, θ, the angle between the jack legs and the horizontal. Using the principle of virtual work, determine the crank moment, M , for which static equilibrium of the system is achieved. The threaded screw has a pitch of N threads per unit length. All bars of the jack are articulated.
Problem 7.8. Lever mechanism A bar of length 3b is pinned at its lower end, point O, and a spring of stiffness constant k connects its tip point T to the ground a point A, as shown in ſg. 7.29. A second bar, of length b, is pinned to the ſrst bar as shown and to a slider that is constrained to move vertically on a frictionless rod. A force of magnitude F is acting on the slider. Use the principle of virtual work to determine the static equilibrium conſguration of the system.
T
A
k
L A
2b
b
F 3b ?
b
u k W
O
Fig. 7.29. Lever mechanism.
Fig. 7.30. Mechanism with nonlinear geometry.
Problem 7.9. Spring-mass problem with nonlinear geometry A spring of stiffness constant k and un-stretched length L is fastened to a support at point A and is connected to a weight, W , as shown in ſg. 7.30. The weight slides on a frictionless vertical rod and the spring is un-stretched when horizontal. (1) Using the principle of virtual work, determine the static equilibrium conſguration of the system. (2) Plot the non¯ = W/(kL), as function of the non-dimensional displacement of the dimensional weight, W ¯ = u/L. slider, u
Problem 7.10. Linked bars with lateral springs and forces Figure 7.31 shows a mechanical system consisting of two articulated bars pinned together at point B and to the ground at point C. Two springs of stiffness constants k1 and k2 support the
294
7 Basic concepts of analytical dynamics
bars at their mid-span and two forces, P and Q, are applied at points B and A, respectively. Let qA and qB , the downward deƀection of points A and B, be the two generalized coordinates of the system. Use the principle of virtual work to determine the two static equilibrium equations of the system. Assume small displacements: |qA | L and |qB | L.
i2
Q
P
C L
k1 L
B L
k2 A L
Fig. 7.31. Two articulated bars supported by springs.
Lb
B
Lc
q O
i1
Q d
A
k T
Fig. 7.32. Two articulated bars supported by springs.
Problem 7.11. Two-bar linkage system The two bar linkage shown in ſg. 7.32 comprises bar OB of length Lb and bar BAT of length Lc . Bar BAT passes through a slider located at ſxed point A but free to swivel about that point. A spring of stiffness constant k connects the tip of the bar at point T to the slider at point A and is of vanishing un-stretched length. A torque of magnitude Q is applied to bar OB. Use the principle of virtual work to determine the static equilibrium conſguration of the system.
8 Variational and energy principles
This chapter investigates applications of the principles of analytical mechanics developed in chapter 7 to dynamical systems. First, the principle of virtual work presented in section 7.5 for static problems will be generalized to dynamic problems, leading to d’Alembert’s principle, see section 8.1. Next, Hamilton’s principle is presented in section 8.2 as an integral version of d’Alembert’s principle. Finally, Lagrange’s formulation is presented in section 8.3, leading to Lagrange’s equations of motion.
8.1 D’Alembert’s principle Newton’s second law, eq. (3.4), states that if external forces, F a , are acting on a particle, its acceleration is proportional to the sum of these forces, F a = ma. The product of the mass by the acceleration vector is a force vector, called the inertial force vector, F I , deſned as F I = −ma. (8.1) The minus sign in the deſnition of the inertial force indicates that such force always opposes motion. With this deſnition, Newton’s second law becomes F I + F a = 0.
(8.2)
Of course, this equation looks like a trivial manipulation of Newton’s law: inertial forces have been brought from the right- to the left-hand side of the equation. The importance of the above statement, however, is that it generalizes the concept of equilibrium, a concept of statics, to dynamics problems. As mentioned in section 3.1.2, Newton’s ſrst law is generally stated as “a particle is in static equilibrium if and only if the sum of the externally applied forces vanishes” within the context of statics problems. Equation (8.2) expresses the condition for dynamic equilibrium: the sum of the externally applied forces must vanish, provided that the inertial forces are treated as externally applied forces. Of course, the concept of dynamic equilibrium does not imply that the particle is at rest; indeed, the particle moves under the effect of
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_8 © Springer Science+Business Media B.V. 2011
296
8 Variational and energy principles
the externally applied forces. Rather, dynamic equilibrium implies the vanish of the resultant of the set of forces acting on a particle in motion; this set of forces includes all externally applied forces and the inertial forces. The importance of the concept of inertial force is that the same law, “the sum of the forces must vanish,” now applies to both statics and dynamics problems; dynamics is reduced to statics. D’Alembert’s principle can now be stated as follows. Principle 12 (D’Alembert’s principle) A system of particles is in dynamic equilibrium if and only if the sum of the externally applied forces and inertial forces vanishes. In section 7.5, the principle of virtual work for static problems was derived from Newton’s ſrst law and shown to imply that δ(V ) = δWnc , for all arbitrary virtual displacements, see eq. (7.40). In this expression, V is the potential of the conservative forces acting on the system of particles, and δWnc the virtual work done by the nonconservative forces. For dynamic equilibrium, D’Alembert’s principle requires the vanishing of the sum of the externally applied forces and inertial forces. Inertial forces are nonconservative force because they cannot be derived from a potential. It follows that the principle of virtual work, the condition for static equilibrium, can be generalized to becomes the condition for dynamic equilibrium, if the virtual work done by the inertial forces, denoted δW I , is added to the virtual work done by the other nonconservative forces. In summary, a system of particles is in dynamic equilibrium if and only if δ(V ) = δW nc + δW I , (8.3) for all arbitrary virtual displacements. D’Alembert’s principle can also be stated as follows. Principle 13 (D’Alembert’s principle) A system of particles is in dynamic equilibrium if and only if virtual changes in the potential of the conservative force equal the virtual work done by the non-conservative forces and inertial forces for all arbitrary virtual displacements. The principle of virtual work presented in section 7.5 is equivalent to Newton’s ſrst law. By treating inertial forces as “externally applied forces,” dynamic problems are reduced to static problems and d’Alembert’s principle becomes equivalent to Newton’s second law. The two alternative statements of d’Alembert’s principle given above are equivalent to Newton’s second law, and hence, provide an alternative basis for dynamics. For a system composed of N of particles, the virtual work done by the inertial forces is N N ( ( δW I = F IT δr = − mi aTi δr i , (8.4) i i i=1
i=1
where ai is the inertial acceleration vector of the ith particle, mi its mass, and δri an arbitrary virtual displacement of the same particle.
8.1 D’Alembert’s principle
297
Newton’s formulation relates the sum of all externally applied forces to the acceleration of the system, but d’Alembert’s principle only involves the virtual work done by the forces acting on the system. It follows that if the virtual work done by a speciſc force vanishes, this force will be automatically eliminated from the equations of motion obtained from d’Alembert’s principle. In section 7.4.3, it was shown that the virtual work done by the forces that impose a kinematic constraint does vanish; hence, such forces will not appear in a formulation based on d’Alembert’s principle but will explicitly appear when using Newton’s formulation. Example 8.1. Conservation of energy Consider a system acted upon by conservative forces only. D’Alembert’s principle then reduces to N ( δV + mi ¨r Ti δr i = 0. i=1
Because the virtual displacements are arbitrary, they can be selected to equal the actual, differential displacements of the system, i.e., δr i = dri . This selection, however, is only possible for speciſc systems; indeed, virtual displacements are arbitrary virtual changes that bring the conſguration of the system to a new conſguration, at a given, ſxed instant in time. Consequently, equating virtual displacements to differential displacements is only possible when dealing with time-independent potential functions. Under this restriction, d’Alembert’s principle now becomes 1 2 N N ( d 1 ( T T dV + mi ¨r i r˙ i dt = dV + mi r˙ i r˙ i dt = 0, dt 2 i=1 i=1 The term in parenthesis is the kinetic energy, K, of the system, and hence, dV /dt + dK/dt = dE/dt = 0, where E is the total mechanical energy of the system. This is the principle of conservation of energy, see eq. (3.25), previously derived directly from Newton’s second law. 8.1.1 Equations of motion for a rigid body Consider a rigid body with a body attached frame F B = B, B ∗ = (¯b1 , ¯b2 , ¯b3 ) , where point B is a reference point on the body and B ∗ a body attached basis, as depicted in ſg. 8.1. The conſguration of the body is described with respect to an inertial frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]. The position of reference point B of the body is r B and its orientation is determined by rotation tensor R, which brings inertial basis I to basis B ∗ . The rigid body is composed of N particles each of mass mi and located at point Pi ; the position vector of the ith particle is denoted r i and its position with respect to reference point B is denoted si . Superscript (·)∗ indicates tensor components resolved in material basis B ∗ . The virtual work done by the inertial forces is
298
8 Variational and energy principles
δW I =
N (
F IT δr i =
i=1
N (
F IT R RT δr i =
i=1
N (
−mi aTi R RT δr i .
i=1
The components of the virtual displacement vector in the body attached basis, RT δr i , will Pi b2 si be evaluated using the corresponding expresB ri * sion for the velocity, see eq. (5.23), as RT δri = B ∗ i3 rB, R s∗ , where δr is the virtual disRT δr B + δψ B i placement of the reference point of the body and b1 O δψ ∗ are the components of its virtual rotation i2 i1 I vector resolved in the material basis. Similarly, the components of the inertial acceleration reT Fig. 8.1. Conſguration of a rigid solved in the same basis, R ai , are given by ∗ body. ˙ + ω ∗ω ∗ )s∗i , eq. (5.25) as RT ai = RT aB + (ω where aB is the inertial acceleration vector of point B and ω∗ the components of the angular velocity vector resolved in the material basis. The virtual work done by the inertial forces now becomes b3
δW I = −
N ( ∗ δr TB R + δψ ∗T s∗i mi RT aB + (ω ˙ + ω ∗ω ∗ )s∗i . i=1
Expanding the products then leads to N N ( ( ∗ I T T ∗ ∗ ∗ ˙ δW = − mi δr B aB − δr B R(ω +ω ω ) mi si i=1
− δψ ∗T
N (
mi s∗i RT aB − δψ ∗T
i=1
− δψ ∗T ω ∗
N (
N (
i=1
ω˙ ∗ mi s∗i s∗T i
i=1
ω∗. mi s∗i s∗T i
i=1
The ſrst bracketed term is simply the total mass of the rigid body, and the components of the tensor of mass moments of inertia, see eq. (6.1), resolved in the material basis, appear in the last two bracketed terms. The second and third bracketed term are related to the location of the center of mass of the body 1 ( mi s∗i , m i=1 N
η∗ =
(8.5)
where η ∗ are the components of the position vector of the center of mass of the rigid body with respect to its reference point B, resolved in the material basis. The virtual work done by the inertial forces now becomes
8.1 D’Alembert’s principle
∗ δW I = − δrTB R mRT aB + (ω ˙ + ω ∗ω ∗ )mη ∗ ∗ T η R aB + I B∗ ω˙ ∗ + ω ∗ I B∗ ω∗ . − δψ ∗T m
299
(8.6)
Let F ∗ and M ∗B be the components, resolved in the material basis, of the force and moment vectors, respectively, applied to the rigid body. The virtual work done by these externally applied loads is then δW a = δr TB R F ∗ + δψ ∗T M ∗B ; note that M ∗B are the components of the applied moment computed with respect to the reference point B. D’Alembert’s principle states that δW I + δW a = 0, for all kinematically admissible virtual displacements RT δr B and δψ ∗ . Of course, the constraint forces that keep the rigid body rigid vanish from the formulation because kinematically admissible virtual displacements are used here. The equations of motion of the rigid body then follow as ∗ mRT aB + (ω ˙ + ω ∗ω ∗ )mη ∗ = F ∗ , ∗
T
m η R aB + I
B∗
∗
∗ B∗ ∗
ω˙ + ω I
ω =
M ∗B .
(8.7a) (8.7b)
In this derivation, no assumptions were made concerning the location to the reference point B of the rigid body. Consequently, the two vector equations of motion become coupled: each equation involves both the acceleration of the reference point, aB , and the angular velocity, ω∗ , and acceleration, ω˙ ∗ , of the rigid body. The relative position of the center of mass with respect to the chosen reference point B, η, appears explicitly in the equations of motion. Clearly, the center of mass of the rigid body could be chosen as the reference point; in this case, η = 0, and the governing equations of motion simplify to mRT aC = F ∗ , and I C∗ ω˙ ∗ + ω ∗ I C∗ ω∗ = M ∗C . The ſrst equation describes the motion of the center of mass of the rigid body, and the second equation describes the motion of the body around this point. These equations are, of course, identical to those obtained earlier in section 6.5. If the orientation of the material frame is selected to coincide with that of the principal axes of inertia, the tensor of mass moments of inertia becomes diagonal, and the equations further simplify, see eqs. (6.21). 8.1.2 Equations of motion for the planar motion of a rigid body When dealing with the planar motion of a rigid body, the equations of motion derived in the previous section simplify considerably. Let the planar motion take place in the plane deſned by unit vectors ¯ı1 and ¯ı2 ; the angular velocity, angular acceleration, and virtual rotation vectors now become ω = θ˙ ¯ı3 , ω˙ = θ¨ ¯ı3 and δψ = δθ ¯ı3 , respectively, where θ is the rotation angle of the rigid body. It will be assumed here that unit vector ¯ı3 , the normal to the plane in which the motion is taking place, is a principal axis of inertia, and hence, the mass moment of inertia tensor becomes ⎡ B B ⎤ I11 I12 0 B B I22 0 ⎦ . I B = ⎣I12 B 0 0 I33
300
8 Variational and energy principles
Introducing these expressions into eq. (8.6), the virtual work done by the inertial forces becomes ¨ı3 η − mθ˙2 η − δθ m¯ıT ηa + I B θ¨ . δW I = −δrTB maB + mθ (8.8) 3 33 B D’Alembert’s principle then yields the equations of motion of the rigid body undergoing planar motion as ¨ı3 η − m θ˙2 η = F , m aB + m θ m ¯ıT ηa + I B θ¨ = MB , 3
B
(8.9a) (8.9b)
33
where F and MB are the components of the externally applied force vector and moment, respectively. These equations further simplify if the reference point is chosen to coincide with the center of mass of the rigid body. Example 8.2. The double pendulum Consider the double pendulum system depicted in ſg. 8.2. The ſrst bar is of length L1 , mass m1 , and is connected by hinges to the ground at point O and to the second bar at point A. The second bar is of length L2 and mass m2 . The bars have orientation angles θ1 and θ2 with respect to the vertical, respectively. I = (¯ı1 , ¯ı2 , ¯ı3 ) is an inertial basis, and bases E = (¯ e1 , e¯2 , e¯3 ) and A = (¯ a1 , a ¯2 , a ¯3 ) are material bases attached to the ſrst and second bars, respectively. Derive the equations of motion of the system. VO
i2 i1
O q1
L1
m1g
g
e2
Top bar
e1 VA HA
A
VA L2
q2
HO
HA m2g
a2
Bottom a1 bar
Fig. 8.2. Conſguration of the double pendulum system.
Newtonian formulation First, the equations of motion will be derived using the classical Newtonian approach. Figure 8.2 shows the free body diagrams for the two bars: VO and HO are the vertical and horizontal components of the reaction force at point O and VA and HA the components of the force vector transmitted through the hinge at point A. The equations of motion for the ſrst bar are found using eq. (6.19) and the pivot equation, eq. (6.38), about inertial point O, to ſnd
8.1 D’Alembert’s principle
m1 L 1 ¨ (θ1 e¯2 − θ˙12 e¯1 ), 2
(8.10a)
L1 θ¨1 = −m1 g S1 + VA L1 S1 + HA L1 C1 . 2
(8.10b)
(HO + HA )¯ı1 + (VO + VA )¯ı2 − m1 g¯ı2 = m1 L21 3
301
The short-hand notation sin θ1 = S1 and sin θ2 = S2 was used, with similar expressions for the cosine function. The equations of motion for the second bar are obtained in a similar manner as m2 L 2 ¨ (θ2 a ¯2 − θ˙22 a ¯1 ), (8.11a) 2 m2 L22 ¨ L2 L2 θ2 = VA S2 + HA C2 . (8.11b) 12 2 2
−HA¯ı1 − VA¯ı2 − m2 g¯ı2 = m2 L1 (θ¨1 e¯2 − θ˙12 e¯1 ) +
In this case, the second equation was written about the center of mass of the bar, see eq. (6.37). Newton’s approach gives a total of six scalar equations, involving six unknowns: angles θ1 and θ2 , two components of reaction force, HO and VO , and two components of internal force, HA and VA . Clearly, the system features two degrees of freedom only, and can be represented with two generalized coordinates, θ1 and θ2 , for instance. The two equations of motion of the system would be obtained by eliminating the four components of reaction force, resulting in two coupled differential equations for θ1 and θ2 . After tedious algebra, the following equations are obtained (
L1 L2 m1 + m2 )L21 θ¨1 + m2 C21 3 2 L1 L2 m2 L22 C21 θ¨1 + m2 2 3
L1 L2 ˙ 2 m1 + m2 )gL1 S1 + m2 θ S21 , θ¨2 = −( 2 2 2 L1 L2 ˙ 2 L2 θ¨2 = −m2 g S2 − m2 θ S21 2 2 1
where C21 = cos(θ2 − θ1 ) and S21 = sin(θ2 − θ1 ). D’Alembert’s formulation The same problem will now be approached with d’Alembert’s principle using kinematically admissible virtual displacements. For the ſrst bar, the potential of the gravity forces is V1 = −m1 gL1 C1 /2. The virtual work done by the inertial forces is obtained from eq. (8.8) as δW1I = −δθ1 I1O θ¨1 , where I1O is the moment of inertia of the ſrst bar with respect to point O. This point was chosen as the reference point on the body, and hence, aO = 0 and δr O = 0, because kinematically admissible virtual displacements are used. It follows that δV1 − δW1I = δθ1 (m1 g
L1 m1 L21 ¨ S1 + θ1 ) 2 3
(8.12)
The potential of the gravity forces acting on the second bar is V2 = −m2 g(C1 L1 + C2 L2 /2). The virtual work done by the inertial forces is once again obtained from eq. (8.8), using point A as the reference point
302
8 Variational and energy principles
δW2I
L2 L2 ¯1 − θ˙22 a ¯1 aA + θ¨2ı3 a 2 2 L2 a1 aA + I2A θ¨2 . − δθ2 m2¯ıT3 2
=−
δr TA m2
(8.13)
The position vector of point A is r A = L1 e¯1 ; its velocity vector is then v A = L1 θ˙1 e¯2 , and hence, δr A = L1 δθ1 e¯2 ; ſnally, its acceleration vector is aA = L1 (θ¨1 e¯2 − θ˙12 e¯1 ). Introducing all these results in the above equation leads to L2 δV2 − δW2I =m2 g(L1 S1 δθ1 + S2 δθ2 ) 2 L2 ¨ L2 ˙ 2 ¯2 − ¯1 + m2 L1 δθ1 e¯T2 L1 (θ¨1 e¯2 − θ˙12 e¯1 ) + θ2 a θ a 2 2 2 L1 L2 T T ¨ a ¯1 ı3 (θ1 e¯2 − θ˙12 e¯1 ) + I2A θ¨2 . + δθ2 m2 2 D’Alembert’s principle now implies δ(V1 + V2 ) − δ(W1I + W2I ) = 0 for all arbitrary variations, δθ1 and δθ2 ; this directly leads to the equations of motion given above. In contrast with Newton’s approach, d’Alembert’s principle yields the equations of motion of the system without reference to the reaction and internal forces that are eliminated from the onset of the formulation. Consequently, the equations of motion are obtained in a more direct and convenient manner, reducing the risk of errors. The reaction and internal forces do not appear in d’Alembert’s formulation because the virtual work done by such forces vanishes, as discussed in section 7.4.3. Of course, reaction and internal forces are quantities of primary interest that must often be evaluated as an integral part of the analysis. Newton’s equations could be used to introduce reaction forces into the formulation; for instance, eqs. (8.11) could be used to evaluate HA and VA , then eqs. (8.10) would yield the other two components HO and VO . Example 8.3. The rigid body/universal joint system Figure 7.9 depicts a rigid body attached to the ground by means of a universal joint. This problem was treated in example 7.5 on page 266, where the conſguration of the universal joint is described. Component k of the universal joint is connected to the ground at point O by means of a bearing allowing rotation about axis ¯ı3 . Component is connected to a rigid body at point O . The orientation of the rigid body will be deſned by Euler angles, using the 3-1-2 sequence. A ſrst planar rotation about axis ¯ı3 , of magnitude φ, brings inertial basis I = (¯ı1 , ¯ı2 , ¯ı3 ) to basis A = (¯ a1 , a ¯2 , a ¯3 ), where a ¯1 is aligned with unit vector ¯b1 of the cruciform. This rotation is associated with a constant angular speed φ˙ = Ω, implying a ¯1 (t) = cos(Ωt)¯ı1 + sin(Ωt)¯ı2 . A second planar rotation about axis a ¯1 , of magnitude θ, brings basis A to basis B = (¯b1 , ¯b2 , ¯b3 ), where ¯b2 is the second unit vector aligned with the cruciform. Finally, a third planar rotation about axis ¯b2 , of magnitude ψ, bring basis B to basis E ∗ = (¯ e1 , e¯2 , e¯3 ) that is attached to the rigid body. Tensor components resolved in basis E ∗ will be denoted
8.1 D’Alembert’s principle
303
with the superscript (·)∗ . Points O and O are coincident. Establish the equations of motion of the system using d’Alembert’s principle. The potential of the gravity forces acting at the center of mass of the rigid body is V = mg¯ıT3 η, where m is the total mass of the rigid body and η the position vector of the center of mass with respect to point O. The variation of this potential is then δV = δψ ∗T mg η ∗ RT ¯ı3 , where δψ ∗ are the components of the virtual rotation vector resolved in the body attached frame. The virtual work done by the inertial forces is given by eq. (8.6), and d’Alembert’s principle now implies ∗ δrTO R mRT aO + (ω ˙ + ω ∗ω ∗ )mη ∗ η ∗ RT ¯ı3 + m η ∗ RT aO + I O∗ ω˙ ∗ + ω ∗ I O∗ ω ∗ = 0, + δψ ∗T mg where inertial point O was used as the reference point on the rigid body; hence, aO = 0. The system presents two degrees of freedom and two generalized coordinates, the two Euler angles, θ and ψ, will be selected here. Kinematically admissible virtual displacements will be used for this problem, hence δr O = 0 and, in view of eq. (4.80), the virtual rotation vector becomes ⎧ ⎫ ⎧ ⎫ ⎡ ⎤ Cψ 0 ⎨ δφ ⎬ ⎨ −Cθ Sψ ⎬ δθ Sθ δψ ∗ = H ∗ δθ = δφ + ⎣ 0 1⎦ δψ ⎩ ⎭ ⎩ ⎭ δψ Sψ 0 Cθ Cψ (8.14) δθ , = h∗ δφ + G∗ δψ where h∗ stores the ſrst column of the tangent operator, H ∗ , and G∗ its last two. Because the ſrst Euler angle is prescribed to be φ = Ωt, the corresponding variation vanishes, δφ = 0 .
D’Alembert’s principle now reduces to δθ, δψ G∗T [mg η ∗ RT ¯ı3 + I O∗ ω˙ ∗ + O∗ ω ∗ I ω ∗ ] = 0. Because the virtual changes in the two generalized coordinates are arbitrary, the equations of motion of the rigid body are G∗T mg η ∗ RT ¯ı3 + I O∗ ω˙ ∗ + ω (8.15) ∗ I O∗ ω ∗ = 0. D’Alembert’s principle is a very powerful tool for the derivation of the equations of motion of the system. Two equations of motion are obtained for the two generalized coordinates of the problem. In contrast, Newton’s method would generate six equations involving six unknown: two Euler angles θ and ψ, three components of the reaction force at point O, and the torque Q required to impose the constant angular velocity φ˙ = Ω. These latter four unknowns would need to be eliminated from the set of equations to obtain two equations of motion for θ and ψ. Derivation of the equations of motion based on Newton’s approach is left to the reader as an exercise.
304
8 Variational and energy principles
8.1.3 Problems Problem 8.1. Euler’s ſrst and second laws Prove Euler’s ſrst and second laws for a system of particles based on d’Alembert’s principle. Hint: ſrst read sections 3.4.4 and 7.5.4.
Problem 8.2. Two bar linkage system The two bar linkage shown in ſg. 7.11 comprises bar OB of length Lb and mass mb , and bar BAT of length Lc and mass mc . Bar BAT passes through ſxed point A but is free to swivel about that point. (1) Derive the equation of motion of the system using d’Alembert’s principle. Use angle θ as the generalized coordinate.
Problem 8.3. Homogeneous bar sliding on guides Figure 8.3 depicts a homogeneous bar of length L and mass m sliding on two guides at its end points. At the left end, the bar is connected to a spring of stiffness constant k that is unstretched when the bar is horizontal. At the right end, the bar is connected to a point mass M . Gravity acts along unit vector ¯ı2 . (1) Use d’Alembert’s principle to derive the equation of motion of the system. Use a single generalized coordinate, θ.
T L k q
O i2
L2, m2
i2
g
L
i1 L1, m1
m
B f
q M
w i1 A
O d
Fig. 8.3. Homogeneous bar sliding on guides at both ends.
Fig. 8.4. The two-bar linkage with slider system.
Problem 8.4. The two-bar linkage with slider The two-bar linkage with slider system shown in ſg. 8.4 is a planar mechanism. It consists of a uniform crank of length L1 and mass m1 connected to the ground at point O; let θ be the angle from the horizontal to the crank. At point B, the crank slides over a uniform linkage of length L2 and mass m2 that is connected to the ground at point A. Let w denote the distance from point B to point A and φ the angle from the horizontal to link BA. (1) Derive the equation of motion of the system using d’Alembert’s principle. Use angle θ as the generalized coordinate.
Problem 8.5. Pendulum mounted on a cart Figure 8.5 shows a pendulum of length L and mass m mounted on a cart of mass M that is connected to the ground by means of a spring of stiffness constant k and of a dashpot of constant c. The displacement of the cart is denoted x which is also the stretch of the spring, and angle θ measures the deƀection of the pendulum with respect to the vertical. Gravity acts on
8.2 Hamilton’s principle
305
the system as indicated in the ſgure. (1) Based on d’Alembert’s principle, derive the equations of motion of the system using the generalized coordinates x and θ. (2) Plot the time history of the cart displacement, x. (3) Plot the history of angle θ. (4) Plot the trajectory of the point at the tip of the pendulum. (5) Plot the cart velocity, x. ˙ (6) Plot the angular velocity of the ˙ (7) Plot the system kinetic and potential energies and the energy dissipated in pendulum, θ. the damper. Check the energy closure equation. (8) Plot the components of the internal force at point A. Use the following data: M = 5 kg; m = 2 kg; L = 0.4 m; k = 10 N/m; acceleration of gravity g = 9.81 m/s2 ; c = 0.5 N·s/m. Present all your results for a period of 10 s. Initial condition are at rest with x(t = 0) = 0.2 m and θ(t = 0) = π.
Problem 8.6. Flexible pendulum on a slider Solve problem 3.34 using d’Alembert’s principle.
Problem 8.7. Pendulum with rotating mass Solve problem 4.38 using d’Alembert’s principle.
Problem 8.8. Plate hinged at the rim of a rotating disk Solve problem 6.14 using d’Alembert’s principle.
Problem 8.9. Pendulum with sliding mass Solve problem 6.34 using d’Alembert’s principle.
Problem 8.10. Bar rocking on top of a curve Solve problem 6.38 using d’Alembert’s principle.
Problem 8.11. Pendulum connected to a plunging mass Solve problem 6.39 using d’Alembert’s principle.
Problem 8.12. Particle sliding in a rolling wheel Solve problem 6.43 using d’Alembert’s principle.
Problem 8.13. Pendulum connected to horizontal piston Solve problem 6.45 using d’Alembert’s principle.
Problem 8.14. Inverted pendulum mounted on a cart Solve problem 6.46 using d’Alembert’s principle.
Problem 8.15. Bar hinged at rim of rotating disk Solve problem 6.27 using d’Alembert’s principle.
Problem 8.16. Geneva wheel mechanism Solve problem 6.47 using d’Alembert’s principle.
8.2 Hamilton’s principle When dealing d’Alembert’s principle, the virtual work done by the inertial force acting on the k th particle, −δrTk mk ak , cannot not be derived from a “potential of the inertial forces.” Consequently, accelerations appear explicitly in formulations based on d’Alembert’s principle. It is possible to remedy this situation by further manipulation of this principle.
306
8 Variational and energy principles
In general, dynamical systems must be studied over a period of interest, say from an initial time ti to a ſnal time tf . At those times, boundary conditions will be enforced; at time ti , the position of each particle could be known, r k (ti ) = ˆr k (ti ), or the velocity of each particle could be known, v k (ti ) = vˆk (ti ), or both position and velocity could be known. In fact, it is even possible to imagine situations were positions could be known for some particles, and velocities for others. At time tf , similar boundary conditions will exist. Initial boundary value problems form an important class of problems in dynamics: the positions and velocities of all particles are known at time ti ; the solution of the dynamical equations of motion will reveal the corresponding quantities at time tf . Another class of problems seeks to determine the conſguration of the system at time ti such that a desirable conſguration is achieved at time tf . In practice, more complex combinations of initial and ſnal boundary conditions could be encountered. To deal with all possible cases, the following notation will be used: at time ti , r k (ti ) = ˆr k (ti ), where ˆrk (ti ) denotes the known or prescribed position of the k th particle if this position is known, or the resulting position if this quantity will result from the solution of the equations of motion. Similarly, vˆk (ti ) denotes the prescribed or resulting velocity at time ti depending on the speciſc problem at hand. Of course, similar notations are used at the ſnal time tf . 8.2.1 Use of physical coordinates If a system of N particles is in dynamic equilibrium, d’Alembert’s principle, eq. (8.3), must be satisſed together with the boundary conditions at the initial and ſnal times. Hence, the following statement must hold for all arbitrary virtual displacements %
tf
!
δW I − δV + δW
" nc
dt +
ti
N (
k=1
+
N (
k=1
tf δr Tk mk (v k − vˆk ) ti
(8.16)
tf mk δv Tk (r k − ˆ rk )
= 0. ti
At each instant in time, the integrand must vanish because of d’Alembert’s principle, eq. (8.3), and consequently, the ſrst integral vanishes for all virtual displacements. The next two terms of the statement also vanish because they simply state the initial and ſnal boundary conditions of the problem. The sum of the three terms must hence vanish for all arbitrary virtual displacements. The expression for the integral of the virtual work done by the inertial forces acting on each particle, see eq. (8.4), is expressed in terms of velocity vectors, v k , and virtual displacement vectors, δr k . Integration by parts then yields
8.2 Hamilton’s principle
%
tf
% δW dt = − I
ti
tf
ti
% =
N ( k=1
N tf (
ti
δr Tk mk
dv k dt dt
δv Tk mk v k dt −
k=1
N (
(8.17)
tf δr Tk mk v k
k=1
307
. ti
The ſrst term of the last equality is directly related to the kinetic energy of the system; δv Tk mk v k = δ(1/2 mk v Tk v k ) = δKk , where Kk is the kinetic energy of the k th particle. The sum then yields the total kinetic energy, K, of the system. The linear momentum vector, pk , of the k th particle equals the derivative of the total kinetic energy with respect to the corresponding velocity vector p k = mk v k =
∂K . ∂v k
(8.18)
Equation (8.17) now simpliſes to %
%
tf
I
tf
δW dt = ti
δK dt −
ti
N (
tf δr Tk pk
k=1
(8.19)
. ti
Introducing these results into statement (8.16) now leads to %
tf
(δK − δV + δW
nc
) dt =
ti
N (
tf
−
δrTk ˆpk
k=1
N (
tf δpTk (r k
− ˆr k )
k=1
ti
. (8.20) ti
Hamilton’s principle can be stated as follows. Principle 14 (Hamilton’s principle) A system of particles is in dynamic equilibrium if and only if equation (8.20) holds for all arbitrary virtual displacements. If the system is subjected to conservative forces only, δW nc = 0, and Hamilton’s principle simpliſes to %
tf
(δK − δV ) dt =
ti
N (
tf
−
δr Tk pˆk
k=1
ti
N (
tf δpTk (r k
− ˆr k )
k=1
.
(8.21)
ti
It is convenient to introduce here a scalar function, L, called the Lagrangian of the system, L = K − V. (8.22) With the help of this function, Hamilton’s principle applied to conservative systems further reduces to tf N N tf % tf ( ( T T δL dt = δr k pˆk − δpk (r k − ˆr k ) . ti
k=1
ti
k=1
ti
308
8 Variational and energy principles
In this statement, the left-hand side takes care of the dynamics of the system, and the right-hand side deals with the boundary conditions at the initial and ſnal times. In some instances, the boundary conditions are rather simple, and Hamilton’s principle is used to derive the sole equations of motion of the system. In such case, the boundary terms on the right-hand side are simply dropped from the statement that now reduces to the principle of least action. % tf δL dt = 0. (8.23) ti
Principle 15 (Principle of least action) A system of particles is in dynamic equilibrium if and only if equation (8.23) holds for all arbitrary virtual displacements. A fundamental difference between d’Alembert’s and Hamilton’s principles is that in the latter principle, the virtual work done by the inertial forces is expressed in terms of the variation of the kinetic energy. As was the case for d’Alembert’s principle, if the virtual work done by a speciſc force vanishes, this force will be automatically eliminated from the equations of motion obtained from Hamilton’s principle. In section 7.4.3, it was shown that the virtual work done by forces that impose kinematic constraints does vanish; hence, such forces will not appear in formulations based on Hamilton’s principle. Of course, if non-conservative forces are applied to the system, the virtual work of these forces must also be taken into account, see eq. (8.20). 8.2.2 Use of generalized coordinates In the previous section, the system was represented by the Cartesian coordinates of each particles. As discussed in section 7.2, it is often convenient to use generalized, rather than Cartesian coordinates to describe dynamical systems. To that effect, eq. (8.16) is now recast as % tf tf tf ! " q ) t = 0. (8.24) δW I − δV + δW nc dt + δq T (p − ˆp) t + δpT (q − ˆ ti
i
i
In this statement, the boundary conditions are written in terms of the array of generalized coordinates, q, used to describe the system and the generalized momenta, p, deſned as ∂L , (8.25) p= ∂ q˙ where q˙ are the system’s generalized velocities. The boundary conditions are as follows: at time ti , q(ti ) = ˆq (ti ), where ˆ q (ti ) denotes the known, or prescribed generalized coordinates deſning the conſguration of the system if this conſguration is known, or the conſguration resulting from the solution of the equations of motion. Similarly, ˆp(ti ) denotes the prescribed or resulting generalized momentum at time ti depending on the speciſc problem at hand. Of course, similar notations are used at the ſnal time tf .
8.2 Hamilton’s principle
309
If the conſguration of the system is described in terms of generalized coordinates, the virtual work done by the inertial forces, eq. (8.19), is expressed as %
tf
% δW I dt =
ti
tf
δK dt −
ti
%
k=1
tf
=
N (
δK dt − δq T
ti
∂K δr Tk ∂v k N ( k=1
tf ti
∂rk ∂q
T
∂K ∂v k
(8.26)
tf . ti
In the last bracketed term, the virtual displacement vector of the k th particle was expressed in terms of the generalized coordinates using the chain rule for derivatives as δr k = (∂r k /∂q)δq. Similarly, the velocity vector of the k th particle can be evaluated in terms of the generalized velocities as v k = (∂r k /∂q)˙q ; it then follows that ∂v k /∂ q˙ = ∂rk /∂q. The boundary term of eq. (8.26) now simpliſes to δq
T
N ( ∂r k=1
T k
∂q
∂K ∂v k
tf
= δq
ti
T
= δq T
N ( ∂v k=1
∂K ∂ q˙
T k
∂ q˙ tf
∂K ∂v k
tf ti
tf = δq T p t , i
ti
where the deſnition of the generalized momenta, eq. (8.25), was introduced. The virtual work done by the inertial forces, eq. (8.26), now becomes % tf % tf tf I δW dt = δK dt − δq T p t , ti
i
ti
where it is understood that variations should be taken with respect to the generalized coordinates. Introducing all these results into eq. (8.24) now leads to % tf tf tf (δK − δV + δW nc ) dt = δq T ˆp t − δpT (q − ˆq ) t . (8.27) ti
i
i
Hamilton’s principle can be stated as follows. Principle 16 (Hamilton’s principle) A system of particles is in dynamic equilibrium if and only if equation (8.27) holds for all arbitrary virtual changes in the generalized coordinates. This principle should be compared with principle 14: both statement express the condition for dynamic equilibrium of the system, the former when the conſguration of the system in described in terms of Cartesian coordinates, the latter when the conſguration is expressed in terms of generalized coordinates. Of course, the statement of Hamilton’s principle can be further simpliſed by introducing the system’s Lagrangian as deſned by eq. (8.22).
310
8 Variational and energy principles
Example 8.4. Pendulum mounted on a cart Figure 8.5 depicts a pendulum of length L and mass m mounted on a cart of mass M that is connected to the ground by means of a spring of stiffness constant k and of a dashpot of constant c. This two degree of freedom problem will be represented by two generalized coordinates: the displacement of the cart, denoted x, which is also the stretch of the spring, and the angular deƀection of the pendulum with respect to the vertical, denoted θ. Gravity acts on the system as indicated in the ſgure. Derive the equations of motion of the system. x k O
c g
i1
M
A q
L m
i2
e2 e1
Fig. 8.5. Pendulum mounted on a cart.
The position of the center of mass of the pendulum is r C = x ¯ı2 + L/2 e¯1 and its velocity v C = x˙ ¯ı2 + L/2θ˙ e¯2 . The total kinetic energy of the system is the sum of that of the cart, 1/2 M x˙ 2 , and that of the pendulum, 1/2 mL2 /12 θ˙2 + 1/2 mv TC v C , i.e., 1 1 mL2 ˙2 1 ˙ θ, K = (M + m)x˙ 2 + θ + mLx˙ θC (8.28) 2 2 3 2 where the short-hand notation Cθ = cos θ and Sθ = sin θ was used. The potential energy of the system consists of the strain energy of the spring, 1/2 kx2 , and the potential energy of the gravity forces, mgL/2 (1 − Cθ ). The potential of the system is now 1 mgL V = kx2 + (1 − Cθ ). (8.29) 2 2 Variation in the Lagrangian of the system becomes mL2 ˙ ˙ θδ θ 3 mL ˙ ˙ θ δθ) − kxδx − mg L Sθ δθ. (θCθ δ x˙ + xC ˙ θ δ θ˙ − x˙ θS + 2 2
δL = δK − δV =(M + m)xδ ˙ x˙ +
Finally, the virtual work done the non-conservative forces is δW nc = −cxδx. ˙ Introducing these results into Hamilton’s principle leads to % tf · · mL ˙ mL2 ˙ mL θCθ δx − θ+ − (M + m)x˙ + xC ˙ θ δθ 2 3 2 ti L mL ˙ x˙ θSθ δθ − kxδx − mg Sθ δθ − cxδx ˙ dt = 0. − 2 2
8.2 Hamilton’s principle
311
To obtain this expression, integration by parts were performed for all terms involving ˙ Each time derivatives of virtual changes in the generalized coordinates, δ x˙ and δ θ. integration by parts generates boundary terms at times ti and tf , which were ignored in this example. Because the variations δx and δθ are arbitrary, the sum of their coefſcients in the integrand must vanish, leading to the two governing equations of the system mL ¨ mL ˙2 Cθ θ − Sθ θ + kx + cx˙ = 0, 2 2 2 mL mL ¨ L Cθ x ¨+ θ + mg Sθ = 0. 2 3 2
(M + m)¨ x+
(8.30)
Several distinguishing features of Hamilton’s principle are apparent in this example. First, there is no need to compute accelerations because the Lagrangian of the system only involves velocities appearing in the expression for the kinetic energy. Both Newtonian approach or application of d’Alembert’s principle would require the computation of accelerations, adding to the complexity of the kinematic analysis. Second, internal and reaction forces are eliminated from the formulation. If this problem were treated using Newton’s approach, the components of the internal force at point A would enter the formulation and derivation of the equations of motion would requires elimination of these forces. Example 8.5. Kinetic energy for a rigid body Figure 8.6 depicts a rigid body in its reference conſguration, as deſned by frame F E = [B, E = (¯ e1 , e¯2 , e¯3 )], where B is a reference point on the body. Position vector r B0 determines the location of a reference point B on the rigid body with respect to inertial frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]. In the ſnal conſguration, the conſguration of the rigid body is deſned by frame F B = B, B ∗ = (¯b1 , ¯b2 , ¯b3 ) . The position vector of point B in the present conſguration is denoted r B = r B0 + u, where u is the displacement vector of point B. Let R0 and R be the rotation tensors that bring bases I to E and E to B ∗ , respectively, both resolved in the inertial basis, I. Superscripts (·)∗ denote tensor components resolved in the body attached basis. Find the kinetic energy of the rigid body. In section 6.2, the kinetic energy of a rigid body was found to be given by eq. (6.6). Expressing all tensor components in the body attached basis leads to 1 T K= mu˙ (R R0 )(R R0 )T u˙ + 2mu˙ T (R R0 ) η ∗T ω ∗ + ω∗T I B∗ ω∗ . 2 The components of angular velocity vector of the rigid body resolved in the inertial ˙ RT ) and ω∗ = (R R )T ω are the components of the same basis are ω = axial(R 0 vector resolved in the material basis; η and η ∗ = (R R0 )T η are the components of the position vector of the center of mass of the body with respect to reference point B resolved in the inertial and material bases, respectively; ſnally, I B∗ and I B = (R R0 )I B∗ (R R0 )T are the components of the mass moment of inertia tensor of the body with respect to reference point B, resolved in material and inertial bases, respectively.
312
8 Variational and energy principles
e3
Present Configuration
Reference configuration
b2
C
h
u, R B E
b3 b1 B
*
B e2
F
i3
C
rB I
e1 rB0, R0
O
B
M
i2
i1 Fig. 8.6. Conſguration of a rigid body.
The kinetic energy is now written in a compact form as K=
1 ∗T ∗ ∗ V MB V , 2
(8.31)
where the mass matrix of the rigid body, resolved in the body attached frame, is η ∗T mI m , (8.32) M∗B = ∗ m η I B∗ and the velocity vector, resolved in the same frame, is (R R0 )T u˙ . V∗ = ω∗
(8.33)
With this notation, the mass matrix is of size 6 × 6 and the velocity vector of size 6×1. The components of the linear and angular momentum vectors of the rigid body, resolved in the material basis, are denoted p∗ and h∗ , respectively, and deſned as ∗ p P∗ = = M∗B V ∗ . (8.34) h∗ As for the velocity vector, the momentum vector, P ∗ , is of size 6 × 1. Strictly speaking, quantities V ∗ and P ∗ are not vectors, but rather arrays, consisting of two separate vectors. To underline this fact, the script type is used to denote such quantities. The same remarks apply to the mass matrix, M∗B . Example 8.6. Equations of motion for a rigid body with respect to a material point Figure 8.6 depicts a rigid body in its reference and present conſgurations. The external loading applied to the rigid body consist of a force, F , and a moment, M B , computed with respect to reference point B. The kinematics of the problem are described in the previous example. Find the equations of motion of the rigid body using Hamilton’s principle.
8.2 Hamilton’s principle
313
The virtual work done by the externally applied loads is δW nc = δuT F + δψ T M B ,
(8.35)
where δu and δψ are the components of the virtual displacement and rotation vectors, respectively, resolved in the inertial basis. For this problem, Hamilton’s principle, eq. (8.20), reduces to % tf (δK + δW nc ) dt = 0, ti
for all arbitrary virtual displacements, where the terms associated with the temporal boundary conditions are neglected. Introducing the expression for the kinetic energy of the rigid body, eq. (8.31), and virtual work done by the non-conservative forces, eq. (8.35), this principle becomes %
tf
*
+ δV ∗T P ∗ + δuT F + δψ T M B dt = 0.
(8.36)
ti
With the help of eqs. (4.101) and (7.17b), virtual changes in the velocity vector are readily evaluated as , , T ˙ T R ) ( u δψ + δ u) ˙ (R R ) u ˙ δ (R 0 0 δV ∗ = = . ˙ (R R )T δψ δω ∗ 0
Introducing these results into Hamilton’s principle, eq. (8.36), then yields % tf * +T * + * + ˙ T R R h∗ + δuT F + δψ T M u ˙ δψ + δ u˙ R R0 p∗ + δψ B dt = 0. 0 ti
In this expression, p = (R R0 )p∗ and h = (R R0 )h∗ are the components of linear and angular momentum vectors of the body, both resolved in the inertial basis. After integration by parts, Hamilton’s principle becomes % tf 3 4 T δuT −p˙ + F + δψ T −h˙ + u dt = 0. ˙ p + M B ti
Because the virtual displacements are arbitrary, the bracketed terms must vanish, leading to the governing equations of the problem. In summary, the governing differential equations of motion for a rigid body are p˙ = F , ˙p = M , ˙h + u B p u˙ = MB . ω h
(8.37a) (8.37b) (8.37c)
314
8 Variational and energy principles
The mass matrix of the rigid body computed with respect to point B and resolved in the inertial frame is ηT mI m MB = R M∗B RT = , (8.38) m η IB where R=
0 (R R0 ) . 0 (R R0 )
(8.39)
The twelve ſrst-order differential equations in time, eqs. (8.37a) to (8.37c) should be solved to ſnd the twelve unknowns of the problem: the three components of the displacement vector, u, the three rotation components required to express the rotation tensor, R, and the three components for each linear and angular momentum vectors, p and h, respectively. Equations (8.37a) to (8.37c) present a high level of nonlinearity; indeed, the mass matrix involves a term, I B = (R R0 )I B∗ (R R0 )T , containing a product of the unknown rotation tensor, R. The equations of motion can be recast in a manner that decreases the level of nonlinearity (R R0 p∗ )· = F , ∗ ·
∗
˙ (R R0 p ) = M B , (R R0 h ) + u ∗ (R R0 )T u˙ p ∗ = M . B h∗ ω∗
(8.40a) (8.40b) (8.40c)
In this form, the mass matrix of the body is a constant, because it is expressed in the material frame. Of course, the equations of motion can be written in many different manners. Eliminating the momenta will lead to six, second-order differential equations for the six displacement components, three displacements of reference point B and three rotation components. The equations of motion could also be recast in terms of displacements and velocities, rather than momenta. In each case, the equations might be resolved in the inertial basis or in the material basis. The equations presented here are intrinsic equations of motion, i.e., they are independent of the variables used to represent rotations: the above equations could be used with any set of Euler angles, see section 4.11. In practice, however, a speciſc parametrization of ſnite rotations must be selected to solve the equations of motion. If the rigid body is tumbling in space, it is important to select a parametrization that is singularity free, as discussed in chapter 13. Example 8.7. Equations of motion for a rigid body with respect to an inertial point In the previous example, the applied moment was computed with respect to a material point, reference point B. Similarly, the linear and angular momenta deſned by eq. (8.34) are the product of the mass matrix resolved in the material frame by the velocity vector. The mass matrix, eq. (8.32), is evaluated with respect to reference
8.2 Hamilton’s principle
315
point B: indeed, η ∗ is the position vector of the body’s center of mass with respect to point B and I B∗ the body’s mass moment of inertia tensor computed with respect to the same point. Derive the equations of motion of the rigid body with respect to an inertial point. Referring to ſg. 8.6, the loading externally applied to the rigid body is FO = F, rB0 + u )F . M O = M B + (
(8.41a) (8.41b)
The externally applied force, F O , and moment, M O , are now computed with respect to inertial point O. Similarly, the linear and angular momenta of the rigid body are pO = p,
(8.42a)
hO = h + ( rB0 + u )p,
(8.42b)
where the linear momentum, pO , and angular momentum, hO , are now computed with respect to inertial point O. The governing equations of the problem derived in the previous example are given by eqs. (8.37). A linear combination of eqs. (8.37a) and (8.37b) yields h˙ + u ˙ p + ( rB0 + u )p˙ = M B + ( rB0 + u )F ; the left-hand side of this equation is an · exact derivative, leading to h + ( rB0 + u )p = M O , where eq. (8.41b) was used to simplify the right-hand side of the equation. The equations of motion of the problem, are now recast in a more compact manner as
O
p˙ O = F O ,
(8.43a)
h˙ O = M O .
(8.43b)
In the absence of externally applied forces, these equations reduce to p˙ O = 0 and
h˙ = 0; this implies the conservation of linear and angular momenta, a general result that was derived for a system of particles, see eqs. (3.81) and (3.82), respectively. To obtain a complete set of governing equations, rotation parameters must now be selected. Assuming that the rotation tensor is expressed in terms of three Euler angles stored in array q, the angular velocity vector becomes ω = H(q)˙q , where expressions for the tangent operator, H(q), are given in section 4.11 for various Euler angle sequences; see, for instance, eq. (4.68) for the 3-1-3 sequence. Equation (8.37c) now become · * +−1 pO 0 I u MB . (8.44) = q hO − ( rB0 + u )pO 0 H −1 (q) While the externally applied loads and momenta are now referred to inertial point O, reference to material point B has not been completely eliminated from the formulation; indeed, the mass matrix, MB , is still computed with respect to point B. Example 8.9 will eliminate this restriction by using a more general formalism based on the use of the motion tensor.
316
8 Variational and energy principles
Example 8.8. Rigid body connected to spring and dashpot Consider a rigid body connected to the ground at point B by means of a spring of stiffness constant k and dashpot of constant c, as depicted in ſg. 8.7 and discussed in example 6.3 on page 221. The spring is of un-stretched length 0 . The rigid body is of mass M and its moment of inertia tensor with respect to the center of mass is I C . Vector η deſnes the position of the center of mass with respect to point B. Frame F B = B, B ∗ = (¯b1 , ¯b2 , ¯b3 ) is attached to the rigid body. The components of the rotation tensor that brings inertial basis I to material basis B ∗ , resolved in basis I, is denoted R. Find the equations of motion of the system.
i3 O I
i2
g b3
c i1
k u, R
b2 B
*
B
h
C
b1 Fig. 8.7. Conſguration of the rigid body connected to a spring and dashpot.
The force, F B , applied to the rigid body at point B acts in the direction opposite to unit vector e¯ = u/u, or F B = −F e¯, where u is the position vector of point B with respect to point O and F the magnitude of the applied force. It then follows that ˙ where Δ = u − 0 is the stretch of the spring. The time rate of F = kΔ + cΔ, change of the stretch is easily found as Δ˙ = e¯T u. ˙ The formalism developed in example 8.6 will be used to solve this problem. In this case, eqs. (8.37a) and (8.37b) become p˙ = −F e¯ − M g ¯ı3 and h˙ + u ˙ p = −M g η ¯ı3 , respectively, and eq. (8.37c) is unchanged. The equations of motion are further simpliſed by expressing the linear and angular momenta with respect to inertial point O, as deſned in eqs. (8.42). The governing equations now become · pO −F e¯ − M g ¯ı3 , = −M g( η+u )¯ı3 hO and eqs. (8.44) remain unchanged. Example 8.9. Equations of motion for a rigid body using the motion formalism Consider the rigid body depicted in ſg. 8.6; the kinetic energy of the body was evaluated in example 8.5. Use the motion formalism and the motion tensor to derive the equations of motion of the rigid body. Let C 0 and C be the motion tensors that bring frame F I to F E and frame F E to F B , respectively, both resolved in frame F I . Frame F E is attached to the rigid body in its reference conſguration.
8.2 Hamilton’s principle
317
Figure 8.6 shows the force, F , and moment, M B , externally applied to the rigid body and computed with respect to reference point B. The virtual work done by these loads is δW nc = F T δu + M TB δψ, where δu is the virtual displacement vector of point B and δψ the virtual rotation vector of the rigid body. Resolving the loads in ∗ the body attached basis yields δW nc = F ∗T RT δu + M ∗T B δψ , and the following compact notation is then used, * +* +−1 δW nc = A∗T δU ∗ = AT C C 0 C C 0 δU = AT δU,
(8.45)
where A∗T = F ∗T , M ∗T is the applied load vector, and δU ∗ the virtual motion vector, both resolved in frame F B . The virtual motion vector is similar to the differential motion vector deſned in eq. (5.72b). The second equality follows from the frame change operations expressed by eqs. (5.72b) and (5.62). The velocity vector V ∗ deſned in eq. (8.33) combines the linear and angular velocity vectors of the rigid body resolved in the body attached frame. It follows that V = (C C 0 )V ∗ is the velocity vector resolved in the inertial frame. Virtual changes in the velocity vector become δV ∗ = (C C 0 )−1 δV + CδC −1 V (8.46) ˙ − VδU ˙ , − δUV = (C C 0 )−1 δU = (C C 0 )−1 δU where eqs. (5.74a) and (5.74b) were used. Because virtual changes in the kinetic energy of the system are expressed as δK = δV ∗T P ∗ , where the linear and angular momenta of the rigid body, P ∗ , are deſned in eq. (8.34), Hamilton’s principle, eq. (8.20), becomes % ˙ T (C C )−T P ∗ + δU T A dt = 0, δU 0 t
Introducing eqs. (8.34) and (5.58) then leads to % ˙ T (C C )−T M∗ (C C )−1 V + δU T (C C )−T A∗ dt = 0. δU B 0 0 0 t
The mass matrix of the rigid body in the inertial frame is deſned as MO = (C C 0 )−T M∗B (C C 0 )−1 .
(8.47)
This change of frame operation performs two tasks: a change of reference point from B to O and a change of basis from B ∗ to I. ' ˙ T P + δU T A) dt = 0, where the moHamilton’s principle now becomes t (δU mentum vector in the inertial frame is deſned as P = MO V. Integration by parts ' now yields t δU T [−P˙ + A] dt = 0. Because the virtual motions are arbitrary, the bracketed term must vanish. In summary, the governing differential equations of motion for a rigid body are
318
8 Variational and energy principles
P˙ = A,
(8.48a)
P = MO V.
(8.48b)
These twelve ſrst-order differential equations in time, should be solved to ſnd the twelve unknowns of the problem: six motion components to express the motion tensor, C, and six components of momentum, P. Note that eqs. (8.48a) and (8.48b) present a high level of nonlinearity; indeed, the mass matrix, eq. (8.47), involves the unknown motion tensor C and its inverse. Example 8.10. Numerical application This example illustrates the use of Hamilton’s principle for numerical applications. Consider a particle of mass m and position vector r(t) with respect to an inertial point, subjected to an arbitrary, time varying force, F (t). The kinetic energy of the system simply writes K = 1/2 m r˙ T r˙ and the virtual work done by the externally applied force becomes δW nc = δr T F (t). Hamilton’s principle could be used to derive the equations of motion of the system, which are m¨r = F for this simple problem; integration of these equations would then yield the trajectory of the particle. The traditional approach to solving dynamics problems consists of two steps; ſrst, one of the formulations of dynamics, such as Newton’s formulation or Hamilton’s principle, is used to derive the ordinary differential equations of motion of the system, and second, an approximate solution of these equations is obtained with the help of numerical methods. Because the equations of motion are, in general, nonlinear differential equations, numerical methods, such as Runge-Kutta integrators [5, 26], for instance, are often used to obtain numerical solutions. A different approach is taken here: algebraic or discretized equations of motion will be directly obtained from Hamilton’s principle, bypassing the derivation of the ordinary differential equations of motion. To that effect, the behavior of the particle is investigated during a small, but ſnite period of time between ti and tf , called a time step. The time step size is denoted Δt = tf − ti . For simplicity, a non-dimensional time, τ , is deſned, which is related to the dimensional time as t = 1/2 (ti + tf ) + τ Δt/2, i.e., τ = ±1 at times ti and tf , respectively. To obtain algebraic equations, the trajectory of the particle over the time step is discretized by assuming its motion to be the straight line joining its positions at times ti and tf , denoted r i and r f , respectively. It then follows that r(τ ) = ri (1 − τ )/2 + r f (1 + τ )/2. Within the time step, the velocity of the particle is constant: v = dr/dt = (dr/dτ )(dτ /dt) = (r f − ri )/Δt. Introducing these approximations into the expression for the kinetic energy then yields δK = pTm (δr f − δr i )/Δt, where pm = m(r f − ri )/Δt is the constant linear momentum of the particle within the time step. The statement of Hamilton’s principle governing the motion of the particle now becomes , % tf T δr f − δr Ti 1+τ T 1−τ T pm + δri + δr f F (t) dt Δt 2 2 ti tf tf = δr T ˆp t − δpT (r − ˆr ) t . i
i
8.2 Hamilton’s principle
319
Note that the boundary terms on the right-hand side of the principle are not ignored for this application and form an essential part of the solution. Because the time history of the particle’s trajectory was assumed, the left-hand side of the principle is readily integrated to yield (δr f − δri )T pm + δr Ti Gi + δr Tf Gf = δr Tf ˆpf − δr Ti ˆpi − δpTf (r f − ˆr f ) + δpTi (r i − ˆri ),
(8.49)
where the following two quantities were deſned % % Δt +1 1 − τ Δt +1 1 + τ F (τ ) dτ, Gf = F (τ ) dτ. Gi = 2 −1 2 2 −1 2 Equation (8.49) must be satisſed for all arbitrary variations δpi , δpf , δr i , and δr f , leading to the four equations of motion of the system ri = ˆri , −pm + Gi + ˆpi = 0,
r f = ˆrf , pm + G f − ˆ pf = 0.
(8.50a) (8.50b)
These four equations involve six unknowns, ˆpi , ˆpf , ˆr i , ˆr f , ri , and r f ; hence, two of these six quantities must be given if a solution is to be found. Initial value problems are a common class of problems for which the initial position and velocity of the particle are given. In this example, it is assumed that the position vector, ˆri , and velocity vector, vˆi = ˆpi /m, of the particle are given at the beginning of the time step. Summing up the two eqs. (8.50b) yields Gi +Gf +ˆ pi −ˆ pf = 0; clearly, % Δt +1 G = Gi + Gf = F (τ ) dτ, 2 −1 is the impulse of the externally applied force over the time step. It follows that G = ˆpf − ˆpi : this equation expresses the principle of impulse and momentum applied to the particle. Although the trajectory of the particle was approximated, this equation is exact, since the principle of impulse and momentum is a ſrst integral of Newton’s law. Because ˆpi = mˆ v i and ˆpf = mˆ v f , this equation yields the velocity of the particle at the end of the time step as vˆf = vˆi + G/m. (8.51) Next, subtracting the two eqs. (8.50b) yields 2pm + Gf − Gi − ˆ pf − ˆ pi = 0. With the help of eq. (8.51), this relationship can be used to evaluate the ſnal position of the particle as Δt ˆr f = ˆr i + Δt vˆi + G. (8.52) m i Finally, eqs. (8.52) and (8.51) can be recast as a matrix equation relating the initial position and velocity vectors of the particle to the corresponding quantities at the end of the time step
320
8 Variational and energy principles
ˆr f Δt ˆv f
11 = 01
ˆri Δt vˆi
Δt + m
Gi . G
A recursive application of this formula for a number of consecutive time steps will yield the trajectory of the particle over any period of time. Of course, more accurate solutions will be obtained when smaller time steps are used, at the expense of increased computational cost. More accurate, but also more complex, integration schemes can be obtained by assuming more complex trajectories of the particle within each time step. For instance, the particle’s trajectory could be assumed to be parabolic: r(τ ) = −τ (1 − τ ) r i /2+(1−τ 2 ) r m +τ (1+τ ) r f /2, where r m is the position vector of the particle at τ = 0, i.e., at the mid-point of the time step for t = (ti + tf )/2. Following the procedure described above, discrete equations of motion will be found corresponding to this new approximation.
8.2.3 Problems Problem 8.17. Equations of motion of rigid body Prove that the equations of motion for a rigid body obtained from Hamilton’s principle, eqs. (8.40), are identical to those obtained from d’Alembert’s principle, eqs. (8.7).
Problem 8.18. Equations of motion of a rigid body Consider the equations of motion of a rigid body, eqs. (8.37). (1) Show how these equations simplify if reference point B is chosen to be the center of mass of the rigid body, point C. Eliminate the momentum variables. Express the components of all tensors in the inertial basis. (2) Write the same equations with all tensors expressed in the body attached frame. (3) Let the orientation of the body attached frame coincide with the principal axes of inertia of the rigid body. In that case, the components the moment of inertia tensor in the material basis are denoted I C∗ = diag(I1∗ , I2∗ , I3∗ ). Show how the equations of motion expressed in the body attached frame further simplify to Euler’s equations.
Problem 8.19. Rigid body subjected to gravity forces Find the equations of motion of a rigid body subjected to gravity forces, as depicted in ſg. 8.8. Vector η deſnes the position of the body’s center of mass, point C, with respect to its reference point B. The potential of the gravity forces is V = −mg T r C , where m is the total mass of the rigid body, g the components of the gravity acceleration vector, and r C the components of the position vector of the center of mass in the present conſguration, both resolved in I.
Problem 8.20. Pendulum with sliding mass Solve problem 6.34 using Hamilton’s principle.
Problem 8.21. Pendulum with rotating mass Solve problem 4.38 using Hamilton’s principle.
Problem 8.22. Equations of motion of a rigid body in terms of Euler angles The equations of motion for a rigid body, eqs. (8.40), are intrinsic equations, i.e., no reference is made to a speciſc representation of ſnite rotations, the sole rotation tensor appears in the equations. Let the conſguration of a rigid body be described with the following six generalized
8.2 Hamilton’s principle
Reference configuration e30
Present Configuration u, R
e2
C
e20 B e10
h
C rC0 rB0, R0
rC
i3
321
e3
E B
e1
rB
O I
i2
i1 Fig. 8.8. Conſguration of a rigid body with center of mass. coordinates: the three components of the position vector, u, of reference point B, and the three Euler angles φ, θ, and ψ using the 3-1-3 sequence that express the orientation of the body. (1) Express the kinetic energy of the body in terms of these generalized coordinates. (2) Write Hamilton’s principle if the body is subjected to externally applied forces, F , and moments, M B . (3) Derive the equations of motion of the system from Hamilton’s principle. (4) Compare your results with eqs. (8.40); comment on the difference between the two sets of equations.
Problem 8.23. Rigid body connected to spring and dashpot Consider a rigid body connected to the ground at point B by means of a spring of stiffness constant k and dashpot of constant c, as depicted in ſg. 8.7 and discussed in example 8.8. The rigid body is of mass M and its moment of inertia tensor with respect to the center of mass is I C . Vector η deſnes the position of the center of mass with respect to point B. Frame F B = B, B = (¯b1 , ¯b2 , ¯b3 ) is attached to the rigid body. The components of the rotation tensor that brings the inertial basis I to basis B, resolved in basis I, is denoted R. Find the equations of motion of the system based on Hamilton’s principle. (1) Solve the equations of motion of the problem using the momenta computed with respect to inertial point O. (2) On one graph, plot the time histories of the three components of vector x ¯ C . (3) On one graph, plot the Euler angles ψ, θ, and φ as a function of τ . (4) On one graph, plot the time histories of the three components of the velocity vector, v¯C = v C /(Ω0 ). (5) On one graph, plot the time histories of the three components of the angular velocity vector, ω ¯ = ω/Ω. (6) On one graph, plot the time histories of the forces in the elastic spring and dashpot. (7) On one graph, plot the kinetic and potential energies of the system as well as the energy dissipated in the dashpot. Verify the energy closure equation. Treat the problem using a non-dimensional scheme with τ = Ωt, ¯ C = xC /0 , and 0 the un-stretched length of the spring. Use the following Ω 2 = k/M , x data: g¯ = g/(0 Ω 2 ) = 0.4; ζ = c/(2M Ω) = 0.1; η¯∗ = η ∗ /0 = [0.8, 1.25, −1.8]T ; C∗ I¯ = I C∗ /(M 20 ) = diag(1, 2.3, 1.5). Use the following initial conditions: x ¯ T (τ = 0) = B
[0, 1, 0]; q T (τ = 0) = [0, 0, 0] and the system is at rest. Present the response on the system for τ ∈ [0, 100]. Hint: to avoid singularities, use Euler angles, ψ, θ, and φ, with the 3-2-1 sequence, as deſned in section 4.11.3, to represent the rotation of the rigid body.
Problem 8.24. Change of frame for mass matrix
Consider two frames: an inertial frame F I and a material frame F. Let M∗ , as given by eq. (8.47), be the mass matrix of a rigid body expressed in the material frame. Show that mass matrix M deſned as M = C −T M∗ C −1 , is the mass matrix of the rigid body in the inertial
322
8 Variational and energy principles
frame, if C is the motion tensor that bring frame F I to frame F, resolved in F I . The above transformation performs two tasks: a change of reference point and a change of orthonormal basis. Note: it will be necessary to use the parallel axis theorem.
Problem 8.25. Higher order integration scheme Generalize the time integration scheme derived in example 8.10 by approximating the trajectory of the particle as r(τ ) = −τ (1 − τ ) r i /2 + (1 − τ 2 ) r m + τ (1 + τ ) r f /2, where r m is the position vector of the particle at τ = 0, i.e., at the mid-point of the time step for t = (ti + tf )/2.
8.3 Lagrange’s formulation As demonstrated in the previous sections, the equations of motions of dynamical systems can be derived from Hamilton’s principle. The procedure involves integrations by parts of all the terms featuring variations of time derivatives of the generalized coordinates. Such derivation can become cumbersome when a large number of generalized coordinates is present; hence, it is desirable obtain the equations of motion of dynamical systems in a more systematic manner, bypassing many of the steps required by Hamilton’s principle. Consider a system featuring n degrees of freedom and described by n generalized coordinates, q. In general, the Lagrangian of the system will be a function of all generalized coordinates and their time derivatives, as well as time, L = L(q, q˙ , t). Variation of the Lagrangian then becomes δL =
∂L T ∂L T δq + δ q. ˙ ∂q ∂ q˙
(8.53)
Introducing this expression into Hamilton’s principle, eq. (8.20), and ignoring the boundary terms leads to 2 % tf 1 ∂L T ∂L T nc dt = 0. (8.54) δq + δ q˙ + δW ∂q ∂ q˙ ti The second term in the integrand involves variations of the generalized velocities, δ q, ˙ that are clearly not independent of the variations in the generalized coordinates, δq. To remedy this situation, this second term is integrated by parts %
tf
ti
∂L T δ q˙ dt = − ∂ q˙
%
tf ti
d dt
1
∂L T ∂ q˙
2
∂L T δq dt + δq ∂ q˙
tf
With the help of this result, Hamilton’s principle becomes % tf d ∂L ∂L + δq T − + Qnc dt = 0, dt ∂ q˙ ∂q ti
.
(8.55)
ti
(8.56)
8.3 Lagrange’s formulation
323
for all arbitrary variations δq. Here again, boundary terms are ignored; the virtual work done by the non-conservative forces was written as δW nc = δq T Qnc , where Qnc are the generalized, non-conservative forces acting on the system. Because variations in generalized coordinates are arbitrary, the bracketed term must vanish, revealing Lagrange’s equations of motion d ∂L ∂L − = Qnc . (8.57) dt ∂ q˙ ∂q In the above developments, the partial derivatives of the Lagrangian with respect to the generalized velocities play a key role. These quantities, denoted p, are called generalized momenta and are deſned by eq. (8.25). With the help of these generalized momenta, Lagrange’s equations of motion take the following form ∂V ∂K p˙ − + = Qnc . ∂q ∂q The third term of this equation, ∂V /∂q, is associated with the conservative forces acting on the system: Qc = −∂V /∂q. Lagrange’s equations now become ∂K − p˙ − + Qc + Qnc = 0. ∂q The ſrst term represents the inertial forces acting on the system; these forces are generalized forces, because they act in the conſguration space. The following notation is introduced for the generalized inertial forces ∂K I Q = − p˙ − . (8.58) ∂q Lagrange’s equations are now cast in a simple form, QI + Qc + Qnc = 0, and imply that the sum of all forces acting on the system must vanish at each instant in time. Clearly, Lagrange’s equations are a statement of dynamic equilibrium; of course, inertial forces must be considered together with all externally applied forces for dynamic equilibrium conditions to be satisſed. Example 8.11. Pendulum mounted on a cart Figure 8.5 depicts a pendulum of length L and mass m mounted on a cart of mass M that is connected to the ground by means of a spring of stiffness constant k and of a dashpot of constant c. The displacement of the cart is denoted x, which is also the stretch of the spring, and angle θ measures the deƀection of the pendulum with respect to the vertical. Gravity acts on the system as indicated in the ſgure. This problem was treated in example 8.4 on page 310 using Hamilton’s principle. Derive the equations of motion of the system using Lagrange’s formulation.
324
8 Variational and energy principles
The kinetic and potential energies of the system are given by eqs. (8.28) and (8.29), respectively. The Lagrangian of the system then becomes L=
1 1 mL2 ˙2 mL ˙ 1 mgL (M + m)x˙ 2 + θ + x˙ θCθ − kx2 − (1 − Cθ ). 2 2 3 2 2 2
The system’s generalized momenta are found as px =
mL ˙ ∂L = (M + m)x˙ + θCθ , ∂ x˙ 2
pθ =
∂L mL2 ˙ mL xC ˙ θ. = θ+ ˙ 3 2 ∂θ
The derivatives of the Lagrangian with respect to the generalized coordinates are ∂L = −kx, ∂x
∂L mL ˙ mgL =− x˙ θSθ − Sθ . ∂θ 2 2
Finally, the virtual work done the non-conservative forces is δW nc = −cxδx, ˙ and hence, the generalized forces are Qx = −cx˙ and Qθ = 0. Lagrange’s formulation then yield the equations of motion of the system mL ¨ (θCθ − θ˙2 Sθ ) + kx = −cx; ˙ 2 mL2 ¨ mL ˙ θ ) + mL x˙ θS ˙ θ + mg L Sθ = 0. θ+ (¨ xCθ − x˙ θS 3 2 2 2 (M + m)¨ x+
These equations are, of course, identical to those obtained earlier, see eqs. (8.30), using Hamilton’s principle. Clearly, Lagrange’s formulation provides an effective procedure for deriving the system’s equations of motion. The integrations by parts associated with the application of Hamilton’s principle are completely bypassed in this approach. Example 8.12. Swiveling plate Figure 6.10 depicts a homogeneous, rectangular plate of height a, width b, and mass m connected to the ground by a rigid, massless link of length d, as discussed in example 6.2 on page 219. At point O, a bearing allows the link to rotate with respect to unit vector ¯ı3 , and at point B, the plate is free to rotate with respect to the link about axis a ¯1 . Three frames will be used in this problem: the inertial frame, F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )], a frame attached to the link, F A = [O, A+ = (¯ a1 , a ¯2 , a ¯3 )], and ſnally, a frame attached to the plate at its center of mass, F P = C, B ∗ = (¯b1 , ¯b2 , ¯b3 ) . A planar rotation of magnitude α about axis ¯ı3 brings basis I to A+ , and a planar rotation of magnitude β about axis a ¯1 brings basis A+ to B ∗ . Rotation tensors Rα and Rβ denotes these two planar rotations, respectively; tensor components resolved in bases A+ and B ∗ are denoted with superscripts (·)+ and (·)∗ , respectively. The inertial angular velocity vector of the plate is readily found with the help of the addition theorem as ¯3 +β˙ a ¯1 ; the components of this vector in basis B ∗
ω = α˙ a ˙ αS then become ω ∗T = β, ˙ β , αC ˙ β . The inertial position of the center of mass of the plate is rC = (d + a/2) a ¯1 and the velocity vector v C = (d + a/2)α˙ a ¯2 . Body
8.3 Lagrange’s formulation
325
attached frame F P is located at the center of mass of the plate and is aligned with its principal axes of inertia. The principal moments of inertia are given in ſg. 6.45 as ∗C ∗C ∗C I11 = mb2 /12, I22 = ma2 /12, I33 = m(a2 + b2 )/12. The kinetic energy of the system becomes 2 1 a b 1 a2 a2 + b 2 2 2 K = m(d + )2 α˙ 2 + m β˙ 2 + m α˙ 2 Sβ2 + m α˙ Cβ , 2 2 2 12 12 12 where Sβ = sin β and Cβ = cos β. The potential energy associated with the forces of gravity is easily found as V = −mg(d + a/2)Sα , and the Lagrangian of the system becomes a 1 m b 2 ˙ 2 a2 2 b 2 2 2 a L = m(d + )2 α˙ 2 + β + α˙ + α˙ Cβ + mg(d + )Sα . 2 2 2 12 12 12 2 The generalized momenta of the system are found by taking derivatives of the Lagrangian with respect to the generalized velocities α˙ and β˙ to ſnd ∂L a2 ∂L b2 ˙ a b2 pα = ˙ and pβ = = m β, = m (d + )2 + + Cβ2 α, ∂ α˙ 2 12 12 12 ∂ β˙ respectively. The partial derivatives of the Lagrangian with respect to the generalized coordinates α and β are ∂L a = mg(d + )Cα ∂α 2
and
b2 ∂L = −m αS ˙ β Cβ , ∂β 12
respectively. Lagrange’s formulation then yield the equations of motion of the system a b2 a d a2 m (d + )2 + + Cβ2 α˙ − mg(d + )Cα = 0, dt 2 12 12 2 2 2 d b b m β˙ + m αS ˙ β Cβ = 0. dt 12 12 It is readily veriſed that these equations of motion are identical to those obtained from Newton’s approach, eqs. (6.34). Example 8.13. The double pendulum with elastic joint Figure 6.17 depicts a double pendulum comprising bar 1 of mass m1 and length 1 , and bar 2 of mass m2 and length 2 , as treated in example 6.5 on page 231. Let frame F A = [A, A = (¯ a1 , a ¯2 )] be attached to bar 1 and frame F E = [E, E = (¯ e1 , e¯2 )] be attached to bar 2. A massless tube allows bar 2 to slide in the direction of a ¯2 ; the slider has a mass M and is connected to bar 1 at point A by means of a spring of stiffness constant k. The position of the slider is determined by its distance, x, from point A, the tip of bar 1; the angular positions of the two bars with respect to the vertical are denoted θ1 and θ2 , respectively. The system is subjected to gravity along the inertial direction ¯ı1 . Derive the equations of motion of the system.
326
8 Variational and energy principles
This problem features three degrees of freedom and will be described by means of three generalized coordinates: angles θ1 and θ2 giving the angular positions of the two bars and the sliding distance, x. The angular velocity of bar 1 is θ˙1 and hence, its kinetic energy is K1 = 1/2 m1 21 θ˙12 /3. The position vector of mass M is r M = 1 a ¯1 + x¯ a2 and its kinetic energy then follows as KM =
1 2 M (1 + x2 )θ˙12 + x˙ 2 + 21 x˙ θ˙1 . 2
Similarly, the position vector of the center of mass of bar 2 is r 2 = 1 a ¯1 + x¯ a2 + 2 /2 e¯1 and its kinetic energy becomes 1 2 K2 = m2 (21 + x2 )θ˙12 + x˙ 2 + 21 x˙ θ˙1 + 2 θ˙22 2 4 1 m 2 2 2 ˙2 + 2 xθ˙1 θ˙2 S21 + 2 (x˙ + 1 θ˙1 )θ˙2 S21 + θ , 2 12 2 where the following notation was introduced: S21 = sin(θ2 − θ1 ) and C21 = cos(θ2 − θ1 ). The last term of this expression represent the kinetic energy associated with the angular motion of the bar. The strain energy for the elastic spring is simply Vs = 1/2 kx2 . The potential energy of the gravity forces is Vg = −m1 g¯ıT1 (
1 2 a ¯1 ) − M g¯ıT1 (1 a ¯1 + x¯ a2 ) − m2 g¯ıT1 (1 a ¯1 + x¯ a2 + e¯1 ). 2 2
Combining kinetic, strain, and potential energies yields the Lagrangian of the system as 1 m1 m2 22 ˙2 L= ( + M2 )21 θ˙12 + M2 (x˙ 2 + x2 θ˙12 ) + θ 2 3 3 2 + 2M2 1 x˙ θ˙1 + m2 2 x˙ θ˙2 C21 + m2 2 θ˙1 θ˙2 (1 C21 + xS21 ) m1 m2 2 1 + M2 )g1 C1 − M2 gxS1 + gC2 , − kx2 + ( 2 2 2 where the following notation was introduced: S1 = sin θ1 and C1 = cos θ1 , with similar conventions for angle θ2 , and M2 = M + m2 . The generalized momenta of the system are found by taking derivatives of the Lagrangian with respect to the generalized velocities θ˙1 , θ˙2 , and x˙ to ſnd ∂L m1 m2 2 ˙ + M2 )21 θ˙1 + M2 x2 θ˙1 + M2 1 x˙ + θ2 (1 C21 + xS21 ), =( ˙ 3 2 ∂ θ1 ∂L m2 22 ˙ m2 2 m2 2 ˙ θ2 + xC ˙ 21 + θ1 (1 C21 + xS21 ), = p2 = ˙ 3 2 2 ∂ θ2 ∂L m2 2 ˙ = M2 (x˙ + 1 θ˙1 ) + θ2 C21 , p3 = ∂ x˙ 2
p1 =
8.3 Lagrange’s formulation
327
respectively. The partial derivatives of the Lagrangian with respect to the generalized coordinates θ1 , θ2 , and x are ∂L m2 2 ˙ m1 θ2 xS + M2 )g1 S1 + M2 gxC1 , ˙ 21 + θ˙1 (1 S21 − xC21 ) − ( = ∂θ1 2 2 ∂L m2 2 ˙ m2 2 ˙ ˙ m2 2 x˙ θ2 S21 − θ1 θ2 (1 S21 − xC21 ) − gS2 , =− ∂θ2 2 2 2 ∂L m2 2 ˙ ˙ = M2 xθ˙12 + θ1 θ2 S21 − kx − M2 gS1 , ∂x 2 respectively. Lagrange’s formulation then yields the equations of motion of the problem as ⎡ ⎤ m1 m2 2 ⎧ ⎫ ( + M2 )21 + M2 x2 (1 C21 + xS21 ) M2 1 ⎢ 3 ⎥ ⎨θ¨1 ⎬ 2 2 ⎢ m2 2 ⎥ m m 2 2 2 2 ⎢ ¨ (1 C21 + xS21 ) C21 ⎥ ⎢ ⎥ ⎩ θ2 ⎭ 2 3 2 ⎣ ⎦ x ¨ m2 2 C21 M2 1 M2 2 ⎧ ⎫ m 2 2 2 ⎪ ⎪ ˙ ˙ ⎪ 2M ( x x ˙ θ − S − xC ) θ 2 1 1 21 21 2 ⎪ ⎪ ⎪ ⎪ ⎪ 2 ⎨ ⎬ m 2 2 2 + m2 2 x˙ θ˙1 S21 + (1 S21 − xC21 )θ˙1 ⎪ ⎪ 2 ⎪ ⎪ ⎪ ⎪ m2 2 ˙ 2 ⎪ ⎪ ⎩ ⎭ −M2 xθ˙12 − θ2 S21 + kx 2 ⎧ m1 ⎫ ⎪( ⎪ + M2 )g1 S1 + M2 gxC1 ⎪ ⎪ ⎨ 2 ⎬ m 2 2 + = 0. gS2 ⎪ ⎪ ⎪ ⎪ 2 ⎩ ⎭ M2 gS1 This Lagrangian approach to the problem should be contrasted with Newton’s formulation discussed in example 6.5. As shown in ſg. 6.17, Newton’s formulation involves the internal forces at the joint, Fs and R, and the reaction forces at point O, VO and HO . With Lagrange’s formulation, the internal and reaction forces are eliminated because the work they perform vanishes and the elastic force in the spring is taken into account by the strain energy of the spring. Example 8.14. The milling machine Consider the simpliſed model of a milling machine depicted in ſg. 6.28. The tool support is a rigid body of mass m and moment of inertia I O with respect to point O connected to the ground at point O. Its center of mass is located at point A, which is at a distance 1 from point O. A torsional spring of stiffness constant kθ and un-stretched rotation θ0 , and a torsional dashpot of constant cθ act at point O. Let frame F E = [O, E = (¯ e1 , e¯2 )] be attached to the tool support; the angle between unit vectors ¯ı1 and e¯1 is denoted θ. A massless, rigid bar DB of length 2 is free to slide inside the tool support. A spring of stiffness constant kx and the un-stretched length x0 , and a dashpot of constant cx connect the tool support at point A to the bar at point D.
328
8 Variational and energy principles
At point B, the bar connects tothe milling machine tool, which is free to rotate about point B. Let frame F B = B, B = (¯b1 , ¯b2 ) be attached to the tool, which center of mass C is located a distance d along axis ¯b1 . The tool rotates at a constant angular velocity, Ω, with respect to the bar, such that the angle between unit vectors e¯1 and ¯b1 is φ = Ωt. The tool is of mass M and moment of inertia I B with respect to point B. Derive the equations of motion of the system using Lagrange’s formulation and the following generalized coordinates: θ and x, the distance from point A to D. The kinetic energy of the system consists of two parts: the kinetic energy of the tool support and that of the tool. Since point O is an inertial point, the kinetic energy of the tool support is simply I O θ˙2 /2. The kinetic energy of the tool is the sum of its translational and rotational components. The position vector of the center of mass of the tool is r C = (1 + 2 + x) e¯1 + d ¯b1 and its velocity then becomes v C = x˙ e¯1 + (1 + 2 + x)θ˙ e¯2 + d(θ˙ + Ω) ¯b2 . Because the angular velocity of the tool is (θ˙ + Ω), the kinetic energy of the system becomes 1 O ˙2 1 1 I θ + M vC 2 + I C (θ˙ + Ω)2 2 2 2 1 O 1 1 I + M (12 + x)2 + 2M d(12 + x)Cφ θ˙2 + M x˙ 2 + I B (θ˙ + Ω)2 = 2 2 2 ˙ ˙ − M dx( ˙ θ + Ω)Sφ + M d(12 + x)Ω θCφ ,
K=
where Sφ = sin φ, Cφ = cos φ, and 12 = 1 + 2 . The parallel axis theorem was used to deſne I B = I C + M d2 , where I B is the moment of inertia of the tool evaluated with respect to point B. The potential of the gravity forces is easily found as V = −mg1 Cθ − M g [(12 + x)Cθ + dCθ+φ ], where Sθ = sin θ, Cθ = cos θ, Sθ+φ = sin(θ + φ), and Cθ+φ = cos(θ + φ). The Lagrangian of the system now becomes L=
1 O 1 I + M (12 + x)2 + 2M d(12 + x)Cφ θ˙2 + M x˙ 2 2 2 1 B ˙ 2 ˙ φ + I (θ + Ω) − M dx( ˙ θ˙ + Ω)Sφ + M d(12 + x)Ω θC 2 + mg1 Cθ + M g [(12 + x)Cθ + dCθ+φ ] .
The generalized momenta of the system are found by taking the partial derivatives of the Lagrangian with respect to the generalized velocities to ſnd pθ =
∂L O = I + M (12 + x)2 + 2M d(12 + x)Cφ θ˙ ˙ ∂θ + I B (θ˙ + Ω) − M dxS ˙ φ + M d(12 + x)ΩCφ ,
and
∂L = M x˙ − M d(θ˙ + Ω)Sφ . ∂ x˙ Finally, the derivatives of the Lagrangian with respect to the generalized coordinates are found as px =
8.3 Lagrange’s formulation
329
∂L = −mg1 Sθ − M g [(12 + x)Sθ + dSθ+φ ] , ∂θ and
∂L ˙ φ + M gCθ . = M (12 + x)θ˙2 + M dθ˙2 Cφ + M dΩ θC ∂x The governing equations of motion of the system then follow from Lagrange’s equation as O I + I B + M (12 + x)2 + 2M d(12 + x)Cφ −M dSφ θ¨ −M dSφ M x ¨ ˙ ˙ 2M [(12 + x) + dCφ ] x˙ θ − M d(12 + x)ΩSφ (2θ + Ω) + −M (12 + x)θ˙2 − M d(θ˙ + Ω)2 Cφ M g [(12 + x)Sθ + dSθ+φ ] + mg1 Sθ − Qθ = 0. + −M gCθ − Qx To evaluate the generalized forces, Qθ and Qx , the virtual work done by the externally applied forces is computed. The linear spring/dashpot system applies forces of equal magnitude and opposite directions at point A and D; the virtual work is then δW = δr TA (Fsd e¯1 ) + δr TD (−Fsd e¯1 ). Since rA = 1 e¯1 and r D = (1 + x)¯ e1 , it follows that δW = 1 δθ¯ eT2 (Fsd e¯1 ) + δx¯ eT1 + (1 + x)δθ¯ eT2 (−Fsd e¯1 ) = −δxFsd . Hence, Qx = −Fsd , where Fsd = kx (x − x0 ) + cx x. ˙ Similarly, the virtual work done by the torsional spring/dashpot is simply δW = δθ(−Msd ), leading to the gen˙ The elastic springs eralized force Qθ = −Msd , where Msd = kθ (θ − θ0 ) + cθ θ. of stiffness constants kθ and kx could have been accounted for through strain energy expressions, V = 1/2 kθ (θ − θ0 )2 and 1/2 kx (x − x0 )2 , respectively. In this particular case, however, it is convenient to treat the spring/dashpot components as single units and compute the virtual work done by both elastic and viscous forces simultaneously. Example 8.15. The quick return mechanism The quick return mechanism shown in ſg. 8.9 consists of a uniform crank of length Lc and mass mc , and of a uniform arm of length La and mass ma . The crank is pinned at point R and the arm at point O; the distance between these two points is denoted d. At point S, a slider allows the tip of the crank to slide along the arm. A mass M is attached at point T, the tip of the arm. A spring of stiffness constant k connects point T to inertial point A; the spring is un-stretched when the arm is in the vertical position. Use Lagrange’s formulation to derive the equations of motion of the problem. This is a single degree of freedom problem, and angle θ will be selected as the single generalized coordinate. The kinematics of the problem are addressed ſrst. Considering triangle ORS, it is clear that β = φ − θ, and the law of sines then yields Lc sin(φ − θ) = d sin φ. Solving for angle φ leads to tan φ = −
¯ c Sθ L ¯ c Cθ , 1−L
¯ c Cθ 1−L Cφ = − , ¯ c Cθ ) − (1 − L ¯ 2c ) 2(1 − L
330
8 Variational and energy principles
A k
M T
La ma Lc
R q d
VR
mc S b
La
i2 I
O
f
HR
R mc
i1
S Fs
Fig. 8.9. Quick return mechanism.
¯ c = Lc /d, sin θ = Sθ , cos θ = Cθ , and cos φ = Cφ . A time derivative of the where L ſrst equation yields the relationship between the angular velocities of the two bar, ¯ c Cθ ) − (1 − L ¯ 2c ) (1 − L ˙ ˙ φ˙ = ¯ c Cθ ) − (1 − L ¯ 2c ) θ = h(θ)θ. 2(1 − L The system’s kinetic energy is K = [mc L2c θ˙2 /3 + (M + ma /3)L2a φ˙ 2 ]/2, where the ſrst term represents the kinetic energy of the crank and the second that of the arm. The following non-dimensional time is introduced: τ = ωt, where ω 2 = k/M , and notation (·) indicates a derivative with respect to τ . The system’s non-dimensional kinetic energy now becomes ¯2 ¯ = K = 1 μc Lc + (1 + μa )L ¯ 2 h2 θ2 , K kd2 2 3 3 a ¯ a = La /d, μa = ma /M , and μc = mc /M . where L The potential energy of the spring is V = 1/2 kΔ2 , where Δ is the stretch of the spring. The law of cosines applied to triangle OMA yields Δ2 = L2a + L2a − 2L2a cos(π − φ) = 2L2a (1 + Cφ ). The non-dimensional potential of the spring then becomes V ¯ 2 (1 + Cφ ). V¯ = 2 = L a kd ¯ = L/(kd2 ) = K ¯ − V¯ , and The non-dimensional Lagrangian of the system is L the generalized momentum is ¯2 ¯ μa ¯ 2 2 ∂L μc L c pθ = = + (1 + )L h θ . ∂θ 3 3 a The derivative of the Lagrangian with respect to the generalized coordinate θ is
8.3 Lagrange’s formulation
331
¯ ∂L μa ¯ 2 2 ∂h ¯ 2 ∂Cφ = (1 + )L θ h − La , ∂θ 3 a ∂θ ∂θ where ¯ c Sθ ¯ 2 )L ¯ c Sθ ∂h (1 − L hL ∂Cφ c = = − . 2 , ¯ c Cθ ) − (1 − L ¯ 2c ) ∂θ ∂θ ¯ c Cθ ) − (1 − L ¯ 2c ) 2(1 − L 2(1 − L Finally, the single equation of motion of the problem is found from Lagrange’s formulation as ¯2 μc L c μa 2 μa 2 ∂h ∂Cφ (8.61) ¯ 2a + (1 + 3 )h θ + (1 + 3 )θ h ∂θ + ∂θ = 0. 3L Although the quick return mechanism is a simple mechanical system that features a single degree of freedom, the system’s equation of motion is complex and extensive algebraic manipulations are required for its derivation. 16
7
14
6
12 5
4
q¢ AND f¢
q AND f
10
3
8 6 4
2 2 1
0
0
0
0.2
0.4
0.6
t
0.8
1
1.2
Fig. 8.10. Angular position of the two bars, θ, solid line, and φ, dashed line.
-2
0
0.2
0.4
0.6
t
0.8
1
1.2
Fig. 8.11. Angular velocities of the two bar, θ , solid line, and φ , dashed line.
The system was simulated for the following values of the non-dimensional pa¯ a = 3, L ¯ c = 0.5, μa = 1, μc = 2.4. At the initial time, rameters of the system: L the crank is in the vertical position, θ = 0 and θ = 2. The system was simulated for τ ∈ [0, 1.2], which corresponds approximately to one complete revolution of the crank. Figure 8.10 shows the time histories of the angular positions of the two bars, θ and φ, and their angular velocities are depicted in ſg. 8.11. At times τ = 0.2762 and 0.8806, the angular velocity of the crank change very rapidly and that of the arm quickly changes sign. This corresponds to the conſguration of the system when the crank is perpendicular to the arm, forcing a quick reversal of the arm’s direction of motion. Figure 8.12 shows the evolution of the system’s kinetic and potential energies. Because the system is conservative, the total mechanical energy of the system remains constant, as also depicted in the ſgure.
332
8 Variational and energy principles 4000
NORMAL FORCE AT POINT S
SYSTEM ENERGIES
25
20
15
10
5
3000 2000 1000 0 -1000 -2000 -3000
0
0
0.2
0.4
0.6
t
0.8
1
1.2
Fig. 8.12. System energies: kinetic energy, solid line; potential energy, dashed line; total mechanical energy, dashed-dotted line.
-4000
0
0.2
0.4
0.6
t
0.8
1
1.2
Fig. 8.13. Non-dimensional magnitude of the force at point S.
The reaction and internal forces of the system are automatically eliminated when using Lagrange’s formulation. It is often important, however, to evaluate both reaction and internal force. Of particular interest in this case is the internal force in the slider. Because this force was eliminated from the formulation, Lagrange’s equations provide no information concerning these forces. Figure 8.9 shows a free body diagram of the crank; at its tip, the force exerted by the slider on the crank, denoted Fs , is acting in the direction normal to the arm. For this planar system, the pivot equation written with respect to point R yields ¨ = −Fs cos(π − φ + θ). In non-dimensional form, this becomes mc L2c θ/3 ¯ c Cθ ) − (1 − L ¯ 2) ¯ c 2(1 − L L F μ s c c θ . = F¯s = 2 ¯ ¯ kLc 3 (1 − Lc Cθ ) − (1 − Lc ) Figure 8.13 shows the time history of the force the slider applies on the crank. At times τ = 0.2762 and 0.8806, large magnitudes of this force is observed. At these two instants, the crank is normal to the arm, which reverses the direction of its motion, leading to large accelerations and hence, large forces. Example 8.16. Equations of motion of a rigid body Figure 8.14 depicts a rigid body deſned by frame F B = B, B ∗ = (¯b1 , ¯b2 , ¯b3 ) , where point B is a reference point on the body and B ∗ a body attached basis. The displacement vector, u, deſnes the location of the reference point B of the rigid body with respect to inertial frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]. Let R be the rotation tensor that bring basis I to basis B ∗ , resolved in basis I. The body is subjected to external forces, F , and external moments, M B , computed with respect to the reference point of the body. Find the equations of motion of the rigid body using Lagrange’s formulation and contrast the resulting equations with those obtained in example 8.6 using Hamilton’s principle. In example 8.6, the kinetic energy of the rigid body was derived as eq. (8.31), where the 6 × 6 mass matrix, M∗B , of the rigid body is given by eq. (8.32) and the velocity array, V ∗ , by eq. (8.33). Finally, the components of the linear and angular
8.3 Lagrange’s formulation
333
b3 b2 u, R i3 O i1
B B* b1
I
i2
Fig. 8.14. Conſguration of a rigid body.
momentum vectors of the body were deſned by eq. (8.34). The generalized coordinates of the system are the three displacements, u, of the reference point of the body and three parameters, q, deſning rotation tensor R. These parameters could be a set of Euler angles with a speciſc sequence, as deſned in section 4.11. For this problem, the Lagrangian of the system simply equals its kinetic energy, L = K. The generalized momenta of the system are computed with the help of the chain rule for derivatives ∗ T
∗ ∂L ∂K ∂V ∗ = ∗ = R 0 P = R p = p, ∂ u˙ ∂ u˙ ∂V ∗ T ∗
∂L ∂K ∂V ∗ T T T = ∗ = 0 H R P = H R h = H h, ∂ q˙ ∂ q˙ ∂V where p = R p∗ and h = R h∗ are the components of linear and angular momentum vectors of the body, both resolved in basis I. As expected, the generalized momentum associated with the generalized velocities u˙ are the components of the linear momentum vector of the rigid body in the inertial frame. To compute the generalized momenta associated with the generalized velocities q˙ it is important to recognize that the angular velocity vector is related to the time derivative of the rotation parameters, q. ˙ Indeed, ω ∗ = RT ω = RT H q˙ , where the tangent operator, H, is deſned in section 4.11 if Euler angles with various sequences are used to represent the orientation of the rigid body. It then follows that ∂ω ∗ /∂ q˙ = RT H. The generalized momentum associated with the generalized velocities q˙ is not the angular momentum, h, of the body, but rather H T h. The next step is to evaluate the derivatives of the Lagrangian with respect to he generalized coordinates. Clearly, ∂L/∂u = 0 since the displacement vector u does not appear in the Lagrangian. The derivative of the Lagrangian with respect to the rotation parameters now becomes , ∗ T ∗ T ∂L ∂K ∂V T ∂ω = = HT u ˙ R, RT R P ∗ ∂q ∂q ∂V ∗ ∂q ∗ T ∂ω T ˙T p+ =H u RT h. ∂q
334
8 Variational and energy principles
˙ H. FurtherTo derive these results, eq. (4.82) was used to obtain ∂(RT u)/∂q ˙ = RT u T T ∗ ∗ more, ∂ R ω /∂q = ∂ω /∂q = R R ∂ω /∂q. Finally, in view of identity (4.85), this simpliſes to T ∂L ˙ T h. ˙ p + H = HT u ∂q The equations of motion of the system now follow from Lagrange’s formulation as p˙ = F , where F are the generalized forces associated with generalized coordinates u because the corresponding virtual work is δW = δuT F , and T · T H h − HT u ˙ p − H˙ T h = H T M B , where H T M B are the generalized forces associated with generalized coordinates q because the corresponding virtual work is δW = δψ T M B = δq T H T M B . The ſrst equation is the familiar equation of motion for the center of mass and is identical to that obtained from Hamilton’s principle, see eq. (8.37a). The second equation is not identical to its counterpart, eq. (8.37c), obtained from Hamilton’s principle. Expanding the time derivative of the ſrst term leads to H T h˙ + u ˙ p = H T M B . In this form, it is apparent that the equations of motion obtained from Lagrange’s formulation are a linear combination of those resulting from Hamilton’s principle. This example, which involves the three-dimensional rotation of a rigid body, calls for the following remarks concerning the application of Lagrange’s method. From the onset, this approach describes the system in terms of a speciſc set of generalized coordinates. The formulation requires the choice of a speciſc set of parameters, q, to represent the three-dimensional rotation because the derivation of the equations of motion will require the computation of the following derivatives: ∂L/∂ q˙ and ∂L/∂q. Consequently, the angular velocity vector, which the kinetic energy explicitly depends on, must be expressed in terms of these parameters as ω = H(q)˙q . The equations of motion depend on the tangent operator, H, which is speciſc to the parametrization selected to represent ſnite rotations. This contrasts with d’Alembert’s and Hamilton’s formulations that do not require the selection of speciſc rotation parameters, see eqs. (8.7a) and (8.7b) for d’Alembert’s principle and eqs. (8.37a) to (8.37c) for Hamilton’s principle. These two approaches are based on the concept of virtual rotation and lead to intrinsic equations of motion, i.e., equations that are independent of the speciſc choice of rotation parameters. The use of Lagrange’s formulation also complicates the algebra because derivatives such as ∂ω/∂ q˙ and ∂ω/∂q are required obtain the equations of motion. Furthermore, identity (4.85) is required to simplify the equations of motion.
8.3 Lagrange’s formulation
8.3.1 Problems Problem 8.26. Particle sliding along a circular ring Solve problem 3.12 using Lagrange’s formulation.
Problem 8.27. Flexible pendulum on a slider Solve problem 3.34 using Lagrange’s formulation.
Problem 8.28. Pendulum with rotating mass Solve problem 4.38 using Lagrange’s formulation.
Problem 8.29. Plate hinged at the rim of a rotating disk Solve problem 6.14 using Lagrange’s formulation.
Problem 8.30. Bar hinged at rim of rotating disk Solve problem 6.27 using Lagrange’s formulation.
Problem 8.31. Rolling cylinder with bar Solve problem 6.30 using Lagrange’s formulation.
Problem 8.32. Rolling cylinder with articulated bar Solve problem 6.31 using Lagrange’s formulation.
Problem 8.33. Double pendulum with elastic joint Solve problem 6.33 using Lagrange’s formulation.
Problem 8.34. Pendulum with sliding mass Solve problem 6.34 using Lagrange’s formulation.
Problem 8.35. Suspension system Solve problem 6.36 using Lagrange’s formulation.
Problem 8.36. Bar rocking on top of a curve Solve problem 6.38 using Lagrange’s formulation.
Problem 8.37. Pendulum connected to a plunging mass Solve problem 6.39 using Lagrange’s formulation.
Problem 8.38. Two-bar mechanism Solve problem 6.40 using Lagrange’s formulation.
Problem 8.39. Particle sliding in a rolling wheel Solve problem 6.43 using Lagrange’s formulation.
Problem 8.40. Particle in a slot on a rotating disk Solve problem 6.44 using Lagrange’s formulation.
Problem 8.41. Pendulum connected to horizontal piston Solve problem 6.45 using Lagrange’s formulation.
Problem 8.42. Inverted pendulum mounted on a cart Solve problem 6.46 using Lagrange’s formulation.
335
336
8 Variational and energy principles
Problem 8.43. Geneva wheel mechanism Solve problem 6.47 using Lagrange’s formulation.
Problem 8.44. Scotch yoke mechanism Solve problem 6.48 using Lagrange’s formulation.
Problem 8.45. Particle in a circular slot with guiding arm Figure 8.15 shows a particle of mass M sliding along a circular slot of radius R. The particle also slides in a rectilinear slot in an arm of mass m and length L. The arm is pivoted to the ground at point O and is restrained by a torsional spring of stiffness constant k and a dashpot of constant c. The spring is un-stretched when angle φ = 0. A viscous friction force, F f = −μw˙ is acting at the interface between the particle and the arm. (1) Derive the equation of motion of the system based on Lagrange’s formulation using angle φ as generalized coordinate. (2) Find an expression for the normal contact force that the circular slot applies on the particle. (3) Find an expression for the normal contact force that the arm applies on the particle.
i3 La ma
i2
m, L M
w O k, c
I
P
f
i1
q
h
q Q
S
Lb mb k
O d
i2
R C
Fig. 8.15. Particle in a circular slot with guiding arm.
c x(t)
M
i1
Fig. 8.16. Conſguration of the spatial mechanism.
Problem 8.46. Spatial mechanism subjected to a torque The spatial mechanism depicted in ſg. 8.16 consists of an arm of length La and mass ma attached to the ground at point S and rotating about the unit vector ¯ı1 ; the angle of rotation is denoted θ(t). A slender rigid link connects point P, at the tip of the arm, to point M that is free to slide along unit vector ¯ı1 . The link is of length Lb and mass mb and the distance from point O to point M is denoted x. A spring of stiffness constant k and a dashpot of constant c connect the slider to point O, the origin of the inertial system; the spring is un-stretched when x = 0. Torque Q(t) is applied to the arm at point S, acting about an axis parallel to axis ¯ı1 . (1) Use Lagrange’s formulation to derive the equation of motion of the system.
8.4 Analysis of the motion The many examples presented in this chapter demonstrate that the equations of motion of even the simplest mechanical systems are often highly nonlinear. Consequently, analytical solutions can rarely be developed and numerical procedures for
8.4 Analysis of the motion
337
the solution of ordinary differential equations are typically used to obtained approximate, although highly accurate solutions. It is often required to obtain the dynamic response of a system subjected to externally applied, time-dependent loads. Such information can only be obtained through analytical or numerical integration of the equations of motion. On the other hand, information about the nature of the motion is sometimes equally important. Assume that for a given set of initial conditions, a dynamical system is at an equilibrium point in the conſguration space. The following question then arises: if perturbed, does the system remain in the neighborhood of this equilibrium point for an extended period of time? Let q e be an equilibrium point of the system in the conſguration space and consider a hyper-sphere of radius ε around this point, S = (q e , ε). The question raised in the previous paragraph is now rephrased in more precise terms: if the system is at the equilibrium point and a small perturbation is applied, will the response of the system remain in S for all subsequent times? The equilibrium point is said to be stable if the response of the system, q(t), is such that q(t) ∈ S for all times t ∈ [0, ∞]. If this condition is not met, the equilibrium point is said to be unstable. In most practical cases, it is desirable for dynamical systems to be stable about their nominal operating conditions. Consequently, assessing the stability characteristics of the system about its equilibrium points is an important task and a procedure for determining the linearized stability characteristics of dynamical systems is described below. 8.4.1 General procedure for the analysis of motion Given the nonlinear equations of motion of a dynamical system, the determination of the linearized stability characteristics proceeds in the following steps. 1. Determine the equilibrium points of the system. An equilibrium point, q e , is a steady solution, q e = q e (t), of the nonlinear governing equations of the system. Consequently, equilibrium points are the solutions of the nonlinear algebraic equations obtained by imposing the vanishing of all time derivatives appearing in the governing differential equations. Because these algebraic equations are nonlinear, multiple equilibrium points could exist. 2. Linearize the governing equations of motion about the equilibrium points. The dynamic response of the system, q(t), is assumed to take the following form, (8.62)
q(t) = q e + ˆq (t),
where ˆq (t) are small perturbations in the generalized coordinates about an the equilibrium point, q e . Each term in the governing equations of motion is then expanded using Taylor series about the equilibrium point. For instance, if f (q) is a term appearing in the equations, 5 ∂f 55 f (q(t)) = f (q e + ˆq (t)) = f (qe ) + q (t) + h.o.t. ˆ (8.63) ∂q 5q e
338
8 Variational and energy principles
Because the perturbation, ˆq (t), is very small, the higher-order terms can be neglected and f (q) now becomes a linear function of the perturbation. A similar procedure is applied to all terms in the governing equations that now become linear, ordinary differential equations with constant coefſcients. 3. Study the stability of the motion. The equations governing the behavior of small perturbations about an equilibrium point are now in the form of linear, secondorder ordinary differential equations with constant coefſcients. The solution of this type of equations is of the following form [27], ˆq (t) =
2n (
Ai exp(pi t),
(8.64)
i=1
where n is the number of degrees of freedom of the system, Ai the integration constants, and pi the characteristic exponents. Both integration constants and characteristic exponents are, in general, complex numbers. It now becomes possible to asses the behavior of small perturbations about an equilibrium √ point. Each characteristic exponent is written as pi = αi + jσi , where j = −1, and consequently, if the real part of any characteristic exponent is positive, the magnitude of the small perturbation grows exponentially, eventually leaving sphere S. The conditions for stability of perturbations about an equilibrium point become (pi ) < 0, i = 1, 2, . . . , 2n.
(8.65)
This procedure is illustrated in the following examples. Example 8.17. Particle sliding along a circular ring Figure 8.17 depicts a particle of mass m sliding along a circular ring of radius under the effect of gravity. The ring rotates on two bearing about an unit vector ¯ı3 ; a torque Q(t), acting about unit vector ¯ı3 , is applied to the ring. Find the equilibrium points of the system and study the stability of the system at those equilibrium points.
g Q(t) O i2
I
f
l q
P m
i1
e1
i3 = e3 Fig. 8.17. Particle sliding along a circular ring.
8.4 Analysis of the motion
339
Derivation of the equations of motion The ſrst step of the procedure is to develop the equations of motion of the system. ˙ ı3 + θ¯ ˙e2 . The The addition theorem gives the angular velocity of rod OP as ω = φ¯ ˙ θ e¯1 + φS ˙ θ e¯2 − θS ˙ θ e¯3 ), where Sθ = velocity of the particle then becomes v P = (θC sin θ and Cθ = cos θ. The kinetic energy of the system is now K = 1/2 m2 (θ˙2 + φ˙ 2 Sθ2 ). The potential of the gravity forces is V = mg(1 − Cθ ) and the Lagrangian of the system becomes 1 2 ˙2 m (θ + φ˙ 2 Sθ2 ) − mg(1 − Cθ ). 2 Using Lagrange’s formulation, the equations of motion of the system are then obtained L=
θ¨ + (ω 2 − φ˙ 2 Cθ )Sθ = 0, ˙ 2 )· = Q, m2 (φS θ
(8.66a) (8.66b)
where ω 2 = g/. Next, it is assumed that the circular ring rotates at a constant angular velocity, φ˙ = Ω. The system now features a single degree of freedom represented by generalized coordinate θ, and the associated governing equation, eq. (8.66a), now becomes θ¨ + (ω 2 − Ω 2 Cθ )Sθ = 0, and eq. (8.66b) yields the torque required to keep the ring rotating at a constant angular velocity, Q = m2 Ω(Sθ2 )· . It will be convenient to recast the governing equation of the problem in a nondimensional form by using the non-dimensional time τ = Ωt to ſnd θ + (¯ ω 2 − Cθ )Sθ = 0,
(8.67)
where ω ¯ = ω/Ω and notation (·) indicates a derivative with respect to τ . Determination of the equilibrium points Equilibrium points, denoted θe , are steady solutions of the governing differential equation of motion and are obtained by imposing the vanishing of all time derivatives appearing in eq. (8.67), leading to (¯ ω 2 − Cθe )Sθe = 0.
(8.68)
The vanishing of the second factor yields two equilibrium points, θe = 0 or π. The other solutions, θe = nπ for n = 2, 3, . . . , ∞ are indistinguishable from the ſrst two and hence, need not be considered. The vanishing of the ſrst factor yields additional equilibrium points, θe = arccos ω ¯ 2 . Solutions ±θe are symmetric with respect to the vertical axis of the ring and are physically indistinguishable. Of course, this solution only exists when ω ¯ 2 ≤ 1. In summary, the system presents three distinct equilibrium points: θe = 0, θe = π, and θe = arccos ω ¯ 2 if ω ¯ 2 ≤ 1. At the equilibrium points, the particle is not necessarily at rest in space. For θe = arccos ω ¯ 2 , the particle moves on a circular path of radius Sθe in a plane normal to unit vector ¯ı3 , but this trajectory corresponds to a constant value of generalized coordinate θ(t) = θe .
340
8 Variational and energy principles
Linearization of the equations of motion The next step of the procedure is to linearize the equations of motion about an equilibrium point. The solution is written as ˆ ), θ(τ ) = θe + θ(τ
(8.69)
where θe is one of the equilibrium points determined in the previous section and ˆ ) a time-dependent perturbation of inſnitesimal magnitude about the equilibrium θ(τ point. The governing equation of the problem, eq. (8.67), is nonlinear due to the presence of transcendental functions, cos θ and sin θ. Introducing the assumed form of the solution, eq. (8.69), the cosine function is approximated using a Taylor series expansion, eq. (8.63), to ſnd 5 5 ˆ = cos θe + ∂ cos θ 5 θ(t) ˆ + h.o.t. ≈ cos θe − θˆ sin θe . cos θ = cos(θe + θ) ∂θ 5θe ˆ A similar treatment This expression is now a linear function of the perturbation, θ. of the sine function yields 5 ∂ sin θ 55 ˆ ˆ sin θ = sin(θe + θ) = sin θe + θ(t) + h.o.t. ≈ sin θe + θˆ cos θe . ∂θ 5θe Introducing these expansions into the nonlinear governing equation of the problem, eq. (8.67), yields θˆ +(¯ ω 2 −cos θe + θˆ sin θe )(sin θe + θˆ cos θe ) = 0. Taking into account the deſnition of the equilibrium points, eq. (8.68), and neglecting higherorder terms leads to the desired linearized equation of motion for small perturbations about an equilibrium point, θˆ + (¯ ω 2 cos θe − cos 2θe )θˆ = 0.
(8.70)
Analysis of motion Because eq. (8.70) is a linear differential equation with constant coefſcients, its solution is of the form θˆ = A exp(¯ pτ ). Introducing this solution into eq. (8.70) yields the characteristic exponent of the system as p¯2 = cos 2θe − ω ¯ 2 cos θe .
(8.71)
It now becomes possible to discuss the stability of the system about its three distinct equilibrium points. 1. Equilibrium point θe = 0. For this point, cos θe = 1, cos 2θe = 1, and the characteristic exponent becomes p¯2 = 1− ω ¯ 2 . The system is stable if 1− ω ¯ 2 < 0, i.e. when ω ¯ > 1. In this case, the non-dimensional frequency of the motion is √ σ ¯ = σ/Ω = ω ¯ 2 − 1.
8.4 Analysis of the motion
341
2. Equilibrium point, θe = π. For this point, cos θe = −1, cos 2θe = 1, and the characteristic exponent becomes p¯2 = 1 + ω ¯ 2 . Because 1 + ω ¯ 2 > 0, the system is always unstable about this equilibrium point. 3. Equilibrium point θe = arccos ω ¯ 2 . For this point, cos θe = ω ¯ 2 , cos 2θe = 2¯ ω4 − 1, and the characteristic exponent becomes p¯2 = ω ¯ 4 − 1. The system is stable if ω ¯ 4 − 1 < 0, i.e.√when ω ¯ < 1. In this case, the non-dimensional frequency of the motion is σ ¯ = 1−ω ¯ 4. In the discussion thus far, parameter ω ¯ was used as a variable. To better understand the physical behavior of the system, it is easier to take ω = g/ to be constant, and evaluate the stability of the system as the non-dimensional angular speed of the ¯ = Ω/ω = 1/¯ ring, Ω ω, increases. For equilibrium point θe = π, the system is always unstable for any value of the ring’s angular speed. 0.1 0.05 60
q
qe [deg]
80
40
0 0 3
-0.1 0.5
1
1.5
2
2.5
0
2
4
6
8
10
12
14
16
18
20
0
2
4
6
8
10
12
14
16
18
20
3 2
2.5
1.5
q
2
s/w
0
-0.05
20
1.5
1
1 0.5
0.5 0
0
0.5
1
1.5
W/w
2
2.5
3
Fig. 8.18. Top ſgure: location of the stable equilibrium point. Bottom ſgure: nondimensional frequency of the motion, σ ¯ = σ/ω. Horizontal axis is the non-dimensional ¯ = Ω/ω. angular speed of the ring, Ω
0
t
¯ = 0.5 Fig. 8.19. System response for Ω ¯ = 2 (bottom ſgure). (top ſgure) and Ω Solid line: nonlinear solution; dashed line: linearized solution. The dashed-dotted line is the nonlinear solution with damping.
¯ < 1, the only stable equilibrium point is θe = 0. When the ring rotates slowly, Ω ¯ = 1, is reached. As the angular speed of the ring increases, the critical speed, Ω Above that speed, equilibrium point θe = 0 becomes unstable, but a new equilibrium position arises, θe = arccos ω ¯ 2 , which is stable. The top portion of ſg. 8.18 shows the stable equilibrium point as function of the ring’s angular speed. ¯ = σ/ω = is σ √ At low speed, the non-dimensional frequency of the motion ¯ 2 , while above the critical speed, the frequency is σ ¯ 2 . The ¯ 2 − 1/Ω 1−Ω ¯ = Ω bottom portion of ſg. 8.18 shows this frequency as function of the ring’s angular ¯ speed. For high angular speeds of the ring, the frequency of the motion is σ ¯ ≈ Ω, i.e., the frequency of oscillation of the point mass is nearly identical to the ring’s angular speed, as indicated by the asymptote shown in the ſgure. To verify the predictions of the stability analysis presented above, the nonlinear equation of motion of the problem, eq. (8.67), was integrated numerically for two ¯ = 0.5 and Ω ¯ = 2. In both cases, the initial conditions are operating conditions, Ω
342
8 Variational and energy principles
θ(τ = 0) = 0.1 and θ (τ = 0) = 0, which put the particle in the neighborhood of equilibrium point θe = 0. ¯ = 0.5, the system is stable about equilibrium point For the low speed case, Ω θe = 0. The top portion of ſg. 8.19 shows the predicted small amplitude oscillations of the mass about the equilibrium point for both nonlinear and linearized equations of motion, eqs. (8.67) and (8.70), respectively. As expected, both equations predict nearly identical responses. Indeed, because the system is stable, the amplitude of the perturbation remains small, and the assumptions inherent to the linearization process are valid. ¯ = 2, the system is unstable about equiIn contrast, for the high speed case, Ω librium point θe = 0. The ſgure also shows the exponential growth of the response predicted by the linearized equation. As the magnitude of angle θ increases, the linearized and nonlinear solutions diverge. The linearized solution, however, correctly predicts that the particle does not remain in the neighborhood of θe = 0. If a small amount of damping is added to the system, the solution of the nonlinear equation, shown in dashed-dotted line in the bottom portion of ſg. 8.19, quickly settles to a new equilibrium point, which is correctly predicted by the linearized analysis as θe = arccos ω ¯ 2 = arccos(1/2)2 = 1.32 rad. Example 8.18. Bar pivoted to a rigid frame Figure 8.20 shows a homogeneous bar of length L and mass m connected at its mid point M to a rotating frame ABCD by means of two revolute joints at points R1 and R2 . A torsional spring of stiffness k and un-stretched rotation angle β0 is present in one of the revolute joints; torque Q(t), acting about unit vector ¯ı3 , is applied to the frame. Basis A = (¯ a1 , a ¯2 , a ¯3 ) is attached to the rotating frame. A planar rotation about unit vector ¯ı3 of magnitude α bring the inertial basis I to A. A planar rotation about axis a ¯1 of magnitude β brings basis A to B = (¯b1 , ¯b2 , ¯b3 ), which is attached to the bar. The conſguration of the system will be represented by two generalized coordinates, α and β. Find the equilibrium points of the system and study the stability characteristics of the system at those equilibrium points. Derivation of the equations of motion The ſrst step of the procedure is to develop the equations of motion of the system. The addition theorem gives the angular velocity of the bar as ω = β˙ ¯b1 + αS ˙ β ¯b2 + αC ˙ β ¯b3 , where Sβ = sin β and Cβ = cos β, with similar notational conventions for the trigonometric functions of angle α. The mass moment of inertia tensor of the bar is I ∗ = m2 diag(1, 0, 1)/12, see ſg. 6.42. The kinetic energy of the system is now K = 1/2 m2 (β˙ 2 + α˙ 2 Cβ2 )/12. The potential of the torsional spring is V = 1/2 k(β − β0 )2 and the Lagrangian of the system becomes L=
1 m2 ˙ 2 1 (β + α˙ 2 Cβ2 ) − k(β − β0 )2 . 2 12 2
Using Lagrange’s formulation, the equations of motion of the system are then obtained
8.4 Analysis of the motion
D
a2 O
i1
Q a2 b
A m
1 0.8 0.6
a1 = b 1
0.4
b2
R2
C
M k
f1(b) and f2(b)
a1 a
i2
343
0.2 0
b0
-0.2 -0.4 -0.6
w = 0.2
-0.8
R1
-1
T B
i3 = a3
0
Fig. 8.20. Homogeneous bar connected to a rotating frame.
w = 0.8 90
w = 0.6 180
w = 0.4 270
w = 0.3 360
b
450
540
720
Fig. 8.21. Graphical solution of the transcendental equation for determining the equilibrium points. Solid line: function f1 (β); dashed line: straight line f2 (β) for β0 = π/4 and ω ¯ = 0.2, 0.3, 0.4, 0.6, and 0.8.
β¨ + α˙ 2 Sβ Cβ + ω 2 (β − β0 ) = 0, m
630
2
(αC ˙ β2 )· /12
= Q,
(8.72a) (8.72b)
where ω 2 = 12k/(m2 ). Next, it is assumed that the rigid frame rotates at a constant angular velocity, α˙ = Ω. The system now features single degree of freedom represented by generalized coordinate, β, and the associated governing equation, eq. (8.72a), now becomes β¨ + Ω 2 Sβ Cβ + ω 2 (β − β0 ) = 0, and eq. (8.72b) yields the torque required to keep the frame rotating at a constant angular velocity, Q = m2 Ω(Cβ2 )· /12. It will be convenient to recast the governing equation of the problem in a nondimensional form by using the non-dimensional time τ = Ωt to ſnd β + Sβ Cβ + ω ¯ 2 (β − β0 ) = 0,
(8.73)
where ω ¯ = ω/Ω and notation (·) indicates a derivative with respect to τ . Determination of the equilibrium points Equilibrium points, denoted βe , are steady solutions of the governing differential equation of motion and are obtained by imposing the vanishing of all time derivatives appearing in eq. (8.73), leading to sin 2βe + 2¯ ω 2 (βe − β0 ) = 0.
(8.74)
This transcendental equations could have a single solution, but for speciſc values of the two parameters, ω ¯ and β0 , it could feature an inſnite number of solutions. A graphical solution of eq. (8.74) can be obtained by deſning two functions, a trigonometric function, f1 (βe ) = sin 2βe , and a straight line with a negative slope,
344
8 Variational and energy principles
f2 (βe ) = −2¯ ω 2 (βe − β0 ). The solutions of eq. (8.74) are at the intersections of these two curves. Figure 8.21 illustrates the graphical solution process for the following values of the parameters, β0 = π/4 and ω ¯ = 0.2, 0.3, 0.4, 0.6, and 0.8. Function f1 (βe ) is independent of the parameters, while the straight line, f2 (βe ), pivots about point β = β0 for different values of ω ¯ . A single solution, βe ≈ 0.32 rad, is found for ω ¯ = 0.8. In contrast, for ω ¯ = 0.2, nine distinct solutions exist. In a dimensional form, eq. (8.74) can be recast as 1/2 (m2 /12)Ω 2 sin 2βe + k(βe − β0 ) = 0. The ſrst term represents the moment of the inertial forces applied to the bar, and the second term is the restoring force due to the elastic spring. Hence, the equation for the equilibrium point is a static moment equilibrium equation. As the bar rotates multiple turns around its axis, the restoring moment in the spring increases and can be equilibrated by inertial forces for different magnitudes of the frame’s angular speed. Linearization of the equations of motion The linearization procedure described in section 8.4.1 yields the following linearized equations of motion for small perturbations about an equilibrium point, βˆ + (¯ ω 2 + cos θe )βˆ = 0.
(8.75)
Analysis of motion The solution of eq. (8.75) is of the form βˆ = A exp(¯ pτ ), and the characteristic exponent of the system becomes p¯2 = −(¯ ω 2 + cos 2βe ).
(8.76)
The stability condition becomes ω ¯ 2 + cos 2βe > 0, where βe = βe (¯ ω , β0 ) because the equilibrium point is a solution of eq. (8.74). As a ſrst example, consider the following parameter values, β0 = π/4 and ω ¯= 0.8. Figure 8.21 shows that a single solution is possible and a numerical solution of eq. (8.74) yields βe = 0.319. This equilibrium point is stable and the frequency of the motion is σ ¯ = σ/Ω = 1.2. Next, consider the case where β0 = π/4 and ω ¯ = 0.2. Figure 8.21 shows that nine solutions are possible and a numerical solution of eq. (8.74) yield βe = 0.030, 1.60, 3.05, 4.88, 6.07, 8.17, 9.06, 11.51, and 12.01 rad. These solutions are alternatively stable and unstable. The frequencies of the stable motion are σ ¯ = 1.02, 1.01, 0.973, 0.888, and 0.693, for equilibrium points βe = 0.030, 3.05, 6.07, 9.06, and 12.01 rad, respectively. For βe = 1.60, 4.88, 8.17, and 11.51 rad, the motion is unstable. It is left to the reader to verify that other equilibrium points exist for negative values of angle β.
8.4 Analysis of the motion
345
8.4.2 Problems Problem 8.47. Rotor blade with ƀap and lag motions Figure 8.22 depicts a very simpliſed model of a helicopter blade of length L and mass ¯3 . At point O, basis m rotating at a constant angular velocity Ω about unit vector ¯ı3 = a ¯2 , a ¯3 ) is attached to the hub represented by a massless bar of length e. Angular A = (¯ a1 , a motion out of the plane of rotation, i.e., a planar rotation about axis a ¯2 of magnitude φ, called the ƀapping angle, is shown on the left part of the ſgure; this motion is resisted by a torsional spring of stiffness kφ . Angular motion in the plane of rotation, i.e., a planar rotation about axis a ¯3 of magnitude θ, called the lead-lag angle, is shown on the right part of the ſgure; this motion is resisted by a torsional spring of stiffness kθ . When both in- and out-of-plane motions are considered simultaneously, the conſguration of the blade can be described by three successive planar rotations: ſrst, a rotation of magnitude Ωt about axis ¯ı3 that brings basis I e1 , e¯2 , e¯3 ), to A, next, a rotation of magnitude θ about axis a ¯3 that brings basis A to E = (¯ and ſnally, a rotation of magnitude φ about axis e¯2 that brings basis E to B = (¯b1 , ¯b2 , ¯b3 ), a blade attached basis. Use Lagrange’s formulation to derive the equations of motion of the system in the following cases. (1) At ſrst, assume that the sole ƀapping motion is allowed; derive the equation of motion. (2) Linearize the equations of motion. Find the natural frequency. (3) Next, assume that the sole lead-lag motion is allowed; derive the equation of motion. (4) Linearize the equation of motion. Find the natural frequency. (5) Finally, assume that both ƀap and lead-lag motions are allowed; derive the equations of motion. (6) Linearize the equations of motions. Find the natural frequencies.
i3 = a3
a2
W
L m
O e
f
kf
L
W a1
O e
m kq
q a1
Fig. 8.22. Out-of-plane and in-plane motions of a uniform rotor blade.
Problem 8.48. Bar on two guides A homogeneous bar of length L and mass m slides on two guides at its end points, as shown in ſg. 8.3. At the left end, the bar is connected to a spring of stiffness constant k that is unstretched when the bar is horizontal. At the right end, the bar is connected to a point mass M . Gravity acts along axis ¯ı2 . (1) Use Lagrange’s formulation to derive the equation of motion of the system. Use a single generalized coordinate, θ. (2) Find the equilibrium conſguration of the system. (3) Find the natural frequency of the system. (4) Is the system stable at the equilibrium point?
Problem 8.49. Spinning disk
The circular disk of mass m1 and radius R spins at a constant angular velocity Ω¯b1 about arm OD, as depicted in ſg. 8.23. This arm is of length L2 , mass m2 , and is connected to the ground at point O by means of two hinges. The orientation of this arm is determined by two planar rotations: ſrst, a rotation of magnitude ψ about axis ¯ı3 that brings inertial basis a1 , a ¯2 , a ¯3 ), and second, a rotation of magnitude θ about axis ¯b2 that I = (¯ı1 , ¯ı2 , ¯ı3 ) to A = (¯
346
8 Variational and energy principles
brings basis A to B = (¯b1 , ¯b2 , ¯b3 ). In the analysis, it will be more convenient to use the angle θ = π/2 − θ . The conſguration of the system is represented by the angles ψ and θ, shown in ſg. 8.23. (1) Use Lagrange’s formulation to derive the equations of motion of the system. (2) Linearize these equations of motion. (3) Show that the linearized equations are uncoupled and discuss the nature of the motion.
a3
y
i3 = a3
q
a2 = b2
g
b2 b3
k
O m2 i1
g
d B
A
O
a2
i2
y b1
b(t)
L m
h
a1
q
L2
i3
a1
Q
m1 R
i2
D W
i1
Fig. 8.23. Circular disk spinning at a constant angular velocity Ω.
O a(t)
b1
a1
Fig. 8.24. Bar of length L connected at point B to a torsional spring of stiffness constant k.
Problem 8.50. Spherical elastic pendulum O i2
q g
r i3
i1
f
m
Fig. 8.25. Elastic spherical pendulum.
Figure 8.25 depicts an elastic spherical pendulum that consists of a bob of mass m connected to an inertial point by means of a spring of stiffness constant k and un-stretched length 0 . The conſguration of the system will be represented by the spherical coordinates r, φ, and θ. (1) Use Lagrange’s approach to derive the equations of motion of the system. (2) Linearize the equations of motion. (3) Study the stability of the equilibrium points.
Problem 8.51. Spinning arm A shaft of height h is ſxed at point O and free to rotate about axis ¯ı3 , as shown in ſg. 8.24. An arm of length d, rigidly attached to the shaft at point A, rotates in the horizontal plane. A homogeneous bar of length L and mass m is connected to the arm at point B with a torsional A spring of stiffness a1 , a ¯2 , a ¯3 )] is attached to the arm and constant k. Frame F = [A, A = (¯ B ¯ ¯ ¯ frame F = B, B = (b1 , b2 , b3 ) is attached to the bar. A planar rotation of magnitude α about axis ¯ı3 brings basis I to A. A planar rotation of magnitude β about axis a ¯2 brings basis A to B; the torsional spring is un-stretched when β = β0 . A torque Q is applied to the shaft at point O. (1) Use Lagrange’s formulation to derive the equations of motion of the system. Use two generalized coordinates, α and β. (2) If the ſrst planar rotation is constrained such that α˙ = Ω, i.e., the shaft is rotating at a constant angular velocity Ω, ſnd the applied torque Q. (3) In this latter case, ſnd the equilibrium conſguration of the system. (4) Find the natural
8.4 Analysis of the motion
347
frequency of the system for small amplitude oscillations about the equilibrium conſguration. (5) If β0 = 0, is the system stable at the equilibrium point? (6) If β0 = π/2, is the system stable at the equilibrium point?
Problem 8.52. Inverted pendulum mounted on a track Figure 6.37 depicts an inverted homogeneous pendulum of mass m and length . The pendulum is mounted on a cart of mass M free to translate along a horizontal track. A torsional spring of stiffness constant k restrains the pendulum at its attachment point. The spring is un-stretched when angle θ = θ0 . (1) Derive the two equations of motion of the system. (2) Linearize the equations of motions about an equilibrium point. (3) Let g¯ = 0.1. Find the equilibrium point(s) and study the characteristics of the motion about that point. (4) Let g¯ = 20. Find the equilibrium point(s) and study the characteristics of the motion about that point. Use the following data: μ = M/m = 1.5 and g¯ = g/(ω 2 ). Use non-dimensional time τ = ωt, where ω 2 = k/(m2 ) and (·) denotes a derivative with respect to τ .
Part IV
Constrained dynamical systems
9 Constrained systems: preliminaries
In the previous chapter, variational and energy principles were derived for dynamical system. Various formulations were addressed including d’Alembert’s principle, Hamilton’s principle and Lagrange’s formulation. In all cases, developments were limited to unconstrained dynamical systems. This means that the number of generalized coordinates used to represent the conſguration of the system was equal the number of degrees of freedom of the system. The idea of using more generalized coordinates than is strictly necessary to represent the conſguration of a mechanical system is not appealing at ſrst, because it seems to increase needlessly formulation complexity. It turns out, however, that increasing the number of generalized coordinates often simpliſes the derivation of the governing equations of mechanical systems. This chapter starts with an introductory problem dealing with a simple, single degree of freedom two-bar mechanism. In example 9.1, the single equation of motion of this problem is derived from Newton’s formulation. An alternative formulation using a large number of generalized coordinates is then presented in a cursory manner in example 9.2. The advantages and drawbacks of the two approaches are contrasted. The remainder of the chapter presents basic concepts associated with constrained dynamical systems. Section 9.1 introduces Lagrange’s multiplier method, which is a theoretical underpinning for the analysis of constrained systems. Holonomic and nonholonomic constraints are presented and contrasted in section 9.2, which also introduces the constraint matrix. The chapter concludes with the generalization of the principle of virtual work for constrained static problems. Arbitrary and kinematically admissible virtual displacements are contrasted. Finally, the combined use of the principle of virtual work and Lagrange’s multipliers is investigated. Example 9.1. Two-bar mechanism, Newtonian formulation The two bar mechanism shown in ſg. 9.1 comprises bar OB of length L1 and mass m1 , and bar BAT of length L2 and mass m2 . Bar BAT passes through a slider located at ſxed point A but free to swivel about that point. A spring of stiffness constant k connects the tip of the bar at point T to the slider at point A and is of vanishing un-
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_9 © Springer Science+Business Media B.V. 2011
352
9 Constrained systems: preliminaries
stretched length. A viscous friction force, Fv = −cw, ˙ acts at the interface between the bar and the slider. Using Newton’s formulation, derive the equation of motion of this single degree of freedom system represented by the single generalized coordinate θ1 . B
HB VB
HO i2
a1
m1, L1
e1
B
VO HB
m2, L2
B q1
e2
m1g
O
w
g
q2
O d
A
w VB
S m2g
i1
Fv
k T
a2
A Fs T
Fig. 9.1. Conſguration of the two-bar mechanism.
The law of cosines applied to triangle OAB yields w2 = L21 + d2 − 2dL1 C1 , where C1 = cos θ1 . This leads to w ¯ 1, = 1 + d¯2 − 2dC (9.1) w ¯= L1 where d¯ = d/L1 . The law of sines applied to the same triangle gives L1 S1 = wS2 , where S1 = sin θ1 , and similar notations are used for the trigonometric functions of angle θ2 . Projections of segments OB and BAT along the horizontal yield L1 C1 − wC2 = d. In non-dimensional form, S2 =
S1 , w¯
C2 =
C1 − d¯ . w ¯
(9.2)
Let unit vectors e¯1 and a ¯1 be aligned with bars OA and BAT, respectively, as shown in ſg. 9.1. It then follows that L1 e¯1 − w¯ a1 = d¯ı1 , and a time derivative of this expression yields L1 θ˙1 e¯2 − w¯ ˙ a1 − wθ˙2 a ¯2 = 0. Projecting this equation along unit vectors a ¯2 and a ¯1 then leads to w ¯ θ˙2 = θ˙1 C21 ,
w ¯˙ = θ˙1 S21 ,
(9.3)
where C21 = cos(θ2 − θ1 ) and S21 = sin(θ2 − θ1 ). The ſrst equation expresses the angular velocity of bar BAT in terms of that of bar OA, and the second the velocity of bar BAT with respect to the slider. Finally, the angular acceleration of bar BAT is found as C21 S21 C21 θ¨2 = θ¨1 + θ˙12 1−2 . (9.4) w ¯ w ¯ w ¯
9 Constrained systems: preliminaries
353
Figure 9.1 also shows a free body diagram of bar OB. Euler’s laws applied to this planar rigid body problem yield m1 L 1 (−θ˙12 e¯1 + θ¨1 e¯2 ), 2 m1 L 1 m1 L21 ¨ gC1 = θ1 , L1 C1 VB − L1 S1 HB − 2 3
(HO + HB )¯ı1 + (VO + VB )¯ı2 − m1 g¯ı2 =
(9.5a) (9.5b)
where the second equation corresponds to the pivot equation, eq. (6.38), written about point O. The components of the reaction force at point O are denoted HO and VO , in the horizontal and vertical directions, respectively. Similarly, the components of the internal force acting at point B are denoted HB and VB . Figure 9.1 shows a free body diagram of bar BAT and Euler’s laws yield −HB¯ı1 − VB¯ı2 − m2 g¯ı2 + (Fv + Fs )¯ a1 + S¯ a2 L2 = m2 L1 (−θ˙12 e¯1 + θ¨1 e¯2 ) + (−θ˙12 e¯1 + θ¨1 e¯2 ) , 2 m2 L 2 L2 m2 L22 ¨ gC2 − wS = −m2 a1 L1 (−θ˙12 e¯1 + θ¨1 e¯2 ) + θ2 . 2 2 3 3
(9.6a) (9.6b)
In this case, the second equation is written with respect to material point B of the bar, see eq. (6.39). The reaction force acting in the direction normal to the sliding direction at point A is denoted S. The elastic spring applies force Fs = k(L2 − w) to the tip of the bar and the viscous friction force acts at point A. At this point, the six equations of dynamics have been written for this two-body planar problem. It involves six unknowns, ſve components of internal force, HO , VO , HB , VB , and S, and a single generalized coordinate, θ1 . The additional kinematic variables appearing in these equations, θ2 , w, and their time derivatives, should be expressed in terms of the generalized coordinate, θ1 , and its time derivatives, using the kinematic equations (9.1) to (9.4). To obtain the single equation of motion of this problem, the ſve internal force components must be eliminated from the six equations of dynamics through careful algebra. The ſrst step of the process is to use eq. (9.6b) to express the normal contact force at the slider wS =
m2 L 2 m2 L 2 m2 L22 ¨ L1 (θ˙12 S21 + θ¨1 C21 ) + gC2 − θ2 . 2 2 3
(9.7)
Next, eq. (9.6a) is projected along unit vectors ¯ı1 and ¯ı2 to ſnd the reaction force components, HB and VB , respectively. The following linear combination of these two components is then evaluated L2 ˙ 2 L1 (C1 VB − S1 HB ) =m2 L1 −L1 θ¨1 − (θ S21 − θ¨2 C21 ) + SC21 L1 2 2 (9.8) + (Fv + Fs )L1 S21 − m2 L1 gC1 .
354
9 Constrained systems: preliminaries
The ſnal step of the procedure is to introduce eq. (9.7) into eq. (9.8) to eliminate the internal force, S. The resulting expression is introduced into eq. (9.5b) to yield the desired equation of motion 2 2 m1 L21 C21 m2 L22 C21 + m2 L21 + θ¨1 − m L L 2 1 2 3 3 w ¯2 w ¯ C21 C21 m2 L 1 L 2 C21 ˙2 (9.9) m2 L22 C21 S21 2 (1 − 2 )− S21 (2 − 3 ) θ1 + 3 w ¯ w ¯ 2 w ¯ w ¯ C21 m1 m2 L 2 2 ˙ + m2 )L1 gC1 − gC2 − k(L2 − w)S21 + cL21 S21 +( θ1 = 0. 2 2 w ¯ In this equation, the following quantities must be expressed in terms of the generalized coordinate, θ1 , w ¯=
¯ 1 , C2 = 1 + d¯2 − 2dC
¯ 1 ¯ 1 C1 − d¯ 1 − dC dS , C21 = , S21 = . w ¯ w ¯ w ¯
This example illustrates one of the main problems associated with the derivation of the equations of motion of mechanical systems. While the system depicted in ſg. 9.1 is a rather simple mechanical system that features a single degree of freedom, the procedure to derive the single equation of motion of the system is a complex analytical task. The use of Lagrange’s formulation will streamline the process by eliminating the reaction and internal forces from the onset of the formulation. It is left to the reader to verify that the same equation of motion will be obtained, as expected. From a mathematical viewpoint, the equation of motion is a second-order, ordinary differential equation in time. It is, however, a highly nonlinear differential equation, which cannot be solved in closed form. Fortunately, numerical procedures for the solution of this class of equations are widely available and hence, approximate solutions of eq. (9.9) are easily obtained. Although numerical procedures ease the solution of the differential equation, its derivation remains an arduous, error-prone task. Symbolic manipulation software can be used to ease this task, but the equation of motion remains complex. Example 9.2. Two-bar mechanism, alternative formulation The two bar mechanism shown in ſg. 9.2 comprises bar OB, of length L1 and mass m1 , and bar BAT, of length L2 and mass m2 . Bar BAT passes through a slider located at ſxed point A but free to swivel about that point. A spring of stiffness constant k connects the tip of the bar at point T to the slider at point A and is of vanishing unstretched length. A viscous friction force, Fv = −cw, ˙ acts at the interface between the bar and the slider. Derive the equation of motion of this single degree of freedom system using a highly redundant set of seven generalized coordinates: the components of the position vector of the center of mass of bar OB along axes ¯ı1 and ¯ı2 , denoted x1 and y1 , respectively, and its orientation, denoted θ1 , the components of the position vector of the center of mass of bar BAT along axes ¯ı1 and ¯ı2 , denoted x2 and y2 , respectively, and its orientation, denoted θ2 , and ſnally, the relative distance between points B and A, denoted w. The array of generalized coordinates is
9 Constrained systems: preliminaries
355
q T = x1 , y1 , θ1 , x2 , y2 , θ2 , w . Figure 9.2 shows a free body diagram of bar OB. Euler’s laws applied to this planar rigid body problem yield (HO + HB )¯ı1 + (VO + VB )¯ı2 − m1 g¯ı2 = m1 (¨ x1¯ı1 + y¨1¯ı2 ), L1 L1 C1 (VB − VO ) − S1 (HB − HO ) = 2 2
m1 L21 12
(9.10a)
θ¨1 .
(9.10b)
where the second equation is written with respect to the center of mass of bar 1. The reaction force at point O, denoted RO , has components in the horizontal and vertical directions denoted HO and VO , respectively. Similarly, the reaction force at point B, denoted RB , has components components denoted HB and VB . VB
RB
B
HB
x1, y1 HO i2 m1, L1
a1
e1 B
RO
q1 m 1g O VO
HB
m2, L2
w
g
B q2 V A VB
O d
A a2
HA
x2, y2 A
i1
e2
RA
m2g
k
Fv
T
Fs T
Fig. 9.2. Conſguration of the two-bar mechanism.
Figure 9.2 also shows a free body diagram of bar BAT and Euler’s laws yield (HA − HB )¯ı1 + (VA − VB )¯ı2 − m2 g¯ı2 + (Fv + Fs )¯ a1 = m2 (¨ x2¯ı1 + y¨2¯ı2 ), (9.11a) L2 L2 L2 m2 L22 ¨ − C2 VB + S2 HB + (w − )(S2 HA − C2 VA ) = θ2 . 2 2 2 12
(9.11b)
Here again, the second equation is written with respect to the center of mass of bar 2. The reaction force acting at point A, denoted RA , has components components denoted HA and VA . The elastic spring applies a force Fs = k(L2 − w) to the tip of the bar and the viscous friction force acts at point A. Because independent generalized coordinates have been used for bars OB (x1 , y1 , and θ1 ) and BAT (x2 , y2 , and θ2 ), the two bars are free to move independently. In particular, point O will not remain a ſxed inertial point, the two bars will not remain connected at point B, nor will point A remain a ſxed inertial point. All these conditions must be expressed as constraints on the generalized coordinates.
356
9 Constrained systems: preliminaries
Consider point O at the tip of bar OB. Its horizontal and vertical position components are x1 − L1 C1 /2 and y1 − L1 S1 /2, respectively. Because point O must remain a ſxed inertial point, the following two constraints must link the three generalized coordinates determining the conſguration of bar OB, x1 − L1 C1 /2 CO = = 0. (9.12) y1 − L1 S1 /2 Similarly, the horizontal and vertical position components of point B are x1 + L1 C1 /2 and y1 + L1 S1 /2, respectively, when computed based on the generalized coordinates deſning the conſguration of bar OB. The same horizontal and vertical position components of point B, evaluated based on the generalized coordinates deſning the conſguration of bar BAT, are x2 + L2 C2 /2 and y2 + L2 S2 /2, respectively. If the two bars are to remain connected at point B at all times, the following constraints must link the generalized coordinates of bars OB and BAT, x1 + L1 C1 /2 − x2 − L2 C2 /2 CB = = 0. (9.13) y1 + L1 S1 /2 − y2 − L2 S2 /2 Finally, imposing the condition that point A must remain a ſxed inertial point leads to the following constraints, x2 + (L2 /2 − w)C2 − d CA = = 0. (9.14) y2 + (L2 /2 − w)S2 The formulation of the problem involves the seven generalized coordinates stored in array q, and the six components of internal and reaction forces stored in arrays RO , RB , and RA , for a total of thirteen unknowns. Euler’s laws applied to the two bars provide a total of six equations, eqs. (9.10) and (9.11). Six constraint equations must also be satisſed, eqs. (9.12), (9.13), and (9.14). Clearly, one additional equation is required to solve the problem. This equation is the static equilibrium equation at the slider: the reaction force at the slider should be normal to bar BAT, leading to CS = C2 HA + S2 VA = 0.
(9.15)
Note that the friction force at the slider is taken into account by vector F v , independently of the reaction force, RA . At this point, the formulation of the problem is complete. It involves thirteen equations, six equations of dynamics, eqs. (9.10) and (9.11), one slider static equilibrium equation, eq. (9.15), and six constraint equations, eqs. (9.12) to (9.14), for a total of thirteen unknowns, seven generalized coordinates and six components of internal force. It is possible to eliminate the six components of internal force and six of the seven generalized coordinates to obtain a single equation of motion written in terms of a single generalized coordinate, say θ1 . As illustrated in example 9.1, this process is lengthy and error-prone. This elimination process will be even more arduous in the presence case, because a highly redundant set of generalized coordinates was selected from the onset of the formulation.
9 Constrained systems: preliminaries
357
In this example, an alternative approach is followed. Instead of eliminating the redundant generalized coordinates and internal forces, all these unknowns are kept in the formulation. First, it is interesting to take a time derivative of the constraints; for instance, the ſrst constraint expressed by eq. (9.12) is x1 − L1 C1 /2 = 0 and its time derivative is x˙ 1 + L1 S1 θ˙1 /2 = 0. Combining this expression with a similar treatment of the second constraint and recasting the results in a matrix form yields 1 0 L1 S1 /2 0 0 0 0 q˙ = B O q˙ = 0, C˙ O = (9.16) 0 1 −L1 C1 /2 0 0 0 0 where q˙ is the array of generalized velocities. Matrix B O (q) is called the constraint matrix. Proceeding in the same manner with the constraints deſned by eqs. (9.13) and (9.14), the following results are obtained 1 0 −L1 S1 /2 −1 0 L2 S2 /2 0 C˙ B = q˙ = B B q˙ = 0, (9.17) 0 1 L1 C1 /2 0 −1 −L2 C2 /2 0 ˙C = 0 0 0 1 0 −(L2 /2 − w)S2 −C2 q˙ = B q˙ = 0, (9.18) A A 0 0 0 0 1 (L2 /2 − w)C2 −S2 where B B and B A are the constraint matrices associated with constraints C B = 0 and C A = 0, respectively. It now becomes possible to write the dynamical equations of the problem in a compact manner as M ¨q = B TO RO + B TB RB + B TA RA + F a ,
(9.19)
where M = diag(m1 , m1 , m1 L21 /12, m2, m2 , m2 L22 /12, 0) is the mass matrix of the system. It is left to the reader to verify that the ſrst three governing equations of system (9.19) are the dynamical equations for bar OB, eqs. (9.10). The contributions of the reaction forces acting on bar OB are written in terms of the constraint matrices. The next three equations of system (9.19) are the dynamical equations for bar BAT, eqs. (9.11). The last equation of system (9.19) is the equilibrium equation for the slider, eq. (9.15). The last term on the right-hand side of system (9.19), F Ta = 0, −m1 g, 0, (Fv + Fs )C2 , −m2 g + (Fv + Fs )S2 , 0, 0 , is the array of externally applied forces, where Fv = −cw˙ and Fs = k(L2 − w). Combining system (9.19) with the six constraint equations, eqs. (9.12) to (9.14), yields a total of thirteen equations for the thirteen unknowns of the problem. It is important to compare the characteristics of the present formulation to those of that presented in example 9.1. Instead of the single equation of motion in a single unknown obtained earlier, eq. (9.9), the present formulation leads to thirteen equations in thirteen unknowns. It must be noted, however, that the thirteen equations of the present formulation are far easier to derive and far less complex than the single equation of motion, eq. (9.9). The present formulation requires writing the constraints equations, eqs. (9.12) to (9.14), and their time derivatives, eqs. (9.16) to (9.18). These kinematic tasks are
358
9 Constrained systems: preliminaries
far simpler than those associated with Newton’s formulation, see eqs. (9.1) to (9.4), which requires the evaluation of the linear and angular accelerations of all bodies in terms of the generalized velocities and accelerations. Newton’s formulation leads to ordinary differential equations in time. Despite their complexity and high level of nonlinearity, numerical solution procedures for this type of equations are well developed and robust. In contrast, the present formulation leads to differential-algebraic equations; because this type of equation is less common than its ordinary differential counterpart, solution techniques for differential-algebraic equations are not as well developed or robust. This does not imply that the solution of differential-algebraic equations is computationally less efſcient; indeed, although the number of equations and unknowns is typically higher, the simplicity and sparsity of the equations enables efſcient solution procedures.
9.1 Lagrange’s multiplier method A fundamental tool used for the analysis of constrained dynamical systems is Lagrange’s multiplier technique; a formal description of this method is presented here. Consider the problem of determining a stationary point of a function of several variables, F = F (u1 , u2 , . . . , un ), as was discussed in section 7.1.1. In this case, however, the variables are not independent, rather, they are subjected to a constraint of the form C(u1 , u2 , . . . , un ) = 0. (9.20) Conceptually, this constraint could be used to express one variable, say un , in term of the others. Next, un would be eliminated from F to obtain a function of n−1 independent variables F = F (u1 , u2 , . . . , un−1 ), a problem identical to that treated in section 7.1.1. In many practical situations, it might be cumbersome, undesirable, or even impossible, to completely eliminate one variable of the problem. This elimination process can be avoided altogether by using an alternative approach. At a stationary point, the variation of function F vanishes δF =
∂F ∂F ∂F δu1 + δu2 + . . . + δun = 0. ∂u1 ∂u2 ∂un
(9.21)
This statement, however, does not imply ∂F/∂ui = 0 for i = 1, 2, . . . , n, because variations δui cannot be chosen arbitrarily. Indeed, they must satisfy the constraint expressed by eq. (9.20). The relationship among the variations δui can be explicitly written by taking a variation of the constraint to ſnd δC =
∂C ∂C ∂C δu1 + δu2 + . . . + δun = 0. ∂u1 ∂u2 ∂un
(9.22)
This expression shows in an explicit manner that variations δu1 , δu2 , . . . , δun , are not independent because a linear combination of these quantities must vanish. A
9.1 Lagrange’s multiplier method
359
linear combination of eqs. (9.21) and (9.22) is formed by multiplying eq. (9.22) by λ and summing the results with eq. (9.21) to ſnd ∂F ∂F ∂C ∂C δu1 + . . . + δun + λ δu1 + . . . + δun = 0. ∂u1 ∂un ∂u1 ∂un Coefſcient λ is an arbitrary function of the variables u1 , u2 , . . . , un , called Lagrange multiplier. Regrouping the various terms then leads to n ( ∂F ∂C δui = 0. +λ (9.23) ∂u ∂u i i i=1 Conceptually, variation δun could be expressed in terms of the other variations, δui , i = 1, 2, . . . , n − 1, using eq. (9.22), leaving n − 1 independent, arbitrary variations. To avoid this cumbersome step, the arbitrary Lagrange multiplier is chosen such that ∂F ∂C +λ = 0. ∂un ∂un With this choice, the last term of the sum in eq. (9.23) vanishes for all δun . Hence, there is no need to express this variation in terms of the n − 1 others, which can now be treated as independent, arbitrary quantities, implying ∂F ∂C +λ = 0, ∂ui ∂ui
i = 1, 2, . . . , n − 1.
Combining the last two equations then leads to the condition that δF + λδC = 0, where all variations, δui , i = 1, 2, . . . , n, are considered to be independent. Because the constraint expressed by eq. (9.20) must be satisſed, Cδλ = 0 for any arbitrary δλ, and the stationarity condition becomes δF + λδC = δF + λδC + Cδλ = δ(F + λC) = 0.
(9.24)
An augmented function, F + = F + λC, is now introduced; the above statement implies the vanishing of the variation in F + for all arbitrary variations δui , i = 1, 2, . . . , n, and δλ. In summary, the initial, constrained problem can be replaced by an unconstrained problem δF + = 0, F + = F + λC. (9.25) The augmented function, F + , involves n + 1 variables, ui , i = 1, 2, . . . , n and λ. The vanishing of variations of the augmented function implies n ( ∂F ∂C δui + C δλ = 0. +λ (9.26) ∂u ∂u i i i=1
360
9 Constrained systems: preliminaries
Because δui , i = 1, 2, . . . , n, and δλ are all independent, arbitrary variations, it follows that ∂F ∂C +λ = 0, ∂ui ∂ui
i = 1, 2, . . . , n,
and C = 0.
(9.27)
These form n + 1 equations to be solved for the n + 1 unknowns. Lagrange’s multiplier method results in an unconstrained problem, but increases the number of unknowns from n to n + 1; the additional unknown is Lagrange’s multiplier. If the constraint is used to eliminate one of the unknowns, the resulting problem will feature n − 1 unconstrained unknowns. Lagrange’s multiplier method can be readily generalized to problems involving multiple constraints, Ci = 0, i = 1, 2, . . . , m. In the presence of m constraints, m Lagrange multipliers, λi , i)= 1, 2, . . . , m, are introduced. The augmented function m then becomes F + = F + i=1 λi Ci . Example 9.3. Minimum distance between a circle and a plane ¯ ρ), both shown in ſg. 9.3. Find Consider plane P = (xP , n ¯ ) and circle C = (xC , k, the shortest algebraic distance, d, between the disk and the plane.
n
k
aC r
C
R
P
d Q
P
Fig. 9.3. The distance between a disk and a plane.
Let xQ and xR be the position vectors of points Q and R, respectively; Q is a point on the plane and R a point on the circle. Finding shortest distance can be cast as a minimization problem d = min (xR − xQ ), xR ,xQ
(9.28)
for all point R and Q. This minimization problem, however, is subjected to several constraints: point Q must be on plane P; in view of eq. (1.40), this implies n ¯ T (xQ − xP ) = 0,
(9.29)
and point R must be on circle C, k¯T (xR − xC ) = 0, as required by eq. (1.42).
xR − xC = ρ,
(9.30)
9.1 Lagrange’s multiplier method
361
The constrained minimization problem deſned by eq. (9.28) is now transformed into a an unconstrained minimization with the help of Lagrange’s multiplier technique d= min xR − xQ + λ¯ nT (xQ − xP ) + μk¯ T (xR − xC ) xR ,xQ ,λ,μ,ν
+ ν(xR − xC − ρ)] , where λ, μ, and ν are three Lagrange multipliers used to enforce the three constraints deſned by eqs. (9.29) to (9.30). Minimization of the augmented function with respect to xR yields xR − xQ x − xC + μk¯ + ν R = 0, (9.31) xR − xQ xR − xC and minimization with respect to xQ leads to −
xR − xQ + λ¯ n = 0. xR − xQ
(9.32)
Of course, minimization with respect to λ, μ, and ν will yield the constraint equations, eqs. (9.29) to (9.30). Equations (9.29), (9.30), (9.31) and (9.32) form a set of nine equations for the nine unknowns of the problem, xR , xQ , λ, μ, and ν. The solution of this set of nonlinear algebraic equations will yield the desired minimum distance. The scalar product of eq. (9.32) by n ¯ T yields the ſrst Lagrange multiplier as T λ=n ¯ (xR − xQ )/xR − xQ and introducing this result back into eq. (9.32) then ¯n ¯ T )(xR − xQ ) = 0. This implies the vanishing of the projection of leads to (I − n vector xR − xQ onto plane P, and hence xR − xQ = d¯ n. This conſrms the very intuitive fact that the minimum distance between the circle and the plane is achieved when vector xR − xQ is perpendicular to plane P. Equation (9.31) now becomes n ¯ + μk¯ + ν(xR − xC )/ρ = 0 and a scalar product ¯ of this equation by k¯ T then yields the second Lagrange multiplier as μ = −¯ nT k, where constraint eq. (9.32) was used. Introducing this result back into the equation yields (1 − k¯k¯ T )¯ n = −ν(xR − xC )/ρ and taking the norm of the relationship yields ν = k¯ n. It then follows that xR − xC = −(ρ/ν)(I − k¯k¯ T )¯ n and ſnally, xR − xC = (ρ/ k¯ n) k k¯ n, where identity (1.33b) was used. The minimum distance between the plane and the circle is now d = n ¯ T (xR − xQ ) = n ¯ T (xR − xP + T ¯ (xR − xP ), where constraint eq. (9.29) was used; it follows that xP − xQ ) = n T ¯ T (xC − xP ) + ρ¯ nT k k¯ n/ k¯ n and ſnally d=n ¯ (xC − xP + xR − xC ) = n k¯ n. d=n ¯ T (xC − xP ) − ρ Of course, the same result can be obtained in a simpler manner by using simple geometric arguments.
362
9 Constrained systems: preliminaries
9.1.1 Problems Problem 9.1. Minimum distance from point to line Use Lagrange’s multiplier technique to ſnd the minimum distance between an arbitrary point l). P of coordinates xP and a line L = (xQ , ¯
Problem 9.2. Minimum distance from point to circle Use Lagrange’s multiplier technique to ſnd the minimum distance between an arbitrary point ¯ ρ). P of coordinates xP and a circle C = (xC , k,
9.2 Constraints The examples discussed in section 7.2 show the importance of constraints: the freedom of using a number of generalized coordinates that exceeds the number of degrees of freedom comes at the expense of adding kinematic constraints. Using more generalized coordinates than is necessary to represent the conſguration of the system seems, at ſrst, to be a poor idea because this increases the number of unknowns. The alternative, however, i.e., the elimination of the redundant generalized coordinates, can lead to equations that are unduly complex and cumbersome to manipulate. Kinematic constraints, also called holonomic constraints, are not the only type of constraints that are encountered in practice. Nonholonomic constraints are constraints that involve the generalized velocities of the system and cannot be integrated. Both types of constraints will be discussed in this section. 9.2.1 Holonomic constraints Typical kinematic constraints take the form of nonlinear relationships among the generalized coordinates; in general, m such constraints might be imposed on the system Ci (q1 , q2 , . . . qn ) = 0, i = 1, 2, . . . m. (9.33) To simplify the notation, an array of generalized coordinates is is deſned, which
stores the n generalized coordinates of the system, q T = q1 , q2 , . . . , qn . Next, an array of kinematic constraint is introduced that stores the m constraints applied on the system, C T = C1 , C2 , . . . , Cm . The m constraints acting on the system are now expressed in a compact form as C(q) = 0. (9.34) Constraints of this form are called kinematic constraints, conſguration constraints, or more generally, holonomic constraints. Because the constraints do not depend on time explicitly, they are said to be scleronomic constraints. Such constraints reduce the number of degrees of freedom of the system because each constraint could be used to eliminate one generalized coordinate of the system: a system described by n generalized coordinates and subjected to m holonomic constraints presents d = n − m degrees of freedom and is called a holonomic system. The differential of the ith constraint is written as
9.2 Constraints
dCi =
∂Ci ∂Ci ∂Ci dq1 + dq2 + . . . + dqn = 0. ∂q1 ∂q2 ∂qn
363
(9.35)
The differential of each constraint can be computed in a similar manner and expressed in a compact form as (9.36) dC(q) = B(q)dq = 0, where dq T = dq1 , dq2 , . . . , dqn is the array of generalized coordinate differentials. Matrix B is called the constraint matrix or Jacobian matrix of the constraints. Each line of this matrix stores the partial derivatives of one constraint with respect to the generalized coordinates, ⎤ ⎡ ∂C1 ∂C1 ∂C1 ... ⎢ ∂q1 ∂q2 ∂qn ⎥ ⎢ ∂C ∂C ∂C2 ⎥ 2 ⎥ ⎢ 2 ... ⎥ ⎢ ∂q1 ∂q2 ∂qn ⎥ . (9.37) B(q) = ⎢ ⎢ . .. .. .. ⎥ ⎥ ⎢ .. . . . ⎥ ⎢ ⎣ ∂Cm ∂Cm ∂Cm ⎦ ... ∂q1 ∂q2 ∂qn
The kinematic constraints discussed thus far do not depend on time explicitly; such constraints are called scleronomic constraints. It is not uncommon for constraints to be explicit functions of time, C(q, t) = 0;
(9.38)
such constraints are called rheonomic constraints. The differential of rheonomic constraints involves partial derivatives with respect to time, together with the partial derivatives with respect to the generalized coordinates. The array of partial derivatives with respect to time is denoted ∂Cm ∂C1 ∂C2 T b (q, t) = . (9.39) , ,..., ∂t ∂t ∂t With the help of this notation, the differential of rheonomic constraints becomes dC(q, t) = B(q, t)dq + b(q, t)dt = 0.
(9.40)
In the presence of a mixture of scleronomic and rheonomic constraints, the entries in array b corresponding to scleronomic constraints will vanish. Kinematic constraints express relationships among the generalized coordinates that must hold at each instant in time. Consequently, time derivatives of the constraint must also vanish. Considering ſrst a scleronomic constraint, see eq. (9.34), the time derivative is ˙ C(q) = B(q)˙q = 0. (9.41) The time derivative of rheonomic constraints, see eq. (9.38), is found in a similar manner as
364
9 Constrained systems: preliminaries
˙ t) = B(q, t)˙q + b(q, t) = 0. C(q,
(9.42)
These expressions are called velocity level constraints. Of course, higher-order derivatives could be computed: acceleration level constraints, obtained by taking second derivatives of the constraints, are commonly used in many computational schemes for constrained multibody systems. Expressions for the velocity level constraints are linear functions of the generalized velocities because they were obtained by taking time derivatives of the corresponding kinematic constraints. Similarly, acceleration level constraints are linear functions of the generalized accelerations. 9.2.2 Nonholonomic constraints The constraints considered thus far are kinematic or conſguration constraints of the form of eqs. (9.34) or (9.38). In some cases, however, the constraints imposed on a mechanical system are of a different nature from those discussed thus far; consider a differential relationship of the form of eq. (9.40), B(q, t) dq + b(q, t) dt = 0,
(9.43)
such constraints are said to be in Pfafſan form. For holonomic constraints, constraint matrix B and array b store the partial derivatives of the constraints with respect to the generalized coordinates and time, respectively, see eqs. (9.37) and (9.39), respectively. For nonholonomic constraints, constraint matrix B stores a set of arbitrary functions of the generalized coordinates and time ⎤ ⎡ b11 (q, t) b12 (q, t) . . . b1n (q, t) ⎢ b21 (q, t) b22 (q, t) . . . b2n (q, t) ⎥ ⎥ ⎢ (9.44) B(q, t) = ⎢ ⎥, .. .. .. .. ⎦ ⎣ . . . . bm1 (q, t) bm2 (q, t) . . . bmn (q, t) and array b stores arbitrary functions
bT (q, t) = b1 (q, t), b2 (q, t), . . . , bm (q, t) .
(9.45)
The fact that coefſcient bij (q, t), j = 1, 2, . . . , n, and bi (q, t) are arbitrary implies that, in general, there exists no function, Ci (q, t), such that bij (q, t) = ∂Ci /∂qj , j = 1, 2, . . . , n and bi = ∂Ci /∂t. If function Cj (q, t) does not exist, the constraint is not integrable, i.e., it is nonholonomic. When faced with a constraint written in the Pfafſan form, it is important to determine whether such constraint is holonomic or not, i.e., whether it is integrable or not. Conſguration constraints must be continuous, and hence, ∂ 2 Ci /∂qk ∂qj = ∂ 2 Ci /∂qj ∂qk , or ∂bik /∂qj = ∂bij /∂qk . Hence, a differential constraint can be integrated if and only if the following integrability conditions are met ∂ (gi bik ) = ∂qj ∂ (gi bi ) = ∂qj
∂ (gi bij ), j, k = 1, 2, . . . , n, j = k, ∂qk ∂ (gi bij ), j = 1, 2, . . . , n, ∂t
(9.46a) (9.46b)
9.2 Constraints
365
where gi (q, t) are integrating functions. If a constraint is not integrable, it can only be expressed in the Pfafſan form presented in eq. (9.43), or in the velocity form obtained by dividing this equation by dt to ſnd D(q, q, ˙ t) = B(q, t)˙q + b(q, t) = 0,
(9.47)
These expressions are not completely general; indeed, the constraints are assumed to be linear functions of the generalized velocities. Constraints with arbitrary mathematical structures could be imagined, but it turns out that the nonholonomic constraints encountered in common mechanical system appear to all be linear functions of the generalized velocities. In the expressions above, time appears explicitly in the nonholonomic relationships; clearly, time-independent nonholonomic constraints could also occur. If a constraint cannot be integrated, it is a nonholonomic constraint and hence, it cannot be used to eliminate a generalized coordinate: nonholonomic constraints do not decrease the number of degrees of freedom of the system. A system described by n generalized coordinates and subjected to a single nonholonomic constraint features n degrees of freedom. Example 9.4. Two bar linkage tracking a curve Figure 9.4 depicts a planar two bar linkage tracking a curve C. The two bars are of length L1 and L2 , respectively, and make angles θ1 and θ2 with the horizontal, respectively. The end point of the second bar tracks a ſxed planar curve described by position vector p0 (s), where s deſnes the intrinsic parametrization of the curve.
i2
L2 A
L1 O
q1
P q2 h
d
C
i2
p0(h)
L2 q2
A q
L1 O
B i 1
Fig. 9.4. Two bar linkage tracking a curve.
q1
P C
p0(q) q R
d
i1 B
Fig. 9.5. Two bar linkage tracking a semi-circular curve.
This system clearly feature a single degree of freedom. The position vector of the end point P of the linkage can be expressed in two different manners (L1 cos θ1 + L2 cos θ2 ) ¯ı1 + (L1 sin θ1 + L2 sin θ2 ) ¯ı2 = d ¯ı1 + p0 (s), where d is the distance from the origin at point O to point B. The left-hand side of this equation corresponds to path OAP, while the right-hand side corresponds to path OBP. This vector equation expresses two kinematic constraints among three parameters, θ1 , θ2 , and s, conſrming the fact that the system possess a single degree of freedom.
366
9 Constrained systems: preliminaries
At ſrst, curve C will be selected to be a circle of radius R and center B, as shown in ſg. 9.5. The position vector of a point on the curve is then simply p0 (θ) = R cos θ ¯ı1 + R sin θ ¯ı2 . The above kinematic constraints now become L1 cos θ1 + L2 cos θ2 − d = R cos θ,
L1 sin θ1 + L2 sin θ2 = R sin θ.
(9.48)
Two generalized coordinates, q1 = θ1 and q2 = θ2 , will be selected to represent the system. Kinematic constraints (9.48) are now used to eliminate θ, leading to a single constraint for the two generalized coordinates, 2L1 L2 cos(q1 − q2 ) − 2dL1 cos q1 − 2dL2 cos q2 = R2 − L21 − L22 − d2 . This holonomic constraint could be used to eliminate either q1 or q2 , but the trigonometric functions involved in the constraint will lead to complex expressions. The constraint matrix is B(q) = L1 d sin q1 − L1 L2 sin(q1 − q2 ), L2 d sin q2 + L2 L1 sin(q1 − q2 ) . This is not the only way to proceed. It might be preferable to keep θ, the location along the circle, as a generalized coordinate. Hence, the generalized coordinates would be selected as q1 = θ1 and q2 = θ. Kinematic constraints (9.48) are then used to eliminate θ2 , leading to a single constraint, 2RL1 cos(q1 − q2 ) + 2dL1 cos q1 − 2dR cos q2 = L21 − L22 + R2 + d2 . The constraint matrix is B(q) = −L1 d sin q1 − L1 R sin(q1 − q2 ), Rd sin q2 + RL1 sin(q1 − q2 ) . Finally, curve C is assumed to be arbitrary, as depicted in ſg. 9.4. In this case, it might be very difſcult to use the constraint conditions to eliminate any parameter. Consequently, it is convenient to use three generalized coordinates, q1 = θ1 , q2 = θ2 , and q3 = s, linked by two kinematic constraints expressed as (L1 cos q1 + L2 cos q2 ) ¯ı1 + (L1 sin q1 + L2 sin q2 ) ¯ı2 = d ¯ı1 + p0 (q3 ). The constraint matrix becomes −L1 sin q1 −L2 sin q2 −¯ıT1 t¯(q3 ) , B(q) = L1 cos q1 L2 cos q2 −¯ıT2 t¯(q3 ) where t¯(q3 ) = dp0 /dq3 is the unit tangent to the curve at location s. Example 9.5. The rigid body/universal joint system This example deals with a rigid body attached to the ground by means of a universal joint, as depicted in ſg. 7.9 and discussed in example 7.5 on page 266. Component k of the universal joint, see ſg. 7.10, is connected to the ground at point O by means of a bearing allowing rotation about axis ¯ı3 . Component is connected to a rigid body at point O . The orientation of the rigid body will be deſned by Euler angles, using the 31-2 sequence. A ſrst planar rotation about axis ¯ı3 , of magnitude φ, brings inertial basis I = (¯ı1 , ¯ı2 , ¯ı3 ) to A = (¯ a1 , a ¯2 , a ¯3 ), where a ¯1 is aligned with unit vector ¯b1 of the cruciform. This rotation is associated with a constant angular speed Ω, implying a ¯1 (t) = cos(Ωt) ¯ı1 + sin(Ωt) ¯ı2 . A second planar rotation about axis a ¯1 ,
9.2 Constraints
367
of magnitude θ, brings basis A to B = (¯b1 , ¯b2 , ¯b3 ), where ¯b2 is the second unit vector aligned with the cruciform. Finally, a third planar rotation about axis ¯b2 , of magnitude ψ, bring basis B to E = (¯ e1 , e¯2 , e¯3 ) that is attached to the rigid body. Points O and O are coincident. The system present two degrees of freedom. Rather than describing the conſguration of the system with the sequence of three planar rotations discussed in the previous paragraph, it might be simpler to argue that the rigid body is free to rotate about ſxed point O and hence, its orientation is determined by three generalized coordinates selected, for instance, as the three Euler angles associated with the rotation tensor R that brings basis I to E. In this scenario, the kinematic constraint associated with the universal joint imposes the following normality condition: C = e¯T2 a ¯1 (t) = 0, a rheonomic kinematic constraint. If the orientation of the rigid body is deſned by Euler angles using the 3-13 sequence, the components of R, resolved in I, are given by eq. (4.11), and the rheonomic kinematic constraint becomes C(q, t) = sin q3 cos(q1 − Ωt) + cos q2 cos q3 sin(q1 − Ωt) = 0, where q1 = φ, q2 = θ, and q3 = ψ are the three generalized coordinates of the problem. These angles are associated with the 3-1-3 sequence, rather than the 31-2 sequence that would more naturally describe the sequence of planar rotations inherent to the present mechanical system. The constraint matrix becomes ⎡ ⎤ cos q2 cos q3 cos(q1 − Ωt) − sin q3 sin(q1 − Ωt) T ⎦. − sin q2 cos q3 sin(q1 − Ωt) B (q, t) = ⎣ cos q3 cos(q1 − Ωt) − cos q2 sin q3 sin(q1 − Ωt) In the presence of a single constraint, the constraint matrix reduces to a single line. For this rheonomic constraint, the partial derivative of the constraint with respect to time is b(q, t) = Ω [sin q3 sin(q1 − Ωt) − cos q2 cos q3 cos(q1 − Ωt)]. Array b features a single entry because the problem involves a single constraint. Example 9.6. The skateboard Figure 9.6 depicts the simpliſed conſguration of a skateboard of mass m and moment of inertia I about its center of mass G. The skateboard rolls without sliding on the horizontal plane by means of a wheel aligned with unit vector e¯1 of the skateboard and located at point C, a distance from the center of mass. The position vector of the center of mass is written as r G = x ¯ı1 + y ¯ı2 and the axis of the skateboard makes an angle θ with the horizontal. The equations of motion of this planar problem are readily obtained from Newton’s second law as F C sin θ = m¨ x,
−F C cos θ = m¨ y,
¨ −F C = I θ,
(9.49)
where F C = −F C e¯2 is the contact force vector between the wheel and the ground. The system is subjected to one constraint: because the wheel does not slip, the velocity vector of the contact point must be along unit vector e¯1 . The velocity of ˙e2 , and hence, the constraint becomes e¯T v = 0, or point C is v C = x¯ ˙ ı1 + y¯ ˙ ı2 + θ¯ 2 C
368
9 Constrained systems: preliminaries
C i2
e2
G l
C e1
vC c
F q
i1
i1 Fig. 9.6. Conſguration of the skateboard.
D = θ˙ − x˙ sin θ + y˙ cos θ = 0.
(9.50)
Let the generalized coordinates of the problem be q1 = x, q2 = y, and q3 = θ. The constraint equations now takes the form of eq. (9.47), where B(q) = − sin θ, cos θ, , b(q) = 0. (9.51) Clearly, the integrability conditions, eqs. (9.46a), are not satisſed, and hence, the constraint is nonholonomic. Time does not appear explicitly and the constraint is linear in terms of the generalized velocities. Example 9.7. Contact between two rigid bodies Consider a rigid body, denoted body k, whose outer shape is described by a surface, denoted Sk , as indicated on ſg. 9.7. The surface coordinates for this surare denoted η1k and η2k , see section 2.4. A body attached frame, F k = face k B , B k = (¯bk1 , ¯bk2 , ¯bk3 ) , is deſned by the position vector uk of its origin with respect to point O and by the rotation tensor Rk that brings basis I to B k . Finally, the position vector of an arbitrary point on the surface is denoted pk0 (η1k , η2k ). Tensor components resolved in basis B k are denoted with superscript (·)∗ . Consider now a second body, denoted body , whose conſguration is described in a manner identical to that used for body k, replacing the superscript (·)k with (·) , see ſg. 9.7. Position vectors xk and x of arbitrary points on bodies k and , respectively, with respect to point O, now become xk = uk + Rk p∗k (η1k , η2k ), 0
x = u + R p∗ (η1 , η2 ). 0
The conſguration of body k is represented by eight generalized coordinates: three displacement components for vector uk , three rotation components for the rotation tensor Rk and two surface coordinates η1k and η2k . To assess the effect of the contact force on the body, it is necessary to know the location the contact point, and hence, the last two generalized coordinates are an inherent part of the formulation. At ſrst, imagine that the bodies are at a short distance from each other: the points that are about to come in contact with each other, called candidate contact points, must satisfy a number of kinematic constraints. First, the tangent planes to bodies k and at the candidate contact points must be parallel. Second, the vector joining the two candidate contact points must be parallel to the common normal. The tangent planes are those spanned by the surface base vector ak1 (η1k , η2k ) and ak2 (η1k , η2k ) for
9.2 Constraints l
b2
h2l
l
S
l
S
x C a1 l n a2l k
p0
i3
n k x C a1k
k
k
u,R
h1k
k
i2
I i1
b3l B
l
b1l
C
h1l
a2k h2k
S
B
l
l
l
l
369
O
B
S
k
p0k
b3k k
Bk
b2k l
u,R
b1k
l
Fig. 9.7. Two rigid bodies contacting at point C.
surface Sk , and a1 (η1 , η2 ) and a2 (η1 , η2 ) for surface S ; the normals to the same planes are denoted n ¯ k (η1k , η2k ) and n ¯ (η1 , η2 ), respectively. The kinematic conditions deſning the location of the candidate contact points can now be expressed in the following manner, k k akT n (η1 , η2 ) = 0, 1 (η1 , η2 )¯ k k akT n (η1 , η2 ) 2 (η1 , η2 )¯ k k k k k akT 1 (η1 , η2 ) x (η1 , η2 ) − x (η1 , η2 ) k k k k k akT 2 (η1 , η2 ) x (η1 , η2 ) − x (η1 , η2 )
(9.52a)
= 0,
(9.52b)
= 0,
(9.52c)
= 0.
(9.52d)
The ſrst two constraints express the parallelism between the two tangent planes, and the last two the parallelism between the normal and the vector joining the candidate contact points. These four holonomic constraints each involve the generalized coordinates η1k , η2k , η1 , and η2 , and could be used to solve for these four generalized coordinates, eliminating them from the formulation. For complex surfaces, however, the position, base, and normal vectors at a point are complex, nonlinear functions of two of the generalized coordinates that deſne the surface. Consequently, the elimination process will be arduous, if not outright impossible. This demonstrates the advantage working with redundant generalized coordinates. Next, the two bodies are assumed to be in rolling contact with each other and point C is the instantaneous point of contact. Surface coordinates (¯ η1k , η¯2k ) and k (¯ η1 , η¯2 ) denote the location of the contact point on surfaces S and S , respectively, which satisfy the holonomic constraints expressed by eqs. (9.52). The velocities of this point of contact, computed using the conſgurations of bodies k and , now become v k = u˙ k + ω k Rk p∗k (¯ η1k , η¯2k ), and v = u˙ + ω R p∗ (¯ η1 , η¯2 ), respectively. 0 0 When taking a time derivative of the inertial position vectors to obtain velocities, the surface coordinates (¯ η1k , η¯2k ) and (¯ η1 , η¯2 ) were held constant because they are the ſxed parameters that deſned the material points on bodies k and l that are located at
370
9 Constrained systems: preliminaries
the instantaneous point of contact. The difference between the instantaneous point of contact between the two rigid bodies and the material points that are instantaneously located at this point of contact is discussed in section 5.4. If the two bodies are in rolling contact with respect to each other, the tangential components of the instantaneous relative velocity of the points of contact on body k and must vanish, leading to the following two nonholonomic constraints, k k ∗k k k D1 = akT η1k , η¯2k ) u˙ − u˙ k + ω R p∗ (¯ η , η ¯ ) − ω R p (¯ η , η ¯ ) = 0, 1 (¯ 1 2 1 2 0 0 D2 = akT η1k , η¯2k ) u˙ − u˙ k + ω R p∗ (¯ η1 , η¯2 ) − ω k Rk p∗k (¯ η1k , η¯2k ) = 0. 2 (¯ 0 0 These velocity level constraints are not integrable. Indeed, the surface coordinates of the instantaneous points of contact, (¯ η1k , η¯2k ) and (¯ η1 , η¯2 ), for body k and , respectively, are time varying functions that depend on the solution of the problem and must satisfy the holonomic constraints expressed by eqs. (9.52). As the bodies roll on each other, the material points of bodies k and that are instantaneously located at the point of contact describe complex curves embedded in surfaces Sk and S , respectively. These nonholonomic constraints do not reduce the number of degrees of freedom of the problem. The two rigid bodies are rolling against each other, but the trajectories of the instantaneous points of contact describe arbitrary curves on the two surfaces. The two bodies could also be rotating with respect to each other about an axis passing through the point of contact and normal to the tangent plane at the contact point, in a manner such that the contact point coincides with ſxed material points on either bodies. 9.2.3 Problems Problem 9.3. Integrability conditions Show that the integrability conditions are not satisſed for the nonholonomic constraint, eq. (9.50), associated with the skateboard system.
Problem 9.4. Spatial mechanism The spatial mechanism depicted in ſg. 5.6 consists of an arm of length La attached to the ground at point S and rotating about axis ¯ı1 of the inertial frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]; the time-dependent rotation angle of unit vector s¯2 with respect to axis ¯ı2 is denoted θ(t). A rigid link connects point P, at the tip of the arm, to point Q that is free to slide along axis ¯ı1 . The link is of length Lb and the distance from point O to point Q is denoted x. (1) How many degrees of freedom does this mechanism present? (2) If θ and x are used as generalized coordinates, write the appropriate constraint (or constraints) applied on the system. (3) Derive the constraint matrix.
Problem 9.5. Crank-slider mechanism The crank-slider mechanism depicted in ſg. 9.8 consists of a uniform crank of length L1 and mass m1 connected to the ground at point O; let θ be the angle from the horizontal to the crank. At point B, the crank connects to a uniform linkage of length L2 and mass m2 that slides along point P, a ſxed point in space, located at a distance d from point O. Let w
9.3 The principle of virtual work for constrained static problems
371
denote the distance from point B to point P and φ the angle from the horizontal to link BP. (1) How many degrees of freedom does this mechanism present? (2) If θ, φ, and w are used as generalized coordinates, write the appropriate constraint (or constraints) applied to the system. (3) Derive the constraint matrix.
b
i2 L1, m1 O
q
w
B
I O
L2, m2
d(t)
f P d
T
i2
k
x1, x2
i1
i1 T
Fig. 9.8. Crank-slider mechanism rotating at a constant angular velocity.
A
q
B
L
Fig. 9.9. Conſguration of the bar with an actuator.
Problem 9.6. Bar with actuator Rigid homogeneous bar AB of length L is connected to the ground at point T by means of a spring of stiffness constant k as depicted in ſg. 9.9. The other end of the bar is connected to the ground at point O through an actuator of prescribed length, d(t). The conſguration of bar AB is deſned by three generalized coordinates, the coordinates of its center of mass, x1 and x2 , and its orientation with respect to the horizontal, θ. (1) How many degrees of freedom does this mechanism present? (2) If x1 , x2 , and θ are used as generalized coordinates, write the appropriate constraint(s) applied to the system. (3) Derive the constraint matrix.
Problem 9.7. The two-bar linkage with slider system The two-bar linkage with slider system shown in ſg. 8.4 is a planar mechanism. It consists of a uniform crank of length L1 and mass m1 connected to the ground at point O; let θ be the angle from the horizontal to the crank. At point B, the crank slides over a uniform linkage of length L2 and mass m2 that is connected to the ground at point A. Let w denote the distance from point B to point A and φ the angle from the horizontal to link BA. (1) How many degrees of freedom does this mechanism present? (2) If θ, φ, and w are used as generalized coordinates, write the appropriate constraint (or constraints) applied to the system. (3) Derive the constraint matrix.
Problem 9.8. Particle in a circular slot with guiding arm A particle of mass M slides along a circular slot of radius R, as shown in ſg. 8.15. The particle also slides in a rectilinear slot in an arm of mass m and length L. The arm is pivoted to the ground at point O and is restrained by a torsional spring of stiffness constant k and a dashpot of constant c. The spring is un-stretched when angle φ = 0. (1) How many degrees of freedom does this mechanism present? (2) If θ, φ, and w are used as generalized coordinates, write the appropriate constraint(s) applied to the system. (3) Derive the constraint matrix.
9.3 The principle of virtual work for constrained static problems In section 7.5, the principle of virtual work was presented for a single particle and for systems of particles. It was pointed out that Newton’s ſrst law does not distin-
372
9 Constrained systems: preliminaries
guish among various types of forces: “the sum of all forces must vanish.” On the other hand, the nature of the applied forces profoundly affects the statement of the principle of virtual work: conservative forces can be derived from a potential, but non-conservative forces cannot, see section 3.2; this fundamental difference is reƀected in statements of the principle. Constraint forces form another important category of forces. In Newtonian mechanics, such forces are treated like any other applied force but in the principle of virtual, such forces are the object of special treatment, as explained in section 7.5.2. Indeed, it is possible to eliminate constraint forces from the formulation by choosing virtual displacements in a speciſc manner, as presented in the next sections. 9.3.1 The principle of virtual work for a constrained particle Application of the principle of virtual work to constrained system will require a close scrutiny of the forces associated with the constraints because the virtual work done by these forces presents special properties. Consider the case of a particle constrained to move in a slot inclined at an angle ψ with respect to the horizontal, as depicted in ſg. 9.10. The particle is connected to two springs of stiffness k1 and k2 , respectively, that remain at all times horizontal and vertical, respectively. The particle is subjected to an externally applied force, F a . This simple problem will now be treated using various approaches. Newtonian approach a
F
i2 k1
y k2
O
a
n
c
F
Fn
k1 x1
i1 t
y
Free body diagram k2 x2 Fig. 9.10. Particle in a slot.
First, the problem depicted in ſg. 9.10 will be solved using Newton’s approach. If the particle is in static equilibrium, Newton’s ſrst law implies F e + F c + F a = 0, where F e = −k1 x1¯ı1 − k2 x2¯ı2 is the elastic force the two springs apply on the particle and F c = F c n ¯ the constraint force applied by the slot. The tangent and normal to the slot, denoted t¯ = cos ψ ¯ı1 + sin ψ ¯ı2 and n ¯ = − sin ψ ¯ı1 + cos ψ ¯ı2 , respectively, are deſned, and the position vector of the particle is r = x1 ¯ı2 +x2 ¯ı2 . The constraint force solely acts in the direction of the normal because the particle is free to move along the slot. The vector equilibrium equation, projected along unit vectors ¯ı1 and ¯ı2 , yields two scalar equations,
−k1 x1 − sin ψFnc + F1a = 0,
and
− k2 x2 + cos ψFnc + F2a = 0,
(9.53)
respectively. This problem involves three unknowns: the coordinates of the particle, x1 and x2 , and the constraint force, Fnc . The solution of the problem will require the above two equations of equilibrium complemented by the constraint equation, C = −x1 sin ψ +
9.3 The principle of virtual work for constrained static problems
373
x2 cos ψ = 0. These three equations are easily solved by noticing that a solution of the form x1 = r cos ψ, x2 = r sin ψ, where r is the distance from point O to the particle, automatically satisſes the constraint equation. The equilibrium equations then become −k1 r cos ψ − Fnc sin ψ + F1a = 0 and −k2 r sin ψ + Fnc cos ψ + F2a = 0, respectively. The solution is readily found to be r = Fta /k, where Fta = F1a cos ψ + F2a sin ψ is the component of the applied force along tangent t¯, and k = k1 cos2 ψ + k2 sin2 ψ is the effective spring constant. The constraint force is Fnc = (k2 − k1 )r sin ψ − Fna , where Fna = −F1a sin ψ + F2a cos ψ is the component of the applied force acting along normal n ¯ . The complete solution is then x1 =
Fta cos ψ, k
x2 =
Fta sin ψ, k
Fnc =
k2 − k1 a Ft sin ψ cos ψ − Fna . k
In conclusion, when treating this problem using Newtonian mechanics, constraint forces are an integral part of the problem; the equations of equilibrium, eqs. (9.53), cannot be written without explicitly taking these forces into account. The complete solution of the problem involves the determination of both displacements and constraint forces. The principle of virtual work Next, the same problem will be analyzed with the help of the principle of virtual work developed in section 7.5.1. The particle is in static equilibrium if and only if δV e = (F c + F a )T δr,
(9.54)
for all arbitrary virtual displacements, δr = δx1 ¯ı1 +δx2 ¯ı2 . The potential of the elastic forces in the springs is V e = k1 x21 /2 + k2 x22 /2. Expanding the statement of the principle leads to [k1 x1 + sin ψFnc − F1a ] δx1 + [k2 x2 − cos ψFnc − F2a ] δx2 = 0; because the virtual displacement components are arbitrary, the bracketed terms must vanish, and equations of equilibrium (9.53) are recovered: as expected, the principle of virtual work is equivalent to Newton’s ſrst law. Here again, the force of constraint is an integral part of the formulation, and is treated like any other externally applied forces. Virtual work done by the constraint force The virtual work done by the constraint force is δW c = F cT δr = Fnc n ¯ T δr. Because virtual displacements are completely arbitrary, this virtual work does not necessarily vanish; indeed, virtual displacements are not required to satisfy the kinematic constraints of the problem. The particle is conſned to remain in the slot, it can move along the tangent vector to the slot only; this direction is called the kinematically admissible direction. The direction normal to the slot is called the kinematically inadmissible direction, or the infeasible direction, because the particle is not allowed to move in that direction.
374
9 Constrained systems: preliminaries
It is interesting to contrast the virtual work done by the constraint forces with the corresponding differential work, dW c = Fnc n ¯ T dr. The differential displacement, dr, is the true, inſnitesimal displacement of the particle along its path. Because it must satisfy the constraint, this differential displacement is along the slot tangential direction, dr = t¯dr. It follows that the differential work done by the force of constraint does vanish, dW c = Fnc n ¯ T dr = Fnc n ¯ T t¯ dr = 0. Next, the constraint equation is written in a compact manner as C = n ¯ T r and a variation of this constraint is δC = Bδr = n ¯ T δr: for this simple problem, the constraint matrix coincides with the normal to the slot. The virtual work done by the constraint force now becomes δW c = Fnc δC.
(9.55)
If the virtual displacements satisfy the constraint condition, i.e., if they are along the tangent to the slot, δC = n ¯ T δr = 0, and the virtual work done by the constraint forces vanishes. Principle of virtual work with kinematically admissible virtual displacements The concept of kinematically admissible virtual displacements was introduced in section 7.5.2; such virtual displacements are not completely arbitrary but are required to satisfy the kinematic constraints. For the case at hand, kinematically admissible virtual displacements are such that δC = n ¯ T δr = − sin ψ δx1 + cos ψ δx2 = 0. This constraint will be automatically satisſed by selecting δx1 = cos ψ δr and δx2 = sin ψ δr, which implies δr = (cos ψ ¯ı1 + sin ψ ¯ı2 )δr = t¯δr. As expected, kinematically admissible virtual displacements are virtual displacements along the sole direction compatible with the constraint condition, the direction tangent to the slot. Consequently, kinematically admissible virtual displacements are also called virtual displacements compatible with the constraints. For kinematically admissible virtual displacements, δC = 0, and eq. (9.55) implies the vanishing of the virtual work done by the constraint forces. The principle of virtual work now becomes: the particle is in equilibrium if and only if δV e = F aT δr,
(9.56)
for all kinematically admissible virtual displacements, δr = t¯δr. This principle implies k1 x1 δx1 +k2 x2 δx2 = F aT t¯δr and ſnally (k1 cos2 ψ+k2 sin2 ψ)rδr = Fta δr. Because the kinematically admissible virtual displacement, δr, is arbitrary, it follows that r = Fta /k, the same solution as found above. In summary, the principle of virtual work can be stated as follows. Principle 17 (Principle of virtual work for a particle) A particle is in static equilibrium if and only if the virtual work done by the externally applied forces vanishes for all kinematically admissible virtual displacements.
9.3 The principle of virtual work for constrained static problems
375
Discussion It is interesting to contrast the two statements of the principle of virtual work given by principle 8 on page 273 and principle 17 above. These two principles are nearly identical. When the principle of virtual work is used with arbitrary virtual displacements, as principle 8, the virtual work done by the reactions forces must be included in the statement of the external virtual work because it does not vanish, see eq. (9.54). The principle of virtual work is only another form of Newton’s ſrst law stating that the sum of all externally applied forces must vanish for static equilibrium to occur. Newton’s ſrst law does not distinguish between various types of forces; “all externally applied forces” means all forces, including the constraint forces. In contrast, when the principle of virtual work is used with kinematically admissible virtual displacements, as principle 17, the virtual work done by the externally applied forces does not include the reaction forces. Indeed, the virtual work they perform automatically vanishes because kinematically admissible virtual displacements are orthogonal to the reaction forces, see eq. (9.56). These two principles are derived from Newton’s law to which they are equivalent. When arbitrary virtual displacements are used, all equilibrium equations of the problem are recovered. If the virtual displacements are limited to those that are kinematically admissible, a subset of the equilibrium equations is recovered. More generally, consider a system featuring n generalized coordinates and m kinematic constraints for a total of d = n − m degrees of freedom. Application of the principle of virtual work with arbitrary virtual displacements, as principle 8, leads to n equations of equilibrium, identical to those obtained from Newton’s ſrst law. These equations will involve the n generalized coordinates as well as the m constraint forces associated with the m kinematic constraints. The n equations of equilibrium and m constraint equations are then solved to yield the n unknown generalized coordinates and m constraint forces. In contrast, application of the principle of virtual with kinematically admissible virtual displacements, as principle 17, leads to d equations of equilibrium. These equations will involve the sole d degrees of freedom of the problem; the constraint forces vanish from the formulation. It is often more convenient to use a formulation based on kinematically admissible virtual displacements; indeed, fewer equations are obtained, involving a smaller number of unknowns. This simpliſcation, however, comes at the expense of eliminating the constraint forces from the formulation, thereby loosing all information about these important forces acting on the system. The use of virtual displacements that violate the kinematic constraints is by no means incorrect. As mentioned earlier, virtual displacements can be interpreted as “mathematical experiments” or “what if?” scenarios. In contrast with real displacements that must indeed satisfy all kinematic constraints, virtual displacements are not constrained to satisfy these same conditions. Of course, if the analyst chooses to work with virtual displacements that violate kinematic constraints, the virtual work associated with the corresponding constraint forces must be taken into account.
376
9 Constrained systems: preliminaries
Example 9.8. Particle sliding along a curve The concepts discussed in the previous section will be illustrated by investigating the problem depicted in ſg. 9.11. A particle of mass m is sliding along a track whose shape is deſned by an arbitrary curve C described by its intrinsic parametrization; curvilinear variable s measures length along the curve. The particle is connected to point O by means of an elastic spring of stiffness constant k and vanishing unstretched length; it is subjected to an externally applied force F a .
i2
Arbitrary curve C m k
O
C
i2
s
R
q m
Fa O i1
a
F
h
k i1 d
Fig. 9.11. Particle connected to a spring and sliding along a curve.
Fig. 9.12. Particle connected to a spring and sliding along a circle.
Newton’s ſrst law is used to derive the equilibrium equations of the system: F e + F c + F a = 0. The elastic force the spring applies on the particle is F e = −kx1¯ı1 − kx2¯ı2 and the constraint force the track applies on the particle is F c = Fnc n ¯ , where n ¯ is the unit vector normal to the track. Projecting Newton’s ſrst law along unit vectors ¯ı1 and ¯ı2 then leads to −kx1 + (¯ıT1 n ¯ )Fnc + F1a = 0,
and
− kx2 + (¯ıT2 n ¯ )Fnc + F2a = 0,
(9.57)
respectively. These equations involve three unknowns: the coordinates of the particle, x1 and x2 , and the magnitude of the normal contact force, Fnc . A third equation is required to solve the problem: the deſnition of the curve that can be viewed as a kinematic constraint linking generalized coordinates, x1 and x2 . In general, the shape of the curve will be deſned through its intrinsic parametrization; in this case, the position of the particle is deſned in terms of s, x1 = x1 (s) and x2 = x2 (s). The two equilibrium equations are then sufſcient to solve for s and Fnc . Next, the principle of virtual work based on arbitrary virtual displacements is used to solve the problem. The position vector of the particle is r = x1¯ı1 + x2¯ı2 , and the virtual displacement vector is δr = δx1¯ı1 + δx2¯ı2 . The potential of the elastic forces in the spring is V e = k(x21 + x22 )/2. The principle of virtual work then leads to [kx1 − (¯ıT1 n ¯ )Fnc − F1a ]δx1 + [kx2 − (¯ıT2 n ¯ )Fnc − F2a ]δx2 = 0. Because the virtual displacement components are arbitrary, this statement involves the reaction force, Fnc , and is identical to the equilibrium equations obtained from Newton’s ſrst law, eqs. (9.57). Of course, it is also possible to use the principle of virtual work based on kinematically admissible virtual displacements. Kinematically admissible virtual displace-
9.3 The principle of virtual work for constrained static problems
377
ments are readily obtained by considering the position of the particle to be an implicit function of the curvilinear coordinate, s, to ſnd δr = (dr/ds)δs = t¯δs, where t¯ is the unit tangent vector to the curve. Arbitrary virtual changes in the curvilinear coordinate, δs, then generate virtual displacements, δr = t¯δs, that are compatible with the constraints; as expected, kinematically admissible virtual displacements are along the tangent to the curve. As discussed in the previous section, the virtual work done by the constraint forces vanishes when using virtual displacements compatible with the constraints. Indeed, the constraint force acts along the normal to the curve and kinematically admissible displacements are along the tangent to the curve. Because the normal and tangent vectors are orthogonal to each other, the virtual work done by the constraint force vanishes, although the constraint force itself does not. The potential of the elastic forces in the spring can be expressed in terms of curvilinear variable s as V e = V e (s). The principle of virtual work now reduces to (dV e /ds) δs = F aT t¯δs. Because δs is arbitrary, it follows that dV e = Fta , ds
(9.58)
where Fta = t¯T F a is the tangential component of the externally applied force. This single equation can be solved to ſnd the equilibrium position of the particle along the curve. Example 9.9. Particle sliding along a circle To illustrate the process described in example 9.8, let the curve be a circle of radius R, as shown in ſg. 9.12. The coordinates of the particle then become x1 = d − R cos θ and x2 = h − R sin θ, where d, h, and θ are deſned in the ſgure. The unit tangent vector to the circle is easily found to be t¯ = sin θ ¯ı1 − cos θ ¯ı2 , and the unit normal vector is n ¯ = − cos θ ¯ı1 − sin θ ¯ı2 . For a circle, it is more convenient to use an arbitrary parameterization of the curve, angle θ, rather than its intrinsic parameterization, s = Rθ. The equilibrium equations obtained from Newton’s ſrst law, eqs. (9.57), are −k(d − R cos θ) + cos θFnc + F1a = 0 and −k(h − R sin θ) + sin θFnc + F2a = 0. These equations are expressed in terms of the angular position of the particle, angle θ, and the normal contact force, Fnc . Eliminating the normal contact force yields the angular position of the particle as tan θ =
kh − F2a . kd − F1a
Finally, the magnitude of the normal contact force becomes Fnc = (kd − F1a )2 + (kh − F2a )2 − kR.
(9.59)
(9.60)
To apply the principle of virtual work with kinematically admissible virtual displacements, the potential of the elastic spring is expressed in terms of the particle’s angular position as V e = 1/2 k[(d − R cos θ)2 + (h − R sin θ)2 ]. It then follows
378
9 Constrained systems: preliminaries
that dV e /ds = (dV e /dθ) (dθ/ds) = k(d sin θ − h cos θ). The principle of virtual work, expressed by eq. (9.58), then yields k(d sin θ −h cos θ) = F1a sin θ −F2a cos θ. Solving this single equation gives the angular position of the particle, eq. (9.59). The use of the principle of virtual work based on the kinematically admissible virtual displacements is expeditious: it yields a single equation for the single degree of freedom of the problem. The constraint force does not appear in this equation, in contrast with the case of Newton’s formulation. 9.3.2 The principle of virtual work and Lagrange multipliers The problem discussed in the previous example and depicted in ſg. 9.11 involves two scalar kinematic constraints: at all times, the particle must remain on curve C. These constraints will be expressed as C = r(x1 , x2 )− p0 (s) = 0. The conſguration of this single degree of freedom system is now represented by three generalized coordinates, the two Cartesian coordinates of the particle, x1 and x2 , and the curvilinear variable, s, linked by two kinematic constraints. Variation of these constraint can be written as δC = δr − t¯δs, where t¯ is the unit tangent vector to the curve. In all previous examples, the forces of constraint were introduced at the onset of the problem to represent the effect of kinematic constraints. In this section, however, kinematic constraints will be enforced using Lagrange’s multiplier method presented in section 9.1. In this approach, the constrained problem is transformed into an unconstrained problem based on an augmented potential, see eq. (9.25), 1 V + = V e + λT C = k(x21 + x22 ) + λT r(x1 , x2 ) − p0 (s) , 2 where the generalized coordinates, x1 , x2 , and s, and Lagrange’s multipliers, λ, are all unconstrained variables. The principle of virtual work now implies −F eT δr + λT (δr − t¯δs) + δλT C = aT F δr. Since all variations δr, δs, and δλ are arbitrary, the following equations are obtained −F e + λ = F a , λT t¯ = 0, r − p0 (s) = 0.
(9.61a) (9.61b) (9.61c)
Equation (9.61a) is the equation of equilibrium for the particle, stating that the sum of the externally applied forces must vanish. Lagrange’s multipliers can be interpreted as the constraint forces, λ = −F c , and the equation then becomes −F e − F c = F a . Equation (9.61b) implies that Lagrange’s multipliers are normal to the tangent vector, i.e., are oriented along the normal to the curve. If Lagrange’s multiplier are written as λ = λt t¯+ λn n ¯ , eq. (9.61b) implies λt = 0: the constraint force consists of a sole component acting along the normal direction. Finally, eq. (9.61c) is just the kinematic constraint. The principle of virtual work yields ſve equations to be solved for the three generalized coordinates, x1 , x2 , and s, and two Lagrange multipliers, λt and λn .
9.3 The principle of virtual work for constrained static problems
379
This simple example reveals the close relationship that exists between Lagrange’s multipliers and constraint forces. In section 9.1, the multipliers were introduced as auxiliary mathematical variables devoid of any physical meaning. Within the framework of the principle of virtual work, Lagrange’s multipliers are closely related to constraint forces. It would be a mistake, however, to simply equate Lagrange multipliers and constraint forces. The term that augments the potential of the problem, λT C, could also be written as (λT /a)(aC), where a is an arbitrary constant. Lagrange’s multipliers would then become proportional to the constraint forces, where a is the constant of proportionality. In the above example, λ = −F c ; had the constraint been written as C = −r + p0 (s), the corresponding result would have been λ = +F c . Finally, the term that augments the potential could be written as λ1 C12 + λ2 C22 , where C1 and C2 are the components of constraint C resolved in basis I. In this case, although related to the constraint forces, Lagrange’s multipliers would not be proportional the constraint forces. Clearly, it is important to determine the precise physical meaning of Lagrange’s multipliers to help explain the signiſcance of the equations derived from the principle of virtual work. Example 9.10. Particle sliding along a curve with friction In this example, the problem of a particle sliding along a track whose shape is deſned by an arbitrary curve C, as depicted in ſg. 9.11, will be investigated once again. This time, a friction force, F f , acts between the particle and the track. The friction force will be assumed to obey Coulomb’s law of static friction, i.e., |F f | ≤ μs |F n |, where μs is the coefſcient of static friction, and F n the normal force at the frictional interface. The virtual work done by the friction force is δW f = F f δs, because the friction force acts in the direction tangent to the curve. The principle of virtual work based on kinematically admissible virtual displacements is used ſrst. This principle implies (dV e /ds) δs = F aT t¯δs + F f δs, and the equilibrium equation becomes dV e /ds = Fta + F f . This equations includes the effect of the friction force and should be compared with its counterpart, eq. (9.58), that ignores this effect. Unfortunately, this approach does not yield enough information to solve the problem: the single equation of equilibrium involves two unknowns, the curvilinear variable, s, and the friction force that depends on the unknown normal contact force, F n . The main advantage of the principle of virtual work based on kinematically admissible virtual displacements is to eliminate the constraint forces from the formulation. This advantage turns out to be a drawback in the present situation: the constraint force is, in fact, the normal force at the frictional interface and is required to evaluate the friction force. In contrast, the principle of virtual work in combination with Lagrange’s multiplier technique yields an elegant solution to the problem. Using the notation deſned in section 9.3.2, the principle of virtual work now implies −F eT δr+λT (δr − t¯δs)+ δλT C = F aT δr + F f δs, and the governing equations of the problem become
380
9 Constrained systems: preliminaries
−F e + λ = F a ,
(9.62a)
−λ t = F ,
(9.62b)
T¯
f
r − p0 (s) = 0.
(9.62c)
Here again, these equations should be compared with their counterpart, eq. (9.61), that ignore the effect of friction. The second equation only has been modiſed by the addition of friction: within a sign, the tangential component of Lagrange’s multiplier can be interpreted as the friction force itself. Solving these equation with the condition |F f | ≤ μs |F n | or |λt | ≤ μs |λn | will yield a range of angular positions of the particle for which equilibrium is possible. Of course, the principle of virtual work based on arbitrary virtual displacements would also provide enough information to solve the problem because it would bring to light the normal contact force required to quantify the friction force. Example 9.11. Constrained system of particles, Lagrange multiplier approach The two systems depicted in ſg. 9.13, denoted system 1 and system 2, respectively, are clearly different and feature different equilibrium conſgurations. Examples 7.14 and 7.15 have treated these two problems using the principle of virtual work, and the equations of equilibrium were found to be given by eq. (7.42a) and (7.42b), for the ſrst and eq. (7.45) for the second.
mb
ma
md
md
mc
q ma
i1 k
k
L/3 L/3 L/3 System 1
k
q2
mb q1
i1
u k
mc
u k
k
k
k
L/3 L/3 L/3 System 2
Fig. 9.13. Particle connected to a spring and sliding along a curve.
Imagine that moments of equal magnitudes and opposite signs are now applied at the mid-span hinge of system 2. Intuitively, if the magnitude of these moments is “just right,” the two parts of the articulated bar will align, and θ1 = θ2 . In this case, systems 1 and 2 become equivalent, and the moment applied at the mid-span articulation of system 2 is equal to the mid-span internal moment of system 1, given by eq. (7.43). Rather than ſnding the mid-span moment that is “just needed” to align the two articulated bar of system 2, it is possible to enforce the kinematic constraint, θ1 = θ2 , by means of Lagrange’s multiplier technique. In this case, Lagrange’s multiplier associated with the constraint will be closely related to the moment applied at the hinge, and hence, to the mid-span internal moment of system 1.
9.3 The principle of virtual work for constrained static problems
381
Following this latter line of thought, the kinematic constraint C = θ1 − θ2 = 0 is applied to system 2 using Lagrange’s multiplier technique. The potential of the problem, given by eq. (7.44), is augmented with the following term: λC = λ(θ1 − θ2 ). All variables are considered to be unconstrained, leading to the following set of equations ⎤⎧ ⎫ ⎡ ⎡ ⎤ u ¯⎪ 4¯ − 4m¯ g 4 −2/3 2/3 0 ⎪ ⎪ ⎪ ⎨ ⎬ ⎢ ¯ ¯ ⎢−2/3 5/18 0 1⎥ g⎥ ⎢−(a /2 + b /6) + (ma /2 + mb /6)¯ ⎥ θ1 ⎢ ⎥, ⎣ 2/3 0 5/18 −1⎦ ⎪θ2 ⎪ = ⎣ (¯d /2 + ¯c /6) − (md /2 + mc /6)¯ g ⎦ ⎪ ⎪ ⎩¯⎭ 0 1 −1 0 0 λ ¯ = λ/(kL2 ) is the non-dimensional multiplier. Note that the ſrst 3 × 3 where λ system of equations is identical to that derived for the unconstrained system, see eq. (7.45). As expected, the solutions for u ¯ and θ1 = θ2 are identical to those earlier, see eqs. (7.42a) and (7.42b), respectively. Lagrange’s multiplier λ is the “force” that imposes the kinematic condition θ1 = θ2 , i.e., it is the mid-span bending moment in the now rigid bar. This can be veriſed by solving for Lagrange multiplier and observing that it is indeed equal to the mid-span bending moment given in eq. (7.43). Example 9.12. System of particles with constraints, penalty method approach Additional insight into Lagrange’s multiplier technique can be gained by comparing it with the penalty method for enforcing constraints. Imagine that the two articulated bars shown in ſg. 7.24 are connected by a torsional spring of stiffness constant p. This problem can be solved by adding to the potential of the elastic forces acting in the linear springs, see eq. (7.44), a term for the mid-span torsional spring, 1/2 p(θ1 − θ2 )2 , called the penalty term. The principle of virtual work based on kinematically admissible virtual displacements then yields the following equations ⎤⎧ ⎫ ⎡ ⎤ ⎡ ¯⎬ 4 −2/3 2/3 24¯ − 24m¯ g ⎨u 1 ⎣−2/3 5/18 + p¯ −¯ p ⎦ θ1 = ⎣−(3a + b ) + (3ma + mb )¯ g⎦ , ⎩ ⎭ 6 2/3 −¯ p 5/18 + p¯ (3d + c ) − (3md + mc )¯ θ2 g (9.63) where p¯ = p/(kL2 ) is the non-dimensional stiffness of the mid-span torsional spring. If the mid-span torsional spring is made increasingly stiffer, i.e., as p increases, the relative rotation of the two rigid bars will become increasingly smaller. Indeed, as p increases, an increasing “penalty,” p¯(θ1 − θ2 )2 /2, is payed for any violation of the constraint, θ1 = θ2 . In fact, as p → ∞, the relative rotation vanishes, θ1 → θ2 . This technique is known as the “penalty method” for enforcing constraints. Figure 9.14 shows the solution of eqs. (9.63) as a function of the penalty factor, p¯; note the logarithmic scale on the horizontal axis. The top ſgure shows the convergence of the mid-span displacement; as p¯ increases, the prediction of the penalty method (in dashed lines) converges to the corresponding result for Lagrange’s multiplier technique (in solid line). The middle ſgure shows the rotations of the two bars; both θ1 and θ2 converge to the same value, angle θ1 = θ2 , predicted using the multiplier technique. Finally, the bottom ſgure compares the mid-span bending moments; for Lagrange’s multiplier technique, the bending moment is simply Lagrange’s multi¯ = λ, ¯ whereas for the penalty method, M ¯ = p¯(θ1 − θ2 ). plier, M
382
9 Constrained systems: preliminaries
DISPL.
0.4 0.38 0.36
BEND. MOM. ROTATION
0.34 0.2 0.1 0 -0.1 -0.2 0.04 0.035 0.03 0.025 -1 10
10
0
10
1
2
10
PENALTY FACTOR
Fig. 9.14. Convergence of the mid-span displacement (top ſgure), rotations (middle ſgure), and mid-span bending moment (bottom ſgure) as the penalty factor, p¯, increases. Lagrange multiplier method: solid line; penalty method: dashed line.
The penalty method is easy to use: it does not introduce additional variables, as is the case for Lagrange’s multiplier technique, nor does it eliminate any variable. Unfortunately, it is not a robust method: the exact solution of the problem is only recovered when p → ∞, but in practice, a ſnite value of p¯ must be used to avoid the ill conditioning of the system equations, eqs. (9.63). Lagrange’s multiplier technique is a rigorous approach to the enforcement of constraints; the penalty method is a convenient approach to obtaining approximate predictions. The parallel between the two approaches underlines their common physical interpretation. In the penalty method, the penalty term, V c = 1/2 p¯(θ1 − θ2 )2 , can be interpreted as the “potential of the constraint forces.” Indeed, the constraint mo¯ , can be derived from this potential: M ¯ = dV c /dθ, where θ = θ1 − θ2 is ment, M the relative mid-span rotation. The constraint is exactly enforced in the limiting case ¯ . The where p¯ → ∞, in such a manner that p¯(θ1 − θ2 ) converges to a ſnite value, M potential of the constraint forces now becomes V c = 1/2 p¯(θ1 − θ2 )(θ1 − θ2 ) = ¯ This means that the term λC ¯ introduced in Lagrange’s mul¯ (θ1 − θ2 ) = λC. 1/2 M tiplier technique is, in fact, the potential of the constraint forces. Because the constraint forces associated with kinematic constraints can be derived from a potential, such forces are conservative. Indeed, the work done by such constraint forces vanishes, leaving the total mechanical energy unchanged, the hallmark of conservative forces.
9.3.3 Problems Problem 9.9. Particle on a circular track Consider a particle connected to a ſxed point O by an spring of stiffness constant k and sliding along a circular track, as depicted in ſg. 9.15. Friction acts between the particle and the track. The friction force F f is assumed to obey Coulomb’s law of static friction, i.e., |F f | ≤ μs |F n |, where μs is the coefſcient of static friction, and F n the normal force at the frictional interface. (1) Derive the governing equations of the problem using the principle of virtual work and Lagrange’s multiplier technique. (2) Find the equilibrium position, θ0 , of
9.3 The principle of virtual work for constrained static problems
383
the particle in the absence of friction, i.e., when μs = 0. (3) Find the range of equilibrium positions, θ < θe < θu , as a function of μs ; θ and θu are the lower and upper bounds, respectively, of the angular position of the particle for which equilibrium is possible. (4) Plot these bounds as a function of the static coefſcient of friction μs , i.e., plot θ = θ (μs ) and θu = θu (μs ). (5) Plot the normal contact forces, F¯n (μs ) and F¯un (μs ), acting on the particle when it is located at θ (μs ) and θu (μs ), respectively. (6) Plot the friction forces, F¯f (μs ) and F¯uf (μs ), acting on the particle when it is located at θ (μs ) and θu (μs ), respectively. (7) Plot the total contact forces, F¯c (μs ) and F¯uc (μs ), acting on the particle when it is located at ¯ = h/R = 2. θ (μs ) and θu (μs ), respectively. Use the following data: d¯ = d/R = 1.5; h F¯ n = F n /kR.
md
C i2
q
mc
R i1
h k
i1
q2
mb
ma
q1 u 1 k
k
u2 k
k
O L/3 L/3 L/3
d Fig. 9.15. Particle sliding along a curve with friction.
Fig. 9.16. Two bars supported by four springs.
Problem 9.10. Two bars supported by four springs Figure 9.16 depicts a system consisting of two rigid bars connected to the ground by means of elastic springs of stiffness k. The un-stretched lengths of the four springs are a , b , c , and d , respectively, and the four masses are ma , mb , mc , and md , respectively. The conſguration of the left bar is represented by the displacement, u1 , of its right end and its orientation, θ1 . The conſguration of the right bar is represented by the displacement, u2 , of its left end and its orientation, θ2 . (1) Use the principle of virtual work to ſnd the solution of the problem. (2) Use Lagrange’s multiplier technique to enforce two kinematic constraints: C1 = u1 − u2 = 0 and C2 = θ1 −θ2 = 0. (3) Solve the constrained problem and show that its solution is identical to that found in example 7.14. (4) What is the physical meaning of Lagrange’s multipliers? (5) Validate your solution by comparing the value of Lagrange’s multipliers with predictions based on statics arguments. (6) Enforce the constraints C1 and C2 using the penalty method. (7) Demonstrate the convergence of the solution of the penalty method to that of Lagrange’s multiplier technique as the penalty factor increases. Plot the displacements, rotations, and loads as a function of the penalty factor for both solutions. Use the following data: ¯a = a /L = 0.5; ¯b = b /L = 0.75; ¯c = c /L = 0.60; ¯d = d /L = 0.30; ma = 1.2 kg; mb = 1.50 kg; mc = 0.60 kg; md = 0.45 kg; g¯ = g/kL = 0.2.
Problem 9.11. Four springs supporting a rigid bar Figure 7.23 depicts a system consisting of a rigid bar connected to the ground by means of elastic springs of stiffness k. The un-stretched lengths of the four springs are a , b , c , and d , respectively. The conſguration of system is represented by the displacements, ua , ub , uc , and ud of the four masses, ma , mb , mc , and md , respectively. (1) Use the principle of virtual work to ſnd the solution of the unconstrained problem, i.e., in the absence of the
384
9 Constrained systems: preliminaries
rigid bar. (2) Use Lagrange’s multiplier technique to enforce the four kinematic constraints imposed by the rigid bar: C1 = xa − (x − Lθ/2) = 0, C2 = xb − (x − Lθ/6) = 0, C3 = xc −(x+Lθ/6) = 0, C4 = xd −(x+Lθ/2) = 0, where x is the mid-span displacement of the bar and θ its rotation. (3) Solve the constrained problem and show that its solution is identical to that found in example 7.14. (4) What is the physical meaning of Lagrange’s multipliers? (5) Validate your solution by comparing the value of Lagrange’s multipliers with predictions based on statics arguments. (6) Enforce the constraints C1 , C2 , C3 , and C4 using the penalty method. (7) Demonstrate the convergence of the solution of the penalty method to that of Lagrange’s multiplier technique as the penalty factor increases. Plot the displacements, rotations, and forces as a function of the penalty factor for both solutions. Use the following data: ¯a = a /L = 0.5; ¯b = b /L = 0.75; ¯c = c /L = 0.60; ¯d = d /L = 0.30; ma = 1.2 kg; mb = 1.50 kg; mc = 0.60 kg; md = 0.45 kg; g¯ = g/kL = 0.2.
Problem 9.12. A rigid bar suspended by two spring Figure 9.17 depicts a massless rigid bar of length with A2(d2, h2) end masses, m1 and m2 , suspended by two springs of A1(d1, h1) stiffness constants k1 and k2 . The springs, of un-stretched k2 lengths 1 and 2 , respectively, are connected to the m2 k1 i2 ground at points A1 and A2 , of coordinates (d1 , h1 ) g q and (d2 , h2 ), respectively. Gravity acts on the system in m1 C(xC, yC) the direction indicated on the ſgure. Two different sets i1 of generalized coordinates will be used to represent the system. A three generalized coordinate representation: xc Fig. 9.17. A rigid bar suspended and yc , the position of the center of mass of the system, by two spring. and θ, the orientation of the rigid bar. The second is a four generalized coordinate representation using x1 and y1 , the coordinates of mass m1 , and x2 and y2 , the coordinates of mass m2 . (1) Using the ſrst set of generalized coordinates, ſnd the equilibrium conſguration of the system based on the principle of virtual work. (2) Find the internal force in the rigid bar. (3) For the second set of generalized coordinates, determine the kinematic constraint that links the four generalized coordinates. (4) Find the equilibrium conſguration of the system based on the principle of virtual work and Lagrange’s multiplier technique. (5) Provide a physical interpretation of Lagrange’s multiplier. (6) Show that the solutions obtained with the two sets of generalized coordinates are identical. (7) Using the second set of generalized coordinates, ſnd the equilibrium conſguration of the system based on the principle of virtual work and the penalty method. (8) Study the convergence of this latter solution as the penalty coefſcient increases. Use the following data: d¯1 = d1 / = 0.5, ¯ 1 = h1 / = 1.5, d¯2 = d2 / = 2, ¯ h2 = h2 / = 0.8; 1 = 2 = 0; m ¯ 1 = m1 g/(k) = 5, h m ¯ 2 = m2 g/(k) = 8; k = k1 = k2 .
10 Constrained systems: classical formulations
Chapter 8 presented variational and energy principles for unconstrained dynamical system. This chapter generalizes these formulations to enable the treatment of constrained systems. D’Alembert’s principle is treated in section 10.1. The generalization of Hamilton’s principle and Lagrange’s formulation to systems with holonomic constraints is presented in section 10.2 and section 10.3 generalizes the same formulations to systems with nonholonomic constraints. The second part of the chapter deals with constraints in multibody systems. While mechanical systems employ many types of joints, the lower pair joints are the most commonly used and section 10.4 describes their kinematic characteristics. Section 10.5 develops generic constraints that will be used for all lower pair joints, and the speciſc constraints associated with each of the six lower pair joints are detailed in section 10.6. The chapter concludes with a cursory look at a few additional joints.
10.1 D’Alembert’s principle for constrained systems D’Alembert’s principle was derived in section 8.1 and is expressed by eq. (8.3). This principle involves the virtual work done by the various forces acting on the system: the inertial forces, the conservative forces, and the non-conservative forces. When using kinematically admissible virtual displacements, the virtual work done by the forces that impose kinematic constraints does vanish, as discussed in section 9.3.1; consequently, such forces do not appear in d’Alembert’s formulation. If redundant generalized coordinates are used, the system will be subjected to kinematic constraints, and virtual changes in these generalized coordinates will no longer be kinematically admissible. As discussed in section 9.3.1, it will then be required to take into account the virtual work done by the constraint forces, because this virtual work does not vanish for virtual displacements that are not kinematically admissible. The virtual work done by the constraint forces is elegantly introduced into the formulation by means of Lagrange’s multipliers; as presented in section 9.3.2, the
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_10 © Springer Science+Business Media B.V. 2011
386
10 Constrained systems: classical formulations
potential of the conservative forces is augmented by the potential of the constraint forces expressed in terms of Lagrange’s multipliers. Example 10.1. The double pendulum Consider the double pendulum system depicted in ſg. 10.1. The ſrst bar is of length L1 , mass m1 , and is connected by hinges to the ground at point O and to the second bar at point A. The second bar is of length L2 and mass m2 . The bars have orientation angles θ1 and θ2 with respect to the vertical, respectively. I = (¯ı1 , ¯ı2 , ¯ı3 ) is an inertial basis; bases E = (¯ e1 , e¯2 , e¯3 ) and A = (¯ a1 , a ¯2 , a ¯3 ) are attached to the ſrst and second bars, respectively. This problem was treated in example 8.2 on page 300 with d’Alembert’s principle using kinematically admissible virtual displacements. Derive the equations of motion of the system using d’Alembert’s principle with a redundant set of generalized coordinates. It is assumed that the internal forces at point A must be evaluated as part of the solution process. The following generalized coordinates will be used: angles θ1 and θ2 and the position vector of point A with respect to point O, denoted r A . Since the system features two degrees of freedom only, two constraints must exist between these four generalized coordinates: C = r A − L1 e¯1 = 0. These two constraints will be enforced using Lagrange’s multiplier technique. The conſguration of the ſrst bar is deſned by the sole generalized coordinate θ1 , and hence, the difference between the variation of the potential energy and virtual work done by the inertial forces is identical to that given by eq. (8.12). The conſguration of the second bar now involves three generalized coordinates, the position of point A, r A , and the orientation of the second bar, θ2 . The potential of the gravity forces acting of the bar is V2 = m2 g(¯ıT2 r A − L2 C2 /2). The virtual work done by the inertial forces is obtained from eq. (8.8), using point A as the reference point now becomes L2 L2 I T 2 L2 ¨ ˙ δW2 = −δrA m2 aA + θ2ı3 a ¯ 1 − θ2 a ¯1 − δθ2 m2¯ıT3 a1 aA + I2A θ¨2 . 2 2 2 Although this expression is identical to that obtained earlier, see eq. (8.13), r A is now an independent, unconstrained generalized coordinate, which implies that δr A = L1 δθ1 e¯2 . According to Lagrange’s multiplier technique, the potential of the constraint forces is V c = μT (r A − L1 e¯1 ), where μ is the array of Lagrange’s multipliers used to enforce the two constraints. Variation of this potential is δV c = δμT (r A − L1 e¯1 ) + δr TA μ − δθ1 L1 μT e¯2 . D’Alembert’s principle now implies δ(V1 + V2 + V c ) − δ(W1I + W2I ) = 0 for all arbitrary variations δθ1 , δθ2 , and δr A , leading to the following equations of motion
10.1 D’Alembert’s principle for constrained systems
m1 L21 ¨ L1 θ1 + m1 g S1 − L1 e¯T2 μ = 0, 3 2 2 L1 L2 L L1 L2 ˙ 2 m L 2 2 2 C21 θ¨1 + θ¨2 + m2 g S2 + m2 θ S21 = 0, m2 2 3 2 2 1 m2 L 2 ¨ (θ2 a μ + m2 g¯ı2 + m2 L1 (θ¨1 e¯2 − θ˙12 e¯1 ) + ¯2 − θ˙22 a ¯1 ) = 0, 2
387
(10.1a) (10.1b) (10.1c)
respectively. Equation (10.1b), the coefſcient of the arbitrary variation δθ2 , is identical to the corresponding equation obtained by using kinematically admissible virtual displacements; it provides the ſrst equation of motion for the problem. Equation (10.1a) is identical to that obtained from Newton’s approach, eq. (8.10b), provided that Lagrange’s multipliers are interpreted as the internal force at the joint, μ = HA ¯ı1 + VA ¯ı2 , where HA and VA are the horizontal and vertical components of the internal force transmitted at point A, respectively. Finally, eq. (10.1c) is identical to that obtained from the Newtonian approach, see eq. (8.11a), and provides an expression for Lagrange’s multipliers, μ, which are the desired internal forces. Elimination of Lagrange’s multipliers from the ſrst and third equations yields the second equation of motion for the problem. This example demonstrates the versatility of d’Alembert’s principle coupled with Lagrange’s multiplier technique. Through a judicious choice of generalized coordinates and constraints, a set of equations involving the desired unknowns of the problem is generated. i3 = a3 i2 i1
O q1
L1
I O
i2 b2 = e2 Universal joint a1 = b 1
i1
g
g
A
e3 L2
q2 Fig. 10.1. Conſguration of the double pendulum system.
O’
e2 h e1
C
Points O and O’ are coincident. Rigid body
Fig. 10.2. Conſguration of the rigid body connected to a universal joint.
Example 10.2. The rigid body/universal joint system This example deal with a rigid body attached to the ground by means of a universal joint, see ſg. 10.2 and examples 7.5 and 8.3, on pages 266 and 302, respectively. Component k of the universal joint, see ſg. 7.10, is connected to the ground at point O by means of a bearing allowing rotation about axis ¯ı3 . Component is connected to a rigid body at point O . The orientation of the rigid body will be deſned by Euler
388
10 Constrained systems: classical formulations
angles, using the 3-1-2 sequence. A ſrst planar rotation about axis ¯ı3 , of magnitude φ, brings inertial basis I = (¯ı1 , ¯ı2 , ¯ı3 ) to A = (¯ a1 , a ¯2 , a ¯3 ), where a ¯1 is aligned with unit vector ¯b1 of the cruciform. This rotation is associated with a constant angular speed φ˙ = Ω, implying a ¯1 (t) = cos(Ωt)¯ı1 + sin(Ωt)¯ı2 . A second planar rotation about axis a ¯1 , of magnitude θ, brings basis A to B = (¯b1 , ¯b2 , ¯b3 ), where ¯b2 is the second unit vector aligned with the cruciform. Finally, a third planar rotation of magnitude ψ about axis ¯b2 bring basis B to E = (¯ e1 , e¯2 , e¯3 ) that is attached to the rigid body. The components of tensors in basis E will be denoted with the superscript (·)∗ . Points O and O are coincident. Find the equations of motion of the system using d’Alembert’s principle and the torque, Q(t), required to drive the system at a constant angular velocity Ω. This system features two degrees of freedom, but the conſguration of the system will be represented by three generalized coordinates, the three Euler angles, φ, θ, and ψ, deſned above. A rheonomic constraint, C = φ − Ωt, will be enforced using Lagrange’s multiplier technique by adding to the potential of the system the potential of the constraint force, V c = λ (φ − Ωt), where λ is Lagrange’s multiplier. Variation of this potential can be written as
δV c = δλ (φ − Ωt) + δφλ = δλ (φ − Ωt) + δφ, δθ, δψ λ¯ı1 ,
where ¯ıT1 = 1, 0, 0 . The term δφλ corresponds to the virtual work done by a torque, λ, undergoing a virtual rotation, δφ; hence, Lagrange’s multiplier λ can be interpreted as the torque required to enforce the constraint φ − Ωt = 0, i.e., λ is the driving torque. D’Alembert’s principle now states
δφ, δθ, δψ H ∗T mg η ∗ RT ¯ı3 + I O∗ ω˙ ∗ + ω ∗ I O∗ ω ∗
= δλ (φ − Ωt) + δφ, δθ, δψ λ¯ı1 , where the tangent operator, H ∗ , is deſned by eq. (4.80). Because Lagrange’s multiplier technique is used, φ, θ, ψ, and λ all are unconstrained variables and their variations arbitrary, leading to the following equations of motion: φ − Ωt = 0, the constraint to be enforced, and H ∗T mg η ∗ RT ¯ı3 + I O∗ ω˙ ∗ + ω ∗ I O∗ ω ∗ = λ¯ı1 . This matrix equation represents three independent scalar equations; the last two are identical to the equations of motion obtained above when using kinematically admissible virtual displacements, see eq. (8.15). The ſrst equation yields the desired torque as λ = h∗T mg η ∗ RT ¯ı3 + I O∗ ω˙ ∗ + ω ∗ I O∗ ω ∗ , where h∗ stores the ſrst column of H ∗ . Through the proper selection of the constraint and associated Lagrange’s multiplier, the system’s equations of motion are obtained together with the additional equation deſning the driving torque.
10.1 D’Alembert’s principle for constrained systems
389
Example 10.3. The rigid body/universal joint system The previous example focused on a rigid body attached to the ground by means of a universal joint, as depicted in ſg. 10.2. The equations of motion of the system were obtained, and one additional equation determining the driving torque was derived based on Lagrange’s formulation. For speciſc applications, the reaction forces acting at point O might also be important. Derive the equations of motion of the system and determine the reaction forces at point O. The reactions forces at point O do not appear in the formation presented in example 10.2 because the virtual displacement components at point O were selected to be kinematically admissible, i.e., virtual displacements vanish at point O, r O = 0, implying δr O = 0. Here again, Lagrange’s multiplier technique will be used to generate the equations required to evaluate these forces: the displacement components at point O, rO , are now considered to be generalized coordinates, and constraint C = r O = 0 is imposed. The potential of the constraints now becomes V c = λ(φ − Ωt) + μT rO , where μ is a set of Lagrange’s multipliers used to enforce the constraint rO = 0. Variation of this potential can be written as
δV c = δλ (φ − Ωt) + δφ, δθ, δψ λ¯ı1 + δμT rO + δr TO μ. The term δr TO μ corresponds to the virtual work done by a force, μ, undergoing a virtual displacement, δrO ; hence, the array of Lagrange’s multipliers, μ, can be interpreted as the force required to enforce the constraint r O = 0, i.e., μ is the reaction force vector at point O. The potential of the gravity forces must be updated to accommodate the new displacement ſeld, V = mg¯ıT3 (r O + η), and variation of this new potential is δV = δr TO mg ¯ı3 + δψ ∗T mg η ∗ RT ¯ı3 . D’Alembert’s principle now states ∗ ˙ + ω ∗ω ∗ )mη ∗ + δψ ∗T mg ∗ I O∗ ω ∗ η ∗ RT ¯ı3 + I O∗ ω˙ ∗ + ω δr TO R mgRT ¯ı3 + (ω
= δλ (φ − Ωt) + δφ, δθ, δψ λ¯ı1 + δμT r O + δrTO μ. Because Lagrange’s multiplier technique is used, the Euler angles, displacement rO , and multipliers λ and μ all are unconstrained variables and their variations arbitrary, leading to the following equations of motion: φ−Ωt = 0 and r O = 0, the constraints to be enforced, and ∗ μ = R mgRT ¯ı3 + (ω ˙ + ω ∗ω ∗ )mη ∗ , H ∗T mg η ∗ RT ¯ı3 + I O∗ ω˙ ∗ + ω ∗ I O∗ ω ∗ = λ¯ı1 . As expected, the last three equations are identical to those obtained earlier and the ſrst three equations yield the reaction force at point O. Clearly, the combination of d’Alembert’s principle and Lagrange’s multiplier technique provides a powerful approach to the analysis of constrained dynamical systems. Selecting various sets of generalized coordinates gives equations of motion involving the variables of interest. In particular, if the number of generalized
390
10 Constrained systems: classical formulations
coordinates equals the number of degrees of freedom of the system, a minimum set of equations is obtained from which reaction forces are completely eliminated. The equations required for the evaluation of the reaction forces can then be obtained by using Lagrange’s multiplier technique. 10.1.1 Problems Problem 10.1. The double pendulum Consider the double pendulum system depicted in ſg. 10.1. The ſrst bar is of length L1 , mass m1 , and is connected by hinges to the ground at point O and to the second bar at point A. The second bar is of length L2 and mass m2 . The bars have orientation angles θ1 and θ2 with respect to the vertical, respectively. I = (¯ı1 , ¯ı2 , ¯ı3 ) is an inertial basis, and bases a1 , a ¯2 , a ¯3 ) are attached to the ſrst and second bars, respectively. E = (¯ e1 , e¯2 , e¯3 ) and A = (¯ Use four generalized coordinates to represent the conſguration of the system: the angles θ1 and θ2 , and the position of point O, denoted r O . Enforce the two constraints C = rO = 0 using Lagrange’s multiplier technique. (1) Derive the equations of motion of the system using d’Alembert’s principle. (2) Prove that your equations are correct by comparing them to those obtained in example 10.1. (3) Give the physical interpretation of Lagrange’s multipliers. (4) On one graph, plot the time history of the angles θ1 and θ2 . (5) Plot the trajectories of the points at the tip of the ſrst and second bars. (6) Plot the angular velocities of the two bars. (7) Plot the horizontal and vertical components of the internal force at point A. (8) Plot the horizontal and vertical components of the reaction force at point O. (9) Plot the kinetic, potential, and total mechanical energies of the system. Comment on your results. Use the following data: m1 = 1.2 kg; m2 = 5 kg; L1 = 0.4 m; L2 = 0.6 m; acceleration of gravity g = 9.81 m/s2 . Present all results for a period of 10 s. The initial conditions are: θ1 (t = 0) = θ2 (t = 0) = π/2; θ˙1 (t = 0) = θ˙2 (t = 0) = 0.
Problem 10.2. Crank-slider mechanism The crank-slider mechanism depicted in ſg. 9.8 consists of a uniform crank of length L1 and mass m1 connected to the ground at point O; let θ be the angle from the horizontal to the crank. At point B, the crank connects to a uniform linkage of length L2 and mass m2 that slides along point P, a ſxed point in space, located at a distance d from point O. Let w denote the distance from point B to point P and φ the angle from the horizontal to link BP. The system is represented by three generalized coordinates: θ, φ, and w. (1) Derive the equations of motion of the system using d’Alembert’s principle. (2) Give the physical interpretation of Lagrange’s multipliers. (3) Find the single equation of motion of the system expressed in terms of a single degree of freedom, θ.
Problem 10.3. The two-bar linkage with slider system The two-bar linkage with slider system shown in ſg. 8.4 is a planar mechanism. It consists of a uniform crank of length L1 and mass m1 connected to the ground at point O; let θ be the angle from the horizontal to the crank. At point B, the crank slides over a uniform linkage of length L2 and mass m2 that is connected to the ground at point A. Let w denote the distance from point B to point A and φ the angle from the horizontal to link BA. (1) Using three generalized coordinates, θ, φ, and w, derive the equations of motion of the system using d’Alembert’s principle. (2) Give the physical interpretation of Lagrange’s multipliers. (3) Find the single equation of motion of the system expressed in terms of a generalized coordinate, θ.
10.1 D’Alembert’s principle for constrained systems
391
Problem 10.4. Rigid body attached to universal joint Figure 10.2 depicts a rigid body attached to the ground by means of a universal joint. Component k of the universal joint, see ſg. 7.10, is connected to the ground at point O by means of a bearing allowing rotation about axis ¯ı3 . Component is connected to a rigid body at point O . The orientation of the rigid body will be deſned by Euler angles, using the 3-1-2 sequence. A ſrst planar rotation about axis ¯ı3 , of magnitude φ, brings inertial basis I = (¯ı1 , ¯ı2 , ¯ı3 ) to ¯2 , a ¯3 ), where a ¯1 is aligned with unit vector ¯b1 of the cruciform. This rotation is A = (¯ a1 , a associated with a constant angular speed φ˙ = Ω, implying a ¯1 (t) = cos(Ωt)¯ı1 − sin(Ωt)¯ı3 . A second planar rotation about axis a ¯1 , of magnitude θ, brings basis A to B = (¯b1 , ¯b2 , ¯b3 ), where ¯b2 is the second unit vector aligned with the cruciform. Finally, a third planar rotation of magnitude ψ about axis ¯b2 bring basis B to E = (¯ e1 , e¯2 , e¯3 ) that is attached to the rigid body. The components of tensors in basis E will be denoted with the superscript (·)∗ . Points O and O are coincident. For all questions, use d’Alembert’s principle and Lagrange’s multiplier technique, when necessary. (1) Find the equations of motion of the system. (2) On one graph, ˙ (4) On one graph, plot plot the time history of angles θ and ψ. (3) On one graph, plot θ˙ and ψ. the three components of the unit vector e¯1 in basis I. (5) Same question for unit vectors e¯2 and e¯3 . (6) Plot the trajectory of the center of mass of the rigid body in three-dimensional space. (7) On one graph, plot the three components of the angular velocity vector of the rigid body in the body attached basis E . (8) Plot the components of the same vector in the inertial basis I. (9) Plot the history of the driving torque required to maintain the constant angular velocity Ω. (10) Compute the cumulative work W done by the driving torque. (11) On one graph, plot the kinetic energy of the system, its potential energy and the cumulative work W . Will a combination of these quantities remain constant? (12) Plot the components of the reaction force at point O in basis I. Use the following data: mass of the body m = 2.8 kg; principal mass moments of inertia about the center of mass I1∗C = 1.1; I2∗C = 0.6; I3∗C = 0.9 of therelative position vector of the center of mass with respect to point kg·m2 ; components O, η ∗T = 0.1, −0.4, 0.3 m; acceleration of gravity g = 9.81 m/s2 ; angular velocity Ω = 2 rad/s. Present all results in a non-dimensional manner; use the reference mass mr = m, reference length r = η ∗ and reference time tr = 1/Ω. At the initial time, the principal axes of inertia are aligned with the inertial system and the body is at rest. Present the response of the system over a non-dimensional period of 8π.
Problem 10.5. Particle in a circular slot with guiding arm A particle of mass M slides along a circular slot of radius R, as shown in ſg. 8.15. The particle also slides in a rectilinear slot in an arm of mass m and length L. The arm is pivoted to the ground at point O and is restrained by a torsional spring of stiffness constant k and a dashpot of constant c. The spring is un-stretched when the arm is horizontal. A viscous friction force, F f = −μw˙ is acting at the interface between the particle and the arm. (1) Using three generalized coordinates, x, y, the coordinates of mass M , and w, the position of the particle along the arm, derive the equations of motion of the system using d’Alembert’s principle. (2) Give the physical interpretation of Lagrange’s multipliers.
Problem 10.6. The crank piston mechanism The crank slider mechanism depicted in ſg. 10.4 comprises a bar of length L1 and mass m1 connected to the ground at point O by means of a hinge. At point A, a hinge connects the ſrst bar to a second bar of length L2 and mass m2 . A slider of mass M , that is constrained to move in the horizontal direction, is connected to this second bar. A spring of stiffness constant k connects the slider to the ground and is un-stretched when the two bars are aligned. This system will be represented with three generalized coordinates: x and y, the coordinates of point A and z, the horizontal position of point B. (1) Write the constraint equations for this
392
10 Constrained systems: classical formulations
i2 (x, y) w
i2
m, L
O k, c
M i1
I
L1 m1
O d
R
L2 A(x,y) m2 i1 B(z,0)
M
k
C Fig. 10.3. Particle in a circular slot with guiding arm.
Fig. 10.4. Crank piston mechanism with spring.
problem. (2) Use d’Alembert’s principle to derive the equation of motion of the system. (3) Give the physical interpretation of Lagrange’s multipliers.
10.2 Hamilton’s principle and Lagrange’s formulation with holonomic constraints As discussed in section 9.2, mechanical systems are often subjected to constraints which fall into two broad categories: holonomic and nonholonomic constraints. Furthermore, constraints can be scleronomic, if they are not an explicit function of time, or rheonomic, in the opposite case. Systems subjected to holonomic constraints are treated in this section, and those subjected to nonholonomic constraints are treated in section 10.3. The most general type of constraints to be considered in this section are in the form of eq. (9.38), C(q, t) = 0, i.e., rheonomic constraints. If time does not appear explicitly in the constraint, it is scleronomic, C(q) = 0. Constraints limit the allowable virtual displacement in such a way that δC = B(q, t)δq = 0.
(10.2)
Because a virtual displacement is an arbitrary change in displacement at a given, ſxed instant, this expression does not involve the term b(q, t) that appears in the differential or Pfafſan form of the constraints. Hamilton’s principle, eq. (8.20), will be written in the following form, % tf (δL + δW nc ) dt = 0, (10.3) ti
for all arbitrary virtual displacements. The boundary terms at the initial and ſnal times have been ignored. For constrained systems, virtual displacements are not arbitrary because they must satisfy the constraints as expressed by eq. (10.2). A linear combination of eqs. (10.3) and (10.2) now yields
10.2 Hamilton’s principle and Lagrange’s formulation with holonomic constraints
%
tf
* + δL + λT δC + δλT C + δW nc dt = 0,
393
(10.4)
ti
where λ is an array of arbitrary Lagrange’s multipliers. The term δλT C was added to the equation. Indeed, because the constraints must be satisſed, this term vanishes, and hence, can be added to the statement of Hamilton’s principle. A reasoning similar to that developed in section 9.1 leads to the conclusion that the constrained problem expressed by eq. (10.3) is now replaced by an unconstrained problem, eq. (10.4), in which variations of the generalized coordinates and Lagrange’s multipliers are unconstrained. The second term in the integrand of eq. (10.4) affords an important physical interpretation. Indeed, in view of eq. (10.2), λT δC = δq T B T (q, t)λ = δq T F c , where F c = B T (q, t)λ,
(10.5)
are the generalized forces of constraint, i.e., the forces that must be applied on the system in the conſguration space to guarantee the satisfaction of the constraints. It then follows that λT δC = δq T F c can be interpreted as the virtual work done by the forces of constraint. The differential work done by these forces is expressed as dW c = dq T F c = dq T B T (q, t)λ = −λT b(q, t)dt, where the last equality was obtained with the help of eq. (9.40). When dealing with a scleronomic constraint, b(q, t) = 0 and the differential work done by the forces constraint vanishes. This contrasts with rheonomic constraints: in this case, the differential work does not necessarily vanish. 10.2.1 Hamilton’s principle with holonomic constraints Holonomic constraints are considered here. In this case, the constraints are integrable and can be written C(q, t) = 0. It then follows that λT δC + δλT C = δ(λT C) and δL + λT δC + δλT C = δ(K − V ) + δ(λT C) = δK − δ(V − λT C). The potential of the constraint forces is now deſned as V c = −λT C.
(10.6)
This quantity is indeed the “potential of the constraint forces” because constraint forces are derived from this potential, see eq. (7.21), ∂V c ∂C T =− λ = −B T (q, t)λ = −F c . ∂q ∂q
(10.7)
It is convenient to introduce the augmented potential of the system, deſned as the sum of the potential of the conservative forces acting on the system and of the potential of the constraint forces V + = V + V c.
(10.8)
394
10 Constrained systems: classical formulations
Finally, the augmented Lagrangian of the system is deſned as L+ = L − V c ,
(10.9)
and clearly, δL + λT δC + δλT C = δ(L − V c ) = δL+ . In the presence of holonomic constraints, Hamilton’s principle, eq. (10.4), now becomes % tf ! + " δL + δW nc dt = 0, (10.10) ti
for all arbitrary variations in generalized coordinates and Lagrange’s multipliers. Both generalized coordinates and Lagrange’s multipliers are unconstrained variables in this principle. This principle is identical to that derived for unconstrained system, except that the Lagrangian has been replaced by the augmented Lagrangian and Lagrange’s multipliers are additional, unconstrained variables. 10.2.2 Lagrange’s formulation with holonomic constraints As was done in section 8.3 for unconstrained systems, Lagrange’s formulation for systems with holonomic constraints will be derived from Hamilton’s principle. After the integration by parts expressed by eq. (8.55), Hamilton’s principle, eq. (10.10), becomes % tf % tf d ∂L ∂L + δq T − + F c + Qnc dt + δλT [C] dt = 0, (10.11) dt ∂ q˙ ∂q ti ti for all arbitrary variations δq and δλ. Consequently, the bracketed terms must vanish, revealing Lagrange’s equations of motion for systems subjected to holonomic constraints d ∂L ∂L − = B T (q, t)λ + Qnc , and C(q, t) = 0. (10.12) dt ∂ q˙ ∂q Here again, it is convenient to introduce the generalized momenta, see eq. (8.25), to simplify the writing of the equations of motion that become ∂V ∂K p˙ − + = B T (q, t)λ + Qnc , and C(q, t) = 0. (10.13) ∂q ∂q The physical interpretation of Lagrange’s equations for systems subjected to holonomic constraints is revealed by recasting eq. (10.13) as QI + F c + Qc + Qnc = 0, where the generalized inertial forces, QI , are given by eq. (8.58). Here again, Lagrange’s equations are a statement of dynamic equilibrium: the sum of the inertial forces and all external forces applied on the system, including the forces of constraint, must vanish for dynamic equilibrium conditions to be satisſed. Although Lagrange’s equations of motion have the same physical meaning when dealing with both unconstrained and constrained systems, a marked difference is
10.2 Hamilton’s principle and Lagrange’s formulation with holonomic constraints
395
observed in the mathematical nature of the equations in these two cases. For unconstrained systems, Lagrange’s equations of motions, eqs. (8.57), form a set of n second-order, ordinary differential-equations in time, or second-order ODE’s. Indeed, second-order derivatives of the generalized coordinates are implied by the structure of Lagrange’s equations. If the system is subjected to holonomic constraints, second-order derivatives of the generalized coordinates will be present as well, but Lagrange’s multipliers appear in the equations as algebraic variables, i.e., as undifferentiated variables. Equation that feature this mixed differential-algebraic nature are called differential-algebraic equations or DAE’s. Lagrange’s equations form a set of n + m DAE’s. DAE’s are typically more difſcult to solve than ODE’s; while methods for the numerical solution of ODE’s are well developed, the solution of DAE’s is still a challenging task. Example 10.4. The simple pendulum Figure 10.5 depicts a simple pendulum of length L featuring a bob of mass m. This single degree of freedom system will be represented using two generalized coordinates, the Cartesian coordinates of the bob, r = q1¯ı1 + q2¯ı2 . A single holonomic constraint must be enforced, C = 1/2 (r T r − L2 ) = 0. This constraint enforces the constant length condition for the pendulum; the constraint matrix is B T (q) = r. The Lagrangian of the system is easily evaluated as L = m r˙ T r/2−mg ˙ ¯ıT1 r. The potential of the constraint forces, eq. (10.6), is V c = Cλ = 1/2 (r T r−L2 )λ; a single Lagrange multiplier is used here to enforce the single constraint. The generalized momenta of the system and partial derivatives of the Lagrangian with respect to the generalized coordinates are p=
∂L = m˙r , ∂ r˙
and
∂L = −mg ¯ı1 , ∂r
respectively. Application of Lagrange’s formulation for systems with holonomic constraints, eqs. (10.13), leads to m¨r + mg¯ı1 − λr = 0, and 1/2 (r T r − L2 ) = 0. These three equations can be used to solve for the three unknowns of the problem: q1 , q2 , and λ. Note the differential-algebraic nature of the equations: terms in q¨1 and q¨2 appear, but λ is not differentiated. Of course, these equations are equivalent to the single equation of motion that would have been obtained had the single generalized coordinate θ been used to represent the conſguration of the system. Indeed, the position vector can be written in polar coordinates as r T = L Sθ , Cθ , where Sθ = sin θ and Cθ = cos θ; the constraint is then automatically satisſed, as expected. The remaining equation of motion then yields two scalar equations: θ¨ + g/L Sθ = 0 and λL = −mgCθ − mLθ˙ 2 . Application of Newton’s law to the free body diagram sketched on ſg. 10.5 yields two equations θ¨ + g/L Sθ = 0 and T = mgCθ + mLθ˙2 , where T is the tension in the string that can be interpreted as the constraint force, i.e., the force that maintains the constant length of the pendulum. The equations of motion are identical, and T = −λL. As expected, Lagrange’s multiplier is closely related to, although not identical, to the force of constraint; indeed, λ = −T /L. In the constrained formulation, the force of constraint is given by
396
10 Constrained systems: classical formulations
eq. (10.5) as F c = B T (q)λ = λr = λL e¯1 . This development explains the difference in sign: T , the tension in the string was chosen to act along axis −¯ e1 , but the constraint force F c acts along axis +¯ e1 . This development also explains the presence of factor L: the tension in the string, T , has units of force, but Lagrange’s multiplier has units of force over length because F c = |λL|. The physical meaning of Lagrange’s multipliers must be clearly identiſed to ease the interpretation of the equations of motion generated by Lagrange’s formulation. The deſnition of the constraint force, F c = B T (q, t)λ, provides the physical interpretation of the multiplier. The constraint force, F c , is a generalized force of constraint, i.e., a force acting in the conſguration space. The meaning of the multiplier depends on the speciſc manner in which the constraint was written. For instance, writing the constraint as C = (L2 − rT r) = 0 results in λ = T /(2L).
A M T
k i2
O q
La
L g
e2
i1
q
mc S
T e2 mg q e1
Fig. 10.5. Conſguration of the simple pendulum.
d a2
ma
Lc
R
m
a1
b
La
i2 I O
f
e1 i1
Fig. 10.6. Conſguration of the quick return mechanism.
Example 10.5. The quick return mechanism The quick return mechanism shown in ſg. 10.6 consists of a uniform crank of length Lc and mass mc , and of a uniform arm of length La and mass ma . The crank is pinned at point R and the arm at point O; the distance between these two points is denoted d. At point S, a slider allows the tip of the crank to slide along the arm. A mass M is attached at point T, the tip of the arm. A spring of stiffness constant k connects the tip of the arm, point T, to ſxed point A; the spring is un-stretched when the arm is in the vertical position. This problem was treated using Lagrange’s formulation in example 8.15 on page 329. The generalized coordinates of the problem are selected to be the angular positions of the two bars, denoted θ and φ. This problem was treated in example 8.15 on page 329 using Lagrange’s formulation with a single generalized coordinate, θ. Use
10.2 Hamilton’s principle and Lagrange’s formulation with holonomic constraints
397
Lagrange’s formulation for constrained systems to derive the equations of motion of the system. Considering triangle ORS, it is clear that β = φ − θ, and the law of sines then yields Lc sin(φ − θ) = d sin φ. This equation expresses the kinematic constraint between the two generalized coordinates of this single degree of freedom problem, C = d sin φ − Lc sin(φ − θ) = 0. The constraint matrix now becomes B = Lc Cφ−θ dCφ − Lc Cφ−θ , (10.14) where the following notation was used: Cφ−θ = cos(φ − θ), and Cφ = cos φ. The system’s kinetic energy is K = [mc L2c θ˙2 /3 + (M + ma /3)L2a φ˙ 2 ]/2, where the ſrst term represents the kinetic energy of the crank and the second that of the arm. The potential energy of the spring is V = 1/2 kΔ2 , where Δ is the stretch of the spring. The law of cosines applied to triangle OMA yields Δ2 = L2a + L2a − 2L2a cos(π − φ) = 2L2a (1 + Cφ ), and the potential of the elastic spring is V = kL2a (1 + Cφ ). Finally, the potential of the single constraint of this problem is V c = [dSφ − Lc Sφ−θ ] λ, where λ is Lagrange’s multiplier used to enforce the constraint, Sφ−θ = sin(φ − θ), and Sφ = sin φ. The Lagrangian is L = K − V , and the system’s generalized momenta become pθ˙ =
∂L mc L2c ˙ θ, = ˙ 3 ∂θ
pφ˙ =
∂L ma 2 ˙ )La φ. = (M + ˙ 3 ∂φ
The derivatives of the Lagrangian with respect to the generalized coordinates are ∂L = 0, ∂θ
∂L = kL2a Sφ . ∂φ
Lagrange’s formulation for constrained systems then yields the equations of motion of the system, mc L2c /3 θ¨ Lc Cφ−θ 0 0 − λ = . (10.15) kL2a Sφ 0 (M + ma /3)L2a φ¨ dCφ − Lc Cφ−θ The equations of motion for this single degree of freedom problem now take the form of three differential-algebraic equations. In the two equations given above, Lagrange’s multiplier, λ, is an algebraic variable, but second time derivatives of the generalized coordinates, θ and φ, appear. The third equation is the holonomic constraint equation, d sin φ − Lc sin(φ − θ) = 0, which is an algebraic equation. It is interesting to compare the equation of motion obtained from Lagrange’s formulation using a single generalized coordinate, eq. (8.61), to those obtained in the present development. Equation (8.61) is a single, ordinary differential equation for the single generalized coordinate, θ. In contrast, when using two generalized coordinates, the equations of motion take the form of three coupled differential-algebraic equations, because an additional variable, Lagrange’s multiplier, was added to enforce the constraint.
398
10 Constrained systems: classical formulations
In the absence constraints, Lagrange’s formulation leads to a single equation of motion that is far more complex than those obtained in the constrained formulation, and involves a much higher level of nonlinearity. On the other hand, the constrained formulation leads to a higher number of equations, but these equations are easier to derive and present a lower level of nonlinearity. This ease of derivation of the equations of motion for constrained system is one of the major attractions of constrained formulations. It is always instructive to provide a physical interpretation of Lagrange’s mul¨ = Lc Cφ−θ λ, which tipliers. The ſrst equation of system (10.15) reads mc L2c θ/3 corresponds to the pivot equation written for the crank about point R. The term on the right-hand side of the equation corresponds to the moment of the normal force, λ, the crank applies on the arm. It is left to the reader to verify that the second equation of system (10.15) can be interpreted as the pivot equation written for the arm about point O, and leads to an identical interpretation of the physical meaning of Lagrange’s multiplier as the normal interaction force between the crank and the arm. 10.2.3 Problems Problem 10.7. The 12 generalized coordinates rigid body The conſguration of a rigid body can be deſned by 12 generalized coordinates: the position vector, u (3 coordinates), of its reference point O, and the three vectors, e1 , e2 , and e3 (3 coordinates each), deſning its orientation. Clearly, this set of coordinates is 6 times redundant and hence, 6 constraints must be added to the problem: three normality constraints eT1 e1 = eT2 e2 = eT3 e3 = 1, and three orthogonality constraints eT2e3 = eT1 e3 = eT1 e2 = 0. Let array q store the generalized coordinates of the problem, q T = uT , eT1 , eT2 , eT3 . (1) Show that the ˙ where M ∗ is a 12 × kinetic energy of the rigid body can be written as K = 1/2 q˙ T M ∗ q, 12 mass matrix. (2) If f and m are the force and moment vectors applied to the rigid body at point O, show that the virtual work done by these forces is δW = δqT F , where F is a 12 × 1 loading array. (3) Write the governing equations of motion for the rigid body.
Problem 10.8. The 9 generalized coordinates rigid body Read the paper by Garc´Æa de Jal´on et al. [28] describing the concept of natural coordinates. Consider a rigid body described by 9 generalized coordinates: the position vector, u (3 coordinates), of its reference point O, and two unit vectors, e1 and e2 (3 coordinates each), deſning its orientation. Let array q store the generalized coordinates of the problem, q T = uT , eT1 , eT2 . (1) Deſne the constraints associated with this representation. (2) Evaluate the kinetic energy of the rigid body based on these generalized coordinates. (3) If f and m are the force and moment vectors applied to the rigid body at point O, ſnd the associated generalized forces, F , such that δW = δq T F . (4) Write the governing equations of motion for the rigid body.
Problem 10.9. The crank piston mechanism The crank slider mechanism depicted in ſg. 10.7 comprises a bar of length L1 and mass m1 connected to the ground at point O by means of a hinge. The orientation of is bar with respect to the horizontal is denoted φ. At point A, a hinge connects the ſrst bar to a second bar of length L2 and mass m2 . A slider of mass M , that is constrained to move in the horizontal direction, is connected to this second bar. A spring of stiffness constant k connects the slider
10.2 Hamilton’s principle and Lagrange’s formulation with holonomic constraints
399
to the ground and is un-stretched when φ = 0. Clearly, this system features a single degree of freedom. Two sets of generalized coordinates will be used to represent the system. For representation 1, a single generalized coordinate, φ, is used. For representation 2, seven generalized coordinates are used: u1 , v1 and φ1 , respectively the horizontal and vertical position of the center of mass of the ſrst bar, and its orientation with respect to the horizontal; u2 , v2 and φ2 , the corresponding quantities for the second bar; and x, the displacement of the slider in the horizontal direction. The spring is un-stretched when x = 0. (1) With the help of Lagrange’s formulation, derive the equation of motion of the system using representation 1. (2) Using representation 2, derive the equations of motion for the constrained system. (3) Discuss the relative merits of the two representations.
i2
L1 m1
O
L2 f
A
i2 A
m2 i1
L2, m2
B
M
k
Fig. 10.7. Crank piston mechanism with spring.
O
q
L1, m1
M f
C P(x(h), y(h))
k h R
i1
Fig. 10.8. Two bar linkage tracking a curve.
Problem 10.10. Two bar linkage tracking a curve Figure 10.8 depicts a planar two bar linkage tracking curve C. The ſrst bar, of length L1 and mass m1 , is connected to the ground at point O. The second bar, of length L2 and mass m2 , connects to the ſrst bar at point A and tracks curve C at point P. A concentrated mass, M , is located at point P and an elastic spring of stiffness constant constant k connects this mass to point R. Curve C is described by its coordinates x(η) and y(η), where η deſnes an arbitrary parametrization of the curve. This system will be represented by three generalized coordinates: angles θ and φ, as deſned on the ſgure, and η, the parameter along curve C. (1) Derive the constraints among the three generalized coordinates and the constraint matrix. (2) Use Lagrange’s formulation for constrained systems to derive the equations of motion of the system. (3) Discuss the physical nature of Lagrange’s multipliers.
Problem 10.11. Crank-slider mechanism The crank-slider mechanism depicted in ſg. 9.8 consists of a uniform crank of length L1 and mass m1 connected to the ground at point O; let θ be the angle from the horizontal to the crank. At point B, the crank connects to a uniform linkage of length L2 and mass m2 that slides along point P, a ſxed point in space, located at a distance d from point O. Let w denote the distance from point B to point P and φ the angle from the horizontal to link BP. The system is represented by three generalized coordinates: θ, φ, and w. (1) Derive the constraints among the three generalized coordinates and the constraint matrix. (2) Use Lagrange’s formulation for constrained systems to derive the equations of motion of the system. (3) Discuss the physical nature of Lagrange’s multipliers.
Problem 10.12. The spatial mechanism The spatial mechanism depicted in ſg. 10.9 consists of a crank of length Lc and mass mc attached to the ground at point A and rotating about axis ¯ı1 ; the crank moves in plane (¯ı2 , ¯ı3 ). A rigid arm of length La and mass ma connects point P, at the tip of the crank, to point Q that
400
10 Constrained systems: classical formulations
is free to slide along axis ¯ı1 . The slider at point Q is of mass M . The generalized coordinates of the problem are y and z, deſning the position of point P and x, deſning the position of point Q. (1) Derive the constraints among the three generalized coordinates and the constraint matrix. (2) Use Lagrange’s formulation for constrained systems to derive the equations of motion of the system. (3) Discuss the physical nature of Lagrange’s multipliers.
i3
P(0, y, z)
Lc, mc k
i2 A h
x
L q
O
La, ma
O
i1
g
L
y
m
i2 x(t)
M
Q (x, 0, 0) M
i1
Fig. 10.9. Conſguration of the spatial mechanism.
Fig. 10.10. Homogeneous bar sliding on guides at both ends.
Problem 10.13. Bar sliding on guides Figure 10.10 depicts a homogeneous bar of length L and mass m sliding on two guides at its end points. At the left end, the bar is connected to a spring of stiffness constant k that is un-stretched when the bar is horizontal. At the right end, the bar is connected to a point mass M . Gravity acts along axis ¯ı2 . This single degree of freedom system will be represented using three generalized coordinates: x, y, and θ. (1) Derive the constraints among the three generalized coordinates and the constraint matrix. (2) Use Lagrange’s formulation for constrained systems to derive the equations of motion of the system. (3) Discuss the physical nature of Lagrange’s multipliers.
Problem 10.14. Bar sliding on guides Repeat the previous problem but use two generalized coordinates only, x and y.
Problem 10.15. The spatial mechanism Read the paper by Garc´Æa de Jal´on et al. [28] describing the concept of natural coordinates. The spatial mechanism depicted in ſg. 10.9 consists of a crank of length Lc and mass mc attached to the ground at point A and rotating about axis ¯ı1 ; the crank moves in plane (¯ı2 , ¯ı3 ). A rigid arm of length La and mass ma connects point P, at the tip of the crank, to point Q that is free to slide along axis ¯ı1 . The slider at point Q is of mass M . This mechanism will be described by the following 12 generalized coordinates: unit vector n ¯ 1 along segment AP, ¯2 the position vector, r P , of point P, the position vector, r Q , of point Q, and unit vector n normal to segment PQ. When bar PQ is in plane (¯ı1 , ¯ı3 ), unit vector n ¯ 2 lies in the same plane. (1) Derive the constraints among the 12 generalized coordinates and the constraint matrix. (2) Use Lagrange’s formulation for constrained systems to derive the equations of motion of the system. (3) Discuss the physical nature of Lagrange’s multipliers.
10.2 Hamilton’s principle and Lagrange’s formulation with holonomic constraints
401
Problem 10.16. The two-bar linkage with slider system The two-bar linkage with slider system shown in ſg. 8.4 is a planar mechanism. It consists of a uniform crank of length L1 and mass m1 connected to the ground at point O; let θ be the angle from the horizontal to the crank. At point B, the crank slides over a uniform linkage of length L2 and mass m2 that is connected to the ground at point A. Let w denote the distance from point B to point A and φ the angle from the horizontal to link BA. (1) Derive the constraints among the three generalized coordinates and the constraint matrix. (2) Use Lagrange’s formulation for constrained systems to derive the equations of motion of the system. (3) Discuss the physical nature of Lagrange’s multipliers.
Problem 10.17. Pendulum mounted on a cart Figure 8.5 shows a pendulum of length L and mass m mounted on a cart of mass M that is connected to the ground by means of a spring of stiffness constant k and of a dashpot of constant c. The displacement of the cart is denoted x which is also the stretch of the spring, and θ measures the angular deƀection of the pendulum with respect to the vertical. Gravity acts on the system as indicated in ſg. 8.5. (1) Based on Hamilton’s principle, derive the equations of motion of the system using the following generalized coordinates: x, θ, and r A , the position of point A. Use Lagrange’s multiplier technique to enforce the kinematic constraint C = r A − x¯ı2 = 0. (2) Interpret Lagrange’s multipliers in physical terms. (3) Plot the time history of the cart displacement, x. (4) Plot the history of angle θ. (5) Plot the trajectory of the point at the tip of the pendulum. (6) Plot the cart velocity, x. ˙ (7) Plot the angular velocity of the ˙ (8) Plot the system kinetic and potential energies and the energy dissipated in pendulum, θ. the damper. Check the energy closure equation. (9) Plot the components of the internal force at point A. Use the following data: M = 5 kg; m = 2 kg; L = 0.4 m; k = 10 N/m; acceleration of gravity g = 9.81 m/s2 ; c = 0.5 N.s/m. Present all your results for a period of 10 s. Initial condition are at rest with x(t = 0) = 0.2 m and θ(t = 0) = π.
Problem 10.18. Particle in a circular slot with guiding arm A particle of mass M slides along a circular slot of radius R, as shown in ſg. 8.15. The particle also slides in a rectilinear slot in an arm of mass m and length L. The arm is pivoted to the ground at point O and is restrained by a torsional spring of stiffness constant k and a dashpot of constant c. The spring is un-stretched when the arm is horizontal. A viscous friction force, F f = −μw˙ is acting at the interface between the particle and the arm. (1) Using three generalized coordinates, x, y, the coordinates of mass M , and w, the position of the particle along the arm, derive the equations of motion of the system using Lagrange’s formulation for constrained systems. (2) Give the physical interpretation of Lagrange’s multipliers.
Problem 10.19. The crank piston mechanism The crank slider mechanism depicted in ſg. 10.4 comprises a bar of length L1 and mass m1 connected to the ground at point O by means of a hinge. At point A, a hinge connects the ſrst bar to a second bar of length L2 and mass m2 . A slider of mass M , that is constrained to move in the horizontal direction, is connected to this second bar. A spring of stiffness constant k connects the slider to the ground and is un-stretched when the two bars are aligned. This system will be represented with three generalized coordinates: x and y, the coordinates of point A and z, the horizontal position of point B. (1) Write the constraint equations for this problem. (2) Use Lagrange’s formulation for constrained systems to derive the equations of motion of the system. (3) Give the physical interpretation of Lagrange’s multipliers.
Problem 10.20. Spinning arm Figure 8.24 depicts a shaft of height h ſxed at point O and free to rotate about axis ¯ı3 . An arm of length d, rigidly attached to the shaft at point A, rotates in the horizontal plane.
402
10 Constrained systems: classical formulations
A homogeneous bar of length L and mass m is connected to the arm at point B with a torsional spring of stiffness constant k. Gravity acts as indicated on the ſgure and the apa1 , a ¯2 , a ¯3 )] is attached to the arm and frame plied torque Q = 0. FrameF A = [A, A = (¯ B F = B, B = (¯b1 , ¯b2 , ¯b3 ) is attached to the bar. A planar rotation of magnitude α about axis ¯ı3 brings basis I to A. A planar rotation of magnitude β about axis a ¯2 brings basis A to B; the torsional spring is un-stretched when β = β0 . (1) Use Lagrange’s formulation to derive the equations of motion of the system. Use two generalized coordinates, α and β. (2) Assume that the shaft is rotating at a constant angular velocity, α˙ = Ω and impose this condition through a rheonomic constraint. (3) Use Lagrange’s formulation for constrained systems to derive the equations of motion of the problem. (4) What is the physical meaning of Lagrange’s multiplier. (5) Assume now instead that the bar is rotating at a constant angular velocity, β˙ = ω and impose this condition through a rheonomic constraint. (6) Use Lagrange’s formulation for constrained systems to derive the equations of motion of the problem. (7) What is the physical meaning of Lagrange’s multiplier.
10.3 Hamilton’s principle and Lagrange’s formulation with nonholonomic constraints Systems subjected to holonomic constraints were studied in section 10.2. In the present section, attention turns to systems subjected to nonholonomic constraints. The most general type of constraint to be considered here are linear functions of the generalized velocities, as expressed by eq. (9.47), D = B(q, t)˙q + b(q, t) = 0. If time appears explicitly in the expression of the constraint, it is rheonomic, otherwise it is scleronomic. Nonholonomic constraints limit the allowable virtual displacement as expressed by eq. (10.2). Because a virtual displacement is an arbitrary change in displacement at a given, ſxed instant, this expression does not involve the term b(q, t) that appears in the differential or Pfafſan form of the constraints. Hamilton’s principle, eq. (8.20), will be written in the following form, % tf (δL + δW nc ) dt = 0, (10.16) ti
for all arbitrary virtual displacements. The boundary terms at the initial and ſnal times have been ignored. For constrained systems, virtual displacements are not arbitrary because they must satisfy the constraints as expressed by eq. (10.2). A linear combination of eqs. (10.16) and (10.2) now yields %
tf
* + δL + λT B(q, t)δq + δλT D + δW nc dt = 0,
(10.17)
ti
where λ is an array of arbitrary Lagrange’s multipliers. The term δλT D was added to the equation. Indeed, because the constraints must be satisſed, this term vanishes, and hence, can be added to the statement of Hamilton’s principle. A reasoning similar to that developed in section 9.1 will lead to the conclusion that the constrained
10.3 Hamilton’s principle and Lagrange’s formulation with nonholonomic constraints
403
problem expressed by eq. (10.16) is now replaced by an unconstrained problem, eq. (10.17), in which variations of the generalized coordinates and Lagrange’s multipliers are unconstrained. 10.3.1 Hamilton’s principle with nonholonomic constraints In the presence of nonholonomic constraints, Hamilton’s principle takes the form of eq. (10.17). The second term in the integrand of this equation affords an important physical interpretation: δq T B T (q, t)λ = δq T F c , where the generalized forces of constraint, F c , are deſned by eq. (10.5). Clearly, this term can be interpreted as the virtual work done by the constraint forces, δW c = δq T F c = δq T B(q, t)λ. This expression underlines the fundamental difference between holonomic and nonholonomic constraints: for holonomic constraints, the constraint forces can be derived from the potential of the constraint forces, see eq. (10.7), whereas for nonholonomic constraints the virtual work done by the forces of constraint is non integrable, i.e., there exist no potential of the constraint forces. 10.3.2 Lagrange’s formulation with nonholonomic constraints Because both generalized coordinates and Lagrange’s multipliers are unconstrained variables in eq. (10.17), Lagrange’s equations of motion for systems subjected to nonholonomic constraints are obtained from this principle in a manner similar to that presented in section 8.3. After the integration by parts expressed by eq. (8.55), Hamilton’s principle, eq. (10.17), becomes % tf % tf d ∂L ∂L + δq T − + F c + Qnc dt + δλT [D] dt = 0, (10.18) dt ∂ q˙ ∂q ti ti for all arbitrary variations in generalized coordinates and Lagrange’s multipliers. Consequently, the bracketed terms must vanish, revealing Lagrange’s equations of motion for systems subjected to nonholonomic constraints d ∂L ∂L − = B T (q, t)λ + Qnc , and D(q, q˙ , t) = 0. (10.19) dt ∂ q˙ ∂q Here again, it is convenient to introduce the generalized momenta, see eq. (8.25), to simplify the writing of the equations of motion that become ∂V ∂K p˙ − + = B T (q, t)λ + Qnc , and D(q, q˙ , t) = 0. (10.20) ∂q ∂q These equations are identical to those obtained for systems subjected to holonomic constraints, eq. (10.12); the only difference is the form of the constraint to be enforced. The derivation of Lagrange’s equations of motion for constrained systems clearly underlines the differences between Newtonian and Lagrangian dynamics. In Newtonian mechanics, the focus is on forces and acceleration vectors for each particle of
404
10 Constrained systems: classical formulations
the system; both quantities are of a vectorial nature. No distinctions exist between various types of forces: the vector sum of all externally applied forces is parallel to the acceleration vector. In Lagrangian dynamics, the focus is on two scalar quantities, the Lagrangian of the system and the virtual work done by the applied forces. These quantities are characteristics of the complete system, not of individual particles. The efſciency and elegance of Lagrange’s approach stems from the fact that the equations of motion of complex mechanical systems can be derived from these two scalar quantities. Various types of forces are treated differently in Lagrange’s formulation. First, the virtual work done by the inertial forces is directly related to the kinetic energy of the system. Next, externally applied forces are divided into conservative and nonconservative forces. The virtual work done by the conservative forces equals the variation of the potential of these forces; the Lagrangian of the system is the difference between the kinetic and potential energies of the system. The virtual work done by the non-conservative forces gives rise to the generalized, non-conservative forces. This distinction is also present for constrained systems: for holonomic constraints, the constraint forces can be derived from a potential, whereas no such potential exists for nonholonomic constraints. Example 10.6. The skateboard Figure 10.11 depicts the simpliſed conſguration of a skateboard of mass m and moment of inertia I about its center of mass G. The skateboard rolls without sliding on the horizontal plane by means of a wheel aligned with axis e¯1 of the skateboard and located at point C, a distance from the center of mass. The position vector of the center of mass is written as rG = x ¯ı1 +y ¯ı2 , and the axis of the skateboard makes an angle θ with the horizontal. This system is subjected to a constraint: because the wheel does not slip, the velocity vector of the contact point must be along axis e¯1 . This nonholonomic constraint and the corresponding constraint matrix are given by eqs. (9.50) and (9.51), respectively. C i2
e2
G l
C e1
vC c
F q
i1
i1 Fig. 10.11. Skateboard with front wheel.
The kinetic energy of the system is K = m(x˙ 2 + y˙ 2 )/2 + I θ˙2 /2. The potential energy of the system vanishes, and hence, the Lagrangian of the system equals the kinetic energy. The generalized coordinates of the system are q1 = x, q2 = y, and q3 = θ. The generalized momenta of the system are px˙ = mx, ˙ py˙ = my, ˙ and ˙ Lagrange’s formulation then yields the equations of motion of the system p = I θ. θ˙
m¨ x = −λ sin θ,
m¨ y = λ cos θ,
I θ¨ = λ,
10.4 The lower pair joints
405
where λ is Lagrange’s multiplier associated with the nonholonomic constraint. These equations are identical to those obtained from the Newtonian approach, eqs. (9.49). Lagrange’s multiplier, λ, is the contact force of the wheel on the ground, λ = −F C , see ſg. 10.11. 10.3.3 Problems Problem 10.21. The skateboard Figure 10.11 depicts the simpliſed conſguration of a skateboard of mass m and moment of inertia I about axis ¯ı3 , computed with respect to its center of mass G. The skateboard solely moves in the horizontal plane: the position vector of the center of mass is written as r G = x¯ı1 + y¯ı2 , and the axis of the skateboard makes an angle θ with respect to ¯ı1 . A wheel of mass M and radius R is connected to the front of the skateboard, at a distance from its center of mass. The wheel rolls on the ground without slipping. (1) Determine the number of degrees of freedom for this system. (2) Discuss the nature of the constraints. (3) Find the equations of motion of the system using Lagrange’s approach. Use the following generalized coordinates: x, y, θ, and ψ, the rotation of the wheel.
10.4 The lower pair joints A distinguishing feature of multibody systems is the presence of joints that impose constraints on the relative motion of the various bodies of the system. Most joints used in practical applications can be modeled in terms of the so called lower pairs [29]: the revolute, prismatic, screw, cylindrical, planar and spherical joints, depicted in ſg. 10.12. In some cases, however, joints with specialized kinematic conditions must also be developed.
Cylindrical
Prismatic
Revolute
Spherical Fig. 10.12. The six lower pairs.
Screw
Planar
406
10 Constrained systems: classical formulations
10.4.1 Kinematics of a typical lower pair joint Consider two bodies, denoted “body k” and “body ,” as shown in ſg. 10.13. Quantities pertaining to body k and body will be indicated with superscripts (·)k and (·) , respectively. A lower pair joint, i.e., anyone of the joints depicted in ſg. 10.12, connects the two bodies at points K and L, which are material points of , respectively. In the reference conſguration, bodies k and two frames, F0k = K, B0k = (¯ ek01 , e¯k02 , e¯k03 ) and F0 = L, B0 = (¯ e01 , e¯02 , e¯03 ) , are attached to bodies k and , respectively. In the deformed are deſned conſguration, the bodies by two frames, F k = K, B k = (¯ ek1 , e¯k2 , e¯k3 ) and F = L, B = (¯ e1 , e¯2 , e¯3 ) , respectively. k
u,R
k
e3
k k
l
e03
e01
k
Body l
e2 l
e1
K
K
L
l
e02
k
k
e02
e1 L l
l
e2
u,R
l
u0l, Rl0
i3
l
e03
l
e3
O
Deformed configuration
i1
i2 I
k
k
k Body k e01 Reference configuration
u 0, R 0 Lower pair joint
Fig. 10.13. Typical lower pair joint in the reference and deformed conſgurations.
10.4.2 Notational conventions Unless otherwise indicated, the components of all tensors will be resolved in the inertial frame, denoted F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]. In the reference conſguration, the position vectors of points K and L are denoted uk0 and u0 , respectively, and rotation tensors Rk0 and R0 describe the rotations from basis I to B0k and I to B0 , respectively. The displacement vectors of these two points are denoted uk and u , respectively; tensors Rk and R describe the rotations from basis B0k to B k , and B0 to B , respectively. The virtual rotations vectors are deſned as δψ k = axial(δRk RkT ) and δψ = axial(δR RT ). Finally, the relative displacement vector at the joint is deſned as u = u − uk . (10.21) With these notations, unit vectors of triads B k and B can be expressed as e¯kα = Rk Rk0 ¯ıα
and e¯β = R R0¯ıβ ,
(10.22)
10.4 The lower pair joints
407
respectively, where α = 1, 2, or 3 and β = 1, 2, or 3. Variations of these unit vectors are readily found as k δ¯ ekα = ekT α δψ
and δ¯ eβ = eT β δψ ,
(10.23)
respectively. To simplify the expressions of the constraints associated with the various joints, the following notation is adopted for the scalar and vector products of two unit vectors of bases B k and B , (10.24a)
gαβ = e¯kT ¯β , α e hαβ =
ekα e¯β .
(10.24b)
Using eqs. (10.23), variations of these two quantities are easily obtained as δgαβ = (δψ k − δψ )T hαβ , δhαβ = δψ
kT
Dk αβ
− δψ
T
Dk . βα
(10.25a) (10.25b)
where the following matrices were deſned Dk = ekα eβ , αβ
Dk = eβ ekα . βα
(10.26)
During the solution process, linearization of the forces of constraint will be required. In turn, this calls for the evaluation of increments in the kinematic quantities expressing the constraints. For instance, an increment in unit vector e¯kα will be k k k k written as Δ¯ ekα = ekT kT α Δψ = e α H Δr , where H is the tangent operator. Rek k lationship Δψ = H Δr k expresses the differential rotation vector, Δψ k , in terms of the increments of the rotation parameters, Δr k . If Euler angles with the 3-1-3 sequence are basis B0k to B k ,
usedto represent the ſnite rotation of body k that brings k kT r = φ, θ, ψ and eqs. (4.68) deſne the tangent operator, H . Similar notation are used for the rotation of body ; in summary, Δψ k = H k Δr k ,
Δψ = H Δr .
(10.27)
10.4.3 Relative motions In the reference conſguration, the relative motions at the joint are assumed to vanish, i.e., uk0 = u0 and Rk0 = R0 . In the deformed conſguration, Δγ denote the relative displacement between the two bodies along unit vector e¯kγ , and φγ the relative rotation about the same unit vector. More formally, the relative displacements and rotations of the two bodies are deſned as e¯kT γ u − Δγ = 0,
(10.28a)
gαα sin φγ + gαβ cos φγ = 0,
(10.28b)
408
10 Constrained systems: classical formulations
respectively. In the second equation, indices α, β, and γ, are such that αβγ = +1, where the permutation symbol, αβγ , is deſned by eq. (1.25). For a planar rotation of magnitude θ about unit vector e¯kγ , gαα = cos θ, gαβ = − sin θ, and eq. (10.28b) becomes sin(φγ − θ) = 0 or φγ = θ, as expected. Table 10.1 now formally deſnes the six lower pairs depicted in ſg. 10.12 in terms of the relative displacement and/or rotation components that each joint allows or inhibits. If the two bodies are rigidly connected to each other, their six relative motions, three displacements and three rotations, must vanish at the connection point. Table 10.1. Deſnition of the six lower pair joints. Symbols “” or “X” indicate that relative motion is allowed or inhibited, respectively. For the screw joint, p is the pitch of the screw. Joint type Revolute Prismatic Screw Cylindrical Planar Spherical
Relative displacements Relative rotations Δ1 Δ2 Δ3 φ1 φ2 φ3 X X X X X X X X X X X X pφ3 X X X X X X X X X X X X
Setting Δγ = 0 in eq. (10.28a) yields the constraint equation expressing the vanishing of the relative displacement along unit vector e¯kγ . Similarly, setting φγ = 0 in eq. (10.28b) expresses the vanishing of the relative rotation about unit vector e¯kγ . On the other hand, if relative displacement along unit vector e¯kγ is allowed, eq. (10.28a) deſnes the magnitude of the relative displacement, Δγ , along that direction. Similarly, if relative rotation about unit vector e¯kγ is allowed, eq. (10.28b) deſnes the magnitude of the relative rotation, φγ , about that vector. The explicit deſnition of the relative displacement and rotation components in lower pair joints as additional variables represents an important detail of the implementation. First, it allows the introduction of spring and/or damper elements in the joints, as usually required for modeling realistic conſgurations. Second, the time histories of joint relative motions can be driven according to suitably speciſed time functions or by actuators presenting their own physical characteristics.
10.5 Generic constraints for lower pair joints Although the six lower pair joints depicted in ſg. 10.12 are kinematically very different from each other, the constraints they impose on the bodies they are connected to are of two distinct types only. Lower pair joints inhibit one or more relative rotation components, and/or one or more relative displacement components. These two generic constraints are examined in details and the associated forces of constraint are derived from Lagrange’s multiplier method in sections 10.5.1 and 10.5.2. In addition,
10.5 Generic constraints for lower pair joints
409
the relative rotation and displacement components at a joint will be deſned by means of constraint of two types, which are examined in sections 10.5.3 and 10.5.4. Once the formulation of these generic constraints has been developed, the formulations of the six lower pair joints will be presented in section 10.6. 10.5.1 First constraint: vanishing relative rotation The constraint associated with the vanishing of the relative rotation at a lower pair joint connecting two bodies is readily obtained by imposing φγ = 0 in eq. (10.28b) to ſnd gαβ = 0. In view of eq. (10.24a), this constraint imposes the orthogonality of two unit vectors, e¯kT ¯β = 0, and is written in a generic manner as α e ¯β = 0. CI = gαβ = e¯kT α e
(10.29)
For various types of lower pair joints, indices α and β will take different values. This holonomic constraint is enforced using Lagrange’s multiplier method developed in section 10.2.1. The potential of the constraint forces is VIc = λI CI , where λI is Lagrange’s multiplier used to enforce this constraint; variation of this potential yields δVIc = δλI CI + λI δCI . The second term represents the virtual work done by the constraint force, δW c = λI δCI = δq T F cI , where array δq stores variations of the generalized coordinates associated with this constraint and F cI the corresponding constraint forces. Variation of the constraint is expressed as δCI = B I δq, and it follows that F cI = λI B TI . Equation (10.25a) now yields k δψ hαβ T δq = , B , F cI = λI B TI . = (10.30) I −hαβ δψ Because the orthogonality constraint expressed by eq. (10.29) is nonlinear, numerical processes for the solution of constrained multibody systems will rely on successive linearizations of this constraint and associated forces. An increment in the constraint is expressed as ΔCI =
∂CI Δq = Z TI Δq, ∂q
where array Z I is easily found as k Δr Δq = , Δr
ZI =
H kT hαβ −H T hαβ
(10.31)
.
(10.32)
Arrays r k and r store the rotation parameters representing the rotations of bodies k and , respectively, and H k and H are the corresponding tangent operators deſned by eqs. (10.27). An increment in the forces of constraint is expressed as ΔF cI =
∂F cI Δq = X I Δq, ∂q
(10.33)
410
10 Constrained systems: classical formulations
where X I is the equivalent stiffness matrix for the constraint. Partial derivatives of the constraint forces yield the following expression for this matrix Dk H k −Dk H βα αβ , (10.34) X I = λI −Dk H k Dk H βα αβ where matrices Dk and Dk are deſned by eqs. (10.26). αβ βα 10.5.2 Second constraint: vanishing relative displacement The constraint associated with the vanishing of the relative displacement at a lower pair joint connecting two bodies is readily obtained by imposing Δα = 0 in eq. (10.28a) to ſnd e¯kT α u = 0. This constraint imposes the orthogonality of the relative displacement vector deſned by eq. (10.21) to unit vector e¯kα . This orthogonality constraint is written in a generic manner as CII = e¯kT α u = 0.
(10.35)
For various types of lower pair joints, index α will take different values. This holonomic constraint is enforced using Lagrange’s multiplier method, as c discussed in section 10.5.1. The potential of the constraint forces is VII = λII CII , c T c and the virtual work done by the constraint force is δW = δq F II . The variation of the constraint, δCII , is evaluated using eq. (10.23) to ſnd ⎧ k⎫ ⎧ ⎫ ekα ⎬ ⎨ δu ⎬ ⎨ −¯ ek u , F cII = λII B TII . (10.36) δq = δψ k , B TII = ⎩ ⎭ ⎩ αk ⎭ e¯α δu An increment in the constraint is expressed as ΔCII = Z TII Δq, where array Z II is easily found as ⎧ k⎫ ⎧ ⎫ ekα ⎬ ⎨Δu ⎬ ⎨ −¯ Δq = Δr k , Z II = (10.37) H kT ekα u . ⎩ ⎭ ⎩ ⎭ Δu e¯kα Array r k stores the rotation parameters representing the rotations of body k, and H k is the corresponding tangent operator deſned by eqs. (10.27). The equivalent stiffness matrix for this constraint is ⎡
X II
⎤ ekα H k 0 0 = λII ⎣− ekα u ekα H k ekα ⎦ . 0 − ekα H k 0
(10.38)
10.5 Generic constraints for lower pair joints
411
10.5.3 Third constraint: deſnition of relative rotation The constraint associated with the deſnition of the relative rotation at a lower pair joint connecting two bodies is given by eq. (10.28b). This constraint is written in a generic manner as CIII = gαα sin φγ + gαβ cos φγ = 0. (10.39) Indices α, β, and γ, are such that αβγ = +1, where the permutation symbol, αβγ , is deſned by eq. (1.25). For various types of lower pair joints, index γ will take different values. This holonomic constraint is enforced using Lagrange’s multiplier method, as c discussed in section 10.5.1. The potential of the constraint forces is VIII = λIII CIII , c T c and the virtual work done by the constraint force is δW = δq F III . The variation of the constraint, δCIII , is evaluated using eq. (10.25a) to ſnd ⎧ k⎫ ⎧ ⎫ ⎨δψ ⎬ ⎨ w⎬ (10.40) δq = δψ , B TIII = −w , F cIII = λIII B TIII , ⎩ ⎩ ⎭ ⎭ τ δφγ where w = hαα sin φγ + hαβ cos φγ and τ = gαα cos φγ − gαβ sin φγ . An increment in the constraint is expressed as ΔCIII = Z TIII Δq, where array Z III is easily found as ⎫ ⎧ k⎫ ⎧ kT ⎨ Δr ⎬ ⎨ H w⎬ Δq = Δr , Z III = −H T w . (10.41) ⎭ ⎩ ⎩ ⎭ Δφγ τ Arrays r k and r store the rotation parameters representing the rotations of bodies k and , respectively, and H k and H are the corresponding tangent operators deſned by eqs. (10.27). The equivalent stiffness matrix for this constraint is ⎡
X III
⎤ z ET H k E H = λIII ⎣−E T H k E H −z ⎦ , z T H k −z T H −CIII
(10.42)
sin φγ + Dk cos φγ . where z = hαα cos φγ − hαβ sin φγ and E = Dk αα αβ 10.5.4 Fourth constraint: deſnition of relative displacement The constraint associated with the deſnition of the relative displacement at a lower pair joint connecting two bodies is given by eq. (10.28a). This constraint is written in a generic manner as CIV = e¯kT (10.43) γ u − Δγ = 0. For various types of lower pair joints, index γ will take different values.
412
10 Constrained systems: classical formulations
This holonomic constraint is enforced using Lagrange’s multiplier method, as c discussed in section 10.5.1. The potential of the constraint forces is VIV = λIV CIV , c T c and the virtual work done by the constraint force is δW = δq F IV . The variation of the constraint, δCIV , is evaluated using eq. (10.23) to ſnd ⎫ ⎧ k⎫ ⎧ δu ⎪ −¯ ekα ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎬ ⎨ k⎬ ⎨ k ⎪ eα u δψ T , F cIV = λIV B TIV . , B δq = = (10.44) k IV e ¯ ⎪ ⎪ ⎪ ⎪ δu α ⎪ ⎪ ⎪ ⎪ ⎭ ⎩ ⎩ ⎭ −1 δΔγ An increment in the constraint is expressed as ΔCIV = Z TIV Δq, where array Z IV is easily found as ⎫ ⎧ ⎧ ⎫ −¯ ekα ⎪ Δuk ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎬ ⎨ k⎬ ⎨ Δr H kT ekα u . (10.45) , Z Δq = = IV Δu ⎪ ⎪ ⎪ e¯kα ⎪ ⎪ ⎪ ⎪ ⎪ ⎭ ⎩ ⎩ ⎭ ΔΔγ −1 Array r k stores the rotation parameters representing the rotations of body k, and H k is the corresponding tangent operator deſned by eqs. (10.27). The equivalent stiffness matrix for this constraint is ⎡ X IV = λIV
0 ekα H k k ⎢− ekα H k ⎢ eα u ⎣ 0 − ekα H k 0 0
0 ekα 0 0
⎤ 0 0⎥ ⎥. 0⎦
(10.46)
0
10.6 Constraints for the lower pair joints In this section, the constraints associated with the six lower pair joints depicted in ſg. 10.12 are detailed. The corresponding constraint forces are derived, and their physical nature is discussed. 10.6.1 Revolute joints Figure 10.13 depicts two bodies linked together by a lower pair joint. The kinematics of the problem and the corresponding notational conventions are presented in sections 10.4.1 and 10.4.2, respectively. This section focuses on a speciſc type of joint, the revolute joint, depicted in ſg. 10.14. For this joint, points K and L are coincident in both reference and deformed conſgurations. The revolute joint allows the two bodies it connects to rotate with respect to each other about a material axis, selected, by convention, to be e¯k3 = e¯3 . This condition implies the orthogonality of e¯k3 to both e¯1 and e¯2 .
10.6 Constraints for the lower pair joints k
l
e3 = e3
R
l
e2
413
k
l
l
e2
k
e1
k
f K=L Deformed configuration
R k
k
K=L
l
u =u I k
l
e03 = e03
k
i3
e1
i1
l
e02 = e02
i2 l
l
k
e01 = e01
uk0 = u0l
Reference configuration
R0 = R0
Fig. 10.14. Revolute joint in the reference and deformed conſgurations.
The revolute joint is characterized by the following kinematic constraints C 1 = u − uk = 0,
(10.47a)
C2 =
= g31 = 0,
(10.47b)
= g32 = 0,
(10.47c)
C3 =
e¯kT ¯1 3 e e¯kT ¯2 3 e
C4 = g11 sin φ + g12 cos φ = 0.
(10.47d)
Constraint (10.47a) expresses the vanishing of the relative displacement at the joint; it is readily enforced by Boolean identiſcation of the corresponding degrees of freedom. The second and third constraints are of type I, see section (10.5.1), with α = 3, β = 1, and α = 3, β = 2, for eqs. (10.47b) and (10.47c), respectively. Finally, the last constraint is of type III, see section 10.5.3, and deſnes the relative rotation about unit vector e¯k3 = e¯3 , denoted φ in ſg. 10.14. The combination of eqs. (10.30) and (10.40) yields the forces associated with the revolute joint constraints, ⎧ ⎧ ⎧ ⎫ ⎫ ⎫ ⎨ h31 ⎬ ⎨ h32 ⎬ ⎨ (h11 sin φ + h12 cos φ)⎬ F c = −h31 λ1 + −h32 λ2 + −(h11 sin φ + h12 cos φ) λ3 , (10.48) ⎩ ⎩ ⎩ ⎭ ⎭ ⎭ (g11 cos φ − g12 sin φ) 0 0 where Lagrange’s multipliers λ1 , λ2 , and λ3 are associated with constraints (10.47b), (10.47c), and (10.47d), respectively. When the constraints are satisſed, h31 = e¯2 and h32 = −¯ e1 . The forces of constraint associated with the ſrst constraint correspond to two moments acting about unit vector e¯2 and of magnitudes +λ1 and −λ1 , respectively, applied to bodies k and , respectively. The forces of constraint associated with the second constraint are readily interpreted in a similar manner. The moments associated with these ſrst two constraints enforce the parallelism of unit vectors e¯k3 and e¯3 . When the constraints are satisſed, h11 = e¯k3 sin φ and h12 = e¯k3 cos φ, implying that h11 sin φ + h12 cos φ = e¯k3 ; furthermore, g11 cos φ − g12 sin φ = cos φ cos φ − (− sin φ) sin φ = 1. To interpret the forces associated with the third constraint, it is
414
10 Constrained systems: classical formulations
assumed that a motor applies a torque Q at the revolute joint; the virtual work done by this torque is then δW = Qδφ. Because Lagrange’s multiplier technique is used to enforce the constraint, the relative rotation, φ, is now an unconstrained variable, and the corresponding equation of motion will be λ3 +Q = 0: Lagrange’s multiplier is of equal magnitude and opposite sign to the applied torque. The remaining components of the constraint forces correspond to two moments acting about unit vector e¯k3 and of magnitude −Q and +Q, respectively, transmitting the applied torque to bodies k and , respectively. If no torque is applied at the joint, Lagrange’s multiplier vanishes, λ3 = 0, and no forces are associated with this constraint, which simply deſnes variable φ but applies no forces to the system. 10.6.2 Prismatic joints Figure 10.13 depicts two bodies linked together by a lower pair joint. The kinematics of the problem and the corresponding notational conventions are presented in sections 10.4.1 and 10.4.2, respectively. This section focuses on the prismatic joint, depicted in ſg. 10.15. For this joint, the two bases coincide in the reference conſguration, B0k = B0 , and in the deformed conſguration, B k = B . The prismatic joint allows the two bodies it connects to translate with respect to each other along a material axis, selected, by convention, to be e¯k3 = e¯3 . This condition implies the orthogonality of unit vectors e¯k1 and e¯k2 to the relative displacement vector, u = u −uk . Reference configuration
u
k k
e2
l
k
e02 = e02 l
k
e01 = e01 K=L
i3 I
u0k = u0l
u
l
l
e2
D k
k
l
e03 = e03
e1
i2 l
e3 = e3
i1
k
K
L
l
e1 Deformed configuration
Fig. 10.15. Prismatic joint in the reference and deformed conſgurations.
The prismatic joint is characterized by the following kinematic constraints C 1 = R − Rk = 0,
(10.49a)
C2 =
= 0,
(10.49b)
= 0,
(10.49c)
− Δ = 0.
(10.49d)
C3 = C4 =
e¯kT 1 u e¯kT 2 u e¯kT 3 u
Constraint (10.49a) expresses the vanishing of the relative rotation at the joint; it is readily enforced by Boolean identiſcation of the corresponding degrees of freedom.
10.6 Constraints for the lower pair joints
415
The second and third constraints are of type II, see section (10.5.2), with α = 1 and 2, for eqs. (10.49b) and (10.49c), respectively. Finally, the last constraint is of type IV , see section 10.5.4, and deſnes the relative displacement along unit vector e¯k3 = e¯3 , denoted Δ in ſg. 10.15. The combination of eqs. (10.36) and (10.44) yields the forces associated with the prismatic joint constraints, ⎧ k⎫ ⎧ k⎫ ⎧ k⎫ −¯ e ⎪ −¯ e ⎪ −¯ e ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎬ ⎬ ⎬ ⎨ k 1⎪ ⎨ k 2⎪ ⎨ k 3⎪ e1 u e2 u e3 u c λ λ λ . F = + + (10.50) e¯k ⎪ 1 ⎪ e¯k ⎪ 2 ⎪ e¯k ⎪ 3 ⎪ ⎪ ⎪ ⎪ ⎭ ⎭ ⎭ ⎩ 1⎪ ⎩ 2⎪ ⎩ 3⎪ 0 0 −1 where Lagrange’s multipliers λ1 , λ2 , and λ3 are associated with constraints (10.49b), (10.49c), and (10.49d), respectively. When the constraints are satisſed, ek1 u = −u¯ ek2 , ek2 u = u¯ ek1 , and ek3 u = 0. The forces of constraint associated with the ſrst constraint correspond to two forces acting along unit vector e¯k1 and of magnitude −λ1 and +λ1 , respectively, applied to bodies k and , respectively, and one moment acting about unit vector e¯k2 and of magnitude −uλ1 , applied to both bodies k and that share a common orientation. The forces of constraint associated with the second constraint are readily interpreted in a similar manner. The forces associated with these ſrst two constraints enforce the collinearity of unit vectors e¯k3 and e¯3 ; the moments account for the fact that these aligning forces form couples with a moment arm u. To interpret the forces associated with the third constraint, it is assumed that an actuator applies a force F at the prismatic joint; the virtual work done by this force is then δW = F δΔ. Because Lagrange’s multiplier technique was used to enforce the constraint, the relative displacement, Δ, is now an unconstrained variable, and the corresponding equation of motion will be λ3 − F = 0: Lagrange’s multiplier equals the applied force. The remaining components of the constraint forces correspond to two forces along unit vector e¯k3 and of magnitude −F and +F , respectively, transmitting the applied force to bodies k and , respectively. If no force is applied at the joint, Lagrange’s multiplier vanishes, λ3 = 0, and no forces are associated with this constraint. 10.6.3 Cylindrical joints Figure 10.16 depicts the cylindrical joint, which is one of the lower pair joints discussed in a generic manner in section 10.4.1. The cylindrical joint allows the two bodies it connects to rotate and translate with respect to each other about a material axis, implying the orthogonality of e¯k3 to both e¯1 and e¯2 and the orthogonality of unit vectors e¯k1 and e¯k2 to the relative displacement vector, u = u − uk . The cylindrical joint is characterized by the following kinematic constraints: constraints (10.47b) and (10.47c) expressing the orthogonality of unit vectors e¯1 and e¯2 to unit vector e¯k3 , and constraints (10.49b) and (10.49c) expressing the orthogonality of unit vectors e¯k1 and e¯k2 to the relative displacement vector, u. The relative rotation
416
10 Constrained systems: classical formulations k
k
l
l
e02 = e02
u,R k
K=L
l
e01 = e01
uk0 = u0l Reference configuration k l e03 = e03 i1
l
i3 I
e2
u k , Rk
l
e1 L
l
e2
K f
k
e1 D
i2 k
l
e3 = e3
Deformed configuration
Fig. 10.16. Cylindrical joint in the reference and deformed conſgurations.
about unit vector e¯k3 = e¯3 , denoted φ, and relative displacement along the same axis, denoted Δ, between the two bodies are deſned by adding to the formulation constraints (10.47d) and (10.49d), respectively. Clearly, the cylindrical joint combines the constraints of the revolute and prismatic joints. The associated forces of constraints are identical to those developed in section 10.6.1 and 10.6.2 and will not be repeated here. 10.6.4 Screw joints The kinematic constraints associated with the screw joint are identical to those of the cylindrical joint. An additional constraint imposes a linear relationship between the relative rotation, φ, and relative displacement, Δ, C =Δ−
p φ = 0, 2π
(10.51)
where p is the pitch of the screw. 10.6.5 Planar joints Figure 10.13 depicts two bodies linked together by a lower pair joint. The kinematics of the problem and the corresponding notational conventions are presented in sections 10.4.1 and 10.4.2, respectively. This section focuses on the planar joint, depicted in ſg. 10.17. The planar joint allows the two bodies it connects to translate with respect to each other within a material plane, selected, by convention, to be normal to unit vector e¯k3 = e¯3 . This condition implies the orthogonality of unit vector e¯k3 to the relative displacement vector, u = u − uk . The planar joint further allows the two bodies to rotate with respect to each other about the axis perpendicular to the material plane, e¯k3 = e¯3 . This condition implies the orthogonality of unit vector e¯k3 to unit vectors e¯1 and e¯2 . The planar joint is characterized by the following kinematic constraints
10.6 Constraints for the lower pair joints l
k
e03 = e03
l
Reference configuration
l
k
k
u,R
l
k
l
e1 f
l k
k
k
e2
l
e3 = e3
l
e02 = e02 e01 = e01
k
e3 = e3
l
u,R
k
417
u l - uk
e2
D1
i3
D2
l
R0 = R0 u0k = u0l
I
k
e1
i2
Deformed configuration
i1 Fig. 10.17. Planar joint in the reference and deformed conſgurations.
C1 = e¯kT 3 u = 0,
(10.52a)
C2 =
(10.52b)
C3 = C4 =
e¯kT ¯1 = g31 = 0, 3 e e¯kT ¯2 = g32 = 0, 3 e e¯kT 1 u − Δ1 = 0. e¯kT 2 u − Δ2 = 0.
C5 = C6 = g11 sin φ + g12 cos φ = 0.
(10.52c) (10.52d) (10.52e) (10.52f)
Constraint (10.52a) is of type II, see section 10.5.2, with α = 3. The second and third constraints are of type I, see section (10.5.1), with α = 3, β = 1, and α = 3, β = 2, for eqs. (10.52b) and (10.52c), respectively. The next two constraints are of type IV , see section 10.5.4, with α = 1 and α = 2 for eqs. (10.52d) and (10.52e), respectively. They deſne the relative displacements of the bodies along unit vectors e¯k1 and e¯k2 , respectively, denoted Δ1 and Δ2 , respectively. Finally, the last constraint is of type III, see section 10.5.3, and deſnes the relative rotation about unit vector e¯k3 = e¯3 , denoted φ in ſg. 10.17. The planar joint combines the constraints of the revolute and prismatic joints. The associated forces of constraints are identical to those developed in sections 10.6.1 and 10.6.2. 10.6.6 Spherical joints Figure 10.18 depicts the spherical joint, which is one of the lower pair joints discussed in a generic manner in section 10.4.1. The spherical joint allows the two bodies it connects to freely rotate with respect to each other about a material point, K = L, while preventing any relative displacement at this point, i.e., uk = u . The spherical joint is characterized by constraints (10.47a), which prevents relative displacement between the bodies. This constraint is readily enforced by Boolean identiſcation of the corresponding degrees of freedom.
418
10 Constrained systems: classical formulations l
e2
k
e2
k
e1
k
Rl
K=L Deformed configuration
I
l
k
e02 = e02 k
i3 l
l
e1
e3
e3
k
e03 = e03
Rk
l
l
u =u k
l
R0 = R0 u0k = u0l i2
i1
K=L k
l
e01 = e01 Reference configuration
Fig. 10.18. Spherical joint in the reference and deformed conſgurations.
10.6.7 Problems Problem 10.22. Relative rotation of a revolute joint Consider the revolute joint discussed in section 10.6.1. Express the rotation of body , R , as a function of rotation of body k, Rk , and of the relative rotation angle φ.
Problem 10.23. Relative motion of a prismatic joint Consider the prismatic joint discussed in section 10.6.2. Express the displacement of body , u , as a function of displacement of body k, uk , and of the relative displacement Δ.
10.7 Other joints Multibody systems often involve a variety of joints that impose constraints on the relative motion of the bodies of the system. The lower pairs described in the previous sections can be used to synthesize more complex joints: for instance, the universal joint depicted in ſg. 10.19 can be viewed two revolute joints sharing a common axis of rotation along unit vector e¯k3 and two more revolute joints sharing a common axis of rotation along unit vector e¯3 . In many cases, however, joints with specialized kinematic conditions must be developed. 10.7.1 Universal joints Although the universal joint depicted in ſg. 10.19 is not a lower pair joint, the kinematic description and notational conventions presented in sections 10.4.1 and 10.4.2, respectively, will be used here again. At the heart of the universal joint is a cruciform, consisting of two rigidly connected bars assembled together at a 90 degree angle. Body k is allowed to rotate about unit vector e¯k3 , which is aligned with the ſrst bar of the cruciform; body is allowed to rotate about unit vector e¯3 , which is aligned with the second bar of the cruciform. It follows that unit vectors e¯k3 and e¯3 are material axes of bodies k and , respectively, and point K = L is a material point of both bodies.
10.7 Other joints
R
419
l
u = ul k
k
e3
Cruciform
i3
l
e3
l
k
e03
R0
K=L Deformed R configuration
l
R0
I k
i1
i2 k 0
u =u
K=L l 0
k
Reference e03 configuration
Fig. 10.19. Universal joint in the reference and deformed conſgurations.
The universal joint is characterized by the following kinematic constraints: constraints (10.47a), which prevents relative displacement between the bodies, and a single type I constraint, see eq. (10.29), with α = β = 3. The universal joint uses a subset of the constraint developed for the revolute joint. 10.7.2 Curve sliding joints The kinematic conditions associated with the sliding of a body along a ƀexible track have been presented by Li and Likins [30] within the framework of Kane’s method. Cardona [31] derived a ſnite element based formulation for the sliding of a body along a prescribed curve. Finally, Bauchau [32] presented the formulation of a sliding joint that enforces the sliding of a body along a ƀexible beam. This formulation was later reſned [33] to include constraints on the relative rotation between the sliding bodies. This section describes the curve sliding joint that enforces the sliding of a body on a rigid curve connected to another body. Figure 10.20 depicts two bodies linked together by a curve sliding joint. Here again, the kinematic description and notational conventions presented in sections 10.4.1 and 10.4.2, respectively, will be used. Spatial curve C, see section 2.2, is rigidly connected to body k. A curve sliding joint involves displacement constraints requiring point L, a material point of body , to slide along curve C, which is rigidly connected to body k. Let p∗ (η) and p(η) = Rk Rk0 p∗ (η) be the components of the position vector of a point on curve C with respect to point K, resolved in bases B k and I, respectively. Moreover, let P k be the components of the position vector of an arbitrary point on curve C with respect to point O, resolved in basis I. It then follows that P k = uk0 + uk + p(η). Similarly, the components of the position vector of point L with respect to point O, resolved in basis I, are denoted P = u0 + u . Because point L must be along curve C, the following vector constraint must be satisſed C = P k − P = u0 + u + p(η) = 0, (10.53)
420
10 Constrained systems: classical formulations l
Body l
e02
l
e03
l
e01
L
h
l
u,R
p*(h0)
Body k
k
e3
p*(h)
k
K
e02
u0l, Rl0
k
e01 Reference Configuration
e2
e3
k
k
u,R
k
e03
l
l
h
l
i3 I O
k
u0k, R0
L
l
e1
K k
e1
i2
i1
k
e2
Deformed Configuration
Fig. 10.20. Conſguration of the curve sliding joint.
where u0 = uk0 − u0 and u = uk − u are the relative displacement vectors of the two bodies, in the reference and deformed conſgurations, respectively. This holonomic constraint is enforced using Lagrange’s multiplier method. The potential of the constraint forces is V c = λT C, and the virtual work done by the constraint force is δW c = δq T F c . The variation of the constraint, δC, is evaluated as ⎫ ⎧ k⎫ ⎧ ⎪ ⎪ ⎪ ⎪ ⎪ δu k ⎪ ⎪ I ⎪ ⎬ ⎨ ⎨ − p(η)⎬ δψ T , F c = B T λ. , B δq = = (10.54) ⎪ −I ⎪ ⎪ ⎪ δu ⎪ ⎪ ⎪ ⎪ ⎭ ⎭ ⎩ ⎩ T p1 (η) δη where p∗1 = p∗ (η), p1 (η) = Rk Rk0 p∗1 (η), and notation (·) indicates a derivative with respect to η. It is often necessary to know the curvilinear coordinate, s, along the curves. For instance, if a friction force of magnitude F f is present between body and curve C at point L, the formulation would require the evaluation of the virtual work done by this force, δW = F f δs. It is often convenient to use the very versatile NURBS representation of curves [34, 35], but this approach is based on an arbitrary parameterization, η ∈ [0, 1], as discussed in section 2.2.2. The intrinsic parameterization of the curve presented in section 2.2.1 directly uses curvilinear coordinate s, but is often very difſcult to obtain. To remedy the situation, an additional scalar constraint relating these two variables is necessary. Expressing the relationship between variables s and η is arduous, and more often that not, impossible. Equation (2.14), however, recast as s˙ = p1 η, ˙ provides a relationship between the corresponding generalized velocities, C = s˙ − p∗1 (η)η˙ = 0.
(10.55)
In general, this constraint is not integrable and hence, must be treated as a nonlinear, nonholonomic constraint.
10.7 Other joints
421
10.7.3 Sliding joints The formulation of prismatic joints was presented in section 10.6.2. The prismatic joint is characterized by the following kinematic constraints: constraint (10.49a) that prevents relative rotation between the bodies, and constraints (10.49b) and (10.49c) that express the orthogonality of unit vectors e¯k1 and e¯k2 to the relative displacement vector u. Note that although these constraints are expressed in terms of the kinematic variables at points K and L, they imply the sliding of body on body k at point K, when body is rigid. k
e2 k
e2
k
k
l
K
e2 k
D
l
e3 = e3
l
e1
e2
e3
e1
l
l
e3
k
K
L
e1
l
L
e1
Fig. 10.21. Prismatic joint with ƀexible body.
Fig. 10.22. Sliding joint with ƀexible body.
The situation is sharply different when body is ƀexible, as shown in ſg. 10.21. If conditions (10.49b), (10.49c) and (10.49a) are enforced, body is no longer sliding on body k at point K, i.e., contact between the bodies is no longer enforced. In actual systems, the piece of hardware corresponding to the prismatic joint implies the sliding of body on body k with contact at point K at all times, as depicted in ſg. 10.22. In fact, in the presence of ƀexible bodies, such joint is more accurately described as a sliding joint [32, 33, 36]. Due to the ƀexibility of body , the kinematic variables at material points K and L are no longer related by conditions (10.49b), (10.49c) and (10.49a). Rather, constraint conditions must be enforced between the kinematic variables at point K of body k, and the kinematic variables at the material point of body which is in contact with body k at an instant. Clearly, kinematic constraints (10.49b), (10.49c) and (10.49a) associated with the classical formulation of prismatic joints, and the kinematic constraint associated with sliding in the presence of ƀexible bodies are fundamentally different and will lead to sharply different dynamic responses of the system. Although the above discussion has focused on prismatic joints, it is clear that identical remarks can be made concerning the classical formulation of cylindrical joints, and about their inadequacy to model sliding behavior in the presence of ƀexible bodies. Figure 10.23 depicts two bodies linked together by a sliding joint. Body k is a ƀexible beam element whose displacement ſeld is interpolated from nodal quantities, see sections 16.3 and 17.7. In the reference conſguration, the coordinates of a point
422
10 Constrained systems: classical formulations
on the beam are uk0 (η) = N (η)ˆ uk0 ,
(10.56)
where u ˆk0 are the nodal positions in the reference conſguration, N (η) the displacement interpolation matrix deſned by eq. (17.6), and η ∈ [0, 1] a non-dimensional parameter indicating the location of a material particle along the beam axis in the reference conſguration. Body can be a rigid or ƀexible element of the system. The position vector of a node point of this body is denoted u0 in the reference conſguration. After deformation, the position h vector of a point on the beam becomes Reference P k (η) = N (η) (ˆ uk0 +ˆ uk ), where u ˆ k are configuration the nodal displacement vectors. SimiBody l h larly, the position vector of the node on l u body is P = u ˆ 0 + u ˆ , where u ˆ is the nodal displacement vector. k The kinematic constraint associu (h) u0l ated with the condition of body freely i3 sliding over the ƀexible beam is C = k u0(h) P k (η) − P = 0. Deformed Parameter η which determines the I i1 i2 configuration location of contact between bodies k Body k and is, of course, a time varying unFig. 10.23. Sliding joint in the reference and known of the problem. This constraint will be enforced using the Lagrange deformed conſgurations. multiplier method, see section 10.2.1. The virtual work done by the constraint force deſned by eq. (10.5) becomes ⎧ k ⎫T ⎫ ⎧ k ⎫T ⎧ NT u ⎬ u ⎬ ⎨ ⎨δˆ ⎬ ⎨δˆ F c = δη δη (ˆ uk0 + u ˆ k )T N T λ, ⎩ ⎭ ⎭ ⎩ ⎭ ⎩ δu δu −1
(10.57)
where (·) denotes a derivative with respect to η. 10.7.4 Problems Problem 10.24. Two rigid bodies connected by a rigid link
Figure 10.24 shows two rigid bodies connected at point K and L by a rigid link. The kinematics of the two bodies is represented using the conventions described in section 10.4.1 and ſg. 10.13. Points K and L are material points of bodies k and , respectively. In the reference conſguration, the position of point K with respect to point K is given by vector sk , with a similar deſnition for vector s . In the deformed conſguration, the corresponding position vectors are denoted S k and S , respectively. Due to the presence of the rigid link, the distance between points K and L must remain constant, leading to the following nonlinear holonomic constraint, C = (d2 − 2 )/2 = 0, where 2 = d0 2 is the constant length of the link. (1) Identify the array of generalized coordinates for this problem. (2) Determine the constraint
10.7 Other joints
423
matrix. (3) If the constraint is enforced via Lagrange’s multiplier method, derive the constraint forces. (4) Describe the physical interpretation of these forces of constraint. (5) Evaluate the equivalent stiffness matrix for the constraint.
Deformed configuration l u,l R
L L¢
s
l
L
i3
l
d0 s
k
K¢
ul0, R0
I
k
u0k, R0
K
S
l
L¢
f
i2
d
I
K¢
i1 Reference u,R configuration k
Universal joint
i2 Shaft k
S
i1
Shaft l
k
k
K
Fig. 10.24. Conſguration of the rigid link.
Fig. 10.25. Two rigid shafts connected by a universal joint.
Problem 10.25. Angular velocities of a universal joint Consider two rigid shafts connected by a universal joint as depicted in ſg. 10.25. The two shafts remain in the ſxed plane deſned by vectors ¯ı1 and ¯ı2 , and the constant angle between the shafts is φ. Let ω k denote the constant angular velocity of shaft k. (1) Find the angular velocity of shaft , denoted ω . (2) Plot the angular velocity ratio, ω /ω k , over one period of rotation of shaft k. (3) Find the maximum value of this ratio as a function of the relative shaft angle φ.
Problem 10.26. Point associated with a rigid body Figure 10.26 shows a rigid body in its reference and ſnal conſgurations. The kinematics of the rigid body is represented using the conventions described in section 10.4.1 and ſg. 10.13. Point A is a material point of the rigid body and its position vector with respect to reference point B of the rigid body is denoted sA and S A in the reference and ſnal conſgurations, respectively. Let uA denote the displacement vector of point A. Because u = uA , it is often desirable to use uA as an additional set of generalized coordinates, which are deſned by the following constraints, C = (u + S A ) − (uA + sA ) = 0. (1) Identify the array of generalized coordinates for this problem. (2) Determine the constraint matrix. (3) If the constraint is enforced via Lagrange’s multiplier method, derive the constraint forces. (4) Describe the physical interpretation of these forces of constraint. (5) Evaluate the equivalent stiffness matrix for the constraint.
Problem 10.27. The curve sliding joint Consider the curve sliding joint. (1) Compute the velocity of a material point on curve C. This can be obtained by taking a time derivative of the position vector of an arbitrary point on curve C, resolved in I considering η to be constant. (2) Compute the velocity of the material point on body . (3) Compute the relative velocity of body with respect to body k at the point of contact. (4) Use the constraint condition, eq. (10.53), to show that the component of relative velocity in the direction tangent to the curve is simply s. ˙ (5) What are the components of relative velocity in the other directions?
424
10 Constrained systems: classical formulations
uA
e03
e3 e2
SA i3
A
sA
Initial configuration
u 0, R 0
E
u, R
u0A e01
A
e02
B E0
e1
B
I O
i2
Final Configuration
i1
Fig. 10.26. Conſguration of the rigid body with an associated point.
Problem 10.28. Relative rotation for a universal joint The formulation of the universal joint was presented in section 10.7.1. In practice, a universal joint consists of a cruciform component connected to body k and by means of two pairs of revolute joints, as depicted in ſg. 10.27. In the reference conſguration, the followk k k k = (¯ g01 = e¯03 , g¯02 = ek03 e¯03 , g¯03 = e¯k03 ) ing orthonormal bases are constructed: B0g k k k and B0g = (¯ g01 = e¯03 , g¯02 = e03 e¯03 , g¯03 = e¯03 ). Let G0 and G0 be the components k of the rotation tensors deſning the rotations from I to B0g and I to B0g , respectively, resolved in I. In the deformed conſguration, the following orthonormal bases are constructed: Bgk = (¯ g1k = e¯3 , g¯2k = ek3 e¯3 , g¯3k = e¯k3 ) and Bg = (¯ g1 = e¯k3 , g¯2 = e3 e¯k3 , g¯3 = e¯3 ). Let Gk and G be the components of the rotation tensors deſning the rotations from I to Bgk and I to Bg , k k k k = (¯ gR1 , g¯R2 , g¯R3 ) respectively, resolved in I. Finally, two additional orthonormal bases, BR k = (¯ gR1 , g¯R2 , g¯R3 ) are deſned as g¯Rα = Rk Gk0 ¯ıα and g¯Rα = R G0¯ıα , respecand BRg tively. The rotation φk of the revolute joint between body k and the cruciform is deſned as k k ¯R1 ) sin φk − (¯ eT ¯R2 ) cos φk = 0; and rotation φ of the revolute joint between body (¯ eT 3 g 3 g kT ekT ¯R2 ) cos φ = 0. (1) Show that Gk0 = Rk0 R∗ (θk ) and the cruciform is (¯ e3 g¯R1 ) sin φ −(¯ 3 g ∗ k and G0 = R0 R (θ ), where θ and θ are the angles deſning the planar rotations from B0k to k B0g and B0 to B0g , respectively. (2) What are the values of angles φk and φ in the reference conſguration? (3) Find the relationship between the rotations of body , R , as a function of the rotation of body k, Rk and of the rotations φk and φ of the revolute joints.
Deformed configuration
R u = ul
l
k
e3 = g3 = g1
l
l
k
k
l
Reference configuration
l
Rl0
k
e03 = g03= g01
l
g02
g2
i3
k
k
l
e03 = g03 = g01 I k
k
l
e3 = g3 = g1
k
g2
R
k
i1
i2 uk0 = u0l
k
g02
Fig. 10.27. Conſguration of the universal joint.
k
R0
11 Constrained systems: advanced formulations
Multibody systems are characterized by two distinguishing features: system components undergo ſnite relative rotations and these components are connected by mechanical joints that impose restrictions on their relative motion. Finite rotations introduce geometric nonlinearities, hence, multibody systems are inherently nonlinear. Mechanical joints, such as the lower pair joints presented in section 10.4, result in algebraic constraints leading to a set of governing equations that combines differential and algebraic equations. Several textbooks are devoted to the description of the many formulations that have been developed to deal with these complex systems: see, for instance, Roberson and Schwertassek [37], Nikravesh [38], Amirouche [39], Schiehlen [40], Garc´Æa de Jal´on and Bayo [41], or Shabana [42]. Computer implementations of a number of the proposed methods and a comparison of their salient features is given by Schiehlen [43]. Bauchau and Laulusa [44, 45] have presented a comprehensive review of the many formulations and numerical techniques that have been used to enforce constraints in multibody systems. A survey paper by Schiehlen [46] summarizes different approaches to the derivation of the equations of motion for multibody systems. The choice of various frames of reference, system variables and mechanics principles are reviewed. While the dynamic behavior of the system is, of course, independent of the formalism used to describe it, the form of the equations of motion, the effort required to derive them, and the computational burden associated with their numerical solution are all affected by the choice of formalism. The same remarks apply to the methods used to enforce constraints: the effort involved in the derivation of the complete system of governing equations and associated constraints, the computational cost required for their solution, and the resulting accuracy all critically depend on the theoretical formalism and numerical methods used to solve the problem. Chapter 10 generalizes the basic formulations of dynamics to constrained systems. Lagrange’s multiplier technique, the key to this generalization, is shown to be both effective and elegant. Furthermore, because Lagrange’s multipliers are closely related to the forces of constraint, these new variables are often physically meaningful. Unfortunately, the use of Lagrange’s multipliers changes the mathematical
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_11 © Springer Science+Business Media B.V. 2011
426
11 Constrained systems: advanced formulations
nature of the equations of motion, which now become differential-algebraic equations rather than the ordinary differential equations that characterize unconstrained systems. The systematic use of Lagrange’s multipliers considerably simpliſes the development of the equations of motion of complex mechanical systems, but results in large systems of nonlinear differential-algebraic equations. Consequently, appropriate numerical techniques must be developed to deal with this type of problems. In fact, the availability of computationally efſcient and accurate numerical tools for the solution of systems of differential-algebraic equations enable the use of Lagrange’s multiplier approach. This chapter surveys the numerical tools developed for this task and their theoretical underpinnings. Typically, the equations of motion of constrained dynamical systems are cast in the form of Lagrange’s equation of the ſrst kind presented in section 11.1. Several approaches that eliminate these multipliers are presented in section 11.2: Maggi’s, the index-1, the null space, and Udwadia and Kalaba’s formulations are summarized in sections 11.2.1 to 11.2.7. A comparison of these various approaches appears in section 11.2.8. The geometric interpretation of the problem presented in section 11.3 presents valuable insight into the behavior of constrained systems. If projection operations are deſned in a space endowed with a metric deſned by the inverse of the mass matrix, the governing equations of motion of constrained system can be projected in the feasible and infeasible directions. Projections in the feasible direction yield the equations of motion of the system from which Lagrange’s multipliers have been eliminated, and projections in the infeasible direction yield an expression for the forces of constraint. Section 11.4 presents Gauss’ principle that has also been used for the solution of constrained dynamical systems. Additional formulations of a more theoretical nature are summarized in section 11.5.
11.1 Lagrange’s equations of the ſrst kind Consider a system represented by n generalized coordinates and subjected to m holonomic or nonholonomic constraints. Lagrange’s formulation yields the equations of motion of the system in the form of eqs. (10.12) or (10.19), for holonomic or nonholonomic systems, respectively. These equations, often called Lagrange’s equations of the ſrst kind, take the following form M (n×n) ¨q (n) + B T(n×m) λ(m) = F (n) ,
(11.1)
where M = M (q, t) is the symmetric, positive-deſnite mass matrix, F = F (q, q, ˙ t) are the dynamic and externally applied forces, λ the array of m Lagrange’s multipliers, and the subscripts indicate the sizes of the corresponding arrays. In the literature, Lagrange’s equations of the ſrst kind typically appear as in eqs. (11.1) instead (10.12): the constraint force term, B T λ, appears on the left- rather than right-hand side. This difference is unimportant because it corresponds to a change of sign of Lagrange’s multipliers.
11.2 Algebraic elimination of Lagrange’s multipliers
427
The constraints applied to the system are written as D (m) = B (m×n) (q, t) q˙ (n) + b(m) (q, t) = 0,
(11.2)
where B(q, t) is the constraint matrix. Constraints could be nonholonomic, in which case eq. (11.2) expresses relationships among the generalized velocities; it is assumed that these relationships depend on the generalized velocities in a linear manner. On the other hand, some of the constraints could be holonomic; this means that they can be integrated to the form C(q, t) = 0, see section 9.2. If all the constraint are holonomic, the n generalized coordinates are linked by m algebraic constraints. If these latter are independent, it is conceptually possible to partition the generalized coordinate array into independent, q I(n−m) , and depen
, such that q T = q IT , q DT and q D = f (q I ), leading to the dent coordinates, q D (m) elimination of the dependent variables. Unfortunately, this approach is fraught with difſculties: it is not clear which generalized coordinates should be selected to be independent. Furthermore, a poor selection of the independent set of coordinates might render f (q I ) singular, a suitable set of independent coordinates might become unsuitable for different conſgurations of the system, and ſnally, function f (q I ) might be so complex as to preclude any practical computations. Lagrange’s equations of the ſrst kind form a set of (n+m) Differential-Algebraic Equations (DAEs) for the (n + m) unknowns, q and λ; indeed, Lagrange’s multipliers are algebraic variables, i.e., no time derivatives of these variables appear in the equations, whereas ſrst- and second-order derivatives of the generalized coordinates are present, as implied by Newton’s second law. Gear, Petzold and coworkers [47, 48, 49], as well as Brennan [50], have given a formal deſnition of the index of a system of DAEs. The governing equations for mechanical systems with holonomic constraints are index-3 DAEs; typically, higher indices result in more arduous solution processes. Constraints written in the form of eq. (11.2) are sometimes called velocity level constraints. A time derivative of these constraints then yields ˙ t) − B(q, ˙ B(q, t) q¨ = −b(q, t) q˙ = c(m) (q, q˙ , t),
(11.3)
the acceleration level constraints. The linear dependency of eq. (11.2) on the generalized velocities implies the linear dependency of eq. (11.3) on the generalized accelerations.
11.2 Algebraic elimination of Lagrange’s multipliers In this section, algebraic procedures for the solution of Lagrange’s equations of the ſrst kind are presented. In all cases, the approach eliminates Lagrange’s multipliers to obtain a set of ordinary differential equations (ODEs). Maggi’s formulation is presented in section 11.2.1 and introduces the important concepts of null space
428
11 Constrained systems: advanced formulations
and orthogonal complements. Next, the index-1 and null space formulations are introduced in sections 11.2.3 and 11.2.5, respectively. Finally, Udwadia and Kalaba’s formulation is presented in section 11.2.7. A comparison of the features of these various approaches is the focus of section 11.2.8. 11.2.1 Maggi’s formulation While the original derivation of Maggi’s formulation appeared in 1896 [51], then again in 1901 [52], it is only in 1972 that it is presented in the English literature by Neimark and Fufaev [53]. More recently, Kurdila et al. [54] and Papastavridis [55] have shown the relevance of this formulation to computational methods for constrained multibody systems. The formulation begins with the deſnition of a set of (n − m) kinematic characteristics, denoted e, that satisfy the following relationships B (m×n) (q, t) b (q, t) 0(m) q˙ + ˇ (m) . (11.4) = ˇ e(n−m) b(n−m) (q, t) B ((n−m)×n) (q, t) The ſrst m equations of this system represent the constraints, eq. (11.2), and the last (n−m) equations deſne independent kinematic characteristics, which are sometimes called kinematic parameters, generalized speeds or independent quasi-velocities. In general, these quantities cannot be integrated, i.e., no p exist such that p˙ = e. The choice of the kinematic parameters, a crucial aspect of the procedure, is left to the analyst. The number of degrees of freedom of the system is n − m. The matrix ˇ deſnes a linear transformation that is assumed to be invertible; formed by B and B this implies a full rank constraint matrix and a judicious choice of the kinematic parameters. The following notation is used B m×n (q, t) B n×n = ˇ . (11.5) B ((n−m)×n) (q, t) The inverse of this matrix is denoted B−1 = Γˇ (n×m) (q, t) Γ (n×(n−m)) (q, t) . n×n
(11.6)
The generalized velocities are now readily expressed in terms of the kinematic characteristics as b 0 q˙ = Γˇ (q, t) Γ (q, t) − ˇ = Γ e − (Γˇ b + Γ ˇb) = Γ e − d(q, t). (11.7) e b Because eqs. (11.4) and (11.7) are inverse of each other, the following relationships must be satisſed B ˇ I0 B Γˇ B Γ Γ = , (11.8) = Γ ˇ ˇ ˇ ˇ 0I B BΓ BΓ
11.2 Algebraic elimination of Lagrange’s multipliers
429
and ˇ = I. Γˇ B + Γ B
(11.9)
Matrix Γ plays a central role in Maggi’s formulation. The property B Γ = 0 implies that Γ spans the null space of the constraint matrix; B and Γ are orthogonal complements. Next, the generalized accelerations are expressed in terms of the kinematic characteristics by taking a time derivative of eq. (11.7), leading to ˙ ¨q = Γ e˙ + Γ˙ e − d.
(11.10)
The governing equations of the system can be expressed in terms of the kinematic characteristics by introducing eq. (11.10) into eq. (11.1), and pre-multiplying by Γ T to ſnd ˙ (Γ T M Γ ) e˙ + (Γ T M Γ˙ ) e = Γ T F + Γ T M d. (11.11) Lagrange’s multipliers have been eliminated from Maggi’s equations: in view of eq. (11.8), the term Γ T B T λ vanishes. The choice of kinematic characteristics is not unique. Ignoring, for simplicity, arrays b and ˇb, eq. (11.7) becomes q˙ = Γ e: the generalized velocities are uniquely deſned, any set of linearly independent vectors can be selected to span the null space, Γ , each leading to a new set of kinematic characteristics. Maggi’s formulation consists of eqs. (11.11) and (11.7), which form a system of (2n − m), ſrst-order ODEs for the (2n − m) unknown, e and q. Maggi’s formulation eliminates Lagrange’s multipliers from Lagrange’s equations of the ſrst kind. It is possible, however, to compute these multipliers once Maggi’s equations have been solved. Properties (11.8) imply that B Γˇ = I, hence, T T multiplying eq. (11.1) by Γˇ yields λ = −Γˇ (M ¨ q − F ). Introducing eq. (11.10) then leads to T λ = −Γˇ M Γ e˙ + M Γ˙ e − M d˙ − F . (11.12) Example 11.1. The simple pendulum Figure 10.5 depicts a point of mass m and coordinates x and y, constrained to remain at a distance from an inertial point O, and discussed in section 10.4. The generalized coordinates of the pendulum are q T = x, y . The constraint condition is C = (q T q − 2 )/2 and the constraint matrix is now B = [x/, y/] .
(11.13)
A ſrst form of the equations of motion for this constrained problem is Lagrange’s
equations of the ſrst kind, eq. (11.1), or M ¨q − B T λ = mg, where gT = g 1, 0 , g the acceleration of gravity and M = diag(m, m) the mass matrix of the system. To derive Maggi’s equations for the pendulum, the kinematics of the problem are recast in the form of eq. (11.4), as 1 0 xy = q, ˙ e −y x
430
11 Constrained systems: advanced formulations
ˇ where the second row of the matrix on the right-hand side deſned matrix B. The single kinematic characteristic of the system was deſned as e = (−y x˙ + xy)/. ˙ To interpret this quantity, the coordinates of the point mass are expressed in ˙ terms of angle θ, such that x = cos θ and y = sin θ. It then follows that e = θ, i.e., the kinematic characteristic is related to the angular velocity of the pendulum. The relationship between the generalized velocities and the kinematic characteristics give rise to the matrices Γˇ and Γ , deſned in eq. (11.7), 1 x 1 −y , Γ = . Γˇ = x y It is easily veriſed that the following relationship holds, as implied by eq. (11.8). B ˇ 1 10 x y 1 x −y ˇ Γ Γ = −y x y x = 0 1 . B Simple algebraic manipulations show that Γ T M Γ = m and Γ T M Γ˙ = 0. The equations of motion of the system then follow from eq. (11.11) as me˙ = Γ T mg.
(11.14)
It is easily veriſed that this equation is indeed the equation of motion of the pendulum: θ¨ + g/ sin θ = 0. Note that the right-hand side of Maggi’s equation involves Γ (q), a function of the generalized coordinates q. Example 11.2. Quick return mechanism The quick return mechanism shown in ſg. 10.6 consists of a uniform crank of length Lc and mass mc , and of a uniform arm of length La and mass ma . The crank is pinned at point R and the arm at point O; the distance between these two points is denoted d. At point S, a slider allows the tip of the crank to slide along the arm. Mass M is attached at point T, the tip of the arm. A spring of stiffness constant k connects the tip of the arm, point T, to ſxed point A; the spring is un-stretched when the arm is in the vertical position. This problem was treated in examples 8.15 and 10.5 on pages 329 and 396, respectively, using Lagrange’s formulation with one and two generalized coordinates, respectively. Use Maggi’s formulation to derive the equations of motion of the system with two generalized coordinates, angles θ and φ. Considering triangle ORS, it is clear that β = φ − θ, and the law of sines then yields Lc sin(φ − θ) = d sin φ. This equation expresses the kinematic constraint between the two generalized coordinates of this single degree of freedom problem, C = d sin φ − Lc sin(φ − θ) = 0, and the constraint matrix is given by eq. (10.14). To start Maggi’s formulation, the following linear transformation is constructed dCφ − Lc Cφ−θ L C 0 θ˙ θ˙ = B ˙ = c φ−θ . (11.15) e 1 0 φ˙ φ The ſrst equation represents the velocity level constraint, C˙ = 0, and the ſrst line of matrix B is the constraint matrix deſned by eq. (10.14). The second line of matrix
11.2 Algebraic elimination of Lagrange’s multipliers
431
˙ chosen to be the B deſnes the single kinematic characteristic of the problem, e = θ, ˙ Of course, the choice of the kinematic characteristic angular velocity of the crank, θ. is not unique. For instance, the kinematic characteristic could be chosen to be the ˙ angular velocity of the arm, e = φ. Inverting eq. (11.15) yields 1 θ˙ 0 Lc Cφ−θ − dCφ 0 −1 0 = B = . e e Lc Cφ−θ φ˙ Lc Cφ−θ − dCφ −1 The ſrst column of matrix B −1 deſnes matrix Γˇ , and the second column deſnes the null space of the constraint matrix, Γ . Maggi’s formulation calls for the evaluation of the following terms ΓTM Γ =
ma 2 ¯ 2 mc L2c + (M + )La h , 3 3
Γ T M Γ˙ = −(M +
ma 2 ¯ )La h¯ g e, 3
¯ = (L ¯ c Cφ−θ )/(L ¯ c Cφ−θ − Cφ ), and g¯ = L ¯ c (Cφ Sφ−θ + ¯ c = Lc /d, h where L 2 ¯ ¯ Sθ h)/(Lc Cφ−θ − Cφ ) . In non-dimensional form, Maggi’s equations become ⎧ ⎫ ⎧ ⎫ e2 )/γ ⎬ ⎨ e¯⎬ ⎨(α + β¯ θ e¯ . (11.16) = ⎩ ⎭ ⎭ ⎩ ¯h¯ φ e ¯ φ , μa = ma /M , μc = ¯ a = La /d, α = hS The following notations were deſned: L mc /M , and β = (1 +
μa ¯ )h¯ g, 3
γ=
¯2 μc L μa ¯ 2 c )h , + (1 + 2 ¯ 3 3La
e¯ = e/ω.
The non-dimensional time τ = ωt, where ω 2 = k/M , was introduced and notation (·) indicates a derivative with respect to τ . Maggi’s equations, eqs. (11.16), take the form of three coupled differential equations for the kinematic characteristic and the two generalized coordinates of the problem. Because Lagrange’s multiplier was eliminated, the equations are ordinary differential equations instead of the algebraic-differential equations that characterize Lagrange’s equations of the ſrst kind, eqs. (10.15). Maggi’s equations can be integrated using classical numerical tools developed for the solution of ordinary differential equations. It is left to the reader to verify that the dynamic response predicted by Maggi’s formulation is identical to that obtained using Lagrange’s formulation, see ſgs 8.10 to 8.12. Of course, slight discrepancies between the two predictions are expected due to the approximate nature of numerical solution techniques. Maggi’s formulation enforces velocity level constraints, C˙ = 0, at each instant in time. Indeed, the velocity level constraint is the third equation of system (11.16). An exact solution of these governing equations would then imply C = 0, provided
432
11 Constrained systems: advanced formulations
10-3
10
4000
-4
3000
-5
LAGRANGE MULTIPLIER
CONSTRAINT VIOLATION
10
10-6 10
-7
10
-8
10
10
-9
-10
10
10
-11
1000 0 -1000 -2000 -3000
-12
10
2000
-13
0
0.2
0.4
0.6
t
0.8
1
1.2
Fig. 11.1. Constraint violation versus nondimensional time. Solid line, Ns = 100; dashed line, Ns = 200; dashed-dotted line, Ns = 400; dotted line, Ns = 800.
-4000
0
0.2
0.4
0.6
t
0.8
1
1.2
Fig. 11.2. Time history of Lagrange’s multiplier.
that the constraint is satisſed at time τ = 0. An approximate solution of the same equations results in an approximate satisfaction of the holonomic constraint equation, C ≈ 0. Figure 11.1 shows the time history of the constraint violation, i.e., C as a function of non-dimensional time. Maggi’s equations were integrated for τ ∈ [0, 1.2] using a fourth-order Runge-Kutta algorithm [5] with Ns =100, 200, 400, and 800 time steps of equal size. As the time step size is reduced, the constraint violation decreases; note the logarithmic scale along the vertical axis. The same data is also presented in table 11.1 on page 441: the constraint violations at time τ = 1 are listed in the second column of the table for the four time step sizes. For small time step sizes, accurate solutions are obtained that closely satisfy the constraint conditions. For long term simulations, the constraint violation will keep increasing as the simulation proceeds: this is known as the drift phenomenon. For short time simulations, the drift phenomenon can be overcome by using small time step sizes. For longer time simulations, the use of small time step might become unpractical, and constraint violation stabilization techniques should be used, see section 12.3. Although Lagrange’s multipliers have been eliminated from Maggi’s formulation, they can be evaluated with the help of eq. (11.12). The following quantities are computed ſrst, ¯2 ma L2a h T Γˇ M Γ = −(M + ) , 3 Lc Cφ−θ
¯ ma L2a h T ) Γˇ M Γ˙ = (M + g¯e. 3 Lc Cφ−θ
Equation (11.12) then yields the desired multipliers, expressed here in nondimensional form as ¯2 μa ¯ 2 L a 2 ¯= λ = ) h −α + (1 + . e ¯ − β¯ e λ ¯ 2 Cφ−θ kLc 3 L c
Figure 11.2 shows the time history of the non-dimensional multiplier.
11.2 Algebraic elimination of Lagrange’s multipliers
433
Example 11.3. The 9 degree of freedom rigid body Consider a rigid body moving in space while one of its points, denoted O, remains ſxed, as depicted in ſg. 11.3. The orientation of the body is determined by an orthonormal basis E = (¯ e1 , e¯2 , e¯3 ); nine generalized coordinates will be used to represent the conſguration of the body, the nine components of unit vectors e¯1 , e¯2 , and e¯3 . The inertial position vector of an arbitrary point P of the body is r P = s∗1 e¯1 + s∗2 e¯2 + s∗3 e¯3 , where s∗ are the components of the position vector of point P with respect to point O, resolved in basis E. The kinetic energy of the rigid body is readily found as K = 1/2 q˙ T M ∗ q˙ , where array q stores the generalized coordinates of the system q T = e¯T1 , e¯T2 , e¯T3 , and the mass matrix is ⎡ ⎤ M11 I M12 I M13 I M ∗ = ⎣M12 I M22 I M23 I ⎦ . M13 I M23 I M33 I ' where the quantities Mij = V ρs∗i s∗j dV are closely related to the components of the tensor of moments of inertia, ρ is the material density, and V the volume of the body.
e3 O
e2
s e1
i3
P R
i1
I
i2
Fig. 11.3. Conſguration of the rigid body.
The kinematics of the rigid body are deſned by the nine generalized coordinates stored in array q. Clearly, six constraints must be imposed: three conditions on the normality of vectors e¯1 , e¯2 , and e¯3 , and three additional constraints enforcing their orthogonality. The array of holonomic constraints and the constraint matrix then become ⎫ ⎧ T ⎡ T T T⎤ e¯1 0 0 (¯ e1 e¯1 − 1)/2⎪ ⎪ ⎪ ⎪ ⎪ ⎪ T T T⎥ T ⎪ ⎪(¯ ⎢ 0 e e ¯ − 1)/2 ⎪ ⎪ 2 ⎪ ⎪ ⎢ T e¯2T 0T ⎥ ⎬ ⎨ 2T ⎢ (¯ e3 e¯3 − 1)/2 0 0 e¯3 ⎥ ⎥, , and B(q) = ⎢ (11.17) C= T ⎢ e¯2 e¯3 0T e¯T3 e¯T2 ⎥ ⎪ ⎪ ⎪ ⎥ ⎪ ⎢ ⎪ ⎪ T T T T ⎪ ⎪ ⎦ ⎣ e¯1 e¯3 ⎪ ⎪ e¯3 0 e¯1 ⎪ ⎪ ⎭ ⎩ e¯T1 e¯2 e¯T2 e¯T1 0T respectively.
434
11 Constrained systems: advanced formulations
The ſrst form of the equations of motion for this constrained problem are Lagrange’s equations of the ſrst kind, eq. (11.1), M ∗ ¨q − B T λ = F a , where F a are the externally applied forces. To apply Maggi’s formulation, the kinematic characteristics of the problem are selected as the components of the angular velocity resolved in the material frame. This means that e = ω ∗ = 1/2 e¯T3 e¯˙ 2 − e¯T2 e¯˙ 3 , e¯T1 e¯˙ 3 − e¯T3 e¯˙ 1 , e¯T2 e¯˙ 1 − e¯T1 e¯˙ 2 , see ˇ eq. (4.53), leading to the following expression for matrix B, ⎤ ⎡ eT2 0 e¯T3 −¯ 1 ˇ t) = ⎣−¯ eT3 0 e¯T1 ⎦ , B(q, 2 T T e¯2 −¯ e1 0 and b = 0, ˇb = 0. The relationship between the generalized velocities and the kinematic characteristics give rise to matrices Γˇ and Γ , deſned in eq. (11.7), ⎡ ⎡ ⎤ ⎤ e¯1 0 0 0 e¯3 /2 e¯2 /2 0 −¯ e3 e¯2 e1 ⎦ . Γˇ = ⎣ 0 e¯2 0 e¯3 /2 0 e¯1 /2⎦ , Γ = ⎣ e¯3 0 −¯ (11.18) 0 0 e¯3 e¯2 /2 e¯1 /2 0 −¯ e2 e¯1 0 It is readily veriſed that the properties deſned by eqs. (11.8) and (11.9) are satisſed; in particular, B Γ = 0: Γ is the null space of the constraint matrix. The equations of motion of the system now follow from eq. (11.11); simple algebraic manipulations show that Γ T M ∗ Γ = I ∗ , where I ∗ are the components of the mass moment of inertia tensor of the rigid body resolved in the material frame. where d = 1/2 tr(I ∗ )I − I ∗ e, leading to Furthermore, Γ T M ∗ Γ˙ = d, I ∗ e˙ + eI ∗ e = M ∗O , where M ∗O are the components of the externally applied moment computed with respect to point O, expressed in the material frame. As expected, these equations are Euler’s equations for the rigid body. The choice of the kinematic characteristics is not unique. Indeed, instead of the components of the angular velocity vector resolved in the material frame, the components of the same vector in the inertial frame could have been selected as kinematic characteristics, e = ω = ( e1 e¯˙ 1 + e2 e¯˙ 2 + e3 e¯˙ 3 )/2. This choice is associated with the following matrices ⎡ T⎤ e1 1 ˇ t) = e1 e2 e3 , Γ = ⎣eT2 ⎦ . B(q, (11.19) 2 eT3 The derivation of Maggi’s equations based on this set of kinematic characteristics is left to the reader as an exercise. Clearly, the null space of the constraint matrix is unique; the choice of a set of linearly independent vectors spanning this null space, however, is not unique. Two speciſc choices were pointed out here: they correspond to the columns of matrix Γ as deſned in eqs. (11.18) and (11.19), respectively.
11.2 Algebraic elimination of Lagrange’s multipliers
435
Example 11.4. The skateboard Figure 10.11 depicts the simpliſed conſguration of a skateboard of mass m and moment of inertia I about its center of mass G, as discussed in examples 9.6 and 10.6. The skateboard rolls without sliding on the horizontal plane by means of a wheel aligned with axis e¯1 of the skateboard and located at point C, a distance from the center of mass. The position vector of the center of mass is written as r G = x ¯ı1 +y ¯ı2 , and the axis of the skateboard makes an angle θ with the horizontal.
Let the generalized coordinates of the problem be q T = x, y, θ . The system is subjected to a constraint: because the wheel does not slip, the velocity vector of the contact point must be along axis e¯1 . The velocity of point C is v C = x˙ ¯ı1 + y˙ ¯ı2 + θ˙ e¯2 , and hence, the constraint is e¯T2 v C = 0, leading to the constraint matrix B = [− sin θ, cos θ, ] .
(11.20)
This constraint is nonholonomic. The ſrst form of the equations of motion for this constrained problem are Lagrange’s equations of the ſrst kind, eq. (11.1), or M ¨q − B T λ = 0, where the mass matrix is a diagonal matrix: M = diag(m, m, I). To derive Maggi’s equations for the skateboard, the kinematics of the problem are recast in the form of eq. (11.4), as ⎡ ⎤ ⎡ ⎤ 0 − sin θ cos θ ⎣e1 ⎦ = ⎣ cos θ sin θ 0⎦ q, ˙ (11.21) 0 0 1 e2 ˇ is deſned by the last two line of the matrix appearing on the where matrix B(q) right-hand side of this equation. The kinematic characteristics of the problem were selected as e1 = x˙ cos θ + y˙ sin θ, the velocity of the wheel in the driving direction, ˙ the angular velocity of the skateboard. and e2 = θ, The relationship between the generalized velocities and the kinematic characteristics give rise to the matrices Γˇ and Γ , deſned by eq. (11.7), ⎤ ⎤ ⎡ ⎡ − sin θ cos θ sin θ (11.22) Γˇ = ⎣ cos θ⎦ , Γ = ⎣ sin θ − cos θ⎦ . 0 0 1 Here again, matrix Γ deſnes the null space of the constraint matrix B since B Γ = 0. The equations of motion of the system now follow from eq. (11.11); simple algebraic manipulations yield Maggi’s equations as m 0 0 me2 e = 0. e˙ + (11.23) 0 I + m2 −me2 0 11.2.2 Problems Problem 11.1. Bar sliding on guides A homogeneous bar of length L and mass m slides on two guides at its end points, as shown in ſg. 10.10. At the left end, the bar is connected to a spring of stiffness constant k that is unstretched when the bar is horizontal. At the right end, the bar is connected to a point mass M .
436
11 Constrained systems: advanced formulations
Gravity acts along axis ¯ı2 . This single degree of freedom problem will be represented using three generalized coordinates: x, y, and θ. (1) Derive the constraints among the three generalized coordinates and the constraint matrix. (2) Use Lagrange’s formulation for constrained systems to derive the equations of motion of the system. Interpret the physical meaning of the multipliers. (3) Derive and solve Maggi’s equations. For this problem, the angular velocity of the bar is a good choice for the kinematic characteristic. (4) On one graph, plot the time history of x and y. (5) Plot the time history of angle θ. (6) On one graph, plot the time histories of x˙ and y. ˙ (7) Plot the time history of angular velocity of the bar. (8) On one graph, plot the time history of the constraint violations. Comment on your results. (10) Plot the time history of the total mechanical energy. (11) On one graph, plot the time history of Lagrange’s multipliers. Use the following data: L = 0.45 m; m = 5 and M = 5 kg; k = 150 N/m; acceleration of gravity g = 9.81 m/s2 . At the initial time, x = 0, θ = 0 and the system is at rest. Plot the results for t ∈ [0, 2] s.
Problem 11.2. Bar sliding on guides Repeat the previous problem using two generalized coordinates only, x and y.
Problem 11.3. Crank-slider mechanism The crank-slider mechanism depicted in ſg. 9.8 consists of a uniform crank of length L1 and mass m1 connected to the ground at point O; let θ be the angle from the horizontal to the crank. At point B, the crank connects to a uniform linkage of length L2 and mass m2 that slides along point P, a ſxed point in space, located at a distance d from point O. Let w denote the distance from point B to point P and φ the angle from the horizontal to link BP. A linear spring, not shown on the ſgure, connects point B to the support at point P; the strain energy of this spring is V = 1/2 kw2 . The system is represented by three generalized coordinates: θ, φ, and w. (1) Derive the constraints among the three generalized coordinates and the constraint matrix. (2) Use Lagrange’s formulation to derive the equations of motion of the constrained system. Interpret the physical meaning of the multipliers. (3) Derive and solve Maggi’s equations. For this problem, the angular velocity of the crank is a good choice for the kinematic characteristic. (4) Plot the time history of the kinematic characteristic. (5) On one graph, plot the time histories of angles θ and φ. (6) On one graph, plot the time histories of angular velocities of the two bars. (7) Plot the time history of w. (8) Plot the time histories of w. ˙ (9) On one graph, plot the time history of the constraint violations. Comment on your results. (10) Plot the time history of the total mechanical energy. (11) On one graph, plot the time history of Lagrange’s multipliers. Use the following data: L1 = 0.25, L2 = 0.75 and d = 0.35 m; m1 = 1.5 and m2 = 4 kg; k = 10 kN/m. At the initial time, x = 0, θ = 0 and the angular velocity of the ſrst bar is ω1 = 100 rad/s. Plot the results for t ∈ [0, 0.10] s.
Problem 11.4. The crank piston mechanism The crank slider mechanism depicted in ſg. 10.4 comprises a bar of length L1 and mass m1 connected to the ground at point O by means of a hinge. At point A, a hinge connects the ſrst bar to a second bar of length L2 and mass m2 . A slider of mass M , that is constrained to move in the horizontal direction, is connected to this second bar. A spring of stiffness constant k connects the slider to the ground and is un-stretched when the two bars are aligned. This system will be represented with three generalized coordinates: x and y, the coordinates of point A and z, the horizontal position of point B. (1) Write the constraint equations for this problem. (2) Use Lagrange’s formulation to derive the equation of motion of the system. Interpret the physical meaning of the multipliers. (3) Derive and solve Maggi’s equations for this problem. It will be convenient to use the angular velocity of the ſrst bar as the kinematic characteristic. (4) Plot the kinematic characteristic as a function of time. (5) On one graph,
11.2 Algebraic elimination of Lagrange’s multipliers
437
plot the three generalized coordinates as a function of time. (6) On one graph, plot the time history of the constraint violations. Comment on your results. (7) Plot the time history of the total mechanical energy. (8) On one graph, plot the time history of Lagrange’s multipliers. Use the following data: L1 = 0.25 and L2 = 0.45 m; m1 = 1.5, m2 = 2.5, M = 10 kg; k = 100 N/m. At the initial time, x = 0, y = L1 and the angular velocity of the ſrst bar is ω1 = 100 rad/s. Plot the results for t ∈ [0, 0.25] s.
Problem 11.5. Slider-arm mechanism Consider the mechanism depicted in ſg. 11.4 that coni3 x2 sists of a slider of mass M free to move along axis ¯ı1 and connected to arm AP of length La and mass ma . x3 O The arm is free to rotate in the plane normal to ¯ı1 . A k x1 i2 P spring of stiffness constant k is attached to the slider La, ma g and gravity acts in the direction indicated on the ſgure. A This system will be represented with three generalized i1 M coordinates, the Cartesian coordinates of point P, x1 , x2 , and x3 . (1) Derive the constraints among the three Fig. 11.4. Slider with arm mechageneralized coordinates and the constraint matrix. (2) nism. Use Lagrange’s formulation to derive the equation of motion of the system. Interpret the physical meaning of the multipliers. (3) Derive Maggi’s equations for this problem.
Problem 11.6. The spatial mechanism The spatial mechanism depicted in ſg. 10.9 consists of a crank of length Lc and mass mc attached to the ground at point A and rotating about axis ¯ı1 . The crank moves in plane (¯ı2 , ¯ı3 ). A rigid arm of length La and mass ma connects point P, at the tip of the crank, to point Q that is free to slide along axis ¯ı1 . The slider at point Q has a mass M . The generalized coordinates of the problem are y and z, deſning the position of point P and x, deſning the position of point Q. (1) Write the constraint equations for this problem. (2) Use Lagrange’s formulation to derive the equation of motion of the system. Interpret the physical meaning of the multipliers. (3) Derive Maggi’s equations for this problem. It will be convenient to use the angular velocity of the ſrst bar as the kinematic characteristic. (4) Plot the kinematic characteristic as a function of time. (5) On one graph, plot the three generalized coordinates as a function of time. (6) On one graph, plot the three generalized velocities as a function of time. (7) On one graph, plot the time history of the constraint violations. Comment on your results. (8) Plot the time history of the total mechanical energy. Use the following data: h = 0.15, Lc = 0.50 and La = 0.75 m; mc = 1.4, ma = 5, M = 125 kg; k = 500 N/m. At the initial time, y = 0, y = Lc + h and the angular velocity of the crank is ω1 = 100 rad/s. Plot the results for t ∈ [0, 0.20] s.
Problem 11.7. The quick return mechanism The quick return mechanism shown in ſg. 10.6 consists of a uniform crank of length Lc and mass mc , and of a uniform arm of length La and mass ma . The crank is pinned at point R and the arm at point O; the distance between these two points is denoted d. At point S, a slider allows the tip of the crank to slide along the arm. A mass M is attached at point T, the tip of the arm. A spring of stiffness constant k connects the tip of the arm, point T, to ſxed point A; the spring is un-stretched when the arm is in the vertical position. The generalized coordinates of the problem are the angles θ and φ as deſned on the ſgure. (1) Derive the constraints among the two generalized coordinates and the constraint matrix. (2) Use Lagrange’s formulation to derive the equation of motion of the system. Interpret the physical meaning of the multiplier.
438
11 Constrained systems: advanced formulations
(3) Derive Maggi’s equations for this problem. Use the angular velocity of the arm as the kinematic characteristic. (4) Plot the kinematic characteristic as a function of time. (5) On one graph, plot the angles θ and φ as a function of time. (6) On one graph, plot the angular velocities of the two bars as a function of time. (7) On one graph, plot the time history of the constraint violations. Comment on your results. (8) Plot the time history of the normal force at point S. (9) Plot the time history of the total mechanical energy. Use the following data: ¯ a = La /d = 3 m; μc = mc /M = 2.4, μa = ma /M = 1. ¯ c = Lc /d = 0.50 and L L At the initial time, θ = 0 and θ = 2, where (·) indicates a derivative with respect to the non-dimensional time τ = ωt, ω 2 = k/M . Plot the results for τ ∈ [0, 1.2].
Problem 11.8. Two bar linkage tracking a curve Figure 10.8 shows a planar two bar linkage tracking curve C. The ſrst bar, of length L1 and mass m1 , is connected to the ground at point O. The second bar, of length L2 and mass m2 , connects to the ſrst bar at point A and tracks curve C at point P. A concentrated mass, M , is located at point P and an elastic spring of stiffness constant constant k connects this mass to point R. Curve C is described by its coordinates x(η) and y(η), where η deſnes an arbitrary parametrization of the curve. This system will be represented by three generalized coordinates: angles θ and φ, as deſned on the ſgure, and η, the parameter along curve C. (1) Write the constraint equations for this problem. (2) Use Lagrange’s formulation to derive the equation of motion of the system. Interpret the physical meaning of these multipliers. (3) Derive and solve Maggi’s equations for this problem. It will be convenient to use the angular velocity of the ſrst bar as the kinematic characteristic. (4) Plot the kinematic characteristic as a function of time. (5) On one graph, plot angles θ and φ as a function of time. (6) Plot η as a function of time. (7) On one graph, plot the angular velocities of the two bars. (8) Plot η˙ as a function of time. (9) On one graph, plot the time history of the constraint violations. Comment on your results. (10) Plot the time history of the total mechanical energy. (11) On one graph, plot the time history of Lagrange’s multipliers. Use the following data: d = 1, L1 = 0.20 and L2 = 1.50 m; m1 = 1.2, m2 = 1.5, M = 25 kg; k = 500 N/m. At the initial time, θ = 0, η > 0 and the angular velocity of the ſrst bar is ω1 = 25 rad/s. The curve is deſned as x(η) = d + aη, y(η) = bη 2 , where a = 1 and b = 2; note that η is not the intrinsic parametrization of the curve. Plot the results for t ∈ [0, 1] s.
11.2.3 Index-1 formulation Consider the system of equations consisting of the Lagrange’s equations of the ſrst kind, eqs. (11.1), and the acceleration level constraints, eq. (11.3), written in a matrix form as M B T ¨q F = . (11.24) c λ B 0 This system is now an index-1 set of DAEs, which can be formally solved for the accelerations and Lagrange’s multipliers. This system is equivalent to Lagrange’s equations of the ſrst kind, eq. (11.1), if and only if the initial conditions of the problem satisfy the constraint conditions, i.e., C(q 0 , t0 ) = 0 and D(q 0 , q˙ 0 , t0 ) = 0 for holonomic and nonholonomic constraints, respectively, where q 0 = q(t0 ) and q˙ 0 = q˙ (t0 ) are the initial conditions of the problem. It is easily shown that system (11.24) has a unique solution if and only if matrix B is of full rank and the mass matrix is invertible, see e.g. Nikravesh [56]. To start,
11.2 Algebraic elimination of Lagrange’s multipliers
439
the ſrst equations are pre-multiplied by B M −1 and accelerations are eliminated with the help of the second equations, to ſnd Lagrange’s multipliers as λ = −(B M −1 B T )−1 (c − B M −1 F ).
(11.25)
Next, the multipliers are introduced in the ſrst equations to yield the accelerations of the system as ! "−1 ! " M ¨q = F + B T B M −1 B T c − B M −1 F ,
(11.26)
and the forces of constraint, F c = B T λ, become Fc =
! "−1 ! " c − B M −1 F . −B T B M −1 B T
(11.27)
System (11.26) is now a system of second-order ODEs, which can be solved to predict the dynamic behavior of the system. These equations have appeared in Hemami and Weimer [57], L¨otstedt et al. [58, 48] and Gear et al. [59] Example 11.5. Quick return mechanism The quick return mechanism shown in ſg. 10.6 consists of a uniform crank of length Lc and mass mc , and of a uniform arm of length La and mass ma . The crank is pinned at point R and the arm at point O; the distance between these two points is denoted d. At point S, a slider allows the tip of the crank to slide along the arm. A mass M is attached at point T, the tip of the arm. A spring of stiffness constant k connects the tip of the arm, point T, to ſxed point A; the spring is un-stretched when the arm is in the vertical position. This problem was treated in examples 8.15 and 10.5 on pages 329 and 396, respectively, using Lagrange’s formulation with one and two generalized coordinates, respectively. The same problem was treated with Maggi’s formulation in example 11.2 on page 430. Use the index-1 formulation to derive the equations of motion of the system with two generalized coordinates, angles θ and φ. The kinematics of the problem were presented in the previous examples dealing with this system and will not be repeated here. In the index-1 formulation, constraints are enforced at the acceleration level. For the quick return mechanism, the kinematic constraint is C = d sin φ − Lc sin(φ − θ) = 0; the velocity level constraint is the C˙ = B q˙ = 0 where the constraint matrix is deſned in eq. (10.14). Finally, using the notation of eq. (11.3), the acceleration level constraint is C¨ = B ¨ q − c = 0, where 2 2 ˙ ˙ ˙ ˙ c = −B q˙ = Lc Sφ−θ (φ − θ) − dSφ φ . The equations of motion written in index-1 form, eq. (11.24), are now ⎫ ⎡ ⎤⎧ ⎫ ⎧ mc L2c /3 0 −Lc Cφ−θ ⎨ θ¨⎬ ⎨ 0 ⎬ ⎣ 0 (M + ma /3)L2a Lc Cφ−θ − dCφ ⎦ φ¨ = kL2a Sφ . (11.28) ⎭ ⎩ ⎭ ⎩ c −LcCφ−θ Lc Cφ−θ − dCφ 0 λ The ſrst step of the index-1 formulation is to solve for Lagrange’s multiplier using eq. (11.25) to ſnd
440
11 Constrained systems: advanced formulations
¯2 μc L 1 + μa /3 c ¯ ¯ ¯ Sφ − ¯ c¯ Lc (Lc Cφ−θ − Cφ )λ = 3γ Lc Cφ−θ − Cφ
(11.29)
where the various non-dimensional quantities appearing in this expression were deſned in example 11.2 and c¯ =
Mc ¯ c Sφ−θ (φ − θ )2 − Sφ φ2 . =L kd
(11.30)
The governing equations of the problem are given by eqs. (11.26), which become ⎤ ¯2 μc L c ¯ 2c ¯ 0 c μc L (1 + μa /3)¯ 0 h ⎣ 3 ⎦ θ = . ¯ 2a Sφ + 3γ Sφ − L μa ¯ 2 ¯ c Cφ−θ − Cφ L −1 φ )La 0 (1 + 3 ⎡
These equations form a set of ordinary differential equations for the two generalized coordinates of this problem; Lagrange’s multiplier is eliminated from the formulation. The index-1 equations can be integrated using classical numerical tools developed for the solution of ordinary differential equations. The dynamic response predicted by the index-1 formulation is identical to that obtained by Lagrange’s or Maggi’s formulations, see ſgs 8.10 to 8.12. Of course, due to the approximations inherent to numerical solution techniques, slight discrepancies between the various predictions should be expected. 4000 10
-2
10
-4
-6
10
10
LAGRANGE MULTIPLIER
CONSTRAINT VIOLATION
3000 10
-8
-10
10
2000 1000 0 -1000 -2000 -3000
-12
0
0.2
0.4
0.6
t
0.8
1
1.2
Fig. 11.5. Constraint violation versus nondimensional time. Solid line, Ns = 100; dashed line, Ns = 200; dashed-dotted line, Ns = 400; dotted line, Ns = 800.
-4000 0
0.2
0.4
0.6
t
0.8
1
1.2
Fig. 11.6. Time history of Lagrange’s multiplier.
The index-1 formulation enforces acceleration level constraints, C¨ = 0, at each instant in time. Indeed, the acceleration level constraint is the third equation of system (11.28). An exact solution of these governing equations would then imply C = 0, provided that the displacement and velocity level constraints are satisſed at time τ = 0. An approximate solution of the same equations results in an approximate satisfaction of the holonomic constraint equation, C ≈ 0.
11.2 Algebraic elimination of Lagrange’s multipliers
441
Figure 11.5 shows the time history of the constraint violation, i.e., C as a function of the non-dimensional time. The index-1 equations were integrated using a fourthorder Runge-Kutta algorithm [5] with Ns =100, 200, 400, and 800 time steps of equal size. As the time step size is reduced, the constraint violation decreases; note the logarithmic scale along the vertical axis. The same data is also presented in table 11.1: the constraint violations at time τ = 1 are listed in the third column of the table for the four time step sizes. Table 11.1. Constraint violations for Maggi’s, index-1, and null space formulations. Number of time steps 100 200 400 800
Maggi’s formulation 3.5 × 10−05 1.7 × 10−06 1.0 × 10−07 6.0 × 10−09
Index-1 formulation 4.0 × 10−03 3.1 × 10−04 2.0 × 10−05 1.2 × 10−06
Null space formulation 4.0 × 10−03 3.1 × 10−04 2.0 × 10−05 1.2 × 10−06
It is important to compare columns two and three of table 11.1. These two columns list the constraint violation for Maggi’s and index-1 formulations when the same integration technique and identical time step sizes are used. Much larger constraint violations are observed for the index-1 formulation, which enforces constraints at the acceleration level, than for Maggi’s formulation, which enforces constraints at the velocity level. Clearly, larger constraint violation are expected for the formulations that enforce constraints at the acceleration level, resulting in a more pronounced drift phenomenon. Although Lagrange’s multipliers have been eliminated from the index-1 formulation, they can be evaluated with the help of eq. (11.29). Figure 11.6 shows the time history of the non-dimensional multiplier. 11.2.4 Problems Problem 11.9. Bar sliding on guides Treat problem 11.1 using the index-1 formulation.
Problem 11.10. Bar sliding on guides Treat problem 11.2 using the index-1 formulation.
Problem 11.11. Crank-slider mechanism Treat problem 11.3 using the index-1 formulation.
Problem 11.12. The crank piston mechanism Treat problem 11.4 using the index-1 formulation.
Problem 11.13. Slider-arm mechanism Treat problem 11.5 using the index-1 formulation.
442
11 Constrained systems: advanced formulations
Problem 11.14. The spatial mechanism Treat problem 11.6 using the index-1 formulation.
Problem 11.15. The quick return mechanism Treat problem 11.7 using the index-1 formulation.
Problem 11.16. Two bar linkage tracking a curve Treat problem 11.8 using the index-1 formulation.
11.2.5 The null space formulation It is also possible to solve system (11.24) in an expeditious manner with the help of the null space of the constraint matrix introduced in section 11.2.1. Pre-multiplying the ſrst equations by Γ T eliminates Lagrange’s multipliers and yields system accelerations as T T Γ M Γ F ¨q = . (11.31) B c Here again, this system forms a set of second-order ODEs, the solution of which requires the constraint matrix to be of full rank and the mass matrix to be invertible. A number of authors developed this formulation independently. Hemami and Weimer [57] introduced the orthogonal complement, Γ , of the constraint matrix to obtain eqs. (11.31) for small-scale systems, although no systematic procedure was proposed to determine the orthogonal complement. They also demonstrated the equivalence of this approach to Kane’s equations [60, 61]. Garc´Æa de Jal´on et al. [62, 63] also derived eqs. (11.31). Borri et al. [64] proposed the acceleration projection method, which decomposes the generalized acceleration as ¨q = Γ ζ + B T η. Substituting this decomposition into eqs. (11.31) express ζ and η in terms of M , B, Γ T , F and c, leading to second-order ODEs. Example 11.6. Quick return mechanism The quick return mechanism shown in ſg. 10.6 consists of a uniform crank of length Lc and mass mc , and of a uniform arm of length La and mass ma . The crank is pinned at point R and the arm at point O; the distance between these two points is denoted d. At point S, a slider allows the tip of the crank to slide along the arm. A mass M is attached at point T, the tip of the arm. A spring of stiffness constant k connects the tip of the arm, point T, to ſxed point A; the spring is un-stretched when the arm is in the vertical position. This problem was treated in examples 8.15 and 10.5 on pages 329 and 396, respectively, using Lagrange’s formulation with one and two generalized coordinates, respectively. The same problem was treated with Maggi’s and the index-1 formulations in example 11.2 and 11.5, respectively. Use the null space formulation to derive the equations of motion of the system with two generalized coordinates, angles θ and φ. The kinematics of the problem were presented in the previous examples dealing with this system and will not be repeated here. In the null space formulation, constraints are enforced at the acceleration level. For the quick return mechanism, the
11.2 Algebraic elimination of Lagrange’s multipliers
443
kinematic constraint is C = d sin φ − Lc sin(φ − θ) = 0; the velocity level constraint is the C˙ = B q˙ = 0 where the constraint matrix is deſned in eq. (10.14). Finally, using the notation of eq. (11.3), the acceleration level constraint is C¨ = B ¨q − c = 0, ˙ 2 − dSφ φ˙ 2 . where c = −B˙ q˙ = Lc Sφ−θ (φ˙ − θ) The governing equations of the problem are given by eqs. (11.31), which become 2 ¯L ¯ ¯2h ¯ Sφ ¯ 2 /3 (1 + μa /3)L μc L θ h c a a , (11.32) = ¯ ¯ −Lc Cφ−θ Lc Cφ−θ − Cφ φ c¯ where c¯ is deſned by eq. (11.30), and the various non-dimensional quantities appearing in this expression were deſned in example 11.2. These equations form a set of ordinary differential equations for the two generalized coordinates of this problem; Lagrange’s multiplier is eliminated from the formulation. The null space equations can be integrated using classical numerical tools developed for the solution of ordinary differential equations. The dynamic response predicted by the index-1 formulation is identical to that obtained by Lagrange’s, Maggi’s, or index-1 formulations, see ſgs 8.10 to 8.12. Of course, due to the approximations inherent to numerical solution techniques, slight discrepancies between the various predictions should be expected. 4000 10
-2
LAGRANGE MULTIPLIER
CONSTRAINT VIOLATION
3000 10
-4
-6
10
10
10
-8
-10
10
2000 1000 0 -1000 -2000 -3000
-12
0
0.2
0.4
0.6
t
0.8
1
1.2
Fig. 11.7. Constraint violation versus nondimensional time. Solid line, Ns = 100; dashed line, Ns = 200; dashed-dotted line, Ns = 400; dotted line, Ns = 800.
-4000
0
0.2
0.4
0.6
t
0.8
1
1.2
Fig. 11.8. Time history of Lagrange’s multiplier.
The null space formulation enforces acceleration level constraints, C¨ = 0, at each instant in time. Indeed, the acceleration level constraint is the third equation of system (11.32). An exact solution of these governing equations would then imply C = 0, provided that the displacement and velocity level constraints are satisſed at time τ = 0. An approximate solution of the same equations results in an approximate satisfaction of the holonomic constraint equation, C ≈ 0. Figure 11.7 shows the time history of the constraint violation, i.e., C as a function of the non-dimensional time. The null space equations were integrated using a fourth-order Runge-Kutta algorithm [5] with Ns =100, 200, 400, and 800 time steps
444
11 Constrained systems: advanced formulations
of equal size. As the time step size is reduced, the constraint violation decreases; note the logarithmic scale along the vertical axis. The same data is also presented in table 11.1: the constraint violations at time τ = 1 are listed in the fourth column of the table for the four time step sizes. The behavior of the constraint violations are very similar for the index-1 and null space formulations, see ſgs. 11.5 and 11.7, respectively, or the third and fourth columns of table 11.1, respectively. This observation is consistent with the fact that both index-1 and null space formulations enforce acceleration level constraints. 11.2.6 Problems The null space formulation requires the determination of the null space of the constraints. While the null space is uniquely deſned, different sets linearly independent vectors spanning the null space can be selected, leading to different equations of motion. In the problems below, use Maggi’s formulation to determine the null space; Problems 11.1 to 11.8 provide hints on how to select the kinematic parameters, thereby leading to a unique deſnition of the null space. Problem 11.17. Bar sliding on guides Treat problem 11.1 using the null space formulation.
Problem 11.18. Bar sliding on guides Treat problem 11.2 using the null space formulation.
Problem 11.19. Crank-slider mechanism Treat problem 11.3 using the null space formulation.
Problem 11.20. The crank piston mechanism Treat problem 11.4 using the null space formulation.
Problem 11.21. Slider-arm mechanism Treat problem 11.5 using the null space formulation.
Problem 11.22. The spatial mechanism Treat problem 11.6 using the null space formulation.
Problem 11.23. The quick return mechanism Treat problem 11.7 using the null space formulation.
Problem 11.24. Two bar linkage tracking a curve Treat problem 11.8 using the null space formulation.
11.2.7 Udwadia and Kalaba’s formulation The results presented in the previous section can also be recast in a more compact and general form in terms of Moore-Penrose generalized inverses. The term featuring the matrix inverse in eq. (11.26) can be written as B T (B M −1 B T )−1 =
11.2 Algebraic elimination of Lagrange’s multipliers
445
M 1/2 (B M −1/2 )T [(B M −1/2 )(B M −1/2 )T ]−1 = M 1/2 (B M −1/2 )+ , where the last equality follows from eq. (18.16). Equations (11.26) and (11.27) now become ! " M ¨q = F + M 1/2 (B M −1/2 )+ c − B M −1 F , (11.33) and Fc =
! " −M 1/2 (B M −1/2 )+ c − B M −1 F ,
(11.34)
respectively. A process similar to that developed by Udwadia et al. [65] can be used to solve the system formed by eqs. (11.31) and Γ T F c = 0, to recover eqs. (11.33) and (11.34). This underlines the close relationship between the present formulation and the null space formulation of section 11.2.5. Equations (11.33) and (11.34) were ſrst presented by Udwadia and Kalaba [66, 67, 68, 69], based on Gauss’ principle: the explicit equations of motion were expressed as the solution of a quadratic minimization problem subjected to constraint conditions at the acceleration level. In a later paper [70], the same equations were derived from d’Alembert’s principle. This formulation is more general than those presented in the previous sections, which require the constraint matrix to be of full rank, whereas the Moore-Penrose generalized inverse is unique and always exists. The same authors [71] later presented a simpler derivation of their formulation that bypasses the concepts of generalized inverses. When rank(B) = m, they proved the existence of Lagrange’s multipliers and expressed them in terms of the constraint forces; when rank(B) < m, Lagrange’s multipliers are not unique, although the constraint forces are unique. Udwadia et al. [65] presented an extended form of d’Alembert’s principle that is able to deal with nonholonomic constraints, which might be nonlinear expressions of the generalized velocities. Furthermore, they showed that the previous formulation could be derived without invoking Moore-Penrose pseudo inverses. The geometric interpretation of the results in terms of projection operators, as presented in section 11.3, appeared in ref. [72] and the relationship of this formulation to GibbsAppell’s equations was explored in ref. [73]. Finally, the same authors [74, 75] generalized the formulation to deal with mechanical system involving non-ideal constraints, i.e., constraints associated with constraint forces whose virtual work might not vanish. The textbook authored by Udwadia and Kalaba [76] gives ample details on all aspects of the formulation. 11.2.8 Comparison of the ODE formulations The formulations presented above all transform the (2n + m) ſrst-order DAEs associated with Lagrange’s equations of the ſrst kind into ODEs by eliminating Lagrange’s multipliers. Maggi’s formulation, eqs. (11.11) and (11.7), yields (2n − m) ſrst-order ODEs for the (n − m) kinematic characteristics and n generalized coordinates. The index-1, eqs. (11.26), null space, eqs. (11.31), or Udwadia and Kalaba’s, eqs. (11.33), formulations form sets of n second-order ODEs for the n generalized
446
11 Constrained systems: advanced formulations
coordinates, which could alternatively be recast as sets of 2n ſrst-order ODEs for the n generalized coordinates and n generalized velocities. The ſrst observation is that these methods decrease the size of the problem from (2n + m) for Lagrange’s equations of the ſrst kind to 2n for the index-1, null space, and Udwadia and Kalaba’s formulations, and (2n−m) for Maggi’s formulation. This dimensional reduction, however, comes at a price: the evaluation of the null space of the constraint matrix in Maggi’s and null space formulations, or the evaluation of generalized inverses in Udwadia and Kalaba’s formulation. Lagrange’s equations of the ſrst kind are typically formulated in terms of generalized coordinates that will render system matrices highly sparse, leading to efſcient solution techniques, as discussed by Orlandea et al. [77, 78]. Hence, the main advantage of the above techniques is not so much the reduction in the number of equations, but rather the change in their mathematical nature, from DAEs to ODEs. The second observation is that Maggi’s formulation enforces velocity level constraints for holonomic constraints, whereas acceleration level constraints are enforced in the other formulations. This fact has important implications for numerical implementations of these approaches: the constraint drift phenomenon will be signiſcantly more pronounced when using the latter formulations than when using Maggi’s formulation. Typically, the constraint violation stabilization techniques described in section 12.3 will be required to compensate for this drift. Although the enforcement of the constraints at the acceleration level is a widely used practice in multibody dynamics, it has the potential to adversely affect the numerical solution procedure. In fact, Campbell and Leimkuhler [79] studied the effects of differentiation of the constraints in DAEs; they concluded: “Thus, the differentiated system may be less well behaved numerically for a given method than either the original DAEs or an equivalent state-space form for that method. These numerical difſculties can take the form of increased stiffness, extraneous positive eigenvalues, and more stringent step-size restrictions.” Maggi’s formulation requires the deſnition of a set of m kinematic parameters. Since the set of linearly independent vectors spanning the null space is not uniquely deſned, the choice of kinematic parameters is not unique and the reduced equations can take a variety of forms. The same remark applies to the null space formulation. While the null space is uniquely deſned, different sets linearly independent vectors spanning the null space can be selected, leading to different equations of motion. In Maggi’s formulation, the selected set of kinematic parameters appears explicitly in the equations of motion. In contrast, these additional variables do not appear in the index-1 and null space formulations, whose equations of motion are expressed in terms of the sole generalized coordinates originally used to describe the system. Udwadia and Kalaba’s formulation presents a number of advantages over the other formulations. The Moore-Penrose generalized inverse appearing in eq. (11.33) always exists, whereas the other formulations require a full rank constraint matrix. Hence, Udwadia and Kalaba’s formulation is capable of dealing with problems featuring rank deſcient constraint matrix, such as those involving redundant constraints. Furthermore, problems with variable number of degrees of freedom, such as intermittent contact problems, or problems involving rolling and slipping, are readily
11.3 The geometric interpretation of constraints
447
treated. In contrast, such situations will be problematic for minimum set approaches, since the number of kinematic parameters must change as the constraints change or become redundant. 11.2.9 Problems Problem 11.25. Transformation of DAEs to ODEs Consider a constrained dynamical system represented by n generalized coordinates and subjected to m holonomic constraints. The following questions deal with the solution of such problem using Maggi’s, index-1, and null space formulations, which all three, transforms the equations of the problem from DAEs to ODEs. (1) Which of the three formulations is least sensitive to the drift phenomenon? Why? (2) For each of the three methods, what is the nature of the variables appearing in the ſnal set of ODEs? (3) For each of the three methods, how many variables appear in the ſnal set of ODEs? (4) For each of the three methods, in which form are the constraints enforced? (5) Are the variable appearing in Maggi’s formulation uniquely deſned? Why? 6) Is the system of equations characterizing the null space formulation uniquely deſned? Why?
11.3 The geometric interpretation of constraints Many features of the dynamic response of constrained i2 systems can be interpreted in a purely geometric manO ner. Consider the problem of a simple pendulum of mass L g m and length L, as depicted in ſg. 11.9. This single q degree of freedom system could be described by single m generalized coordinate θ. Alternatively, two generalized i1 coordinates, the Cartesian coordinates, q1 and q2 , deſne2 ing the position vector of the bob, r = q1 ¯ı1 + q2 ¯ı2 , -gSq e2 could be used, subjected to a single holonomic conc -F /m e1 straint, C = (r T r/L − L)/2 = 0. This constraint ena forces the constant length condition for the pendulum; g gCq e1 the constraint matrix is B T (q) = r/L = Cθ¯ı1 + Sθ¯ı2 , where Cθ = cos θ and Sθ = sin θ. e1 Figure 11.9 also shows the unit vectors, e¯1 and e¯2 , Fig. 11.9. Geometric inof a polar coordinate system that can be used to conterpretation of accelerations veniently compute the accelerations of the particle as for a simple pendulum. a = −Lθ˙2 e¯1 + Lθ¨ e¯2 . Since the path of the particle is a circle of radius L, motion is allowed in the tangential direction, e¯2 , but prohibited in the radial direction, e¯1 . For this particular problem, the constraint matrix is a unit vector along e¯1 and deſnes the direction in which the motion is constrained. In more general terms, the conſguration space, deſned here by the plane (¯ı1 , ¯ı2 ), is divided into two mutually orthogonal subspaces: the subspace deſned by unit vector e¯2 in which motion is allowed, and the subspace deſned by unit vector e¯1 , i.e., the subspace deſned by the constraint matrix, along which motion is prohibited. This
448
11 Constrained systems: advanced formulations
discussion provides a geometric interpretation of the constraint matrix: it deſnes a subspace of the conſguration space along which motion is not allowed. Next, Newton’s second law will be written for the particle. A free body diagram of the particle reveals that gCθ − F c /m = ar and −gSθ = aθ , where ar = −Lθ˙ 2 , and aθ = Lθ¨ are the radial and tangential components of acceleration, respectively. The geometric interpretation of these results is depicted in the right portion of ſg. 11.9. First, the constraint force is entirely contained in the subspace deſned by the constraint matrix. Next, the externally applied load has a component, −mgSθ , in the feasible direction, e¯2 ; this component actually drives the motion of the particle. The externally applied load also has a component, mgCθ , in the direction of prohibited motion, e¯1 . Finally, due to this applied load, the particle experiences an acceleration featuring components in both feasible and infeasible directions. The projection of this acceleration along the feasible direction equals the component of load applied in this direction divided by the mass of the particle, aθ = −gSθ . The projection of the acceleration along the infeasible direction equals the component of load applied in this direction divided by the mass of the particle and corrected by the acceleration associated with the constraint force, F c /m, ar = gCθ − F c /m. This discussion outlines the geometric interpretation of all the quantities involved in this simple constrained dynamical problem. Of course, at this point, these observations are limited to the simple pendulum problem presented above. In the sections below, the above results will be shown to apply to all constrained dynamical systems. This geometric interpretation of the problem has been investigated by a number of authors using similar concepts: Brauchli et al. [80, 81] and Udwadia and Kalaba [72]. To generalize the above observations, however, the concept of projection must ſrst be generalized. Instead of the orthogonal projections used in ſg. 11.9, a more general type of projection, deſned with respect to a certain metric of the conſguration space, must be deſned ſrst. 11.3.1 The orthogonal projection operator Consider a plane in a three-dimensional space; the plane is deſned by its unit normal vector n ¯ . This unit vector divides the three-dimensional space into two subspaces: the subspace spanned by n ¯ , and the subspace orthogonal to n ¯ , i.e., the subspace spanned by two mutually orthogonal vectors, u ¯ and v¯, spanning the plane normal to n ¯ , as shown in ſg. 11.10. Intuitively, the projection operator, denoted P , projects an arbitrary vector in the direction parallel to n ¯ . It is readily veriſed that in the simple case considered here, the projection operator is P = n ¯n ¯ T . Indeed ¯n ¯ T a = a cos α n ¯ = a . P a = n
(11.35)
¯ . Note that this is an orthogAs shown in ſg. 11.10, a is the projection of a along n onal projection of a along unit vector n ¯. The following terminology is now deſned: n ¯ is the image of the projector, whereas [¯ u, v¯] is the kernel or null space of the projector. Note that while the kernel
11.3 The geometric interpretation of constraints
449
of the projector is uniquely deſned as the subspace orthogonal to n ¯ , vectors [¯ u, v¯] are not uniquely deſned. Indeed, any two mutually orthogonal vectors normal to n ¯ could be selected. The meaning of this terminology is easily understood in view of the following results P n ¯=n ¯,
(11.36a)
u, v¯] = 0. P [¯
(11.36b)
Equation (11.36a) implies that the projection of the image is the image itself, and eq. (11.36b) implies that the projection of the null space vanishes. Another interpretation of these results is that the image of the projector spans its eigenaP vectors associated with unit eigenvalues, and the kernel of the projector spans its eigenvectors a n a associated zero eigenvalues, hence, the kernel is v also called the null space. Finally, it is easily veriſed that u a^ P P = P .
(11.37)
Geometrically, this corresponds to the fact that Fig. 11.10. Projection in threeonce an arbitrary vector has been projected dimensional space by a unit vector n ¯. along the image to ſnd a , any further projection of a will leave this vector unchanged. The complementary projector, denoted P ⊥ , projects an arbitrary vector in the direction perpendicular to n ¯ : a⊥ = P ⊥ a. Figure 11.10 reveals that a = a⊥ + a , and hence, P ⊥ a = (I − P )a. Since this result must hold true for any arbitrary vector a, it follows that P⊥ = I − P = I − n ¯n ¯T . (11.38) The vectors n ¯, u ¯, and v¯ form the column of an orthogonal matrix, and the following identity is readily veriſed n ¯n ¯T + u ¯u ¯T + v¯v¯T = I. Hence, P ⊥ = I − n ¯n ¯T = T T u ¯u ¯ + v¯v¯ . This last result then implies P ⊥ P = 0.
(11.39)
Finally, it is easily shown that the projections of an arbitrary vector in the two orthogonal subspaces are orthogonal to each other, as expected, aT⊥ a = aT P ⊥ P a = 0.
(11.40)
The developments presented in this section could have been started by considering an image subspace formed by the plane deſned by units vector u ¯ and v¯, the kernel or null space would then be unit vector n ¯ . All results obtained above would be recovered, except for the fact that subscripts (·) and (·)⊥ would be interchanged. The two projection operators are complementary of each other.
450
11 Constrained systems: advanced formulations
In this section, all projections were orthogonal projections: orthogonal vectors, a and b, satisfy the condition aT b = 0. It is possible to extend the concept of orthogonality: two vectors are said to be orthogonal with respect to a metric, M, of the space, if the following condition is satisſed aT M b = 0.
(11.41)
The metric, M, of the space is a symmetric, positive-deſnite matrix. In the present section, orthogonality was deſned with respect to a metric equal to the identity matrix M = I. In the next section, the projection operator concept will be extended to a space where orthogonality is deſned with respect to a non-identity metric.
11.3.2 The projection operator Instead of following the rather intuitive, geometric development used in last section, a formal deſnition of the projection operator is the starting point of this section. A projection operator, P , is an n × n linear transformation that acts like the identity on its image, P E = E, (11.42) where the image of the projection, E, is an n×m matrix such that rank(E) = m ≤ n. The projection operation is assumed to act in a space where orthogonality is deſned with respect to a metric, M, see eq. (11.41). At ſrst, the metric of the space is factorized as M = S T S, using the Cholesky factorization [82], for instance. Equation (11.42), is rewritten as (S P S −1 )(S E) = ˆ Eˆ = E, ˆ where the following scaled quantities were deſned: Eˆ = S E (S E), or P
ˆ indicates scaled quantities. The singular value deˆ = S P S −1 . Notation (·) and P composition presented in section 18.1 will be used extensively in this development. ˆ is factorized with the help of the singular value decomposition, The scaled image, E, eq. (18.10), to ſnd ˇ Σ V T. Eˆ = U (11.43) The null space or kernel, Γ , of the projection is deſned by eq. (18.8) as T Eˆ Γ = 0.
(11.44)
Introducing the singular value decomposition, eq. (11.43), into the deſnition of the ˆ U ˇ Σ V T . Since V and Σ are non singular ˇ ΣVT = U projection operator yields P matrices, it follows that ˆ =U ˇU ˇT . P (11.45) ˆ is a symmetric matrix, and furthermore, in view of This result implies that P eq. (18.6b),
11.3 The geometric interpretation of constraints
ˆ , ˆ P ˆ =P P
451
(11.46)
a characteristic property of projection operations. An explicit expression of the projector can be obtained that does not rely on the singular value decomposition. In view of the orthogonality of matrix V , eq. (11.45) is recast as T ˆ =U ˇ Σ(V T V )Σ −2 (V T V )Σ U ˇ Σ V T )(V Σ −2 V T )(V Σ U ˇ T ). ˇ = (U P (11.47) T 2 T ˆ ˆ Since E E = V Σ V , it follows that −1 T ˆ = Eˆ Eˆ T Eˆ Eˆ . P
(11.48)
The complementary projection operator, P ⊥ , is deſned as P ⊥ = I − P,
(11.49)
ˆ = I −P ˆ , where the scaled complewhere I is the identity matrix; this implies P ⊥ ˆ = S P S −1 . It is readily veriſed that mentary projection operator is deſned as P ⊥ ⊥ ˆ is a symmetric matrix and that P ˆ P ˆ )(I − P ˆ )=P ˆ , ˆ = (I − P ˆ = I −P P ⊥
⊥
⊥
⊥
two characteristic properties of projection operators. In view of property (18.6a) of ˇU ˇ T S + S −1 Γ Γ T S = I, or S −1 Γ Γ T S = the singular value decomposition, S −1 U I −P , and hence, the explicit expression for the complementary projection becomes ˆ = Γ ΓT. P ⊥
(11.50)
This shows that the image of the complementary projection is Γ , the null space or ˆ P ˆ P ˆ = P ˆ = 0, where properkernel of the projection. It is also clear that P ⊥ ⊥ ties (18.7) were used. Consider now an arbitrary vector, a, and its scaled counterpart, a ˆ = S a; the components, c, of this vector along the orthogonal basis U deſned by the singular value decomposition of matrix Eˆ are such that a ˆ = U c. In view of the partition (18.3) of U , this becomes ˇ Γ c=U ˇc +Γ c =a ˆ + a ˆ⊥ . a ˆ= U ⊥ The projections of vector a ˆ are found to be ˇc +Γ c =U ˇc =a ˇU ˇT U ˆ a ˆ = U ˆ . P ⊥ ˆ a ˇc +Γ c =Γ c =a P ˆ = Γ ΓT U ˆ⊥ . ⊥ ⊥ ⊥
(11.51)
(11.52a) (11.52b)
Clearly, the projection and the complementary projection operators project an arbitrary vector onto the image of the projector and its null space, respectively.
452
11 Constrained systems: advanced formulations
These two complementary subspaces are orthogonal in metric M; indeed, a ˆT a ˆ⊥ = T T ˆ ˆ a ˆ P P a ˆ = 0, and hence,
⊥
aT⊥ M a = 0.
(11.53)
Projection operators are closely related to the Moore-Penrose generalized inverse deſned in 18.2. Indeed, the Moore-Penrose inverse of the scaled image of the pro+ + ˆ Comparing this result to the jector, Eˆ , satisſes condition (18.11), i.e., Eˆ Eˆ Eˆ = E. + ˆ = Eˆ Eˆ , which, with the help of eq. (18.16), scaled version of eq. (11.42) yields P
ˆ = E[ ˆ Eˆ T E] ˆ −1 Eˆ T . This simple manipulation re-establishes eq. (11.48) in becomes P an expeditious manner. The fact that the projector is a symmetric operator mirrors ˆ = Eˆ Eˆ + = Eˆ +T Eˆ T . Transposing property (18.13) of the generalized inverse: P property (18.12) of the generalized inverse applied to the scaled image of the projec+T T +T +T = Eˆ , or tor leads to Eˆ Eˆ Eˆ ˆ Eˆ +T = Eˆ +T . P
(11.54)
+T This result implies that Eˆ is entirely contained in the subspace deſned by the ˆ image, E, of the projector.
Example 11.7. Orthogonal projection in three-dimensional space Consider a simple projection in three-dimensional space deſned by the image E = n ¯, where n ¯ is a unit vector, and a metric M = I, where I is the 3 × 3 identity, as depicted in ſg. 11.10. It follows that the scaled quantities are identical to their unscaled counterparts. This problem was treated in an intuitive manner in section 11.3.1; in this example, the projection operator will be obtained from the more formal derivation presented above. In view eq. (11.48), the projection operator becomes P = n ¯ (¯ nT n ¯ )−1 n ¯T = n ¯n ¯T . u v¯], where u ¯ and v¯ are two mutually orthogonal, The kernel of the projector is Γ = [¯ unit vectors contained in the plane normal to n ¯ . It satisſes eq. (11.44), n ¯ T [¯ u v¯] = 0. Equation (11.50) now yields the complementary projection operator P ⊥ = Γ Γ T = u ¯u ¯T + v¯v¯T . It is easily veriſed that P + P ⊥ = n ¯n ¯T + u ¯u ¯T + v¯v¯T = I, where the last equality holds because n ¯, u ¯, and v¯ form an orthogonal basis. Consider now an arbitrary vector a; application of the projection operators yields ¯n ¯ T a = (¯ nT a)¯ n = a cos α n ¯ = a . P a = n and P ⊥ a = (¯ uT a)¯ u + (¯ v T a)¯ v = a⊥ . These results reproduce those obtained in a more intuitive manner in section 11.3.1.
11.3 The geometric interpretation of constraints
453
Example 11.8. Orthogonal projection in three-dimensional space Consider now a second example in three-dimensional space deſned by the image E = n, where n is a non-unit vector, and a metric M = S T S. The scaled image is n ˆ = S n and, in view eq. (11.48), the scaled projection operator becomes ˆT n ˆ n T −1 T ˆ =n , ˆ (ˆ n n ˆ ) n ˆ = P ˆ n ˆ n
(11.55)
where n ˆ T /ˆ n represents the image of the projection normalized in the space of metric M. Consider now an arbitrary vector a, scaled as a ˆ = S a; application of the projection operator, eq. (11.52a), yields ˆT n ˆ n n ˆ ˆ a a ˆ = ˆ =a ˆ , P ˆ = a cos α ˆ n ˆ n ˆ n
(11.56)
ˆ , i.e., a ˆT n ˆ = ˆ aˆ n cos α. where α is the angle between a ˆ and n 11.3.3 Projection of the equations of motion The equations of motion of constrained dynamical systems have been cast in the form of eq. (11.1). Maggi’s formulation, as presented in section 11.2.1, is a purely algebraic approach to the problem; this section focuses on a more geometric interpretation of constrained dynamical problems, which has been investigated by a number of authors: Brauchli et al. [80, 81], Udwadia and Kalaba [72], Blajer and his coworkers [83, 84, 85]. Because the mass matrix is a symmetric positive-deſnite matrix, it can be factorized as M = S S T using the Cholesky factorization [82], for instance. Multiplying the governing equations of motion by S −1 then leads to ¨ˆq − Fˆ c = Fˆ ,
(11.57)
where the scaled accelerations were deſned as ¨ˆq = S T ¨q , the scaled constraint forces c as Fˆ = S −1 F c and scaled forces as Fˆ = S −1 F . Next, the scaled constraint matrix ˆ = S −1 B T . Finally, the acceleration level constraint, eq. (11.3), is is deſned as E ˆT ¨ written as B S −T S T ¨q = c, leading to the following scaled expression, E ˆq = c. A scaled projection operator is now introduced T −1 T ˆ E ˆ E ˆ ˆ . Pˆ = E E
(11.58)
ˆ In view of eq. (11.48), the image of this projector is the scaled constraint matrix, E, −1 and it operates in a space of metric M = M , i.e., the inverse of the mass matrix. By construction, the projection operator, Pˆ , projects an arbitrary vector into the subspace parallel to the constraint matrix; of course, the projection operation is
454
11 Constrained systems: advanced formulations
performed in a space endowed with a metric deſned by the inverse of the mass matrix. This basic property of the projector can be veriſed by projecting the constraint c ˆ λ, the constraint forces are entirely contained in forces: since Fˆ = S −1 B T λ = E the subspace deſned by the constraint matrix, and hence, the projection operation has no effect c ˆ c. Pˆ Fˆ = F (11.59) ˆE ˆ T E] ˆ −1 E ˆT ¨ The projection of the scaled accelerations yields Pˆ ¨ qˆ = E[ ˆq = ˆE ˆ T E] ˆ −1 c, and ſnally, in view of eq. (18.16), E[ +T +T Pˆ ¨ˆq = Eˆ c = Pˆ Eˆ c,
(11.60)
where the last equality follows from property (11.54) of the projection operator. The scaled equations of motion, Image of the ^ eqs. (11.57), are multiplied by the projector projector = F constraint to ſnd m ^ ^ Fc PP F gradient c ^ .. F ˆ c = Fˆ c = Pˆ ¨ Pˆ F ˆq − Pˆ Fˆ ^ q +T ^ * + (11.61) E c m^ +T P^ F = Pˆ Eˆ c − Fˆ , Constraint manifold where the last equality was obtained with Fig. 11.11. Geometric representation of the help of eq. (11.60). This important reconstraint dynamics with holonomic con- lationship shows, once again, that the constraints. Although appearing as orthogonal straint forces are entirely contained within ˆ of the projector, i.e., the conprojections in this illustration, projections the image, E, are, in fact, operating in the metric of the straint forces belong to the space deſned by inverse of the mass matrix. the scaled constraint matrix. Next, the accelerations of the system are computed from eq. (11.57) as * + ¨ˆq = F ˆ + Fˆ c = Fˆ + Pˆ Eˆ +T c − Fˆ = Pˆ Fˆ + E ˆ +T c. (11.62) ⊥ ˆq = Clearly, system accelerations have a component in the image of the projector, Pˆ ¨ ˆ +T c and a component in the orthogonal subspace, Pˆ ¨ E ˆq = Pˆ ⊥ Fˆ . The geometric ⊥ interpretation of these results is illustrated in ſg. 11.11. ˆ , are the sum of the According to eq. (11.57), the scaled unconstrained forces, F c ˆ scaled constraint forces, F , and the scaled accelerations of the system. The scaled ˆ +T c, both contained constraint forces are the difference between vectors Pˆ Fˆ and E in the image of the projector, as implied by eq. (11.61). On the other hand, system ˆ +T c, contained in the image of the projector, accelerations are the sum of vector E and vector Pˆ ⊥ Fˆ , contained in the orthogonal subspace. The various projection involved in this geometric interpretation of the equations of motion of constraint dynamical systems are not orthogonal projection. Indeed,
11.3 The geometric interpretation of constraints
455
the projection operator is deſned in a space with a metric deſned by the inverse of the mass matrix. Although appearing as orthogonal projections on this illustration, projections are, in fact, operating in the metric of the inverse of the mass matrix. 11.3.4 Elimination of Lagrange’s multipliers In the last section, geometric aspect of the problem have been underlined by using the geometric concept of projection operator. It is, however, easy to recast the main results of the last section in a form that does not make use of these geometric concepts. Indeed, eqs. (11.62) and (11.61) are rewritten as ! "−1 ! " M ¨q = F + B T B M −1 B T c − B M −1 F , (11.63a) ! " ! " c T −1 T −1 −1 F = c−BM F , (11.63b) B BM B respectively. These results are identical to those obtained by Udwadia and Kalaba [66, 69, 71, 65, 73]. Equations (11.63a) form a set of n second-order, ordinary differential for the n generalized coordinates q. Clearly, Lagrange’s multipliers present in the original equations, eqs. (11.1), have been eliminated; the governing equations of motion are now ordinary differential equations rather that differential-algebraic equations. Clearly, eqs. (11.63a) could be recast as a set of 2n ſrst-order, ordinary differential for the n generalized coordinates q and n generalized velocities q˙ . This contrasts with Maggi’s equations, eqs. (11.11) and (11.7), that consist of 2n−m ſrst-order, ordinary differential for the n − m kinematic characteristics and n generalized coordinates. Although Lagrange’s multipliers have been eliminated, an explicit expression for the forces of constraint is available as eq. (11.63b). Example 11.9. The simple pendulum Here again, the simple pendulum problem is considered as a ſrst example. The con! "−1 straint matrix of the problem is given by eq. (11.13), and hence B M −1 B T = m, the mass of the particle. The acceleration level constraint, eq. (11.3), yields c = −(x˙ 2 + y˙ 2 )/. The system accelerations then follow from eq. (11.63a) as x˙ 2 + y˙ 2 1 x y1 y −m . m¨q = mg −x y The constraint forces are found with the help of eq. (11.63b) x˙ 2 + y˙ 2 1 x x1 x c −m . F = −mg y y The geometric interpretation of this result, as discussed in section 11.3.4, is particularly striking for this simple pendulum example. For reference, the scaled projectors are easily found to be 2 1 x2 xy 1 y −xy ˆ Pˆ = 2 , and P . = 2 2 ⊥ xy y −xy x2
456
11 Constrained systems: advanced formulations
Example 11.10. The rigid body Figure 11.3 depicts a rigid body moving in space while one of its points, denoted O, remains ſxed. The orientation of the body is determined by an orthonormal basis B = (¯ e1 , e¯2 , e¯3 ). The kinematics of the
rigid bodyare deſned by the nine generalized coordinates stored in array q T = e¯T1 , e¯T2 , e¯T3 . Clearly, six constraints must be imposed: three conditions on the normality of vectors e¯1 , e¯2 , and e¯3 , and three additional constraints enforcing their orthogonality. The corresponding constraint matrix is given by eq. (11.17) and the projector deſned by eq. (11.58) is found to be ⎡ ⎤ √ √ M e¯2 e¯T M1 e¯3 e ¯T M1 M2 M1 M3 T T 2 3 I − M11 +M − e ¯ e ¯ e ¯ e ¯ 2 3 1 1 M +M M +M M +M 2 1 3 1 2 3 √1 √ ⎢ ⎥ M e¯1 e¯T M e¯3 e¯T M1 M2 M2 M3 ⎥, 1 3 Pˆ = ⎢ ¯1 e¯T2 I − M2 +M − M2 +M ¯3 e¯T2 ⎣ ⎦ M +M e M +M e 2 √1 M1 M3 ¯1 e¯T3 M1 +M3 e
2 2 √1 M2 M3 T e ¯ e ¯ M2 +M3 2 3
3
2
I−
M3 e¯1 e¯T 1 M3 +M1
and the pseudo inverse of the image of the projector becomes √ ⎡√ M3 M1 M1 e¯1 0 0 0 ¯3 M1 +M3 e √ √ M3 M2 ˆ +T = ⎢ E M2 e¯2 0 e ¯ 0 ⎣ 0 M2 √ +M3 3 √ √ 2 M3 1 M3 0 0 M3 e¯3 M ¯2 M ¯1 M2 +M3 e M1 +M3 e
3
−
M3 e ¯2 e ¯T 2 M3 +M2
√ ⎤ M2 M1 ¯2 M1 √ +M2 e ⎥ M1 M2 ¯1 ⎦ , M1 +M2 e
0
where, for simplicity, the mass matrix was assumed to be diagonal, M ∗ = diag(M1 I, M2 I, M3 I). The scaled constraint forces and accelerations are then given by eqs. (11.61) and (11.62), respectively. Example 11.11. The skateboard Figure 9.6 depicts the simpliſed conſguration of a skateboard of mass m and moment of inertia I about its center of mass G. The skateboard rolls without sliding on the horizontal plane by means of a wheel aligned with the axis e¯1 of the skateboard and located at point C, a distance from the center of mass. The position vector of the center of mass is written as rG = x ¯ı1 + y ¯ı2 , and the axis of the skateboard makes an angle θwith the horizontal. Let the generalized coordinates of the problem be q T = x, y, θ . Clearly, the system is subjected to a constraint: because the wheel does not slip, the velocity vector of the contact point must be along axis e¯1 . The velocity of point C is v C = x˙ ¯ı1 + y˙ ¯ı2 + θ˙ e¯2 , and hence, the constraint is e¯T2 v C = 0, leading to the constraint matrix given by eq. (11.20). The procedure described in the previous section leads to the elimination of Lagrange’s multiplier by constructing the projection operator. The accelerations of the system, given by eq. (11.63a), then become ⎫ ⎧ − sin θ ⎬ ⎨ mI ˙ M ¨q = cos θ , θ(x˙ cos θ + y˙ sin θ) ⎭ ⎩ I + m2 where the mass matrix is a diagonal matrix, M = diag(m, m, I). Since there are no externally applied forces, it follows that F c = M ¨q .
11.4 Gauss’ principle
457
11.4 Gauss’ principle Consider a dynamical system characterized by generalized coordinates denoted q. By ) deſnition (8.1), the inertial forces acting on the system are F I = mi ai , where mi are the masses of the particles and ai their acceleration vectors. When the position vectors of all particles are expressed in terms of generalized coordinates, the inertial forces become F I = −M ¨q − f I , where M = M (q, t) is the symmetric, positivedeſnite mass matrix, and f I = f I (q, q˙ , t) the dynamical forces. Note that inertial forces are linear functions of the generalized accelerations. D’Alembert’s principle, eq. (8.3), now implies that δq T [−M ¨q − f I + f a ] = 0, for all kinematically admissible virtual changes in the generalized coordinates. In this statement of the principle, δq T f a represents the virtual work done by all externally applied conservative and non-conservative forces. D’Alembert’s principle is now recast in a compact manner as δq T M ¨q − F = 0, (11.64) where F (q, q, ˙ t) = f a (q, q˙ , t) − f I (q, q˙ , t) is the sum of all dynamical and externally applied forces. The generalized coordinates are functions of time and a Taylor series expansion yields q(t + dt) = q(t) + q˙ (t)dt + ¨q (t)dt2 /2 + h.o.t. The position and velocity vectors of all particles of the system are now assumed to be given, ſxed quantities at time t, implying that δq = 0 and δ q˙ = 0. Neglecting higher-order terms, variation of the series expansion now yields δq(t + dt) =
1 δ¨q (t)dt2 . 2
Introducing this result into d’Alembert’s principle, eq. (11.64), leads to δ¨q T M ¨q − F = 0, (11.65) for all kinematically admissible virtual changes in the generalized accelerations. Within the framework of the present development, variations of all quantities that are sole functions of the generalized coordinates and velocities vanish; for instance, δM (q, t) = 0 or δF (q, q˙ , t) = 0. This implies that δ¨q = δ[M −1 (M ¨q − F )] and eq. (11.65) becomes
M ¨q − F
T −1 ! " δ M M ¨q − F = 0,
and ſnally, δG = 0, where the Gaussian of the system is deſned as G=
1 M ¨q − F M −1 M ¨q − F , 2
which is a quadratic function of the generalized accelerations.
(11.66)
458
11 Constrained systems: advanced formulations
Thus far, d’Alembert’s principle has been used to prove that among all kinematically admissible generalized accelerations, the actual accelerations of the constrained system are a stationary point of its Gaussian, δG = 0. If the system is constrained by a combination of holonomic and nonholonomic constraints, the accelerations level constraints are expressed by eqs. (11.3), which are linear functions of the generalized accelerations. Conceptually, generalized accelerations could be divided into two sets, the independent and dependent accelerations, denoted ¨q I and ¨q D , respectively. The accelerations level constraints, eq. (11.3), would then yield a linear relationship between the two sets, ¨q D = T (q, t)¨q I + d(q, q˙ , t). Introducing this expression into eq. (11.66), shows that the Gaussian now becomes a quadratic expression of the independent generalized accelerations, and hence, its stationary point corresponds to its absolute minimum. This discussion establishes Gauss’ principle [86, 87]. Principle 18 (Gauss’ principle) Among all kinematically admissible generalized accelerations, the actual acceleration of a constrained system minimizes its Gaussian. Because the Gaussian is a quadratic function of the generalized acceleration and because the acceleration level constraints are linear functions of the same variables, the stationary point of the Gaussian corresponds to it absolute minimum. Consequently, Gauss’ principle is a true minimum condition, rather than the stationarity condition that characterizes d’Alembert’s principle. The use of Gauss’ principle for the solution of constrained multibody systems was proposed by Lilov and Lorer [88] in 1982; their approach involves the MoorePenrose inverse of the constraint matrix. The importance of Gauss’ principle and its relationship to d’Alembert’s principle was studied in a mathematical manner by Cardin and Zanzotto [89], within the framework of differential geometry. Possibly non-Riemannian mechanical systems with holonomic constraints were considered, generalizing Gauss’ principle. Example 11.12. The simple pendulum Derive the equations of motion of the simple pendulum problem depicted in ſg. 11.9 using Gauss’
principle. Two generalized coordinates, the Cartesian coordinates, q T = x, y , deſning the position vector of the bob are used, subjected to a single holonomic constraint: C = (q T q − L2 )/2 = 0. This constraint enforces the constant length condition for the pendulum; the constraint matrix of the problem is B(q) = x, y . The acceleration level constraint is C¨ = B ¨q + x˙ 2 + y˙ 2 = 0. Within the framework of Gauss’ principle, variations of all quantities that are sole functions of the generalized coordinates and velocities vanish: δB = 0, δ x˙ = 0, and δ y˙ = 0. A variation of the acceleration level constraint then yields xδ¨ x + yδ y¨ = 0, which demonstrates that the variations in the generalized accelerations, δ¨ x and δ y¨, are not independent of each other. If y¨ is selected to be the independent acceleration component, δ¨ x = −yδ y¨/x. For this simple problem, the Gaussian is G = m[(¨ x − g)2 + y¨2 ]/2, and the stationarity condition becomes δG = m[(¨ x − g)δ¨ x + y¨δ y¨] = 0. Because δ¨ x and δ y¨
11.4 Gauss’ principle
459
are not independent quantities, the stationarity condition does not yield the equation of motion. Eliminating δ¨ x leads to δG = m[−y(¨ x − g)/x + y¨]δ y¨ = 0, and the equation of motion of the problem becomes −y¨ x + x¨ y + gy = 0.
(11.67)
To verify that this result is correct, the position of the particle is written as x = LCθ , y = LSθ , where L is the length of the pendulum. Equation (11.67) can then be recast as θ¨ + Sθ g/L = 0, as expected. This example call for several remarks. First, the elimination process is fraught with difſculties. Indeed, equation δ¨ x = −yδ y¨/x becomes singular when x = 0. Selecting δ¨ x to be the independent acceleration component would not circumvent the problem because equation δ y¨ = −xδ¨ x/y would now becomes singular when y = 0. Second, equation (11.67) is still expressed in terms of the two generalized coordinates selected to represent the conſguration of the problem, x and y. Eliminating one of the two generalized coordinates leads to very complicated expressions for the single equation of motion and singularities will appear, whether x or y is selected as the independent variable. Rather than eliminating one of the generalized coordinates, it is simple to append to equation (11.67) the acceleration level constraint, thereby creating a set of two ordinary differential equations for the two generalized coordinates. It is left to the reader to verify that this set of equations is identical to that generated by application of the null space formulation presented in section 11.2.5. Example 11.13. Derivation of the index-1 formulation from Gauss’ principle Derive the index-1 formulation for constrained dynamical systems from Gauss’ principle. As shown in the previous example, the elimination of the dependent accelerations is a perilous exercise. Furthermore, it is difſcult, in general, to express dependent acceleration components in terms of their independent counterparts. To avoid this potentially difſcult step, the constraints will be enforced using Lagrange’s multiplier technique described in section 9.1. The following augmented Gaussian, G+ , is introduced, G+ = G + λT B(q, t) q¨ − c(q, q˙ , t) , where the Gaussian of the system, G, is deſned by eq. (11.66), and λ is the array of Lagrange multipliers used to enforce the constraints. The augmented Gaussian is now an unconstrained function of two sets of variables, the generalized accelerations, ¨q , and Lagrange’s multipliers, λ. Variation of the augmented Gaussian leads to δG+ = δ¨q T M ¨q − F + B T λ + δλT B ¨q − c = 0, and because variations δ¨q and δλ are arbitrary, the two bracketed terms must vanish, leading to the equations of motion of the problem. When recast in a matrix form, these two sets of equations are identical to those characterizing the index-1 formulation developed in section 11.2.3, see eq. (11.24).
460
11 Constrained systems: advanced formulations
Example 11.14. Independent quasi-accelerations In general, it is difſcult to express dependent velocity components in terms of their independent counterparts. To overcome this difſculty, kinematic parameters or independent quasi-velocities were introduced at the onset of the development of Maggi’s formulation presented in section 11.2.1. Develop the corresponding concept of independent quasi-accelerations for Gauss’ principle. The relationship between generalized velocities and kinematic parameters expressed by eqs. (11.7) is at the heart of Maggi’s formulation and introduces the null space, Γ . A time derivative of this expression yields eqs. (11.10), repeated here for ˙ convenience, ¨q = Γ e˙ + Γ˙ e − d. Within the framework of Gauss’ principle, variations of all quantities that are sole functions of the generalized coordinates and velocities vanish: δΓ (q, t) = 0, δe = 0, δ Γ˙ = 0, and δ d˙ = 0. Taking a variation of eqs. (11.10) then yields δ¨q = Γ δ e˙ .
(11.68)
This important relationship expresses variations of the system’s generalized accelerations in terms of variations of a set of independent quasi-accelerations, e˙ , which are the time derivatives of the kinematic parameters introduced in Maggi’s formulation. As mentioned earlier, the choice of the kinematic parameters is not unique and the selection of a speciſc set is left to the analyst. Similarly, the choice of speciſc quasi-accelerations is not unique Example 11.15. Derivation Maggi’s and null space formulations from Gauss’ principle Derive Maggi’s and null space formulations principle. The condition of from Gauss’ stationarity of the Gaussian implies δ¨q T M ¨q − F = 0. Because the generalized acceleration are not independent variables for a constrained dynamical system, this stationarity condition does not yield the equations of motion of the system. To remedy this problem, the generalized accelerations are expressed in terms of independent quasi-accelerations using eq. (11.68), leading to δ e˙ T Γ T M ¨q − F = 0. Because the quasi-accelerations are independent variables, the stationarity condition now yields the equations of motion of the problem as Γ T M ¨q − F = 0. (11.69) Introducing the generalized accelerations from eqs. (11.10) into eqs. (11.69) then yields the governing equations, eqs. (11.11), of Maggi’s formulation. On the other hand, appending the acceleration level constraints to eqs. (11.69) leads to the governing equations, eqs. (11.31), of the null space formulation. The developments summarized in the last two examples show that the index-1, null space, and Maggi’s formulations can all be derived from Gauss’ principle. This should be expected because Gauss’ principle is a fundamental principle of dynamics. It is indeed derived from d’Alembert’s principle, which itself, was shown to be equivalent to Newton’s second law.
11.5 Additional formulations
461
11.5 Additional formulations The theoretical developments presented in the above sections are well known, but additional formulations have also been presented in research papers, often of a more theoretical nature. In fact, many papers focus on explaining relationships, and often establishing equivalence, between various formulations rather than proposing practical numerical methods for the enforcement of constraints. For instance, Borri et al. [90] have shown the equivalence of Maggi’s and Kane’s equations [60, 61]. Angeles and Lee [91] independently derived Maggi’s formulation for mechanical systems composed of rigid bodies coupled by holonomic constraints. The formalism of Riemannian geometry was used by Maißer [92] to study holonomic multibody systems. This work focuses on rigid multibody systems with a tree topology and emphasizes the generation of the equations of motion within the Riemannian formalism with the help of Christoffel symbols. Coordinate partitioning was suggested as a solution method for the resulting equations. Jungnickel [93] further investigated the equations of motion for combined holonomic and general nonholonomic constraints, i.e., constraints that might be nonlinear in the generalized velocities, within the framework of a Riemannian space endowed with a metric depending on the generalized mass matrix of the system and the constraints. The equations of motion were projected onto the tangent space resulting in index-1 DAEs from which Lagrange’s multipliers were eliminated. Ess´en [94] considered systems of particles and derived a minimal set of equations of motion for holonomic systems by projecting Newton’s equations onto the space tangent to the constraint manifold. These projected Newton equations were then shown to be equivalent to Lagrange’s equations. Generalizing to nonholonomic constraints in Pfafſan form, Ess´en obtained equations of motion in terms of quasivelocities by projection of Newton’s equations onto the null space of the constraint matrix. The resulting equations were shown to be general Boltzmann-Hamel equations. The relationship of this approach to Kane’s method was also underlined. Blajer [85, 95] summarized much of the work done within the framework of differential Riemannian geometry: index-1 formulations, null space formulations, and Maggi’s formulation for combined holonomic and nonholonomic constraints have all been presented in this framework. For holonomic systems, the equivalence of Maggi’s formulation and Boltzmann-Hamel equations was shown, as was the equivalence of the projective formulation and of the matrix setting of Gibbs-Appell equations. The author underlined the need to develop efſcient methods for computing the time derivative of the null space, an indispensable ingredient for the application of Maggi’s and projective formulations. He also proposed a technique for the elimination of constraint violations that affect the index-1, null space and Maggi’s formulations. The Boltzmann-Hamel equations are immune from these violations because independent generalized coordinates are introduced.
12 Constrained systems: numerical methods
The classical and advanced formulations presented in chapter 10 and 11, respectively, provide the theoretical background for the analysis of constrained dynamical systems. In this chapter, practical numerical algorithm are described and compared. Lagrange’s equations of the ſrst kind have been derived and form a set of index3 differential-algebraic equations (DAEs). Gear [96] clearly underlined the difſculties associated with the solution of this type of equations. The same author and his coworkers [97, 47] have studied DAEs extensively and concluded in 1984: “If the index does not exceed 1, automatic codes [...] can solve the problem with no trouble.” Furthermore, “If [...] the index is greater than one, the user should be encouraged to reduce it.” These observations prompted the multibody community to engage along two distinct avenues of research. First, the development of the ordinary differential equation techniques described in section 12.1, which eliminate Lagrange’s multipliers all together, reducing the DAEs to ODEs. Methods developed for the solution of ordinary differential equations (ODEs) are then applicable to the reduced system of equations. Second, the index reduction techniques presented in section 12.2, which reduce the governing equations of motion to index-1 equations. A survey paper by Haug [98] describes in a conceptual manner these two approaches to computational methods in constrained dynamics. Nikravesh [56] investigated two algorithms representative of those two approaches: the ſrst algorithm reduces the problem to an index-1 system by enforcing the constraints at the acceleration level, the second used a coordinate partitioning method based on ref. [99]. Many of the methods proposed for the solution of constrained dynamical systems do not enforce constraints exactly, rather, small constraint violations are allowed that could grow over time. This phenomenon, called the drift phenomenon, was illustrated in examples 11.2, 11.5, and 11.6, when using Maggi’s, the index-1, and the null space formulations, respectively. Figures 11.1, 11.5, and 11.7 show the time histories of the constraint violations and results are summarized in table 11.1. It was noted that the drift phenomenon is more pronounced for the index-1 and null space formulations than for Maggi’s formulation because the two former approaches en-
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_12 © Springer Science+Business Media B.V. 2011
464
12 Constrained systems: numerical methods
force the constraint at the acceleration level, but the latter enforces constraints at the velocity level. Section 12.3 presents several constraint violation stabilization techniques that were developed to alleviate the drift phenomenon. Constraint violation elimination techniques that completely circumvent this problem are discussed in section 12.4. Finally, in recent years, the ſnite element method has played an increasingly important role in multibody dynamics formulations and the tools and techniques used within this framework are the subject of section 12.5. This chapter concludes with a detailed discussion of scaling methods presented in section 12.6. It is shown that with the proper scaling of the equations of motion, the index-3 DAEs stemming from the modeling of constrained dynamical systems are not more difſcult to integrate than the ODEs characteristic of unconstrained systems.
12.1 Ordinary differential equation techniques The challenges posed by the differential-algebraic nature of Lagrange’s equations of the ſrst kind can be dealt with by means of alternative formulations of the equations of motion. This section deals with methods that recast the governing equations of motion in terms of ODEs. A logical approach is to eliminate the redundant generalized coordinates to obtain a minimum set of equations, bypassing the need for constraints; this is the approach followed in Maggi’s formulation presented in section 11.2.1. It is also possible, however, to obtain ODEs for all the generalized coordinates selected by the user to describe the system; this is the approach followed in the null space and Udwadia and Kalaba’s formulations discussed in sections 11.2.5 and 11.2.7, respectively. 12.1.1 “Maggi-like” formulations The essence of Maggi’s formulation developed in section 11.2.1 is the construction of the null space, which enable the elimination of Lagrange’s multipliers through the use of orthogonal complements, B Γ = 0. Because the constraint matrix is a function of time, the null space is itself a function of time, and in numerical implementations, it must be recomputed at each time step, a considerable computational burden. Hence, the vectors spanning the null space at two different steps could be different, resulting in a new set of kinematic characteristics at each time step. To overcome these problems, many researchers have evaluated the null space at the beginning of the simulation, Γ 0 , and kept it constant for the subsequent time steps of the analysis. When using this approach, Lagrange’s multipliers are no longer eliminated, because B(t)Γ 0 = 0. At regular intervals, the null space is recomputed. Typically, a criterion is developed that identiſes the appropriate time step when this expensive operation is to be performed; various criteria have been used by various researchers. It should be noted that these methods no longer represent a numerical implementation of Maggi’s formulation; they might be better characterized as “Maggi-like” methods.
12.1 Ordinary differential equation techniques
465
Kurdila et al. [54] and Papastavridis [55] ſrst pointed out the unifying role of Maggi’s formulation, which forms the basis for many of these coordinate reduction techniques that are equally applicable to holonomic and nonholonomic constraints. They point out that various methods only differ by the choice of the basis selected to span the null space of the constraint matrix, which in turns, determines the kinematic characteristics; the equations of motion are then projected onto this subspace. Clearly, matrix B and its inverse, as deſned by eqs. (11.5) and (11.6), respectively, fully characterize Maggi’s formulation. A number of “Maggi-like” formulations only differ by the computational tool used to evaluate the null space of the constraint matrix. The following approaches have been used: the zero eigenvalue method [100], the coordinate partitioning method based on the LU factorization [99, 101], and the singular value decomposition method [102, 103]. Because these approaches were reviewed by Kurdila et al. [54], details are not repeated here. The following sections discuss methods that were developed after their review paper appeared. The recursive Householder transformation method Amirouche et al. [104] applied the Householder transformation technique to the transpose of the constraint matrix, assumed to be of full rank, to obtain a full rank, upper triangular matrix B ut(n×m) = H B T , where H (n×n) is the product of successive Householder transformations. The Gram-Schmidt orthonormalization process was then employed to ſnd an orthonormal basis, D, which was partitioned as D(n×n) = D1(n×m) D2(n×(n−m)) . D1 and B ut span the same subspace, B Tut D2 = B H T D2 = 0, becuse DT1 D2 = 0. D2 precisely spans the null space of B H T , while H T D2 spans the null space of B. The fundamental matrices of Maggi’s formulation are easily identiſed as B T = B T H T D2 and B −1 = H T D1 (B H T D1 )−1 H T D2 . The authors pointed out that this approach is equivalent to the zero eigenvalue [100] and singular value decomposition methods [102, 103], while achieving higher computational efſciency. The tangent coordinate method Agrawal and Saigal [105] also used the Gram-Schmidt orthogonalization process to generate a basis of the null space of constraint matrix. For holonomic constraints, this null space is tangent to the constraint manifold, hence the name of the method. This approach is very similar to that presented by Liang and Lance [106], except that matrix E is also constructed using the Gram-Schmidt process, a method that is faster and requires less computer memory. The process generates an orthogonal matrix T , T T which is partitioned as T T(n×n) = T 1(n×m) T 2(n×(n−m)) . T T1 and B T span the same subspace and B T T2 = 0, because T 1 T T2 = 0, while T T2 precisely spans the
466
12 Constrained systems: numerical methods
tangent space. Thefundamental matrices of Maggi’sformulation are easily identiſed as B T = B T T T2 and B −1 = T T1 (B T T1 )−1 T T2 . 12.1.2 Maggi’s formulations Wampler et al. [107] devised a simple approach in which Maggi’s kinematic characteristics are selected to be a subset of the generalized speed within the framework of Kane’s method. They only presented analytical examples of their procedure. A few authors have developed approaches that update the null space at each time step. Kim and Vanderploeg [108] proposed an updating scheme which maintains the directional continuity of the null space. However, matrix Q of the underlying QR decomposition does not remain orthogonal, and hence, the full QR decomposition must be repeated at regular intervals, based on a criterion reƀecting the condition number of a matrix involved in the null space update. This approach was reviewed by Kurdila et al. [54], details are not repeated here. The following sections discuss methods that were developed after their review paper appeared. The Gram-Schmidt method Liang and Lance [106] have used the Gram-Schmidt orthonormalization process to generate independent continuous and differentiable. At ſrst, ma coordinates that are T T trix P (n×n) = B (n×m) E (n×(n−m)) is constructed, where E T is an arbitrary matrix such that P is nonsingular. Typically, E is determined by singular value decomposition or by LU factorization. Matrix P is then transformed into an or thogonal matrix V = V D V I through the Gram-Schmidt process, where V D and V I are of the same dimensions as B T and E T , respectively. V D and B T span the same subspace, hence, B V I = 0 because V TD V I = 0. V I precisely spans the null space of B. The fundamental matrices of Maggi’s formulation are easily identiſed as B T = B T V I and B −1 = V D (B V D )−1 V I . The extraction procedure approach Constraints equations are intimately related to the choice of coordinates used to represent mechanical systems. Garc´Æa de Jal´on et al. developed the concept of “basic coordinates” for systems composed rigid bodies; the approach was developed for the kinematic analysis of planar lower-pair mechanisms [109, 110] and later expanded to deal with spatial mechanisms [111, 112]. Serna et al. [113] used this framework to analyze the dynamic response of planar mechanisms. Maggi’s and the null space formulations were both presented, together with an original approach to the determination of the null space. The authors note that each column of the null space can be determined by means of the solution of an elementary velocity problem; this is a more physical approach
12.1 Ordinary differential equation techniques
467
that contrasts with the purely numerical procedures described in the previous sections. Similarly, the term Γ˙ e that appears in Maggi’s equations (11.11) can be evaluated as the solution of elementary acceleration problems. The same approach was used by Garc´Æa de Jal´on et al. [114] who presented a formulation for both open- and closed-loop systems based on natural, or fully Cartesian coordinates. These coordinates have the advantages of leading to a constant mass matrix, and to relatively simple expression of the constraint matrix. Garc´Æa de Jal´on et al. [28] later showed how “natural coordinates” evolved from the earlier basic coordinates, and used this new concept to describe multibody systems. The null space is determined from eq. (11.7), written as q˙ = Γ e, because all constraints are assumed to be holonomic and scleronomic. It is then possible to determine the null space corresponding to kinematic characteristics that are an “extraction” of components of the generalized velocity array. In view of eq. (11.4), this ˇ has a single nonzero entry. A good choice of this implies that each row of matrix B extraction is initially determined by performing a Gaussian triangulation of the constraint matrix with full pivoting: the pivot locations indicate the generalized velocities to be extracted. This choice might become unsuitable during the simulation, when a previously selected pivot becomes very small; a new extraction is then selected. In a subsequent paper, Garc´Æa de Jal´on et al. [62] also investigated the use of the singular value decomposition to identify the kinematic parameters. They concluded that while this approach might yield a set of kinematic parameters that are suitable over a longer period of the motion, it is also more expensive than the extraction approach. Avello et al. [115] further elaborated on the extraction procedure by showing that it leads to a highly parallelizable algorithm. The columns of the null space, Γ , are each computed in parallel as the solution of an elementary velocity problem, and furthermore, the triple product Γ T M Γ can also be evaluated in parallel. The terms of array Γ˙ e appearing in Maggi’s equations are also computed in parallel and correspond to solutions of elementary acceleration problems. For computational efſciency, the overall approach uses recursive techniques for open loop mechanisms; in the presence of closed loops, the augmented Lagrangian formulation is used, see section 12.3.2. 12.1.3 Discussion of the methods based on Maggi’s formulation While the null space of the constraint matrix is unique, individual vectors that span this subspace are not. The methods presented above all deſne the null space by different sets of vectors that are obtained by means of different computational processes. Two fundamental criteria can be used to assess the various approaches. First, is the subspace deſned by a set of linearly independent vectors? Second, how robust and efſcient is the numerical process used to generate the subspace? The ſrst criterion is a necessary condition for the viability of the approach: if the vectors are not linearly independent, the null space is not properly deſned. Kurdila et al. [54] pointed out that the approaches of Kane [60] and Wehage and Haug [99] are not robust because they sometimes lead to a poorly conditioned or even singular representations of the
468
12 Constrained systems: numerical methods
null space. To overcome this problem, most other approaches generate an orthogonal basis spanning the null space. The second criterion deals with computational robustness and efſciency. Based on operation count, the computational cost of the singular value decomposition is known to be two to ten times higher than that of the QR algorithm, depending on the size of the constraint matrix. In turns, the QR algorithm is about two times more costly than the LU factorization. On the other hand, the singular value decomposition is many times more expensive than the Gram-Schmidt orthonormalization process. The singular value decomposition, however, is probably the most robust algorithm since it can be safely used even when the constraint matrix is not of full rank [102], as is the case in the presence of redundant constraints. Clearly, the singular value decomposition is the most robust and stable algorithm, but is also the most expensive. An important feature of Maggi’s formulation is that constraints are enforced at the velocity level. Hence, nonholonomic constraints will be satisſed to numerical accuracy, whereas holonomic constraints will drift due to the inherent errors associated with the integration process. This drift, however, is minimal, because the kinematic characteristics lie in the hyperplane tangent to the constraint manifold. In fact, Liang and Lance [106] mention that with their approach, “the numerical solution will be satisfactory without any positive constraint violation control or constraint violation stabilization.” This is an important beneſt of a rigorous application of Maggi’s formulation. The situation, however, is different with the Maggi-like methods that do not update the null space, because the kinematic characteristics no longer exactly reside in the tangent hyperplane. To obtain accurate solutions, a Newton-Raphson iteration procedure that enforces the constraint is often added to the time integration process. 12.1.4 Null space formulations This section discusses the approaches based on the null space formulation presented in section 11.2.5. Kamman and Huston [116, 117] developed an approach where the zero eigenvalue theorem was used to determine the null space of the constraint matrix. System dynamic response was then obtained based on the null space formulation. Borri et al. [64] pointed out that this approach is not much more computationally expensive than other null space methods because the most costly task is, by far, the determination of the null space. Section 12.1.2 described the extraction procedure used by Garc´Æa de Jal´on and his coworkers [62] to determine the null space of the constraint matrix. In these papers, the authors introduced the index-1, Maggi’s and null space formulations for the modeling of rigid multibody systems within the framework of reference point and natural coordinates. Of particular interest is the second paper [63], which compares different approaches to the modeling of constrained mechanical systems. The salient conclusions of the work are as follows. First, the relative efſciency of all formulations depends on the number of generalized coordinates and degrees of freedom of the system. Second, the null space formulation tended to be more efſcient than the index-1 approach. Finally, Maggi’s formulation tended to outperform the null
12.1 Ordinary differential equation techniques
469
space formulation. Note that this study provides qualitative information for the speciſc framework described by the authors. For instance, it is unclear whether such conclusions would still hold when dealing with elastic multibody systems. Chiou et al. [118] presented a numerical approach to the solution of the equations of motion expressed in terms of independent velocities. Based on a partitioning scheme that makes use of the velocity transformation relations, the null space of the constraint matrix was constructed. Maggi’s equations and a system of ODEs were obtained for open- and closed-loop systems, respectively. The explicit-implicit staggered procedure devised by Park et al. [119] was employed to integrate the system of ODEs. A parallel implementation of the proposed approach was proposed but the authors underlined the need to increase the efſciency of the algorithm. 12.1.5 Udwadia and Kalaba’s formulations This section discusses the approaches based on Udwadia and Kalaba’s formulation described in section 11.2.7. Arabyan and Wu [120] extended Udwadia and Kalaba’s formulation, which was originally developed for systems of particles, to constrained rigid body problems. The main challenge to the use of this approach is that it calls for the computation of a generalized inverse at each time step, see eq. (11.33). The singular value decomposition is one tool to compute the generalized inverse, but it is very costly [82]. The authors proposed to use the Gram-Schmidt orthogonalization process [82] to this end; depending on the size of the constraint matrix, this process can be considerably cheaper than the singular value decomposition. Furthermore, the Gram-Schmidt algorithm is able to identify inconsistencies in the speciſcation of constraints. These claims were substantiated by a number of examples comparing the performance of the index-1 approach to that based on the generalized inverse computed by both singular value decomposition and Gram-Schmidt algorithms. 12.1.6 The projective formulation Blajer [83, 121] proposed a projection method for the analysis of constrained dynamical problems. Instead of introducing the concept of projectors, as discussed in section 11.3, Blajer uses differential Riemannian geometry: linear metric spaces in which vectors are resolved into their covariant and contravariant components, and the metric of the space is deſned by the mass matrix. The effect of this metric is akin to the scaling of all quantities, as performed in section 11.3, and adds consistency to the formalism. The term “geometric projection” is used because the proposed method projects the index-1 equations onto the subspaces tangent and orthogonal to the admissible subspaces. Maggi equations (11.11) were then obtained by substitution of the independent variables into the equations projected on the tangent subspace. The independent variables form a set of independent quasi-velocities, in ref. [121], or independent quasi-accelerations, in ref. [83]. When applied to holonomic systems, the projective formulation is equivalent to Kane’s form of Appell’s equations [61]; for nonholonomic systems, it is equivalent to Maggi’s formulation. Analytical examples
470
12 Constrained systems: numerical methods
were presented in these papers but numerical implementation and computational efſciency issues for complex multibody systems were not addressed. In a subsequent paper, Blajer et al. [84] used the projective formulation to devise a criterion for the optimal selection of independent coordinates, to be used in the coordinate partitioning method proposed by Wehage and Haug [99]. Blajer [122] also addressed the numerical implementation of the projective formulation. The Gram-Schmidt orthogonalization process was used to obtain a tangent subspace, as earlier suggested by other researchers [106, 105]. In this approach, however, orthogonality of the tangent and constraint subspaces is not achieved in a Cartesian space, as was the case for earlier methods, but rather in a space endowed with a metric deſned by the mass matrix. The projective formulation requires the computation of the inverse of the mass matrix and of its time derivative, operations that are, in general, computationally expensive. Hence, Blajer recommends the use of this method in conjunction with absolute coordinates that lead to constant mass matrices; in such case, the inverse must be computed once only and its time derivative vanishes. 12.1.7 Modiſed phase space formulation Borri et al. [123] derived governing DAEs that feature the following unknowns: the generalized coordinates, q, the modiſed momenta, p∗ , which are related to the actual momenta, p∗ = p − B T μ, and the multipliers, μ, which are related to Lagrange’s multipliers, μ˙ = −λ. Unlike the momenta, the modiſed momenta are unconstrained, i.e., the state vector (q, p∗ ) evolves in a modiſed, unconstrained phase space, and hence, signiſcant reduction of the constraint violations can be expected. While the constraint forces, driven by Lagrange’s multipliers, sometimes exhibit large amplitude oscillations, thus affecting the accuracy of the solution and imposing smaller time step sizes, the multipliers, μ, have a smoother behavior because they are integrals of Lagrange’s multipliers, easing the integration process. The approach is robust in the presence of singular conſgurations. The DAEs are transformed into ſrst-order ODEs in q and p∗ for integration. Good numerical results were shown, particularly in terms of satisfaction of the constraint conditions. A penalty formulation of the approach, similar to that employed in Park and Chiou [124], was derived to render the method even more insensitive to singular conſgurations.
12.2 Index reduction techniques Index reduction techniques are typically presented as mathematical processes that reduce the index of a set of DAEs. Numerical analysis techniques are then used to prove that the application of speciſc types of time integrators to the reduced order DAEs provides a reliable solution of the problem. Gear et al. [59] proposed a method, called the stabilized index-2 or GGL method, that reduces the index from 3 to 2 and showed that variable-order, variable-step backward difference methods converge for the resulting index-2 problem. Later, Gear [125] developed an approach to further
12.2 Index reduction techniques
471
reduce the problem to index-1 DAEs. Of course, these approaches imply additional computational cost in the form of additional Lagrange multipliers to be solved for. Gear [126] transformed ſrst-order ODEs with equality and inequality invariants into index-2 DAEs, and provided, for equality invariants, a convergence analysis for variable-order, variable-step size, multi-step methods applied to the resulting DAEs. The proposed approach generalizes the one-step integrators proposed by Shampine [127], who minimally perturbed the solution of ODEs after each step to satisfy the invariants. This is equivalent to projecting the solution onto the invariant manifolds. Convergence was proved for one-step integration methods. L¨otstedt [58] studied rigid multibody systems subjected to unilateral holonomic and nonholonomic constraints. The equations of motion consisted of second-order ODEs and inequalities deſning a linear complementarity problem. The occurrence of discontinuities in the displacement, velocity, and acceleration ſelds when constraint are activated or released was studied and bounds on the velocity vector were derived. Existence of solutions was discussed; particularly, displacements and constraint forces were proved to be unique in all conſgurations, whereas Lagrange multipliers are unique only when the constraint matrix has full rank. Because solutions of linear complementarity problems are also solutions of quadratic programming problems, Gauss’ principle was shown to generalize to rigid body problems with unilateral constraints. L¨otstedt and Petzold [48] proved that k th -order, constant step size, backward difference methods converge when applied to index-1, -2, or -3 DAEs; the numerical solution is accurate to order O(hk ), where h is the time step size. The same authors [49] further investigated the practical difſculties of implementing variables step size integration methods for the same types of problems. The difſculties associated with the solution of index-3 DAEs were underlined: the condition number of the Newton iteration matrix, i.e., the tangent matrix used to solve the discretized nonlinear algebraic equations is O(h−3 ), resulting in increasingly ill conditioned problems for decreasing time step sizes. This conditioning problem can be completely eliminated for index-3 DAEs by using the scaling techniques presented in section 12.6. Eich [128] provided a convergence analysis for a coordinate projection approach combined with backward difference methods to integrate index-1 DAEs. The approach projects the numerical solution of the underlying ODEs onto the position and velocity invariants to reduce constraint violations. The accuracy of the projected solution was shown to be identical to that of backward difference methods applied to the ODEs. For linear systems, it was shown that only the errors lying in the invariants were propagated, rendering the solution more accurate. For holonomic systems, Yen et al. [129, 130] reduced the index-1 DAEs to ODEs by means of local parametrizations. The ODEs, which are similar to BoltzmannHamel equations, feature local parameters that implicitly deſne independent generalized coordinates and speeds. Then, using the local parametrization mapping and the constraint equations, the original generalized coordinates and velocities are recovered. A convergence analysis is presented that demonstrate an O(hk ) accuracy when k th -order linear multistep methods are used. In numerical applications, local parameterizations were obtained using the generalized coordinate partitioning and
472
12 Constrained systems: numerical methods
tangent space methods. Local parameterizations using the tangent space determined by QR decomposition were also used by Potra and Yen [131]. A similar approach was used by Haug and Yen [132] who determined local parameters using the generalized coordinate partitioning technique. Based on backwards difference methods, the discretized Boltzmann-Hamel equations were shown to be equivalent to a set of discretized DAEs, involving the constraints at the displacement, velocity, and acceleration levels. For practical applications, these DAEs were used. Yen et al. [133, 134] introduced the “coordinate-split formulation” for the solution of the index-2 DAEs characteristic of ƀexible multibody systems. A family of second-order α-methods with user controllable numerical dissipation was proposed, which extend the corresponding methods used for ODEs in structural dynamics, the HHT algorithm of Hilber et al. [135] and the generalized-α algorithm of Chung and Hulbert [136]. The coordinate-split method is a numerical implementation of the null space formulation applied to the stabilized index-2 approach of Gear et al. [59]; it eliminates the two sets of Lagrange multipliers associated with this approach. Projections in the space of the mass matrix were used to impose the constraints at both position and velocity levels. To deal with the highly oscillatory nature of the response of ƀexible multibody systems, the authors introduced a modiſcation of the Newton iteration process, denoted “modiſed coordinate-split iteration.” Improved convergence was proved mathematically and demonstrated by means of examples. For holonomic systems, Tseng et al. [137] devised an algorithm, called “Maggi’s equations with perturbation iteration,” which further develops the modiſed coordinate-split iteration by perturbing the solution that is projected onto the constraint manifold to eliminate constraint violations. In this approach, the determination of the generalized accelerations, velocities, and displacements is separated from that of the Lagrange multipliers, which are recovered in a post-processing operation. Good numerical results were obtained, although the authors stressed the need for further validation of the approach, especially in the presence of ƀexible bodies. The authors considered the coordinate-split formulation to be a numerical implementation of Maggi’s equations, but in the classiſcation introduced herein, this approach belongs to null space formulations. Another approach to index reduction is the embedded projection method developed by Borri et al. [138], which can be used to systematically reduce the index of the original DAEs system from 3 to 1. Furthermore, the method splits the original problem into its algebraic and differential parts, which can then be solved sequentially. While the accuracy and robustness of the procedure were demonstrated, its complexity is also apparent. Parczewski and Blajer [139, 140] investigated systems subjected to program constraints, i.e., systems forced to follow a prescribed path. The control forces that impose the prescribed motion might have components in directions both tangential and orthogonal to the constraint manifold. This feature of control problems contrasts with the classical theory of constrained dynamics, for which constraint forces are acting in the direction normal to the constraint manifold, see ſg. 11.11. A classiſcation of program constraint realizations was developed, which includes both orthogonal and tangent realizations, involving normal and tangent control forces, respectively. The
12.3 Constraint violation stabilization techniques
473
authors provided several examples of program constraint realizations and determined the associated control forces; the difſculties inherent to non orthogonal realizations were underlined. Constraint forces that have components in directions both tangential and orthogonal to the constraint manifold are said to be “non-ideal” constraint forces. Udwadia and Kalaba [74] also studied such systems and gave explicit expressions for both ideal and non-ideal constraint forces in terms of Moore-Penrose generalized inverses.
12.3 Constraint violation stabilization techniques A number of techniques impose the constraints at the acceleration level, as is the case for the index-1, null space, or Udwadia and Kalaba’s formulations, see sections 11.2.3 and 11.2.5, respectively. Considering holonomic constraints, let C = 0, C˙ = 0, and C¨ = 0 represent the displacement, velocity, and acceleration level constraints, respectively. The system consisting of the equations of motion and the acceleration level constraints then forms a set of index-1 DAEs with invariants. Indeed, for the exact solution, C = 0 and C˙ = 0 represent two invariants of the system. Unfortunately, due to numerical approximations and round-off errors, numerical solutions will not evolve along the invariant manifolds, resulting in C = 0 and C˙ = 0. This phenomenon, called the drift phenomenon, was illustrated in examples 11.2, 11.5, and 11.6, when using Maggi’s, the index-1, and the null space formulations, respectively. Figures 11.1, 11.5, and 11.7 show the time histories of the constraint violations and results are summarized in table 11.1. From a mathematical standpoint, equation C¨ = 0 is not stable because its poles are located at the origin of the splane, where s is the variable of Laplace’s transform; consequently, C and C˙ will not converge to zero if any deviation occurs. The constraint violation stabilization techniques presented in this section attempt to minimize or eliminate this drift; they are not, per se, solution methods for constrained dynamical problems, but rather, are used in conjunction with various solution techniques that are sensitive to the drift phenomenon. If the mechanical system is conservative, the total mechanical energy, E, is an additional invariant of the system, E˙ = 0. If nonconservative forces are externally applied, the work they perform can be added to the total mechanical energy to form an invariant of the system. As was the case for the holonomic constraints considered above, due to numerical approximations and round-off errors, the solution will drift away from this manifold, i.e., the total mechanical energy will not be preserved. In fact, the energy preservation constraint is a particular case of a nonholonomic constraint D(q, q˙ ) = 0. 12.3.1 Control theory based stabilization techniques The most popular stabilization technique is probably Baumgarte’s method, which can be interpreted within the framework of control theory. Several researchers improved Baumgarte’s original method and these efforts are described below.
474
12 Constrained systems: numerical methods
Baumgarte’s constraint violation stabilization To compensate the observed drift of the solution, Baumgarte [141] introduced a stabilization method in which the original acceleration level constraint, C¨ = 0, is replaced by C¨ + 2αC˙ + β 2 C = 0, (12.1) where α and β are user deſned, positive parameters. In practical implementation, the choice α = β is often appropriate because critical damping is achieved. In the case of nonholonomic constraints, see eq. (11.2), the velocity level constraint, D(˙q , q, t) = B(q, t) q˙ + b(q, t), is replaced by ˙ + γD = 0, D
(12.2)
where γ is a user deſned, positive parameter. The total energy constraint was treated in a similar manner. Baumgarte’s stabilization method has been very widely used in multibody dynamics because it is easily implemented in conjunction with a variety of formulations of the equations of motion and time integration procedures. Ostermeyer [142] explained the effects of Baumgarte’s stabilization method within the framework of control theory. Unfortunately, parameters α and β are problem dependent, and no general procedure exists for their determination; hence, the approach tends to be unreliable and cannot be recommended for general purpose use in multibody dynamics because the constraints are never exactly satisſed. Eich and Hanke [143] mention that: “Choosing α and β too large results in stiff ODEs and a great amount of computing time.” Nevertheless, some authors reported successful computations with Baumgarte’s method. For instance, Nikravesh et al. [144] found an index-1 formulation in conjunction with Baumgarte’s stabilization to be signiſcantly more efſcient computationally than the coordinate partitioning approach. They mention that: “Experience has shown that for most practical problems, positive values less than 5 for α and β are adequate. When α = β, critical damping is achieved, which usually provides the fastest error reduction.” Park and Haug [145] have combined Baumgarte’s stabilization method with the generalized coordinate partitioning method and shown that this hybrid approach outperforms both methods applied individually. They mention that: “Thus, the constraint stabilization method alone cannot handle every situation accurately and efſciently.” Their rational for this conclusion is that the choice of α and β at each integration step is difſcult and expensive, and erroneous solutions can appear when the constraint matrix is nearly singular. Improvements of Baumgarte’s stabilization method Chang and Nikravesh [146] proposed an approach to adaptively determine the damping coefſcient as the simulation proceeds. They assumed α = β and used adaptive control concepts to estimate optimal damping coefſcients that are different for each
12.3 Constraint violation stabilization techniques
475
constraint. Numerical examples demonstrate that better control of constraint violations is achieved with the adaptive approach. Another improvement of Baumgarte’s method was proposed by Ostermeyer [142] who added to eq. (12.1) a term involving the time integral of the constraint violation, based on optimum control theory. Bae and Yang [147] also proposed an approach to the evaluation of the stabilization parameters. First, they replaced eq. (12.1) by C¨ + αC˙ + αC = 0, where α represents the magnitude of the penalty factor for both position and velocity constraint violations, arguing that both violations are equally undesirable. Larger values of α will yield smaller constraint violations. If α is too large, however, the system becomes unstable. Hence, the value of α is limited by the stability characteristics of the numerical procedure used to integrate the equations of motion; the Adams-Bashforth integrator was used in their work. This condition yields a closed form expression for α as a function of the time step size and stability boundaries of the integrator. A similar study was undertaken by Yoon et al. [148] who showed that under suitable assumptions, the constraint equation, written as C¨ + αC˙ + βC = d, where d represent the disturbances due to truncation errors, is indirectly integrated with the same numerical scheme as that used for the dynamics equations. This enables a rigorous study of the accuracy and stability characteristics of Baumgarte’s method to be performed. In view of the complexity of the analysis, however, results were only shown for one case, the simple pendulum. The authors also pointed out the importance of stabilizing the energy preservation constraint. Based on the input-output feedback linearization technique, Chiou and Wu [149] transformed the nonlinear governing DAEs into a set of linear equations. Next, they showed that a pole placement technique leads to Baumgarte’s method and proposed a new approach to stabilization based on the variable structure control technique. While they demonstrated the superiority of their approach over Baumgarte’s method by means of examples, no guidelines were provided on how to select the constants appearing in either approach. Control theory concepts are also the basis for Lin and Hong’s [150] stability analysis of Baumgarte’s method using digital control theory. They notice that selecting α and β to be positive numbers is not sufſcient to guarantee convergence of C and C˙ to zero as implied by stability analysis applied to eq. (12.1). Hence, they performed a stability analysis of the discretized equations using the Z-transform concept. They deſned two parameters, α ˆ = α/h and βˆ = β/h2 , and concluded that while α ˆ and ˆ β are independent of the problem and time step size, they do depend on the time integration scheme used for the simulation. Desirable values of α ˆ and βˆ were given for the Adams-Bashforth and Adams-Moulton predictor-corrector integrators. 12.3.2 Penalty based stabilization techniques In penalty formulations, constraints are enforced by means of a penalty term added to the Lagrangian of the system, 1 T C P C, 2
(12.3)
476
12 Constrained systems: numerical methods
where P = diag(p2i ), and pi are the penalty factors. It is common practice to use the same penalty factor, p = pi , for all constraints and hence, the penalty term is often written as 1/2 p2 C T C. The idea behind this formulation is to choose large penalty factors so as to drive the constraints to zero, i.e., p → ∞ and C → 0. Taking a variation of the penalty term yields δq T B T p2 C, revealing the equivalent externally applied generalized force, B T (p2 C), which at the limit, become B T λ, where λ are Lagrange’s multipliers. Of course, in practical applications, a ſnite value of the penalty factor must be selected to avoid numerical ill conditioning and hence, the constraints are never exactly enforced and the quantities λ = p2 C approximate Lagrange’s multipliers. The staggered stabilization technique Park and Chiou [124] presented a stabilization technique based on a penalty formulation. The Lagrange multipliers associated with holonomic constraints were written as λ = C(q, t)/ , where = 1/p2 is the penalty factor; time differentiation of this expression then leads to 1 ∂C λ˙ = (B q˙ + ). (12.4) ∂t Taken together with eq. (11.1), these equations form a set of coupled ODEs. For nonholonomic constraints, a similar procedure can be followed by selecting Lagrange’s multipliers as λ˙ = 1/ (B ¨q + ∂D/∂t). Introducing governing eq. (11.1) leads to λ˙ + (B M −1 B T )λ = B M −1 F + ∂D/∂t. If Lagrange’s multipliers ¯ exp(σt), the homogeneous part of this equations becomes are written as λ = λ ¯ = 0. This implies that the constraint decay rates, σi , are the (σ + B M −1 B T / )λ eigenvalues matrix B M −1 B T / ; in other words, the decay rates are a function of the physical characteristics of the system, in contrast with Baumgarte’s method that depends on abstract coefſcients unrelated to system properties. A single derivative of the constraints was taken, and hence, this approach will be less sensitive to the drift phenomenon than methods requiring two time derivatives. Furthermore, it depends on a single coefſcient, the penalty factor. Examples treated by Park and Chiou [124] showed improved accuracy for displacement level constraint invariants as compared to the results of Baumgarte’s method. This stabilized technique is robust as it can accommodate nearly rank deſcient constraint matrices, while Baumgarte’s technique cannot. Park et al. [119] presented an explicit-implicit, staggered procedure to implement the stabilization procedure described in the previous paragraph. The approach calls for the developments of two distinct modules, one integrates the generalized coordinates knowing the constraint forces, the other integrates the Lagrange multipliers knowing the generalized coordinates. Calls to the two modules alternate, hence, the approach is called a “staggered procedure.” Several application examples were given, demonstrating the accuracy and effectiveness of the procedure, which is robust but conditionally stable.
12.3 Constraint violation stabilization techniques
477
Augmented Lagrangian formulation The augmented Lagrangian formulation developed by Bayo et al. [151] starts as a penalty formulation of the problem. Corresponding to the k th holonomic constraint, the following terms are added to the Lagrangian of the system: a penalty term, 1/2 αk ωk Ck2 , a Rayleigh dissipative forces terms, −2 αk ωk μk C˙k , and a ſctitious kinetic energy term, 1/2 αk C˙k2 . The governing equations of the system now become index-1 equations M ¨q + B T α(C¨ + 2Ω μ C˙ + Ω 2 C) = F ,
(12.5)
where α = diag(αk ), Ω = diag(ωk ) and μ = diag(μk ). This penalty formulation will only yield accurate predictions for large penalty factors, αk → ∞; the coefſcients ωk and μk play a stabilizing role similar to that of the corresponding coefſcients of Baumgarte’s method. In the augmented Lagrangian formulation, a set of Lagrange multipliers is introduced together with the penalty terms, leading to M ¨q + B T α(C¨ + 2Ω μ C˙ + Ω 2 C) = F − B T λ∗ .
(12.6)
Had the sole Lagrange multipliers been introduced, the governing equations would have been M ¨q = F − B T λ, and hence, λ = λ∗ + α(C¨ + 2Ω μ C˙ + Ω 2 C). Because the Lagrange multipliers are sufſcient, per se, to impose the constraints, the penalty coefſcient is no longer required to be large; the formulation, however, now involves m additional unknowns. In the proposed approach, the Lagrange multipliers, λ∗ , are not treated as unknowns; rather, they are computed through an iterative process λ∗(i+1) = λ∗(i) + α(C¨ + 2Ω μ C˙ + Ω 2 C)(i+1) ,
(12.7)
where the superscript (·)(i) indicates the iteration number and λ∗(0) = 0. Combining this iterative scheme with eq. (12.6) leads to (M + B T α B)¨q (i+1) = M ¨q (i) − B T α(B˙ q˙ + 2Ω μ C˙ + Ω 2 C),
(12.8)
where M ¨q (0) = F . The augmented Lagrangian formulation reduces the problem to a set of ODEs with no additional unknowns. The iterative solution of the Lagrange multipliers is inexpensive since iterations are already required for the solution of the nonlinear equations of motion. Numerical experimentation shows that accurate solutions can be obtained for a wide range of penalty factors, αk ∈ [103 , 109 ]. The formulation can be generalized to accommodate nonholonomic constraints. Bayo et al. [152] further elaborated the augmented Lagrangian formulation. The ˙ penalty term was simpliſed to keep two terms only, resulting in λ = λ∗ +α(C +μC). They observed that the velocity level constraint in the penalty factor was necessary
478
12 Constrained systems: numerical methods
to prevent the appearance of high frequency numerical oscillations during the simulation. To integrate the equations of motion, the trapezoidal rule was used with accelerations or displacements as primary variables: the latter were shown to provide superior performance. Using penalized potential and kinetic energies, and Raleigh dissipation functions, Kurdila et al. [153] formulated penalized equations of motion for holonomic systems. For conservative systems, convergence to the original, non penalized equations was shown, stability analysis was performed, and sufſcient conditions for Lyapunov and asymptotic stability were given. Good numerical results were shown for singular conſgurations, due to the dissipative function. Systems with relatively large number of degrees of freedom and constraints could also be accurately simulated. The equations were developed for holonomic systems, and are thus somewhat limited in their applications.
12.4 Constraint violation elimination techniques In contrast to the constraint violation stabilization techniques presented in the previous section, constraint violation elimination techniques are method which result in the exact satisfaction of the constraint, or at least to satisfaction of the constraints within machine accuracy. 12.4.1 Geometric projection approach to stabilization Yoon et al. [154] developed an approach to constraint violation stabilization. Let ¯ qn and ¯v n be the generalized coordinates and velocities, respectively, predicted by the integration of the equations of motion at the end of time step n. Due to numerical approximations, both holonomic and nonholonomic constraints will not be exactly satisſed, i.e., C(¯q n , tn ) = 0 and D(¯q n , v¯n , tn ) = 0, respectively. The approach consists in correcting or perturbing the generalized coordinates and velocities, q n = ¯q n + qˆn , and v n = v¯n + vˆn , respectively, where ˆq n and vˆn are the unknown coordinate and velocity corrections, respectively, both assumed to be small. The updated coordinates, q n , and velocities, v n , satisfy the holonomic and nonholonomic constraints, i.e., C(q n , tn ) = 0 and D(q n , v n , tn ) = 0, respectively. At ſrst, the generalized coordinate corrections are evaluated by linearizing the holonomic constraints to ſnd B(¯q n , tn ) ˆq n ≈ −C(¯q n , tn ). Since these equations are overdetermined, it is assumed that the corrections lie in the subspace deſned by the constraint matrix, i.e., ˆq n = B T n , where n is an unknown array. It then follows that n = −(B B T )−1 C(q n , tn ), and ſnally ˆq n = −B T (B B T )−1 C(¯q n , tn ).
(12.9)
Next, the generalized velocity corrections are evaluated by linearizing the nonholonomic constraints to ſnd B(q n , v¯n , tn ) vˆn ≈ −D(q n , v¯n , tn ). In this second phase, the generalized coordinates are kept constant, since they were corrected in the ſrst
12.4 Constraint violation elimination techniques
479
phase of the procedure. Here again, these equations are overdetermined and it is assumed that vˆn = B T γ n . It then follows that γ n = −(B B T )−1 D(q n , v¯n , tn ), and ſnally vˆn = −B T (B B T )−1 D(q n , v¯n , tn ). (12.10) The procedure alleviates constraint violations without modifying the equations of motion, in contrast with Baumgarte’s method. The approach is geometric in nature: Γ T (q n − ¯q n ) = Γ T B T n = 0, the corrected solution is a projection of the approximate solution onto the constraint manifold. Clearly, the geometric procedure alleviates the constraint violations without eliminating them; complete elimination would require an iterative solution of the constraint equations. Yoon et al. [154] demonstrated the effectiveness of the procedure for holonomic, nonholonomic and energy constraints. Blajer [85, 155] developed a similar approach to constraint elimination based on the geometric interpretation of constrained dynamics he presented with his coworkers in ref. [84]. Based on geometric arguments, the following correction schemes are found for the generalized coordinates ˆq n = −M −1 B T (B M −1 B T )−1 C(¯q n , tn ),
(12.11)
vˆn = −M −1 B T (B M −1 B T )−1 D(q n , ¯ v n , tn ),
(12.12)
and velocities,
respectively. Blajer’s corrections, eqs. (12.11) and (12.12), are more physically consistent than Yoon’s, eqs. (12.9) and (12.10), respectively, because matrix (B M −1 B T ) involves terms that are of consistent units, in contrast with matrix (B B T ) that does not. Indeed, when generalized coordinates have different units, such as displacements and rotations, matrix (B B T ) weighs all components equally; in contrast, matrix (B M −1 B T ) weighs each component by an appropriate inertial term. In numerical applications, the position corrections, eq. (12.11), are used ſrst in an iterative manner until constraint violations are completely eliminated, i.e., until the constraint equations are satisſed to machine accuracy. If nonholonomic constraints are present, the same process is applied to correct the velocities. Baumgarte [156] developed a new stabilization method, which is derived from a modiſed statement of Hamilton’s principle. The resulting equations of motion feature non classical Lagrangian multipliers and the holonomic constraints need to be differentiated only once with respect to time. Unfortunately, no applications were presented, making the assessment of the approach rather difſcult. Terze et al. [157] formulated a constraint elimination method within the framework of the null space approach. Using the projective criterion deſned by Blajer et al. [84], they identiſed a set of independent variables. Displacement constraint violations were then iteratively eliminated by adjusting the sole dependent variables to satisfy the displacement level constraint equations. In a second step, the velocity constraint violations were eliminated using the velocity level constraint equations. During both correction steps, the independent displacements and velocities were kept unchanged.
480
12 Constrained systems: numerical methods
12.4.2 The mass-orthogonal projection formulation Bayo and Avello [158] proposed an augmented Lagrangian formulation based on the canonical equations of Hamilton. Compared to the index-1 based formulation derived earlier by Bayo et al. [151], the new formulation exhibits better accuracy and robustness in the presence of singular conſgurations. The improved performance was credited to the fact that a single differentiation of the holonomic constraints is required in canonical formulations, rather than the double differentiation associated with index-1 formulations. Effectiveness of the new approach was illustrated by numerical examples. A similar formulation was derived for nonholonomic constraints although no example was given. While the augmented Lagrangian formulation presented in section 12.3.2 satisſes the weighted constraint, C¨ + 2Ω μ C˙ + Ω 2 C, see eq. (12.5), to machine accuracy, individual constraints at the position, C = 0, velocity, C˙ = 0, and acceleration levels, C¨ = 0, are not necessarily satisſed to the same level of accuracy. To improve this situation, Bayo and Ledesma [159] combined the augmented Lagrangian formulation with a mass orthogonal projection technique. To impose the position level constraint, they propose to minimize V = 1/2 (q − q∗ )M (q − q ∗ ) subject to the constraint C(q, t) = 0, where q ∗ is the solution obtained at the end of a time step using the augmented Lagrangian formulation. This minimization problem is itself solved using an augmented Lagrangian formulation, transforming V into 1 1 (q − q ∗ )T M (q − q ∗ ) + C T α C + C T λ. (12.13) 2 2 Simple algebraic manipulations lead to the following iterative scheme to impose the position constraint V∗ =
(M + B T α B)Δ(i+1) = −M (q (i) − q ∗ ) − B T λ(i)
(12.14)
where Δ(i+1) = q (i+1) − q (i) and λ(i+1) = λ(i) + α C (i+1) . From a computational view point, this iterative procedure is not expensive because the system matrix, (M + B T α B), is identical to that of eq. (12.8). Hence, this matrix is factorized once only and the additional computational cost consists of the evaluation of the right-hand side of eq. (12.14) followed by forward reductions and backward substitutions. This contrasts with the approaches presented in section 12.4.1 that typically involve more computational effort. The constraints at the velocity and acceleration levels can be treated in a similar manner and are formulated in such a way that the resulting system matrix is identical to that of eq. (12.14), minimizing computational cost. Bayo and Ledesma [159] illustrated their approach with several numerical examples. Application of the mass-orthogonal projection at each time step eliminates constraint violations to machine accuracy and dramatically increases the accuracy of the simulation. A mechanism presenting singular conſgurations was successfully simulated to demonstrate the robustness of the augmented Lagrangian formulation. The trapezoidal rule was used to integrate the equations of motion with accelerations or displacements as primary variables: the latter were shown to provide superior performance.
12.5 Finite element based techniques
481
Comparative studies For holonomic systems, Schiehlen [160] derived governing DAEs and ODEs, the Boltzmann-Hamel equations, as well as equations of motion based on a recursive approach. The recursive approach, suitable for chain topologies, can be much more competitive than the ODE formulation, although for small numbers of degrees of freedom, the latter is still competitive as the former is rather complex. In general, recursive approaches require O(n) operations, in contrast with ODE formulations that may need up to O(n3 ) arithmetic operations. A comparative study showed that the ODE formulations are more efſcient than their DAEs counterpart, although this conclusion was based on very simple, rigid multibody systems examples involving very few degrees of freedom. Cuadrado et al. [161] compared four methods that are used to simulate multibody dynamics with constraints. These methods are: the augmented Lagrangian formulation index-1 and index-3 with projections, a modiſed state-space formulation (equations of motion in independent coordinates) and a fully recursive formulation. Modiſcations were performed to the classical state-space formulation to improve its performance in the presence of stiff systems. The augmented Lagrangian index-1 and index-3 formulations used natural or fully Cartesian coordinates, as described in ref. [41]. These coordinates have the advantage of leading to a constant mass matrix. A number of rigid multibody problems were solved with all four methods to compare their performance; none was found to be fully satisfactory. The index-3 formulation with projections failed to converge when using time step sizes smaller than 10−5 sec, while for time step sizes larger than 10−2 sec, the index-1 formulation failed to converge. The space-state and the fully recursive formulation lacked robustness as they failed to handle singular conſgurations. In addition, the fully recursive formulation behaved poorly in the presence of stiff systems or systems presenting redundant constraints. Nevertheless, for non-stiff problems of large size, this method became competitive. Of all the methods tested, the index-3 formulation with projections was the most efſcient, while the index-1 formulation with projections was the most robust. The authors suggested that a combined index-1 and index-3 formulation would constitute a very good tool for solving multibody dynamics with constraints. Further evaluation of the methods was recommended, however, especially for large scale industrial problems. It is not clear how the various methods presented in this study would perform for elastic multibody systems.
12.5 Finite element based techniques Multibody dynamics analysis was originally developed as a tool for modeling mechanisms with simple tree-like topologies composed of rigid bodies, but has considerably evolved to the point where it can handle nonlinear ƀexible systems with arbitrary topologies. The modeling of the elastic bodies is one of the most difſcult aspects of multibody systems dynamics, and many different formulations have been presented
482
12 Constrained systems: numerical methods
in the literature. Comprehensive reviews of the state of the art in the ſeld are given by Shabana [162], or Wasfy and Noor [163]. Traditionally, elasticity in multibody systems has been taken into account using the ƀoating frame of reference approach [162], which is discussed in section 12.5.1. The displacement ſeld of a ƀexible body is then decomposed into two additive parts, a rigid body and an elastic displacement ſeld. The rigid body displacement ſeld is represented by the arbitrarily large motion of a suitably selected frame of reference, which could be rigidly connected to a material point of the ƀexible body, or could be in motion with respect to the ƀexible body, hence the name “ƀoating frame of reference.” On the contrary, the elastic displacement ſeld, resolved in the ƀoating frame of reference, is assumed to remain small, and hence, is adequately represented using modal expansion techniques. If this assumption is satisſed, the elastic behavior of ƀexible bodies can be accurately captured using a small number of modal degrees of freedom. Component mode synthesis techniques, initially developed for ſnite element analysis, are now routinely used in ƀexible multibody dynamics and section 12.5.2 summarizes several commonly used approaches. In ſnite element based multibody dynamics approaches, a given mechanism is modeled by an idealization process that represents each component of the ƀexible mechanism by an “element” chosen from an extensive library of elements implemented in the code. In fact, this approach is at the heart of the ſnite element method, which has enjoyed, for this very reason, an explosive growth in the last few decades. Each element provides a basic functional building block, for example a rigid or ƀexible member, a hinge, a motor, etc. Assembling the various elements, the construction of a mathematical description of the mechanism with the required level of accuracy becomes possible. In addition to the classical beam, plate, shell, and solid brick elements found in all ſnite element codes, kinematic constraints are also formulated as “ſnite elements,” such as revolute joint or universal joint elements, to name but a few. A detailed description of the formulation is given by G´eradin and Cardona [164]. 12.5.1 Floating frame of reference approach One of the most common approaches to the modeling of ƀexible multibody systems is based on the concept of ƀoating frames [162]. The total motion of the ƀexible body is broken into two parts: rigid body motions represented by the motion of the ƀoating frame, and superimposed “elastic motions.” This decomposition allows the introduction of simplifying assumptions: although the total motion is always ſnite, the elastic motions may, in some cases, be assumed to give rise to inſnitesimal deformations. Many problems of great practical importance fall into this category. Consider, for instance, road or rail vehicles: the body of the vehicle undergoes large rigid body motions but the elastic deformations remain small. Of course, this assumption is no longer valid during a crash: in that case, large plastic deformations will be encountered. Other components of the vehicle, however, such as the suspension, wheels, and tires are inherently of a nonlinear nature. Another example is rotorcraft. Under normal operation, the fuselage undergoes large rigid body motions but small elastic deformations. During maneuvering ƀight, large rotations will be encountered. Here
12.5 Finite element based techniques
483
again, the other components of the rotorcraft, the main and tail rotor, and the landing gear, are inherently nonlinear. When deformations remain small, it seems natural to use modal reduction techniques to represent the small elastic motions in an efſcient manner. The system size will be considerably reduced, together with the resulting computational cost. In addition, since high frequency modes are eliminated, larger time step sizes can be used in the simulation. Although the concept of ƀoating frame seems rather intuitive, the implementation of a computational procedure based on this idea must deal with several thorny issues. First, the accuracy of the analysis will critically depend on the selection of a suitable modal basis. Second, a speciſc ƀoating frame must be selected: it could be attached to a point of the elastic body or moving with respect to it. Third, the modal based elements should be easy to couple with the other components of the system modeled with multibody formulations. This points towards the use of component mode synthesis techniques that are well developed for structural dynamics problems. Fourth, in the absence of elastic deformations, the formulation should recover the exact equations of motion for a rigid body. Finally, the formulation should be independent of the ſnite element analysis package used to compute the modes of the elastic components. These various issues will be discussed in more detail in the following paragraphs. The ſrst critical step is the selection of a suitable modal basis, which is closely related to the choice of a speciſc ƀoating frame of reference [165, 166, 167, 168]. Ideally, the selected modes should capture as accurately as possible the deformation patterns encountered during operation. Consequently, the analyst should be given the greatest possible freedom to select the type of modes he sees ſt. The formulation should not put any restriction on the choice of the modal basis. Several authors have addressed the mode selection process [169, 170, 171, 172, 173, 174, 175, 176, 177]. Next, a speciſc ƀoating frame must be selected. Since there exits no unique manner of deſning the “rigid” and “elastic motions,” the ƀoating frame can be selected in a number of different ways and speciſc conditions must be selected to remove this indeterminacy. Several authors make use of body-attached frames, i.e., the ƀoating frame is attached to an arbitrary point in the body [178, 172, 174]. Other authors rely on ƀoating frames moving with respect to the elastic body [167, 179, 177]. Cavanin and Likins [168] studied different options including frames attached to a material point of the ƀexible body, frames oriented along the principal axes of inertia, the Tisserand frame, the Buckens frame and the rigid body mode frame. They concluded that the Tisserand frame was the most advantageous choice and showed the equivalence of several of these choices when the body undergoes small deformations. The moving frame approach seems to be more desirable than the body-attached approach because it eliminates the need to arbitrarily select a material point where to attach the ƀoating frame. On the other hand, the moving frame approach also involves the analyst’s insight since a speciſc condition must be selected to determine its location. Furthermore, this latter approach comes at the expense of additional computational complexity.
484
12 Constrained systems: numerical methods
In some formulations, the choice of the ƀoating frame is intimately linked to that of the modes used in the reduction technique [180, 181, 175, 176, 163, 182]. This connection hinders the selection of the most appropriate modes because the boundary conditions used to compute them do not necessarily match those of the ƀexible component once it is part of a multibody system. In the classical application of modal analysis [183], the displacement ſeld is represented as a linear combination of modes shapes. This type of representation has been used by some authors [184] in the context of multibody dynamics analysis, but it requires special techniques for coupling the modal based element with the other components of the system. Typically, this is done by formulating a constraint condition that equates the modal superposition to the physical displacement at a node of the model [185]. 12.5.2 Component mode synthesis methods These approaches do not take advantage of the component mode synthesis techniques that have been developed for structural dynamics problems over the past forty years. These techniques are aimed at computing the eigenmodes of very large structures in an efſcient manner. The complete structure is broken into a number of substructures whose eigenmodes are easily computed. The substructures are then connected together to yield a lower-order model of the complete structure. Each substructure involves two types of degrees of freedom: physical degrees of freedom at a limited number of connection points (called “boundary nodes”), and modal degrees of freedom representing its internal ƀexibility. Clearly, the need to interconnect the substructures is an integral part of the reduction technique and seems therefore ideally suited to the present problem. Among the most widely used component mode synthesis techniques are those of Craig and Bampton [186], MacNeal [187], Rubin [188]. Other efforts include those of Herting [189], Hintz [190], and reſnements of the Craig-Bampton method [191]. Component mode synthesis methods have been used in the context of multibody dynamics by Shabana [178], Haug and coworkers [170, 171, 192, 193], and later by Cardona and coworkers [172, 174, 177] who used the Craig-Bampton method. Unfortunately, this method requires the use of modes associated with clamped conditions at the boundary nodes, thereby limiting the analyst’s freedom to select the most appropriate modal basis. Consequently, the modal basis might poorly approximate the elastic behavior of the component. This fundamental limitation of the approach was recognized by Craig and Bampton who suggested the use of “static correction modes” to alleviate the problem; Schwertassek and coworkers [175, 176] used this concept for ƀexible multibody systems. It also prompted the development of the MacNeal-Rubin method. In this case, however, free conditions must be used at all boundary nodes, limiting again the analyst’s freedom. Furthermore, this method is more cumbersome to implement than the Craig-Bampton method. Finally, Herting’s method [189] offers a more general approach that enables the analyst to choose any type of modes. In fact, predictions based on the Craig-Bampton
12.5 Finite element based techniques
485
and MacNeal-Rubin methods were found to be in good agreement with those obtained with Herting’s method [189]. Herting’s method seems to be the most appropriate choice as it provides the analyst maximum ƀexibility in the choice of the modal basis. Furthermore, it allows independent choices to be made for the selection of the ƀoating frame and of the modal basis [194, 195]. The formulation of modal based elements should be independent of the ſnite element analysis package used to compute the modes of the elastic components [170, 171, 172, 174, 177]. This means that the computation of the mass and stiffness coefſcients used for the formulation of a modal based element should be solely based on the information readily provided by the ſnite element package. Some formulations have been proposed in which the ſnite element analysis tool is embedded in the multibody formulation [196]. Although higher accuracy can be achieved in that manner, this is clearly not a practical option if a large dimensional ſnite element model is required for the representation of the elastic components. Yoo and Haug [170, 171] showed that by assuming a lumped mass representation of the elastic body, the modal based formulation could be fully decoupled from the ſnite element package. Unfortunately, the lumped mass approximation is rarely used in today’s ſnite element models of complex structures. Cardona and G´eradin [172, 174, 177] used corotational techniques to achieve the same decoupling without resorting to the lumped mass approximation. Herting’s transformation leads to an approximation that is fully independent of the ſnite element analysis package. The mass and stiffness coefſcients of the modal based element are computed on the sole basis of the unconstrained mass and stiffness matrices of the elastic component and Herting’s transformation, which also applies to the inertial velocities required to compute the kinetic energy of the elastic component, under the sole assumption of small displacements. In summary, Herting’s transformation is an attractive approach for the implementation of component mode synthesis techniques in ƀexible multibody systems. First, it allows the use of any modal basis the analyst sees ſt. This contrasts with other approaches, such as those based on Craig-Bampton or Rubin-MacNeal transformations that require speciſc boundary conditions for the selected modes. Second, it can be used with both body-attached or moving frames of reference. Third, the modal based element is readily coupled to other components of the multibody system through the boundary nodes that retain physical degrees of freedom for this purpose. Fourth, the formulation recovers the exact equations of motion for a rigid body in the absence of elastic deformations. Finally, it is completely independent of the ſnite element package used to compute the modes of the elastic components. 12.5.3 Basic solution techniques for ſnite element models Application of ſnite element concepts to multibody dynamics analysis has been the focus increased research in recent years. The textbook by G´eradin and Cardona [164] describes such a procedure and presents numerical examples obtained with a commercial implementation the approach. As compared with rigid multibody dynamics or even ƀexible multibody dynamics using a modal approximation, a distinguishing
486
12 Constrained systems: numerical methods
feature of ſnite element methods is the much larger number of degrees of freedom used to model the system. While multibody systems involve tens or at the most a few hundreds of degrees of freedom, ſnite element element models often involve tens of even hundreds of thousands of degrees of freedom. Because solution costs grow as a power of the number of degrees of freedom, efſcient solution techniques are an enabling technology for ſnite element formulations. Because of the large number of degrees of freedom involved in ſnite element formulations and the likely presence of high frequencies associated with the discretization process, time integration relies almost exclusively on implicit schemes. For linear systems, the HHT integrator [135], the workhorse used in most commercial codes, is second-order accurate, unconditionally stable, and presents high frequency numerical damping; these three features are considered indispensable for the successful integration of large ſnite element systems, as discussed in textbooks such as Hughes [197] or Bathe [198]. This contrasts with multibody formulations that tend to use explicit, predictor multi-corrector algorithms such as the Adams-Bashforth integrator [199], for instance. Although of much higher-order accuracy, this integrator is conditionally stable. Implicit integrators require the solution of a linear system at each time step. Typical solution procedures rely on the trifactorization of the sparse, banded dynamic stiffness matrix, K, as K = L D LT , where L is a lower triangular, D a diagonal, and LT an upper triangular matrix, followed by a back-substitution phase. More details concerning this approach called the “skyline solver” or the “active column solver” are found in many textbooks, such as Bathe [198], for instance. The cost, Cb , of the trifactorization of a sparse, banded matrix can be roughly estimated as Cb ∝ nw2 , where n is the number of freedom and w the average bandwidth of the dynamic stiffness matrix. If this matrix were to be fully populated, the factorization cost, Cfp , would become Cfp ∝ n3 . For a ſnite element problem of modest size where n = 10, 000 and w = 100, Cb /Cfp = (m/n)2 = 10−4 ; clearly, the advantage of the sparse solver is overwhelming and is an enabling technique of the ſnite element method. An important implication of these observations is that any formulation that destroys the sparsity of the system matrix generated by the ſnite element method is unlikely to be effective. For instance, applications of Maggi’s formulation presented in section 11.2.1 requires the computation of the null space of the constraint matrix. The various algorithms used to compute the null space, whether the LU factorization with pivoting, Gram-Schmidt orthogonalization algorithm, or singular value decomposition, all alter the band structure of the system matrix. The index-1 formulation requires the inverse of the mass matrix, another band destroying operation; of course, the null space formulation requires the evaluation of the null space; ſnally, the computation of the pseudo-inverse called for by Udwadia and Kalaba’s formulation is once more an operation that does not preserve sparsity. Clearly, far fewer methods are available for the effective enforcement of constraints when bandedness of the system matrix must be preserved. Hence, it should not come as a surprise that the sparsity based, index-3 DAEs formulation of Orlan-
12.5 Finite element based techniques
487
dea et al. [77, 78] discussed in section 12.6 has been used within the framework of ſnite element formulations. The penalty based stabilization techniques presented in section 12.3.2 have also been used in this framework. Finally, the staggered stabilization technique of Park and Chiou [124] and the augmented Lagrangian formulation of Bayo et al. [151] were originally developed for ſnite element formulations. 12.5.4 Numerically dissipative schemes In view of the difſculties associated with the solution of index-3 DAEs, considerable effort was devoted to the development of time integration techniques suitable for large ſnite element systems. Cardona and G´eradin [200, 31] showed that the classical Newmark [201] trapezoidal rule is unconditionally unstable for linear systems in the presence of constraints. The use of dissipative algorithms such as HHT [135] scheme, however, resulted stable behavior, even for nonlinear systems. Further work by Farhat et al. shows that both HHT and generalized-α [136] methods achieve stability for a class of constrained hybrid formulations. In these approaches, stabilization of the integration process is inherently associated with the dissipative nature of the algorithms. While stability is mathematically proven for linear systems, there is no guarantee when it comes to nonlinear systems [202]. A more detailed description of the generalized-α scheme is given in section 17.4. 12.5.5 Nonlinear unconditionally stable schemes To remedy this situation, considerable work has been done in recent years with energy preserving schemes. In these schemes, unconditional nonlinear stability is achieved by proving a discrete energy preserving statement, Ef = Ei , where E denotes the total mechanical energy of the system, and the subscripts (·)i and (·)f denote the value of the corresponding quantity at the initial and ſnal times of the time step, respectively, denoted ti and tf , respectively. This algorithmic preservation property is a direct consequence of the speciſc discretization used for the inertial and elastic forces acting on the system. In view of the positive-deſnite nature of the total mechanical energy, this discrete conservation law guarantees the stability of the computational scheme for nonlinear problems. It is important to understand that while the exact solution of the equations of motion implies the exact preservation of the total mechanical energy, a numerical, i.e., an inherently approximate, solution of the problem does not, in general, guarantee the preservation of energy at the discrete level. When using energy preserving schemes, the computed, approximate solution exactly satisſes the energy preservation condition. A number of researchers have developed energy preserving schemes for rigid bodies [203, 204, 205], beams [206, 207], and plates and shells [208, 209]. Section 17.5 describes energy preserving and decaying schemes in more details, but an exhaustive review of these schemes is beyond the scope of this book. While nonlinear unconditional stability is the ſrst step towards the development of robust algorithms, energy preserving schemes are not well suited for large ſnite element problems because high frequency oscillations, especially in the velocity and
488
12 Constrained systems: numerical methods
stress ſelds, can corrupt the computed system response, as observed by Bauchau et al. [210, 211]. Consequently, the presence of high frequency numerical dissipation is an indispensable feature of robust time integrators for multibody systems, a fact that was already observed for linear systems and prompted the development of numerically dissipative algorithms such as the HHT [135] or generalized-α [136] methods. Numerically dissipative schemes that feature nonlinear unconditional stability can be developed by proving a discrete energy decay statement, Ei+1 = Ei − Ed , where Ed > 0 is the energy dissipated within the time step. This approach was followed by a number of researchers who developed energy decaying schemes for beams [212], shells [213] and multibody systems [214, 215]. An exhaustive survey of energy decaying algorithms is beyond the scope of this book; Bottasso and Trainelli [216] attempted a classiſcation of a number of such algorithms. 12.5.6 Enforcement of the constraints The development of energy preserving and decaying algorithm has considerably increased the robustness of time integration schemes for multibody systems. The main idea behind these techniques is to develop discretizations of the equations of motion that imply algorithmic preservation of a known ſrst integral of the motion, the total mechanical energy. When it comes to enforcement of the constraints, a similar path has been followed: the well known fact that the work done by the constraint forces must vanish is implemented at the algorithmic level [210, ' 217]. ' The work done by the constraint forces is W c = F cT q˙ dt = λT B q˙ dt = ' T λ C˙ dt and hence, the vanishing of this work is intimately linked to the vanishing of the constraint derivatives. This observation helps understand why it is important to enforce constraints at both displacement and velocity levels. Here again, it must be noted that an approximate solution of the constrained equations of motion will not necessarily imply the vanishing of the work done by the constraint forces at the algorithmic level. This provides a potential source of “numerical energy,” which could destabilize the integration scheme. Focusing on holonomic constraints, the following relationship is used to deſne the algorithmic constraint matrix, B m , as C f − C i = B m (q f − q i ),
(12.15)
where the subscript (·)m indicates quantities evaluated at the midpoint of the time step. Note that the mean value theorem guarantees the existence of B m . The discretized forces of constraint now become F cm = B Tm λm , where λm are midpoint Lagrange’s multipliers, and the work done by these discretized forces of constraint follows as W c = (q f − q i )T F cm = λTm B m (q f − q i ) = λTm (C f − C i ). Clearly, the vanishing of the work done by the algorithmic forces of constraints implies Δt C˙ m = C f − C i = 0, which echoes, at the algorithmic level, the condition required for the exact solution, C˙ = 0. Rather than imposing the condition C f − C i = 0, it is preferable to enforce C f = 0 at each time step, to avoid the drift phenomenon.
12.5 Finite element based techniques
489
Discretizations of numerous constraints that satisfy eq. (12.15) can be found in the following references [218, 219, 220, 221]. Additional details about this approach are found in section 17.5.4. Gonzalez [222] formulated an integration scheme for solving the equations of motion of Hamiltonian system expressed in the form of DAEs. Holonomic constraints were considered. The numerical scheme was based on the notion of discrete derivative, which satisſed properties such as directionality, consistency and orthogonality; eq. (12.15) is an example of discrete derivative. The proposed scheme satisſes the constraints, and leads to the conservation of the Hamiltonian and linear and angular momenta, but constraints are not satisſed at the velocity level. Bauchau [223] showed that the approach to modeling constraints characterized by eq. (12.15) is closely related to the stabilized index-2 method of Gear et al. [59], although no additional unknowns are required. The approach summarized in the last two sections combines two algorithmic features: preservation/dissipation of energy and vanishing of the work done by the constraint forces. This provides a formal proof of numerical stability for the integration of nonlinear, ƀexible multibody systems, and constraints are enforced to machine accuracy, both at the displacement and velocity levels. The price to pay for these desirable features is that the discretization of inertial, elastic and constraint forces must be carefully crafted for each element type so that the preservation characteristics of the algorithms can be proved. This stands in sharp contrasts with the more traditional approach to multibody simulations that use a variety of formulations of the equations of motion, but rely on “black box” integration routine, which are designed for the solution of DAEs, but are otherwise unaware of the speciſc features and characteristics of the equations being solved. 12.5.7 The discrete null space approach Betsch et al. [224, 225, 226] have recently proposed an original method for the time integration of constrained dynamical systems, based on Maggi’s formulation. In this approach, the index-3 DAEs are ſrst discretized with an energy/momentum preserving scheme based on the algorithms of Gonzalez [222], and Betsch and Steimann [227]. Next, the discrete Lagrange multipliers are eliminated using a discrete null space: using the notation of eq. (12.15), the discrete null space, Γ m , is the orthogonal complement of the discrete constraint matrix, B m , such that Γ Tm B Tm = 0. As discussed in section 12.5.6, analytical expressions of the discrete constraint matrix can be derived for a wide range of constraints; the originality of proposed approach is to show that analytical expressions of the discrete null space can also be obtained for numerous constraints. This approach bypasses the need for the numerical evaluation of the null space using the many techniques described in sections 12.1.1 and 12.1.2, and the associated numerical cost. Furthermore, the discrete null space is computed for each element of the system independently, and hence, can be used within the framework of ſnite element methods without harming the bandwidth of the system.
490
12 Constrained systems: numerical methods
12.6 Scaling of Lagrange’s equation of the ſrst kind Orlandea et al. [77, 78] have presented an approach to the dynamic analysis of mechanical systems based on the solution of Lagrange’s equation of the ſrst kind, a system of index-3 DAEs in the presence of holonomic constraints. While the number of generalized coordinates used in this approach is larger than the minimum set, they argue that numerical solutions of the resulting equations can be efſciently obtained by taking advantage of their sparsity through the use of appropriate algorithms. To overcome the numerical problems associated with the solution of DAEs, numerically dissipative time integrators were used that are speciſcally designed for stiff problems, see Gear [228]. This early approach proposes a purely numerical solution to the challenges posed by Lagrange’s equations of the ſrst kind: stiff integrators are used to deal with DAEs. Petzold and L¨otstedt [49] have shown that index-3 DAEs are severely ill conditioned for small time step sizes when using backwards difference formulas: unless corrective actions are taken, the condition number of the iteration matrix is of O(h−3 ), where h denotes the integration time step size. Furthermore, errors in the displacement, velocity, and multiplier ſelds are shown to propagate at rates of O(h−1 ), O(h−2 ), and O(h−3 ), respectively. A perturbation analysis by Arnold [229] indicates that errors and constraint violations grow very rapidly as the time step size is reduced, preventing the practical use of time reſnement procedures, and imposing tight tolerances on the solution of the nonlinear discrete equations. Petzold and L¨otstedt [49] presented a simple scaling transformation of the DAEs that yields a condition number of O(h−2 ) and an improvement of one order in the errors for all solution ſelds. Although the sensitivity to perturbations is reduced, numerical problems are still observed in practice. Their scaling, termed “left preconditioning,” consists of dividing the constraint equations by the time step size, while the dynamic equilibrium equations are multiplied by the same quantity. While the mathematical rational for preconditioning is recent, the technique has been used for a number of years by Cardona [31] or Bauchau et al. [230]. Clearly, scaling can and should be used in conjunction with other techniques for the solution of DAEs: it is easily implemented, does not require a reformulation of the equations of motion, and does not introduce additional unknowns. In recent years the direct solution of index-3 DAEs has regained popularity, specially when ſnite element formulations are used to model ƀexible multibody systems, see section 12.5. Because of the large number of degrees of freedom involved in these formulations and the likely presence of high frequencies associated with the spatial discretization process, time integration relies almost exclusively on implicit schemes such as the HHT integrator [135], or more recently, the generalized-α scheme [136]. While dissipative time integration schemes seem to be indispensable to the successful integration of constrained dynamical systems modeled with index-3 DAEs, scaling of the governing equations and constraints seems to be an equally important technique, which is, in fact, hardly new.
12.6 Scaling of Lagrange’s equation of the ſrst kind
491
In the framework of engineering optimization, scaling of constraint equations is a well-known practice that is recommended in numerous textbooks, such as Fox [231], 1971, or Reklaitis et al. [232], 1983. In his 1984 textbook, Vanderplaats [233] speciſcally mentions: “Often, numerical difſculties are encountered because one constraint function is of different magnitude or changes more rapidly than the others and therefore dominates the optimization process. [...,] we have normalized the constraints so they become of order of unity. This improves the conditioning of the optimization problem considerably, and should always be done when formulating the problem.” Although engineering optimization and multibody dynamics simulation are numerically similar problems that must both deal with constraints, it is disturbing to note that scaling of the constraint equations is rarely mentioned in multibody dynamics papers or textbooks. Cardona and G´eradin [234] showed that the condition number of the iteration matrix obtained from the HHT integrator is of O(h−4 ) and stated that “If we try to solve this problem without scaling, the Newton algorithm will not converge since round-off errors would become of the same order as the Newton correction itself.” To remedy this problem, they proposed a symmetric scaling of the equations of motion that render the condition number of the system matrix independent of the time step size and of the mean value of the mass matrix. A more systematic analysis of the scaling procedure was discussed by Bottasso et al. [235] who proposed a simple scaling transformation for the index-3 DAEs describing constrained multibody dynamical systems. The approach amounts to a left and right preconditioning of the iteration matrix, in an effort to decrease solution sensitivity to perturbation propagation. A remarkable result was obtained: both error propagation and iteration matrix conditioning are of O(h0 ), and hence, the behavior of the numerical solution of index-3 DAEs is identical to that of regular ODEs. Bottasso et al. [236] later extended the same ideas to the Newmark family of integration schemes and provided a better theoretical foundation to explain how perturbations affect the solution process. In section 12.6.1, physical arguments are used to derive a simple scaling procedure that is directly applied to the governing equations of motion, before the time discretization is performed, and an augmented Lagrangian term is added to the formulation, see section 12.6.2. Application of any time discretization scheme, such as that described in section 12.6.3, followed by a linearization of the resulting nonlinear algebraic equations then lead to a Jacobian matrix that is independent of the time step size; hence, the condition number of the Jacobian and error propagation are both of O(h0 ): the numerical solution of index-3 DAEs behaves as in the case of regular ODEs. Since the scaling factor depends on the physical properties of the system, the proposed scaling decreases the dependency of this Jacobian on physical properties, further improving the numerical conditioning of the resulting linearized equations. Finally, the additional beneſts stemming from the augmented Lagrangian term are discussed in section 12.6.5. Speciſcally, this term enables the use of sparse solvers that do not rely on pivoting for the stable and accurate solution of the linearized equations of motion.
492
12 Constrained systems: numerical methods
12.6.1 Scaling of the equations of motion In this section, simple physical arguments are used to scale Lagrange’s equation of the ſrst kind, eqs. (11.1), which form a set of index-3 DAEs, M
d2 q + BT λ = F , dt2 C(q, t) = 0,
(12.16a) (12.16b)
where M = M (q, t) is the symmetric, semi positive-deſnite mass matrix, and F = F (q, q, ˙ t) the array of dynamic and externally applied forces. For simplicity of the exposition, the constraints are all assumed to be holonomic, but the derivation presented here equally applies to nonholonomic constraints, or a mixture thereof. To ease the discussion, the damping and stiffness matrices will be explicitly shown in the equations of motion, and eqs. (12.16a) and (12.16b) are restated as M
d2 q dq + K q + B T λ = G, +D 2 dt dt C(q, t) = 0,
(12.17a) (12.17b)
where D = D(q) is the damping matrix, K = K(q) the stiffness matrix, and G = G(q, q˙ , t) the array of remaining dynamic and externally applied forces. Following the advice of Vanderplaats [233] for optimization problems, constraints are normalized so as to become of the order of unity. This can be readily achieved by introducing normalized generalized coordinates, ˆq , such that q = r ˆq , where r is a reference or characteristic length of the system. For dynamical systems, it is also important to introduce a normalized time variable, τ , such that t = hτ , where h is the time step size. The equations of motion, eqs. (12.17a) and (12.17b), have not yet been discretized in time, but the time step size is anticipated to become an important characteristic time of the problem from a numerical standpoint. The equations of motion now become M ¨ˆq + hD ˆq˙ + h2 K ˆq + B T h2 λ = h2 G,
(12.18a)
C(ˆq , τ ) = 0.
(12.18b)
Matrices M , D, K, and B as well as arrays G and C are now expressed in terms of the normalized generalized coordinates. Matrices M , D, and K have been multiplied ˙ is used to by r ; for simplicity, the same notation is used from here on. Notation (·) denote a derivative with respect to the non-dimensional time, τ . The equations of motion, eqs. (12.18a), were multiplied by h2 to avoid division by a potentially small number, h2 . A cursory examination of the normalized equations of motion, eqs. (12.18a) and (12.18b), reveals two obvious numerical problems. First, if the mass and/or damping and/or stiffness of the system become large, one or more of the ſrst three terms of the equations of motion will become large, whereas the constraint equations
12.6 Scaling of Lagrange’s equation of the ſrst kind
493
remain unchanged. In other words, for systems with large mass, damping, or stiffness, the constraint equations become “invisible” to the numerical process. Second, the unknowns of the problem are of different orders of magnitude: displacements are typically very small quantities, but Lagrange’s multipliers are force quantities, and hence, typically much larger, potentially by many orders of magnitude. The ſrst problem is easily solved by multiplying the constraint equations, eqs. (12.18b), by a scalar factor, called the scaling factor, s, to render the constraint equations and the equations of motion, eqs. (12.18a), of comparable magnitudes. Clearly, selecting s = mr + dr h + kr h2 accomplishes this goal. In this expression, mr , dr , and kr represent characteristic mass, damping and stiffness coefſcients of the system, which can be selected as mr = M ∞ , dr = D∞ and kr = K∞; another convenient choice is to select mr , dr , and kr as the average of the diagonal terms of the mass, damping and stiffness matrices, respectively. The second problem can be solved by scaling Lagrange’s multipliers by writing ˆ Clearly, in view of Newton’s law, selecting s = mr + dr h + kr h2 , h2 λ = sλ. ˆ makes λ a quantity of magnitude comparable to that of displacement quantities. The equations of motion of the problem, eqs. (12.18a) and (12.18b), now become ˆ = h2 G, M ¨ˆq + hD ˆq˙ + h2 K ˆq + B T sλ
(12.19a)
sC = 0.
(12.19b)
The techniques used here are well-known scaling techniques for systems of equations, as discussed in textbooks on matrix computations. For instance, Golub and Van Loan [82] state: “The basic recommendation is that the scaling of the equations and unknowns must proceed on a problem-by-problem basis. General scaling strategies are unreliable. It is best to scale (if at all) on the basis of what the source problem proclaims about the signiſcance of each aij [i.e., each matrix entry].” In the proposed scaling strategy, the scaling factor was selected on the basis of physical arguments about the nature and order of magnitude of each term appearing in the equations of motion. At this point, it is convenient to simplify the notation and write the scaled governing equations of index-3 multibody systems as ˆ = h2 F , M ¨ˆq + B T sλ
(12.20a)
sC = 0,
(12.20b)
where the scaling factor is deſned as, s = mr + dr h + kr h2 .
(12.21)
˙ indicates a derivative with respect to the non-dimensional time, τ = t/h, Notation (·) and all generalized coordinates have been normalized by the reference length, r . 12.6.2 The augmented Lagrangian term An augmented Lagrangian term is now added to the scaled formulation of the equations of motion, as proposed by Bayo et al. [151, 152],
494
12 Constrained systems: numerical methods
ˆ + B T ρsC = h2 F , M ¨ˆq + B T sλ
(12.22a)
sC = 0.
(12.22b)
The penalty factor, ρs, was deſned as the product of the scaling factor deſned in eq. (12.21) by ρ; for ρ = 1, the penalty factor is equal to the scaling factor. A set of modiſed Lagrange’s multipliers, ˆ + ρC, μ ˆ=λ
(12.23)
is introduced to simplify the above equations, leading to M ¨ˆq + B T sˆ μ = h2 F ,
(12.24a)
sC = 0.
(12.24b)
The equations of motion were scaled ſrst, then the augmented Lagrangian term was added. Had this latter term be added from the onset of the formulation, the penalty factor would become h2 p, i.e., the penalty factor would vanish for small time step sizes, negating any advantage this term could have. It is possible to include the augmented Lagrangian term from the onset of the formulation by using a penalty factor written as ρ¯s = ρs/h2 , which yields results identical to those presented here. 12.6.3 Time discretization of the equations To understand the implications of the scaling factor and augmented Lagrangian term presented above, the equations of motion will now be discretized in the time domain. A simple mid-point scheme is used for this task M (ˆ v f − vˆi ) + B Tm sˆ μ = h2 F m ,
(12.25a)
ˆq f − ˆq i = (ˆ v i + vˆf )/2,
(12.25b)
sC m = 0.
(12.25c)
Subscripts (·)i and (·)f indicate quantities at the beginning and end times of the time step, denoted ti and tf , respectively, B m = (B i + B f )/2, C m = (C i + C f )/2, F m = (F i +F f )/2, and μ ˆ m are the mid-point, modiſed Lagrange multipliers. Equation (12.25b) is the discretized velocity-displacement relationship obtained from the mid-point rule; with the present notation, vˆ = ˆq˙ = dˆq /dτ = h dˆq /dt. In view of the scaling of the time dimension performed in the previous section, the formulæ associated with time discretization are independent of the time step size, which is, in fact, taken to be unity; see eq. (12.25b), for example. This means that the time step size dependency of the various terms of the equations of motion indicated in eqs. (12.24a) and (12.24b) will not be affected by the time discretization, no matter what time integration scheme is used. The unknown velocity, vˆf , is easily eliminated from the discretized equations, leading to
12.6 Scaling of Lagrange’s equation of the ſrst kind
495
2M (ˆq f − ˆq i − vˆi ) + B Tm sˆ μ = h2 F m ,
(12.26a)
sC m = 0.
(12.26b)
Next, these nonlinear algebraic equations will be solved using a NewtonRalphson iterative process based on the following set of linear algebraic equations ˆ x = −ˆb. JΔˆ
(12.27)
ˆ is The Jacobian of the system, J,
2M + s(B T μ ˆ ),ˆq − h2 F ,ˆq sB T sC ,ˆq 0 Jˆ Jˆ = ˆ11 12 , J 21 0
Jˆ =
(12.28)
, m
(12.29)
where the notation (·),ˆq was used to indicate a derivative with respect to the generalized coordinates, and subscript [·]m indicates that the Jacobian matrix is evaluated The corrections to the unknowns of the problem are Δˆ xT = 4 3 at the Tmid-point. ˆ , and the residual array is Δˆq Tf , Δλ m ˆb =
μ − h2 F 2M (ˆq f − ˆq i − vˆi ) + B T sˆ sC
, .
(12.30)
m
The asymptotic behavior of the Newton corrections, Δˆ x, as the time step size tends to ˆ and the right-hand zero depends on the asymptotic behavior of both the Jacobian, J, side, ˆb. In fact, ˆ x) = lim (J) ˆ lim (Δˆ lim (JΔˆ x) = − lim ˆb, (12.31) h→0
h→0
h→0
h→0
ˆ = O(h0 ) and limh→0 (ˆb) = O(h0 ), limh→0 (Δˆ and therefore, if limh→0 (J) x) = 0 O(h ). The following results are easily obtained from examination of eqs. (12.28) and (12.30), O(h0 ) O(h0 ) O(h0 ) ˆ . (12.32) Jˆ = , and b = O(h0 ) O(h0 ) 0 Furthermore, it is readily veriſed that the inverse Jacobian matrix is −1 O(h0 ) O(h0 ) ˆ . J = O(h0 ) O(h0 )
(12.33)
ˆ = It then follows that the condition number of the Jacobian matrix, κ(J) −1 0 ˆ ∞ Jˆ ∞ , is clearly independent of the time step size, κ(Jˆ) = O(h ). In view J of eqs. (12.27) and (12.31), it follows that
496
12 Constrained systems: numerical methods
Δˆq f = O(h0 ),
ˆ = O(h0 ). Δλ m
(12.34)
Of course, scaling of the variables has to be considered when the criterion for convergence of Newton iterations is evaluated. This behavior is markedly different from what happens when scaling of the equations is not performed. Indeed, applying the mid-point time discretization to the unscaled, augmented equations of motion, eqs. (12.16a) and (12.16b), leads to dq 2M (q f − q i − h i ) + B Tm μm = F m , 2 h dt C f = 0,
(12.35a) (12.35b)
where the unscaled modiſed Lagrange multiplier is deſned as μ = λ + ρC. A Newton-Ralphson approach is taken again to solve this set of nonlinear algebraic equations; linearization leads to J Δx = −b, where the Jacobian of the system, J, is 2M /h2 + (B T μ),q − F ,q B T , (12.36) J= C ,q 0 m
and the residual array is ⎧ ⎫ dq i ⎨ 2M ⎬ T (q f − q i − h )+B μ−F 2 b= . h dt ⎩ ⎭ C
(12.37)
m
The following results are easily obtained from examination of eqs. (12.36) and (12.37), O(h−2 ) O(h0 ) O(h−2 ) . (12.38) J= , and b = O(h0 ) 0 O(h0 ) The inverse Jacobian matrix is J −1 =
O(h2 ) O(h0 ) . O(h0 ) O(h−2 )
(12.39)
It then follows that the condition number of the Jacobian matrix, κ(J), exhibits a strong dependency on the time step size, κ(J) = O(h−4 ), and Δq f = O(h0 ),
Δλm = O(h−2 ).
(12.40)
Example 12.1. The simple pendulum Figure 12.1 depicts a simple pendulum of length and bob of mass m. In this example, the root torsional spring is ignored. This problem will be treated with two generalized coordinates: the bob’s horizontal
and vertical Cartesian coordinates, denoted x1 and x2 , respectively, q T = x1 , x2 . Because the system features a single degree of freedom, a single constraint must be enforced: the pendulum arm must remain of constant length, . The governing equations of the problem are
12.6 Scaling of Lagrange’s equation of the ſrst kind
M
d2 q + B T λ = 0, dt2 C = 0,
497
(12.41a) (12.41b)
where M = diag(m, m), B = 2q T , C = q T q − 2 , and λ = λ1 . The Jacobian of the unscaled system is readily obtained from eqs. (12.41a) and (12.41b) as 2M /h2 + (B T λ),q B T J= . (12.42) C ,q 0 m
i2
k f
l
i1
m
Fig. 12.1. Conſguration of the pendulum with a root torsional spring.
These equations of motion can be scaled then augmented using the proposed approach, and with the help of the mid-point time discretization method, the Jacobian of the linearized system then becomes ˆ ),ˆq sB T 2M + s(B T μ ˆ J= . (12.43) sC ˆq 0 m
It is readily veriſed that all blocks of this Jacobian and of the corresponding righthand side are O(h0 ). For this simple problem, this is true even without the augmented Lagrangian term, i.e., even if ρ = 0. Example 12.2. Simple pendulum with torsional spring The problem treated in example 12.1 will now be repeated with the addition of the root torsional spring of stiffness constant k, as depicted in ſg. 12.1. Three generalized coordinates will be used here: the bob’s horizontal and vertical Cartesian coordinates, and the root rotation angle, φ. Since the system features a single degree of freedom, two constraints must be enforced, the pendulum arm must remain of constant length, , and angle φ can be obtained from elementary trigonometric considerations. The governing equations of this problem are d2 q + B T λ = 0, dt2 kφ + C2,φ λ2 = 0,
(12.44b)
C = 0,
(12.44c)
M
(12.44a)
498
12 Constrained systems: numerical methods
where Cφ = cos φ, Sφ = sin φ, λT = λ1 , λ2 , C T = C1 , C2 , C1 = q T q − 2 , C2 = q1 Cφ + q2 Sφ , and 2q1 Cφ . (12.45) B= 2q2 Sφ The relative rotation angle, φ, is an algebraic variable, which, in contrast with the Lagrange multipliers λ, appears explicitly in the constraint equations, eq. (12.44b). This equation simply represents the static equilibrium of the spring and hence, involves no time derivative of this angle. The explicit deſnition of the relative rotations at the root readily allows for the introduction of root spring. The Jacobian of the unscaled system is readily obtained from eqs. (12.44a) to (12.44c) as ⎡ ⎤ 2M /h2 + (B T λ),q (B T λ),φ BT ⎢ ⎥ (C2,φ λ2 ),q k + (C2,φ λ2 ),φ C T,φ ⎦ . J =⎣ (12.46) C ,q C ,φ 0 m
These equations of motion can be scaled and augmented using the proposed approach, and with the help of the mid-point time discretization method, the Jacobian of the linearized system then becomes ⎡ ⎤ 2M + s(B T μ ˆ ),ˆq s(B T μ ˆ ),φ sB T ⎢ ⎥ ˆ2 ),ˆq h2 k + s(C2,φ μ ˆ2 ),φ sC T,φ ⎦ . Jˆ = ⎣ s(C2,φ μ (12.47) sC ˆq sC φ 0 m
Here again, it is readily veriſed that all blocks of this Jacobian and of the corresponding right-hand side are of O(h0 ). The key to this proof is in the fact that ˆ + sρC = h2 λ + sρC = O(h0 ). In contrast with example 12.1, the augsˆ μ = sλ mented Lagrangian term is indispensable to achieving this result; indeed, if ρ = 0, ˆ = h2 λ = O(h2 ). sˆ μ = sλ Clearly, the proposed scaling of the unknowns and equations is sufſcient to achieve time step size independent Jacobians when the sole algebraic variables of the problem are Lagrange’s multipliers. When the problem involves additional algebraic variables, such as the relative rotation of this example, the scaling of the unknowns and of the equations must be used in conjunction with the augmented Lagrangian term to achieve time step size independent formulations. 12.6.4 Relationship to the preconditioning approach A preconditioning approach for index-3 DAEs was proposed by Bottasso et al. [235, 236]. The starting point of their development is the Jacobian matrix resulting from the linearization of the governing equations (12.16a) and (12.16b). The Jacobian is multiplied by left and right preconditioning matrices, denoted L and R, respectively, such that J¯ = L J R, where L = diag(hαi ) and R = diag(hβi ). The powers of the time step size, i.e., coefſcients αi and βi , are selected to render the preconditioned
12.6 Scaling of Lagrange’s equation of the ſrst kind
499
¯ independent of h. To prevent confusion, it must be noted the scaling Jacobian, J, factor deſned here, s, and that deſned by Bottasso et al., s (but noted s in refs. [235, 236]), are different: s = s/h2 . For the problem presented in example 12.1, the preconditioning and scaling approaches yield identical Jacobian matrices if the preconditioning matrices are selected to be L = diag(h2 , s) and R = diag(1, s/h2 ). For the problem presented in example 12.2, identical Jacobians are obtained by selecting L = diag(h2 , h2 , s) and R = diag(1, 1, s/h2 ). Clearly, left and right preconditioning matrices can be found that will yield identical Jacobians for the two approaches. For the problem presented in example 12.2, a time step size independent Jacobian is only obtained with the addition of an augmented Lagrangian term; indeed, without these terms, the Jacobian becomes ⎡ ⎤ 2M + (B T h2 λ),q (B T h2 λ),φ BT ⎢ ⎥ J¯ = ⎣ (C2,φ h2 λ2 ),q h2 k + (C2,φ h2 λ2 ),φ C T,φ ⎦ . (12.48) C ,q C ,φ 0 m
Clearly, not all blocks of this Jacobian are O(h0 ). The reasons why this feature is desirable is discussed in the next section. While the use of the augmented Lagrangian term was not addressed in refs. [235, 236], it is clear that if such term is added to the equations of motion from the onset of the formulation, the two methods become equivalent. 12.6.5 Beneſts of the augmented Lagrangian formulation In practical implementations of the ſnite element method, the linearized set of governing equations is solved in two steps [198, 82]: ſrst, the system Jacobian is factorized as J = L D LT , where L is a lower triangular matrix and D a diagonal matrix, and second, the solution is found by back-substitution. The advantage of this approach is that it preserves the banded structure of the Jacobian, if its factorization is performed without pivoting. In general, factorization of the Jacobian without pivoting is numerically unstable, unless the Jacobian is symmetric and positive-deſnite [82]. This is always the case for the stiffness and mass matrices of structures because they can be derived from the minimization of quadratic energy functionals. Consequently, factorizations without pivoting, also called “skyline solvers,” are used systematically in ſnite element codes. The Jacobian matrices of constrained multibody systems, however, are not identical to the mass and stiffness matrices of structures. Consider the Jacobian obtained without the augmented Lagrangian term given by eq. (12.48), and note the presence of the factor h2 along some columns of the matrix. Consider next the very simple linear system, J x = b, where ⎧ ⎫ ⎤ ⎡ 1 0 0 ⎨1⎬ (12.49) J = ⎣0 h2 1⎦ , and b = 1 , ⎩ ⎭ 1 0 1 0
500
12 Constrained systems: numerical methods
which shares the characteristics of eq. (12.48); although symmetric, the Jacobian is not positive-deſnite. It is easy to show that the condition number of this Jacobian is unity, and for h = 0.001, the exact solution is x1 = x2 = 1, and x3 = 0.999999. Using ſnite precision arithmetic with ſve signiſcant digits, the solution of the system with full pivoting yields x1 = x2 = 1, and x3 = 0.99999, whereas solution without pivoting leads to an incorrect answer, x1 = 1, x2 = 10, and x3 = 0.99999. Clearly, when using a skyline solver, i.e., when factorization of the Jacobian is performed without pivoting, the condition number of the system matrix is not a good indicator of the accuracy of the solution. While a low condition number is a necessary condition for obtaining accurate solutions of linear problems, it is not a sufſcient condition when skyline solvers are used. Consider, for instance, the Jacobian matrices of example 12.2 deſned in eqs. (12.47) and (12.48), obtained with and without the augmented Lagrangian term, respectively. Because of the presence of the multiplicative factor, h2 , across entire columns of the Jacobian in eq. (12.48), pivoting will be required to ensure accurate solutions. On the other hand, all the sub-matrices of the Jacobian obtained with the present scaling approach, see eq. (12.47), are independent of the time step size, enabling the safe use of skyline solvers. The augmented Lagrangian term of the proposed formulation was shown above to be key to achieving time step size independent Jacobians, see eq. (12.28). The ˆ ,ˆq − Hessian of the system, see eq. (12.29), can be expressed as Jˆ11 = 2M + s(B T λ) T h2 F ,ˆq + sρB B, where the last term represents the contribution of the penalty term and provide two further beneſts. First, consider the problem described in example 12.2 and assume that the system is at rest at t = 0. Because the ſrst Lagrange multiplier represents the tension in the rod and the second the moment in the spring, it is clear that λ = 0 at t = 0. In the absence of penalty term, i.e., for ρ = 0, the Jacobian of the linearized system at that instant becomes ⎡ ⎤ 2M 0 sB T ⎢ ⎥ Jˆ = ⎣ 0 0 sC T,φ ⎦ . (12.50) sC ˆq sC φ 0 m
Although this Jacobian is not singular, a skyline solver will obviously fail if pivoting is not used. Clearly, if a skyline solver is used, the augmented Lagrangian term is indispensable to the success of the simulation’s ſrst time step. Second, Gill et al. [237] showed that there always exists a ρ∗ such that the Hessian of the augmented Lagrangian, Jˆ11 , is positive-deſnite for all ρ > ρ∗ . As mentioned earlier, positive-deſniteness is key to the reliable use of skyline solvers: this ∗ ∗ implies that the sub-system Jˆ11 Δˆ x∗ = −ˆb , where x ˆ ∗ and ˆb are vectors of appropriate dimensions, can be solved without pivoting. Experience shows that ρ = 1 is a good choice; this implies that the penalty factor is taken equal to the scaling factor. Finally, now that it has been proved that the Hessian of the augmented Lagrangian, Jˆ11 , can be factorized without pivoting, it must also be proved that the complete solution can be obtained without pivoting. At ſrst, consider a system with a single constraint: Jˆ12 and Jˆ21 are then column and row vectors, respectively. Since
12.6 Scaling of Lagrange’s equation of the ſrst kind
501
the constraint matrix is assumed to be of full rank, its single column, Jˆ12 , must contain a least one non-zero element, and hence, factorization without pivoting can proceed safely. Mathematical induction then implies that factorization without pivoting can proceed for systems with an arbitrary number of constraints, for as long as columns and rows of Jˆ12 and Jˆ21 , respectively, are linearly independent, a property that is guaranteed by the fact that the constraint matrix is of full rank. As a last note of interest, the proof presented above assumes that the degrees of freedom of the system are segregated: ſrst, all the generalized coordinates of the system, then, Lagrange’s multipliers. In practice, this ordering is not desirable because it does not minimize the bandwidth of the system of equations. It can be easily shown that generalized coordinates and Lagrange’s multipliers can be interspersed, as desired for minimization of the bandwidth, while still using a skyline solver. The only requirement is that Lagrange’s multipliers must be placed after the generalized coordinates that participate in the corresponding constraint equation, as was already observed by Cardona [31]. 12.6.6 Using other time integration schemes While the proposed scaling method has been presented so far within the framework of the mid-point time integration scheme, it is easily extended to the more advanced integration methods that are used for the analysis of realistic mechanical systems. Consider, for example, the generalized-α method [136] applied to the scaled general equations of motion of a multibody system, see eqs. (12.24a) and (12.24b). The resulting discretization is Ma ˇ + sB T ˇ = h2 F ,
(12.51a)
sC = 0.
(12.51b)
Here, the mass matrix, constraints, constraint matrix, and forces are evaluated using the algorithmic variables deſned by eqs. (17.38) and (17.50). The corresponding variables at the end of the time step are related to their values at the beginning of the time step through eqs. (17.41). Linearization of eqs. (12.51a) and (12.51b) with respect to these increments yields a system of algebraic equations identical to eq. (12.27) with a Jacobian matrix presenting the same structure as in eq. (12.29), where the sub-matrices are Jˆ11 = (1 − αM )/β M + h2 (1 − αF )γ/β F ,ˆv + h2 (1 − αF )F ,ˆq + s(B T μ ˆ ),ˆq , T Jˆ = s(1 − αF )B , and Jˆ = s(1 − αF )C , respectively, and their asymp12
21
,ˆ q
totic behavior is independent of the time step size as was observed for the simple mid-point scheme. The developments presented above can be repeated for other integration schemes such as the well-known HHT scheme [135], implicit Runge-Kutta methods including the class of RADAU schemes [26], or backward difference formulæ (BDF) [96]. In all cases, the application of the time integration scheme to the proposed scaled equations, see eqs. (12.24a) and (12.24b), leads to a Jacobian matrix that is independent of the time step size.
502
12 Constrained systems: numerical methods
Example 12.3. Scaling of a simple pendulum problem Consider the simple pendulum problem described in example 12.2 and depicted in ſg. 12.1, with m = 1 kg, k = 10 N·m/rad, and = 1 m, simulated within the time range t ∈ [0, 1] sec. Table 12.1 lists the condition numbers of iteration matrix, κ(J ), at convergence of the last time step, for time step size h ∈ [10−1 , 10−5 ] s. Table 12.1. Condition numbers of the iteration matrix at convergence of the last time step. h 1 10−1 5 10−2 1 10−2 5 10−3 1 10−3 5 10−4 1 10−4 5 10−5 1 10−5
No s = 1 in s from scaling eq. (12.21) eq. (12.21) 4 104 10. 12. 6 105 8.9 13. 3 108 9.2 14. 5 109 9.2 14. 3 1012 9.2 14. 5 1013 9.2 14. 3 1016 9.2 14. 5 1017 9.2 14. 3 1020 9.2 14.
Table 12.2. Condition numbers of the iteration matrix at convergence of the last time step. Mass 10−2 10−1 100 101 102 103 104
No s = 1 in s from scaling eq. (12.21) eq. (12.21) 3 106 2 101 13. 3 108 9 100 14. 3 1010 4 102 14. 3 1012 3 104 14. 3 1014 3 106 14. 3 1016 3 108 14. 3 1018 3 1010 14.
The second column of table 12.1 lists the condition numbers in the absence of scaling. As predicted, κ(J) = O(h−4 ), clearly demonstrating the need for scaling. The next two columns list the condition numbers with scaling factors s = 1 and s selected according to eq. (12.21), for the third and fourth columns, respectively. Example 12.4. Scaling of a simple pendulum problem Next, the same problem is solved with a ſxed time step size, h = 0.01 s, and ſxed spring stiffness constant, k = 10 N·m/rad, but for a range of mass values, m ∈ [10−2 , 104 ] kg. Table 12.2 lists the condition numbers of iteration matrix, κ(J ), at convergence of the last time step. The second column of table 12.2 lists the condition numbers in the absence of scaling. As the mass of the system increases, the condition number of the Jacobian matrix increases, demonstrating here again the need for scaling. The next two columns list the condition numbers with scaling factors s = 1 and s selected according to eq. (12.21), for the third and fourth columns, respectively. These results highlight the importance of scaling the problem with respect to its dependency on physical properties. Selecting the scaling factor according to eq. (12.21) renders the condition number of the Jacobian independent of the value of the mass. Of course, varying the spring stiffness constant would yield similar results. Example 12.5. Flexible beam actuated by a crank Figure 12.2 depicts a cantilevered beam of length L = 1 m actuated by a crank mechanism. The beam has a rectangular cross-section of depth h = 0.1 m and width w = 2.5 mm; it is made of aluminum of Young’s modulus E = 73 GPa and Poisson’s
12.6 Scaling of Lagrange’s equation of the ſrst kind
503
ratio ν = 0.3. This beam is modeled by eight cubic beam elements; the geometrically exact beam element formulation is described in section 16.3. The tip of the beam is connected to a spherical joint at point C by means of a short connector modeled by two cubic elements and featuring physical properties identical to those of the beam. In turn, the spherical joint is connected to a ƀexible steel (E = 210 GPa and ν = 0.3) link of length L = 0.5 m with a hollow circular cross-section of outer radius Ro = 15 mm and thickness t = 8 mm. i3
L = 1m Beam
T C
+
+
R Spherical joint Revolute joint
+
Cross-section d = 5 mm Connector h= 0.1 m T C w = 2.5 mm ~ Link
Link Crank
i1 Ll = 0.5m
G L Lc = 30 mm
Fig. 12.2. Beam actuated by a tip crank.
Next, the link connects to a crank of length Lc = 30 mm by means of a revolute joint at point L; the crank’s cross-section is identical to that of the link. Finally, a revolute joint connects the crank to the ground at point G. Points G, L, and C deſne the plane of the crank-link mechanism, which is offset by a distance d = 5 mm from plane (¯ı1 , ¯ı3 ) of the cantilevered beam. The relative rotation of the revolute joint at point G is prescribed as φ = 1.6(1 − cos 2πt/T ) rad, where T = 1.6 s. As the crank rotates up, the vertical transverse shear force in the beam increases, and the beam buckles laterally. Figure 12.3 shows the three displacement components at the beam’s mid-point: at about 0.05 s in the simulation, the lateral displacement component, u2 , suddenly increases. Lateral buckling is accompanied by rotation of the beam’s mid-section. The following observations will be made concerning this simulation. First, in the absence of augmented Lagrangian terms, the simulation failed at the ſrst iteration of the ſrst time step. Indeed, at the ſrst time step, the Jacobian of the system presents a structure similar to that presented in eq. (12.50), and the skyline solver fails to factorize the Jacobian. Next, augmented Lagrangian terms were included in the simulation, but no scaling was used, i.e., s = 1 was selected. In this case, the skyline solver was able to factorize the Jacobian at the ſrst time step, but iterations failed to converge because of the poor conditioning of the system. Finally, when using the proposed scaling, the simulation ran smoothly to completion, as shown in ſg. 12.3.
504
12 Constrained systems: numerical methods 12
x 10-3
10-1
ERROR IN DISPLACEMENTS
10
DISPLACEMENTS [m]
8 6 4 2 0
10
10 10 10
10
10
10
0.15
0.2
0.25
0.3
0.35
0.4
TIME [s]
Fig. 12.3. Displacement components at the beam’s mid-span. Component u1 : solid line; u2 : dashed-dotted line; and u3 : dashed line.
-5
-7
-2
0.1
-3
-4
10-6
-4 0
0.05
-2
-8
-9
10
2
3
10
4
1/h [1/s]
10
5
10
Fig. 12.4. Three integration schemes. Radau IIA: solid line; energy decaying scheme: dashed-dot line; HHT: dashed line.
The applicability of the proposed scaling to various time integration schemes will also be demonstrated here. Simulations were run with three integration schemes: the Radau IIA scheme [26], the energy decaying scheme [218], and the HHT scheme [135]. Figure 12.4 demonstrates the convergence characteristics of the three schemes by plotting the solution error as a function of the inverse of the time step size. Errors were computed with respect to a reference solution obtained by using the Radau IIA scheme with a very small time step size, h = 5 μsec. Note the good convergence of all three schemes, even for very small time step sizes.
12.7 Conclusions This chapter has presented a comprehensive review of the numerical tools used for the enforcement of constraints in multibody systems. The classical formulation of Lagrange’s equations of the ſrst kind yields index-3, differential-algebraic equations. In view of the difſculties associated with the solution of these equations, several approaches have been used to algebraically eliminate Lagrange’s multipliers. The ſrst approach is to use Maggi’s formulation, which plays a pivotal role in constrained dynamics, although its importance was not initially recognized. The selection of a set kinematic characteristics is required, and for both holonomic and nonholonomic systems, constraints are enforced at the velocity level. The central ingredient of the approach is the null space of the constraint matrix. The next two approaches are the index-1 formulation that is widely used in multibody codes, and the null space formulation that has also received considerable attention. Both methods enforce constraints at the acceleration level, and hence, signiſcant drift of numerical solutions should be expected. Finally, Udwadia and Kalaba’s formulation provides new insight to the behavior of constrained dynamical system. Here again, constraints are enforced at the acceleration level, but the use of MoorePenrose inverses provides increased generality and robustness. The salient features of these four approaches were reviewed and compared.
12.7 Conclusions
505
Extensive mathematical studies of DAEs concluded that the best approach for the solution of DAEs is the reduce their index. Consequently, two distinct avenues of research were pursued: ſrst, coordinate reduction techniques were developed to reduce the DAEs to ODEs, and second, index reduction techniques were proposed that bring the DAEs index from 3 to 2 or 1. Maggi’s method has been extensively used as a coordinate reduction technique that transform DAEs into ODEs. A distinction was made between Maggi-like methods that do not eliminate Lagrange’s multipliers and true Maggi formulations for which these multipliers are completely eliminated. Many of these methods only differ by the numerical process used to compute the null space of the constraint matrix. This contrasts with the extraction procedure that evaluates the null space based on kinematic considerations. The null space formulation has also been used to obtain ODEs. Numerical implementations of Udwadia and Kalaba’s formulation inherits the advantages of this powerful technique. Finally, more geometric arguments form the basis of the projective formulation, which uses the concepts of tangent and orthogonal subspaces to obtain ODEs. Index reduction techniques are formal mathematical procedures that reduce the index of DAEs from 3 to 2 or 1. In many cases, properties of the proposed schemes are proved mathematically. While the order of accuracy of the solution is often provided, the problem of violation of the constraints was rarely addressed. Due to approximations and round-off errors, many numerical solutions do not satisfy the constraints exactly, a phenomenon known as “drift.” Numerous constraint violation stabilization techniques have been developed to remedy this problem, but Baumgarte’s method is probably the most widely used. It presents two shortcomings: ſrst, constraints are never exactly satisſed, and second, no general procedure exists to determine the problem dependent parameters appearing in the formulation. Consequently, this approach cannot be recommended for general use in multibody systems. Penalty based formulations have also been used to control the drift phenomenon. The augmented Lagrangian formulation is probably the most robust and efſcient method of that type. Next, constraint violation elimination techniques have been developed to enforce the exact satisfaction of the constraints, at least to machine accuracy. In those approaches, the solution obtained through the approximate time integration process is corrected or perturbed to satisfy the constraints. This correction is typically based on geometric concepts: the approximate solution is projected onto the constraint manifold and an iterative process is required to completely eliminate constraint violations. Next, new algorithms have been developed for the enforcement of constraints within the framework of the ſnite element method, which has gained popularity for multibody dynamics applications. Based on the physical concepts of energy preservation and vanishing of the work done by the constraint forces, robust algorithms have been developed that present mathematical proofs of stability, but at the expense of more complex discretization schemes, moving away from the traditional “black box” approach to time integration schemes. Typically, constraints are satisſed to machine accuracy at both displacement and velocity levels.
506
12 Constrained systems: numerical methods
For the several past decades, the numerical solution of DAEs has been known to be fraught with difſculties, mainly due to their undesirable behavior for vanishingly small time step sizes. The importance of scaling of both equations of motion and solution ſelds has been underlined and the following facts have been established. 1. Scaling can be performed at the level of the equations of motion, prior to time discretization. By curing problems a priori, beneſts are reaped for all time integration schemes. 2. The proposed scaling factor depends on both time step size and system physical properties, further improving the numerical conditioning of the problem. 3. In multibody formulations, algebraic variables stem from the presence of Lagrange’s multipliers, but also from the deſnition of additional algebraic variables such as relative motions. In such cases, scaling in conjunction with an augmented Lagrangian term was shown to yield time step size independent Jacobians. 4. The combined use of scaling and augmented Lagrangian term also enables the safe use of sparse linear equation solvers that do not rely on pivoting to ensure stable, accurate solutions. While ſnite element codes routinely rely on such skyline solvers, their safe use for DAEs has been justiſed and considerably improves the efſciency of the solution process. Although further theoretical work is needed before more general conclusions can be drawn, the following facts are emerging from the discussion presented in section 12.6 and in refs. [234, 235, 236, 238]. 1. High index DAEs, once properly scaled, are not more difſcult to integrate than ODEs. Unless leading to computational savings, there is no reason to avoid Lagrange’s multipliers, the main source of algebraic variables. 2. While numerous researchers have advocated the use of speciſc time integration schemes to overcome the ill-conditioning of the linearized index-3 equations, section 12.6 shows that these problems can be resolved a priori, for all stable integration schemes. Furthermore, scaling does not alter the basic properties of time integration schemes. If an integration scheme is energy preserving, its application to scaled equations of motion will still preserve energy. 3. Promoting index reduction techniques to avoid the perceived numerical problems associated with DAEs might be ill advised: section 12.6 results indicate that these techniques might not required. Furthermore, they might create difſculties that were not present in the original formulation based on DAEs; for instance, index reduction techniques often enforce constraints through their higher-order derivatives, leading to the drift phenomenon, which does not affect the direct solution of high-index DAEs. While the drift problem may be alleviated or completely eliminated by the use of projections onto the constraint manifold, the index-3 approach is conceptually simpler and possibly more efſcient since it does not incur the extra costs of computing and applying projection operators. 4. The results presented in section 12.6 do not provide a general approach to the solution of DAEs. Indeed, the approach presented in that section only deals with the index-3 DAEs stemming from the modeling of mechanical systems, which
12.7 Conclusions
507
present a linear dependency on generalized accelerations, on Lagrange’s multipliers, and on the generalized velocities appearing in nonholonomic constraints. The results presented in section 12.6 are limited to DAEs presenting these three characteristics.
Part V
Parameterization of rotation and motion
13 Parameterization of rotation
The effective description of rotations has led to the development of numerous parameterization techniques presenting various properties and advantages, as described in the following review papers [239, 240, 241, 242, 243, 244, 245]. Whether originating from geometric, algebraic, or matrix approaches, parameterization of rotation is most naturally categorized into two classes: vectorial and non-vectorial parameterizations. The former refers to parameterization in which a set of parameters, sometimes called rotational “quasi-coordinates,” deſne a geometric vector, whereas the latter cannot be cast in the form of a vector. These two types of parameterizations are sometimes denoted as invariant and non-invariant parameterization, respectively. The Cartesian rotation vector, the Euler-Rodrigues parameters, or the WienerMilenkovi´c parameters all are examples of vectorial parameterizations. These are all characterized by a minimal set of three parameters, which behave as the Cartesian components of a vector in three-dimensional space. Non-vectorial parameterizations, on the other hand, may be either minimal, as in the case of Euler angles, or “redundant,” as for Euler parameters, Cayley-Klein parameters, and the matrix of direction cosines. Redundancy arises when more than three parameters are employed: four in the case of Euler and Cayley-Klein parameters, nine in the case of direction cosines. In fact, rotation may be described as the motion of a point on a three-dimensional nonlinear manifold, the Lie group of special orthogonal transformations of the threedimensional space. The various parameterizations of rotation are, in differential geometry terminology, different charts available for this particular manifold. Stuelpnagel [246] provides a concise analysis of different parameterizations of rotations. He shows that the six parameter representation consisting of the ſrst two columns of the rotation tensor yields a set of linear differential equations for the motion of a rigid body. He further proves that a minimum of ſve parameters is required to obtain a bijective mapping of the rotation group. This parameterization yields a set of nonlinear equations of motion for a rigid body and is not recommended for practical applications. Four parameter representations, such as the quaternion representation [247, 248, 249], are singularity free, in contrast with minimal set parameterizations, which he proves to always involve singularities.
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_13 © Springer Science+Business Media B.V. 2011
512
13 Parameterization of rotation
The various parameterization techniques detailed in the literature present distinct advantages and drawbacks. Advantages can be of a theoretical nature, such as ease of geometric interpretation, or convenience in algebraic manipulations, for instance, or of a computational nature, low cost function evaluations, wide range of singularityfree behavior, etc. These features provide guidelines for selecting parameterizations that are best suited for speciſc applications. A survey of the literature reveals, however, that for both theoretical and numerical applications, the choice of parameterization is often based on personal taste and traditions rather than cost/beneſt considerations. Section 13.1 presents an algebraic description of rotation that contrasts with the geometric approach developed in chapter 4. Cayley’s elegant formulation is introduced based on the fundamental property of the rotation operation: preservation the length of the rotated vector. Next, section 13.3 introduces the well-known Euler parameters [247, 246, 250, 249] that provide an elegant, purely algebraic representation of rotation. When using the quaternion algebra presented in section 13.2, all rotation operations become bi-linear expressions of quaternions. These advantages, however, come at a high cost: four parameters must be used instead of three, i.e., Euler parameters do not form a minimum set. Euler’s theorem on rotations, see section 4.5, states that an arbitrary motion of a rigid body that leaves one of its point ſxed can be represented by a single rotation of magnitude φ about unit vector n ¯ . It is readily shown that the associated rotation tensor, R, possesses a positive unit eigenvalue and the corresponding eigenvector is n ¯ , see section 4.7. The vectorial parameterization of rotation is introduced in section 13.4 and consists of minimal set of parameters deſning the components of a rotation parameter vector, p = p(φ)¯ n, where p(φ) is the generating function. The vectorial parameterization of rotation presents two fundamental properties. First, it is tensorial in nature: the tensorial nature of the second-order rotation tensor implies and is implied by the tensorial nature of the rotation parameter vector, a ſrst-order tensor. Second, rotation parameter vectors are parallel to the eigenvector of the rotation tensor corresponding to its unit eigenvalue. Because these two properties imply each other, either can be taken as the deſnition of the vectorial parameterization of rotation. A parameterization of rotation is tensorial if and only if the rotation parameter vector is parallel to the eigenvector of the rotation tensor associated with its unit eigenvalue. The Cartesian rotation vector, the Cayley-Gibbs-Rodrigues parameters, or the Wiener-Milenkovi´c parameters all are special cases of the vectorial parameterization of rotation corresponding to speciſc choices of the generating function. Furthermore, these parameterizations are recovered as members of two different families: the sine and the tangent family. The occurrence of singularities in the proposed vectorial parameterization is the focus of section 13.6. Finally, section 13.7 details a number of useful parameterizations: the Cartesian rotation vector, the Euler-Rodrigues parameters, the Cayley-Gibbs-Rodrigues parameters, and the Wiener-Milenkovi´c parameters. Euler parameters are closely related to the vectorial parameterization. On the other hand, minimal non-vectorial parameterizations such as Euler and Euler-type
13.1 Cayley’s rotation parameters
513
angles are not easily related to vectorial techniques. Rather, they may be investigated in terms of exponential coordinates of the second kind, in contrast with the exponential parameterization, which is an application of exponential coordinates of the ſrst kind [251].
13.1 Cayley’s rotation parameters In chapter 4, rotation operations were described in geometric terms, based on the visualization of an orthonormal basis B rotating to a new basis E. It is possible, however, to describe rotations without resorting to geometric concepts. Indeed, the fundamental property of rotation operations is to preserve length: the length of a vector is the same when computed from its components resolved in two arbitrary orthonormal bases. Consider an arbitrary vector b and its components, b[B] and b[E] , resolved in bases B and E, respectively. Let R be the rotation tensor that brings basis B to E. The relationship between the components of b in these two bases is given by eq. (4.27) as b[E] = RT b[B] . The basic property of rotation is to preserve the length, , of vector b, i.e., 2 = b[E]T b[E] = b[B]T b[B] , which implies +* + * b[B] − b[E] = 0. (13.1) b[B]T + b[E]T
b
[B]
Vectors f and g are now deſned as f = b[B] − b[E] = (R − I) b[E] , and g = + b[E] = (R + I) b[E] , and the length preservation condition now simply states g T f = 0.
(13.2)
Eliminating b[E] from vectors f and g leads to f = C g, where C = (R − I)(R + I)−1 = (R + I)−1 (R − I).
(13.3)
Tensor R + I is only singular when φ = ±π, because the rotation tensor then possesses an eigenvalue λ = −1. For all other rotations, R + I can be inverted. The second equality of eq. (13.3) is readily obtained by noting that (R − I)(R + I)−1 = (R + I)−1 (R + I)(R − I)(R + I)−1 , where the second and third factors commute. The length preservation condition, eq. (13.2), now becomes gT C g = 0. Rotation operations preserve the length of any arbitrary vector. Consequently, scalar g T C g must vanish for any arbitrary vector g, which implies that C must be a skewsymmetric tensor, i.e., C = a, where a are Cayley’s rotation parameters. The rotation tensor can be expressed in terms of this skew-symmetric tensor by solving eq. (13.3) for R to ſnd R = (I − C)−1 (I + C) = (I + C)(I − C)−1 .
(13.4)
514
13 Parameterization of rotation
The determinant of I − C is 1 + aT a; hence, this matrix is always invertible. In summary, there exists a one to one relationship between an orthogonal tensor R and a skew-symmetric tensor C. Equation (13.4) is known as Cayley’s formula. The structure of the rotation tensor is obtained from eq. (13.4) R=
1 (1 + aT a)I + 2 a + 2 a a . T 1+a a
(13.5)
In expanded form, this becomes ⎡ ⎤ 1 + a21 − a22 − a23 2(a1 a2 − a3 ) 2(a1 a3 + a2 ) 1 ⎣ 2(a1 a2 + a3 ) 1 − a21 + a22 − a23 2(a2 a3 − a1 ) ⎦ . (13.6) R= 1 + aT a 2(a1 a3 − a2 ) 2(a2 a3 + a1 ) 1 − a21 − a22 + a23 The algebraic description of rotation is based on the length preservation property, eq. (13.1). This description implies the speciſc structure of the rotation tensor given by eq. (13.5), which explicitly shows the dependency of the rotation tensor on three parameters only. The geometric nature of these parameters, however, is not evident in this purely algebraic approach.
13.2 Quaternion algebra In section 13.3, it will be shown that rotation operations are conveniently expressed in terms of quaternions. The present preparatory section focuses on the deſnition of quaternions, the derivation of a number of their properties, and the deſnition of the operators that ease quaternion algebra. A quaternion [247] is deſned as an array of four numbers e eˆ = 0 , (13.7) e where e0 is the scalar part of the quaternion and e the vector part of the quaternion. This four component array is not a vector, as it does not transform like a vector, see section 4.8.1. The norm of quaternion pˆ is deſned as ˆ p = pˆT pˆ = p20 + pT p. (13.8) Quaternions operators Quaternion operations are conveniently performed using the following matrices or operators of size 4 × 4
13.2 Quaternion algebra
e A(ˆ e) = 0 e B(ˆ e) =
C(ˆ e) =
e0 e
e0 e
⎡ e0 ⎢e1 −eT =⎢ ⎣e2 e0 I + e e3 ⎡ e0 ⎢e1 −eT =⎢ ⎣e2 e0 I − e e3 ⎡ e0 ⎢e1 eT =⎢ ⎣e2 −e0 I − e e3
515
⎤
−e1 −e2 −e3 e0 −e3 e2 ⎥ ⎥, e3 e0 −e1 ⎦ −e2 e1 e0 ⎤ −e1 −e2 −e3 e0 e3 −e2 ⎥ ⎥, −e3 e0 e1 ⎦ e2 −e1 e0 ⎤ e1 e2 e3 −e0 e3 −e2 ⎥ ⎥. −e3 −e0 e1 ⎦ e2 −e1 −e0
(13.9a)
(13.9b)
(13.9c)
If pˆ is an arbitrary quaternion, these operators enjoy the following properties A(ˆ p)AT (ˆ p) = B(ˆ p)B T (ˆ p) = C(ˆ p)C T (ˆ p) = ˆ p 2 I 4 ,
(13.10)
where I 4 is the 4 × 4 identity matrix. If pˆ and qˆ are arbitrary quaternions, the following matrix products commute A(ˆ p)B T (ˆ q ) = B T (ˆ q )A(ˆ p), A(ˆ p)B(ˆ q ) = B(ˆ q )A(ˆ p), T
C(ˆ p)A (ˆ q ) = B(ˆ q )C(ˆ p),
AT (ˆ p)B(ˆ q ) = B(ˆ q )AT (ˆ p),
(13.11a)
T
T
T
T
(13.11b)
T
T
(13.11c)
A (ˆ p)B (ˆ q ) = B (ˆ q )A (ˆ p), T
C (ˆ p)B(ˆ q ) = A (ˆ q )C (ˆ p).
These identities then imply the following results p)ˆ q = B(ˆ q )ˆ p, A(ˆ
AT (ˆ p)ˆ q = C T (ˆ q )ˆ p,
B T (ˆ p)ˆ q = C(ˆ q )ˆ p.
(13.12)
Next, the following results are easily checked p)A(ˆ q ) = A(ˆ r ) ⇐⇒ rˆ = A(ˆ p)ˆ q = B(ˆ q )ˆ p, A(ˆ
(13.13a)
A(ˆ p)A (ˆ q ) = A(ˆ r ) ⇐⇒ rˆ = C(ˆ p)ˆ q = B (ˆ q )ˆ p,
(13.13b)
A (ˆ p)A(ˆ q ) = A(ˆ r ) ⇐⇒ rˆ = A (ˆ p)ˆ q = C (ˆ q )ˆ p,
(13.13c)
T
T
T
T
T
where the double-headed arrows indicate that the two equalities imply each other. Similarly p)B(ˆ q ) = B(ˆ r ) ⇐⇒ rˆ = B(ˆ p)ˆ q = A(ˆ q )ˆ p, B(ˆ
(13.14a)
B(ˆ p)B T (ˆ q ) = B(ˆ r ) ⇐⇒ rˆ = C T (ˆ p)ˆ q = AT (ˆ q )ˆ p,
(13.14b)
B (ˆ p)B(ˆ q ) = B(ˆ r ) ⇐⇒ rˆ = B (ˆ p)ˆ q = C(ˆ q )ˆ p.
(13.14c)
T
T
p) is deſned as Finally, the skew-symmetric operator S(ˆ T 1 00 = [A(ˆ p) = p) − B(ˆ p)]. S(ˆ 0 p 2
(13.15)
516
13 Parameterization of rotation
Unit quaternions Quaternion eˆ is said to be a unit quaternion if its norm, eq. (13.8), is unity, i.e., ˆ e = 1. In view of identity (13.10), operators A(ˆ e), B(ˆ e), and C(ˆ e) now become orthogonal matrices. A bi-linear operator is now deſned D(ˆ e) = A(ˆ e)B T (ˆ e) = B T (ˆ e)A(ˆ e) = C(ˆ e) C(ˆ e). It is now readily veriſed that for a unit quaternion eˆ, 1 0T , e) = D(ˆ 0 R(ˆ e)
(13.16)
(13.17)
where R(ˆ e) = I + 2e0 e + 2 ee. Orthogonal quaternions Two quaternions, pˆ and qˆ, are said to be orthogonal if pˆT qˆ = 0. For such pair of quaternions, the following identities hold AT (ˆ p)A(ˆ q ) + AT (ˆ q )A(ˆ p) = 0, T
T
B (ˆ p)B(ˆ q ) + B (ˆ q )B(ˆ p) = 0,
A(ˆ p)AT (ˆ q ) + A(ˆ q )AT (ˆ p) = 0. T
T
B(ˆ p)B (ˆ q ) + B(ˆ q )B (ˆ p) = 0.
(13.18a) (13.18b)
13.3 Euler parameters Euler parameters [248, 247, 246, 250, 249, 252] are presented in this section. These parameters lead to a very simple, purely algebraic representation of rotation. Four parameters, however, instead of three, are used in this representation. These four parameters are related by a normality condition, and are thus not independent. The four Euler parameters are deſned as follows e0 = cos
φ , 2
φ e = sin n ¯, 2
(13.19)
where n ¯ is the unit vector about which the rotation of magnitude φ is taking place, according to Rodrigues’ rotation formula, eq. (4.15). Consequently, Euler parameters deſne a rotation operation. Note the redundancy in this representation: four parameters are used instead of three. Of course, these four parameters are linked by the following constraint e20 + e21 + e22 + e23 = 1, (13.20)
where eT = e1 e2 e3 . Euler parameters
are conveniently interpreted as the comT e = 1. ponents of a unit quaternion, eˆ = e0 , e , because eq. (13.20) implies ˆ
13.3 Euler parameters
517
13.3.1 The rotation tensor The rotation tensor can be expressed in terms of the Euler parameters by introducing eqs. (13.19) into Rodrigues’ rotation formula, eq. (4.15), to ſnd R(e) = I + 2e0 e + 2 ee.
(13.21)
Rotation operations using Euler parameters are most easily expressed in terms of quaternions. Instead of working with the 3 × 3 rotation tensor, R, it is easier to work with the 4 × 4 operator D deſned by eq. (13.17), which is closely related to the rotation tensor. Identity (13.16) then gives various expressions for operator D. The rotation tensor is now a purely algebraic function of Euler parameters. 13.3.2 The angular velocity vector The components of the angular velocity vector resolved in the inertial basis, see section 4.10, are obtained from their deſnition, eq. (4.56). Here again, it is easier to work with the 4 × 4 operators deſned in the previous section; indeed, T 1 0T 0 0T 00 ˙ e)D T (ˆ D(ˆ = S(ˆ = e) = ω ). ˙ e) 0 RT (ˆ 0 ω 0 R(ˆ e) The following algebraic manipulations now relate the components of the angular velocity vector to Euler parameters · ˙ e)D T (ˆ D(ˆ e) = A(ˆ e)B T (ˆ e) B(ˆ e)AT (ˆ e) = A(eˆ˙ )AT (ˆ e) + B T (eˆ˙ )B(ˆ e), (13.22)
where identities (13.10) and (13.11a) were used. Note that eˆ˙ T = e˙ 0 , e˙ does not form a unit quaternion. Because B(ˆ e) is an orthogonal operator, B T (eˆ˙ )B(ˆ e) = T −B (ˆ e)B(eˆ˙ ), and eq. (13.22) becomes ω ˆ ω ˆ ˙ e)DT (ˆ e) = A(eˆ˙ )AT (ˆ e) − B T (ˆ e)B(eˆ˙ ) = A( ) − B( ) = S(ˆ ω ), D(ˆ 2 2 where identities (13.13) and (13.14) were used to ſnd ω ˆ = 2B T (ˆ e)eˆ˙ = 2C(eˆ˙ )ˆ e.
(13.23)
The vector part of quaternion ω ˆ is the angular velocity vector. Its scalar part, ω0 , follows from the deſnition of operator B, eq. (13.9b), as ω0 = 2(e0 e˙ 0 + eT e˙ ) = 2ˆ eT eˆ˙ = 0, because eˆ is a unit quaternion. The components of the angular velocity vector resolved in the rotating frame, see section 4.10, are obtained in a similar manner ω ˆ ∗ = 2AT (ˆ e)eˆ˙ = 2C T (eˆ˙ )ˆ e.
(13.24)
Time derivatives of Euler parameters can be related to the angular velocities by inverting eqs. (13.23) and (13.24) to ſnd eˆ˙ = 1/2 B(ˆ e)ˆ ω and eˆ˙ = 1/2 A(ˆ e)ˆ ω∗, respectively. Because operators A(ˆ e) and B(ˆ e) are orthogonal operators for unit quaternions, these relationships are free of singularities.
518
13 Parameterization of rotation
13.3.3 Composition of rotations The concept of composition of rotations was discussed in section 4.9. Consider three unit quaternions pˆ, qˆ, and rˆ such that R(ˆ r ) = R(ˆ p)R(ˆ q ).
(13.25)
The problem at hand is to determine unit quaternion rˆ as a function of quaternions pˆ and qˆ. Here again, eq. (13.25) is expressed by means of 4 × 4 operators to ease the algebraic manipulations as D(ˆ r ) = D(ˆ p)D(ˆ q ). With the help of identity (13.16), this expands to A(ˆ r )B T (ˆ r ) = A(ˆ p)B T (ˆ p)A(ˆ q )B T (ˆ q ) = A(ˆ p)A(ˆ q )B T (ˆ p)B T (ˆ q ), where identity (13.11a) was used. Identities (13.13) and (13.14) then imply rˆ = A(ˆ p)ˆ q.
(13.26)
p)A(ˆ p)ˆ q= It is readily shown that rˆ is also a unit quaternion; indeed rˆT rˆ = qˆT AT (ˆ qˆT qˆ = 1, since pˆ and qˆ both are unit quaternions. 13.3.4 Determination of Euler parameters Equation (13.21) expresses the rotation tensor in terms of Euler parameters. In this section, the inverse operation is developed, but unfortunately, it cannot be written in a simple manner. Indeed, any such expression will involve a division by a term that can vanish for certain speciſc rotation tensors. To overcome this problem, Klumpp [253] and Shepperd [254] introduced the procedure described in this section. Consider the following symmetric matrix constructed from the components of the rotation tensor T 2 axial (R) 1 + tr(R) T = . 2 axial(R) 1 − tr(R) I + 2 symm(R) Introducing eq. (13.21) then yields ⎡ 2 e0 ⎢e0 e1 T = 4⎢ ⎣e0 e2 e0 e3
e0 e1 e21 e1 e2 e1 e3
e0 e2 e1 e2 e22 e2 e3
⎤ e0 e3 e1 e3 ⎥ ⎥ = 4 eˆeˆT . e2 e3 ⎦ e23
(13.27)
Euler parameters can be computed from √ any column of this matrix, for instance, ei = Tik /Δk , i = 0, 1, 2, 3, where Δk = 2 Tkk . This expression shows the problem associated with the desired inverse relationships: the results become inaccurate when the denominator Δk becomes very small, or vanishes.
13.3 Euler parameters
519
The most accurate results will be obtained by selecting the denominator of maximum magnitude. In other words, the best results will be obtained by extracting Euler parameters from the column of T which presents the largest diagonal term. It can be readily shown that " ! max (T00 , T11 , T22 , T33 ) = max tr(R), R11 , R22 , R33 . (13.28) If m is the index corresponding to the column with the maximum diagonal term, Euler parameters write ei = Tim /Δm ,
i = 0, 1, 2, 3.
(13.29)
The combination of eqs. (13.28) and (13.29) provides a singularity free algorithm for extracting Euler parameters from a given rotation tensor. In contrast, it is not possible to extract Euler angles from a given rotation tensor without encountering singularities, see eq. (4.13). Example 13.1. Kinetic energy of a rigid body The kinetic energy of a rigid body undergoing an arbitrary motion was developed in section 6.2. To illustrate the use of Euler parameters, the kinetic energy of a rigid body undergoing rotational motion about a ſxed inertial point will be evaluated in this example. The kinetic energy of the rigid body is given by eq. (6.16). To express this quantity in terms of Euler parameters, it is convenient to use quaternion ω ˆ ∗T =
∗T ∗ ω0 ω , where ω are the components of the angular velocity vector resolved in the body attached basis. The scalar part, ω0 = 2ˆ eT eˆ˙ = 0, because eˆ is a unit quaternion. Next, the following 4 × 4 matrix is introduced ∗ T m 0 B∗ M = , (13.30) 0 I B∗ where I B∗ is the mass moment of inertia tensor of the rigid body computed with respect to the ſxed inertial point and m∗ a representative mass moment of inertia component. Notation (·)∗ indicates tensor components resolved in a body attached basis. The kinetic energy now becomes 1 ∗T B∗ ∗ ω ˆ M ω ˆ . (13.31) 2 Because the scalar part of quaternion ω ˆ ∗ vanishes, the speciſc value of coefſcient ∗ m does not affect the value of the kinetic energy; hence, m∗ is simply deſned as a “representative mass moment of inertia component.” Quaternion ω ˆ ∗ is now readily expressed in terms of Euler parameters using eq. (13.24), to ſnd K=
K = 2 eˆ˙ T A(ˆ e)M B∗ AT (ˆ e)eˆ˙ = 2 eˆT C(eˆ˙ )M B∗ C T (eˆ˙ )ˆ e,
(13.32)
where the last equality follows from identity (13.12). Clearly, it is expeditious to express all quantities in the quaternion formalism before introducing Euler parameters.
520
13 Parameterization of rotation
Example 13.2. Lagrange’s equations of the ſrst kind for a rigid body Express the equations of motion for a rigid body rotating about a ſxed inertial point in terms of Euler parameters. This representation involves four generalized coordinates, the four Euler parameters, which are linked by the kinematic constraint expressed by eq. (13.20). Lagrange’s equations of the ſrst kind will be derived for this constrained system. 2 2 2 2 If the constraint equation is written as C = (e0 + e1 + e2 + e3 − 1)/2 = 0, the
constraint matrix becomes B = e0 e1 e2 e3 = eˆT . The Lagrangian of the system is simply L = K − V = K. The system’s generalized momenta and the derivatives of the Lagrangian with respect to the generalized coordinates are now p=
∂L e)M B∗ AT (ˆ e)eˆ˙ , = 4 A(ˆ ∂ eˆ˙
∂L = 4 C(eˆ˙ )M B∗ C T (eˆ˙ )ˆ e. ∂ˆ e
Let the rigid body be acted upon by an external moment, which components resolved in the material basis are denoted Q∗ . The virtual work done by this moment is δW nc = Q∗T δψ ∗ , where δψ ∗ are the components of the virtual rotation vector resolved in the material basis. Here again, it is convenient to express the vir
ˆ ∗ , where Q ˆ ∗T δψ ˆ ∗T = 0 Q∗T tual work in terms of quaternions as δW nc = Q
ˆ ∗T = 0 δψ ∗T . Both quaternions feature a vanishing scalar part. By analand δψ
ˆ ∗ = 2 AT (ˆ e)δˆ e and the virtual work becomes δW nc = ogy with eq. (13.24), δψ T ∗T ˆ 2 A (ˆ Q e)δˆ e. Lagrange’s equations of the ſrst kind are readily found as
1 λ ˆ∗, A(ˆ e)M B∗ AT (ˆ e)e¨ˆ + AT (eˆ˙ )eˆ˙ + A(eˆ˙ ) − C(eˆ˙ ) M B∗ AT (ˆ e)eˆ˙ + eˆ = A(ˆ e)Q 4 2 (13.33) where λ is Lagrange’s multiplier used to enforce the normality of the Euler parameters. The equations of motion for the rigid body do not involve transcendental functions, only products of the generalized coordinates, eˆ. Example 13.3. Maggi’s formulation for a rigid body Lagrange’s equation of the ſrst kind developed in the previous example are differential-algebraic equations. Use Maggi’s formulation presented in section 11.2.1 to derive ordinary equations of motion expressed in terms of Euler parameters. Equation (13.24) is recast in the following form B 0 T ˙ = 2A (ˆ e)eˆ = ˇ eˆ˙ = Beˆ˙ . ω∗ B The ſrst row of matrix B deſnes the constraint matrix, and the next three row deſne the kinematic characteristics of the problem, selected to be the components of the angular velocity vector resolved in the material basis. This linear transformation is at the heart of Maggi’s formulation, see eq. (11.4). To eliminate Lagrange’s multiplier from the formulation, Lagrange’s equations of the ſrst kind, eqs. (13.33), are multiplied by AT (ˆ e) to yield
13.3 Euler parameters
λ 1 B∗ ˙ ∗ T B∗ ∗ ˙ ˙ ˆ ∗. =Q M ω e) A(eˆ) − C(eˆ) M ω ˆ + ˆ + A (ˆ 2 0
521
(13.34)
The following identity is readily veriſed 0 rT ˙ ˙ e) A(eˆ) − C(eˆ) = 2 , A (ˆ 0 r0 I − r T
where
1 1 e)eˆ˙ = C T (ˆ e)A(ˆ e)ˆ ω∗ = rˆ = C T (ˆ 2 2
0 . −ω∗
These results indicate that expression AT (ˆ e)[A(eˆ˙ ) − C(eˆ˙ )] is closely related to the angular velocity vector. Introducing these results into Maggi’s equation, eqs. (13.34), leads to ∗ T λ 1 m 0 0 0 0 −ω∗T m∗ 0T 0 + + = ∗ . (13.35) Q 0 0 ω ∗ 0 I B∗ ω˙ ∗ 0 I B∗ ω ∗ 2 The ſrst equation yields Lagrange’s multiplier as λ = 2 ω ∗T I B∗ ω ∗ = 4K: Lagrange’s multiplier is a moment that enforces the normality condition for the Euler parameters and its magnitude equals four times the kinetic energy of the rigid body. The second equation is I B∗ ω˙ ∗ + ω ∗ I B∗ ω ∗ = Q∗ , which is the pivot equation, eq. (6.23), for the angular motion of a rigid body about a ſxed inertial point. Of course, this equation could have been used from the onset of this development, bypassing the formal derivation of Maggi’s formulation. The complete formulation of the problem consists of seven ordinary differential equations combining Maggi’s equations, eqs. (13.35), and the deſnition of the kinematic characteristics, eqs. (13.24), ∗ · B∗ −1 ∗ ω ∗ I B∗ ω ∗ ) (I ) (Q − ω , (13.36) = eˆ e)ˆ ω ∗ /2 A(ˆ where the scalar part of quaternion ω ˆ ∗ vanishes. Example 13.4. Hamilton’s principle for a rigid body In example 8.16 on page 332, the equations of motion of a rigid body were derived from Lagrange’s formulation. Because this formulation requires the explicit evaluation of the derivatives of the Lagrangian with respect to the generalized coordinates and generalized velocities, a speciſc parameterization of rotation must be speciſed at the onset of the formulation: for instance, eq. (13.32) shows the explicit expression of the kinetic energy in terms of Euler parameters and their time derivatives. Examples 8.16 and 13.2 both demonstrate the ensuing complexity of the analytical developments. Application of Hamilton’s principle to the rigid body problem leads to a compact form of the equations of motion, as was shown in example 8.6 on page 312. Derive the governing equations for the rotational motion of a rigid body in terms of Euler parameters based on Hamilton’s principle.
522
13 Parameterization of rotation
The rigid body is acted upon by an external moment and the virtual work done by this moment is δW nc = QT δψ, where δψ are the components of the virtual rotation vector resolved in the inertial basis. It is convenient to express the virtual
ˆT = ˆ where Q ˆ T δψ, ˆ = 0 Q and δψ work in terms of quaternions as δW nc = Q
δψ0 δψ T ; δψ0 is the scalar part of the virtual rotation quaternion. Similarly, the kinetic energy of the system can be expressed in terms of the angular velocity quaternion, see eq. (13.31), and variation of this quantity becomes ˆ ∗ , where the vector part of quaternion h ˆ ∗T = δK = δ ω ˆ ∗T M B∗ ω ˆ ∗ = δω ˆ ∗T h
∗T stores the components of the angular momentum vector resolved in the ma0, h terial basis. Because this quaternion has a vanishing scalar part, eq. (4.102b) yields ˙ T R h∗ = δψ ˙ T h, where array h stores the components of the δK = δω ∗T h∗ = δψ ˆ˙ T h, ˆ where angular momentum vector resolved in the inertial basis. δK = δψ
Finally, ˆ the angular momentum quaternion is deſned as h = 0, h . The normality constraint to be imposed on Euler parameters is expressed as C = eˆT eˆ − 1 = 0, and the potential of the constraint, eq. (10.6), becomes V c = −λC, where λ is Lagrange’s multiplier used to enforce the constraint. Variation of this potential is δV c = −δλ C − 2λδˆ eT eˆ. By analogy with eq. (13.23), the relationship between the virtual rotation quaternion and virtual changes in Euler parameters is ˆ = 2B T (ˆ δψ e)δˆ e. Variation in the constraint potential becomes δV c = −δλ C −
T ˆ λˆ1, where ˆ1 = B T (ˆ δψ e)ˆ e = 1, 0T is a quaternion with a unit scalar part and a vanishing vector part. For this problem, L+ = K −V c and Hamilton’s principle for constrained system, eq. (10.10), now implies % tf * + ˆ˙ h ˆ + δλ C + δψ ˆ T λˆ1 + δψ ˆ TQ ˆ dt = 0. δψ ti
All boundary terms are ignored here. After integration by parts of the ſrst term, the equations of motion of the system are found to be C = 0, the constraint equation, ˆ˙ − λ1ˆ = Q. ˆ Lagrange’s multiplier is readily eliminated from the last equation and h because quaternion ˆ1 has a vanishing vector part, leading to h˙ = Q. The last step of the procedure is to evaluate the time derivatives of Euler parameters with respect to time. First, eq. (13.24) yields eˆ˙ = A(ˆ e)ˆ ω ∗ /2, and ſnally, ˆ eˆ˙ = A(ˆ e)(M B∗ )−1 DT (ˆ e)h/2, which leads to the following system of ordinary differential equations, , · Q h "−1 T ! , (13.37) = ˆ eˆ A(ˆ e) M B∗ D (ˆ e)h/2 ˆ vanishes. where the scalar part of quaternion h It is left to the reader to show that eqs. (13.37) are identical eqs. (13.36) obtained from Lagrange’s formulation from which Lagrange’s multipliers have been eliminated using Maggi’s formulation. The present procedure, based on Hamilton’s principle, is far more expeditious.
13.3 Euler parameters
523
13.3.5 Problems Problem 13.1. Angular velocity with Euler parameters Starting from eq. (4.55), prove that the components of the angular velocity vector expressed in the rotating system are given by eq. (13.24).
Problem 13.2. Euler parameters in terms of Euler angles Consider the Euler angles with the 3-1-3 sequence described in section 4.11.1. Show that the Euler parameters deſning this rotation are given in terms of Euler angles as ⎫ ⎧ ⎫ ⎧ cos(φ + ψ)/2 cos θ/2⎪ e0 ⎪ ⎪ ⎪ ⎪ ⎪ ⎬ ⎪ ⎨ ⎬ ⎨ ⎪ cos(φ − ψ)/2 sin θ/2 e1 = . sin(φ − ψ)/2 sin θ/2 ⎪ e2 ⎪ ⎪ ⎪ ⎪ ⎪ ⎭ ⎪ ⎩ ⎭ ⎩ ⎪ sin(φ + ψ)/2 cos θ/2 e3 Hint. Express the rotation as a succession of three planar rotations, see example 4.4. Find the Euler parameters of each planar rotation. Use the composition of rotation formula, eq. (13.26), to ſnd the desired result.
Problem 13.3. Time dependent quaternions Consider the following time dependent quaternion, ⎧ ⎫ cos φ/2 ⎪ ⎪ ⎪ ⎪ ⎨ ⎬ sin φ/2 sin θ cos ψ eˆ = , sin φ/2 sin θ sin ψ ⎪ ⎪ ⎪ ⎪ ⎩ ⎭ sin φ/2 cos θ where φ(t) = 3t + 5t2 , θ(t) = 2t and ψ(t) = 7t − 3t3 . (1) Show that eˆ is a unit quaternion. (2) Compute the quaternion eˆ˙ (t) and its norm. Is it a unit quaternion? (3) Plot the components of angular velocity vector in the ſxed system for t ∈ [0, 2] s. (4) Plot the components of angular velocity vector in the rotating system for t ∈ [0, 2] s. (5) Consider the following approximation for the components of the angular velocity vector in the ſxed system ω (t + Δt/2) ≈
R(t + Δt) − R(t) RT (t + Δt) + RT (t) . Δt 2
Prove that this can be written as ω (t + Δt/2) ≈
[R(t + Δt)RT (t)] − [R(t + Δt)RT (t)]T . 2Δt
(6) On one graph, plot the exact and approximate components of angular velocity vector in the ſxed system for t ∈ [0, 2] s. (7) Find the corresponding approximation for the components of the angular velocity vector in the rotating system. (8) On one graph, plot the exact and approximate components of angular velocity vector in the rotating system for t ∈ [0, 2] s.
Problem 13.4. Composition of rotations with quaternions Consider components of three rotation tensors R1 , R2 , and R = R1 R2 , all resolved in a single basis B. The components of R1 and R2 are ⎡
⎤ 0.5996 0.7336 0.3199 R1 = ⎣−0.4732 0.6473 −0.5976⎦ , −0.6455 0.2069 0.7352
⎡
⎤ −0.0282 0.2133 0.9766 R2 = ⎣−0.4423 0.8735 −0.2035⎦ . −0.8964 −0.4377 0.0697
524
13 Parameterization of rotation
(1) Extract Euler parameters eˆ1 and eˆ2 corresponding to rotation tensors R1 and R2 , respectively, using the procedure described in section 13.3.4. (2) Compute Euler parameters eˆ of R using the composition formula, eq. (13.26). (3) Compute the components of the rotation tensor R = R1 R2 . Extract Euler parameters eˆ corresponding to rotation tensor R. Check that your answer is identical to that of question (2).
Problem 13.5. Satellite dynamics with quaternions
Consider a satellite with a body attached frame B = (¯b1 , ¯b2 , ¯b3 ) that is aligned with the principal axes of inertia of the system. Let R denote the components of the rotation tensor that brings the inertial frame I to the body attached frame B. The components of the angular velocity vector of the satellite, resolved in B, are denoted ω ∗ . The mass moments of inertia are I1∗ = 12, I2∗ = 16 and I3∗ = 20 kg·m2 . During a maneuver, thrusters apply a moment M (t) to the satellite M ∗ (t) = Q∗ sin 2πt/T for t ≤ T and M ∗ (t) = 0 for t > T , where T = 5 s. The initial angular velocity of the satellite is ω∗T (t = 0) = 0, 0.5, 0 rad/s. The components of the moment vector Q∗ in the body attached frame are Q∗T = 5, 0, 0 N·m. (1) Solve Euler’s equation for the time history of the angular velocity of the satellite. (2) Simultaneously solve for the Euler parameters eˆ parameterizing R. (3) On one graph, plot the three components of the angular velocity vector in the body attached frame as a function of time for t ∈ [0, 30T ]. (4) Plot the Euler parameters e1 , e2 , and e3 . (5) The rotation R(t) can be represented by a rotation of magnitude φ(t) about axis n ¯ (t). Plot the angle φ(t). (6) Plot the components of axis n ¯ (t) in frame I. (7) Plot the components of axis n ¯ (t) in frame B. Comment on your results. (8) Plot the direction cosines of axis ¯b1 with respect to basis I. (9) Same question for axis ¯b2 . (10) Same question for axis ¯b3 . (11) Plot the quantity eˆT eˆ − 1. Comment on your results.
13.4 The vectorial parameterization of rotation The vectorial parameterization of rotation [255] consists of a minimum set of three parameters deſning the components of a rotation parameter vector. The tensorial nature of this class of parameterization of rotation sets it apart from the parameterizations investigated in previous sections. 13.4.1 Fundamental properties Consider three rotations of magnitudes φ1 , φ2 , and φ3 , about unit vectors n ¯1, n ¯2, and n ¯ 3 , respectively. The three rotations, denoted (φ1 , n ¯ 1 ), (φ2 , n ¯ 2 ), and (φ3 , n ¯ 3 ), respectively, are associated with three rotation tensors, denoted R1 , R2 , and R3 , respectively, through Rodrigues’ rotation formula, eq. (4.15). Assume that the following triple product of rotation tensors relates these three quantities, R3 = RT2 R1 R2 . (13.38) As discussed in section 4.8.2, this operation corresponds to a change of basis for second-order tensors: R1 and R3 are the components of the same rotation tensor expressed in two bases related by rotation tensor R2 . Using Rodrigues’ rotation formula, eq. (4.15), eq. (13.38) becomes
13.4 The vectorial parameterization of rotation
525
T T T R3 = I + sin φ1 R n ¯ + (1 − cos φ1 )R n ¯ R n ¯ , 2 1 2 1 2 1 where eq. (4.30) was used. Comparing this result with Rodrigues’ rotation formula implies that φ3 = φ1 , n ¯3 =
RT2 n ¯1.
(13.39a) (13.39b)
These equations express the two conditions required for the proper transformation of rotation tensors components under a change of basis. Let p(φ) be an odd scalar function of angle φ; eq. (13.39a) then implies p(φ3 ) = p(φ1 ). Multiplication of eq. (13.39b) by p(φ3 ) on the left-hand side and p(φ1 ) = p(φ3 ) on the right-hand side then yields n3 = RT2 p(φ1 )¯ n1 . p(φ3 )¯
(13.40)
This equation is equivalent to eqs. (13.39). Indeed, taking the norm of eq. (13.40) yields p(φ3 ) = p(φ1 ), or φ3 = φ1 , because n ¯ 1 and n ¯ 1 are unit vectors and R2 an orthogonal tensor. Dividing eq. (13.40) by p(φ3 ) then yields eq. (13.39b) because p(φ3 ) = p(φ1 ). The vectorial parameterization of rotation is deſned as p = p(φ)¯ n,
(13.41)
where p is the rotation parameter vector. Equation (13.40) can now be recast in a more compact manner as p3 = RT (p2 ) p1 . (13.42) The discussion presented above establishes that the tensorial nature of the rotation tensor expressed by the transformation rule of its components, eq. (13.38), implies the tensorial nature of the rotation parameter vector expressed by the transformation rule of its components, eq. (13.42). It is easily shown that the process can be reversed, i.e., tensorial nature of the rotation parameter vector implies that of the rotation tensor. In summary, the vectorial parameterization of rotation presents two fundamental properties. 1. The vectorial parameterization of rotation is tensorial in nature, as expressed by the following equivalence, R(p3 ) = RT (p2 )R(p1 )R(p2 ) ⇐⇒ p3 = RT (p2 )p1 .
(13.43)
The tensorial nature of the second-order rotation tensor implies and is implied by the tensorial nature of the rotation parameter vector, a ſrst-order tensor. 2. Rotation parameter vectors are parallel to the eigenvector of the rotation tensor corresponding to its unit eigenvalue. Because unit vector n ¯ is the eigenvector of the rotation tensor associated with its unit eigenvalue, eq. (4.24), the deſnition of the rotation parameter vector, eq. (13.41), implies its parallelism to n ¯.
526
13 Parameterization of rotation
Because these two properties imply each other, either can be taken as the deſnition of the vectorial parameterization of rotation. A parameterization of rotation is tensorial if and only if the rotation parameter vector is parallel to the eigenvector of the rotation tensor associated with its unit eigenvalue. The rotation parameter vector is not yet fully deſned because function p(φ), called the generating function, is still arbitrary. Generating functions must be odd functions of the rotation angle, φ, and present the following limit behavior (13.44)
lim p(φ) = φ,
φ→0
i.e., all rotation parameter vectors must approach the inſnitesimal rotation vector when φ → 0. It will be shown that many widely used rotation parameterization belong to this class of vectorial parameterization. 13.4.2 The rotation tensor The explicit expression of the rotation tensor in term of the vectorial parameterization is easily obtained from Rodrigues’ rotation formula, eq. (4.15), R = I + ζ1 (φ) p + ζ2 (φ) pp,
(13.45)
where ζ1 (φ) and ζ2 (φ) are even functions of the rotation angle, φ, deſned as φ ν2 sin φ = ν cos = , p 2 ε εζ1 1 − cos φ ν2 ζ2 (φ) = = . = p2 2 2
ζ1 (φ) =
(13.46a) (13.46b)
The following two even functions of the rotation angle play an important role in the vectorial parameterization of rotation, 2 sin φ/2 , p 2 tan φ/2 ν ε= = . p cos φ/2
ν=
(13.47a) (13.47b)
In view of eq. (13.44), limφ→0 ν = 1, limφ→0 ε = 1, limφ→0 ζ1 = 1, limφ→0 ζ2 = 1/2, and limφ→0 R = I, as expected. Functions p(φ), ζ1 (φ), and ζ2 (φ) solely depend on the magnitude, φ, of the rotation. Because this angle is invariant under a change of basis, these functions are also invariant under a change of basis, and hence, are zeroth order tensors. Since p is a second-order tensor, see eq. (4.30), eq. (13.45) proves that the rotation tensor is a second-order tensor because it is obtained through tensor operations from zeroth and second-order tensors. This proof of the tensorial nature of the rotation tensor mirrors that provided in section 13.4.1.
13.4 The vectorial parameterization of rotation
527
The components of the rotation tensor resolved in the canonical basis, E, deſned by eq. (4.32), are ⎤ ⎡ ⎤ ⎡ 1 0 0 1 0 0 R[E] = ⎣0 1 − p2 ζ2 −pζ1 ⎦ = ⎣0 cos φ − sin φ ⎦ . (13.48) 0 sin φ cos φ 0 pζ1 1 − p2 ζ2 Clearly, functions ζ1 and ζ2 are not independent because (1 − p2 ζ2 )2 + (pζ1 )2 = 1 and hence, ζ12 + p2 ζ22 = ν 2 . The two multiplicative decompositions of the rotation tensor, eqs. (4.20) and (4.22), are easily expressed in terms of the vectorial parameterization as G=I+
1 − cos φ/2 ν p + pp, 2 p2
(13.49)
leading to R = G G, and ε ε ε ε R = (I + p)(I − p)−1 = (I − p)−1 (I + p). 2 2 2 2
(13.50)
R+I ε . (I − p)−1 = 2 2
(13.51)
Here again,
This decomposition implies ε ε R − I = ζ1 p(I + p) = (I + p)ζ1 p, 2 2 ε ε R − I = ε p(I − p)−1 = (I − p)−1 ε p. 2 2
(13.52a) (13.52b)
Tensors R and G are related by the following identities R+I R+I ε p = ε p = νG p = pνG, 2 2 T T G+G G+G ε p( )=( ) ε p = ν p = G − GT , 2 2 2−1 1 G + GT p T p T . (I + ε ) G = (I + ε )G = 2 2 2 R−I =
(13.53a) (13.53b) (13.53c)
13.4.3 The angular velocity vector ˙ n +pn ¯˙ , where Taking a time derivative of the rotation parameter vector yields p˙ = p φ¯ ˙ n − p, p = dp/dφ. Use of identity (1.33b) leads to n n p˙ = p nn n ¯˙ = −pn ¯˙ = p φ¯ ˙ T ˙ because n ¯ is a unit vector and hence, n ¯ n ¯ = 0. Introducing these results into the expression for the angular velocity, eq. (4.58), then leads to ˙ ω = H(p)p.
(13.54)
528
13 Parameterization of rotation
Operator H(p) is given by H(p) = σ0 (φ) I + σ1 (φ) p + σ2 (φ) pp,
(13.55)
where σ0 (φ), σ1 (φ), and σ2 (φ) are even functions of the rotation angle, φ, deſned as 1 , p 1 − cos φ = ζ2 , σ1 (φ) = p2 σ0 − ζ1 . σ2 (φ) = p2
σ0 (φ) =
(13.56a) (13.56b) (13.56c)
These three functions are zeroth order tensors because they are functions of angle φ, which is invariant under a change of basis. Using eq. (13.44), limφ→0 σ0 = 1, limφ→0 σ1 = 1/2, limφ→0 σ2 = 0 and limφ→0 H = I. Since H n ¯ = σ0 n ¯ , σ0 is the eigenvalue of H corresponding to the eigenvector n ¯. Properties of the tangent tensor The components of the tangent tensor, resolved in the canonical basis, E, deſned by eq. (4.32), are ⎤ ⎡ ⎤ ⎡ 1/p 0 0 0 0 1/(νp ) cos φ/2 − sin φ/2 ⎦ . (13.57) H [E] = ⎣ 0 ζ1 −pσ1 ⎦ = ν ⎣ 0 0 sin φ/2 cos φ/2 0 pσ1 ζ1 The eigenvalues, μk , of H are μ1 = σ0 and μ2,3 = ζ1 ± ipζ2 = ν(cos φ/2 ± i sin φ/2). The determinant of H is readily obtained as det(H) =
ν2 . p
(13.58)
Time derivatives of the rotation parameter vector can be expressed in terms of angular velocity vector as p˙ = H −1 (p)ω, (13.59) where
1 H −1 (p) = χ0 I − p + χ2 pp, (13.60) 2 where χ0 (φ) and χ2 (φ) are even functions of the rotation angle, φ, deſned as χ0 (φ) = p , 1 χ2 (φ) = 2 p
1 . p − ε
(13.61a) (13.61b)
13.4 The vectorial parameterization of rotation
529
These two functions are zeroth order tensors because they are functions of angle φ, which is invariant under a change of basis. The components of tensor H −1 resolved in the canonical base, E, deſned by eq. (4.32) are ⎤ ⎡ 0 0 νχ0 1 cos φ/2 sin φ/2 ⎦ . (13.62) H −1[E] = ⎣ 0 ν 0 − sin φ/2 cos φ/2 As expected, the eigenvalues, μ ¯k , of H −1 are μ ¯ 1 = χ0 and μ ¯ 2,3 = (cos φ/2 ± i sin φ/2)/ν. Operator H enjoys the following remarkable properties, R = H H −T = H −T H, R − I = pH = H p, I − R = ν pH T
2
p = H
−T
−1
(13.63b) 2
=ν H
−H
−1
(13.63a)
−1
p,
.
(13.63c) (13.63d)
Finally, the components of the angular velocity vector resolved in the rotating system are given by eq. (4.55) as ω∗ = RT ω. In view of eq. (13.63a), ω∗ = H T (p)p˙ and the inverse relationship is p˙ = H −T (p)ω ∗ . As discussed in section 7.3, the virtual rotation vector, δψ, can be deſned by = δR RT . Hence, the relationship beanalogy to the angular velocity vector as δψ tween the virtual rotation vector and virtual changes in the vectorial parameters is readily found to be δψ = H(p)δp and similarly, δψ ∗ = H T (p)δp. Tensorial nature of the tangent operator Operator H is speciſc to a particular vectorial parameterization, i.e., its expression depends on the choice of the generating function. It is, however, a second-order tensor. In equation (13.55), p is a second-order tensor, and scalars σ0 , σ1 , and σ2 are zeroth order tensors. Consequently, H must be a second-order tensor because it is obtained through tensor operations from zeroth and second-order tensors, H(p3 ) = RT (p2 )H(p1 )R(p2 ) ⇐⇒ p3 = RT (p2 )p1 .
(13.64)
Although tensor H is not an intrinsic tensor because it depends on the choice of a speciſc generating function, it is a second-order tensor for all vectorial parameterizations of rotation. 13.4.4 Determination of the rotation parameter vector The determination of the components of the vectorial parameterization from the rotation tensor is best accomplished through a two step procedure: ſrst, extract Euler
530
13 Parameterization of rotation
parameters from the rotation tensor using eq. (13.29), and second, express the vectorial parameterization in terms of Euler parameters. This second operation simply states that 2 p = e. (13.65) ν 13.4.5 Composition of rotations The concept of composition of rotations was discussed in section 4.9 and is depicted in ſg. 4.7. Let p1 , p2 , and p with rotation angles φ1 , φ2 , and φ, respectively, be the rotation parameter vectors of three rotation tensors such that R(p) = R(p1 )R(p2 ), the relationship between the various parameters then follows from eq. (13.26) φ 1 1 T cos = ν1 ν2 (13.66a) − p1 p2 , 2 ε1 ε2 4 1 1 1 (13.66b) p1 + p2 + p1 p2 . νp = ν1 ν2 ε2 ε1 2 The ſrst equation is used to compute φ and hence, ν. The second equation then yields the components of the rotation parameter vector. 13.4.6 Linearization of the tangent tensor In many numerical procedures, linearization of the tangent tensor will be required. For example, an increment in the angular velocity vector deſned by eq. (13.54) is Δω = Δ[H(p)p] ˙ = H(p)Δp˙ + M (p, p)Δp. ˙ More generally, operator M is deſned as M (p, a) = ∂(H(p)a)/∂p, where a is an arbitrary vector. Tedious algebra shows that operator M is σ0 + σ ˆ1 p + σ ˆ2 pp)a pT − σ1 a − σ2 (2 p a− ap), M (p, a) = (ˆ
(13.67)
where σ ˆ0 = σ0 /(pp ), σ ˆ1 = σ1 /(pp ), and σ ˆ2 = σ2 /(pp ). Notation (·) indicates a derivative with respect to angle φ, and coefſcients σ0 , σ1 , and σ2 are given by eq. (13.56). 13.4.7 Problems Problem 13.6. Angular velocity with vectorial parameterization Prove relationships (13.55) and (13.60).
Problem 13.7. Properties of the vectorial parameterization Prove relationships (13.63a) and (13.63b).
Problem 13.8. Angular velocity with vectorial parameterization
Prove that ω ∗ = H T (p)p˙ starting from the deſnition of the angular velocity vector expressed ˙ in the rotating system ω ∗ = RT R.
13.5 Speciſc choices of generating function
531
Problem 13.9. Tensorial nature of the tangent tensor Prove that the tangent tensor is a tensor starting directly from its deſnition, eq. (13.55).
Problem 13.10. Composition of collinear rotations Let p1 , p2 , and p be the rotation parameter vectors of three rotations with rotation angles φ1 , φ2 , and φ, respectively, such that R(p) = R(p1 )R(p2 ). If p1 and p2 are two parallel vectors, prove that p is also parallel to them and that φ = φ1 + φ2 .
Problem 13.11. Relationship between tensors G and H Prove the following relationship nn ¯ T = σ0 I + (σ0 − ν) nn . GT H = νI + (σ0 − ν)¯
(13.68)
Problem 13.12. Linearization of the tangent tensor Equation (13.67) deſnes operator M involved in the linearization of the tangent operator. (1) Determine operator M ∗ (p, a) = ∂(H T (p)a)/∂p.
13.5 Speciſc choices of generating function The formulation presented in the previous section is very general, but, in practice, a speciſc choice of the generating function, p(φ), must be selected. It seems natural to select a generating function that will simplify some of the operators involved in rotation manipulations. Speciſc parameterizations The simplest choice is to select the generating function as the rotation angle itself p(φ) = φ.
(13.69)
This parameterization is called the Cartesian rotation vector, or the exponential map of rotation; details concerning this parameterization are given in section 13.7.1. A second approach is to simplify the expression for the rotation tensor, R, given by eq. (13.45), by requiring ζ1 (φ) = 1. This yields p(φ) = sin φ.
(13.70)
This choice is called the linear parameterization. Note that requiring ζ2 (φ) = 1 √ yields p(φ) = 2 sin φ/2. Although this is a valid parameterization, it does not satisfy the limit condition, eq. (13.44). An alternative approach is to require the last term of tensor H, eq. (13.55), to vanish, i.e., σ2 (φ) = 0. This leads to the nonlinear differential equation p sin φ = p, the solution of which is p(φ) = c tan(φ/2), where c is an integration constant. The limit condition, eq. (13.44), implies c = 2, and hence, φ p(φ) = 2 tan . 2
(13.71)
532
13 Parameterization of rotation
This parameterization is variously called after Cayley, Gibbs, Rodrigues, or some combinations of these names. It shall be referred to here as the Cayley-GibbsRodrigues parameterization. Details concerning this parameterization are given in section 13.7.3. Another approach is to require the last term of tensor H −1 , eq. (13.60), to vanish, i.e., p − 1/ε = 0. This leads to the nonlinear differential equation 2p tan φ/2 = p, the solution of which is p(φ) = c sin φ/2, where c is an integration constant. Here again, the limit condition yields the solution as p(φ) = 2 sin
φ . 2
(13.72)
This parameterization is usually termed the reduced Euler-Rodrigues parameterization. It is closely connected to the parameterization technique employing unit quaternions: the parameter vector, p, coincides with the vector part of the unit quaternion of the rotation. It shall be referred to here as the Euler-Rodrigues parameterization. Details concerning this parameterization are given in section 13.7.2. In yet another approach, operators H and G are required to be multiples of each other, i.e., H = α(φ)G. This implies two conditions: p = 1/α(φ) and pα(φ) = 2 sin φ/2. Hence, the following differential equation must hold: 2p sin φ/2 = p. With the help of the limit condition, the solution becomes p(φ) = 4 tan
φ . 4
(13.73)
This parameterization also bears various names in the literature: Wiener [256], Milenkovi´c [257], or modiſed Rodrigues parameterization [243, 258]. It is also known as the conformal rotation vector (CRV) parameterization. It shall be referred to here as the Wiener-Milenkovi´c parameterization. Details concerning this parameterization are given in section 13.7.4. To avoid the appearance of singularities when manipulating tensor H, eq. (13.55), it might be desirable to work with a parameterization for which det(H) = c, where c is a constant. From eq. (13.58), this requirement implies cp = ν 2 . The solution of this nonlinear differential equation leads to (13.74) p(φ) = 3 6(φ − sin φ). Constant c was evaluated with the help of the limit condition and found to be c = 1. Hence, this particular parameterization is such that det(H) = 1 for all values of φ. Clearly, the complex expression for this parameterization makes it quite unpractical to use. The sine and tangent families This discussion indicates that two subclasses of vectorial parameterization enjoy interesting properties
13.6 The extended vectorial parameterization
p(φ) = m sin
φ , m
and p(φ) = m tan
φ . m
533
(13.75)
To ease the manipulation of the trigonometric functions, m is typically selected to be an integer, but real values of m are equally valid. As m increases, p(φ) → φ for |φ| < π. This feature is illustrated in ſgs. 13.1 and 13.2 for the sine and tangent families, respectively. Note the convergence by lower and upper bound for the sine and tangent families, respectively. 4
3
3.5 2.5
m tan f / m
m sin f / m
3 2
1.5
2.5 2 1.5
1 1 0.5 0.5 0
0
0.5
1
1.5
f
2
2.5
3
Fig. 13.1. Generating function versus angle φ for the sine family. m = 1 (+), 2 (), 3 ( ), 4 (◦), 5 (). The dotted line represents the generating function for the rotation vector, p(φ) = φ.
0 0
0.5
1
1.5
f
2
2.5
3
Fig. 13.2. Generating function versus angle φ for the tangent family. m = 1 (+), 2 (), 3 ( ), 4 (◦), 5 (). The dotted line represents the generating function for the rotation vector, p(φ) = φ.
13.6 The extended vectorial parameterization The vectorial parameterization presented in the previous section exhibits desirable features, but also suffers serious drawbacks. In particular, for all generating function choices, singularities will occur for speciſc values of the rotation angle, as proved by Stuelpnagel [246]. 13.6.1 Singularities of the vectorial parameterization More speciſcally, singularities can ſrst occur in the deſnition of the generating function when p → ∞. For instance, the Cayley-Gibbs-Rodrigues parameterization is singular when φ = ±π. Because the representation of all arbitrary rotations requires a well deſned parameterization for all |φ| ≤ π, the Cayley-Gibbs-Rodrigues parameterization can not be used when dealing with rotations of arbitrary magnitude. Next, problems can occur when determining the component of the vectorial parameterization from the rotation tensor. In view of eq. (13.65), singularities are encountered when ν → 0 or ∞. Linear parameters, for instance, experience such singularity when ν → ∞, i.e., when φ = ±π.
534
13 Parameterization of rotation
Singularities also arise from the manipulation of the tangent tensor and of its inverse. Inspection of eqs. (13.55), (13.58), and (13.60), reveals that singularities will appear when p → 0 or ∞ and ν → 0 or ∞. In summary, singularities will appear when p → ∞, ν → 0 or ∞, and p → 0 or ∞. Table 13.1 lists the range of validity of various parameterizations. Figures 13.3 and 13.4 show the relevant function, ν(φ), p (φ), and det(H) for the sine and tangent families, respectively. Clearly, the parameterizations with larger values of m have an extended range of validity, although for m = 4 the range settles to |φ| < 2π for both the sine and tangent families, and does not increase with further increases in m. Table 13.1. Various choices of the generating function. Name
p(φ)
p
Cartesian rota-
φ
1
ν (sin
tion vector Cayley-Gibbs- 2 tan Rodrigues Wiener-
4 tan
Milenkovi´c Linear
φ φ )/( ) 2 2
φ φ 1/ cos2 2 2
cos
φ 2
φ φ 1/ cos2 4 4
cos2
φ 4
sin φ
cos φ
1/(cos
Parameters Euler
2 sin
φ 2
cos
φ 2
1
4 sin
φ 4
cos
φ 4
cos
Rodrigues
Validity range
ε
φ ) 2
φ 4
(tan
φ φ )/( ) 2 2
|φ| < 2π |φ| < π
1 1/(1 − tan2
φ ) |φ| < 2π 4
1/(cos2
φ ) 2
|φ| < π
1/(cos
φ ) 2
|φ| < π
(cos
φ φ )/(cos ) |φ| < 2π 4 2
All parameterizations with a validity range of |φ| > π are able to handle all rotations. Such parameterizations, however, are not necessarily “worry free.” Indeed, rotation are often used in incremental procedures where a small incremental rotation is added to a rotation at each time step, for instance. In this case, rotation angles of arbitrary magnitude are routinely encountered; consider, for instance, a rotating shaft, or a satellite tumbling in space. In such cases, singularities will always appear as φ increases to large values. For the sine and tangent families, problems will be encountered when |φ| reaches 2π, for m ≥ 4. 13.6.2 The rescaling operation The range of validity of the sine and tangent parameterizations for m = 4 can be extended by using a rescaling operation. This operation is based on the observation that rotations of magnitudes φ and φ† = φ ± 2π about the same axis n ¯ correspond to the same ſnal conſguration.
13.6 The extended vectorial parameterization 1
n
n
1
0.5
0.5
0
0
1
p (f)
p (f)
1 0.5
0.5 0
1
det (H)
det (H)
0
0.5 0
535
0
1
2
3
f
4
5
6
Fig. 13.3. Functions ν (top ſgure), p (middle ſgure), and det(H) (bottom ſgure), versus φ for the sine family. m = 1 (+), 2 (), 3 ( ), 4 (◦), 5 (). The dotted line gives the corresponding quantities for the rotation vector, p(φ) = φ.
1 0.5 0 0
1
2
3
f
4
5
6
Fig. 13.4. Functions ν (top ſgure), p (middle ſgure), and det(H) (bottom ſgure), versus φ for the tangent family. m = 1 (+), 2 (), 3 ( ), 4 (◦), 5 (). The dotted line gives the corresponding quantities for the rotation vector, p(φ) = φ.
Rescaling the Wiener-Milenkovi´c parameterization The Wiener-Milenkovi´c parameterization characterized by the generating function p = 4¯ n tan φ/4 is considered ſrst. The norm of the rotation parameter vector, p = p, is such that p ≤ 4 when |φ| ≤ π. Let p and p† be associated with the rotations φ and φ† , respectively. The relationship between these two sets of parameters is p φ π 1 φ† † p = 4¯ = 4¯ n tan ± = −4¯ n =− , (13.76) n tan 4 4 2 tan φ/4 tan2 φ/4 which implies p† = −
ν p. 1−ν
(13.77)
Taking the norm of eq. (13.76) yields p† = p/ tan2 φ/4, and hence, pp† = p2 /(tan2 φ/4), or pp† = 16. If π < |φ| < 2π, p > 4, and hence p† < 4; in other words, the rescaling operation decreases the norm of the rotation parameter vector. Let p1 , p2 , and p be the parameters of three rotation tensors such that R(p) = R(p1 )R(p2 ). Equation (13.66b) then provides the following rotation composition formula ν 1 ν2 1 1 1 p= (13.78) p1 + p2 + p1 p2 , ν ε2 ε1 2 where, in view of eq. (13.66a), 2ν − 1 = cos φ/2 = ν1 ν2 (1/ε1 ε2 − pT1 p2 /4). As incremental rotations are added to the initial orientation, p increases and when |φ| becomes larger than π, p > 4 and a rescaling operation, eq. (13.77), becomes necessary. The two operations, composition and rescaling, are conveniently combined into a single operation
536
13 Parameterization of rotation
⎧ ⎪ ⎪ ⎨
ν1 ν2 1 1 1 1 p + p + p1 p2 , ν ≥ , ν ε2 1 ε1 2 2 2 p= ν 1 ν 1 1 1 ⎪ 1 2 ⎪ p + p + p1 p2 , ν ≤ . ⎩− 1 − ν ε2 1 ε1 2 2 2
(13.79)
Rescaling the sine parameterization Similar developments hold for the sine parameterization characterized by the generating function p = 4¯ n sin φ/4. The norm of the rotation parameter vector, p = p, is such that p2 ≤ 8 when |φ| ≤ π. The rescaling operation now becomes ν p† = √ p. 1 − ν2
(13.80)
and implies p2 + p†2 = 16. Here again, the rescaling operation decreases the norm of the rotation parameter vector. Using the same approach as for the Wiener-Milenkovi´c parameterization, the update and rescaling operations are conveniently combined into a single operation ⎧ ν1 ν2 1 1 1 1 ⎪ ⎪ p + p + p1 p2 , ν ≥ √ , ⎨ ν ε2 1 ε1 2 2 2 p= (13.81) ν1 ν2 1 1 1 1 ⎪ ⎪ √ √ , p1 + p2 + p1 p2 , ν ≤ ⎩ ε1 2 2 1 − ν 2 ε2 where, in view of eq. (13.66a), 2ν 2 − 1 = cos φ/2 = ν1 ν2 (1/ε1 ε2 − pT1 p2 /4). The two parameterizations, p = 4¯ n sin φ/4 and p = 4¯ n tan φ/4, are now able to handle rotations of truly arbitrary magnitude provided that any update operation is combined with a possible rescale, as indicated in eqs. (13.79) and (13.81), respectively. Compact notation It is often convenient to indicate the composition of rotations combined with an optional rescaling by the following notation R(p) = R(p1 )R(p2 ) ⇔ p = p1 ⊕ p2 ,
(13.82)
which implies that p is computed with the help of eq. (13.79) or (13.81) for the Wiener-Milenkovi´c or sine parameterization, respectively. Composition operations such as R(p) = RT (p1 )R(p2 ) are also commonly encountered. In view of eq. (13.45), RT (p1 ) = R(−p1 ) and hence, the following notion is used R(p) = RT (p1 )R(p2 ) ⇔ p = p− ⊕ p2 , (13.83) 1 where notation p− indicates that the sign of the rotation parameter vector should be 1 changed before using eqs. (13.79) or (13.81).
13.7 Speciſc parameterizations of rotation
537
Note the simplicity of eqs. (13.79) or (13.81) as compared to the direct application of the composition equation. Indeed, given p1 and p2 , equation R(p) = R(p1 )R(p2 ) requires a four step procedure for the evaluation of p: (1) evaluate R(p1 ) using eq. (13.45), (2) evaluate R(p2 ) using the same equation, (3) evaluate the matrix product R(p) = R(p1 )R(p2 ), (4) extract parameters p from R(p) using the procedure described in section 13.3.4 and eq (13.65).
13.7 Speciſc parameterizations of rotation In this section, several parameterizations of rotation will be discussed that correspond to speciſc choices of the generating function. The Cartesian rotation vector, the Euler-Rodrigues parameters, Cayley-Gibbs-Rodrigues parameters, and the WienerMilenkovi´c parameters are discussed in the sections below. 13.7.1 The Cartesian rotation vector The rotation vector is associated with the generating function p(φ) = φ, i.e., r=φn ¯.
(13.84)
Important quantities associated with the rotation vector are p = 1, ν = (sin φ/2)/(φ/2), and ε = (tan φ/2)/(φ/2). Equation (13.45) yields the rotation tensor, using the following parameters: ζ1 = (sin φ)/φ and ζ2 = (1 − cos φ)/φ2 . Tensor H then follows from eq. (13.55) with σ0 = 1, σ1 = ζ2 , and σ2 = (1−ζ1 )/φ2 . Finally, tensor H −1 is obtained from eq. (13.60) with χ0 = 1, χ1 = −1/2, and χ2 = (1 − 1/ε)/φ2 . An interesting expression can be found by expanding the trigonometric functions in inſnite series and using identity (1.34b) 1 1 rr + rrr + · · · = exp( r) (13.85) 2! 3! This form of the rotation tensor is called the exponential map of rotation. This expansion of the rotation tensor provides a natural way of approximating rotations. For very small rotations, only the ſrst-order term is kept, to yield ⎡ ⎤ 1 −r3 r2 R(r) ≈ I + r = ⎣ r3 1 −r1 ⎦ . (13.86) −r2 r1 1 R(r) = I + r +
For moderate rotations, two terms of the expansion are kept, leading to ⎤ ⎡ r1 r3 r22 + r32 r1 r2 − r3 + r2 ⎥ ⎢1 − 2 2 2 2 ⎥ ⎢ 2 1 ⎥ ⎢ r1 r2 r1 + r3 r2 r3 . R(r) ≈ I + r + rr = ⎢ + r − r 1 − 3 1 ⎥ ⎥ ⎢ 2 2 2 2 ⎣ r 2r 2⎦ r + r2 r2 r3 1 3 − r2 + r1 1 − 1 2 2 2
(13.87)
538
13 Parameterization of rotation
Composition of rotations It is difſcult to write the composition of rotation formulæ in terms of the rotation vector. The simplest way to proceed is to ſrst transform the corresponding rotation vectors to an Euler parameter representation, compose the rotations with the help of eq. (13.26), then transform the result back to the rotation vector representation. 13.7.2 The Euler-Rodrigues parameters The Euler-Rodrigues parameters are associated with the generating function p(φ) = 2 sin φ/2, i.e., φ v = 2¯ (13.88) n sin . 2 As its name suggests, this parameterization is closely related to Euler parameters. Indeed, v = 2e, and e = v/2. The following parameter plays an important role in this parameterization, 6 φ vT v v0 = 1 − = cos . (13.89) 4 2 This representation is limited to rotation angles |φ| < π, so that 0 ≤ v0 ≤ 1. For |φ| > π, this parameterization cannot distinguish between the distinct rotations of magnitudes φ and π − φ. Important quantities associated with the Euler-Rodrigues parameters are p = v0 , ν = 1, and ε = 1/v0 . Equation (13.45) yields the rotation tensor, using the following parameters: ζ1 = v0 and ζ2 = 1/2. Tensor H then follows from eq. (13.55) with σ0 = 1/v0 , σ1 = 1/2, and σ2 = 1/(4v0 ). Finally, tensor H −1 is obtained from eq. (13.60) with χ0 = v0 , χ1 = −1/2, and χ2 = 0. Composition of rotations Let p and q be the Euler-Rodrigues parameters of two successive rotations, and r the Euler-Rodrigues parameter of the composed rotation, such that R(r) = R(p)R(q). The composition formulæ, eqs. (13.66a) and (13.66b), then yield 1 T (p q), 4 1 r = q0 p + p0 q + pq. 2
r0 = p0 q0 −
(13.90a) (13.90b)
13.7.3 The Cayley-Gibbs-Rodrigues parameters The Cayley-Gibbs-Rodrigues parameters are associated with the generating function p(φ) = 2 tan φ/2, i.e.,
13.7 Speciſc parameterizations of rotation
φ r = 2¯ n tan . 2 The following parameter plays an important role in this parameterization, r0 =
φ 1 = cos2 . 1 + (r T r)/4 2
539
(13.91)
(13.92)
This representation is limited to rotation angles of magnitude |φ| < π, because it presents a singularity, r → ∞, when |φ| → π. This parameterization is closely related to Euler parameters, r = 2e/e0 and e = √ r0 r/2. It is also closely related to Cayley’s parameters presented in section 13.1, r = 2a. Important quantities associated with the Cayley-Gibbs-Rodrigues parameters are √ p = 1/r0 , ν = r0 , and ε = 1. Equation (13.45) yields the rotation tensor, using the following parameters: ζ1 = r0 and ζ2 = r0 /2. Tensor H then follows from eq. (13.55) with σ0 = r0 , σ1 = r0 /2, and σ2 = 0. Finally, tensor H −1 is obtained from eq. (13.60) with χ0 = 1/r0 , χ1 = −1/2, and χ2 = 1/4. Composition of rotations Let p and q be the Cayley-Gibbs-Rodrigues parameters of two successive rotations, and r the Cayley-Gibbs-Rodrigues parameters of the composed rotation, such that R(r) = R(p)R(q). The composition formulæ, eqs. (13.66a) and (13.66b), then yield r0 = p0 q0 Δ21 , 1 1 p + q + pq . r= Δ1 2
(13.93a) (13.93b)
where Δ1 = 1 − (pT q)/4. 13.7.4 The Wiener-Milenkovi´c parameters The Wiener-Milenkovi´c parameters are associated with the generating function p(φ) = 4 tan φ/4, i.e., φ c = 4¯ (13.94) n tan . 4 The following parameter plays an important role in this parameterization, c0 = 2(1 − tan2
φ cT c )=2− . 4 8
(13.95)
This representation is limited to rotation angles of magnitude |φ| < 2π, because it presents a singularity, c → ∞, when |φ| → 2π. This parameterization is closely related to Euler parameters. Indeed, c = 4e/(1+ e0 ) and c0 = 4e0 /(1 + e0 ). The inverse relationship is e = c/(4 − c0 ) and e0 =
540
13 Parameterization of rotation
c0 /(4 − c0 ). Because the Wiener-Milenkovi´c parameters can be obtained from this conformal transformation, they are sometimes referred to as the conformal rotation vector. The following relationship ease the manipulations of this parameterization: cos φ/2 = c0 /(4 − c0 ). Important quantities associated with the Wiener-Milenkovi´c parameters are p = 1/ν, ν = 2/(4 − c0 ), and ε = 2/c0 . Equation (13.45) yields the rotation tensor, using the following parameters: ζ1 = ν 2 c0 /2 and ζ2 = ν 2 /2. Tensor H then follows from eq. (13.55) with σ0 = ν, σ1 = ν 2 /2, and σ2 = ν 2 /8. Finally, tensor H −1 is obtained from eq. (13.60) with χ0 = 1/ν, χ1 = −1/2, and χ2 = 1/8. For the Wiener-Milenkovi´c parameterization, tensors H and G are closely related, H(c) = ν G and H −1 (c) = GT /ν. Composition of rotations Let p and q be the Wiener-Milenkovi´c parameters of two successive rotations, and r the conformal rotation parameters of the composed rotation, such that R(r) = R(p)R(q). The composition formulæ, eq. (13.66a) and (13.66b), then yield " ! (13.96a) r0 = 4 p0 q0 − pT q /(Δ1 + Δ2 ), " ! r = 4 q0 p + p0 q + pq /(Δ1 + Δ2 ), (13.96b) where Δ1 = (4 − p0 )(4 − q0 ) and Δ2 = p0 q0 − pT q. In most applications, it will be necessary to rescale the Wiener-Milenkovi´c parameters, as discussed in section 13.6.2. The two operations, composition and rescaling, are conveniently combined into a single operation, eq. (13.79), which, for Wiener-Milenkovi´c parameters, takes on a particularly simple form " ! 4 !q0 p + p0 q + pq " /(Δ1 + Δ2 ) if Δ2 ≥ 0, r= (13.97) −4 q0 p + p0 q + pq /(Δ1 − Δ2 ) if Δ2 < 0. The rescaling condition automatically selects the largest denominator, also guaranteeing the most accurate numerical evaluation of the composed rotation. Time derivatives of the tangent tensor When linearizing equations involving rotations expressed in terms of the WienerMilenkovi´c parameters, time derivatives of the tangent tensor are often required. First, this tensor is recast as aH = c0 + c + c cT /4, where a = (4 − c0 )2 /2. Simple algebra then implies c0 c c cT + + , a a 4 a c˙ c˙ cT c c˙ T a˙ ˙ = c˙0 + H + + − H, a a a 4 4 a a T T ¨ c ¨ c c c ˙ c ˙ c ¨cT 2a˙ ˙ c ¨ a ¨ 0 ¨ = + + + + − H H − H. a a a 4 2 a 4 a a a H=
(13.98a) (13.98b) (13.98c)
13.7 Speciſc parameterizations of rotation
541
In these expressions, the following notation was used cT c˙ c¨0 a c˙ T c˙ cT ¨c c2 c˙0 ), =− , =− − , (13.99a) 16 a 4 a a 4 a a 4 a c2 c˙0 a ¨ c˙0 c2 a˙ c2 c¨0 a = 2(1 + )2 , = −2(1 + ) , = a( )2 − 2(1 + ) . (13.99b) 16 a 16 a a a 16 a
c0 = 2(1 −
13.7.5 Problems Problem 13.13. The exponential map of rotation Prove that the rotation tensor expressed in terms of the rotation vector can be written as the exponential map, eq. (13.85).
Problem 13.14. Algebraic representations of rotation
(1) Show that for all vectorial parameterizations of rotation, tensors R, H and H −1 , and the composition of rotation formulæ are expressed in terms of three parameters only, ν, ε, and p . It then follows that if those three parameters can be expressed as algebraic functions of the rotation parameter vector, the corresponding vectorial parameterization of rotation enables the manipulation of rotation without using any trigonometric functions. (2) Show that the Wiener-Milenkovi´c parameters described in section 13.7.4 enables an algebraic representa¯ , also leads an tions of rotation. (3) Show that the following parameterization, s = 4 sin φ/4 n algebraic representations of rotation.
Problem 13.15. Short questions (1) Are the Euler angles using the 3-1-3 sequence a particular case of the vectorial parameterization of rotation? (2) Describe the singularities associated with the Euler-Rodrigues parameters. (3) Describe the procedure used to extract the Cayley-Gibbs-Rodrigues parameters from a given rotation tensor. (4) What is the main problem associated with the use of the Euler parameters? (5) Is it possible to ſnd a vectorial parameterization of rotation for which the tangent tensor is orthogonal?
Problem 13.16. Relationship between tensors G and H Based on eq. (13.68), ſnd the vectorial parameterization for which H = νG.
Problem 13.17. Study of the limit behavior Prove the following results related to the limit behavior of the angular velocity and acceleration vectors as the rotation parameter vector vanishes. (1) limp→0 H = I. (2) limp→0 M = − a/2, ˙ /2. (4) limp→0 ω = p. ˙ = p ˙ (5) limp→0 ω˙ = ¨ p. see eq. (13.67). (3) limp→0 H
Problem 13.18. Composing rotations with the Wiener-Milenkovi´c parameters Consider a sequence of rotation tensors Rk = R(tk ), where tk = kΔt is a sequence of equally spaced discrete instants in time; Δt = 0.01 s is the time step size. Each tensor in the ˆ such that sequence is obtained from the previous one through an incremental rotation tensor R ˆ ˆ Rk+1 = R Rk . Let the Wiener-Milenkovi´c parameters ck and ˆc be associated with Rk and R, T respectively. The initial tensor is such that c0 = 0, 0, 0 . The parameters of the incremental rotation are given as a function of time
542
13 Parameterization of rotation ⎡
⎤ sin θk cos ψk ˆc = 0.15 ⎣ sin θk sin ψk ⎦ , cos θk where θk = 3tk and ψk = 6tk − 2t3k . (1) Find the sequence of parameters ck , k = 1, . . . 200. Use the formula for composition of rotation with rescaling. (2) On one graph, plot the three Wiener-Milenkovi´c parameters ck . (3) On one graph, plot the direction cosines R11 , R21 , R31 as a function of time. Comment on your results. (4) Show that the components of the angular velocity vector in the ſxed system can be approximated in the following manner ω (tk + Δt/2) ≈
[Rk+1 RTk ] − [Rk+1 RTk ]T 2Δt
.
Relate the components of the angular velocity vector, ω, to the incremental rotation parameters ˆc. (5) On one graph, plot the three components of the angular velocity vector in the ſxed system as a function of time. (6) Derive a similar formula the the components of the angular velocity vector in the rotating system. (7) On one graph, plot the three components of the angular velocity vector in the rotating system as a function of time.
Problem 13.19. Rigid body tumbling in space In example 8.7, the equations of motion for a rigid body with respect to an inertial point were derived. The dynamic equilibrium equations, eqs. (8.43), are p˙ O = F O and h˙ O = M O . The relationship between the displacements and the momenta are in the form of eq. (8.44), recast here as · −1 pO 0 I u MB , = −1 c 0 H (c) hO − ( rB0 + u )pO where c is the Wiener-Milenkovi´c rotation parameter vector for rotation tensor R and mass matrix MB is deſned by eq. (8.38). The mass properties of the body are m = 6, 900 kg, η ∗T = 1.5, 0.5, 0.7 m, and I B∗ = diag(110, 9, 000, 15, 000) kg·m2 . At the initial time, the position and rotation parameter vectors are u(t = 0) = 0and c(t = 0) = 0, respec tively. The initial linear and angular momenta are pTO (t = 0) = 45, 000, 18, 000, 25, 000 kg·m/s, and hTO (t = 0) = 5, 000, 4, 000, 8, 000 kg·m2 ·rad/s. The externally applied loads are F O = 0 and M O = 0. (1) Integrate the governing equations of motion of the rigid body to ſnd its response for t ∈ [0, 300] s. Plot the components of the position vector of the reference point, u, and the orientation parameters of the rigid body, c, as a function of time. (2) Plot the inertial components of the velocity vector, u, ˙ and those of the inertial angular velocity, ω, of the rigid body. (3) Plot the components of the linear and angular momenta, and the kinetic energy of the body as a function of time. Comment on your results. (4) Plot the motion and velocity of the center of mass of the rigid body.
14 Parameterization of motion
While the parameterization of rotation discussed in chapter 13 has received wide attention, much less emphasis has been placed on that of motion. This is probably due to the fact that the analysis of motion is often described in terms of rather abstract mathematical formulations. For instance, Ball [259] and Angeles [260] used the concepts of twists and wrenches. A systematic, coordinate-free exposition of the different algebraic operations in the set of inſnitesimal displacements (screws) and their relation with ſnite displacement was developed by Chevallier [261]. Euler motion parameters are closely related to the dual-number quaternion algebra techniques used in kinematics [262, 263, 264]. Dual numbers, vectors, and matrices are described in the textbook by Fisher [265] and have received considerable attention in kinematics [1, 23, 22], dynamics [266, 267], and elastodynamics [268]. Many of these studies have shown that the most efſcient and elegant implementations of dual-number techniques are based on general screw theory with the screw expressed by means of Pl¨ucker coordinates, see section 5.1. Borri et al.[269] addressed the problem of parameterization of motion by focusing on two representations, the exponential map of motion and Cayley’s parameterization. Borri and Bottasso [270, 271] used these concepts to analyze curved beam in three-dimensional space, leading to the helicoidal approximation. The present chapter focuses on parameterization techniques for motion. In contrast with other approaches, the exposition presented here is expressed in terms of linear algebra concepts, which are easily understood and implemented in computer software. Cayley’s formulation that led to an algebraic representation of rotation is generalized in section 14.1 to the problem of motion. Euler motion parameters are presented in section 14.3 and provide a purely algebraic representation of motion. When using the bi-quaternion algebra introduced in section 14.2, all motion operations become bilinear expressions of bi-quaternions. These advantages, however, come at a high cost: eight parameters must be used instead of six, i.e., Euler motion parameters do not form a minimum set.
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_14 © Springer Science+Business Media B.V. 2011
544
14 Parameterization of motion
Mozzi-Chasles’ theorem, presented in section 5.1, states that an arbitrary motion of a rigid body can be represented by a screw motion. The axis of the screw is called the Mozzi-Chasles axis, denoted M, and its Pl¨ucker coordinates have been evaluated in section 5.1. The Pl¨ucker coordinates of an arbitrary material line of a rigid body subjected to a screw motion are known to transform by the action of the motion tensor, and M is an eigenvalue of this tensor associated with its positive unit eigenvalue. The vectorial parameterization of motion is introduced in section 14.4 and consists of minimal set of parameters deſning the components of a motion parameter vector. The vectorial parameterization of motion presents two fundamental properties. First, it is tensorial in nature: the tensorial nature of the second-order motion tensor implies and is implied by the tensorial nature of the motion parameter vector, a ſrst-order tensor. Second, rotation parameter vectors are parallel to the eigenvector of the motion tensor corresponding to its unit eigenvalue. Because these two properties imply each other, either can be taken as the deſnition of the vectorial parameterization of motion. A parameterization of motion is vectorial if and only if the motion parameter vector is parallel an eigenvector of the motion tensor associated with its unit eigenvalue. A complete description of motion is presented for a generic motion parameter vector. Relevant formulæ for speciſc parameterizations of this class are then easily obtained. Speciſc expressions are given for three parameterizations that present desirable properties: the exponential map of motion, the Cayley-Gibbs-Rodrigues motion parameters, and Wiener-Milenkovi´c motion parameters.
14.1 Cayley’s motion parameters In section 13.1, a purely algebraic description of rotation was obtained from the simple argument of length preservation. These developments lead to a multiplicative decomposition of the rotation tensor: R = (I − a)−1 (I + a), see eq. (13.4). Cayley’s parameters, a, are a by-product of this decomposition. This multiplicative decomposition exists for all vectorial parameterizations, see eq. (13.50). A similar decomposition is sought for the motion tensor, C, deſned by eq. (5.35). Consider the following relationship (I − ε Ru p/2) −ε q/2 R p/2) ε q/2 (I + ε , (14.1) = p/2) 0 R p/2) 0 (I − ε 0 (I + ε where q is an as yet unknown quantity. In view of eq. (13.50), three of the above submatrix equalities are readily satisſed. Using eq. (13.51), the last equality implies ε q = (I − ε p/2) u(I + ε p/2) = u + εu p/2 + ε2 (pT u) p/4. Clearly, q is related to p/2 + ε2 p pT /4 u. Finally, simple vector the displacement vector, u, εq = I − ε identities and the deſnition of ε, eq. (13.47), lead to q=
RT + I u. 2ζ1
(14.2)
14.1 Cayley’s motion parameters
545
The following notation is introduced P=
q . p
(14.3)
The two vectors, q and p, form the motion parameter vector, P. The multiplicative decomposition, eq. (14.1), now becomes * ε + * ε +−1 * ε +−1 * ε + C= I+ P I+ P = I− P I− P , (14.4) 2 2 2 2 is deſned by eq. (5.52). It is readily where the generalized vector product tensor, P, veriſed that eq. (13.51) generalizes to * " ε +−1 1! I− P C+I . = 2 2 The motion tensor is now written in terms of the vectors q and p as ⎡ ⎤ R(p) + I R(p) + I ε q ⎦. C(P) = ⎣R(p) 2 2 0 R(p)
(14.5)
In summary, the motion tensor can be expressed in a purely algebraic form in terms of the six parameters, P T = q T , pT . The rotation parameter vector, p, determines the rotation tensor. Vector q is related to the displacement vector of the reference point through eq. (14.2) and determines the remaining entries of the motion tensor. Similar developments lead to the following additional results * ε + * ε +−1 * ε +−1 * ε + C −1 = I − P I− P = I+ P I+ P , 2 2 2 2 * " ε +−1 1 ! −1 C +I , = I+ P 2 2 and ⎡ ⎤ T RT + I T R +I T −1 R ε q C (P) = ⎣ 2 2 ⎦. 0 RT In section 13.1, Cayley’s rotation parameters were shown to provide a purely algebraic description of rotation. Based on the length preservation property of rotation, Cayley’s formula, eq. (13.4), was derived, which takes the form of a multiplicative decomposition of the rotation tensor. In this section, a multiplicative decomposition of the motion tensor was derived based on purely algebraic arguments. The resulting decomposition, eq. (14.4), mirrors Cayley’s formula, eq. (13.4). The motion parameters deſned by eq. (14.3) are a byproduct of the multiplicative decomposition and provide a purely algebraic description of the motion tensor, eq. (14.5), which explicitly shows the dependency of the motion tensor on six parameters only.
546
14 Parameterization of motion
14.2 Bi-quaternion algebra In section 14.3, it will be shown that motion operations are conveniently expressed in terms of bi-quaternions. The present preparatory section focuses on the deſnition of bi-quaternions, the derivation of a number of their properties, and the deſnition of the operators that ease bi-quaternion algebra. A bi-quaternion is deſned as an array of two quaternions qˆ . (14.6) gˇ = eˆ Bi-quaternion operators Bi-quaternion operations are performed by using a number of matrices of size 8 × 8 deſned as follows T e) A(ˆ q) e) AT (ˆ q) A(ˆ A (ˆ ¯ A(ˇ , A(ˇ , (14.7a) g) = g) = T e) 0 A(ˆ e) 0 A (ˆ T e) B(ˆ q) e) B T (ˆ q) B(ˆ ¯ g ) = B (ˆ B(ˇ , B(ˇ , (14.7b) g) = e) 0 B(ˆ e) 0 B T (ˆ T e) C(ˆ q) e) C T (ˆ q) C(ˆ C (ˆ ¯ C(ˇ , C(ˇ . (14.7c) g) = g) = e) 0 C(ˆ e) 0 C T (ˆ Each operator is composed of four, 4 × 4 sub-matrices consisting of the quaternion operators deſned by eqs. (13.9). ˇ are two arbitrary bi-quaternions, the following matrix products comIf gˇ and h mute ¯ h) ˇ = B( ¯ h)A(ˇ ˇ A(ˇ g )B( g ), ˇ ˇ A(ˇ g )B(h) = B(h)A(ˇ g ),
¯ g )B(h) ˇ = B(h) ˇ A(ˇ ¯ g), A(ˇ ¯ g )B( ¯ h) ˇ = B( ¯ h) ˇ A(ˇ ¯ g), A(ˇ
(14.8b)
¯ h) ˇ = B(h)C(ˇ ˇ C(ˇ g )A( g ),
¯ g )B(h) ˇ = A( ¯ h) ˇ C(ˇ ¯ g ). C(ˇ
(14.8c)
(14.8a)
These commutativity properties mirror the corresponding properties of quaternion operators expressed by eqs. (13.11). These identities then imply the following results ˇ = B(h)ˇ ˇ g, A(ˇ g )h
¯ g )h ˇ = C( ¯ h)ˇ ˇ g, A(ˇ
¯ g )h ˇ = C(h)ˇ ˇ g. B(ˇ
(14.9)
Here again, these properties mirror the corresponding properties of quaternion operators expressed by eqs. (13.12). Next, the following results are easily checked ˇ = A(fˇ) ⇐⇒ fˇ = A(ˇ ˇ = B(h)ˇ ˇ g, A(ˇ g )A(h) g )h ¯ h) ˇ = A(fˇ) ⇐⇒ fˇ = C(ˇ ˇ = B( ¯ h)ˇ ˇ g, A(ˇ g )A( g )h
(14.10b)
ˇ = A(fˇ) ⇐⇒ fˇ = A(ˇ ˇ = C( ˇ g, ¯ g )A(h) ¯ g )h ¯ h)ˇ A(ˇ
(14.10c)
(14.10a)
14.3 Euler motion parameters
547
where the double-headed arrows indicate that the two equalities imply each other. These bi-quaternion identities are inherited from the quaternion counterparts, eqs. (13.13). Similarly ˇ = B(fˇ) ⇐⇒ fˇ = B(ˇ ˇ = A(h)ˇ ˇ g, B(ˇ g )B(h) g )h ¯ h) ˇ = B(fˇ) ⇐⇒ fˇ = C(ˇ ¯ g )h ˇ = A( ¯ h)ˇ ˇ g, B(ˇ g )B(
(14.11b)
ˇ = B(fˇ) ⇐⇒ fˇ = B(ˇ ˇ = C(h)ˇ ˇ g. ¯ g )B(h) ¯ g )h B(ˇ
(14.11c)
(14.11a)
Finally, operator W is deſned as W(ˇ g) =
1 e) S(ˆ q) S(ˆ = [A(ˇ g ) − B(ˇ g )], e) 0 S(ˆ 2
(14.12)
where quaternion operator S is deſned by eq. (13.15) which also implies the last equality. Bi-quaternions composed of orthogonal quaternions Two quaternions, qˆ and eˆ, are said to be orthogonal ifqˆT eˆ = 0. For bi-quaternions
composed of such pair of quaternions, gˇT = qˆT , eˆT , the following identities can be shown to hold ¯ g )A(ˇ ¯ g ) = B(ˇ ¯ g )B(ˇ ¯ g ) = |ˆ A(ˇ g) = A(ˇ g )A(ˇ g ) = B(ˇ g )B(ˇ e|2 I,
(14.13)
where I is the 6 × 6 identity matrix. Identities (13.18) were used to prove the above relationships.
14.3 Euler motion parameters The deſnition of the motion tensor, eq. (5.35), requires six parameters, three parameters to deſne the displacement vector, u, and three parameters to deſne the rotation tensor, R. The goal of this section is to develop a parameterization of motion, i.e., to ſnd a set of parameters that ease the manipulation of motion operations. In section 13.3, Euler parameters were introduced and quaternion algebra was shown to provide an effective tool to manipulate rotation operations in a purely algebraic manner. The vector part of Euler parameters is oriented along the eigenvector of the rotation tensor corresponding to a unit eigenvalue, see eq. (13.19). To generalize Euler parameters to the problem of motion, a set of parameters are derived that are parallel to the eigenvector of the motion tensor corresponding to a unit eigenvalue. To ease the algebra, the motion tensor, eq. (5.35), is expanded to an 8 × 8 operator using the quaternion algebra operators deſned in section 13.2, e) S(ˆ u)D(ˆ e) D(ˆ C= , (14.14) D(ˆ e) 0
548
14 Parameterization of motion
where eˆ is the unit quaternion representing the rotation tensor R, and u ˆT = 0, uT a non-unit quaternion with a vanishing scalar part. An eigenvector of this expanded motion tensor associated with the unit eigenvalue is , 1 qˆ e)ˆ u B(ˆ . (14.15) = N= 2 eˆ eˆ As discussed in section 5.5.2, several expressions for the eigenvector can be found, due to the multiplicity of two of the unit eigenvalue. The above expression, however, is convenient because a one to one correspondence exists between quaternions u ˆ and qˆ; indeed, qˆ =
1 B(ˆ e)ˆ u 2
⇐⇒ u ˆ = 2B T (ˆ e)ˆ q.
(14.16)
Because B(ˆ e) is an orthogonal operator, this mapping presents no singularities. The scalar part of qˆ is q0 = −1/2 eT u = −d/2 sin φ/2, where d is the intrinsic displacement of the rigid body deſned by eq. (5.7). On the other hand, the scalar part of u ˆ is u0 = 0 = 2 eˆT qˆ. This implies that quaternions eˆ and qˆ are orthogonal to each other eˆT qˆ = 0. (14.17) Finally, it is readily veriſed that the norms of quaternions u ˆ and qˆ are closely related, u ˆT u ˆ = 4 qˆT qˆ. The Euler motion parameters form a bi-quaternion and provide a convenient parameterization of the motion tensor, qˆ . (14.18) gˇ = eˆ Note the redundancy in this representation that requires eight parameters instead of the six forming a minimum set. The eight Euler motion parameters are subjected to two constraints: the normality condition for quaternion eˆ, see eq. (13.20), and the orthogonality of quaternions eˆ and qˆ, see eq. (14.17). 14.3.1 The motion tensor Equation (14.14) gives the expression for the motion tensor. The only term not expressed in terms of the Euler motion parameters is S(ˆ u)D(ˆ e). With the help of identity (13.15), this term becomes S(ˆ u)D(ˆ e) =
1 1 u)A(ˆ e)B T (ˆ e) − A(ˆ e)B(ˆ u)B T (ˆ e), A(ˆ 2 2
(14.19)
where operator D(ˆ e) was expressed by eq. (13.16), and the commutativity property (13.11b) was used in the second term. Using eq. (13.13a), the ſrst term of eq. (14.19) becomes A(ˆ q )B T (ˆ e), where qˆ is given by eq. (14.16). Next, note that
14.3 Euler motion parameters
549
because quaternion u ˆ has a vanishing scalar part, −B(ˆ u) = B T (ˆ u), and the second T term of eq. (14.19) becomes A(ˆ e)B (ˆ q ), where qˆ is given by eq. (14.16) once again. The motion tensor written in terms of the Euler motion parameters becomes e) A(ˆ q )B T (ˆ e) + A(ˆ e)B T (ˆ q) D(ˆ . C(ˇ g) = 0 D(ˆ e) ¯ deſned by eqs. (14.7a) and (14.7b), respectively, then Introducing operators A and B leads to ¯ g ) = B(ˇ ¯ g )A(ˇ C(ˇ g ) = A(ˇ g )B(ˇ g ). (14.20) The motion tensor becomes a bilinear function of Euler motion parameters. Note the parallel between this expression and that for the rotation tensor, eq. (13.16). The inverse of the motion tensor is found using similar developments, T e) B(ˆ q )AT (ˆ e) + B(ˆ e)AT (ˆ q) D (ˆ −1 , C (ˇ g) = T e) 0 D (ˆ and ſnally,
¯ g )B(ˇ ¯ g). C−1 (ˇ g ) = A(ˇ g ) = B(ˇ g )A(ˇ
(14.21)
14.3.2 The velocity vector The components of the velocity vector resolved in the ſxed frame are obtained from their deſnition, eq. (5.69a). This deſnition is expressed by means of 8 × 8 operators to ease algebraic manipulations ˙ g )C−1 (ˇ ¯ g ) · B(ˇ ¯ g ) = A(gˇ˙ )A(ˇ ¯ g ) + B( ¯ gˇ˙ )B(ˇ C(ˇ g ) = A(ˇ g )B(ˇ g )A(ˇ g ), where the commutativity relationships (14.8) and normality conditions (14.13) were ¯ B = I, and a time used. Because eˆ is a unit quaternion, eq. (14.13) implies B ¯ B. ˙ g )C−1 (ˇ ˙ The above expression then becomes C(ˇ ¯˙ B = −B g) = derivative yields B ¯ g ) − B(ˇ ¯ g )B(gˇ˙ ) = A(ˇ A(gˇ˙ )A(ˇ v /2) − B(ˇ v /2), where, according to eq. (14.10b) ¯ g )gˇ˙ . and (14.11c), vˇ = 2B(ˇ In summary, ˙ g )C−1 (ˇ C(ˇ g ) = W(ˇ v ),
T T T ˆ where vˇ = vˆ , ω is the velocity bi-quaternion in the ſxed frame given by ¯ g )gˇ˙ , vˇ = 2B(ˇ
(14.22)
The velocity bi-quaternion becomes a bilinear expression of the Euler motion parameters and their derivatives; note the parallel between eq. (14.22) for motion and its counterpart, eq. (13.23), for rotation. The vector parts of quaternions vˆ and ω ˆ are the velocity and angular velocity vectors, respectively. The scalar part of the velocity quaternion is v0 = 2[ˆ eT qˆ˙ + qˆT eˆ˙ ] =
550
14 Parameterization of motion
0, because qˆ and eˆ are orthogonal quaternions, and because eˆ is a unit quaternion, the scalar part of the angular velocity quaternion vanishes, ω0 = 2ˆ eT eˆ˙ = 0. The components of the velocity vector resolved in the material frame are obtained in a similar manner ˙ g ) = W(ˇ C−1 (ˇ g )C(ˇ v ∗ ),
ˆ ∗T is the velocity bi-quaternion in the material frame. These where vˇ∗T = vˆ∗T , ω results are written in a compact manner as ¯ g )gˇ˙ . vˇ∗ = 2A(ˇ
(14.23)
14.3.3 Composition of ſnite motions
Let gˇT = qˆT , eˆT , gˇ1T = qˆ1T , eˆT1 , and gˇ2T = qˆ2T , eˆT2 be the bi-quaternions of three motion tensors such that C(ˇ g ) = C(ˇ g1 )C(ˇ g2 ). The problem at hand is to determine bi-quaternion gˇ as a function of the other two. With the help of eq. (14.20), this ¯ g1 )A(ˇ ¯ g2 ) and the commutativity property (14.8a) expands to C(ˇ g ) = A(ˇ g1 )B(ˇ g2 )B(ˇ ¯ g1 )B(ˇ ¯ g2 ). Equations (14.10a) and (14.11a) then g ) = A(ˇ g1 )A(ˇ g2 )B(ˇ then implies C(ˇ ¯ g ) = A(ˇ g)B(ˇ g ), where gˇ = A(ˇ g1 )ˇ g2 . yield C(ˇ In summary, composition of motions expressed in terms of Euler motion parameters reduces to C(ˇ g ) = C(ˇ g1 )C(ˇ g2 ) ⇐⇒ gˇ = A(ˇ g1 )ˇ g2 = B(ˇ g2 )ˇ g1 .
(14.24)
This operation is bilinear in terms of the Euler motion parameters of the two motions; note the parallel between eq. (14.24) for motion and its counterpart, eq. (13.26), for rotation.
If bi-quaternions gˇ1T = eˆT1 , qˆ1T and gˇ2T = eˆT2 , qˆ2T are such that
Teˆ1 and T T T eˆ2 are unit quaternions and eˆ1 qˆ1 = eˆ2 qˆ2 = 0, bi-quaternion gˇ = eˆ , qˆT e1 )A(ˆ e1 )ˆ e2 = eˆT2 eˆ2 = 1, beenjoys the same properties. Indeed, eˆT eˆ = eˆT2 AT (ˆ cause eˆ1 and eˆ2 both are unit quaternions. Furthermore, eˆT qˆ = eˆT2 AT (ˆ e1 )A(ˆ e1 )ˆ q2 + eˆT2 AT (ˆ e1 )A(ˆ q1 )ˆ e2 = eˆT2 qˆ2 + eˆT1 B T (ˆ e2 )B(ˆ e2 )ˆ q1 = eˆT2 qˆ2 + eˆT1 qˆ1 = 0. 14.3.4 Determination of Euler motion parameters The last task is to determine Euler motion parameters given the components of the motion tensor. Unfortunately, this inverse relationship cannot be expressed in a simple manner. In view of eq. (14.20), the motion tensor written in the following form e) Z(ˆ q , eˆ) R(ˆ , C(ˇ g) = 0 R(ˆ e) where R(ˆ e) is given by eq. (13.21) and Z(ˆ q , eˆ) = 2 [e0 q + q0 e + eq + qe] .
(14.25)
14.3 Euler motion parameters
551
The determination of the bi-quaternion gˇT = qˆT , eˆT proceeds in two steps. First, quaternion eˆ is determined from the rotation tensor, R(ˆ e), by following the procedure described in section 13.3.4. The second step is to determine quaternion qˆ from operator Z(ˆ q , eˆ). Consider the following symmetric matrix constructed from the components of Z(ˆ q , eˆ), ⎡
⎤ tr(Z) Z32 − Z23 Z13 − Z31 Z21 − Z12 ⎢Z32 − Z21 ⎥ ⎥. T =⎢ ⎣Z13 − Z31 ⎦ Z + Z T − tr(Z)I Z21 − Z12 Introducing the deſnition of matrix Z, eq. (14.25), then yields ⎡ e0 q0 + q0 e0 ⎢e0 q1 + q0 e1 ⎢ T = 4⎣ e0 q2 + q0 e2 e0 q3 + q0 e3
e1 q0 + q1 e0 e1 q1 + q1 e1 e1 q2 + q1 e2 e1 q3 + q1 e3
e2 q0 + q2 e0 e2 q1 + q2 e1 e2 q2 + q2 e2 e2 q3 + q2 e3
⎤ e3 q0 + q3 e0 e3 q1 + q3 e1 ⎥ ⎥ = 4(ˆ eqˆT + qˆeˆT ). e3 q2 + q3 e2 ⎦ e3 q3 + q3 e3
Quaternion qˆ can readily be computed from any column of this matrix. This determination, however, will involve a division by components of quaternion eˆ; hence, inaccurate results will be obtained when dividing by small, or zero values. The most accurate results will be obtained by selecting index m such that |em | > |ei |, i = m. The components of qˆ are then 1 Tmm 1 Tmi qm = , qi = (14.26) − ei qm , i = m. em 8 em 4 Of course, the integrity of the data should be checked by verifying that qˆ and eˆ are orthogonal quaternions. Example 14.1. Kinetic energy of a rigid body The kinetic energy of a rigid body undergoing an arbitrary motion was developed in example 8.5, on page 311. Find the expression for the kinetic energy of a rigid body depicted in ſg. 8.6 expressed in terms of Euler motion parameters. Equation (8.31) gives the kinetic energy of the rigid body. To express this quantity in terms of Euler motion parameters, it is convenient to use the two quaternions, vˆ∗ and ω ˆ ∗ , introduced in section 14.3.2. These two quaternions with vanishing scalar ˆ ∗T . parts are combined into a bi-quaternion, vˇ∗T = vˆ∗T , ω Next, the following 8 × 8 mass matrix is introduced η ∗T ) mI 4×4 mS(ˆ B∗ M = , (14.27) mS(ˆ η ∗ ) M B∗
where ηˆ∗ = 0, η ∗ is a quaternion with a vanishing scalar part and whose vector part stores the components of the relative position vector of the body’s center of mass with respect to the reference point, resolved in the body attached basis. Operator S
552
14 Parameterization of motion
is deſned by eq. (13.15), and matrix M B∗ by eq. (13.30). Notation (·)∗ indicates tensor components resolved in the body attached basis. The kinetic energy now becomes 1 ∗T B∗ ∗ vˇ M vˇ . (14.28) 2 Bi-quaternion vˇ∗ is now readily expressed in terms of Euler motion parameters using eq. (14.23), to ſnd K=
¯ T (ˇ ¯ g )gˇ˙ = 2 gˇT C ¯ T (gˇ˙ )MB∗ C( ¯ gˇ˙ )ˇ K = 2 gˇ˙ T A g )MB∗ A(ˇ g,
(14.29)
where the last equality follows from identity (14.9). Clearly, it is expeditious to express all quantities in the bi-quaternion formalism before introducing Euler motion parameters. Example 14.2. Hamilton’s principle for a rigid body Application of Hamilton’s principle to the rigid body problem leads to a compact form of the equations of motion, as was shown in example 8.6 on page 312. Based on Hamilton’s principle, derive the equations of motion of the rigid body depicted in ſg. 8.6 in terms of Euler motion parameters. As observed in the previous example, it is expeditious to express all quantities in terms of bi-quaternion before introducing Euler motion parameters. The virtual
ˆ ∗T = δu∗0 , R δuT and displacement and rotation quaternions are deſned as δu
ˆ ∗T = δψ ∗ , δψ ∗T , respectively. These two quaternions are combined into the δψ 0 4 3 ˇ ∗T = δu ˆ ∗T . ˆ ∗T , δψ virtual motion bi-quaternion resolved in the material frame, δu force and moment are deſned in a similar manner as Fˆ ∗T =
The
quaternions ∗T ∗T ∗T ˆ and Q , respectively. Both quaternions have a vanishing = 0, Q 0, F scalar part and are combined into the load bi-quaternion resolved in the material ˆ ∗T . frame, a ˇ∗T = Fˆ ∗T , Q
ˆ ∗T = 0, h∗T are the linear and angular moFinally, pˆ∗T = 0, p∗T and h mentum quaternions, respectively. Both quaternions have a vanishing scalar part and are combinedinto the momentum bi-quaternion resolved in the material frame, ˆ ∗T . pˇ∗T = pˆ∗T , h With these notations at hand, the virtual work done by the applied loads deˇ∗ = a ˇ The load bi-quaternion, ſned by eq. (8.45) becomes δW nc = a ˇ∗T δu ˇT δu. −1 ∗ a ˇ=C a ˇ , consists of two quaternions with vanishing scalar components, Fˆ O and O ˆ , whose vector parts store the components of the applied force and moment, reQ spectively, computed with respect to inertial point O. Virtual changes in the kinetic energy of the rigid body follow from eq. (14.28) ˇ˙ and as δK = δˇ v ∗T MB∗ vˇ∗ = δˇ v ∗T pˇ∗ . By analogy with eq. (8.46), δˇ v ∗ = C−1 δu, ˇ˙ T pˇ. The momentum bivirtual changes in the kinetic energy then becomes δK = δu quaternion, pˇ = C−T pˇ∗ , consists of two quaternions with vanishing scalar compoˆ O , whose vector parts store the components of the linear and angular nents, pˆO and h momenta, respectively, computed with respect to inertial point O.
14.3 Euler motion parameters
553
Euler motion parameters are subject to two constraints, the orthogonality constraint, C1 = 2ˆ eT qˆ = 0, and the normality constraint, C2 = eˆT eˆ − 1 = 0. The po tential of these constraints, eq. (10.6), becomes V c = −λT C, where C T = C1 , C2 is the array of constraints and λ the array of Lagrange’s multiplier used to enforce these constraints. Variation of the potential of the constraints yields T T T T ˆ qˆ ˆ qˆ T T e T T e c ˇ g = −δλ C − λ ˆT T B(ˇ δV = −δλ C − 2λ ˆT T δˇ g )δu 0 eˆ 0 eˆ ˆˆ ˇ T 1 0 λ, = −δλT C − δu ˆ0 ˆ1
where ˆ0 denotes a vanishing quaternion and ˆ1T = 1, 0T a quaternion with a unit scalar part and a vanishing vector part. By analogy with eq. (14.22), variations in ˇ Euler motion parameters are related to the virtual motion vector as δˇ g = 1/2 B(ˇ g )δu. + c For this problem, L = K −V and Hamilton’s principle for constrained system, eq. (10.10), now implies % tf ˆˆ ˇ T 1 0 λ + δu ˇ Ta ˇ˙ T pˇ + δλT C + δu ˇ dt = 0. δu ˆ0 ˆ1 ti All boundary terms are ignored here. After integration by parts of the ſrst term, the equations of motion of the system are found to be C = 0, the constraint equations, and · pˆO FˆO λ1 ˆ1 ˆ O − λ2 1ˆ = Q ˆO . h Because bi-quaternions pˇ and a ˇ are resolved in the inertial frame, their quaternion components are evaluated with respect to the origin of the inertial frame, point O. Lagrange’s multipliers are readily eliminated by only keeping the vector parts of the quaternion equations, leading to the following equations of motion for the rigid body, p˙ O = F O and h˙ O = QO . These equations are identical to those obtained earlier, see eqs. (8.48a). The last step of the procedure is to evaluate the time derivatives of Euler motion parameters with respect to time. First, eq. (14.23) yields gˇ˙ = A(ˇ g )ˇ v ∗ /2, and ſnally, g )(MB∗ )−1 CT (ˇ g )ˇ p/2, which leads to the following system of ordinary difgˇ˙ = A(ˇ ferential equations, ⎧ ⎫· ⎧ ⎫ FO ⎨ pO ⎬ ⎨ ⎬ QO = , (14.30) hO ⎩ ⎭ ⎩ ⎭ gˇ A(ˇ g )(MB∗ )−1 CT (ˇ g )ˇ p/2 ˆ O vanish. where the scalar parts of quaternions pˆO and h 14.3.5 Problems Problem 14.1. Inverse of the motion tensor Prove eq. (14.21) starting from the deſnition of the motion tensor, eq. (5.59).
554
14 Parameterization of motion
Problem 14.2. Angular velocity with Euler motion parameters
˙ g ) = W(ˇ Prove eq. (14.23) by evaluating C−1 (ˇ g )C(ˇ v ∗ ). Find the relationship between vˇ, ∗ deſned in eq. (14.22) and vˇ , deſned in eq. (14.23).
Problem 14.3. Eigenvectors of the motion tensor Prove that two linearly independent eigenvectors tensor associated with its unit T ofT the motion e)/2, eˆT . ˆ B (ˆ eigenvalue are NT1 = eˆT , ˆ 0T and NT2 = u
14.4 The vectorial parameterization of motion In the previous section, the Euler motion parameters have been shown to provide an elegant, purely algebraic representation of ſnite motion. In fact, when using biquaternions, all motion operations become bi-linear expressions of bi-quaternions. These advantages, however, come at a high cost: eight parameters must be used instead of six, i.e., the Euler motion parameters do not form a minimum set. Furthermore, the normality and orthogonality conditions inherent to the representation must be enforced as constraints. The vectorial parameterization of motion [272] consists of a minimal set of parameters deſning the components of two vectors. The vectorial nature of this class of parameterization of motion sets it apart from the other parameterizations investigated earlier. 14.4.1 Fundamental properties Consider three motions characterized by displacement vectors, u1 , u2 , and u3 , and rotation tensors, R1 , R2 , and R3 , respectively. The three motions, denoted (u1 , R1 ), (u2 , R2 ), and (u3 , R3 ), respectively, are associated with three motion tensors, C 1 , C 2 , and C 3 , respectively, through the intrinsic expression of the motion tensor, eq. (5.53a). Assume that the following triple product of motion tensors relates these three quantities, C 3 = C −1 C1C 2. (14.31) 2 As discussed in section 5.6.3, this operation corresponds to a change of frame operation for motion tensors: C 1 and C 3 are the components of the same motion tensor expressed in two frames related by motion tensor C 2 . Using the intrinsic expression for the motion tensor, eq. (5.53a), eq. (14.31) now becomes −1 −1 −1 C 3 = I + Z(d1 c1 , sin φ1 ) C N 1 + Z(d1 c2 , 1 − cos φ1 ) C N 1 C N 1, 2 2 2
where eq. (5.54) was used. Comparing this result with the intrinsic expression for the motion tensor implies
14.4 The vectorial parameterization of motion
φ3 = φ1 , , 1 u2 ) RT2 (m1 + n m3 −1 . = N 3 = C2 N 1 = n ¯3 RT n ¯1
555
(14.32a) (14.32b)
2
These equations express the two conditions required for the proper transformation of motion tensors components under a change of frame. Note that an additional condition is required, d3 = d1 , but is implied by eqs. (14.32). Indeed, eq. (14.32b) yields n ¯ T3 m3 = n ¯ T1 R2 RT2 (m1 + n 1 u2 ) = n ¯ T1 m1 , or λ3 = λ1 , where λ is deſned by eq. (5.41). In view of eq. (14.32a) and (5.41), λ3 = λ1 then yields d3 = d1 . Let p(φ) be an arbitrary scalar function of angle φ; eq. (14.32a) then implies p(φ3 ) = p(φ1 ). Multiplication of eq. (14.32b) by p3 = p(φ3 ) on the left-hand side and p1 = p(φ1 ) = p(φ3 ) on the right-hand side then yields , T p (m + n u ) R p 3 m3 1 1 1 2 2 . (14.33) = C −1 p(φ1 )N 1 = p(φ3 )N 3 = 2 p3 n ¯3 RT2 p1 n ¯1 This equation is equivalent to eqs. (14.32). Indeed, taking the norm of the last three of eqs. (14.33) yields p3 = p1 , or φ3 = φ1 , because n ¯ 1 and n ¯ 3 are unit vectors and R2 an orthogonal tensor. Dividing eq. (14.33) by p(φ3 ) then yields eq. (14.32b) because p3 = p1 . The vectorial parameterization of motion is deſned as P = p(φ)N ,
(14.34)
where P is the motion parameter vector. Equation (14.33) can now be recast in a more compact manner as P 3 = C −1 P 1. (14.35) 2 The discussion presented above establishes that the tensorial nature of the motion tensor expressed by the transformation rule of its components, eq. (14.31), implies the tensorial nature of the rotation parameter vector expressed by the transformation rule of its components, eq. (14.35). It is easily shown that the process can be reversed, i.e., tensorial nature of the rotation parameter vector implies that of the rotation tensor. In summary, the vectorial parameterization of motion presents two fundamental properties. 1. The vectorial parameterization of motion is tensorial in nature, as expressed by the following equivalence, C(P 3 ) = C −1 (P 2 )C(P 1 )C(P 2 ) ⇐⇒ P 3 = C −1 (P 2 )P 1 .
(14.36)
The tensorial nature of the second-order motion tensor implies and is implied by the tensorial nature of the motion parameter vector, a ſrst-order tensor. 2. Motion parameter vectors are parallel to an eigenvector of the motion tensor associated with its unit eigenvalue. Equation (5.39) shows that vector N is a linear
556
14 Parameterization of motion
combination of two linearly independent eigenvectors of the motion tensor, both associated with its unit eigenvalue; equation (14.34) then implies that the motion parameter vector shares this property. Because these two properties imply each other, either can be taken as the deſnition of the vectorial parameterization of motion. A parameterization of motion is vectorial if and only if the motion parameter vector is parallel an eigenvector of the motion tensor associated with its unit eigenvalue. 14.4.2 The motion parameter vector A more explicit expression of the motion parameter vector is as follows D(p)u q pE(φ)u pm P= = = = pN = , p p¯ n p p¯ n
(14.37)
where p is the vectorial parameterization of rotation, see section 13.4, and tensor E is deſned by eq. (5.44). Using the notation developed for the vectorial parameterization of rotation, tensor D becomes 1 D(p) = δ0 − p + δ2 pp, 2
(14.38)
where functions δ0 (φ) and δ2 (φ) are even functions of the rotation angle given by α , ν 1 1 . δ2 = 2 δ0 − p ε
δ0 =
(14.39a) (14.39b)
Tensor D can be resolved in the canonical basis deſned by eq. (4.32) to ſnd ⎤ ⎡ α 0 0 1⎣ [E] 0 cos φ/2 sin φ/2 ⎦ . (14.40) D = ν 0 − sin φ/2 cos φ/2 The determinant of this tensor is now simply evaluated as det(D) = α/ν 3 . The inverse of this tensor is readily found as F (p) = D−1 (p) = ϕ0 + ϕ1 p + ϕ2 pp,
(14.41)
where functions ϕ0 (φ), ϕ1 (φ), and ϕ2 (φ) are even functions of the rotation angle given by ν , α ϕ1 = ζ2 , 1 ϕ2 = 2 (ϕ0 − ζ1 ) . p ϕ0 =
(14.42a) (14.42b) (14.42c)
14.4 The vectorial parameterization of motion
557
where coefſcients ζ1 and ζ2 are given by eqs. (13.46). Tensor F enjoys the following remarkable properties R = F F −T = F −T F ,
(14.43a)
R − I = F p = pF , p = F
−T
−F
−1
(14.43b) (14.43c)
,
which are similar to those of the tangent tensor, eqs. (13.63). The motion parameter vector is not fully deſned yet because it depends on the choice of the generating function, p(φ), of the vectorial parameterization of rotation and furthermore, parameter α can be selected arbitrarily. Generating functions must be odd functions of the rotation angle and present the limit behavior expressed by eq. (13.44), i.e., all rotation parameter vectors must approach the inſnitesimal rotation vector when φ → 0. Similarly, the displacement related part of the motion parameter vector, q, should approach the inſnitesimal displacement vector for vanishing motions. In view of eq. (14.37), this requirement implies limφ→0,d→0 D(p) = I, or limφ→0,d→0 α/ν = 1, and ſnally lim α = 1. (14.44) φ→0,d→0
Time derivative of the displacement In the manipulation of the time derivatives of the motion tensor, it will be necessary to evaluate u˙ = (F q)· = F˙ q + F q˙ , which can be written as u˙ = L(q, V p)p˙ + F q˙ , where operator L is implicitly deſned as follows ˙ F˙ (p)q = L(q, p)p.
(14.45)
Using eq. (14.41), operator L is easily found as ϕ ϕ 1 ϕ0 + 1 p + 2 p p q pT − ϕ1 q − ϕ2 (2 L(q, p) = p q − q p) , p p p p
(14.46)
where the notation (·) indicates a derivative with respect to angle φ. Operator L enjoys the following properties, RT1 L(q, p)R1 = L(RT1 q, RT1 p),
(14.47a)
L(q 1 + q 2 , p) = L(q 1 , p) + L(q 2 , p),
(14.47b)
L p = Fq − F q,
(14.47c)
p L = Fq − R q H T ,
(14.47d)
L( p q, p) = p L(q, p) − L(q, p) p = F q − R q H . T
(14.47e)
The ſrst property, eq. (14.47a), expresses the transformation of the components of operator L under a change of basis of both of its arguments. The second property,
558
14 Parameterization of motion
eq. (14.47b), expresses the linearity of operator L with respect to its ſrst argument. Property (14.47d) stems from the deſnition of operator L, pF˙ q = pL p, ˙ and noting ˙ ˙ ˙ that eq. (14.43b) implies pF = R − pF . 14.4.3 The generalized vector product tensor Skew-symmetric tensor p plays an important role in the vectorial parameterization of rotation as it appears in the explicit expression of all rotation related tensors, see plays an important role in the section 13.4. The generalized vector product tensor, P, vectorial parameterization of motion. The tensorial nature of the generalized vector product operator directly follows from eq. (5.54), leading to 3 = C −1 (P )P 1 C(P ) ⇐⇒ P = C −1 (P )P . P 2 2 3 2 1
(14.48)
This statement generalizes eq. (4.30), which expresses the tensorial nature of the skew-symmetric operator, p. Identity (5.55) generalizes as P + Z(2, p2 )P = 0, PP
(14.49)
where tensor Z is deſned by eq. (5.51) and scalar is closely related to the intrinsic displacement of the rigid body deſned by eq. (5.7), = pT q =
pd . ϕ0
(14.50)
14.4.4 The motion tensor The motion tensor and its inverse are obtained from eqs. (5.53) as
C
+ Z(ζ¯2 , ζ2 )P P, C(P) = I + Z(ζ¯1 , ζ1 )P
(14.51a)
+ Z(ζ¯2 , ζ2 )P P. (P) = I − Z(ζ¯1 , ζ1 )P
(14.51b)
−1
The parallel between the expressions for the rotation and motion tensors, eqs. (13.45) and (14.51), is now evident. Coefſcients ζ1 and ζ2 are given by eqs. (13.46), and ζ¯1 = (ϕ2 − ϕ0 ζ2 ), ζ¯2 = (ϕ2 ζ1 − ζ22 ).
(14.52a) (14.52b)
The following multiplicative decomposition of the motion tensor generalizes the corresponding expression for the rotation tensor, eq. (13.50), −1 1 1 C(P) = I + Z(¯ ε, ε)P I − Z(¯ ε, ε)P 2 2 (14.53) −1 1 1 . I + Z(¯ = I − Z(¯ ε, ε)P ε, ε)P 2 2
14.4 The vectorial parameterization of motion
559
Furthermore, eq. (13.51) also generalizes as
1 I − Z(¯ ε, ε)P 2
−1
=
C+I , 2
1 I + Z(¯ ε, ε)P 2
C −1 + I . 2
−1
=
(14.54)
In these last two equations, coefſcient ε is given by eq. (13.47b) and ζ1 ε¯ = 2ζ¯2 − εζ¯1 .
(14.55)
These equations also generalize Cayley’s multiplicative decomposition presented in section 14.1. 14.4.5 The velocity vector The velocity vector is obtained from a time derivative of the motion tensor, as indicated in eq. (5.64). For the vectorial parameterization of motion, this becomes , ˙ u˙ + u ω v V= = = F (p)q + F (p)˙q + F (p)q H(p)p˙ . ω ω H(p)p˙ The velocity vector is now related to the time derivative of the motion parameter vec˙ where tangent tensor H is deſned by eq. (14.57a). Using eq. (5.67), tor, V = H P, similar developments for the components of the velocity vector resolved in the ma˙ where tensor H∗ is deſned by eq. (14.57b). terial frame lead to V ∗ = H∗ P, In summary, the components of the velocity vector resolved in the inertial and material frames, denoted V and V ∗ , respectively, are related to the time derivatives of the motion parameter vectors through the following relationships, ˙ V = H P, ∗ ∗ ˙ V = H P.
(14.56a) (14.56b)
Explicit expressions for tensors H, H∗ , and their inverses are H= H∗ = H−1 = H∗−1 =
F L + Fq H , 0 H
F T RT L , 0 HT * + F −1 −F −1 L H −1 + Fq
0
H −1
F −T −F −1 L H −T , 0 H −T
where operator L(q, p) is deſned by eq. (14.45).
(14.57a) (14.57b) ,
(14.57c) (14.57d)
560
14 Parameterization of motion
Properties of the tangent tensor Tangent tensors H, H∗ , and their inverses enjoys the following remarkable properties, C = H H∗−1 , C
−1
C
−1
=H H
C−I = −1
∗
(14.58a) (14.58b)
,
H = P
H P,
H∗ = −H∗ P, − I = −P =H P
∗−1
−H
−1
.
(14.58c) (14.58d) (14.58e)
These properties are established directly from the deſnition of the tangent tensor, eqs. (14.57), taking into account the properties of the tangent tensor for the vectorial parameterization of rotation, eqs. (13.63), and those of tensor F , eqs. (14.43). The properties of operator L, eqs. (14.47), must also be used. Tangent tensor H is speciſc to a particular vectorial parameterization of motion, i.e., its expression depends on the choice of the generating function. It is, however, a second-order tensor because the following equivalence holds H(P 3 ) = C −1 (P 2 )H(P 1 )C(P 2 ) ⇐⇒ P 3 = C −1 (P 2 )P 1 .
(14.59)
Although tensor H is not an intrinsic tensor because it depends on the choice of a speciſc generating function, it is a second-order tensor for all vectorial parameterizations of motion. Equation (14.59) is established directly from the deſnition of tensor H, eq. (14.57a), by using eqs. (13.63), (14.43), and eqs. (14.47). Alternative expression of tangent tensor Tangent tensors H, H∗ , and their inverses are given by eqs. (14.57), which are valid for any choice of parameter α. If this parameter is selected to be α = νp , alternative expressions of tangent tensor can be obtained, + Z(¯ P, H(P) = Z(¯ σ0 , σ0 ) + Z(ζ¯2 , ζ2 )P σ2 , σ2 )P
(14.60a)
+ Z(¯ P, σ0 , σ0 ) − Z(ζ¯2 , ζ2 )P σ2 , σ2 )P H (P) = Z(¯
(14.60b)
∗
+ Z(χ ¯0 , χ0 ) − P/2 ¯2 , χ2 )PP, H−1 (P) = Z(χ
(14.60c)
+ Z(χ (P) = Z(χ ¯0 , χ0 ) + P/2 ¯2 , χ2 )PP.
(14.60d)
H
∗−1
The parallel between these expressions and those for the vectorial parameterization of rotation, eqs. (13.55) and (13.60), is evident. Coefſcients σ0 , σ1 , and σ2 are given by eqs. (13.56), and σ ¯0 = ϕ0 /(pp ), p σ ¯2 = σ ¯0 − 2ϕ2 − ζ¯1 . 2
(14.61a) (14.61b)
14.5 Speciſc parameterizations of motion
561
Coefſcient χ0 and χ2 are given by eqs. (13.61), and χ ¯0 = −χ0 δ0 σ ¯0 , ¯ ¯ ζ1 ζ2 − +χ ¯0 ζ2 − 2ζ2 δ2 . ¯2 = p2 ζ2 χ ε 2
(14.62a) (14.62b)
14.4.6 Determination of the motion parameter vector The motion tensor can be written as C(P) =
R(p) Z(p, q) , R(p) 0
where Z(p, q) = FqR. To determine the components of the motion parameter vector from this motion tensor, the rotation parameter vector is ſrst extracted from R(p) using the procedure described in section 13.4.4. The displacement related vector, q, is then extracted from tensor Z q = F −1 (p)axial Z(p, q)RT (p) .
(14.63)
14.4.7 Composition of ſnite motions
Let P, P T1 = q T1 , pT1 , and P T2 = q T2 , pT2 correspond to motion tensors C(P), C(P 1 ), and C(P 2 ), respectively. If C(P) = C(P 1 )C(P 2 ), the problem is to relate P to P 1 and P 2 . The ſrst step of the process is to note that R = R1 R2 , and hence, eqs. (13.66a) and (13.66b) yield p as a function of p1 and p2 . Next, the relationship between q and q 1 , q 2 is obtained as F (p)q = F (p1 )q 1 + R(p1 )F (p2 )q 2 , and ſnally q = F −1 (p)F (p1 )q 1 + F −T (p)F T (p2 )q 2 .
(14.64)
14.5 Speciſc parameterizations of motion The vectorial parameterization of motion presented in the previous sections consists of a set of displacement related parameters, q = D u, and of the rotation parameter vector, p. The motion parameter vector is parallel to an eigenvector of the motion tensor associated with its unit eigenvalue. This leads to families of parameterizations that depend on two choices: the choice of the generating function, p(φ), and that of parameter α appearing in tensor D. As discussed in section 13.5, the generating function can be selected to simplify some of the operators involved in manipulating rotations. But more importantly, judicious choices of this function can eliminate the singularities that occur in the various rotation operators.
562
14 Parameterization of motion
The occurrence of singularities is also a major concern when dealing with the vectorial parameterization of motion. Two criteria guide the selection of function α(φ). First, a one to one, singularity free relationship must exist between the displacement related part of motion parameter vector, q, and the physical displacement vector, u. Second, the limit behavior expressed by eq. (14.44) must be satisſed. 14.5.1 Alternative choices of the motion parameter vector Arbitrary parameter α was introduced in section 5.5.2 to reƀect the non-uniqueness of the eigenvector of the motion tensor associated with its unit eigenvalue. An additional constraint is required to evaluate this parameter. For instance, imposing the orthogonality condition, q T p = 0, leads to α = 0, and the resulting motion parameter vector then corresponds to the Pl¨ucker coordinates of the Mozzi-Chasles axis, as discussed in section 5.5.1. This choice, however, does satisfy the limit behavior condition, eq. (14.44), and furthermore, because det(D) = α/ν 3 = 0, a one-to-one mapping between q and u ceases to exist for this choice. Comparing eqs. (14.40) and (4.34), the close relationship between tensors D[E] and G[E] is apparent. Indeed, for α = 1, tensors D and F become D=
1 T G , ν
(14.65)
F = νG,
respectively. The determinant of tensor D is det(D) = 1/ν 3 . This choice satisſes the limit behavior condition, eq. (14.44). The close connection between tensors D and H −1 is evident when comparing eqs. (14.40) and (13.62). For α = νp , tensors D and F are identical to the inverse of the tangent tensor and to the tangent tensor, respectively, i.e., D = H −1 ,
(14.66)
F = H,
respectively. The determinant of tensor D is det(D) = p /ν 2 . Here again the limit behavior condition is satisſed. It is possible to eliminate the quadratic term in p of tensor D by choosing α = ν/ε. This leads to the following expressions for tensors D and F D(p) =
1* ε + I − p , ε 2
F =ε
R+I , 2
(14.67)
respectively. The determinant of tensor D is det(D) = 1/(εν 2 ). One ſnal alternative is to select α = ε/ν, and comparing eqs. (14.40) and (13.48) then leads to * I + RT ε + D(p) = , F = ζ1 I + p , (14.68) 2ζ1 2 and det(D) = 1/(ν 2 ζ1 ). This choice leads to Cayley’s motion parameters presented in section 14.1, see eq. (14.2).
14.5 Speciſc parameterizations of motion
563
Of all the choices presented in this section, α = νp seems to the most desirable because it leads to D = H −1 . This choice satisſes the limit behavior expressed by eq. (14.44), and because the tangent tensor plays a critical role in manipulating rotations, eliminating singularities from this tensor is already a criterion for the selection of appropriate generating functions. A singularity free tangent tensor in the vectorial parameterization of rotation will then automatically lead to a one-to-one mapping between q and u, avoiding the occurrence of singularities in the vectorial parameterization of motion. 14.5.2 The exponential map of motion One of the simplest choices of the generating function is p(φ) = φ, which leads to the exponential map of motion [269]. This parameterization involves the evaluation of numerous trigonometric functions and will not be discussed here, although all the relevant formulae can be obtained by introducing the generating function, p(φ) = φ, into the expression given in the previous sections. 14.5.3 The Euler-Rodrigues motion parameters The Euler-Rodrigues rotation parameters, v, are associated with the generating function p(φ) = 2 sin φ/2 and the properties of this parameterization are detailed in section 13.7.2. The corresponding Euler-Rodrigues motion parameters are deſned as Du P= . (14.69) v The relevant coefſcients for this parameterization, ζ1 , ζ2 , σ0 , σ1 , σ2 , χ0 , χ1 , and χ2 , are given in section 13.7.2. Parameters δ0 and δ2 , and ϕ0 , ϕ1 , and ϕ2 , are given by eqs. (14.39) and (14.42), respectively. Selecting, for instance, α = νp , eqs. (14.52) yield ζ¯1 = −/(4v0 ) and ζ¯2 = 0, where coefſcient is given by eq. (14.50). Equations (14.51) then gives the motion tensor and its inverse. Next, eqs. (14.61) and (14.62) yield σ ¯0 = /(4v03 ), σ ¯2 = ∗ 3 −/(16v0 ), χ ¯0 = /(4v0 ), and χ ¯2 = 0; tensors H, H and their inverses then follow from eqs. (14.60). Finally, the multiplicative decomposition of the motion tensor given by eq. (14.53) is applicable to this parameterization using ε¯ = /(4v03 ). 14.5.4 The Cayley-Gibbs-Rodrigues motion parameters The Cayley-Gibbs-Rodrigues rotation parameters, r, are associated with the generating function p(φ) = 2 tan φ/2 and the properties of this parameterization are detailed in section 13.7.3. The corresponding Cayley-Gibbs-Rodrigues motion parameters are deſned as Du P= . (14.70) r
564
14 Parameterization of motion
The relevant coefſcients for this parameterization, ζ1 , ζ2 , σ0 , σ1 , σ2 , χ0 , χ1 , and χ2 , are given in section 13.7.3. Parameters δ0 and δ2 , and ϕ0 , ϕ1 , and ϕ2 , are given by eqs. (14.39) and (14.42), respectively. Selecting, for instance, α = νp , eqs. (14.52) yield ζ¯1 = −r02 /2 and ζ¯2 = −r02 /4, where coefſcient is given by eq. (14.50). Equations (14.51) then gives the motion tensor and its inverse. Next, eqs. (14.61) and (14.62) yield σ ¯0 = −r02 /2, σ ¯2 = 0, χ ¯0 = /2, and χ ¯2 = 0; tensors H, H∗ and their inverses then follow from eqs. (14.60). Finally, the multiplicative decomposition of the motion tensor given by eq. (14.53) is applicable to this parameterization using ε¯ = 0. 14.5.5 The Wiener-Milenkovi´c motion parameters The Wiener-Milenkovi´c rotation parameters, c, are associated with the generating function p(φ) = 4 tan φ/4 and the properties of this parameterization are detailed in section 13.7.4. The corresponding Cayley-Gibbs-Rodrigues motion parameters are deſned as Du P= , (14.71) c The Wiener-Milenkovi´c parameterization of rotation is singularity free for rotations of arbitrary magnitude when using the rescaling technique. Consequently, the Wiener-Milenkovi´c motion parameterization is singularity free for displacements and rotations of arbitrary magnitude provided that the rescaling operation is applied to the rotation parameter vector. The relevant coefſcients for this parameterization, ζ1 , ζ2 , σ0 , σ1 , σ2 , χ0 , χ1 , and χ2 , are given in section 13.7.4. Parameters δ0 and δ2 , and ϕ0 , ϕ1 , and ϕ2 , are given by eqs. (14.39) and (14.42), respectively. Selecting, for instance, α = νp , eqs. (14.52) yield ζ¯1 = ν 2 (1 − 4ν)/8 and ζ¯2 = −ν 3 /8, where coefſcient is given by eq. (14.50). Equations (14.51) then gives the motion tensor and its inverse. Next, eqs. (14.61) and (14.62) yield σ ¯0 = −ν 2 /8, ∗ 3 σ ¯2 = −ν /32, χ ¯0 = /8, and χ ¯2 = 0; tensors H, H and their inverses then follow from eqs. (14.60). Finally, the multiplicative decomposition of the motion tensor given by eq. (14.53) is applicable to this parameterization using ε¯ = ε2 /8. Using eq. (5.35), the motion tensor can also be expressed as G 0 I ν q G 0 R Gν qG = . (14.72) C= 0 R 0 G 0 I 0 G where the second equality follows from eq. (4.19). This factorization affords the following geometric interpretation: the motion is decomposed into the half-angle rotation characterized by the rotation tensor G, followed by a translation of magnitude νq, and ſnally a half-angle rotation. 14.5.6 Problems Problem 14.4. Motion tensor for the Cayley-Gibbs-Rodrigues parameters Show that the motion tensor and its inverse, expressed in terms of Cayley-Gibbs-Rodrigues motion parameters, can be written as
14.5 Speciſc parameterizations of motion ⎡ ⎤ R+I R+I RT + I T RT + I T q C = ⎣R 2 q 2 ⎦ , C −1 = ⎣R 2 2 ⎦. 0 R 0 RT ⎡
565
⎤
Problem 14.5. Prove relationship Prove the following relationship 1 = (C + I)−1 (C − I) = (C − I)(C + I)−1 . ε, ε)P Z(¯ 2
Problem 14.6. Prove the properties of the tangent tensor Prove properties (14.58c) of the tangent tensor.
Problem 14.7. Prove the properties of the tangent tensor Prove properties (14.58d) of the tangent tensor.
Problem 14.8. Prove the tensorial nature of the tangent tensor Prove property (14.59) of the tangent tensor.
Problem 14.9. The half-motion tensor The half-rotation tensor G deſned by eq. (13.49), play an important role in the vectorial parameterization of rotation. Find the half-motion tensor, G, such that C = G G.
Part VI
Flexible multibody dynamics
15 Flexible multibody systems: preliminaries
Multibody systems are characterized by two distinguishing features: system components undergo ſnite relative rotations and these components are connected by mechanical joints that impose restrictions on their relative motion. Broadly speaking, multibody systems can be divided into three categories, rigid multibody systems, linearly elastic multibody systems, and nonlinearly elastic multibody systems. This classiſcation and its implication on modeling techniques for multibody systems are discussed in section 15.1. Section 15.2 presents a review of the basic equations of three-dimensional, linear elastodynamics. Geometrically nonlinear problems are characterized by nonlinear strain-displacement relationships, which are the subject of section 15.3. In section 15.5, special attention is devoted to the formulation of problems where structures undergo arbitrarily large displacements and rotations although strain components are assumed to remain small at all points of the structure.
15.1 Classiſcation of multibody systems Multibody systems can be divided into three categories, rigid multibody systems, linearly elastic multibody systems, and nonlinearly elastic multibody systems. Systems of the ſrst category involves rigid bodies only, but those of the latter two categories comprise both rigid and ƀexible bodies. Section 12.5.1 introduced the concept of ƀoating frame of reference in which the total motion of ƀexible bodies is broken into two parts: rigid body motions represented by the motion of the ƀoating frame of reference and superimposed elastic motions. By deſnition, rigid body motions generate no strains. The elastic motions typically consist of displacement and rotation ſelds, which generate an associated strain ſeld, denoted . For rigid multibody system, the strain ſeld vanishes in all bodies, i.e., = 0 in each body. The distinction between linearly and nonlinearly elastic multibody systems stems from the characteristics of the strain ſeld. The characteristics of the three types of multibody systems are as follows.
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_15 © Springer Science+Business Media B.V. 2011
570
15 Flexible multibody systems: preliminaries
1. Rigid multibody systems consist of an assemblage of rigid bodies connected together through mechanical joints and in arbitrary motion with respect to each other. Although all bodies are rigid, i.e., = 0 in each body, lumped elastic components, also called ƀexible joints, bushing elements or force elements, could be placed between two components of the system to represent localized elasticity. These ƀexible joints exhibit arbitrary constitutive behavior. 2. Linearly elastic multibody systems consist of an assemblage of both elastic and rigid bodies connected together through mechanical joints and in arbitrary motion with respect to each other. For linearly elastic multibody systems, it is assumed that the strain-displacement relationships remain linear and that strains components remain very small at all times, i.e., 1 for all elastic bodies. Efſcient analysis techniques for this type of problems typically rely on modal expansions of the elastic displacement ſeld. 3. Nonlinearly elastic multibody systems consist of an assemblage of both elastic and rigid bodies connected together through mechanical joints and in arbitrary motion with respect to each other. For the elastic bodies, the strain-displacement relationships become nonlinear, or the strain components become large, or both. Nonlinear strain-displacement relationships characterize geometrically nonlinear problems, i.e., problems involving large elastic displacements, or rotations, or both. When strain components become large, nonlinear material constitutive laws must be used, a characteristic of materially nonlinear problems. For nonlinearly elastic multibody systems the accuracy and reliability of modal expansion of the elastic displacement ſeld become questionable. Because the overall motions of all bodies of a multibody system are large and because the relative motions between the system’s various components are also large, multibody system dynamics is an inherently nonlinear problem. The qualiſers “linearly” and “nonlinearly elastic” used in the classiſcation above speciſcally refer to the elastic behavior of the bodies. The modeling of linearly elastic multibody systems leads to nonlinear dynamical equations of motion, although the representation of the elastic behavior of the bodies could be largely linearized. 15.1.1 Linearly and nonlinearly elastic multibody systems The demarcation between linearly and nonlinearly elastic multibody systems is sometimes blurry. Consider, for instance, the problem of a helicopter rotor blade. As the blade rotates, elastic displacements and rotations remain very small, and the blade is designed to undergo small strains at all time to ensure safety of ƀight and guarantee structural fatigue life. This problem seems to fall into the category of linearly elastic multibody systems. Due to the high angular speed of the rotor, however, large centrifugal forces appear in the blade, leading to considerable centrifugal stiffening of the blade and nonlinear coupling between its two bending and torsional deformations. To accurately capture these effects, nonlinear strain-displacement relationships must be used, although linear constitutive laws adequately represent material behavior. These geo-
15.1 Classiſcation of multibody systems
571
metric nonlinearities squarely put the helicopter rotor blade problem into the category of nonlinearly elastic multibody systems. Because wind turbine blades rotate at a much lower angular speed and are far stiffer than helicopter rotor blades, assuming wind turbines to be linearly elastic multibody systems often leads to reliable predictions. On the other hand, efforts to design ever increasingly efſcient turbines call for ever increasing rotor diameters. It is possible that future generation turbines will become large enough to operate in the geometrically nonlinear regime, requiring the use of formulations capable of dealing with nonlinearly elastic multibody systems to predict accurately the dynamic response of such highly ƀexible machines. The distinction between linearly and nonlinearly elastic systems is further complicated by the fact that both linearly and nonlinearly elastic components could appear simultaneously in a given multibody system. For instance, the modeling of a complete helicopter in ƀight calls for the coupled simulation of the rotor and fuselage. As explained above, the rotor problem is inherently nonlinear, but it is reasonable to assume that the fuselage behaves in a linearly elastic manner, even during large angle maneuvers. For such problems, different formulations could be used to model the rotor and fuselage that reƀect the distinctly different behavior of these two components. Similar remarks could be made concerning wind turbines. Whereas the rotor blades could be treated as linearly or nonlinearly elastic bodies depending on the magnitude of the elastic motions, it seems reasonable to assume the supporting tower to behave in a linearly elastic manner. The remaining chapters of this book focus on nonlinearly elastic multibody systems. As discussed in section 12.5.1, one of the most common approaches to the modeling of linearly elastic multibody systems is based on the concept of ƀoating frames of reference [162] and the component mode synthesis based methods described in section 12.5.2 are then used to approximate the elastic displacement ſeld. For nonlinearly elastic multibody systems the accuracy and reliability of these approaches become questionable for the reasons detailed in the next section. 15.1.2 Shortcomings of modal analysis applied to nonlinear systems The natural vibration modes of a structure are the eigenfunctions of the system’s equations of motion linearized about one of its equilibrium conſgurations. Therefore, these modes characterize the dynamic behavior of small perturbations about an equilibrium conſguration of the system. Clearly, natural vibration modes are inherently linearized quantities that provide no information about the nonlinear behavior of the system. Consider a simple cantilevered beam under transverse, time-dependent loading. The beam’s mode shapes computed with respect to its unloaded conſguration include the familiar bending, axial, and torsional modes of vibration, which are derived in numerous structural vibration textbooks such as those of Timoshenko and Young [273] or Meirovitch [183]. If the beam is subjected to a tip transverse load that generates small transverse deƀections of the beam, a modal expansion in terms
572
15 Flexible multibody systems: preliminaries
of transverse bending modes will accurately predict the dynamic behavior of the system, even when using a small number of modes. Of course, resonance conditions should be avoided as large deƀections would result. Furthermore, all the modes associated with natural frequencies of magnitude comparable to that of the excitation frequency should be included in the modal expansion. Modal analysis is a natural approach to reducing the number of degrees of freedom involved in structural dynamics problems, and has the added advantage of involving degrees of freedom that have a direct physical meaning. Modal analysis can be viewed as a two step process. In the ſrst step of the process, a change of variables is performed by projecting the equations of motion expressed in term of physical variables to the modal domain. For a simple cantilevered beam problem, the physical variables would typically consist of the displacement and rotation components at discrete nodes along the beam. The modal variables are the amplitudes of excitation of the beam’s eigenmodes. If all the modes of the structure are used, this change of variable is a purely mathematical operation that involves no approximations. In the second step of the process, modal truncation is performed. Based on physical arguments, a small subset of all the eigenmodes of the structure is retained. For linear systems, modal truncation is a simple operation: modes associated with frequencies far higher than the excitation frequencies are simply eliminated from the modal basis because their contribution to structural response is negligible. This assumption yields the major advantages of modal analysis. First, a dramatic reduction in the number of degrees of freedom is achieved, leading to considerable computational savings. For complex structures, thousands or even hundreds of thousands of degrees of freedom might be involved in a detailed ſnite element model, whereas ten or ſfteen modes only could be sufſcient to capture accurately the overall dynamic behavior of the structure. Second, because high frequency modes have been eliminated, larger time step sizes can be used to integrate the system’s equations of motion, leading to additional computational savings. Finally, the modal degrees of freedom are easily interpreted in a physical manner. For instance, if a structure responds “mainly in its second bending mode,” or “primarily in its ſrst torsion mode,” it is easy to visualize the overall deformation of even very complex structure. Projection of the equations of motion into the modal domain is a purely mathematical step that does not involve any approximation. Modal truncation is an assumption. Indeed, when applied to linear systems, the projection of the equations of motion to the modal space decouples the governing equations. In the modal space, even the most complex structures can be viewed as a superposition of linear, single degree of freedom oscillators. The elimination of speciſc modes to obtain a reduced modal basis is based on the well known properties of these oscillators. When applied to nonlinear structures, the ſrst step of the modal analysis process is also the projection of the equations of motion onto the modal domain. As was the case for linear structures, no assumption is involved here. Modal truncation, however is fraught with difſculties. For nonlinear problems, the equations of motion projected in the modal space are still nonlinear and in general, do not decouple. The
15.1 Classiſcation of multibody systems
573
nonlinear system cannot be viewed as the superposition of linear, single degree of freedom oscillators, nor can it be viewed as the superposition of nonlinear oscillators. Consequently, the physical arguments invoked to eliminate speciſc modes from the modal basis no longer apply, or become more tenuous. To illustrate the problems encountered p2(x1, t) i2 by modal analysis applied to nonlinear T i1 structures, consider the simply supported O Case (a) beam depicted in ſg. 15.1. The beam is subp2(x1, t) i2 jected to time-dependent loading, p2 (x1 , t), T i1 and features a large axial stiffness. For case O Case (b) (a), the end points of the beam cannot move axially. In contrast, axial displacements are Fig. 15.1. Simply supported beam with allowed at point T for case (b). For case (a), different end conditions. large transverse displacements of the beam generate axial deformations, which in turn, cause large axial forces to appear due to the beam’s high axial stiffness. As the magnitude of the transverse displacements increases, a considerable stiffening of the system is observed, leading to pronounced nonlinear behavior. This phenomenon is much less severe for case (b) because the beam is free to move axially at point T. To simplify the discussion, the problems depicted in ſg. 15.1 are limited to the planar case where all displacements take place in plane (¯ı1 , ¯ı2 ). The beam’s natural vibration modes about its unloaded conſguration are easily obtained and consist of transverse bending and axial modes. Because they are linearized quantities, the bending modes involve transverse displacement components only. Note that the beam’s bending modes are identical for cases (a) and (b), although its axial modes differ. If small, time-dependent transverse loads are applied to the beam, modal analysis using a modal basis consisting of a few bending modes yields accurate predictions of the dynamic response of the system. In the linear range, axial modes are not excited and need not be present in the modal basis. The solutions for cases (a) and (b) are identical in the linear range, a feature that is correctly reproduced by the modal solution because the axial displacement boundary condition is not reƀected in the modal basis. The modal basis does not “feel” the difference between cases (a) and (b). Next, larger transverse loads are applied to the beam, which now responds in the nonlinear range. If bending modes only are used in the modal approximation, the beam’s dynamic response is no longer predicted accurately. The situation is somewhat improved by adding axial vibration modes, but a large number of these modes is required to obtained a good solution. The reason for this behavior is twofold. First, because the beam’s axial displacement ſeld is not captured accurately by the modal approximation, errors are to be expect in the estimation of the axial strain ſeld. Due to the beam’s large axial stiffness, small errors in the axial strain ſeld lead to a grossly erroneous axial force ſeld and the nonlinear stiffening effects it induces are poorly captured by the modal analysis. Clearly, the foreshortening of the blade, an inherent part of its nonlinear response, must be modeled precisely to predict accurately the beam’s nonlinear response.
574
15 Flexible multibody systems: preliminaries
Second, the blade’s axial displacement ſeld is primarily due to foreshortening (a purely kinematic, nonlinear phenomenon), whereas axial vibration modes characterize true axial vibration (a purely vibratory, linear phenomenon). In other words, modal analysis attempts to “synthesize” a nonlinear kinematic mode shape as a superposition of linear vibratory modes. Because these two phenomena are not physically related, accurate predictions should hardly be expect from this superposition. Thus far, the discussion has focussed on nonlinear foreshortening effects in a simple planar problem. The above arguments, however, also apply to other kinematic nonlinearities found in three-dimensional problems. For instance, transverse deƀections due to transverse loads applied to the beam in two orthogonal directions create a torsional moment, thereby coupling bending and torsion responses. When applied to nonlinear problems, convergence and accuracy of modal analysis are not guaranteed. To improve the situation, it seems natural to investigate the selection of alternative mode shapes that contain information about the nonlinear behavior of the structure. Several concepts have been proposed to improve the quality of the modal basis when dealing with nonlinear problems. The conceptually simplest method it to recalculate a new set of natural vibration modes once the deformations of the blade become signiſcant [274]. Due to the large relative motions between the components of multibody systems, the equilibrium conſguration of the system is time-dependent, and hence, the modal basis is itself time-dependent. Although this approach might give good results, it does so at a tremendous computational costs, because the modal basis must now be updated during the response calculation, and the modal reduction scheme must be repeated at each update. Another approach is to include in the modal basis natural vibration modes about different equilibrium conſgurations of the structure. This method is attractive because the evaluation of the various equilibrium conſgurations and associated modal bases only require a modest increase in computational costs. In some cases, this method appears to give accurate results, as documented by Nickell [274]. The concept of perturbation modes was introduced by Thompson and Walker [275] to study the nonlinear behavior of beam structures. The authors demonstrated improved accuracy compared with classical modal analysis based on natural vibration modes. The same concept was later reſned by Noor et al. [276, 277] for the nonlinear static analysis of beam and shell structures in conjunction with the ſnite element method. Bauchau and Guernsey [278] also used these concepts for the analysis of helicopter rotors. Mixed ſnite element formulations have also been developed to improve the accuracy and reliability of modal methods. Ruzicka and Hodges [279, 280] have demonstrated applications of this method to rotorcraft problems. While the various approaches described in the previous paragraphs are capable of extending the range of validity of modal methods, they also present many drawbacks. First, they require extensive numerical developments and more often that not, the simplicity of modal analysis is lost. Second, they are based of formulations that are not widely available. For instance, commercial ſnite element codes are rarely based on mixed formulations. Finally, different approaches are required to deal with
15.1 Classiſcation of multibody systems
575
different problem, i.e., these methods are not general purpose methods that can be used reliably for general multibody systems. 15.1.3 Finite element based modeling of ƀexible multibody systems Multibody dynamics analysis was originally developed as a tool for modeling rigid multibody systems with simple tree-like topologies, but has considerably evolved to the point where it can handle linearly and nonlinearly elastic multibody systems with arbitrary topologies. It is now used widely as a fundamental design tool in many areas of engineering. In the automotive industry, for instance, multibody dynamics analysis is used routinely for optimizing vehicle ride qualities, a complex multidisciplinary task that involves the simulation of many different sub-components. Modern multibody codes can deal with complex mechanisms of arbitrary topologies including sensors, actuators, and controls, are interfaced with CAD solid modeling programs that allow to import directly the problem geometry, and have sophisticated graphics, animation, and post-processing features [38, 39]. The success of multibody dynamics analysis tools stems from their versatility: a given mechanism can be modeled by an idealization process that identiſes the mechanical components from within a large library of elements implemented in the code. Each element provides a basic functional building block, for example, a rigid or ƀexible member, a revolute joint, or a motor. Assembling the various elements, it is then possible to construct a mathematical description of the mechanism with the required level of accuracy. The modeling of linearly elastic multibody systems relies predominantly on the ƀoating frame of reference approach discussed in section 12.5.1, in which the elastic displacement ſeld is approximated using modal expansion techniques. In the last two decades, new approaches have emerged that bypass the need for the ƀoating frame of reference and eliminate modal expansions. These approaches are closely related to the ſnite element method which they effectively generalize to enable the treatment of multibody systems. In the ſnite element method, the solution domain is ſrst divided into a ſnite number of sub-domains called ſnite elements. Within each element, the solution is then approximated by a small number of continuous functions, based on the value of these functions at discrete points, often called nodes, associated with the element. The main advantage of this two-step approximation process is that many aspects of the solution procedure can be carried out at the element level, i.e., by considering one single element at a time, independently of all others. The continuity of the solution across elements is guaranteed by the fact that neighboring elements share common nodes, i.e., share common degrees of freedom. This aspect of the formulation is key to ensuring the continuity of the displacement ſeld over the entire system, an indispensable requirement for displacement based ſnite element formulations. Consider a beam connected at one of its end points to a revolute joint. Complete deſnition of the beam requires geometric data: typically, a local frame is used to
576
15 Flexible multibody systems: preliminaries
deſne the beam’s cross-sectional plane and its physical mass and stiffness properties are given with respect to this local frame. Similarly, deſnition of the revolute joint also requires geometric data: typically, a local frame is used to deſne the plane of the joint and the unit vector about which relative rotation is allowed, see ſg. 10.14. These local frames are independent of each other and are used solely by the elements for which they are deſned. In contrast, the components of the displacement and rotation vectors at the connection point between the beam and revolute joint must be uniquely deſned. If the displacement components are resolved in the local frame of the beam element, the revolute joint will not be able to interpret these components properly because they are resolved in a local frame whose orientation it does not know. Vice versa, if the displacement components are resolved in the local frame of the revolute joint, the beam will not be able to interpret these components adequately because they are resolved in a local frame whose orientation it does not know. To resolve this conƀict, all nodal displacement and rotation components must be deſned in a common frame, which is conveniently selected to be the inertial frame of reference. Because the components of multibody systems typically undergo large displacements and rotations, these inertial, or absolute, displacement and rotation components must be treated rigorously as large displacement and rotation components. Consequently, all the elements of the multibody system must be able to handle arbitrarily large displacements and rotations exactly. This is why these elements are sometimes called “geometrically exact elements,” although the term “kinematically exact elements” would be more appropriate. Finite element based modeling of ƀexible multibody systems makes use of a library of elements consisting of structural elements and joint elements. Structural elements, such as cables, membranes, beams, plates, shells, or three-dimensional elements are similar to the corresponding elements found in all ſnite element packages. The geometrically exact formulations of cables, beams, and plates and shells are presented in sections 16.2, 16.3, and 16.4, respectively. The dynamic equations of equilibrium of these elements are written in an absolute Cartesian frame. Joint elements characterize multibody systems and are absent from most ſnite element codes. In typical multibody formulations, joints are modeled as idealized components, i.e., joints are not modeled per se. Rather, the effects of joints are represented by the kinematic constraints they impose on the components they are connected to. For instance, section 10.6 details the constraints associated with the lower pair joints, which are enforced via Lagrange’s multiplier technique. The assembly of the equations of motion of both structural and joint elements leads to systems of equations that are highly sparse, although not of minimal size, a characteristic of the approach pioneered by Orlandea et al. [77]. Because it is an extension of the ſnite element method to multibody systems, algorithms such as sparse solvers, assembly procedures, and data structures developed for ſnite element analysis are directly applicable to ſnite element based modeling of ƀexible multibody systems. This approach can readily treat conſgurations of arbitrarily complex topologies through the assembly of basic components chosen from an extensive library of struc-
15.1 Classiſcation of multibody systems
577
tural and joint elements. In fact, this concept is at the heart of the ſnite element method which has enjoyed, for this very reason, an explosive growth in the last few decades. This analysis approach leads to new comprehensive simulation tools that are modular and expandable. Modularity implies that all the basic building blocks can be validated independently, easing the more challenging task of validating complete simulation procedures. Because they are applicable to conſgurations with arbitrary topologies, including those not yet foreseen, such simulation tools will enjoy a longer life span, a critical requirement for any complex software tool. Example 15.1. Modeling helicopter rotors with ƀexible multibody dynamics Historically, the classical approach to rotor dynamics modeling has relied on modal reduction approaches, as pioneered by Houbolt and Brooks [281]. Typical models were limited to single articulated blades connected to an inertial point and the control chain was ignored. The blade’s equations of motion were written in the rotating system, and ordering schemes were used to decrease the number of nonlinear terms appearing in the modal expansion [282]. In time, more detailed rotor models were developed to improve accuracy and to account for various design complexities such as gimbal mounts, swashplates, or bearingless root retention beams, among many others. The relevant equations of motion were derived for the speciſc conſgurations at hand. In fact, the various codes developed in-house by rotorcraft manufacturers are geared towards the modeling of the speciſc conſguration they produce. This approach severely limits the generality and ƀexibility of the resulting codes. In recent years, a number of new rotorcraft conſgurations have been proposed: bearingless rotors with redundant load paths, tilt rotors, co-axial rotors, or variable diameter tilt rotors, to name just a few. Developing a new simulation tool for each novel conſguration is a daunting task, and software validation is an even more difſcult issue. Furthermore, the requirement for ever more accurate predictions calls for increasingly detailed and comprehensive models. For instance, modeling the interaction of the rotor with a ƀexible fuselage or with the control chain must be considered to capture speciſc phenomena or instabilities. The ſnite element based ƀexible multibody dynamics formulation outlined above appears to be readily applicable to the rotorcraft dynamics analysis, because a rotorcraft system can be viewed as a complex ƀexible mechanism. It is now becoming the industry norm for this complex, nonlinear problem. Figure 15.2 depicts the conceptual representation of a rotorcraft system as a ƀexible multibody system. The various mechanical components of the system are associated with elements found in the library of typical multibody analysis tools. The ſgure shows a classical conſguration for the control chain, consisting of a swashplate with rotating and non-rotating components. The lower swashplate motion is controlled by actuators that provide the vertical and angular control inputs. The upper swashplate is connected to the rotor shaft through a scissors-like mechanism and controls the blade pitching motions through pitch-links. This control linkage conſguration can be modeled using the following elements: rigid bodies, used to model the non-rotating and rotating swashplate components and
578
15 Flexible multibody systems: preliminaries
Bearingless blade
} }
Articulated blade
Flexbeam Snubber
Flap,lag, and pitch hinges
Blade Torsion cuff Pitch-link
Hub Scissors
Actuators
Blade
Pitch-horn Pitch-link Swash plate: rotating non-rotating
Structural elements Rigid body Beam Flexible joint Ground clamp
Constraint elements Spherical joint Universal joint Revolute joint Sliding joint
Shaft
Fig. 15.2. Detailed multibody representation of a rotor system.
scissors links, and beams for modeling the ƀexible shaft and pitch-link. These bodies are connected through mechanical joints. For instance, a revolute joint, described in section 10.6.1, connects the rotating and non-rotating swashplates, allowing the former to rotate at the shaft angular velocity while the latter is non-rotating. Revolute joints also connect the scissors links to each other and to the upper swashplate, thereby synchronizing the angular speeds of the shaft and upper swashplate. Other types of joints are required for the model. For instance, the non-rotating swashplate is allowed to tilt with respect to an element that slides along the shaft, but does not rotate about the shaft direction. The universal joint, described in section 10.7.1, serves this purpose. Similarly, the pitch-link is connected to the pitchhorn by means of a spherical joint, see section 10.6.6, that allows the connected components to be at an arbitrary orientation with respect to each other. The other end of the pitch-link is attached to the swashplate by means of a universal joint. Figure 15.2 also shows two different rotor conſgurations: a classical, fully articulated design on the right portion of the ſgure and a bearingless design on the left. The articulated blade is connected to the hub through three consecutive revolute joints, that allow out-of-plane, in-plane, and torsional motions of the blade. For rotorcraft, these joints are called the ƀap, lag, and pitch hinges, respectively. In some designs, these joints are collocated, while other designs call for offset distances between these joints. In the latter case, rigid or ƀexible bodies of ſnite dimensions would be used to connect the three joints. For bearingless designs, the blade connects to the hub through a ƀexible component, called the ƀexbeam. The control input coming from the pitch-link is transmitted to the blade via the torsion cuff, represented by a ƀexible beam. To ensure a purely
15.2 The elastodynamics problem
579
rotational motion of the torsion cuff, it is also connected to the ƀexbeam through a snubber, which is typically modeled as a ƀexible joint, see section 16.1. The bearingless design is a multiple load path conſguration: the ƀexbeam and torsion cuff are assembled in parallel and connected by a snubber. Because they eliminates the ƀap, lag, and pitch hinges characteristic of fully articulated designs, bearingless designs are mechanically simpler and more robust. On the other hand, the blade’s control motions are accommodated through ƀexible elements, which could be subjected to higher stresses than those observed in fully articulated design. When using the ſnite element based ƀexible multibody dynamics formulation outlined above, the two designs, fully articulated and bearingless, can be modeled by assembling different sets of elements from the multibody library of elements. There is no need to derive and validate two different sets of equations for the two conſgurations. The blade itself is modeled by an appropriate beam element that should account for shearing deformations and for all elastic couplings that arise from the use of composite materials [283]. Furthermore, the center of mass, center of tension, and shear center of the blade are at distinct geometric locations of the blade’s crosssection, further complicating the modeling task. Of course, the level of detail presented in ſg. 15.2 is not always needed: some or all of the control chain components could be omitted, and the blade could be represented by a rigid body rather than beam elements, if a crude model is desired.
15.2 The elastodynamics problem Figure 15.3 depicts an elastic body of arbitrary shape subjected to time-dependent surface tractions and body forces. Geometric boundary conditions consist of timedependent prescribed displacements at a point or over a portion of the body’s outer surface. The volume of the body is denoted V and its outer surface S. Unit vector n ¯ is the normal to its outer surface. The dynamic response of the system is studied between initial and a ſnal times, denoted ti and tf , respectively. The displacement ſeld at a point of the body is denoted u(x1 , x2 , x3 , t), where t denotes time and x1 , x2 , and x3 the Cartesian coordinates of a point of the body resolved in inertial frame F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]. Over the outer surface of the body, displacements and surface tractions are denoted u ˆ (t) and ˆt(t), respectively. Over portion S1 of the body’s outer surface, the surface tractions are given, prescribed quantities; this includes the portion of the outer surface that is traction free, because vanishing surface tractions, ˆt(t) = 0, are prescribed over that portion of the outer surface. Over portion S2 of the body’s outer surface, the displacements are given, prescribed quantities. Surfaces S1 and S2 share no common point because displacements and tractions cannot be prescribed simultaneously at the same point, and hence, S = S1 +S2 . Over S1 , ˆt(t) represents the prescribed surface tractions, and u ˆ (t) the resulting displacements. Over S2 , u ˆ (t) represents the prescribed displacements, and ˆt(t) the resulting
580
15 Flexible multibody systems: preliminaries
traction, also called reaction forces. Reaction forces are those forces arising from the enforcement of the prescribed displacements. At the initial and ſnal times, the momenta and displacements are denoted pˆ and u ˆ , respectively. At these times, displacements could be given, prescribed quantities, and simultaneously, momenta could also be given, prescribed quantities. Note that at the initial and ſnal times, both displacements and momenta can be prescribed simultaneously. For instance, in an initial value problem, both initial displacements and momenta are prescribed at the initial time, and the values of both quantities at the ſnal time will result from the analysis. Time-dependent body forces, b(t), Prescribed might also be applied over the entire tractions volume of the body. Gravity forces are a ^ t(t) S1 typical example of body forces, but such n . forces can also arise from electric or magb(t) - p netic ſelds. In dynamic problems, inertial V ^ u(t) forces can be considered to be externally i2 applied body forces, in accordance with d’Alembert’s principle. The momentum S2 I Prescribed vector for a differential element of the O displacements i3 body is p = ρv, where ρ is the material i1 mass density and v the element’s inertial Fig. 15.3. General elastodynamics prob- velocity vector. The inertial forces are then −p. ˙ lem. The basic equations of elastodynamics form a set of ſrst-order partial differential equations in space and time that can be solved to ſnd the displacement, velocity, strain, stress, and momentum ſelds at all points of the body and all instants in time. These equations will be reviewed in section 15.2.1 where several important deſnitions are also introduced. In the subsequent sections, a number of variational and energy principles are presented that provide an alternative formalism for the solution of elasticity problems. This formalism is the basis for powerful numerical techniques, such as the ſnite element method, that are routinely used to obtain approximate solutions to complex elastodynamics problems. 15.2.1 Review of the equations of linear elastodynamics As discussed in section 8.1, d’Alembert’s principle reduces dynamics problems to statics problems, provided that inertial forces are treated as externally applied forces. This implies that elastodynamics problems reduce to elasticity problems, provided that inertial forces are treated as externally applied forces. The equations presented in this section are, in fact, the general equations of elasticity [284, 285], in which the inertial forces taken into account as externally applied body forces. The equations of elastodynamics can be broken into three groups: (1) the equations of dynamic equilibrium, (2) the strain-displacement and velocity-displacement equations, and (3) the constitutive laws.
15.2 The elastodynamics problem
581
The equations of dynamic equilibrium The equations of dynamic equilibrium are the most fundamental equations of elasticity. They are derived from Newton’s ſrst law, see section 3.1.2, which states the conditions for static equilibrium of a differential element of the body. D’Alembert’s principle 12 then states the conditions for dynamic equilibrium of the differential element of the body, ∂σ1 ∂τ21 ∂τ31 + + + b1 − p˙ 1 = 0, ∂x1 ∂x2 ∂x3 ∂σ2 ∂τ32 ∂τ12 + + + b2 − p˙ 2 = 0, ∂x1 ∂x2 ∂x3 ∂τ13 ∂τ23 ∂σ3 + + + b3 − p˙ 3 = 0, ∂x1 ∂x2 ∂x3
(15.1a) (15.1b) (15.1c)
where σ1 , σ2 , and σ3 are the direct stress components and τ23 , τ13 , and τ12 the shear stress components. These are the components of the stress tensor [284, 285] resolved in basis I. The components of vectors were resolved
the bodyforce and momentum
in the inertial basis as bT = b1 , b2 , b3 and pT = p1 , p2 , p3 , respectively. Equations (15.1) are ſrst-order, partial differential equations in space and time must be satisſed at all points of the body and all instants in time. The surface equilibrium equations state that at all points on S and all instants in time, (15.2a)
t1 = n1 σ1 + n2 τ21 + n3 τ31 = tˆ1 , t2 = n1 τ12 + n2 σ2 + n3 τ32 = tˆ2 ,
(15.2b)
t3 = n1 τ13 + n2 τ23 + n3 σ3 = tˆ3 ,
(15.2c)
where the components of the unit vector normal to the outer surface of the body, the surface traction vector, and the prescribed surface traction vector, all resolved in
T basis I, are denoted nT = n1 , n2 , n3 , tT = t1 , t2 , t3 , and ˆt = tˆ1 , tˆ2 , tˆ3 , respectively. Over S1 , these conditions are also called the force, or natural boundary conditions. The stress array
σ T (t) = σ1 , σ2 , σ3 , τ23 , τ13 , τ12 , (15.3) will be used whenever that notation is convenient to represent the stress ſeld. Finally, additional conditions are imposed on the momentum vectors at the initial and ſnal times p(ti ) = ˆpi , p(tf ) = ˆ pf . (15.4) These conditions are called the boundary conditions in time. Deſnition 15.1 (Admissible stress and momentum ſelds). Stress ſelds, σ(x1 , x2 , x3 , t), and momentum ſelds, p(x1 , x2 , x3 , t), are said to be admissible if, at all times, they satisſes the dynamic equilibrium equations, eqs. (15.1), at all points in V, the surface equilibrium equations, eqs. (15.2), at all points on S, and the time boundary conditions, eqs. (15.4), at times ti and tf .
582
15 Flexible multibody systems: preliminaries
The strain-displacement and velocity-displacement relationships The strain-displacement equations merely deſne the strain components that are used for the characterization of the deformation of the body at a point. The straindisplacement relationships are derived from purely geometric considerations. Similarly, the velocity-displacement equations simply deſne the velocity components at a point of the body. When the displacements are small, deformations at a point of the body are conveniently measured by the engineering strain components [284, 285], deſned as, ∂u1 , ∂x1 ∂u2 ∂u3 = + , ∂x3 ∂x2
ε1 = γ23
∂u2 , ∂x2 ∂u1 ∂u3 = + , ∂x3 ∂x1
ε2 = γ13
∂u3 , ∂x3 ∂u1 ∂u2 = + , ∂x2 ∂x1
ε3 = γ12
(15.5a) (15.5b)
where ε1 , ε2 , and ε3 are the relative elongations or direct strain components of a material line and γ23 , γ13 , and γ12 the angular distortions or shear strain components of two material lines. To compute strain components, the displacement ſeld must be continuous and differentiable. Furthermore, over S and at the initial and ſnal times, the following displacement boundary conditions must be met u(t) = u ˆ (t).
(15.6)
Over S2 , these conditions are called the geometric boundary conditions. The strain array
εT (t) = ε1 , ε2 , ε3 , γ23 , γ13 , γ12 , (15.7) will be used whenever that notation is convenient to represent the strain ſeld. The components of the velocity vector v(t) are the time derivatives of the displacement vector, v(t) = u. ˙ (15.8) Deſnition 15.2 (Kinematically admissible displacement ſeld). A displacement ſeld, u(x1 , x2 , x3 , t), is said to be kinematically admissible if, at all time, it is continuous and differentiable at all points in V and satisſes the displacement boundary conditions, eqs. (15.6), at all points on S and the initial and ſnal times. Deſnition 15.3 (Compatible strain ſeld). A strain ſeld, ε(x1 , x2 , x3 , t), is said to be compatible if, at all times, it is derived from a kinematically admissible displacement ſeld through the strain-displacement relationships, eqs. (15.5). Deſnition 15.4 (Compatible velocity ſeld). A velocity ſeld, v(x1 , x2 , x3 , t), is said to be compatible if, at all times, it is derived from a kinematically admissible displacement ſeld through the velocity-displacement relationships, eqs. (15.8).
15.2 The elastodynamics problem
583
The constitutive laws The constitutive laws relate the stress and strain ſelds. They consist of a mathematical idealization of the experimentally observed behavior of materials. Hooke’s law is commonly used to model the behavior of homogeneous, isotropic, linearly elastic materials operating in the small strain regime. Many materials may present one or more of the following features: anisotropy, plasticity, visco-elasticity, or creep, to name just a few commonly observed behaviors of engineering materials. A second set of constitutive laws relates the momentum vector to the velocity vector. The stress and strain ſelds must satisfy the constitutive laws at all points in V. For small strains, Hooke’s law [284, 285] represents the behavior of homogeneous, isotropic, linearly elastic materials in a approximate manner by the following linear relationship σ = C ε, ε = S σ, (15.9) where C is a symmetric, positive-deſnite material stiffness matrix, and S = C −1 a symmetric, positive-deſnite material compliance matrix. A constitutive law is also required for the momentum ſeld. This law is, in fact, the deſnition of the momentum vector p = ρ v,
(15.10)
where ρ is the material mass density. Summary Complete solutions of elastodynamics problems involves the following ſelds. 1. Admissible stress and momentum ſelds, see deſnition 15.1. 2. Kinematically admissible displacement ſelds, see deſnition 15.2, and associated compatible strain and velocity ſelds, see deſnition 15.3 and 15.4, respectively. 3. Stress and momentum ſelds that satisfy the constitutive laws, eqs. (15.9) and (15.10), respectively, at all points in V. All these equations must be satisſed at all instants in time. 15.2.2 The principle of virtual work Consider an elastic body that is in dynamic equilibrium under applied body forces and surface tractions. This implies that the stress and momentum ſelds are admissible, see deſnition 15.1. The following statement is now constructed
584
15 Flexible multibody systems: preliminaries
%
tf
%
∂σ1 ∂τ21 ∂τ31 + + + b1 − p˙ 1 ) δu1 ∂x1 ∂x2 ∂x3 V ∂τ12 ∂σ2 ∂τ32 +( + + + b2 − p˙2 ) δu2 ∂x1 ∂x2 ∂x3 ∂τ13 ∂τ23 ∂σ3 +( + + + b3 − p˙ 3 ) δu3 dVdt ∂x1 ∂x2 ∂x3 % % tf T ˆ (t − t) δu dSdt + (p − ˆp)T δu dV = 0. (
ti
% −
tf
S
ti
V
(15.11)
ti
This statement was constructed in the following manner. Each of the three dynamic equilibrium equations, eqs. (15.1), was multiplied by an arbitrary, virtual change in displacement, then integrated over the range of validity of the equation. Similarly, each of the three surface equilibrium equations, eqs. (15.2), was multiplied by an arbitrary, virtual change in displacement, then integrated over the range of validity of the equation. Finally, each of the three boundary conditions in time, eqs. (15.4), was multiplied by an arbitrary, virtual change in displacement, then integrated over the range of validity of the equation. Because the stress and momentum ſelds are admissible, each term in parenthesis is zero, and multiplication by an arbitrary quantity results in a zero product. Each of the three integrals then vanishes, as does their sum. Next, integration by parts is performed. Using Green’s theorem [2], the ſrst term of the volume integral becomes % % % ∂σ1 ∂δu1 δu1 dV = − σ1 dV + n1 σ1 δu1 dS, ∂x1 V ∂x1 V S where n1 is the component of the outward unit normal along ¯ı1 , see ſg. 15.3. A similar operation is performed on each stress derivative terms appearing in eq. (15.11). For the momentum terms, integration by parts yields % tf % tf % tf tf T tf T T − p˙ δu dt = p δ u˙ dt − p δu t = pT δv dt − pT δu t . ti
i
ti
i
ti
Introducing the results of these integrations by parts into eq. (15.11) then yields % % % tf % T T T T ˆ b δu dV + σ δε + p δv dV + t δu dS dt − V
ti
% −
T
V
tf
ˆp δu dV V
S
(15.12)
= 0, ti
where the virtual, compatible strain ſeld was deſned as ∂δu1 ∂δu2 ∂δu3 , δε2 = , δε3 = , (15.13a) ∂x1 ∂x2 ∂x3 ∂δu2 ∂δu3 ∂δu1 ∂δu3 ∂δu1 ∂δu2 = + , δγ13 = + , δγ12 = + , (15.13b) ∂x3 ∂x2 ∂x3 ∂x1 ∂x2 ∂x1
δε1 = δγ23
15.2 The elastodynamics problem
585
and the virtual, compatible velocity ſeld as (15.14)
δv = δ u. ˙ Equation (15.12) can be restated as % tf % T σ δε − pT δv dVdt − %
ti tf
V
%
% T
+
b δu dV + ti
V
S
T
ˆt δu dS dt −
% T
p δu dV ˆ V
(15.15)
tf
= 0. ti
The ſrst term represents the virtual work done by the internal stresses and momenta and the remaining correspond to the virtual work done by the externally applied loads and momenta. It has been shown thus far that if the stress and momenta ſelds are admissible, eq. (15.15) must hold. It can also be shown that if this equation holds, the stress and momenta ſelds must be admissible. Indeed, eq. (15.15) implies eq. (15.12), which in turns implies eq. (15.11) by reversing the integration by parts process. Finally, the volume and surface equilibrium equations are recovered because eq. (15.11) must hold for all arbitrary, kinematically admissible virtual displacements ſelds. Statement (15.15) can thus be interpreted as follows. Principle 19 (Principle of virtual work) A body is in dynamic equilibrium if the sum of the internal and external virtual work vanishes for all arbitrary kinematically admissible virtual displacement ſelds and corresponding compatible strain and velocity ſelds. This principle is illustrated in ^t Space ſg. 15.4. The surface tractions act at the spatial boundaries of the problem Virtual work of internal and play a role similar to that of the mo^ ^ pi pf stresses and momenta menta at the temporal boundaries of the problem. In summary, the equations of dyTime ^t namic equilibrium, eqs. (15.1), (15.2), ti tf and (15.4), and the principle of virtual work are two equivalent statements. Fig. 15.4. Illustration of the principle of virtual Furthermore, because the equations of work. dynamic equilibrium are a statement of Newton’s second law, the principle of virtual work and Newton’s second law are two equivalent statements. Because the principle of virtual work is solely a statement of equilibrium, it is always true. However, for the solution of speciſc elastodynamics problems, it must be complemented with stress-strain and momentum-velocity relationships, and constitutive laws. More details about the principle of virtual work and its application to structures can be found in numerous textbooks such as [284, 286, 287, 285].
586
15 Flexible multibody systems: preliminaries
15.2.3 Hamilton’s principle If the internal forces in the solid are assumed to be conservative, they can be derived from a potential, as discussed in section 3.2. In this case, the internal forces are the components of stress, and the potential is the strain energy density function. If the stresses in the solid can be derived from a strain energy density function, a( ), σ=
∂a(ε) , ∂ε
(15.16)
the material is said to be an elastic material. Assuming the material to be elastic or assuming the existence of a strain energy density function are two equivalent assumptions. Linearly elastic materials are elastic materials for which the stress-strain relationship is linear. If the material is elastic, the work done by the internal stresses when the system is brought from one state of deformation to another depends only on the two states of deformations, but not on the speciſc path that the system followed from one deformation state to the other. This restricts the types of material constitutive laws that can be expressed in terms of a strain energy density function. For instance, if a material is deformed in the plastic range, the work of deformation will depend on the speciſc deformation history; hence, there exists no strain energy density function that describes material behavior when plastic deformations are involved. For instance, the strain energy density function of a linearly elastic material is a(ε) =
1 T ε C ε. 2
(15.17)
Introducing this function into eq. (15.16), yields σ = C ε, the constitutive law for a linearly elastic material. Consider a general elastic body that is in equilibrium under applied body forces and surface tractions, and therefore, the principle of virtual work, eq. (15.15), must apply. It is now assumed that the constitutive law for the material can be expressed in terms of a strain energy density function, eq. (15.16). The virtual work done by the internal stresses appears in the ſrst term of eq. (15.15), and it is readily evaluated as % % % % ∂a(ε) δεT σ dV = δεT dV = δa(u) dV = δ a(u) dV = δA(u), ∂ε V V V V where the chain rule for derivatives is used to obtain the second equality. ' The strain energy density and the total strain energy of the body, A = V a dV, must be expressed in terms of the displacement ſeld using the strain-displacement relationships because the principle of virtual work requires a compatible strain ſeld. In a similar manner, the use of the dynamic constitutive law, eq. (15.10), leads to % % % pT δv dV = ρv T δv dV = δk(v) dV = δK(v), V
V
V
where k(v) = 1/2 ρv v is the kinetic energy density function, and K(v) the total kinetic energy of the body. T
15.2 The elastodynamics problem
587
The principle of virtual work, eq. (15.15), now becomes % tf −δ (A − K) dt %
ti tf
%
%
+
b δu dV + V
ti
T
T
S
%
ˆt δu dS dt −
(15.18)
tf
T
p δu dV ˆ V
= 0. ti
The ſrst term on the second line of this statement represents the virtual work done by the externally applied loads, % % T δWext = bT δu dV + ˆt δu dS. (15.19) V
S
With this deſnition, the principle of virtual work, eq. (15.18), becomes % % tf % tf tf (A − K) dt + δWext dt − pT δu dV ˆ = 0. −δ ti
V
ti
(15.20)
ti
Next, the body forces and surface tractions are assumed to be conservative, i.e., they can be derived from a potential, b=−
∂φ ; ∂u
ˆt = − ∂ψ , ∂u
where φ is the potential of the body forces, and ψ the potential of the surface tracT tions. For instance, the potential of ſxed surface tractions is simply ψ = −ˆt u, or the potential of the body forces associated with a gravity ſeld, g, is φ = −ρ g T u. The two terms of the virtual work done by the external forces, eq. (15.19), now become % % % ∂φ T T b δu dV = − δu dV = −δ φ(u) dV, V V ∂u V % % % ∂ψ T ˆtT δu dS = − δu dS = −δ ψ(u) dS. S S ∂u S Combining these two loading terms then yields % % δWext = −δ φ(u) dV − δ ψ(u) dS = −δΦ(u), V
S
where Φ(u) is the total potential the externally applied loads. Introducing this result into eq. (15.20) leads to % % tf tf δ (A − K + Φ) dt + ˆpT δu dV = 0. (15.21) ti
V
ti
The Lagrangian of the system is now deſned as L = K(v) − A(u) − Φ(u),
(15.22)
588
15 Flexible multibody systems: preliminaries
and it follows that
%
%
tf
ˆT δu dV p
δL dt = ti
V
tf
(15.23) ti
Hamilton’s principle can be stated as Principle 20 (Hamilton’s principle) An elastic system is in dynamic equilibrium if and only if equation (15.23) holds for all arbitrary virtual displacements. Clearly, this statement generalizes the version of Hamilton’s principle derived in section 8.2 for systems of particles. If the momenta vanish at the initial and ſnal times, it simply becomes δL = 0. Many other variational principles exist in elasticity. Particularly noteworthy are the principle of complementary virtual work and the principle of minimum complementary energy [286, 287, 285]. Two or three ſeld principles can also be developed, such as Hellinger-Reissner’s and Hu-Washizu’s principles, respectively. All these principle can be extended to elastodynamics problem by invoking d’Alembert’s principle, inertial forces are included as externally applied forces.
15.3 Finite displacements kinematics for ƀexible bodies While ƀexible multibody systems are characterized by large relative motions at the joints, it is often the case that individual ƀexible bodies undergo small deformations. Conceptually, the displacement ſeld at a point of the ƀexible body can be decomposed into rigid body and elastic displacement ſeld [167], where the latter ſeld is responsible for the straining of the body whereas the former, by deſnition, generates no deformation. It is not uncommon for structures such as slender beams or thin plates and shells, to undergo large rigid body displacements and rotations while the strains remain small at all points of the structure. This behavior is often the consequence of careful planning: to avoid premature failure, the structure is designed to operate in the small strain regime. When a structure operates in the small strain regime, the three groups of equations of elastodynamics described in section 15.2.1 still apply, but must be updated to account for the large displacements. Of course, Newton’s law still applies, but in this case, the equilibrium conditions must be enforced on the deformed conſguration of the structure. The strain-displacement relationships now become nonlinear equations, rather than the linear relationships that characterize small displacement problems, eqs. (15.5). Finally, the constitutive laws remain unchanged, although the stress and strain components are now those resolved in the convected or material basis. This section presents a brief discussion of the state of deformation in the neighborhood of a material point in a ƀexible body. Two conſguration of this body will be deſned: a reference conſguration, and a deformed conſguration. The following notational convention will be used: lower-case symbols refer to quantities deſned
15.3 Finite displacements kinematics for ƀexible bodies
589
in the reference conſguration, and upper-case symbols refer to the corresponding quantities in the deformed conſguration. Material coordinates Figure 15.5 depicts a body in its reference and deformed conſgurations and an inertial frame, F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]. This section focuses on the relationships between the deformed and reference conſguration of the solid without any consideration for the loads that create the deformation. Let point P be a material point of the Deformed Reference body, and the position vectors of this maconfiguration configuration terial point are denoted x and X, in the reference and deformed conſguration, reP P spectively. Each material particle of the body will be identiſed by a label consisti3 ing of a triplet of real numbers. This label will remain attached to the material particle throughout the deformation process. x(a , a , a ) X(a1, a2, a3) 1 2 3 I This label is called the material coordii2 O nates of material point P, and is denoted i1 (α1 , α2 , α3 ). The position vectors of point P in the Fig. 15.5. The reference and deformed reference and deformed conſgurations are conſgurations of a body.
x = x(α1 , α2 , α3 ), X = X(α1 , α2 , α3 ),
(15.24a) (15.24b)
respectively. Because the material coordinates are an identifying label for a material particle, they can be chosen arbitrarily. A convenient choice for the material coordinates consists of the components of the position vector resolved in basis I x(α1 , α2 , α3 ) = α1 ¯ı1 + α2 ¯ı2 + α3 ¯ı3 .
(15.25)
This particular choice of the material coordinates is called the Lagrangian representation. A material line is an ensemble of material particles forming a straight line in the reference conſguration of the body. For instance, ſg. 15.6 shows segments PR, PS, and PT of the reference conſguration, which are are material lines intersecting at point P. Due to the deformation of the body, all the material particles forming material line PR will move to segment PR in the deformed conſguration. Because segment PR is of differential length, it can be assumed to remain straight in the deformed conſguration. Base vectors and metric tensor The base vectors are vectors tangent to these material line
590
15 Flexible multibody systems: preliminaries
g¯i =
∂x = ¯ıi . ∂αi
(15.26)
In the reference conſguration, the base vectors are mutually orthogonal unit vectors.
T g3 P
Material line g2
g1 R x(a1, a2, a3) i1
Convected material line G3 T
S i3
P G1 R
G2
S
X(a1, a2, a3)
T T g3da3 G3da3 Q Q P S P g2da2 R R g1da1 G da 1
i2
I Material particle
i1
Fig. 15.6. The base vectors in the reference and deformed conſgurations.
1
i3
x
I
S G2da2
X
i2
Fig. 15.7. Volume elements in the reference and deformed conſgurations.
As the deformation takes place, the material lines are convected with the body. The convected material lines now describe curves in space intersecting at the new location of the particle. The base vectors in the deformed conſguration are deſned in a manner similar to those of the reference conſguration Gi =
∂X . ∂αi
(15.27)
These base vectors are not mutually orthogonal, nor are they unit vectors. To visualize this deformation, ſg. 15.7 shows the small rectangular parallelepiped PQRST of differential size dα1 by dα2 by dα3 cut in the neighborhood of point P. The reference conſguration is the conſguration of the solid in its undeformed state, and rectangular parallelepiped PQRST is spanned by vectors g¯1 dα1 , g¯2 dα2 , and g¯3 dα3 . Under the action of applied loads, the body deforms and assumes a new conſguration, called the deformed conſguration. All the material particles that formed the rectangular parallelepiped PQRST in the reference conſguration now form parallelepiped PQRST, which is spanned by vectors G1 dα1 , G2 dα2 , and G3 dα3 in the deformed conſguration. The state of strain at a point characterizes the deformation of the parallelepiped without any consideration for the loads that created the deformation. Increments in position vector are denoted dx and dX in the reference and deformed conſgurations, respectively, and are expressed as ∂x dαi = g¯i dαi , ∂αi ∂X dαi = Gi dαi , dX = ∂αi dx =
15.3 Finite displacements kinematics for ƀexible bodies
591
where summation is implied by the repeated indices. The lengths of these increments, denoted ds and dS in the reference and deformed conſgurations, respectively, are readily found as ds2 = dxT dx = g¯iT g¯j dαi dαj = gij dαi dαj , 2
T
dS = dX dX =
GTi Gj
dαi dαj = Gij dαi dαj .
(15.28a) (15.28b)
These relationships deſne the components of the metric tensors in the reference and deformed conſgurations, denoted gij and Gij , respectively, as gij = g¯iT g¯j , Gij =
GTi Gj .
(15.29a) (15.29b)
The symmetry of both tensors is apparent from these deſnitions. Displacement ſeld The difference between the positions of a material point in the deformed and reference conſgurations deſnes the displacement vector as u (α1 , α2 , α3 ) = X − x.
(15.30)
The displacement and position vectors in the deformed conſguration are now resolved along the base vectors of the reference conſguration as u = ui g¯i ,
(15.31a)
X = Xi g¯i .
(15.31b)
With these deſnitions, eq. (15.30) now becomes Xi = αi + ui .
(15.32)
The base vector in the deformed conſguration is related to the displacement ſeld Gi =
∂X = (δij + uj,i ) g¯j , ∂αi
(15.33)
where δij is Kronecker’s symbol deſned by eq. (1.14) and notation (·),i indicates a derivative with respect to material coordinate αi . Many different measures can be used to characterize the state of deformation at a point. Some measures are directly related to the physical concept of strain, i.e., a relative change in length, but are not necessarily of a tensorial nature. Some other measures, clearly related to the physical concept of strain can be shown to be tensors.
592
15 Flexible multibody systems: preliminaries
15.3.1 The engineering strain components The motion of segment PR from its reference to deformed conſguration depicted in ſg. 15.7 consists of two parts: a change in orientation and a change in length. Clearly, the change in length is a deformation or stretching of the material line. Similarly, segments PR and PS form a rectangle in the reference conſguration, but form a parallelogram in the deformed conſguration. The angular distortion of the rectangle into a parallelogram represents a deformation of the body. Stretching of a material line and angular distortion between two material lines will be selected as measures of the state of strain at a point. The stretching or relative elongations of material lines PR, PS, and PT will be denoted as ε1 , ε2 , and ε3 , respectively. The angular distortions between segments PS and PT, PR and PT, and PR and PS will be denoted γ23 , γ13 , and γ12 , respectively. The relative elongation, ε1 , of material line PR, see ſg. 15.7, is deſned as ε1 =
PRdef − PRref , PRref
(15.34)
where the subscripts (·)ref and (·)def are used to indicate the reference and deformed conſgurations, respectively. The relative elongation is a non-dimensional quantity. Similar deſnitions hold for ε2 and ε3 , the relative elongation of material lines PQ and PT. The angular distortion, γ23 , between two material lines PT and PS is deſned as the change of the initially right angle, γ23 = π/2 − ∠TPSdef , where notation ∠TPS is used to indicate the angle between segments PT and PS. This can also written as γ23 = arcsin
GT2 G3 . G2 G3
(15.35)
Angular distortion are non-dimensional quantities. Similar deſnitions hold for the angular distortion γ13 and γ12 of the angles between material lines PR and PT, and PS and PT, respectively. The engineering strain components do not form a secondorder tensor. They are often called physical strain components. 15.3.2 The deformation gradient tensor A widely used strain measure is the deformation gradient tensor deſned as Fij =
∂Xi . ∂αj
(15.36)
In the following sections, the index notation will be used to represent second-order tensors. For instance, the deformation gradient tensor is denoted Fij rather than the less explicit F = ∂X/∂α. Resolving the base vector in the deformed conſguration, eq. (15.27), along the reference frame yields
15.3 Finite displacements kinematics for ƀexible bodies
593
∂X = Fji g¯j . (15.37) ∂αi A scalar product of this relationship by g¯l yields an alternative deſnition of the deformation gradient tensor Fij = g¯iT Gj . (15.38) Gi =
With the help of the chain rule for derivatives, an explicit expression of the inverse of the deformation gradient tensor can be obtained Fij−1 =
∂αi . ∂Xj
(15.39)
Introducing the displacement ſeld, eq. (15.32) into eq. (15.36) yields the deformation gradient tensor in terms of the displacement vector components Fij = δij + ui,j .
(15.40)
15.3.3 The metric tensor Relationship (15.28b) shows that the metric tensor, Gij , is, in fact, a measure of the deformation. When used as strain measure, the metric tensor is also called the Green deformation tensor, or the Cauchy-Green deformation tensor. The metric tensor is clearly related to the engineering strain components. Indeed, eq. (15.34) implies ε1 =
g1 dα1 G1 dα1 − ¯ = G11 − 1. ¯ g1 dα1
(15.41)
Similar relations hold for ε2 and ε3 . The angular distortion, eq. (15.35) becomes γ23 = arcsin √
G23 . G22 G33
(15.42)
The inverse relationships are readily obtained as G11 = (1 + ε1 )2 ,
(15.43a)
G23 = (1 + ε2 )(1 + ε3 ) sin γ23 .
(15.43b)
The metric tensor in the deformed conſguration is closely related to the deformation gradient tensor. Introducing eq. (15.37) into the deſnition of the metric tensor, eq. (15.29b), yields Gij = Fki Fkj . (15.44) 15.3.4 The Green-Lagrange strain tensor A widely used strain measure is the Green-Lagrange strain tensor, deſned as eij =
1 (Gij − gij ). 2
(15.45)
594
15 Flexible multibody systems: preliminaries
It is also called the Lagrangian strain tensor, or the Green-Saint Venant strain tensor. The Green-Lagrange strain tensor is closely related to the metric tensor, and eqs. (15.43) reveal its connection to the engineering strain components, 1 1 e11 = (G11 − 1) = ε1 + ε21 , (15.46a) 2 2 1 1 e23 = (G23 − 0) = (1 + ε2 )(1 + ε3 ) sin γ23 . (15.46b) 2 2 If the deformation of the body is such that the strain components remain far smaller than unity, the above relations simplify to 1 e11 ≈ ε1 , e23 ≈ γ23 . (15.47) 2 The Green-Lagrange strain tensor is closely related to the deformation gradient tensor. Indeed, introducing eq. (15.44) into eq. (15.45) yields 1 eij = (Fki Fkj − gij ). (15.48) 2 The Green-Lagrange strain tensor is also closely related to the change in length of the increment of the position vector. Indeed, eqs. (15.28a) and (15.28b) yield 1 1 (dS 2 − ds2 ) = (Gij − gij ) dαi dαj = eij dαi dαj . (15.49) 2 2 Finally, the Green-Lagrange strain tensor is readily expressed in terms of the displacement components by introducing eq. (15.40) into eq. (15.48) to ſnd 1 eij = (ui,j + uj,i + uk,i uk,j ). (15.50) 2
15.4 Strain measures for various differential elements The previous section has focused on the state of strain at a point of a threedimensional solid. It is often useful, however, to characterize the straining of a differential line, surface, or volume element of the body. These issues are addressed in the following sections. 15.4.1 Stretch of a material line In the reference conſguration, the orientation of material line PQ is deſned by a unit vector, denoted n ¯ , deſned as PQ dαi n ¯= = g¯i = ni g¯i . (15.51) PQ ds The stretch, λ, of this material line is deſned as the ratio of the length of the differential elements in the reference and deformed conſgurations, given by eqs. (15.28a) and (15.28b), respectively, to ſnd λ2 = dS 2 /ds2 = Gij (dαi /ds)(dαj /ds) = Gij ni nj . The stretch of the line element is now (15.52) λ = Gij ni nj = Fki Fkj ni nj .
15.4 Strain measures for various differential elements
595
15.4.2 Angle between two material lines Consider two material lines deſned by unit vectors n ¯ and n ¯ with stretches λ and λ , respectively. The scalar product of the position increments corresponding to these material lines is dX T dX = dX dX cos θ, where θ is the angle between the two material lines in the deformed conſguration. Solving for this angle yields cos θ = (dX/dS)T (dX /dS ). Using the chain rule for derivatives, and introducing the deſnition, eq. (15.36), of the deformation gradient tensor yields cos θ = Fik Fil and ſnally cos θ =
dαk ds dαl ds , ds dS ds dS
Gij ni nj Fik Fil nk nl = . λλ λλ
(15.53)
15.4.3 Surface dilatation Consider now the area of the rectangle deſned by vectors g¯2 α2 and g¯3 α3 . The material particles forming that surface before deformation are located in the surface deſned by vectors G2 dα2 and G3 dα3 after deformation. The initial area da1 , is found from ſg. 15.8 as da1 = g2 g¯3 dα2 dα3 = dα2 dα3 .
(15.54)
The area in the deformed conſguration, dA1 , is similarly found 2 G dα2 dα3 = GT G T G 2 G dα2 dα3 = G22 G33 − G2 dα2 dα3 . dA1 = G 3 3 3 2 23 Clearly, quantity G22 G33 − G223 = −1 GG−1 11 , where Gij is the inverse of the metric tensor and G its determinant. Hence, dA1 = GG−1 (15.55) 11 da1 , where G = det(Gij ).
T g3da3 P R g1da1
(15.56)
T G3da3
da1 S g2da2
P i3
dA1
S
G2da2
G1da1 X R
x i1
I
i2
Similar developments yield expres- Fig. 15.8. Dilatation of a differential element sions for areas da2 and da3 , and dA2 of area. and dA3 for the reference and deformed conſgurations, respectively. Combining all results then yields dAi =
GG−1 ii dai ,
no sum on i.
(15.57)
The surface dilatation, Σi , is deſned as the relative change in area of a differential element in the deformed and reference conſgurations and is readily found from eq. (15.57) as dAi − dai Σi = = GG−1 no sum on i. (15.58) ii − 1, dai
596
15 Flexible multibody systems: preliminaries
15.4.4 Volume dilatation Figure 15.7 depicts the volume spanned by vectors g¯1 dα1 , g¯2 dα2 , and g¯3 dα3 . The particles contained in that volume before deformation are located in the volume deſned by vectors G1 dα1 , G2 dα2 , and G3 dα3 after deformation. The volume, dv, in the reference conſguration is found from eq. (1.32) as dv = g¯1T g2 g¯3 dα1 dα2 dα3 = dα1 dα2 dα3 .
(15.59)
The volume, dV , in the deformed conſguration is 2 G dα1 dα2 dα3 = det(Fij ) dα1 dα2 dα3 . dV = GT1 G 3
(15.60)
In view of eq. 15.44, det(Gij ) = det(Fki Fkj ), and hence det(Fij ) = det(Gij ) = √ G. The volumetric strain, or relative change in volume is now deſned as Δ=
dV − dv √ = G − 1. dv
(15.61)
15.4.5 Problems Problem 15.1. Deformed elastic body Figure 15.9 depicts an elastic body in its reference and deformed conſgurations. The displacement ſeld is given as u1 = α1 α2 /4 and u2 = −α1 α2 /8. (1) Evaluate the base vectors in the reference and deformed conſgurations. (2) Find the deformation gradient tensor. (3) Determine the metric tensors in the reference and deformed conſgurations. (4) Evaluate the Green-Lagrange strain tensor, and (5) the physical strain components. Consider point A (located at α1 = α2 = 1) and two material lines n ¯ 1 and n ¯ 2 parallel to ¯ı1 and ¯ı2 , respectively. ¯ 2 . (2) Determine the angle between the two (1) Find the stretch of the material lines n ¯ 1 and n material lines in the deformed conſguration, (3) the surface dilatation, Σ3 , at point A, and (4) the volumetric strain, Δ, at point A.
Reference configuration 1.0
i2
0.5
n2 A
2
n1
I
a
Deformed configuration i1
2 Fig. 15.9. Conſguration of the elastic body.
a i2
G2 X(a1,a2,a3)
G1 P g2 i1
I x(a1,a2,a3)
P
g1
Fig. 15.10. Conſguration of the cantilevered beam.
15.5 The formulation of small strain problems
597
Problem 15.2. Deformed cantilevered beam Figure 15.10 shows a cantilevered beam of length a and depth h in its reference and deformed conſgurations. The position vector in the reference conſguration is x = α1 ¯ı1 +α2 ¯ı2 +α3 ¯ı3 , and in the deformed conſguration α1 α1 X = (a − α2 ) sin (1 + Δ) ¯ı1 + a − (a − α2 ) cos (1 + Δ) ¯ı2 + α3 ¯ı3 , a a where Δ is a strain measure. (1) Find the base vectors in the reference and deformed conſgurations. (2) Determine the deformation gradient tensor, (3) the metric tensor, (4) the GreenLagrange strain tensor, (5) the physical strain components, and (6) the volume dilatation at points A (located at α1 = a/2, α2 = h/2, α3 = 0) and B (located at α1 = a/2, α2 = −h/2, α3 = 0). (7) Evaluate the surface dilatations Σ1 , Σ2 , and Σ3 at points A and B.
15.5 The formulation of small strain problems At the heart of the formulation of problems involving small strain is the decomposition of the deformation gradient tensor into rigid body motion and deformation. In section 15.5.1, the decomposition of the deformation gradient tensor is described. This leads to a modiſed principle of virtual work. The implications of the small strain assumption are then discussed in detail in section 15.5.2. 15.5.1 Decomposition of the deformation gradient tensor Fig. 15.11 depicts the base vectors at a material point of a deformable body in the reference and deformed conſgurations. The analysis of the metric tensor presented in section 15.3.3 demonstrates that the base vectors in the deformed conſguration do not form an orthogonal basis. Indeed, eqs. (15.43) show that these base vectors are not unit vectors, nor are they mutually orthogonal. Consider an orthonormal basis of arbitrary orientation denoted J = (¯ j1 , j¯2 , j¯3 ). The position vector of a material point in the deformed conſguration is now resolved in this basis as X = Xi∗ j¯i . (15.62) The base vectors in the deformed conſguration then become Gi =
∂Xj∗ ∂X = j¯j = Fˆji j¯j , ∂αi ∂αi
(15.63)
where the following modiſed deformation gradient tensor was deſned ∂Xi∗ . Fˆij = ∂αj
(15.64)
A scalar product of eq. (15.63) by j¯l yields an alternative deſnition of this tensor Fˆij = j¯Ti Gj .
(15.65)
598
15 Flexible multibody systems: preliminaries
The two deformation gradient tensors deſned in eqs. (15.38) and (15.65) can be related by equating the two expressions for the base vectors in the deformed conſguration, eqs. (15.37) and (15.63), to ſnd Gi = Fˆji j¯j = Fji g¯j . A scalar product of giT j¯k ) Fˆkj . Because the base this result by g¯l yields the desired relationship, Fij = (¯ vectors in the reference conſguration form the orthonormal basis I, see eq. (15.26), rotation tensor R brings basis I to basis J , and Rij = g¯iT j¯j . The relationship between the two deformation gradient tensors now simply becomes Fij = Rik Fˆkj .
(15.66)
This decomposition expresses the deformation gradient tensor as the product of rotation tensor R, deſning a rigid body rotation, by deformation gradient tensor Fˆ , deſning the deformation of the body at that point. g3da3
j3
G3da3
P
g2da2 i3
g1da1
j1
I i1
P
P j2 G1da1
G2da2 » j2
g2da2 P
i3
g1da1
i2
i1
Fig. 15.11. The base vectors in the reference and deformed conſgurations.
G3da3 » j3
g3da3
G2da2
i2 I
G1da1 » j1
Fig. 15.12. Deformation of a differential element.
15.5.2 The small strain assumption Consider the deformation of a differential element of the body depicted in ſg. 15.12. The norm of base vector G1 in the deformed conſguration was found in eq. (15.43a) to be closely related to the engineering strain component, ε1 , G1 2 = G11 = (1 + ε1 )2 = 1 + 2ε1 + ε21 .
(15.67)
Similarly, the angular distortion between G2 and G3 is closely related to the engineering strain component, γ23 , see eq. (15.43b), GT2 G3 = sin γ23 . G2 G3
(15.68)
In numerous applications, thin structures such as cables, membranes, beam, plates, and shells undergo ſnite displacements and rotations while strain components remain very small. The small strain assumption states that relative elongations and angular distortions are negligible compared to unity, i.e., |ε1 |, |ε2 |, |ε3 | 1,
|γ23 |, |γ13 |, |γ12 | 1.
(15.69)
15.5 The formulation of small strain problems
599
Introducing this approximation in eq. (15.67) and (15.68) yields GT2 G3 ≈ 0. G2 G3
G1 2 = G11 ≈ 1,
In other words, the base vectors in the deformed conſguration approximately deſne an orthonormal basis because each base vector is approximately of unit length, and they are nearly orthogonal to each other. For small strain problems, orthonormal basis J deſned in section 15.5.1 will be selected to be nearly coincident with the base vectors in the deformed conſguration, i.e., Gi ≈ j¯i .
(15.70)
Consequently, basis J is called the convected or material frame. Introducing this approximation in eq. (15.63) leads to Gi = Fˆji j¯j ≈ j¯i . A scalar product of this result by j¯p then yields Fˆij ≈ δij . Finally, using eq. (15.66) Fij ≈ Rij .
(15.71)
In other words, when the strain components are very small, the deformation gradient tensor is approximately equal to the ſnite rotation tensor that brings basis I to J . Under the assumption of small strains, it can be shown that the principle of virtual work becomes % tf % % tf % % ∗ij T T T ˆ τ δγij − p δv dVdt = b δu dV + t δu dS dt ti
V
ti
% −
V
pˆT δu dV V
S
tf
. ti
(15.72)
In this principle, the strain measures are deſned as γij =
1 ˆ (Fij + Fˆji ) − δij , 2
(15.73)
and the stress measures, τ ∗ij , form the convected Cauchy stress tensor, i.e., the components of the true stress tensor in basis J . This statement of the principle of virtual work will be used in subsequent sections to derive the governing equations of structures undergoing large displacements and rotations but small strains.
16 Formulation of ƀexible elements
This chapter deals with the formulations of ƀexible elements such as ƀexible joints, cables, beams, and plates and shells, which are presented in sections 16.1, 16.2, 16.3, and 16.4, respectively. In all cases, geometrically exact formulations are derived, i.e., the displacements and rotations of the elements are arbitrarily large, although strain components are assumed remain small, a feature that signiſcantly simpliſes the governing equations of motion of these structural components.
16.1 Formulation of ƀexible joints Flexible joints, sometimes called bushing elements or force elements, are found in all multibody dynamics codes. In their simplest form, ƀexible joints consist of sets of three linear and three torsional springs placed between two nodes of a multibody system. For inſnitesimal deformations, the selection of the lumped spring constants is an easy task, which can be based on a numerical simulation of the joint or on experimental measurements. If the joint undergoes ſnite deformations, identiſcation of its stiffness characteristics is not so simple, specially if the joint is itself a complex system. When ſnite deformations occur, the deſnition of deformation measures becomes a critical issue. Indeed, for ſnite deformation, the observed nonlinear behavior of materials is partly due to material characteristics, and partly due to kinematics. This section focuses on the determination of the proper ſnite deformation measures for elastic bodies of ſnite dimension. In contrast, classical strain measures, such as the Green-Lagrange strains presented in section 15.3.4, among many others, characterize ſnite deformations of inſnitesimal elements of a body. It is argued that proper ſnite deformation measures must be of a tensorial nature, i.e., must present speciſc invariance characteristics. This requirement is satisſed if and only if deformation measures are parallel to the eigenvector of the motion tensor. Anand [288, 289] has shown that the classical strain energy function for inſnitesimal isotropic elasticity is in good agreement with experiment for a wide class of materials for moderately large deformations, provided the inſnitesimal strain measure
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_16 © Springer Science+Business Media B.V. 2011
602
16 Formulation of ƀexible elements
used in the strain energy function is replaced by the Hencky or logarithmic measure of ſnite strain. This means that the behavior of materials for moderate deformations can be captured accurately using linear constitutive laws, but replacing the inſnitesimal strain measures by ſnite deformation measures that are nonlinear functions of the displacements. These nonlinear deformation measures capture the observed nonlinear behavior associated with the nonlinear kinematics of the problem. Degener et al. [290] also reported similar ſndings for the torsional behavior of beams subjected to large axial elongation. Much attention has been devoted to the problem of synthesizing accurate constitutive properties for the modeling of ƀexible bushings presenting complex, timedependent rheological behavior [291, 292]. It is worth stressing, however, that the literature seldom addresses three-dimensional joint deformations. Much like multibody codes, most ſnite element codes also support the modeling of lumped structural elements. While linear analysis is easily implemented, problems are encountered when dealing with ſnite displacements and rotations, as pointed out by Masarati and Morandini [293]. Structural analysis codes, either speciſcally intended for multibody dynamics analysis, like MSC/ADAMS, or for nonlinear ſnite element codes with multibody capabilities, like Abaqus/Standard, allow arbitrarily large absolute displacements and rotations of the nodes and correctly describe their rigid-body motion. When lumped deformable joints are used, relative displacements and rotations are often required to remain moderate, although not necessarily inſnitesimal. Such restrictions occur when using the FIELD element of MSC/ADAMS, a linear element that implements an orthotropic torsional spring based on a constant, orthotropic constitutive matrix [294]. Similarly, the JOINTC element implemented in Abaqus/Standard, describes the interaction between two nodes when the second node can “displace and rotate slightly with respect to the ſrst node [295],” because its formulation is based on an approximate relative rotation measure. The formulations and implementations of ƀexible joints available in research and commercial codes do not appear to allow arbitrarily large relative displacements and rotations. Moreover, in many cases, the ordering sequence of the nodes connected to the joint matters, because the behavior of the ƀexible joint is biased towards one of the nodes. This problem is known to experienced analysts using these codes. To the authors’ knowledge, these facts are rarely acknowledged in the literature. It appears that little effort has been devoted to the elimination of these shortcomings from the formulations found in research and commercially available codes, although the predictions of these codes might be unexpected. This section presents families of ſnite deformation measures that can be used to characterize the deformation of ƀexible joints. These deformation measures are closely related to the tensorial parameterization motion developed in chapter 14. Because they are of a tensorial nature, these deformation measures are intrinsic and invariant. Furthermore, it will be shown that using these strain measures in combination with the linear constitutive laws of the joint enable the accurate prediction of joint behavior under moderate deformation.
16.1 Formulation of ƀexible joints
603
16.1.1 Flexible joint conſguration Figure 16.1 shows a ƀexible joint in its reference and deformed conſgurations. It consists of a three-dimensional elastic body of ſnite dimension and of two rigid bodies, called handle k and handle , that are rigidly connected to the elastic body. In the reference conſguration, the conſguration of the handles is deſned by frame F0 = K = L, B0 = (¯b01 , ¯b02 , ¯b03 ) , where B0 forms an orthonormal basis. Points K and L are material points of handles k and , respectively, with coincident geometric locations. In the deformed conſguration, the two handles Handle l b03 B 0 move to new positions and the elastic body deforms. Points K and L are now at distinct locab02 b01 Elastic tions; the relative displacement vector of point L body with respect to point K is denoted u. The conſgK=L urations of the two handles are now distinct and two distinctframes, F k = K,B k = (¯bk1 , ¯bk2 , ¯bk3 ) and b03 F = L, B = (¯b1 , ¯b2 , ¯b3 ) , deſne the conſguraHandle k tions of handle k and , respectively. The relative b01 B0 b02 Reference configuration rotation tensor of basis B with respect to basis B k l b2 is denoted R. l Bl b3 The deformation of the ƀexible joint stems from ML applied forces and moments. At point K, the applied l b1 L FL force and moment vectors are denoted F k and M k , respectively; the corresponding quantities applied at u, R point L are denoted F and M , respectively. The MK loading applied to the ƀexible joint is deſned in the following manner k FK b3 K Fk F k Ak = , A = , (16.1) b1 k Mk M B k b2 where Ak and A denote the loads applied at points Deformed configuration K and L, respectively. According to Newton’s third Fig. 16.1. Conſguration of the law, these loads must be in equilibrium, i.e., ƀexible joint. I 0 Ak = − A . (16.2) u I The joint is assumed to be massless, i.e., inertial forces associated with its motion are neglected. The state of deformation of the elastic body depends on the relative displacement and rotation of the two handles and is unaffected by rigid body motions. Consequently, it is possible to assume that handle k does not move, and the relative displacement and rotation of handle with respect to handle k then simply becomes its absolute motion, as illustrated in ſg. 16.2. This conſguration is denoted scenario . Of course, scenario k could also be deſned in a similar manner if the location of handle is assumed to remain ſxed in space.
604
16 Formulation of ƀexible elements
Consider the differential displacement of point L shown in ſg 16.2. The components of this differential displacement vector in bases B k and B are du+ and RT du+ , respectively. The components of the differential rotation vector of handle are denoted dψ + = axial(dR RT ) and dψ ∗ = axial(RT dR) = RT dψ + when resolved in the same bases, respectively. The differential motion vector of point L is now deſned as T + R du dU ∗ = . (16.3) RT dψ + Superscripts (·)+ and (·)∗ indicate tensor components resolved in basis B k and B , respectively. The differential motion of the point Deformed l b3 of handle that instantaneously coinconfiguration l b2 cides with the origin of reference frame l of handle l B F0 , denoted dU + FL , is found from the ML l following frame change operation b1 + Reference L + dψ + du + u dU + = configuration + dψ (16.4) u, R b03 of handle l + ∗ + = C(u , R )dU , b03 B0 B0 K=L where R+ = R. b02 b 02 b 01 The load externally applied at b01 point L, denoted A , was deſned in eq. (16.1). These applied force and mo- Fig. 16.2. Conſguration of the ƀexible joint for ment vectors are now in basis scenario . For clarity of the ſgure, the elastic
resolved ∗T ∗T ∗T B to form A = F , M . The body is not shown. following change of frame operation is now considered F+ A+ = C −T (u+ , R+ )A∗ . = (16.5) + + M+ + u F Note the parallel between vector A+ and the second Piola-Kirchhoff stress tensor [296]. Indeed, A+ represents the true loads applied to handle in its deformed conſguration, but transferred to the original location of their application point in the ∗ reference conſguration. Loads A+ and A form a set of equipollent loads applied to handle . The change of frame operation described by eq. (16.5), expresses, in fact, a condition of equipollence. 16.1.2 Flexible joint differential work The differential work, dW , done by the forces applied to the joint is + ∗ + +T ∗T +T + dW = F +T du + M dψ = A dU = A dU ,
(16.6)
16.1 Formulation of ƀexible joints
605
where the last two equalities follow from eqs. (5.57) and (5.61), respectively. Because handle k does not move, the forces and moments applied at point K do not work. Let E + be a set of six generalized coordinates that uniquely deſne the conſguration of handle , i.e., a one-to-one mapping is assumed to exist between these generalized coordinates and the conſguration of handle . It then follows that a oneto-one mapping must exist between the handle’s differential motion and differentials of the generalized coordinates + dU ∗ = H∗ (E + )dE ,
+ + dU + = H(E )dE .
(16.7)
Matrix H(E + ) is the Jacobian matrix or tangent operator of the coordinate transformation. The differential work done by the forces applied to the joint, eq. (16.6), now becomes ∗ + + +T + + +T + dW = A∗T H (E )dE = A H(E )dE = L dE ,
(16.8)
where the generalized forces associated with the generalized coordinates are deſned as ∗T ∗ T + + L+ (E + (16.9) =H )A = H (E )A . It is now assumed that the ƀexible joint is made of an elastic material [285], which implies that the generalized forces can be derived from a potential, the strain energy of the joint, denoted A, L+ =
∂A(E + ) + . ∂E
(16.10)
The differential work now becomes dW = dE +T
∂A(E + ) = d(A), + ∂E
(16.11)
and can be expressed as the differential of a scalar function, the strain energy. The reasoning presented in this section could be repeated for scenario k. Because scenarios k and only differ by a rigid body motion, identical results should be obtained. In particular, the differential work for the two scenarios should be identical, + + +T + + leading to dW = A+T dU = Ak dU k . Loading A and Ak are referred to the same point, the origin of frame F0 , and expressed in the same basis, B0 ; Newton’s + ſrst law then implies A+ + Ak = 0, leading to the intuitive result that + dU + k = −dU .
(16.12)
16.1.3 The deformation measures In the previous section, quantities E + were deſned as “a set of generalized coordinates that uniquely deſne the conſguration of handle ,” but were otherwise left
606
16 Formulation of ƀexible elements
undeſned. For scenario , the conſguration of handle deſnes the deformation of the elastic body, and hence, these generalized coordinates are, in fact, deformation measures for the ƀexible joint. The following notation is introduced + E+ . (16.13) = κ+ The ſrst three components of this array form the stretch vector, denoted , and the last three the wryness vector, denoted κ. Both quantities are assumed to form ſrst-order tensors. Because the deformation measures uniquely deſne the conſguration of handle relative to handle k, the motion tensor, C(u, R), can be expressed as C = C(E + ). It follows that the deformation measures form a parameterization of the motion tensor. In general, the deformation measures are nonlinear functions of six quantities, the three components of the relative displacement vector, u, and the three parameters that deſne the relative rotation tensor, R. For instance, the stretch vector could be selected as the position vector of point L, + = u+ ; note that ∗ = RT + , as expected from the tensorial nature of the stretch vector. The Euler angles associated with rotation tensor R form a valid set of generalized coordinates to characterize the angular motion of handle , but cannot be the components of the wryness vector because Euler angles do not form the components of a vector. Any vectorial parameterization of rotation, see section 13.4, is a suitable choice for the wryness vector. 16.1.4 Change of reference frame The results derived in section 16.1.2 were based on the selection of an arbitrary reference frame, F0 , deſned by coincident reference points, K = L, which are material points of handles k and , respectively, and an orthonormal basis, B0 . Another reference frame could have been selected, F0 = K = L , B0 = (¯b01 , ¯b02 , ¯b03 ) ; ſgure 16.3 shows the new reference b03 K=L points, K = L , which are material B0 B0 s points of handles k and , respectively. b02 b02 K¢ = L¢ b01 For clarity, the new basis, B0 , is not b01 shown on the ſgure. The position vecFig. 16.3. Conſguration of the ƀexible joint for tor of point K with respect to point K scenario . For clarity of the ſgure, the elastic is denoted s and the relative rotation body is not shown. tensor of basis B0 with respect to basis B0 is denoted S. The motion tensor that brings frame to F0 to F0 is denoted C (s, S). l Deformed b3 l configuration b2 l of handle l B FL ML¢ l ML b1 L L¢ FL¢ Reference u¢, R configuration u, R of handle l b03
16.1 Formulation of ƀexible joints
607
The development presented in section 16.1.2 could now be repeated for this new choice of basis and reference points, leading to a new set of applied loads, A+ , + deformation measures, E + , tangent operator, H(E ), and associated generalized forces, L+ . If the same problem is treated with scenario in the two different frames, the sets of loads applied at points L and L must be equipollent, which implies −T A+ (s, S)A+ = C .
(16.14)
This equation expresses the relationship between the components of the loading vector in the two frames, F0 to F0 , i.e., the equipollence condition implies that the loading vector is a ſrst-order tensor, see eq. (16.5). To be physically meaningful, the corresponding deformation measures must also be invariant with respect to a change of frame, i.e., they must also be ſrst-order + tensors, and their components in two frames, F0 and F0 , denoted E + and E , respectively, must transform according to the rules of transformation for kinematic quantities given by eq. (16.4), i.e., + E+ = C (s, S)E .
(16.15)
This equation expresses the desired invariance of the deformation measure. The motion tensor is a second-order tensor and the deformation measure is a parameterization of this motion tensor. According to eq. (16.15), this parameterization must be a ſrst-order tensor. This implies that the deformation measure must be a tensorial parameterization of motion. 16.1.5 Deformation measure invariance Equation (16.6) expresses the invariance of the differential work with respect to a change of frame. The equipollence condition of the applied load is expressed by eq. (16.14) and introducing this condition into eq. (16.6) yields + dU + = C (s, S)dU .
(16.16)
The equipollence of the applied load and invariance of the differential work imply that the components of the loading vector transform according to the ſrst-order tensor transformation rule expressed by eq. (16.5) for loading quantities and the components of the differential displacement vector according to that expressed by eq. (16.4) for kinematic quantities. Introducing eq. (16.16) into eq. (16.7) and pre-multiplying by C −1 yields + −1 −1 dU = C −1 H(E + dE + = I. Introducing eq. (14.59) then )C C , where C C + + −1 + yields dU = H(E )C dE , which leads to the expected transformation rule for the components of the differential deformation measure + dE + = C dE .
(16.17)
The invariance of the differential work written in the form of eq. (16.8) requires + +T dW = L+T dE + dE = L . Introducing eq. (16.17) then yields
608
16 Formulation of ƀexible elements −T + L+ L . = C
(16.18)
In summary, the formulation developed in section 16.1.2, is frame invariant. Under a change of frame, the components of the applied and generalized loads transform according to eqs. (16.14) and (16.18), respectively. The components of the deformation measure, differential displacement, and differential deformation measure transform according to eqs. (16.15), (16.16), and (16.17), respectively. These energetically conjugate ſrst-order tensors present different transformation rules under a change of frame to guarantee the required invariance of the differential work. The invariance of the various quantities involved in the formulation stems from the tensorial nature of the deformation measure. Because this measure is selected to be the tensorial parameterization of motion, it must be an eigenvector of the mo+ tion tensor, i.e., E + = C E . Since the deformation measure is a kinematic quantity, + ∗ ∗ E = C E , and it follows that E + = E , i.e., the components of the deformation measure are identical in frames F0 and F . This implies that the deformation measure is identical when viewed by observers in frames F0 or F . Consequently, the deformation measure is not biased towards one of the nodes of the joint, a shortcoming of many of the formulation presently implemented in research and commercial codes. + Equation (16.12) implies E + k = −E , which simply corresponds to a sign con+ vention. Henceforth, notation E = E = E ∗ is used, which emphasizes the intrinsic nature of the deformation measure; of course, a change of sign is required for sce∗ nario k. Finally, eq. (16.10) implies L = L+ = L , which shows the intrinsic nature of the generalized forces; here again, a change of sign is required for scenario k. The proposed deformation measures are parallel to the eigenvector of the motion tensor associated with its unit eigenvalue. Because this eigenvalue has a multiplicity of two, two linearly independent eigenvectors exist, and the deformation measure is a linear combination of these two eigenvectors. An explicit expression of the deformation measure, see eq. (14.37), is D(κ)u E= = , (16.19) κ κ where the stretch vector, , is related to the displacement vector, u, of the handle, the wryness vector, κ, is the vectorial parameterization of rotation, and tensor D is deſned by eq. (14.38). 16.1.6 Flexible joint constitutive laws The strain energy of the ƀexible joint is assumed to be a quadratic function of the deformation measures, A = 1/2 E T K E, where K is the joint’s stiffness matrix for inſnitesimal deformations. The generalized forces now become L = K E, and eq. (16.9) then yields −T A+ (E)K E, = H
(16.20a)
A∗
(16.20b)
=H
∗−T
(E)K E.
16.1 Formulation of ƀexible joints
609
Due to the presence of the tangent tensor, the load-deformation relationships are nonlinear, and the deformation-displacement relationships, eqs. (16.19), are also nonlinear. The loads applied to handle resolved in basis B0 , denoted A, are A = T T A+ = R A∗ . The joint’s constitutive laws now become A=
0 F −T (κ) K E. −T T −T −T −H (κ)L ( , κ)F (κ) H (κ)
(16.21)
where tensors F , H, and L are deſned by eqs. (14.41), (13.55), and (14.46), respectively. Finally, inversion of this equation gives the constitutive laws in compliance form as T 0 F (κ) E=S T A, (16.22) T L ( , κ) H (κ) where S = K −1 is the compliance matrix for inſnitesimal deformations. Given the externally applied loads, A, this nonlinear equation yield the joints deformations, in terms of the stretch vector, , and the wryness vector, κ. This section has focused on the deſnition of appropriate deformation measures for elastic bodies of ſnite dimension, in contrast with classical strain measures that are deſned for inſnitesimal elements of an elastic body. It was ſrst argued that to be physically meaningful, these deformation measures must be of a tensorial nature. Next, it was proved that this requirement is satisſed if and only if the deformation measures are parallel to the eigenvector of the motion tensor associated with its unit eigenvalue. Equipped with these deformation measures, constitutive laws for the ƀexible joint were derived by assuming the existence of a strain energy function that is a quadratic form of these deformation measures. Because all the quantities involved in the formulation are of a tensorial nature, the behavior of the joint presents the required invariance with respect to changes of basis or reference point. Furthermore, the proposed strain measures are unbiased. Flexible joint formulations described in the literature up to date do not appear to present these desirable characteristics. Example 16.1. Simple beam treated as a ƀexible joint The load-deformation and deformation-conſguration relationships developed above will be tested on a number of simple examples involving a ƀexible beam. Figure 16.4 shows the beam of length L along unit vector ¯b01 , width b along ¯b02 , and height h along ¯b03 . The beam is made of a homogeneous material of Young’s modulus E and shear modulus G. The examples presented below use the following data: L = 0.6 m, b = 5 mm, h = 15 mm, E = 73 GPa, and G = E/(2(1 + ν)), where ν = 0.3. Handles k and are rigidly attached to the root and tip of the beam, respectively. Elementary structural analysis [285] yields the compliance matrix of the joint
610
16 Formulation of ƀexible elements
⎤ ⎡ L/S 0 0 0 0 0 ⎢ 0 L3 /3H33 0 0 0 L2 /2H33 ⎥ ⎥ ⎢ 3 2 ⎥ ⎢ 0 0 L /3H22 0 −L /2H22 0 ⎥, ⎢ S=⎢ ⎥ 0 0 0 L/H 0 0 11 ⎥ ⎢ ⎦ ⎣ 0 0 −L2 /2H22 0 L/H22 0 0 L2 /2H33 0 0 0 L/H33
(16.23)
where S = Ebh, H22 = Ebh3 /12, H33 = Ehb3 /12, and H11 = Ghb3 /3 are the beam’s axial stiffness, bending stiffness with respect the unit vector ¯ı2 , bending stiffness with respect the unit vector ¯ı3 , and torsional stiffness, respectively. Various combinations of forces and moments are applied to handle k, and the resulting displacements and rotations are then evaluated using the joint’s constitutive laws, eqs. (16.22). These predictions are compared with those of a ſnite element solution for the geometrically exact beam model presented in section 16.3, which provide an exact treatment of the kinematics of the system, but assume the strains to remain small at all time. This latter assumption is equivalent to assuming a constant compliance matrix, as done here. All the numerical solutions shown below are obtained by modeling the beam with 12 cubic elements, corresponding to a total 216 degrees of freedom. In the ſrst example, the joint is subjected to a single bending moment about unit vector ¯ı3 , denoted M3 . For this simple case, eqs. (16.22) can be solved analytically ¯ 3 , where φ is the rotation angle of handle about unit to yield κ3 (φ) = σ0 (φ)M ¯ vector ¯ı3 and M3 = LM3 /H33 . The displacement components of handle along unit vectors ¯ı1 and ¯ı2 are then u ¯1 = u1 /L = −(1−cos φ)/2 and u ¯2 = u2 /L = 1/2 sin φ.
u1 and u2 [m]
0.4 0.2 0 -0.2 -0.4
B0
b01 Flexible beam K = L
b03 Handle k
b
F1
Handle l
Fig. 16.4. Reference conſguration of the ƀexible beam.
r3 [rad]
b02
1.5
F2
L
1
0.5
0
0
5
10
15
20
25
TIP MOMENT M3 [N.m]
Fig. 16.5. Joint deformation under a single moment. Top ſgure: displacement components u1 (◦) and u2 ( ); bottom ſgure: rotation r3 ( ). Exact solution: symbols. Present solution: κ(φ) = φ, dashed line; κ(φ) = 4 sin φ/4, dotted line; κ(φ) = 4 tan φ/4, dashed-dotted line.
16.1 Formulation of ƀexible joints
611
The exact solution of this problem is easily found because the beam deforms ¯ 3, u into an arc of circle under the single applied moment, leading to φ = M ¯1 = −(1 − sin φ/φ)/2, and u ¯2 = (1 − cos φ)/φ, see ſg. 16.5. Three approximate solutions obtained from the proposed approach for three different generating functions, κ(φ) = φ, κ(φ) = 4 sin φ/4, and κ(φ) = 4 tan φ/4, are also depicted in this ſgure. For κ(φ) = φ, corresponding to the exponential map of rotation, the proposed approach gives the exact solution of the joint’s relative rotation. The transverse displacement of the joint is well captured up to very large displacement magnitudes, u2 ≈ 0.3 m, for a beam of length L = 0.6 m. The beam’s foreshortening, a higherorder nonlinear effect, is also well predicted up to large transverse displacements. If the joint were made of a nonlinear material, the curvature-relative rotation relationship would become nonlinear, and the generating function could be selected to approximate this numerically or experimentally observed behavior as closely as possible. This will enable the present approach to deal with nonlinear elastic manner in an approximate manner. This effect is apparent in ſg 16.5 that depicts the curvature-relative rotation relationship for generating functions κ(φ) = 4 sin φ/4 and κ(φ) = 4 tan φ/4, which give rise to softening or stiffening material behaviors, respectively. The second example involves the same ƀexible joint now subjected to two moment components, M2 = 3λ N·m and M3 = λ N·m, acting about unit vectors ¯ı2 and ¯ı3 , respectively, where λ ∈ [0, 12] is the loading factor.
0.15
u1 and u2 [m]
0.1
0
0.1 0.05 0
-0.1
-0.05
0.5
0.4
0.4
r3 [rad]
r1, r2, and r3 [rad]
u1, u2, and u3 [m]
0.2
0.3 0.2
0.1
0.1 0
0.3 0.2
0 0
2
4
6
8
10
12
LOADING FACTOR
Fig. 16.6. Joint deformation under two moments. Top ſgure: displacement components u1 (◦), u2 ( ), and u3 (2); bottom ſgure: exponential map components r1 (), r2 (), and r3 ( ). Finite element solution: symbols. Present solution: κ(φ) = φ, dashed line; κ(φ) = 2 sin φ/2, dotted line; κ(φ) = 2 tan φ/2, dashed-dotted line.
0
50
100
150
200
250
300
TENSILE FORCE F2
Fig. 16.7. Joint deformation under two forces. Top ſgure: displacement components u1 (◦) and u2 ( ); bottom ſgure: exponential map component r3 ( ). Finite element solution: symbols. Present solution: κ(φ) = φ, dashed line; κ(φ) = 2 sin φ/2, dotted line; κ(φ) = 2 tan φ/2, dashed-dotted line.
Figure 16.6 illustrate the ability of the proposed approach to capture the coupled, three-dimensional response of the joint up to large relative displacements and rotations.
612
16 Formulation of ƀexible elements
In the next example, the joint is subjected to two forces: a constant force F2 = 20 N and a linearly increasing tensile force F1 ∈ [0, 300] N, acting along unit vectors ¯ı2 and ¯ı1 , respectively. Under the effect of the tensile force, the joint stiffens and the displacement component u2 resulting from the constant force component F2 decreases, as shown in ſg. 16.7. Here again, the predictions of the proposed approach are found to be in qualitative agreement with the ſnite element solution. The stiffening of the joint under a tensile force is a nonlinear effect that is captured by the proposed approach because the equilibrium equations of the joint are expressed in the deformed conſguration of the system. This prompts the following question: is the proposed formulation able to predict the instability of the joint under compressive load?
0.5
u1 and u2 [m]
u1 and u2 [m]
0.5
0
-0.5
-0.5 2
2
1.5
1.5
r3 [rad]
r3 [rad]
0
1
0
1 0.5
0.5
0 0
10
20
30
40
50
60
70
80
90
100
AXIAL COMPRESSIVE LOAD F1 [N]
Fig. 16.8. Joint deformation under compressive force. Top ſgure: displacement components u1 (◦) and u2 ( ); bottom ſgure: exponential map component r3 ( ). Finite element solution: symbols. Present solution: κ(φ) = φ, dashed line; κ(φ) = 4 sin φ/4, dotted line; κ(φ) = 4 tan φ/4, dasheddotted line.
0
10
20
30
40
50
60
70
80
90
100
AXIAL COMPRESSIVE LOAD F1 [N]
Fig. 16.9. Joint deformation under two forces. Top ſgure: displacement components u1 (◦) and u2 ( ); bottom ſgure: exponential map component r3 ( ). Finite element solution: symbols. Present solution: κ(φ) = φ, dashed line; κ(φ) = 2 sin φ/2, dotted line; κ(φ) = 2 tan φ/2, dashed-dotted line.
Figure 16.8 shows the response of the system subjected to a small, constant load F2 = 0.1 N and a linearly increasing compressive load, F1 ∈ [0, 100] N. The Euler buckling load of the beam [285] is PEuler = π 2 H33 /(4L2 ) = 78 N, which is accurately predicted by the ſnite element model. The present model also predicts the buckling phenomenon, although for a lower compressive load of about 60 N. The inaccurate prediction of the present model is due to the fact that it uses 6 degrees of freedom only, in contrast with the 216 degrees of freedom used in the reference solution. Modeling the problem with a single two-node beam element also results in an inaccurate prediction of the buckling load, which is over-predicted by about 50%. It is also possible to trace the post-buckling path of the system. If a constant load F2 = 10 N and a compressive load F1 ∈ [0, 100] N are applied to the joint, it quickly enters the post-buckling regime, as depicted in ſg. 16.9. The proposed model
16.2 Formulation of cable equations
613
traces the post-buckling path for up to very large displacements and rotations: for a compressive load of 100 N, the relative rotation of the joint is of about 180 degrees. All the predictions presented in this example are in good qualitative agreement with exact solutions for geometrically exact beams obtained from nonlinear ſnite element simulations, up to very large relative displacements and rotations of the ƀexible joint. For small to moderate displacements and rotations, the agreement between the predictions of the proposed formulation and exact solutions is accurate. It must be emphasized that the present formulation only “knows” the linearized compliance matrix of the joint. The nonlinear governing equations of geometrically exact beams are not derived. Yet, the proposed deformation measures used in conjunction with the linearized compliance matrix provide constitutive laws for the ƀexible joint that qualitatively describe its behavior up to large relative displacements and rotations. Instabilities, such as buckling under large compressive load or lateral buckling under large transverse loads (not shown here for brevity sake) are also predicted by the proposed formulation. For small displacements and rotations, accurate predictions are obtained. While the proposed deformation measures remain tensorial for deformations of arbitrary magnitude, nonlinear constitutive laws should be used if the joint undergoes large deformations. The numerical examples presented in this example use linear constitutive laws to model a joint consisting of a simple ƀexible beam. The behavior joint is accurately predicted for small and moderate deformations and the correct qualitative behavior for up to very large displacements and rotations is observed.
16.2 Formulation of cable equations Cables are one-dimensional, ƀexible structures that can only carry axial forces, i.e., forces acting in the direction tangent to the cable. In contrast with beams, described in section 16.3, cables present no bending, torsional, or shearing stiffness. The kinematic description of cable structures in presented in section 16.2.1 and leads to the deſnition of the strain components in section 16.2.2. The governing equations for the static behavior of elastic cables are derived in section 16.2.3 and section 16.2.4 extends the formulation to dynamics problems. 16.2.1 The kinematics of the problem Figure 16.10 shows a ƀexible cable idealized as a curve in space. The reference and deformed conſgurations of the cable will be described with respect to an inertial reference frame, F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )]. Material point P of the cable is deſned by its curvilinear coordinate, α1 , which measures length along the reference conſguration of the cable. The position vector of point P is x = x(α1 ).
(16.24)
614
16 Formulation of ƀexible elements
Using eq. (15.26), base vector g¯1 becomes g¯1 =
∂x . ∂α1
(16.25)
The base vector is the unit tangent to the curve that deſnes the geometry of the cable in its reference conſguration; indeed, as discussed in section 2.2.1, curvilinear variable α1 represents an intrinsic parameterization of the curve. In the deformed conſguration, Deformed P G 1 the position vector of point P, deconfiguration noted X(α1 ), becomes X(a1) i3 u X(α1 ) = x(α1 ) + u(α1 ), a1 i2 I g1 (16.26) O P where u is the displacement vector Reference i1 x(a1) configuration of point P. The base vector in the deformed conſguration becomes Fig. 16.10. Cable in the reference and deformed conſgurations.
G1 =
∂X = g¯1 + u ∂α1
(16.27)
where notation (·) indicates a derivative with respect to α1 . Because the cable undergoes axial deformations, material coordinate α1 no longer measures length along the deformed conſguration of the cable; hence, as discussed in section 2.2.2, it represents an arbitrary parameterization of the curve deſning the geometry of the cable in its deformed conſguration. Base vector G1 is tangent to the deformed conſguration of the cable, but it is not a unit vector. Let unit vector j¯1 be parallel to base vector G1 , G1 = (1 + e¯11 ) j¯1 ,
(16.28)
where e¯11 is a strain related parameter which can be expressed in terms of displacements with the help of eqs. (16.27) and (16.28) (1 + e¯11 )2 = (¯ g1 + u )T (¯ g1 + u ).
(16.29)
Because the cable is a one dimensional structure, the metric tensor reduces to a single component, G11 = (1 + e¯11 )2 . The only non vanishing component of the GreenLagrange strain tensor, eq. (15.45), is e11 =
1 1 1 (1 + e¯11 )2 − 1 = e¯11 + e¯211 = g¯1T u + uT u , 2 2 2
(16.30)
where the strain parameter was expressed in terms of displacement using eq. (16.29). 16.2.2 The small strain assumption The strain-displacement relation, eq. (16.30), is valid for arbitrarily large displacements and strains. If the strain component can be assumed to remain much smaller
16.2 Formulation of cable equations
615
that unity, a simpliſed strain-displacement relationship can be obtained. The modiſed deformation gradient tensor deſned in section 15.5.1 reduces to a single component, Fˆ11 , obtained from eqs. (15.65) and (16.28) as Fˆ11 = 1 + e¯11 .
(16.31)
The small strain measure then follows from eq. (15.73) 1 γ11 = e¯11 ≈ g¯1T u + uT u . 2
(16.32)
The small strain assumption was used to approximate eq. (16.30), e11 = e¯11 + e¯211 /2 ≈ e¯11 , leading to the second equality of eq. (16.32). When the strains are small, it is clear that the strain parameter, e¯11 , is equal to the axial strain in the cable, γ11 . Variation of the small strain measure is δγ11 = δuT (¯ g1 + u ) = δuT G1 .
(16.33)
16.2.3 Governing equations The governing equations of the static problem are readily obtained from the principle of virtual work, eq. (15.72), which states % L% τ ∗11 δγ11 dAdα1 = δWext , (16.34) 0
A
for all arbitrary virtual displacements. The length of the cable in the reference conſguration is denoted L, A is its cross-section area, and δWext the virtual work done by the externally applied loads. Integrating the left-hand side over the cross-sectional area of the cable yields % L F ∗ δγ11 dα1 = δWext , (16.35) ∗
'
0 ∗11
dA is the total axial force in the cable along material axis j¯1 . where F = A τ The virtual work done by the forces externally applied to the cable is expressed 'L as δWext = 0 δuT f dα1 , where f is the externally applied load per unit length of the cable’s reference conſguration. Introducing the strain variation, eq. (16.33), into eq. (16.35) then leads to % L % L δuT F ∗ G1 dα1 = δuT f dα1 . (16.36) 0
0
Integration by parts then yields the governing equations of the problem,
[F ∗ G1 ] = −f .
(16.37)
If the cable is assumed to present a linear elastic behavior, the constitutive law simply states the proportionality of the axial force to the axial strain,
616
16 Formulation of ƀexible elements
F ∗ = Sγ11 ,
(16.38)
where S is the axial stiffness of the cable. Introducing the constitutive law into the governing equation, eq. (16.37), T Su (¯ g1 + u /2)G1 = −f . (16.39) When written in this form, the high level of nonlinearity of the equation governing the cable’s displacement ſeld is apparent. 16.2.4 Extension to dynamic problems The formulation presented thus far has focused on static problems. If the cable’s conſguration changes in time, the inertial velocity of a material point of the cable is v = u. ˙ The cable’s total kinetic energy is then % % % 1 L 1 L ρu˙ T u˙ dAdα1 = mu˙ T u˙ dα1 , (16.40) K= 2 0 A 2 0 ' where ρ is the cable’s mass density, and m = A ρ dA its mass per unit span in the reference conſguration. Variation of the kinetic energy is % L % L δK = δ u˙ T mu˙ dα1 = δ u˙ T p dα1 , (16.41) 0
0
where p = mu˙ is the momentum vector. Hamilton’s principle now yields the equations of motion of the problem m¨ u − [F ∗ G1 ] = f .
(16.42)
These equations of motion are valid for arbitrarily large displacements of the cable when the strain components are assumed to remain small. 16.2.5 Problems Problem 16.1. Linear elastic cable
Consider a cable with a linear elastic constitutive law: F ∗ = S γ11 , where S is the axial stiffness of the cable. The cable is unloaded. Prove: (1) the preservation the total linear momentum of the cable; (2) the preservation the total angular momentum of the cable; (3) the preservation the total mechanical energy of the cable. If the cable is subjected to distributed external loads f (α1 , t) and end forces F1 (t) and F2 (t) at α1 = 0 and L, respectively, what happens to the above three preservation laws?
Problem 16.2. Pre-stretched cable
Consider a straight, pre-stretched cable of length L. The constitutive law for the cable is F ∗ = S (¯ e +γ11 ), where e¯ is the pre-stretch, S the axial stiffness, and hence, T = S e¯ the pre-tension in the cable. Linearize the governing equations by assuming displacement ſeld to remain small. Find the equilibrium conſguration of the cable under a uniform transverse loading f0 . For the unloaded cable under pre-tension ſnd the natural frequencies and mode shapes of the system.
16.3 Formulation of beam equations
617
16.3 Formulation of beam equations A beam is deſned as a structure having one of its dimensions much larger than the other two. The axis of the beam is deſned along that longer dimension and its crosssection is normal to this axis. The cross-section’s geometric and physical properties are assumed to vary smoothly along the beam’s span. Civil engineering structures often consist of assemblies or grids of beams with cross-sections having shapes such as T’s or I’s. A large number of machine parts also are beam-like structures: linkages, transmission shafts, robotic arms, etc. Aeronautical structures such as aircraft wings or helicopter rotor blades are often treated as thin-walled beams. Finally, both tower and rotor blades of wind turbines also fall within the category of beams structures. The solid mechanics theory of beams, more commonly referred to simply as “beam theory,” plays an important role in structural analysis because it provides designers with simple tools to analyze numerous structures [285]. Within the framework of multibody dynamics, the governing equations for beam structures are nonlinear partial differential equations, and the ſnite element method is often used to obtain approximate numerical solutions of these equations. Of course, the same ſnite element approach could also be used to model the same structures based on plate and shell, or even three-dimensional elasticity models, but at a much higher computation cost. Beam models are often used at a pre-design stage because they provide valuable insight into the behavior of structures. Several beam theories have been developed based on various assumptions, and lead to different levels of accuracy. One of the simplest and most useful of these theories is due to Euler who analyzed the elastic deformation of a slender beam, a problem known as Euler’s Elastica [297]. Euler-Bernoulli beam theory [285] is now commonly used in many civil, mechanical and aerospace applications, although shear deformable beam theories [298, 299], often called “Timoshenko beams,” have also found wide acceptance. Reissner investigated beam theory for large strains [300] and large displacements of spatially curved members [301, 302]. In this section, the geometrically exact beam theory will be presented. The kinematic description of the problem developed in section 16.3.1 accounts for arbitrarily large displacements and rotation, hence the term “geometrically exact,” although the strain components are assumed to remain small, see section 15.5.2. The kinematics of geometrically beams was ſrst presented by Simo et al. [303, 304], but similar developments were proposed by Borri and Merlini [305] or Danielson and Hodges [306, 307]. In many applications, however, beams are, in fact, complex build-up structures with solid or thin-walled cross-sections. In aeronautical constructions, for instance, the increasing use of laminated composite materials leads to heterogeneous, highly anisotropic structures. The analysis of complex cross-sections featuring composite materials and the determination of the associated sectional properties was ſrst presented by Giavotto et al. [308, 309]. Their approach, based on linear elasticity theory, leads to a two-dimensional analysis of the beam’s cross-section using ſnite elements, which yields the sectional stiffness characteristics in the form of a 6×6 stiffness matrix relating the six sectional deformations, three strains and three curvatures, to the
618
16 Formulation of ƀexible elements
sectional loads, three forces and three moments. Furthermore, the three-dimensional strain ſeld at all points of the cross-section can be recovered once the sectional strains are known. For nonlinear problems, the decomposition of the beam problem into a linear, two-dimensional analysis over the cross-section, and a nonlinear, one-dimensional analysis along its span was ſrst proposed by Berdichevsky [310]. Hodges [311] has reviewed many approaches to beam modeling; he points out that although the twodimensional ſnite element analysis of the cross-section seems to be computationally expensive, it is, in fact, a preprocessing step that is performed once only. A uniſed theory presenting both Deformed linear, two-dimensional analysis over configuration the cross-section, and a nonlinear, oneB2 B3 i3 dimensional analysis along the beam’s P span was further reſned by Hodges and I O B B i2 his co-workers [312, 313]. The noni1 linear, one-dimensional analysis along B1 x0(a1) the beam’s span corresponds the geometrically exact beam theory develb u(a1) b3 2 oped earlier based on simpliſed kineP b1 matic assumptions. More sophisticated A B B0 beam theories have been developed that Reference account for Vlasov effects [314] or the configuration trapeze effect [315]. Detailed develFig. 16.11. Curved beam in the reference and opments of nonlinear composite beam deformed conſgurations. theory developed by Hodges and his coworkers are found in his textbook [316] and applications to multibody systems in ref. [283]. 16.3.1 Kinematics of the problem Figure 16.11 depicts an initially curved and twisted beam of length L, with a crosssection of arbitrary shape and area A. The volume of the beam is generated by sliding the cross-section along the reference line of the beam, which is deſned by an arbitrary curve in space. Curvilinear coordinate α1 deſnes the intrinsic parameterization of this curve, section 2.2.1, i.e., it measures length along the beam’s reference line. Point B is located at the intersection of the reference line with the plane of the crosssection. In the reference conſguration, an orthonormal basis, B0 (α1 ) = (¯b1 , ¯b2 , ¯b3 ), is deſned at point B. Vector ¯b1 is the unit tangent vector to the reference curve at that point, and unit vectors ¯b2 and ¯b3 deſne the plane to the cross-section. An inertial reference frame, F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )], is deſned, and the components of the rotation tensor that brings basis I to B0 , resolved in basis I, are denoted R0 (α1 ). The position vector of point B along the beam’s reference line is denoted x0 (α1 ). The position vector of material point P of the beam then becomes x(α1 , α2 , α3 ) = x0 (α1 ) + α2 ¯b2 + α3 ¯b3 , where α2 and α3 are the material
16.3 Formulation of beam equations
619
coordinates along unit vectors ¯b2 and ¯b3 , respectively. Coordinates α1 , α2 , and α3 form a natural choice of coordinates to represent the conſguration of the beam.
The displacement ſeld In the deformed conſguration, all the material points located on a cross-section of the beam move to new positions. This motion is decomposed into two parts, a rigid body motion and a warping displacement ſeld. The rigid body motion consists of a translation of the cross-section, characterized by displacement vector u(α1 ) of reference point B, and of a rotation of the cross-section, which brings basis B0 to ¯1 , B ¯2 , B ¯3 ), see ſg. 16.11. The components of the rotation tensor that B(α1 ) = (B brings basis B0 to B, resolved in basis I, are denoted R(α1 ). ¯1 + w2 B ¯2 + The warping displacement ſeld is deſned as w(α1 , α2 , α3 ) = w1 B ¯3 . This displacement ſeld represents a warping that includes both in-plane and w3 B out-of-plane deformations of the cross-section. To be uniquely deſned, the warping ſeld should be orthogonal to the rigid body motion [308, 316]. Consequently, unit ¯ 2 and B ¯3 deſne the average plane of the cross-section and vector B ¯1 is vectors B orthogonal to that plane. The position vector of point P in the deformed conſguration now becomes ¯2 + (w3 + α3 )B ¯3 . ¯1 + (w2 + α2 )B X(α1 , α2 , α3 ) = X 0 + w1 B
(16.43)
¯ i = R ¯bi = The position of point B is expressed as X 0 (α1 ) = x0 + u. Because B (R R0 ) ¯ıi , eq. (16.43) becomes X(α1 , α2 , α3 ) = x0 + u + (R R0 ) (w + α2 ¯ı2 + α3 ¯ı3 ) .
(16.44)
The warping displacement ſeld is computed from the geometric and stiffness properties of the cross-section, typically by solving a two-dimensional ſnite element problem over the cross-section, as described in refs. [308, 316]. The sectional strain measures The sectional strain measures for beams with shallow curvature are deſned as x0 + u − (R R0 ) ¯ı1 , e= = , (16.45) κ k + R ki where k = axial(R RT ) are the components of the sectional curvature vector resolved in the inertial basis and ki the components of the corresponding curvature vector in the reference conſguration. Notation (·) indicates a derivative with respect to α1 . The strain components resolved in the convected material basis, B, are denoted ∗ = (R R0 )T and consist of the sectional axial and shear strains. The curvature components resolved in the same material basis are denoted κ∗ = (R R0 )T κ
620
16 Formulation of ƀexible elements
and consist of the sectional twisting and bending curvatures. Notation (·)∗ indicates the components of vectors and tensors resolved in the material basis. By deſnition, a rigid body motion is a motion that generates no strains. This implies that the following rigid body motion, u(α1 ) = uR + (RR − I)x0 (α1 ), R(α1 ) = RR , consisting of a translation, uR , and a rotation about the origin characterized by a rotation matrix, RR , should generate no straining of the beam. It can be readily veriſed with the help of eqs. (16.45) that such rigid body motion results in = 0 and κ = 0, as expected. 16.3.2 Governing equations For the problem at hand, the principle of virtual work states % L (δ ∗T N ∗ + δκ∗T M ∗ ) dα1 = δWext ,
(16.46)
0
where N ∗ and M ∗ are the beam’s sectional forces and moments, respectively. The sectional constitutive law relates the sectional strain measures to the sectional loads, ∗ ∗ N ∗ =C , (16.47) M∗ κ∗ where C ∗ is the beam’s 6 × 6 sectional stiffness matrix. This matrix is a byproduct of a two-dimensional ſnite element analysis over the beam’s cross-section, as discussed in refs. [308, 316]. For homogeneous sections of simple geometry, exact or approximate analytical expressions are available for the stiffness matrix. Variations in strain components are expressed using eq. (16.45) to ſnd δ ∗ = (R R0 )T δu + ( (16.48a) x0 + u )δψ , δκ∗ = (R R0 )T δψ .
(16.48b)
where δψ = axial(δR RT ) is the virtual rotation vector. The principle of virtual work, eq. (16.46), now becomes % L 3 4 δuT + δψ T ( x0 + u )T N + δψ T M dα1 = δWext , (16.49) 0
where N = (R R0 )N ∗ and M = (R R0 )M ∗ are the beam’s internal forces and moments, respectively, resolved in the inertial basis. The virtual work done by the externally applied forces is expressed as δWext = 'L T [δu f + δψ T m] dα1 , where f and m denote the externally applied forces and 0 moments per unit span of the beam, respectively. The governing equations of the static problem then follow as
M +
( x0
N = −f ,
(16.50a)
+u )N = −m.
(16.50b)
16.3 Formulation of beam equations
621
Example 16.2. The cantilevered beam under tip loading Consider a cantilevered beam of length L with a rectangular cross-section of width b and height h. The beam is made of a homogeneous material of Young’s modulus E and shear modulus G and is subjected to a tip axial load, NT , tip transverse load, PT , and tip moment, MT . The beam is of bending stiffness H33 = Ebh3 /12, axial stiffness S = Ebh, and shearing stiffness K22 = 5Gbh/6. The loading is acting in plane (¯ı1 , ¯ı2 ), and due to the symmetry of the problem, the beam deforms in that plane only. The rotation tensor, R, then corresponds to a planar rotation, eq. (4.6), and the displacement vector, u, is two-dimensional Cθ −Sθ u1 , u= , R= Sθ Cθ u2 where Cθ = cos θ, Sθ = sin θ, angle θ is the average rotation of the cross-section, and u1 and u2 the displacement components along unit vectors ¯ı1 and ¯ı2 , respectively. Because the beam is not subjected to distributed transverse loads, the ſrst equa tion of equilibrium, eq. (16.50a), reduces to N = 0. Consequently, the sectional
T force, N = N1 , V2 , remain constant, where N1 and V2 are the sectional forces along unit vectors ¯ı1 and ¯ı2 , respectively. Since equilibrium must be satisſed at the tip of the beam, N1 (α1 ) = NT and V2 (α1 ) = PT . The second equation of equilibrium, eq. (16.50b), now becomes M = −( x0 + u )N , and since the sectional forces are constant, this equation integrates to M3 (α1 ) = u2 NT − (α1 + u1 )PT + c, where c is an integration constant. Because the problem is two-dimensional, the other two moment components, M1 and M2 , vanish. Imposing the moment equilibrium condition at the tip of the beam yields the integration constant and ſnally, M3 (α1 ) = MT + (L − α1 + uT1 − u1 )PT − (uT2 − u2 )NT , where uT1 and uT2 are the beam’s tip displacements along unit vectors ¯ı1 and ¯ı2 , respectively. The constitutive law for the bending moment is simply M3 = H33 θ ; indeed, for this two-dimensional problem, the curvature vector reduces to a single non vanishing component, κ3 = θ . The constitutive laws for the sectional forces becomes ∗ NT N1 N1 1 + u1 Cθ ∗ T = =R = R − , C R PT V2 V2∗ u2 Sθ where N1∗ and V2∗ are the sectional axial and shear forces, respectively, resolved in the material system, and C ∗ = diag(S, K22 ). Combining all the relationships obtained above yields the governing equations of the problem, ⎫ ⎧ PT NT 2 NT 2 PT ⎪ ⎪ ⎪ ⎪ ⎧ ⎫ ⎪ ⎪ ⎪ Cθ − 1 + S Cθ + K22 Sθ + ( S − K22 )Sθ Cθ ⎪ ⎪ ⎪ ⎨u 1 ⎬ ⎨ NT NT PT 2 PT 2 ⎬ u 2 = Sθ . − S + +( )Sθ Cθ + C ⎩ ⎭ ⎪ S K22 S θ K22 θ ⎪ ⎪ ⎪ θ ⎪ ⎪ M P N ⎪ ⎪ ⎪ ⎭ ⎩ T + (L − α1 + uT1 − u1 ) T − (uT2 − u2 ) T ⎪ H33 H33 H33
622
16 Formulation of ƀexible elements
To better understand these equations, it is convenient to normalize all quantities. First, the material coordinate, α1 , is normalized by the length of the beam, η = α1 /L, and notation (·)+ denotes a derivative with respect to η. The displacement components are also normalized by the beam’s length, u ¯1 = u1 /L and u ¯2 = u2 /L. ¯ = NT L2 /H33 , P¯ = PT L2 /H33 , The non-dimensional loading parameters are N ¯ = MT L/H33 , and the governing equations now become and M ⎧ ⎫+ ⎧ ⎫ ¯ (¯ a2 Cθ2 + s¯2 Sθ2 ) − P¯ (¯ s2 − a ¯2 )Sθ Cθ ⎬ ⎨u¯1 ⎬ ⎨ Cθ − 1 + N ¯ (¯ u¯2 + P¯ (¯ a2 Sθ2 + s¯2 Cθ2 ) − N s2 − a ¯2 )Sθ Cθ , = Sθ (16.51) ⎩ ⎭ ⎩ ⎭ T T ¯ (¯ ¯ + P¯ (1 − η + u θ ¯1 ) − N u2 − u ¯2 ) M ¯1 − u where the non-dimensional stiffness properties of the beam are deſned as 2 2 h H33 1 h H33 1 E 2 = , s ¯ = = . a ¯2 = 2 2 SL 12 L K22 L 10 G L The axial stiffness coefſcient, a ¯2 , is the ratio of the bending to the axial stiffness of the beam, and the shear stiffness coefſcient, s¯2 , is the ratio of the bending to the shear stiffness of the beam. For long, slender beams, both coefſcients are very small as (h/L)2 → 0 and can be assumed to vanish without noticeably affecting the predictions. The governing equations of the problem, eqs. (16.51), take the form of three coupled ſrst-order differential equations for the three variables of the problem, u ¯1 , u ¯2 , and θ. These equations are nonlinear due to the presence of trigonometric functions, but also because the beam’s unknown tip deƀections, u ¯T1 and u ¯T2 , appear on the right-hand side of the equations. A convenient solution technique is to assume u ¯T1 = u ¯T2 = 0 and integrate eqs. (16.51) numerically. The solution yields an estimate of the beam’s tip deƀections, which are then used to obtain a reſned solution by integrating eqs. (16.51) once again. An iterative procedure then yields the desired solution. This crude solution process will become unstable for large deƀections of the beam. Using a relaxation factor when updating the tip deƀections is often sufſcient to stabilize the computation. The deƀected shape of the beam under a tip loads NT and PT was computed using the procedure described above. The following parameters were used: E/G = ¯ = 0 and P¯ = 0.5, 1, 2.6 and L/h = 10. Simulations were performed ſrst for N 2, and 4. Figure 16.12 shows the predictions of the simulations. The tip deƀection of the beam is not proportional to the applied load, as expected for this nonlinear problem. ¯ = 0, 4, 8, and 12. As A second set of simulation was performed for P¯ = 4 and N the axial tip force, NT , increases, the effective stiffness of the beam increases and the tip deƀection under the constant tip transverse force decreases. 16.3.3 Extension to dynamic problems The developments presented thus far have focused on static problems. The inertial velocity vector, v, of a material point is found by taking a time derivative of its inertial position vector, eq. (16.44), to ſnd
16.3 Formulation of beam equations 0.6
0.5
0.5
0.4
0.4
u2
u2
0.6
623
0.3
0.3
0.2
0.2
0.1
0.1
0
0 0
0.1
0.2
0.3
0.4
0.5
h
0.6
0.7
0.8
0.9
Fig. 16.12. Deƀected shape of the beam un¯ = 0. P¯ = 0.5, der a tip transverse load for N 1, 2, and 4, indicated with symbols ◦, , , and , respectively.
0
0.1
0.2
0.3
0.4
0.5
h
0.6
0.7
0.8
0.9
Fig. 16.13. Deƀected shape of the beam un¯ = 0, der a tip transverse load for P¯ = 4. N 4, 8, and 12, indicated with symbols ◦, , , and , respectively.
˙ R s∗ = u˙ + (R R ) v = u˙ + R ω ∗ s∗ = u˙ + (R R0 ) s∗T ω ∗ , 0 0
(16.52)
where contributions of warping of the cross-section have been ignored and s∗T =
˙ indicates a derivative with respect to time and ω ∗ are the 0, α2 , α3 . Notation (·) components of the angular velocity vector in the material system, ω∗ = (R R0 )T ω, ˙ R). where ω = axial(R The components of the inertial velocity vector of a material point resolved in the material frame now become v ∗ = (R R0 )T v = (R R0 )T u˙ + s∗T ω ∗ .
(16.53)
The total inertial velocity of a material point has two components: the ſrst term, (R R0 )T u, ˙ due to the translation of the cross-section, and the second term, s∗T ω ∗ , due to its rotation. The kinetic energy The kinetic energy, K, of the beam is % % 1 L ρ v ∗T v ∗ dAdα1 , K= 2 0 A
(16.54)
where ρ is the mass density of the material per unit volume of the reference conſguration. Introducing eq. (16.53) for the inertial velocity yields % % 1 L ρ u˙ T (R R0 ) + ω ∗T s∗ (R R0 )T u˙ + s∗T ω∗ dAdα1 . (16.55) K= 2 0 A The following sectional mass constants are deſned % % % 1 ρ dA, η ∗ = ρ s∗ dA, ∗ = ρ s∗ s∗T dA, m= m A A A
(16.56)
624
16 Formulation of ƀexible elements
where m is the mass of the beam per unit span, η ∗ the components of the position vector of the sectional center of mass with respect to point B, see ſg. 16.11, and ∗ the components of the sectional tensor of inertia per unit span, all resolved in the material basis. After integration over the beam’s cross-section, the kinetic energy, eq. (16.55), becomes % 1 L T K= mu˙ u˙ + 2mu˙ T (R R0 ) η∗T ω∗ + ω ∗T ∗ ω ∗ dα1 2 0 % 1 L ∗T ∗ ∗ = V M V dα1 . 2 0 To obtain the compact form expressed by the second equality, the sectional mass matrix of the cross-section, resolved in the material basis, is deſned as η ∗T mI m ∗ , (16.57) M = m η ∗ ∗ and the sectional velocities, also resolved in the material basis, are given by 0 (R R0 )T (R R0 )T u˙ u˙ V∗ = = = (R R0 )T V. (16.58) 0 (R R0 )T ω ω∗ In this expression, the sectional velocities resolved in the inertial system were deſned
as V T = u˙ T , ω T and the following notation was introduced R R0 =
0 (R R0 ) . 0 (R R0 )
(16.59)
The components of the sectional linear and angular momenta resolved in the material system, denoted h∗ and g ∗ , respectively, are ∗ h ∗ = M∗ V ∗ . (16.60) P = g∗ The governing equations 'L
δV ∗T M∗ V ∗ dα1 , where the variations T ˙ T (R R ). in velocities are δ[u˙ T (R R0 )] = (δ u˙ T + δψ T u ˙ )(R R0 ) and δω∗T = δψ 0 Introducing these variations in the expression for the kinetic energy yields % L T ˙ T (R R ) g ∗ dα1 , δK = (δ u˙ T + δψ T u ˙ )(R R0 ) h∗ + δψ 0 Variation of the kinetic energy is δK =
0
0
The components of the sectional linear and angular momenta, denoted h and g, respectively, resolved in the inertial system are
16.3 Formulation of beam equations
h = (R R0 )P ∗ , P= g
625
(16.61)
where P ∗ are the corresponding quantities resolved in the material frame, see eq. (16.60). The variation in kinetic energy ſnally can be written as % δK =
L
T ˙ T g) dα1 . ˙ h + δψ (δ u˙ T h + δψ T u
(16.62)
0
With the help of eqs. (16.49) and (16.62), the governing equations of motion of the problem are obtained from Hamilton’s principle, which states that %
tf
ti
% 0
L
3 T ˙ g − (δuT + δψ T E 1T )N ˙ )h + δψ (δ u˙ T + δψ T u 4 −δψ T M + δuT f + δψ T m dα1 dt = 0.
Integration by parts yields the equations of motion of the problem h˙ − N = f ,
(16.63a)
˙ h − M − ( x0 + u )N = m. g˙ + u
(16.63b)
Example 16.3. The four-bar mechanism Figure 16.14 depicts a ƀexible four bar mechanism. Bar 1 is of length 0.12 m and is connected to the ground at point A by means of a revolute joint. Bar 2 is of length 0.24 m and is connected to bar 1 at point B with a revolute joint. Finally, bar 3 is of length 0.12 m and is connected to bar 2 and the ground at points C and D, respectively, by means of two revolute joints. -3
2.5
A W = 5 rad/s
D
Revolute joints Beams
2 1.5
DISPLACEMENT u3 [m]
0.24 m Bar 2 C Misaligned axis of rotation Bar 3
0.12 m Bar 1
B
x 10
1 0.5 0
-0.5 -1
-1.5 -2 0
0.5
1
1.5
2
2.5
TIME [s]
Fig. 16.14. Conſguration of the four bar mechanism.
Fig. 16.15. Out-of-plane displacement u3 at point C. (ρ∞ = 0)
In the reference conſguration, the bars of this planar mechanism intersect each other at 90 degree angles and the axes of rotation of the revolute joints at points A, B,
626
16 Formulation of ƀexible elements
and D are normal to the plane of the mechanism. The axis of rotation of the revolute joint at point C is at a 5 degree angle with respect to this normal to simulate an initial defect in the mechanism. The angular velocity at point A of bar 1 is prescribed to be Ω = 5 rad/s. If the bars were inſnitely rigid, no motion would be possible because the mechanism locks. For elastic bars, motion becomes possible, but generates large, rapidly varying internal forces. Bar 1 has the following physical characteristics: axial stiffness, EA = 40 MN, bending stiffnesses, EI22 = EI33 = 2.4 MN·m2 , torsional stiffness, GJ = 0.28 MN·m2 , shearing stiffnesses, K22 = K33 = 2 MN, mass per unit span, m = 3.2 kg/m, and mass moments of inertia, m22 = m33 = 0.012 kg·m. Bars 2 and 3 have the following physical characteristics: axial stiffness, EA = 4 MN, bending stiffnesses, EI22 = EI33 = 0.24 MN·m2 , torsional stiffness, GJ = 0.028 MN·m2 , shearing stiffnesses, K22 = K33 = 0.2 MN, mass per unit span, m = 1.6 kg/m, and mass moments of inertia, m22 = m33 = 0.06 kg·m. 7 6
ANGULAR VELOCITY [rad/s]
0.8 0.6
VELOCITY [m/s]
0.4 0.2 0 -0.2 -0.4 -0.6 -0.8
5 4 3 2 1 0 -1 -2
-1 0
0.5
1
1.5
2
2.5
TIME [s]
Fig. 16.16. Velocity components at point C. Solid line: v1 ; dashed line: v2 ; dashed-dotted line: v3 . (ρ∞ = 0)
-3
0
0.5
1
1.5
2
2.5
TIME [s]
Fig. 16.17. Angular velocity components at point C. Solid line: ω1 ; dashed line: ω2 ; dashed-dotted line: ω3 . (ρ∞ = 0)
This problem was simulated for a total of 2.5 s using the generalized-α scheme described in section 17.4 with ρ∞ = 0; a time step of constant size Δt = 2 ms was used. If the four revolute joints had their axes of rotation orthogonal to the plane of the mechanism, the response of the system would be purely planar, and bars 1 and 3 would rotate at constant angular velocities around points A and D, respectively. The initial defect in the mechanism causes a markedly different response. Bar 1 rotates at the constant prescribed angular velocity, but bar 3 now oscillates back and forth, never completing an entire turn. When the direction of rotation of bar 3 reverses, bar 2 undergoes large rotations, instead of near translation, and sharp increases in velocities are observed, as depicted in ſgs. 16.16 and 16.17, which show the three components of velocity and angular velocity at point C, respectively. Furthermore, ſg. 16.15 depicts the time history of out-of-plane displacements at point C; clearly, the response of the system is three-
16.3 Formulation of beam equations
627
dimensional: the out-of-plane displacement at point C has a magnitude of up to about 2 mm. 4
1
x 10
x 10
1
4
0.8 0.5 0.6
MOMENTS [N.m]
FORCES [N]
0
-0.5
-1
-1.5
0.4 0.2 0 -0.2 -0.4 -0.6
-2 -0.8 -2.5 0
0.5
1
1.5
2
-1
2.5
0
0.5
1
1.5
2
2.5
TIME [s]
TIME [s]
Fig. 16.18. Bar 1 force components at point A. Solid line: F1 ; dashed line: F2 ; dasheddotted line: F3 . (ρ∞ = 0)
Fig. 16.19. Bar 1 moment components at point A. Solid line: M1 ; dashed line: M2 ; dashed-dotted line: M3 . (ρ∞ = 0)
The time history of the three components of internal forces and bending moments in bar 1 at point A are shown in ſg. 16.18 and 16.19, respectively. These large internal forces and moments are all caused by the initial imperfection of the mechanism. 10
2.5
8
ANGULAR VELOCITY [rad/s]
2
VELOCITY [m/s]
1.5 1 0.5 0 -0.5 -1 -1.5 0
6 4 2 0 -2 -4 -6 -8
0.5
1
1.5
2
2.5
TIME [s]
Fig. 16.20. Bar 1 force components at point A. Solid line: F1 ; dashed line: F2 ; dasheddotted line: F3 . (ρ∞ = 0)
0
0.5
1
1.5
2
2.5
TIME [s]
Fig. 16.21. Bar 1 moment components at point A. Solid line: M1 ; dashed line: M2 ; dashed-dotted line: M3 . (ρ∞ = 0)
Next, the same simulation was run using the generalized-α scheme with ρ∞ = 0.85, see eq. (17.39). In the previous simulation, the spectral radius at inſnity ρ∞ = 0 achieves asymptotic annihilation, see ſg. 17.19; in contrast, the present simulation uses ρ∞ = 0.85, which generates very little algorithmic damping, even at high frequencies. Figures 16.20 and 16.21 show the three components of velocity and angular velocity at point C, respectively, for ρ∞ = 0.85, and should be compared with their counterparts, ſgs. 16.16 and 16.17, respectively, obtained for ρ∞ = 0.
628
16 Formulation of ƀexible elements
Using an initial time step size of Δt = 2 ms, the simulation with ρ∞ = 0.85 failed to converge at time steps 10 and 14. In both cases, the time step size was halved to allow the simulation to continue. Note that very high frequency oscillations of a purely numerical origin are predicted. The asymptotic annihilation achieved for ρ∞ = 0 effectively eliminates this undesirable numerical noise. 16.3.4 Problems Problem 16.3. Conservation properties for beams Consider an unloaded beam with linearly elastic constitutive laws. (1) Prove the preservation the total linear momentum of the beam. (2) Prove the preservation the total angular momentum of the beam. (3) Prove the preservation the total mechanical energy of the beam. If the beam is subjected to distributed external loads and concentrated end forces what happens to the above three preservation laws?
16.4 Formulation of plate and shell equations Section 16.3 presents the formulation of beams, which are structures possessing one dimension that is much larger than the other two. The present section focuses on another type of structural components, plates, for which one dimension is far smaller than the other two. The mid-plane of the plate lies along the two long dimensions of the plate, and the normal to the plate extends along the shorter dimension. The term “plate” is usually reserved for ƀat structures, while the term “shell” refers to a curved plate. Solid mechanics theories describing plates, more commonly referred to as plate theories, play an important role in structural analysis because they provide tools for the analysis of these commonly used structural components. Although more sophisticated formulations, such as three-dimensional elasticity theory, could be used for the analysis of plates and shells, the associated computational burden is often too heavy, and furthermore, plate and shell models provide valuable insight into the behavior of these structures at a much reduced computational cost. It is beyond the scope of this text to review the numerous formulations that have been developed for the analysis of plate and shell structures; comprehensive reviews of the topic are given by Noor et al. [317, 318]. Beam theories reduce the analysis of complex, three-dimensional structures to one-dimensional problems. Indeed, the governing equations for geometrically exact beams, eqs. (16.50), are ordinary differential equations expressed in terms of a single variable along the axis of the beam. In contrast, plate theories reduce the analysis of three-dimensional structures to two-dimensional problems. The equations of plate theory are partial differential equations in the two dimensions deſning the mid-plane of the plate.
16.4 Formulation of plate and shell equations
629
16.4.1 Kinematics of the shell problem Figure 16.22 depicts a shell of thickness h and mid-plane surface Sm . Let x0 (α1 , α2 ) be the position vector of an arbitrary point B on the shell’s mid-surface and let α1 and α2 be two coordinates that parameterize the mid-surface, see section 2.4. If the mid-surface of the shell is represented by an arbitrary set of coordinates, the expressions for the ſrst and second metric tensors of the surface, given by eqs. (2.37) and (2.47), respectively, will be complex. Consequently, it is natural to use the concept of lines of curvature introduced in section 2.4.5. In fact, shell theories are developed almost exclusively with the help of lines of curvature. In the reference conſguration frame F = B, B0 (α1 , α27) = (7¯b1 , ¯b2 , ¯b3 ) is de0 7 7 ſned at point B. Vector ¯b1 = x0,1 / 7x0,1 7 and ¯b2 = x0,2 / 7x0,2 7 are unit vectors deſning the plane tangent to the shell’s mid-surface and unit vector ¯b3 is the unit normal to this tangent plane. Notations (·),1 and (·),2 indicate derivatives with respect to α1 and α2 , respectively. An inertial reference frame, F I = [O, I = (¯ı1 , ¯ı2 , ¯ı3 )], is deſned and the components of the rotation tensor that brings basis I to B0 , resolved in basis I, are denoted R0 (α1 ). The position vector of point B B3 on the shell’s mid-surface is denoted u, R P B b3 x0 (α1 , α2 ). The position vector of maB2 B terial point P of the shell then becomes P b B 2 0 x(α1 , α2 , ζ) = x0 + ζ ¯b3 , where ζ is the h B B1 i3 Deformed material coordinate measuring length configuration b along the normal to the mid-surface. 1 x0, R0 X0 Reference Unit vector ¯b3 deſnes a material line, i2 configuration I i.e., a set of material particles that are i1 normal to the shell’s mid-surface in the reference conſguration. Coordinates α1 , α2 , and ζ form a set of curvilinear coordinates that is a natural choice of coordinates to represent the shell.
Fig. 16.22. Shell in the reference and deformed conſgurations.
The displacement ſeld In the deformed conſguration, all the material points located on a normal material line of the shell move to new positions. This motion is decomposed into two parts, a rigid body motion and a warping displacement ſeld. The rigid body motion consists of a translation of the normal material line, characterized by displacement vector u(α1 , α2 ) of reference point B, and a rotation of the material line, which brings basis ¯1 , B ¯2 , B ¯3 ), see ſg. 16.22. Unit vectors B ¯ 1 and B ¯2 deſne the plane B0 to B(α1 ) = (B ¯ tangent to the deformed mid-surface of the shell and unit vector B3 is normal to this plane. The components of the rotation tensor that brings basis B0 to B, resolved in basis I, are denoted R(α1 , α2 ).
630
16 Formulation of ƀexible elements
¯1 + w2 B ¯2 + The warping displacement ſeld is deſned as w(α1 , α2 , ζ) = w1 B ¯ w3 B3 . This displacement ſeld represents a warping that includes all possible deformations of the normal material line. To be uniquely deſned, the warping ſeld should be orthogonal to the rigid body motion [319, 320]. The position vector of point P in the deformed conſguration now becomes ¯3 . ¯1 + w2 B ¯2 + (w3 + ζ) B X(α1 , α2 , ζ) = X 0 + w1 B
(16.64)
The position of point B is expressed as X 0 (α1 , α2 ) = x0 + u. To uniquely de¯1 and B ¯ 2 , the following condition is imposed, ſne the orientations of unit vectors B ¯1T X 0,2 = B ¯2T X 0,1 . Because B ¯i = R ¯bi = (R R ) ¯ıi , eq. (16.64) becomes B 0 X(α1 , α2 , ζ) = x0 + u + (R R0 ) (w + ζ ¯ı3 ) ,
(16.65)
The warping displacement ſeld is computed from the geometric and stiffness properties of the normal material line, typically by solving a one-dimensional ſnite element problem over the material line, as described in refs. [319, 320]. The sectional strain measures The two-dimensional generalized strain measures for shallow shells are now deſned. They are conveniently divided into three groups, the mid-surface in-plane strain components, the transverse shear strain components, and the curvature components. The mid-surface in-plane strain components are * T + ˆ E ˆ e11 = E (16.66a) 1 1 − 1 /2, * T + ˆ E ˆ e22 = E (16.66b) 2 2 − 1 /2, T
ˆ E ˆ 2e12 = E 1 2.
(16.66c)
The transverse shearing strain components are ˆTE ˆ 2e13 = E 1 3,
(16.67a)
ˆTE ˆ E 2 3.
(16.67b)
2e23 = Finally, the curvature components are
ˆ E ˆ T1 √ 3,1 + 1 , κ11 = E a11 R1 ˆ E ˆ T √ 3,2 + 1 , κ22 = E 2 a22 R2 ˆ ˆ E E ˆ T √ 3,2 + E ˆ T √ 3,1 , κ12 = E 1 2 a22 a11
(16.68a) (16.68b) (16.68c)
16.4 Formulation of plate and shell equations
631
where R1 and R2 are the principal radii of curvature of the shell’s reference conſguration as deſned by eqs. (2.54). The shell’s deformation measures are deſned in terms three vectors, ˆ = ¯b1 + u /√a11 , E (16.69a) 1 ,1 √ ¯ ˆ = b2 + u / a22 , (16.69b) E 2
¯3 , ˆ =B E 3
,2
(16.69c)
72 72 7 7 where a11 = 7x0,1 7 and a22 = 7x0,2 7 are the diagonal terms of the shell’s ſrst metric tensor in its reference conſguration, see section 2.4.1 and eq. (2.37). Vector ˆ is the unit vector normal to the deformed mid-surface of the shell. E 3 The generalized strain measures are expressed in terms of ſve parameters: the three components of the displacement vector, u, appearing in the deſnition of vectors ˆ 1 and E ˆ 2 , eqs. (16.69a) and (16.69b), respectively, and the two parameters deſning E ˆ . the orientation of the unit normal vector, E 3 16.4.2 Governing equations The governing equations of the problem are obtained from the principle of virtual work, which states that δWint + δWext = 0, where δWint and δWext are the virtual works done by the internal forces and externally applied loads, respectively. Virtual work done by internal forces For simplicity, the shell’s two-dimensional generalized strain measures are collected into a single array, e∗ , deſned as
e∗T = e11 , e22 , e12 , e13 , e23 , κ11 , κ22 , κ12 . The ſrst three entries are the mid-surface in-plane strain components deſned by eqs. (16.66), the next two entries the transverse shearing strain components deſned by eqs. (16.67), and the last three entries the curvature components deſned by eqs. (16.68). The corresponding stress resultants are also collected in a single array, F ∗ , deſned as
∗ ∗ ∗ ∗ ∗ ∗ ∗ ∗ , N22 , N12 , N13 , N23 , M11 , M22 , M12 F ∗T = N11 . ∗ ∗ and N22 are the stress resultants The ſrst three entries are the in-plane forces; N11 ∗ ¯ ¯ along unit vectors B1 and B2 , respectively, and N12 is the in-plane shear force. The ∗ ∗ next two entries are the transverse shear forces; N13 and N23 act on faces normal to ¯1 and B ¯2 , respectively. Finally, the last three entries are the bending unit vectors B and twisting moments. Both forces and moments are measured per unit length of the shell, and resolved in material basis B. Evaluating the variation of the strain components given in eqs. (16.66), (16.67), and (16.68), the virtual work done by the internal forces becomes
632
16 Formulation of ƀexible elements
%
%
T δu,1 N 1 + δuT,2 N 2 Sm Sm 4 ˆ T M + δE ˆ T M + δE ˆ T N dSm . +δ E 3,1 3,2 3 1 2 3
δWint = −
δe
∗T
∗
F dSm = −
(16.70)
To simplify this expression, the following quantities were introduced ˆ ˆ E E 1 3,1 3,2 ∗ ˆ ∗ ˆ ∗ ˆ ∗ ∗ N11 E 1 + N12 E 2 + N13 E 3 + M11 √ , (16.71a) + M12 √ N1 = √ a11 a11 a22 ˆ 3,1 ˆ 3,2 E E 1 ∗ ˆ ∗ ˆ ∗ ˆ ∗ ∗ N12 E 1 + N22 E 2 + N23 E 3 + M12 √ , (16.71b) N2 = √ + M22 √ a22 a11 a22 ∗ ˆ ∗ ˆ N 3 = N13 E 1 + N23 E2, 1 ∗ ˆ ∗ ˆ M11 E 1 + M12 E2 , M1 = √ a11 1 ∗ ˆ ∗ ˆ M2 = √ M12 E 1 + M22 E2 . a22
(16.71c) (16.71d) (16.71e)
Constitutive laws The stress resultants are related to the strain measures through the constitutive law F ∗ = C ∗ e∗ .
(16.72)
where C ∗ is the shells’s 8 × 8 sectional stiffness matrix. This matrix is a byproduct of a one-dimensional ſnite element analysis through the shell’s thickness, as discussed in refs. [319, 320]. Virtual work done by externally applied loads Let f and m denote the force and moment vectors applied to the shell’s mid-surface per unit area, respectively. The virtual work done by these externally applied loads is expressed as % * + δWext = δuT f + δψ T m dSm , (16.73) Sm
where δu is the virtual displacement vector of the point of application of the force and δψ the virtual rotation vector of the same point. ˆ is a director, as deſned in section 4.15, and can be expressed as Unit vector E 3 ˆ ˆ3 = E 3 = (R R0 ) ¯ı3 . A virtual change in this director’s orientation then becomes δ E ∗ ∗ T (R R0 ) ı3 b δα , see eq. (4.113), where δα is a two-parameter virtual rotation vector resolved in material basis B and matrix b is deſned by eq. (4.112). The virtual work done by the applied moment becomes δψ T m = δψ ∗T m∗ = δα∗T bT m∗ , where m∗ denotes the components of the applied moment vector, resolved in material basis B, m∗ = (R R0 )T m. Because the last row of matrix b stores
16.4 Formulation of plate and shell equations
633
two vanishing entries, see eq. (4.112), the product bT m∗ , ignores the last component of vector m∗ . This last component, called the drilling moment, is the component of the externally applied moment acting about the normal to the shell’s mid-surface. Because the shell presents no stiffness about this axis, it cannot carry a drilling moment. The virtual work done by the externally applied moment, δα∗T bT m∗ , automatically ſlters out the contribution of the drilling moment. Equations of motion Now that the virtual work done by both internal force and externally applied loads have been evaluated, the principle of virtual work states that % 3 δuT f + δα∗T bT m∗ − δuT −N 1,1 − N 2,2 Sm (16.74) 4 ∗T T T dSm = 0. −δα b ı3 (R R0 ) N 3 − M 1,1 − M 2,2 The governing equations ſnally become
b ı3 (R R0 )T T
N 1,1 + N 2,2 = −f , " M 1,1 + M 2,2 − N 3 = −bT m∗ .
!
(16.75a) (16.75b)
16.4.3 Extension to dynamic problems The velocity of a material of the shell is computed as a time derivative of the position ˙ = u+ζ vector, eq. (16.65), to ſnd X ˙ B˙ 3 , where velocity components associated with the warping ſeld have been ignored. The kinetic energy of the shell then becomes % % 1 T K= ρ (u˙ T + ζ E˙ 3 )(u˙ + ζ E˙ 3 ) dζdSm , 2 Sm h where ρ is the material density. Integration through the shell thickness then yields % 1 K= V ∗T M∗ V ∗ dSm , 2 Sm where V ∗T = deſned as
u, ˙ B˙ 3 is the velocity vector and the 6 × 6 mass matrix, M∗ , is mI m∗ I M∗ = . m∗ I M ∗ I
The following mass coefſcients were deſned % % m= ρ dζ, m∗ = ρζ dζ, h
h
M∗ =
% ρζ 2 dζ, h
634
16 Formulation of ƀexible elements
where m is the mass of the shell per unit mid-surface area, m∗ /m the location of the center mass, and M ∗ /m the square of the radius of gyration. Virtual changes in the kinetic energy become % T δK = (δ u˙ T h + δ E˙ 3 g¯) dSm , (16.76) Sm
∗ ˙ ∗ ˙ where h = mu+m ˙ ˙ E 3 and g = m∗ u+M E 3 are the linear and angular momentum vectors, respectively. The governing equations of motion are then obtained from Hamilton’s principle that becomes % % 3 δuT −h˙ + N 1,1 + N 2,2 + δα∗T bT ı3 (R R0 )T t Sm 4 −g¯˙ − N 3 + M 1,1 + M 2,2 + δuT f + δα∗T bT m∗ dSm dt = 0.
The governing equations of motion ſnally become
bT ı3 (R R0 )T
h˙ − N 1,1 − N 2,2 = f , g¯˙ + N 3 − M 1,1 − M 2,2 = bT m∗ .
(16.77a) (16.77b)
16.4.4 Mixed interpolation of tensorial components Several recently developed shell elements have distinguished themselves from other shell formulations because of their versatility, accuracy and robustness. One of these is the mixed interpolation of tensorial components (MITC) element developed by Bathe and his co-workers [321, 322, 323]. The MITC approach is based on the interpolation of strains at chosen sampling points (so-called “tying points”). The key issue of this approach is the selection of the tying points and corresponding interpolation functions. In case of the nine-noded MITC9 element, the interpolated strain components are deſned as ( ( ( α α α α α α e11 = grr e11 , e22 = gss e22 , e12 = grs e12 ; (16.78a) α
α
e13 =
( α
α α grr e13 ,
α
e23 =
(
α α gss e23 .
(16.78b)
α
α α α , gss , and grs are the strain interpolation functions and eij the strain comwhere grr ponents at the α tying point, which are obtained by direct interpolation using the ſnite element displacement assumptions. The location of the tying points and corresponding strain interpolation functions can be found, for example, in [322, 323] for each strain component. For the MITC9 element, the strain components e11 and α e13 are interpolated based on six tying points, using the shape functions grr . The strain components e22 and e23 are interpolated based on six tying points, using the
16.4 Formulation of plate and shell equations
635
α shape functions gss . Finally, the in-plane shearing strain component e12 is interpoα lated based on four tying points, using the shape functions grs . This approach takes care of both membrane and transverse shearing strain locking problems. The stiffness matrix of the element is then formed based on these interpolated strain components and full integration is used. The element does not present any spurious mechanism. In view of the more complicated strain interpolation and full integration scheme, the MITC9 element is a more computationally expensive element, but it is accurate and fairly insensitive to element deformations.
Example 16.4. Lateral buckling of a thin plate Figure 16.23 depicts a thin cantilevered plate acted upon by a crank and link mechanism. The plate is of length L = 1m, height h = 80 mm, thickness t = 2 mm, and is made of steel with the following properties: Young’s modulus E = 210 GPa, Poisson ratio ν = 0.25 and density ρ = 7870 kg/m3 . It is clamped along edge AB and a reinforcing beam is located along edge CD. i3
L = 1m Thin plate
B
D
Beam
M
i1
+
C
C
Link
~
Link Gf
t = 2 mm
+
h= 80 mm
+
Spherical joint Revolute joint Cross-section
Ll = 0.5m
A
Crank
L
Lc = 10 mm
Fig. 16.23. Thin plate actuated by a crank.
At point C, the reinforcing beam connects to a crank and link mechanism through a spherical joint. The crank of length Lc = 10 mm is attached to the ground at point G and the link is of length L = 0.5 m. The ground, crank, and link are connected together by means of revolute joints. The crank is modeled as a rigid body and its rotation is prescribed as φ = π(1 − cos 2πt/T )/4 for t ≤ T /2 s and φ = π/2 for t > T /2, where T = 1.6 s. The reinforcing beam has the following physical characteristics: axial stiffness, EA = 3.36 MN, bending stiffnesses, EI22 = EI33 = 4.48 N·m2 , torsional stiffness, GJ = 3.02 N·m2 , shearing stiffnesses, K22 = K33 = 1.12 MN, mass per unit span, m = 0.126 kg/m, and mass moments of inertia, m22 = m33 = 0.168 mg·m. The link has the following physical characteristics: axial stiffness, EA = 44 MN, bending stiffnesses, EI22 = EI33 = 0.3 MN·m2 , torsional stiffness, GJ = 28 kN·m2 , shearing stiffnesses, K22 = K33 = 2.4 MN, mass per unit span, m = 1.6 kg/m, and mass moments of inertia, m22 = m33 = 0.011 kg·m.
636
16 Formulation of ƀexible elements
The link is modeled as a geometrically exact beam, see section 16.3. The thin plate is modeled with a 2 × 6 mesh of quadratic elements. The system is simulated for 1.4 s using a constant time step Δt = 0.5 ms using the generalized-α scheme with ρ∞ = 0. As the crank rotates, the plate deƀects downwards then snaps laterally when its buckling load is reached. In the post-buckling regime, the plate becomes signiſcantly softer in bending due to its large twisting allowed by the spherical joint. 0.1
0
0
-0.02
-0.1
ROTATIONS [rad]
DISPLACEMENTS [m]
0.02
-0.04
-0.06
-0.08
-0.3
-0.4
-0.5
-0.1
-0.12
-0.2
0
0.2
0.4
0.6
0.8
1
TIME [s]
1.2
-0.6
1.4
Fig. 16.24. Displacement components at point M. Solid line: u1 ; dashed line: u2 ; dashed-dotted line: u3 .
0
0.2
0.4
0.6
0.8
TIME [s]
1
1.2
1.4
Fig. 16.25. Rotation components at point M. Solid line: r1 ; dashed line: r2 ; dashed-dotted line: r3 .
The plate’s displacement components at point M are shown in ſg. 16.24. At time t = 0.145 s, the plate buckles laterally and the transverse displacement, which was vanishingly small up to that time, suddenly becomes very large. For time t > 0.8 s, the crank angle remains constant at φ = π/2, but the plate continues to vibrate because no dissipative mechanism is present in the system. The components of the Wiener-Milenkovi´c vectorial parameterization of rotation at point M are shown in ſg. 16.25. 5
x 10
2
350 300 250
1 200
MOMENTS [N]
IN-PLANE FORCES [N/m]
1.5
0.5 0 -0.5
150 100 50 0
-1 -50 -1.5 -2
-100 -150 0
0.2
0.4
0.6
0.8
1
1.2
1.4
TIME [s]
Fig. 16.26. In-plane force components at ∗ ∗ ; dashed line: N22 ; point M. Solid line: N11 ∗ dashed-dotted line: N12 .
0
0.2
0.4
0.6
0.8
1
1.2
1.4
TIME [s]
Fig. 16.27. Moment components at point M. ∗ ∗ ; dashed line: M22 ; dashedSolid line: M11 ∗ dotted line: M12 .
16.4 Formulation of plate and shell equations
637
The force and moments components in the plate at point M are depicted in ſgs. 16.26 and 16.27, respectively. Prior to buckling, the plate resists the bending loads applied by the driving mechanism with very little deformations. The in-plane ∗ shear force component, N12 , reƀects the tip shear force applied by the crank and link mechanism, but all other force and moment components vanish. Once buckling has occurred, twisting of the plate renders it much softer in the vertical direction, offering little resistance to crank motion. Because the lateral buckling occurs so suddenly, high frequency vibrations are observed. 4
x 10
30
1
20
0.5
10
DRIVING MOMENT [N.m]
SHEAR FORCES [N/m]
1.5
0 -0.5 -1 -1.5 -2 -2.5 -3 -3.5
0 -10 -20 -30 -40 -50 -60
0
0.2
0.4
0.6
0.8
1
1.2
1.4
TIME [s]
Fig. 16.28. Transverse shear force compo∗ ; dashed nents at point M. Solid line: N13 ∗ . line: N23
-70
0
0.2
0.4
0.6
0.8
1
1.2
1.4
TIME [s]
Fig. 16.29. Rotation components at midpoint. Solid line: r1 ; dashed line: r2 ; dasheddotted line: r3 .
Figure 16.28 shows the corresponding transverse shear force components. Finally, the driving torque, i.e., to torque applied to the crank at point G to achieve the prescribed schedule of angle φ is depicted in ſg. 16.29. This quantity is, in fact, Lagrange’s multiplier used to enforce the prescribed rotation holonomic constraint. As soon as the plate buckles, the magnitude if this moment decreases suddenly because of the plate’s apparent softening when it buckles laterally. Finally, the same problem was simulated using the generalized-α scheme with ρ∞ = 1. In this case, due to the lack of numerical dissipation, high frequency oscillations with amplitudes an order of magnitude larger than those predicted for ρ∞ = 0 are observed. This numerical noise completely obscures the results of the simulation demonstrating here again the need for integration schemes presenting numerical dissipation.
17 Finite element tools
Numerous textbooks [324, 197, 198] present detailed development of the theoretical and numerical concepts underpinning the ſnite element method. Similar developments are clearly beyond the scope of this text. The present chapter focuses on speciſc details of the ſnite element method that are relevant to its application to ƀexible multibody systems. Techniques for interpolation of displacement and specially rotation ſelds are presented in sections 17.1 and 17.2, respectively. Next, general processes for the linearization of the governing equations are presented in section 17.3. Both statics and dynamics problems are addressed, with special emphasis of the equations characterizing ƀexible multibody systems subjected to both holonomic and nonholonomic constraints. Time integrations schemes are a crucial part of the solution process for multibody dynamics codes. Because ſnite element based formulations of multibody dynamics inherit many of the characteristics of the ſnite element methods, it should not come as a surprise if the time integration schemes used in ſnite element implementations are also used for multibody dynamics. In particular, the HHT scheme and its generalization, the generalized-α scheme, both workhorses used in most commercial codes, are reviewed in section 17.4. Section 17.5 discusses energy preserving and decaying schemes that have been developed in recent years for application to ƀexible multibody systems. The chapter closes with a detailed presentation of the implementation of two elements: the cable and the beam element are presented in sections 17.6 and 17.7, respectively.
17.1 Interpolation of displacement ſelds The present section discusses simple numerical tools used to interpolate displacement ſelds within one-dimensional ſnite elements, such as the cable or beam elements discussed in sections 16.2 or 16.3, respectively. Interpolation is a linear operation that has been used for decades to interpolate displacement ſelds, which form a linear space. The interpolation of rotation ſelds is addressed in section 17.2.
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_17 © Springer Science+Business Media B.V. 2011
640
17 Finite element tools
In the ſnite element method, the solution domain is ſrst divided into a ſnite number of sub-domains called ſnite elements. Within each element, the solution is then approximated by a ſnite number of continuous functions, based on the value of these functions at discrete points, called nodes, associated with the element. The main advantage of this two-step approximation process is that many aspects of the solution procedure can be carried out at the element level, i.e., by considering one single element at a time, independently of all others. The continuity of the solution across element boundaries can be guaranteed by the fact that neighboring elements share common nodes. Consider an element of length , described by material coordinate α1 . To illustrate the process, the geometry and displacement ſeld of the element are assumed to be deſned at three nodes along the element. The ſrst two nodes, denoted nodes 1 and 2, are located at the end points of the element, and one additional node, denoted node 3, is inside the element. Let r 1 , r 2 , and r 3 be the position vectors of nodes 1, 2, and 3, respectively; similarly, let u1 , u2 , and u3 be the displacement vectors of nodes 1, 2, and 3, respectively. The geometry and displacement ſeld of the element are now interpolated based on the values of the position and displacement vectors at the nodes using shape functions denoted h1 (s), h2 (s), and h3 (s), r(s) = h1 (s)r 1 + h2 (s)r 2 + h3 (s)r3 ,
(17.1a)
u(s) = h1 (s)u1 + h2 (s)u2 + h3 (s)u3 ,
(17.1b)
where variable s is a non-dimensional quantity deſned along the span of the element. Node 1, 2, and 3 are located at s = −1, +1, and 0, respectively. The shape functions are as yet undetermined, but at s = −1, +1, and 0, the approximation must recover nodal values exactly. For instance, at s = −1, eq. (17.1a) yields r(−1) = h1 (−1)r1 + h2 (−1)r2 + h3 (−1)r3 = r 1 , which implies h1 (−1) = 1, h2 (−1) = h3 (−1) = 0. Proceeding similarly at s = +1 and 0 leads to the following conditions that must be satisſed by the shape functions, h1 (−1) = 1,
h1 (+1) = 0,
h1 (0) = 0,
h2 (−1) = 0, h3 (−1) = 0,
h2 (+1) = 1, h3 (+1) = 0,
h2 (0) = 0, h3 (0) = 1.
(17.2)
Conditions (17.2) alone do not uniquely deſne the shape functions. It is convenient, however, to select the shape functions in the form of quadratic polynomials because each shape function is then uniquely deſned by conditions (17.2). It is easily veriſed that the desired shape functions are 1 h1 (s) = − s(1 − s), 2
h2 (s) =
1 s(1 + s), 2
h3 (s) = 1 − s2 .
(17.3)
The reasoning developed in the previous paragraphs can be repeated for elements presenting two, three, or four nodes, leading to linear, quadratic, or cubic polynomial shape functions, respectively. For elements featuring two nodes located at their end points, the two linear shape functions are
17.1 Interpolation of displacement ſelds
h1 (s) =
1 (1 − s), 2
h2 (s) =
1 (1 + s). 2
641
(17.4)
For elements with four nodes, two at their end points and two internal nodes located at s = ∓1/3, the four cubic shape functions are 9 2 (s − 16 9 2 h2 (s) = (s − 16 h1 (s) =
1 )(1 − s), 9 1 )(1 + s), 9
h3 (s) = − h4 (s) =
27 (1 − s2 )(s − 16 27 (1 − s2 )(s + 16
1 ), 3 1 ). 3
(17.5)
1
1
LINEAR
LINEAR
The shape functions deſned by eqs. (17.4), (17.3), and (17.5) are depicted in the top, middle, and bottom portions of ſg. 17.1, respectively. Derivatives of the shape functions with respect to variable s will also be necessary and are readily computed from eqs. (17.4), (17.3), and (17.5). Figure 17.2 depicts these derivatives.
0.5
1
2
0
0.5
QUADRATIC
0.5
1
2
3
0
2 1 0 -1 -2 5
1 0.5
1
3
CUBIC
CUBIC
QUADRATIC
0 1
2
4
0 -1
-0.8
-0.6
-0.4
-0.2
0
s
0.2
0.4
0.6
0.8
-5 -1
1
Fig. 17.1. Shape function. Linear: top ſgure. Quadratic: middle ſgure. Cubic: bottom ſgure. Red circles indicate the node locations.
0
-0.8
-0.6
-0.4
-0.2
0
s
0.2
0.4
0.6
0.8
1
Fig. 17.2. Derivatives of the shape function. Linear: top ſgure. Quadratic: middle ſgure. Cubic: bottom ſgure.
It will be convenient to introduce a compact notation for the interpolation operation expressed by eq. (17.1b). First, the displacement interpolation matrix, N (s), is deſned as N (s) = h1 (s)I, h2 (s)I, h3 (s)I . (17.6) Next, the nodal displacements are stored in a single array, denoted u ˆ , and deſned as
(17.7) u ˆ T = uT1 , uT2 , uT3 . With this notation, eq. (17.1b) simply becomes u. u(s) = N (s)ˆ
(17.8)
The sizes of the displacement interpolation matrix and nodal displacement array will vary according to the number of nodes used for the interpolation. In many applications, the derivatives of the displacement ſeld will also be required and are easily found as
642
17 Finite element tools
u+ (s) = N + (s)ˆ u,
(17.9)
where notation (·)+ indicates a derivative with respect to s, and the displacement derivative interpolation matrix, N + (s), simply stores the derivatives of the shape functions, + + N + (s) = h+ (17.10) 1 (s)I, h2 (s)I, h3 (s)I . Example 17.1. Interpolation of a displacement ſeld Consider a displacement ſeld with the following nodal values, u1 = 0.9, u2 = 0.1, u3 = 0.3, and u4 = 0.5. For this simple example, a single component of the displacement vector is considered. Find the interpolated displacement ſeld over the element. Consider ſrst the case of linear interpolation. In this case, the two end nodes only will be used, and the displacement interpolation matrix deſned by eq. (17.6) reduces to N (s) = (1 − s)/2, (1 + s)/2 . The single line corresponds to the single displacement component and the two columns correspond to the two nodes of the element. The linear shape functions are those of eq. (17.4). Equation (17.8) now yields the interpolated displacement ſeld as u(s) = 1/2 (1 − s)u1 + 1/2 (1 + s)u2 = 1/2 (1 − s) 0.9 + 1/2 (1 + s) 0.1. This corresponds to the straight line interpolation depicted in the top portion of ſg. 17.3. Quadratic interpolation is considered next. The two end nodes are used, together with the third node, where the displacement is u3 = 0.3. The displacement interpolation matrix is now of size 1 × 3, and the quadratic shape functions deſned by eq. (17.3) appear along the single line of this matrix. Equation (17.8) now yields the interpolated displacement ſeld as u(s) = −1/2 s(1 − s) 0.9 + 1/2 s(1 + s) 0.1 + (1 − s2 ) 0.3. This corresponds to the parabolic interpolation depicted in the middle portion of ſg. 17.3. Finally, cubic interpolation is used. The cubic shape functions deſned in eq. (17.5) are now stored in the displacement interpolation matrix of size 1 × 4. Equation (17.8) now yields the interpolated displacement ſeld as u(s) = 9/16 (s2 − 1/9)(1 − s) 0.9 + 9/16 (s2 − 1/9)(1 + s) 0.1 − 27/16 (1 − s2 )(s − 1/3) 0.3 + 27/16 (1 − s2 )(s + 1/3) 0.5, which corresponds to the cubic interpolation depicted in the bottom portion of ſg. 17.3. The derivative of the displacement ſeld is readily obtained from eq. (17.9) and is depicted in ſg. 17.4. Example 17.2. Evaluation of the strain ſeld The shape functions developed in section 17.1 are deſned in terms of the nondimensional variable, s, and the derivative of the displacement ſeld with respect to this non-dimensional variable is easily evaluated. The strain ſeld, however, is deſned as the derivative of the displacement ſeld with respect to material coordinate α1 . Find the strain ſeld. Consider the axial displacement ſeld, u(α1 ), of a beam, for instance. The axial strain ſeld is
17.1 Interpolation of displacement ſelds 1
LINEAR
LINEAR
1
1 0.5
2 0
0.5 0 -0.5 -1
QUADRATIC
QUADRATIC
643
1
1 0.5
3 2
0
1 0.5 0 -0.5 -1
0.5
1
4
3
2 0 -1
-0.8
-0.6
-0.4
-0.2
0
s
0.2
0.4
0.6
0.8
1
Fig. 17.3. Shape function. Linear: top ſgure. Quadratic: middle ſgure. Cubic: bottom ſgure. Red circles indicate the node locations.
CUBIC
CUBIC
1 0 -1 -2 -1
-0.8
-0.6
-0.4
-0.2
0
s
0.2
0.4
0.6
0.8
1
Fig. 17.4. Derivatives of the shape function. Linear: top ſgure. Quadratic: middle ſgure. Cubic: bottom ſgure.
du du ds 1 du = u = = , (17.11) dα1 ds dα1 J ds where the second equality follows from the chain rule for derivatives and J = dα1 /ds is the Jacobian of the coordinate transformation. Notation (·) indicates a derivative with respect to material coordinate α1 . To evaluate the Jacobian, an increment of the position vector is evaluated based on eq. (17.1a), dr = N + ˆr ds, where N (s) is the displacement interpolation matrix deſned by eq. (17.6) and ˆr the array of nodal position vectors. It then follows that dα21 = drT dr = ˆr T N +T N + ˆr ds2 , and the Jacobian becomes dα1 = ˆr T N +T N + ˆr . (17.12) J= ds =
The strain ſeld, eq. (17.11), is now = u =
1 + 1 u = N + (s)ˆ u, J J
Example 17.3. Evaluation of the strain energy The axial strain energy stored in a beam element is % 1 A= S(α1 ) 2 (α1 ) dα1 , 2 0
(17.13)
(17.14)
where is the length of the beam element, S its axial stiffness, and (α1 ) the axial strain ſeld. Express the strain energy in terms of the elements nodal displacements. Equation (17.13) gives the axial strain ſeld in terms of the nodal displacements. Introducing this expression into eq. (17.14) leads to % 1 dα1 1 T +T A= S(α1 ) 2 u ˆ N (s)N + (s)ˆ u ds 2 0 J ds % 1 1 T 1 T ˆ ˆ ku S(α1 ) 2 N +T (s)N + (s) Jds u ˆ= u ˆ. = u 2 J 2 0
644
17 Finite element tools
In the second equality, the nodal displacement array was placed outside of the integral because nodal quantities are independent of the integration variable, s. The bracketed quantity is the stiffness matrix of the element. The above equation gives the desired result, the strain energy expressed in terms of the nodal displacements. The element’s stiffness matrix is deſned as % S(α1 ) +T k= N (s)N + (s) ds. 0 J(α1 ) Typically, numerical integration is used to evaluate this integral, leading to the following expression for the stiffness matrix k=
NG ( wi S(si ) i=1
J(si )
N +T (si )N + (si ).
where si and wi are Gauss’ points and weights, respectively, as discussed in section 18.3.
17.2 Interpolation of rotation ſelds The exact treatment of ſnite rotations is particularly important in multibody dynamics because ſnite rotations associated with the ſnite relative motions of the system’s components are combined with the ſnite elastic motions of the ƀexible components. Consider, for instance, the motion of a helicopter rotor blade in which elastic deformation of the blade are superimposed onto the rigid body rotation of the entire rotor. Interpolation of displacement ſelds is at the heart of the ſnite element method and basic interpolation techniques based on the deſnition of appropriate shape functions are reviewed in section 17.1. Application of the same, linear interpolation technique to ſnite rotation ſelds has been the subject of controversy, because ſnite rotation ſelds do not form a linear space. Crisſeld and Jeleni´c [325] were the ſrst to point out a major deſciency of the classical interpolation techniques applied to rotation ſelds: its lack of objectivity. By deſnition, a rigid body motion generates no strains; hence, the strain ſeld associated with a given displacement ſeld must remain unaffected by the addition of a rigid body motion to the displacement ſeld. If a computational scheme satisſes this condition, it is said to be “objective.” Crisſeld and Jeleni´c [325, 326] showed that classical interpolation formulæ applied to ſnite rotation ſelds violate this objectivity criterion. They prove the non-objectivity of the direct interpolation of total rotations [327], incremental rotations [328] and iterative rotations [304]. Crisſeld and Jeleni´c argue that “all of these formulations can be regarded as stemming from the same family, for which the following is valid: the interpolation is applied to the rotation between a particular reference conſguration and the current conſguration. With hindsight, the nature of this interpolation is bound to make all
17.2 Interpolation of rotation ſelds
645
of these formulations non-objective. The rotations interpolated in this way in general include rigid body rotations, so that the error, introduced by the interpolation, makes the resulting strain measures dependent on the rigid body rotation.” They also point out, however, that while the errors in the computed strain ſeld are small and decrease with mesh p- or h-reſnement, lack of objectivity persists if rotation increments or Newton-Raphson updates are interpolated. Crisſeld and Jeleni´c proposed a novel interpolation technique that guarantees objectivity by splitting rotations into rigid and elastic components: the sole elastic component is interpolated. This approach is akin to the co-rotational formulation [329], but retains the fully nonlinear strain-conſguration equations, rather than their linearized counterparts. Betsch and Steinmann [330] proposed an alternative approach to achieving objectivity: instead of interpolating ſnite rotation parameters, they interpolate the unit vectors forming the columns of the ſnite rotation tensor and proved that this approach also satisſes the objectivity criterion. Linear interpolation of unit vectors, however, does not yield unit vectors, nor does it preserve their orthogonality. Special procedures were developed to guarantee that the interpolated results lead to orthogonal rotation tensors. Numerical examples were shown that demonstrate the accuracy of numerical predictions. Romero et al. [331, 332] presented a comparison of different interpolation methods including the direct interpolation of ſnite rotations, the interpolation method proposed by Crisſeld and Jeleni´c [325], and two new approaches, based on 1) the non-orthogonal interpolation of rotations with modiſcation of geometrically exact beam theory and 2) the isoparametric interpolation of rotations followed by orthogonalization using polar decomposition. Numerical tests of all four methods showed that with the exception of the direct interpolation of ſnite rotations, the other methods are objective, path-independent and preserve the orthogonality of the rotation tensor. The proposed interpolation approaches, however, were shown to soften structural response, and could converge to erroneous solutions. They recommend the use of the interpolation approach of Crisſeld and Jeleni´c. Finally, Ibrahimbegovi´c and Taylor [333] also proposed interpolation techniques that satisfy the objectivity criterion for geometrically exact structural models. Update formulæ are based on an incremental approach and rely on the representation of ſnite rotations based on quaternion quantities, which must be stored at each node of the model. Special attention was paid to the implementation details for applied support rotations and the corresponding modiſcations of the residual vector and tangent matrix introduced by the follower forces and moments. Because of the percieved difſculties associated with the treatment of ſnite rotations, “rotationless formulations” have appeared in recent years. For instance, in the absolute nodal coordinate formulation [334], absolute displacements and global slopes are used as nodal coordinates, bypassing the need for ſnite rotations. Betsch and Steinmann [227] have advocated the use of the direction cosine matrix to represent ſnite rotations. It should be noted, however, that these rotationless formulations use more coordinates than the minimum set required to represent ſnite rotations, and hence, typically require more computational resources than their counterparts based on minimum set representations.
646
17 Finite element tools
In this section, the problem of interpolation of ſnite rotations within the framework of geometrically exact structural elements is examined. For computational efſciency, it is desirable to use a minimal set representation of ſnite rotations, i.e., three parameters only. While quaternions have been used in multibody dynamics simulations [250, 252], the computational costs of dealing with four parameters and the enforcement of the associated normality condition, eq. (13.20), have limited their use. The rescaling operation presented in section 13.6.2 is used to systematically eliminate singularities associated with such minimal set representations. The rescaling operation is based on the observation that addition of a rotation of magnitude φ = ±2π to a ſnite rotation leaves the associated rotation tensor unchanged. While the concept of objectivity is based on the invariance of the strain ſeld with respect to the addition of a rigid body motion to the rotation ſeld, the concept of rescaling is based on the invariance of the rotation tensor with respect to the addition of a rotation of magnitude φ = ±2π, i.e., R(φ, n ¯ ) = R(φ ± 2π, n ¯ ). In turn, this raises the question of invariance on the interpolation of ſnite rotation with respect to rescaling. It is shown that the basic interpolation algorithm proposed by Crisſeld and Jeleni´c [325] to achieve objectivity, is also invariant with respect to rescaling operations. The simple numerical tools used to interpolate displacement ſelds within onedimensional ſnite elements, such as the cable or beam elements were presented in section 17.1. The challenges associated with the interpolation of rotation ſelds are addressed in section 17.2.1, with special attention devoted to the impact of rescaling operations. Rescaling also impacts the choice of unknowns, as discussed in section 17.2.2, and a new algorithm is proposed for the interpolation of incremental quantities. Finally, numerical examples are discussed that demonstrated the simplicity and efſciency of the proposed approach when applied to complex, ƀexible multibody systems. 17.2.1 Finite element discretization Interpolation of the displacement ſeld within an element is at the heart of the ſnite element discretization procedure and is summarized in section 17.1. Interpolation is a linear operation, acting on the displacement ſeld which forms a linear space. Interpolation of the displacement ſeld Let arrays u ˆi , u ˆ and u ˆ f store the nodal displacements of the element at the beginning of a time step, the incremental nodal displacements, and the displacements at the end of a time step, respectively. Furthermore, let the displacement update at the nodes be written as u ˆf = u ˆi + u ˆ. It then follows that ui (s) + u(s) = N (s)(ˆ ui + u ˆ) = N (s)ˆ uf = uf (s),
(17.15)
where N (s) is the displacement interpolation matrix deſned by eq. (17.6) and variable s a non-dimensional quantity deſned along the span of the element. This important relationship implies that initial, ſnal, and incremental ſelds can all three be
17.2 Interpolation of rotation ſelds
647
interpolated with the same shape functions, and a simple update of the nodal values then guarantees compatibility of the interpolated displacement ſeld over the entire element. Interpolation of the rotation ſeld When formulating beam and shell elements, the kinematic description of the problem also requires interpolation of the rotation ſeld and its derivative, written as and c (s) = N (s)ˆc =
c(s) = N (s)ˆc,
1 + N (s)ˆc, J
(17.16)
1.5
1.5
1
1
0.5
0.5
0
c1
c1
respectively, where ˆc is the array that stores the rotation parameter vectors at the nodes of the element. This interpolation simply provides an approximation to the rotation ſeld within the element.
0
-0.5
-0.5
-1
-1
Rescaled node -1.5 -1
-0.8
-0.6
-0.4
-0.2
0
s
0.2
0.4
0.6
0.8
1
Fig. 17.5. Wiener-Milenkovi´c parameter, c1 , for the given rotation ſeld; no rescaling is used at node 4. Nodal rotations: (◦). Interpolation using eq. (17.16): solid line, corresponding Gauss point values: ( ). Relative nodal rotations: (♦). Interpolation of relative rotations: dashed line. Interpolation computed by algorithm 1: dashed-dotted line, corresponding Gauss point values: ().
-1.5 -1
-0.8
-0.6
-0.4
-0.2
0
s
0.2
0.4
0.6
0.8
1
Fig. 17.6. Wiener-Milenkovi´c parameter, c1 , for the given rotation ſeld; node 4 has been rescaled; for reference, the unscaled note 4 is indicated by (). Nodal rotations: (◦). Interpolation using eq. (17.16): solid line, corresponding Gauss point values: ( ). Relative nodal rotations: (♦). Interpolation of relative rotations: dashed line. Interpolation computed by algorithm 1: dashed-dotted line, corresponding Gauss point values: ().
Figure 17.5 shows the interpolated rotation ſeld for a four-noded beam element using the cubic interpolation functions given by eq. (17.5). The rotations at the four nodes are deſned by four rotation angles, φ1 = 145◦ , φ2 = 160◦ , φ3 = 170◦ , φ4 = 181◦ , and associated unit vectors, ⎤ ⎡ 0.3049 0.3262 0.3193 0.3105 ¯2, n ¯3, n ¯ 4 ] = ⎣0.6097 0.6461 0.6095 0.5485⎦ . [¯ n1 , n 0.7316 0.6900 0.7256 0.7763
648
17 Finite element tools
The interpolated rotation ſeld was computed using eq. (17.16), and the ſrst component, c1 , of the Wiener-Milenkovi´c rotation parameter vector is shown in ſg. 17.5; similar results are obtained for the other two components, c2 and c3 . The curvature can be computed in a similar manner as κ(s) = H c (s), where the tangent tensor, H(c), is deſned by eq. (13.55) and c(s) and c (s) by eq. (17.16). Figure 17.7 shows the ſrst component, κ ˆ 1 , of the curvature vector. Although the interpolation procedure of eq. (17.16) looks reasonable considering the results shown in ſg. 17.5, it suffers several serious drawbacks. First, let ˆci , ˆc and ˆcf be the nodal rotation parameter vectors at the beginning of a time step, for the incremental rotation, and at the end of a time step, respectively. Proceeding as was done above for the displacement ſeld implies that cf (s) = ci (s) + c(s) if the nodal updates are selected as ˆcf = ˆci + ˆc. Unfortunately, as discussed in section 4.9, rotations do not form a linear space; they must be composed, not added. At the nodes, rotations should be updated using the following composition formula, ˆcf = ˆc ⊕ˆci , where the notation ⊕ is used to indicate the composition operation, as deſned in eq. (13.82). Furthermore, the nonlinear character of composition operations and the linear character of the interpolation operation imply that cf (s) = c(s) ⊕ ci (s) if ˆcf = ˆc ⊕ ˆci . Consequently, if the nodal rotations are updated using the composition formula, the interpolated rotation ſeld does not satisfy the same composition formula at all points along the element. 0.5
1
0.4
0
-1
0.2
-2
k1
k1
0.3
0.1
-3
0
-4
-0.1
-5
-0.2 -1
-0.8
-0.6
-0.4
-0.2
0
s
0.2
0.4
0.6
0.8
1
Fig. 17.7. First component of the curvature vector, κ1 , based on interpolation using eq. (17.16): solid line, corresponding Gauss point values: (). Curvatures computed by algorithm 1: dashed-dotted line, corresponding Gauss point values: ().
-6 -1
-0.8
-0.6
-0.4
-0.2
0
s
0.2
0.4
0.6
0.8
1
Fig. 17.8. First component of the curvature vector, κ1 , based on interpolation using eq. (17.16): solid line, corresponding Gauss point values: (). Curvatures computed by algorithm 1: dashed-dotted line, corresponding Gauss point values: ().
The second drawback becomes obvious once the rescaling operation presented in section 13.6.2 is taken into account. The rotation at the fourth node of the element is of magnitude φ4 = 181◦ > 180◦, and hence, should be rescaled to avoid singularities. The Wiener-Milenkovi´ c rotation parameter vector at this node
is cT4 = 1.253, 2.214, 3.132 , c4 = 4.035 > 4, and the rescaled parameter vec-
17.2 Interpolation of rotation ſelds
649
† tor is c†T 4 = −1.231, −2.175, −3.078 , c4 = 3.965 < 4, as expected from the results presented in section 13.6.2. Figure 17.6 shows the rotation ſeld interpolated using eq. (17.16) in the presence of rescaling. Note that the results presented in this ſgure should be identical to those shown in ſg. 17.5 because they correspond to the interpolation of identical conſgurations: indeed, the rotation tensor at node 4 is uniquely deſned, but represented by different rotation parameters, c4 and c†4 , due to rescaling. Clearly, the linear interpolation operation of eq. 17.16 is not invariant under the rescaling operation. The curvature ſeld is shown in ſg. 17.8 and clearly, in the presence of rescaling, the results are erroneous: without rescaling, the three Gauss point values of the ſrst curvature component are κ1 = -0.048, 0.230, and 0.322, respectively, as compared to κ1 = -0.208, 0.308, and -5.0521, respectively, in the presence of rescaling. The Crisſeld and Jeleni´c interpolation algorithm Clearly, a more robust interpolation approach is necessary to deal with rotations in the presence of rescaling; the following algorithm was proposed by Crisſeld and Jeleni´c [325]. Algorithm 1 (Rotation interpolation) Interpolation of a rotation ſeld deſned by its rotation parameter vectors, ˆc, at the nodes of a ſnite element. Step 1. Compute the nodal relative rotations, ˆr , by removing the rigid body rotation, ˆc1 , from the rotation at each node, ˆr = ˆc1− ⊕ ˆc. Step 2. Interpolate the relative rotation ſeld, r(s) = N (s)ˆr, and its derivative, r (s) = N (s)ˆr . Find the curvature ſeld, κ = R(c1 )H(r) r . Step 3. Restore the rigid body rotation removed in step 1, c(s) = c1 ⊕ r(s). Algorithm 1 removes the possible effects of rescaling from the interpolation procedure. In step 1, the relative rotations of the nodes with respect to node 1 are computed using the composition formula; note that the relative rotation ſeld could be computed with respect to any of the nodes of the element, node 1 is simply a convenient choice. It is assumed here that the relative rotations within one single element are small enough that no rescaling is needed within the element, i.e., within the element, |φr | < π. If this condition were not to be satisſed, a ſner mesh would be required to limit the relative rotation within each element. Next, these relative rotations are interpolated using standard procedures. Finally, the interpolated relative rotation is composed with the rotation of node 1 to ſnd the interpolated rotation ſeld. The interpolated rotation ſeld computed by algorithm 1 is also shown in ſgs. 17.5 and 17.6. Because the nodal rotations presented in these ſgures only differ by the rescaling of node 4, the relative rotation ſelds are identical, the corresponding curvature ſelds are identical, as are the interpolated rotation ſelds. The interpolated rotation ſeld seems to present a discontinuity at s = 0.973 in both ſgures: this is due to the rescaling operation in step 3 of algorithm 1, but does not affect the quality of the interpolation. In fact, the interpolation procedure of algorithm 1 is able to deal with the discontinuities inherent to the required rescaling operations. The presence of
650
17 Finite element tools
these discontinuities, however, has implications on the linearization of the equations of motion, as discussed in section 17.2.2. The third drawback of interpolation based on eq. (17.16) is its lack of objectivity when computing strain components. As shown in section 16.3.1, the strain measures of geometrically exact beam theory are invariant with respect to the addition of a rigid body motion. Because algorithm 1 is based on the interpolation of relative rotation, the addition of a rigid body motion is automatically ſltered out from the interpolation step, ensuring the objectivity of the process. Jeleni´c and Crisſeld [326] studied the lack of objectivity of interpolation schemes based on eq. (17.16) and concluded that “The non-invariance and path-dependence in these formulations decrease with both p-reſnement and h-reſnement and in practical applications cannot always be easily spotted.” These conclusions are supported by the data presented here: in ſg. 17.7, the curvatures computed based on eq. (17.16) (non-objective) are nearly identical to those computed with algorithm 1 (objective). In fact, at the Gauss points, the curvature component, κ1 , computed by the two approaches only differ by 0.16, -0.085 and 0.16%, respectively. These discrepancies are minute compared to the gross disparities observed in ſg. 17.8 in the presence of rescaling. The objectivity of the strain interpolation resulting from the use of algorithm 1 typically provides modest improvements in the quality of the interpolated strain ſeld, but is indispensable when dealing with rotation ſelds involving potential rescaling. 17.2.2 Total versus incremental unknowns Multibody simulations typically proceed in discrete time steps. Figure 17.9 shows the inerti :Initial tial frame of reference, the reference, i.e., unconfiguration stressed, conſguration of the beam at time t = 0, and its conſgurations at the beginning and u, R u 0, R 0 ui, Ri end times of a typical time step, denoted ti and tf , respectively. Each frame is related to its parent frame by a ſnite motion characterized by a u f, R f displacement vector and a rotation tensor, all retf :Final t0 :Reference configuration solved in the inertial frame. It is assumed that configuration the dynamic simulation has successfully proceeded up to time ti , i.e., the corresponding disFig. 17.9. Conſguration of the sysplacement and rotation ſelds, denoted ui and tem at various instants in time. Ri , respectively, are known. Let ci be a vectorial parameterization of the rotation tensor Ri . To advance the solution from the initial to the ſnal time of the time step, two sets of unknowns can be selected: the incremental displacements and rotations, denoted u and R, respectively, or the total displacements and rotations, denoted uf and Rf , respectively, see ſg. 17.9. Let c and cf be parameterizations of the rotation tensors R and Rf , respectively. From a kinematic viewpoint, both sets of unknowns are equivalent. Inertial Frame
17.2 Interpolation of rotation ſelds
651
In typical dynamic simulations, however, small time steps must be selected to achieve convergence and guarantee the accuracy of the solution. Consequently, it can be assumed that incremental rotations will be of magnitude |φ| < π; in fact, for most practical cases, |φ| π; indeed, |φ| = π implies that a component of the system rotates by 180◦ within a single time step. It cannot be assumed, however, that |φf |, the rotation associated with rotation tensor Rf , is small, in fact, |φf | > π is likely to occur. The implication of these observations is clear: if total rotations are used as unknowns, some of the rotation parameters, cf , will be rescaled, as required, whereas if incremental rotations are used as unknowns, none of the unknown parameters, c, will be rescaled. Interpolation algorithm 1 was shown to seamlessly handle rescaling, however, when dealing with dynamic simulations, additional considerations must be taken into account. Spatial and time discretization algorithms typically transform the governing partial differential equations of complex multibody systems into a set of nonlinear algebraic equations, which are solved in an iterative manner using the Newton-Raphson method. Inherent to this approach is a linearization process that transforms the nonlinear algebraic equations into their linearized counterparts. Consider, for instance, the linearization of the curvature vector, κ = H(c)c , which will appear in the expression for the elastic forces of a beam element. Application of the linearization procedure leads to Δκ = H(c)Δc +M (c, c )Δc, where operator M (c, c ) is deſned by eq. (13.67), and hence, operators H(c) and M (c, c ) appear in the expression of the tangent stiffness matrix of the element. Let c and c† denote a rotation parameter vector and its rescaled counterpart, respectively, as discussed in section 13.6.2. By construction of the rescaling operation, R(c) = R(c† ), but it is easily veriſed that H(c) = H(c† ) and M (c, c ) = M (c† , c† ). Clearly, while intrinsic quantities such as the rotation tensor, the curvature vector, or elemental elastic forces are invariant to rescaling, and while the interpolation operation can be made invariant to the same rescaling through the use of algorithm 1, the tangent stiffness matrix is not invariant to rescaling. The implications of this lack of invariance are easily understood by considering the situation depicted in ſg. 17.6. Evaluation of the tangent stiffness matrix at the Gauss points uses the interpolated rotation ſeld and its derivative, but ignores the fact node 4 was rescaled. The tangent stiffness matrix will be evaluated as if the rescaling of node 4 never took place, i.e., the equations are linearized about the wrong point. Hence, the search direction in the Newton-Raphson iteration process will be erroneous, which can ultimately cause failure of the iteration process. 17.2.3 Interpolation of incremental rotations In view of the above discussion, it is desirable to work with incremental rotations because they remain small and do not require rescaling. The tangent stiffness matrix then always corresponds to the correct linearization of the problem. This contrasts with the choice of total rotations as unknowns for which these desirable features cannot be guaranteed.
652
17 Finite element tools
The choice of incremental nodal rotations as unknowns requires interpolation of the incremental rotation ſeld to compute the elemental elastic forces and tangent stiffness matrix. This task cannot be performed with the help of eq. (17.16): as already pointed out in section 13.6.2, the nonlinear nature of the composition operation is incompatible with the linear interpolation operation. An alternative approach is proposed for this operation. Algorithm 2 (Incremental rotation interpolation) Interpolation of the incremental rotation ſeld between two conſgurations deſned by nodal rotation parameter vectors, ˆci and ˆcf , of a ſnite element. Step 1. Use algorithm 1 to compute the interpolated rotation ſeld, ci (s), based on nodal values ˆci . Step 2. Use algorithm 1 to compute the interpolated rotation ſeld, cf (s), based on nodal values ˆcf = ˆc ⊕ ˆci . Step 3. Compute the incremental rotation ſeld by composition: c(s) = cf (s) ⊕ c− i (s). This approach is different from that proposed by Cardona and G´eradin, who directly interpolated incremental rotations using eq. 17.16. It is also different from the algorithm proposed by Crisſeld and Jeleni´c [325]. Example 17.4. The use of total versus incremental unknowns In this example, the use of total versus incremental unknowns will be contrasted, to underline the difſculties associated with the use of total rotations in the formulation of dynamic problems. Consider a free-free beam featuring the following physical properties: axial stiffness S = 9.28 kN, shearing stiffness K22 = K33 = 3.57 kN, torsional stiffness J = 65.2 N·m2 , bending stiffness I22 = I33 = 32.6 N·m2 , and mass per unit length m = 0.35 kg/m. The beam is modeled using a single cubic element and is subjected to two mutually orthogonal end bending moments Q2 and Q3 , both acting in directions normal to the axis of the beam. Both bending moments have a triangular time history: starting from zero value at time t = 0, growing linearly to a maximum value of 0.3 N·m at t = 0.5 s, linearly decreasing to a zero value at time t = 1 s, and remaining zero at all subsequent times. The dynamic response of the beam was computed using time step sizes Δt = 1 and 0.1 ms, with formulations using both total and incremental unknowns. Algorithms 1 and 2 were used to interpolate the total and incremental rotations, respectively. Figure 17.10 shows the third component of the Wiener-Milenkovi´c rotation parameter vector at the beam’s end opposite to the applied bending moments, for Δt = 1 ms; the formulations using total and incremental unknowns lead to nearly identical predictions. The rescaling operation that occurs at time t = 0.929 s is evident in ſg. 17.10. All four nodes of the element, however, are rescaled simultaneously and the rotation interpolation procedure performs well with both total and incremental unknowns. Next, the time step size was reduced to Δt = 0.1 ms. Due to this smaller time step size, the node at the unloaded end of the beam was rescaled at time t = 0.9284
17.2 Interpolation of rotation ſelds
653
3
2
c3
1
i3
0
i2
F i1
-1
W
-2
-3 0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Time [s]
Fig. 17.10. Time histories of third component of the conformal rotation vector at the end node: incremental formulation: solid line; total formulation: dashed line.
Fig. 17.11. Rotating cantilevered beam subjected to transverse tip force.
s, while the other three nodes of the element were not. As expected, the formulation using total unknowns failed to converge at that time step, in contrast with that using incremental unknowns that proceeded uneventfully. This example call for the following observations. If the rotation ſeld is interpolated with eq. (17.16) without ever rescaling the rotation parameters, the computation will proceed smoothly at ſrst; although the interpolated strain ſeld is not objective, errors remain small, particularly if higher-order elements are used with ſne meshes. During the simulation, rotation magnitudes grow; no matter what parameterization is used to represent rotations, a singularity will eventually be reached and the simulation will fail at that point. On the other hand, if the rotation ſeld is interpolated with eq. (17.16) with rescaling of the rotation parameters, the computation will proceed smoothly at ſrst, although the interpolated strain ſeld is not objective. When the ſrst node of the model is rescaled, the strain ſeld computed in the elements connected to this node will be grossly erroneous, see ſg. 17.8, and typically, convergence will not be reached for that time step at which rescaling occurs. Finally, if algorithm 1 is used for the interpolation of the strain ſeld, the simulation is not affected by rescaling of the rotation parameters that takes place whenever required, and the computed strain ſeld is objective. The rescaling operation becomes transparent to the computation process. However, evaluations of the tangent stiffness matrix based on interpolations of total unknowns computed with algorithm 1 can yield erroneous search directions in the Newton-Raphson process used to solve the nonlinear equations, which are inherent to time-stepping procedures. This can destabilize simulations. Therefore, the use incremental unknowns in conjunction with algorithm 2 is recommended. This method preserves the objectivity of geometrically exact formulations, yields tangent stiffness matrices and residual vectors that are invariant to the rescaling of rotations, and enables the use of geometrically exact structural models in multibody simulations.
654
17 Finite element tools
10
6
10
5
10
4
10
3
10
2
10
ERROR IN SHEAR FORCE [%]
ERROR IN SHEAR FORCE [%]
Example 17.5. Assessing the accuracy of algorithm 2 Figure 17.11 depicts a cantilevered beam rotating about an axis normal to its axis and passing through its root. The beam’s physical properties are identical to those used in example 17.4 and it is subjected to a transverse tip load, F , linearly increasing from 0 to 50 N in one second. The beam rotates at an angular speed, Ω, linearly increasing from 0 to 4 rad/s in the same time. The system was simulated for 1.5 s with a time step size h = 0.01 s. Given the results of example 17.4, the simulations presented here only use incremental unknowns. The direct interpolation of rotation increments through eq. (17.16) will be contrasted with the interpolation technique described in algorithm 2. Figure 17.12 shows the error in the beam’s root shear force as a function of the number of linear elements used to mesh the beam. Figure 17.13 shows the corresponding results for quadratic elements. The reference solution for the error analysis was obtained using a 250 cubic element mesh for which convergence was established.
1
10
0
10
-1
0
2
4
6
8
10
12
14
NUMBER OF ELEMENTS
16
18
20
Fig. 17.12. Beam root shear force error versus number of linear elements. Interpolation using algorithm 2: solid line; direct interpolation using eq. 17.16: dashed line.
10
4
10
3
10
2
10
1
10
0
10
-1
10
-2
10
-3
10
-4
10
-5
0
2
4
6
8
10
12
14
16
18
20
NUMBER OF ELEMENTS
Fig. 17.13. Beam root shear force error versus number of quadratic elements. Interpolation using algorithm 2: solid line; direct interpolation using eq. 17.16: dashed line.
For both linear and quadratic elements, direct rotation interpolation using eq. (17.16) leads to large errors when coarse meshes are used, but these errors decrease rapidly for both h- and p-reſnements. Indeed, the errors observed for the quadratic element mesh are far smaller than those for the linear element mesh. When algorithm 2 is used to interpolate rotation increments, errors are further reduced, although this reduction is less pronounced for ſner meshes. Since the computational cost associated with the use of algorithm 2 is nearly identical to that of using eq. (17.16), the use of the former is advisable. Indeed, achieving a 0.01% error in root shear force with quadratic elements requires 5 elements with algorithm 2, but 16 elements for eq. (17.16); this will result in a nearly threefold gain in computational cost when using algorithm 2.
17.2 Interpolation of rotation ſelds
655
Example 17.6. Rotorcraft tail rotor transmission This example presents the modeling of a helicopter supercritical tail rotor transmission. Figure 17.14 shows the conſguration of the system. The aft part of the helicopter is modeled and consists of a 6 m fuselage section that connects at a 45 degree angle to a 1.2 m projected length tail section. This structure supports the transmission to which it is connected at points M and T by means of 0.25 m support brackets. The transmission is broken into two shafts, each connected to ƀexible couplings at either end. The ƀexible couplings are represented by ƀexible joints, consisting of concentrated springs.
i2 S R
Rotor Q
Gear box 1 Shaft 1 Fuselage Rigid body Beam Flexible joint Revolute joint Ground clamp
Shaft 2 T G Tail M
Gear box 2
i1 Q
T~ Side view of tail rotor
H Plane of rotor
Fig. 17.14. Conſguration of a tail rotor transmission.
Shaft 1 is connected to a revolute joint at point S, and gear box 1 at point G. Shaft 2 is connected to gear box 1 and gear box 2 which in turn, transmits power to the tail rotor. The plane of the tail rotor is at a 0.3 m offset with respect to the plane deſned by the fuselage and tail, and its hub is connected to gear box 2 by means of a short shaft. Each tail rotor blade has a length of 0.8 m and is connected to the rotor hub at point H through a rigid root attachment of length 0.2 m. The gear ratios for gear boxes 1 and 2 are 1:1 and 2:1, respectively. The fuselage has the following physical characteristics: axial stiffness S = 687 MN, bending stiffnesses I22 = 19.2 and I33 = 26.9 MN·m2 , torsional stiffness J = 8.77 MN·m2 , and mass per unit span m = 15.65 kg/m. The properties of the tail are one third of those of the fuselage. Shafts 1 and 2 have the following physical characteristics: axial stiffness S = 22.9 MN, bending stiffnesses I22 = 26.7 and I33 = 27.7 kN·m2 , torsional stiffness J = 22.1 kN·m2 , and mass per unit span m = 0.848 kg/m. The center of mass of the shaft has a 1 mm offset with respect to the shaft reference line. The small difference in bending stiffnesses together with the center of mass offset are meant to represent an initial manufacturing imperfection or an unbalance in the shaft.
656
17 Finite element tools
The stiffness properties of the ƀexible couplings are as follows: axial stiffness 5 kN/m and damping 0.5 N·sec/m, transverse stiffnesses 1 MN/m, torsional stiffness 0.1 MN·m/rad, and bending stiffnesses 0.1 kN·m/rad. Finally, gear boxes 1 and 2 have a concentrated mass of 5 kg each, and the tail rotor a 15 kg mass with a polar moment of inertia of 3 kg·m2 . At ſrst, a static analysis of the system was performed for various constant angular velocities of the drive train. The natural frequencies of the system were computed about each equilibrium conſguration. When shaft 1 does not rotate, its two lowest natural frequencies of shaft 1 were found to be ω1 = 46.9 and ω2 = 49.1 rad/s. According to linear theory, the system is stable when the shaft angular velocity is below ω1 or above ω2 , but unstable between theses two speeds. The system was loaded by a torque acting at the root of shaft 1, featuring the following time history: Q(t) = 50 (1 − cos 2πt) for 0 < t < 1 s, Q(t) = 0 for 1 < t < 2 s, Q(t) = 6 (1 − cos 2πt) for 2 < t < 3 s, and Q(t) = 0 for 3 < t < 6 s. After 1 s, the angular velocity of shaft 1 stabilizes at about 45 rad/s, below the critical speed. The torque applied for 2 < t < 3 s then accelerates the transmission through the critical zone to reach an angular velocity of 50.5 rad/s. A constant time step size h = 0.5 ms was used for the entire simulation.
M1 [N.m]
4
c1
2 0
0.4 0.2 0
-0.2
-2
400
M2 [N.m]
-0.4
e2
-4 1 0.5 0
0
-200
-0.5 -1
-400
52
400
M3 [N.m]
w1 [rad/s]
200
50 48 46 44
200 0
-200 -400
2
2.1
2.2
2.3
2.4
2.5
2.6
2.7
2.8
2.9
3
TIME [s]
Fig. 17.15. Top ſgure: ſrst component of the Wiener-Milenkovi´c rotation parameter vector. Middle ſgure: components of unit vector e¯2 . Bottom ſgure: shaft 1 mid-span angular speed.
4
4.1
4.2
4.3
4.4
4.5
4.6
4.7
4.8
4.9
5
TIME [s]
Fig. 17.16. Moments at the mid-span location of shaft 1. Top ſgure: torque, M1 . Middle ſgure: bending moments M2 . Bottom ſgure: bending moments M3 .
Figure 17.15 shows the dynamic response at shaft 1 mid-span position for 2 < t < 3 s. The top portion of the ſgure shows the ſrst component of the WienerMilenkovi´c rotation parameter vector: a rescaling operation occurs for each complete revolution of the shaft. The middle portion of the ſgure shows the components of the unit vector e¯2 , i.e., the second column of the rotation tensor. As expected, these quantities are continuous, as they do not “see” the rescaling operations. Finally, the bottom portion of the ſgure shows the angular velocity of the shaft. The horizontal dashed line indicate the unstable zone for the shaft.
17.3 Governing equations and linearization process
657
The angular velocity of shaft 1 passes through this critical zone fast enough to avoid the build up of lateral vibrations. Here again, the angular velocity is continuous, unaffected by the rotation rescaling operations. Figure 17.16 shows the torque, M1 , and the two bending moments, M2 and M3 , at shaft 1’s mid-span, for 4 < t < 5 s. Since the shaft has just passed through the critical zone, fairly large bending moments are observed. Here again, all quantities are continuous, despite the multiple rescaling operations. This example demonstrates the ability of the algorithm 2 to handle rotations of arbitrary magnitudes in complex, ƀexible multibody systems. The rescaling operations are applied at those nodes where they are required to avoid singularities in rotation representations. All other quantities, such as the rotation tensor, angular velocities, or bending moments are continuous and unaffected by the rescaling operations.
17.3 Governing equations and linearization process The governing equations for multibody systems can take many different forms. Prior to performing a dynamic analysis, it is often informative to carry out a static analysis, for which all inertial forces are assumed to vanish. Both linear and nonlinear static problems arise both with and without kinematic constraints, as discussed in section 17.3.1. Next, the equations governing linear structural dynamics problems will be reviewed in section 17.3.3, leading to the nonlinear problems discussed in section 17.3.4. Finally, the governing equations for typical multibody systems are presented, for holonomic and nonholonomic systems in sections 17.3.5 and 17.3.6, respectively. 17.3.1 Statics problems Consider ſrst a simple linear, unconstrained static problem characterized by a system of linear equations, K q = f, (17.17) where array q stores the n generalized coordinates of the system, K is the constant stiffness matrix, and f the externally applied forces. For complex elastic structures, static problems are typically formulated using ſnite element techniques [324, 197, 198]. A large number generalized coordinates, corresponding to the displacement components at all nodes, will be present and a two step procedure is generally used to solve linear system (17.17). First, the symmetric stiffness matrix is factorized as K = L D LT , where L is a lower triangular matrix and D a diagonal matrix. Next, the solution is found through backsubstitution. This algorithm, often referred to as the active column solver, is well documented numerous textbooks [324, 197, 198].
658
17 Finite element tools
Nonlinear static problems More often than not, the elastic forces are nonlinear functions of the generalized coordinates, and the governing equations express the equilibrium of the system as f E (q) = f , where f E are the elastic forces. This nonlinear static problem will be solved through successive linearization of the governing equations. Given an approximate solution, q, a more accurate solution, q + Δq, is sought, where Δq are the unknown increments in generalized coordinates. The new solution is assumed to satisfy the governing equations of the problem, i.e., f E (q + Δq) = f . A Taylor series expansion is performed about the known, approximate equilibrium solution, to ſnd ∂f E f E (q) + Δq + h.o.t. = f . (17.18) ∂q The externally applied load array, f , is assumed here to be known and independent of the generalized coordinates. If this is not the case, this array would also be approximated using a Taylor series expansion and treated in a manner similar to the elastic forces. Next, the solution increments, Δq, are assumed to be small quantities and the higher-order terms in the Taylor series expansion are neglected, leading to a linearized problem for these increments, K Δq = f − f E (q),
(17.19)
where array f − f E (q) is called the out-of-balance force array. The stiffness matrix, K(q), corresponds to the derivatives of the elastic forces with respected to the generalized coordinates, ∂f E (q) K(q) = . (17.20) ∂q This solution procedure, known as the Newton-Raphson method, is of an iterative nature. Starting from an approximate solution, q (1) , the stiffness matrix, K(q (1) ), is evaluated ſrst and increments are obtained from the solution of system (17.19) as Δq (1) = K −1 (q (1) )[f − f E (q (1) )]. The new solution is then q (2) = q (1) + Δq (1) . At the kth iteration, the approximate solution is denoted q (k) and the next approximate solution, denoted q (k+1) , is obtained as q (k+1) = q (k) + K −1 (q (k) )[f − f E (q (k) )]. At convergence, the norm of the out-of-balance force array becomes small, f − f E (q (k) ) < , where is a small positive number, which implies that q (k) is a good approximation to the exact solution of the nonlinear problem. Because the most computationally expensive step of the procedure is the factorization of the stiffness matrix, it often efſcient to keep the stiffness matrix unchanged for several iterations. For instance, at the second iteration, the new approximation is obtained as q (3) = q (2) + K −1 (q (1) ){f − f E (q (2) )}. The sole elastic forces, f E (q (2) ), are evaluated at the second iteration; the stiffness matrix and its factorization are kept unchanged, resulting in considerable computational savings. This approach is known as the modiſed Newton-Raphson method.
17.3 Governing equations and linearization process
659
Constrained statics problems Consider now a static problem subjected to m holonomic constraints, denoted C(q) = 0. The nonlinear equilibrium equations of the problem are stated as f E (q) + B T (q)λ = f ,
(17.21a)
C(q) = 0.
(17.21b)
The constraint forces, B T λ, appear in the equilibrium equation of the system, eq. (17.21a), and eq. (17.21b) states the constraints imposed on the system. The constraints were enforced via the Lagrange multiplier technique; λ denotes Lagrange’s multipliers and B(q) the constraint matrix deſned by eq. (9.37). Here again, system (17.21) will be solved using a linearization technique. The two sets of equations are expanded using Taylor series about a known, approximate solution, (q, λ), leading to f E (q) + K Δq + B T (q)λ + B T (q)Δλ + K b (q, λ)Δq + h.o.t. = f ,
(17.22a)
C(q) + K (q)Δq + h.o.t. = 0.
(17.22b)
c
The linearization of the elastic forces involves the stiffness matrix deſned by eq. (17.20). The linearization of the constraint forces involves two matrices, the constraint matrix B deſned by eq. (9.37) and the constraint related stiffness matrix, K b (q, λ) =
∂(B T λ) . ∂q
(17.23)
Finally, the linearization of the constraint gives rise to matrix K c deſned as K c (q, t) =
∂C . ∂q
(17.24)
Here again, solution increments are assumed to be small quantities and higherorder terms in the Taylor series expansion are neglected, leading to a linearized problem for these increments, Δq K + K b BT f − f E (q) . (17.25) = Δλ Kc 0 −C(q) An iterative approach based on the Newton-Raphson or modiſed Newton-Raphson method then yields the solution of the nonlinear constrained system. 17.3.2 Problems Problem 17.1. Rigid body with a root spring Consider the rigid bar of length L with a root spring, as depicted in ſg. 17.17. The bar is subjected to a tip vertical force P . The root spring is nonlinear such that M = k1 θ + k3 θ3 ,
660
17 Finite element tools
P k1 q + k3 q
C
P
3
L
q
q
A
D B
L Fig. 17.17. Rigid body with a root spring.
Fig. 17.18. Shallow arch under center load.
where M is the root moment and θ the root rotation. Let k1 /L = 50 N/rad and k3 /L = 15 N/rad3 . (1) Write the nonlinear governing equations of the system. (2) Linearize the governing equations. (3) Use an iterative technique to ſnd the response of the system θ as a function of the applied load P . Plot θ as a function of P ∈ [0, 600] N.
Problem 17.2. Snap-through behavior of a shallow arch The shallow arch depicted in ſg. 17.18 is subjected to a center vertical load, P . The shallow arch is modeled by two articulated bars of length L, pinned at points A, B, and C. At point C, a vertical load P is applied and the deƀection of point C under the load is denoted Δ. The constitutive law for the two identical bars is F = ke, where k is the axial stiffness of the bar, F the applied axial load, and e = (L − L)/L the resulting axial strain. The initial length of the bar is L and its length under load is denoted L . (1) Find the strain in the bar in ¯ = Δ/L. (2) Find the non-dimensional terms of the non-dimensional vertical displacement, Δ ¯ (3) On one graph, plot P¯ versus Δ ¯ for θ = 10, 20, applied load, P¯ = P/(2k) versus Δ. and 30 degrees. (4) The shallow arch snaps through when dP/dΔ = 0. Find the strain at snap-through, es and plot es versus angle θ. (5) Find the vertical deƀection at snap-through, ¯s versus θ. (6) Find the applied load at snap-through, P¯s and plot P¯s versus θ. ¯s and plot Δ Δ
17.3.3 Linear structural dynamics problems Consider next a linear structural dynamics problem characterized by the following equations of motion M ¨q + C q˙ + K q = f (t), (17.26) where array q stores the n generalized coordinates, M , C, and K are the constant mass, damping, and stiffness matrices of the system, respectively, and f (t) the externally applied, time-dependent force array. These equations of motion form a set of linear, second-order, coupled ordinary differential equations. For complex elastic structures, dynamics problems are typically formulated using the ſnite element method. Procedures for formulating the mass and stiffness matrices of such problems are well documented in textbooks [324, 197, 198]. Because energy dissipating mechanisms are difſcult to model rigorously, the damping matrix is often approximated. The Rayleigh damping assumption is often used and corresponds to the choice of a damping matrix written as a linear combination of the mass and stiffness matrices, C = αM + βK, where parameters α and β are selected based on experimental observations.
17.3 Governing equations and linearization process
661
The initial conditions of the problem are the initial displacements and velocities of the system, q(ti ) = q i , and q˙ (ti ) = v i , where ti is the initial time of the simulation. The initial accelerations can be obtained by expressing the dynamic equilibrium conditions, eq. (17.26), at time ti , to ſnd ¨q (ti ) = M −1 f (ti ) − C v i − K q i . 17.3.4 Nonlinear structural dynamics problems Many practical engineering problems involve dynamical systems presenting large displacements and rotations, i.e., geometric nonlinearities, or large deformations resulting in nonlinear material behavior, i.e., material nonlinearities. Such nonlinear structural dynamics problems are described by the following dynamic equilibrium equations M (q, t)a + f (q, v, t) = 0, (17.27) where arrays q, v = q˙ , and a = ¨q store the n generalized displacement, velocity, and acceleration components of the system, respectively. The mass matrix is symmetric, positive-deſnite, and array f stores the dynamic and externally applied forces. These equations of motion form a set of second-order, coupled, nonlinear, ordinary differential equations, and exhibit a linear dependency on generalized accelerations because they are derived from Newton’s second law. For multibody systems, the generalized coordinates are likely to involve both displacements and rotations. For instance, six generalized coordinates, three displacements and three rotations, are used to represent the conſguration of rigid bodies and the formulation of geometrically exact beams presented in section 16.3.1 calls for both displacements and rotation ſelds. Rotations could be represented using the vectorial parameterization of rotation discussed in section 13.4. Because the problem is nonlinear, it is necessary to linearize the equations of motion following a procedure similar to that developed in section 17.3.1 for nonlinear static problems. Given an approximate solution characterized by generalized displacement, velocity, and acceleration arrays, denoted q, v, and a, respectively, a more accurate solution characterized by arrays q+Δq, v+Δv, and a+Δa, is sought, where Δq, Δv, and Δa are the unknown increments in generalized displacements, velocities, and accelerations, respectively. The new solution is assumed to satisfy the governing equations of the problem, i.e., M (q + Δq, t)(a + Δa) + f (q + Δq, v + Δv, t) = 0. A Taylor series expansion is performed about the known, approximate solution, to ſnd M (q, t)a + f (q, v, t) + K(q, v, a, t)Δq + G(q, v, t)Δv + M (q, t)Δa + h.o.t = 0. The stiffness, gyroscopic, and mass matrices, denoted K, G, and M , respectively, are deſned as the derivative of all forces with respect to the displacement, velocity, and acceleration components, respectively, i.e.,
662
17 Finite element tools
K(q, v, a, t) =
∂(M a + f ) , ∂q
∂(M a + f ) , ∂v ∂(M a + f ) M (q, t) = , ∂a
G(q, v, t) =
(17.28a) (17.28b) (17.28c)
Next, the solution increments, Δq, Δv, and Δa, are assumed to be small quantities and the higher-order terms in the Taylor series expansion are neglected, leading to a linearized problem for these increments, K(q, v, a, t)Δq + G(q, v, t)Δv + M (q, t)Δa = −M (q, t)a − f (q, v, t). (17.29) In their linearized form, the governing equations of the system now resemble their counterparts for linear systems, eqs. (17.26). The mass, gyroscopic, and stiffness matrices, however, are now functions of the states of the system. As discussed in section 12.6.1, it is desirable to scale the equations of motion equations to ſnd " ! h2 KΔq + hGhΔv + M h2 Δa = − M h2 a + h2 f , (17.30) where h is the time step size that will be use in the time integration procedure. 17.3.5 Multibody dynamics problems with holonomic constraints Multibody systems are typically subjected to constraints; the present section deals with nonlinear multibody systems featuring n generalized coordinates and m holonomic. Problems involving nonholonomic constraints will be addressed in section 17.3.6. Nonlinear multibody systems subjected to holonomic constraints are described by Lagrange’s equations of the ſrst kind developed in section 11.1 and repeated here for convenience, M (q, t)a + f (q, v, t) + B T (q, t)λ = 0,
(17.31a)
C(q, t) = 0.
(17.31b)
Array f stores the dynamic and externally applied forces. The constraint forces associated with the holonomic constraints are given by the term B T λ, where B is the constraint matrix and λ the array of Lagrange’s multipliers used to enforce the constraint. Although equations (17.31) describe fully nonlinear multibody systems, their dependency on Lagrange’s multipliers is linear. Because the governing equations are nonlinear, their solution calls once more for a linearization process. The linearization of the dynamical terms, M (q, t)a + f (q, v, t), gives rise to the stiffness, gyroscopic, and mass matrices deſned in eqs. (17.28). Linearization of the constraint forces gives rise to matrix K b deſned
17.3 Governing equations and linearization process
663
by eq. (17.23), and similarly, linearization of the constraint gives rise to matrix K c deſned by eq. (17.24). Following a procedure identical to that developed in section 17.3.4 for the equations of nonlinear structural dynamics, the linearized equations for the present problem are obtained. As discussed in section 12.6.1, it is desirable to scale the equations of motion equations to ſnd (h2 K + sK b )Δq + hGhΔv + M h2 Δa + sB T Δˆ μ = − M h2 a − h2 f − sB T μ ˆ, (17.32a) sK c Δq = − sC(q, t).
(17.32b)
where μ ˆ is the array of modiſed Lagrange’s multipliers deſned by eq. (12.23). 17.3.6 Multibody dynamics problems with nonholonomic constraints If the multibody system is subjected to nonholonomic constraints of the form given by eq. (11.2), Lagrange’s equations of the ſrst kind developed in section 11.1 are still applicable, M (q, t)a + f (q, v, t) + B T (q, t)λ = 0,
(17.33a)
D = 0.
(17.33b)
The nonholonomic constraints are expressed by eq. (11.2) as D = B(q, t)v + b(q, t) = 0 and are assumed to present a linear dependency on the generalized velocities. The constraint forces associated with the nonholonomic constraints are given by the term B T λ, where B is the constraint matrix and λ the array of Lagrange’s multipliers used to enforce the constraint. Although equations (17.33) describe fully nonlinear multibody systems, their dependency on Lagrange’s multipliers is linear. Because the governing equations are nonlinear, their solution calls once more for a linearization process. The linearization of the dynamical terms, M (q, t)a + f (q, v, t), gives rise to the stiffness, gyroscopic, and mass matrices deſned in eqs. (17.28). Linearization of the constraint forces gives rise to matrix K b deſned by eq. (17.23). Finally, linearization of the nonholonomic constraints introduces the following matrix ∂D K d (q, v, t) = . (17.34) ∂q Following, once again, a procedure identical to that developed in section 17.3.4 for the equations of nonlinear structural dynamics, the linearized equations of present problem are obtained. As discussed in section 12.6.1, it is desirable to scale the equations of motion equations to ſnd (h2 K + sK b )Δq + hGhΔv + M h2 Δa + sB T Δˆ μ = −M h2 a − h2 f − sB T μ ˆ, (17.35a) shK d Δq + sBhΔv = −sB(q, t)¯ v − shb(q, t), (17.35b) where μ ˆ is the array of modiſed Lagrange’s multipliers deſned by eq. (12.23).
664
17 Finite element tools
17.4 The generalized-α time integration scheme Typical equations for static, structural dynamic, and multibody dynamic problems have been presented in the previous sections. For nonlinear statics problems, the Newton-Raphson procedure outlined in section 17.3.1 is used and transforms the solution of nonlinear algebraic problems into the solution of a sequence of linear, algebraic problems. The situation is different for structural and multibody dynamic problems. Linear structural dynamics problems were presented in section 17.3.3 and are characterized by eq. (17.26), which forms a set of ordinary differential equations in time. Time integration schemes transform these ordinary differential equations into a set of linear algebraic equations. For nonlinear structural and multibody dynamic problems, a similar path is followed. The Newton-Raphson procedure outlined in section 17.3.4 for structural dynamics or in section 17.3.5 or 17.3.6 for multibody dynamics problems with holonomic or nonholonomic constraints, respectively, is ſrst used to transform the nonlinear, ordinary differential equations of motion into a sequence of linear, ordinary differential equations. Time integration schemes then ſnally lead to sets of linear algebraic equations. Numerous time integration schemes have been used in multibody dynamics. For systems presenting a small number of degrees of freedom, explicit, predictor multicorrector algorithms such as the Adams-Bashforth integrator [199] are often used. Hairer and Wanner [26] present an exhaustive review of this ſeld. Because numerous degrees of freedom are generated by the discretization process inherent to ſnite element formulations, the resulting equations of motion typically involve many high frequency modes that are an artifact of the discretization process. Consequently, implicit schemes are used almost exclusively when dealing with ſnite element discretizations. The Hilber-Hughes-Taylor (HHT) integrator [135], the workhorse used in most commercial codes, is described in textbooks such as Hughes [197] or Bathe [198]. The HHT scheme was originally developed for linear structural dynamics problems [135]. Chung and Hulbert [136] later generalized this scheme as the generalized-α scheme. Because the HHT scheme is a particular case of the latter, the presentation focuses on the generalized-α scheme. Applications of this scheme to nonlinear structural and multibody problems are presented next. The presentation of the generalized-α scheme given in the sections below is limited to a description of the scheme and of its basic properties. Chung and Hulbert [136] proved that for linear structural dynamics problems, the scheme is secondorder accurate, unconditionally stable, and presents high frequency numerical damping; these three features are considered indispensable for the successful integration of large ſnite element systems [197, 198]. The need for numerical dissipation in time integration of large systems of linear equations was identiſed very early. Indeed, the average acceleration scheme proposed by Newmark [201] in 1959 is an energy preserving scheme for linear systems; the strict preservation of energy at each time step of the integration process precludes the presence of any numerical dissipation. Undesirable characteristics of this
17.4 The generalized-α time integration scheme
665
scheme were reported by Hughes [335]: in large systems, numerical round-off errors are sufſcient to provide excitation of the high frequency modes of the system. This energy does not dissipate, due to the strict energy preservation characteristic of the algorithm. This prompted the development of algorithms presenting high frequency numerical dissipation in linear systems, such as the HHT and generalized-α schemes. Because the generalized-α scheme was shown to be a powerful tool for the time integration of large structural dynamics problems, extending its use to the simulation of constrained dynamical systems seems natural. Cardona and G´eradin [200] have shown that numerical damping is critical to avoid numerical oscillations in Lagrange’s multipliers. Although their analysis is restricted to linear problems, stable predictions are presented for nonlinear test cases. A more formal study of the spectral behavior of the HHT schemes in constrained linear system is given by Farhat et al. [336]. The generalized-α scheme has also been applied to the solution of the equations of constrained dynamical systems after index reduction, see section 12.2. Lunk and Simeon [337] and Jay and Negrut [338] have proved second-order accuracy when applied to the stabilized index-2 or GGL method [59]. Formal results concerning the application of the generalized-α scheme to index-3 constrained dynamic systems are presented by Negrut et al. [339] and Arnold and Br¨uls [340]. Of course, in all cases, the equations of motion should be properly scaled, as discussed in section 12.6. 17.4.1 Linear structural dynamics problems The generalized-α scheme [136] was introduced for linear structural dynamics problems of the form described in section 17.3.3. The equations of motion are in the form given by eq. (17.26). A typical time step starts and ends at times ti and tf , respectively, and h = tf − ti is the time step size. Subscripts (·)i and (·)f are used to identify quantities evaluated at times ti and tf , respectively. The generalized displacement, velocity, and acceleration arrays at time ti are denoted q i , v i , and ai , respectively. Similar notations are deſned at the end of the time step using subscript (·)f . In the generalized-α scheme, the solution at the end of the time step is written as " − β h2 ai + βh2 af , hv i + (1 − γ) h2 ai + γh2 af ,
q f = q i + hv i + hv f =
! 1 2
(17.36a) (17.36b)
where β and γ are two parameters that will be selected later to optimize the performance of the scheme. Instead of being satisſed at each instant in time, the equations of motion, eqs. (17.26), are satisſed at discrete instants only. The discrete statement of dynamic equilibrium is stated as M h2 a ˇ + hC hˇ v + h2 K qˇ = h2 f (tˇ), where the displacement, velocity, and acceleration stages are deſned as
(17.37)
666
17 Finite element tools
(17.38a)
qˇ = α ˆ F q f + αF q i , ˆ F hv f + αF hv i , hˇ v=α
(17.38b)
2
(17.38c)
2
2
h a ˇ=α ˆ M h af + αM h ai , tˇ = α ˆ F tf + αF ti .
(17.38d)
Coefſcients αM and αF are two additional quantities that characterize the generalized-α family of integration schemes. Coefſcients β, γ, αM , and αF will be selected to optimize the accuracy and stability characteristics of the algorithm. The following simplifying notation was adopted, α ˆ F = 1 − αF and α ˆM = 1 − αM . For the generalized-α scheme [136], the four coefſcients are expressed in terms of the spectral radius at inſnity, denoted ρ∞ . At ſrst, αM and αF are chosen as αM =
2ρ∞ − 1 , ρ∞ + 1
αF =
ρ∞ , ρ∞ + 1
(17.39)
with ρ∞ ∈ [0, 1]. The two remaining coefſcients are then computed as γ=
1 − αM + αF , 2
β=
1 (1 − αM + αF )2 . 4
(17.40)
The HHT-α scheme [135] is a subset of the generalized-α scheme for which the ſrst two coefſcients are selected as αM = 0 and αF = −α, with α ∈ [−0.3, 0]. The two remaining coefſcients are then computed using eq. (17.40) To facilitate the solution process, the solution at the end of the time step given by eqs. (17.36) is recast as h2 h2 ai + βh2 (af − ai ) = q i + hv i + ai + Δq, (17.41a) 2 2 γ 2 2 hv i + h ai + γh (af − ai ) = hv i + h2 ai + Δq, (17.41b) β 1 2 2 2 h ai + h (af − ai ) = h ai + Δq. (17.41c) β
q f = q i + hv i + hv f = h 2 af =
Introducing these expressions into eq. (17.37) then leads to γα ˆF α ˆM M+ hC + α ˆF h2 K Δq = h2 f (tˇ) β β α ˆF 2 h ai + α ˆF h2 ai + hv i − h2 K ˆ F hv i + q i . − M h2 ai − hC α 2
(17.42)
This linear set of algebraic equations is solved for the increments in the generalized coordinates, Δq. Equations (17.41) then yield the generalized displacements, velocities, and accelerations at the end of the time step. Example 17.7. Single degree of freedom problem Consider a simple, single degree of freedom spring, mass, dashpot system. For this system, it is now easily shown that
17.4 The generalized-α time integration scheme
667
γα ˆF γα ˆF α ˆM α ˆM M+ hC + α +2 ζμ + α ˆ F μ2 = mG, ˆ F h2 K = m β β β β where μ = ωh = 2πh/T and ζ is the damping of the system, expressed as a fraction of the critical damping rate. Equation (17.42) is now restated as 2 ! " α ˆF 2 h GΔq = f − h2 ai − 2ζμ α h ai + α ˆ F h2 ai + hvi − μ2 ˆ F hvi + qi . m 2 Finally, the displacements, velocities, and accelerations at the end of the time step are now expressed in terms of their counterparts at the beginning of the time step with the help of eqs. (17.41) as ⎫ ⎧ ⎧ ⎫ ⎧ ⎫ 1 ⎬ qi ⎬ ⎨ qf ⎬ ⎨ ⎨ 2 h hvf f γ/β + A hvi . = (17.43) ⎭ ⎩ 2 ⎭ mG ⎩ ⎩ 2 ⎭ 1/β h af h ai In the absence of external excitation, the ſrst term on the right-hand side vanishes, and matrix A, called the ampliſcation matrix, then relates the displacements, velocities, and accelerations of the system at the beginning of the time step to the corresponding quantities at the end of the time step. The ampliſcation matrix is deſned as A = A1 − A2 AT3 /G, where ⎫ ⎧ ⎧ ⎫ ⎤ 1 1 1/2 μ2 ⎨ 1 ⎬ ⎨ ⎬ 2ζμ + μ2 α ˆF A1 = ⎣0 1 1 ⎦ , A2 = γ/β , A3 = . ⎭ ⎩ ⎩ ⎭ 00 1 1/β 1 + 2ζμˆ αF + μ2 α ˆ F /2 ⎡
The largest eigenvalue of the ampliſcation matrix is called its spectral radius. Figure 17.19 shows this spectral radius as a function of h/T for ζ = 0 and several values of ρ∞ . The corresponding results for the HHT scheme are shown in ſg. 17.20, when ζ = 0 and coefſcient α takes different values. For this single degree of freedom linear oscillator, the generalized-α scheme can be viewed as a low-pass ſlter. For small time step sizes, h/T 1, the integrator yields accurate predictions. For large time step sizes, h/T > 1, the response of the system is dramatically attenuated, and for ρ∞ = 0, asymptotic annihilation is achieved, i.e., the numerical prediction of the system’s response vanishes after a single time step. Example 16.3 shows the implications of the choice of ρ∞ on the performance of the generalized-α scheme. 17.4.2 Nonlinear structural dynamics problems Nonlinear structural dynamics problems were investigated in section 17.3.4, with equations of motion cast in the form of eq. (17.27). The linearization process described in this section leads to the linearized equations of motion given by eq. (17.30). Since the generalized-α scheme was introduced for linear structural dynamics problems, it seems logical to extend its application to nonlinear structural
668
17 Finite element tools 1
1
0.9
0.9 0.8
SPECTRAL RADIUS
SPECTRAL RADIUS
0.8 0.7 0.6 0.5 0.4 0.3 0.2
0.6 0.5 0.4 0.3 0.2
0.1 0 -3 10
0.7
0.1 -2
10
-1
10
0
10
1
10
2
10
3
10
0 -3 10
-2
10
-1
10
10
0
10
1
10
2
3
10
h/T
h/T
Fig. 17.19. Spectral radius of the generalized-α scheme versus h/T for ρ∞ = 1: solid line; ρ∞ = 0.5: dasheddotted line; ρ∞ = 0.2: dashed line; ρ∞ = 0: dotted line.
Fig. 17.20. Spectral radius of the HHT scheme versus h/T for α = 0: solid line; α = −0.1: dashed-dotted line; α = −0.2: dashed line; α = −0.3: dotted line.
dynamics problems by applying the scheme to the linearized equations of motion. The scaled, linearized equations are recast here as " ! h2 KΔˇ (17.44) q + hGhΔˇ v + Mh2 Δˇ a = − Mh2 a ˇ + h2 F , where qˇ, vˇ, and a ˇ are the stages deſned in eqs. (17.38), and the following notations were deſned for the mass, gyroscopic and stiffness matrices, M = M (ˇ q , tˇ),
G = G(ˇ q , vˇ, tˇ),
K = K(ˇ q , vˇ, a ˇ, tˇ),
(17.45)
respectively, and the dynamic load vector, q , vˇ, tˇ). F = f (ˇ
(17.46)
Increments in the stages are readily obtained from eq. (17.38) as ˆF h2 Δaf = Δˇ q = βα
Δˇ q=
α ˆF Δq, γ γα ˆF Δˇ q= Δq, hΔˇ v = γα ˆ F h2 Δaf = β β α ˆM α ˆM Δq, a= α ˆM h2 Δaf = Δˇ q= h2 Δˇ βα ˆF β
(17.47a) (17.47b) (17.47c)
where the second set of equalities were obtained from eq. (17.41). Introducing these results into eq. (17.44) and multiplying by β leads to " ! α ˆM M + γ α (17.48) ˆ F hG + β α ˆF h2 K Δq = −β Mh2 a ˇ + h2 F . These linearized equations are solved sequentially as part of an iterative procedure up to convergence. Increments in the displacement, velocity, and acceleration stages are then obtained from eq. (17.47).
17.4 The generalized-α time integration scheme
669
17.4.3 Multibody dynamics problems with holonomic constraints Multibody dynamics problems with holonomic constraints were investigated in section 17.3.5, with equations of motion cast in the form of eqs. (17.31). The linearization process described in that section leads to the linearized equations of motion given by eqs. (17.32). Since the generalized-α scheme was introduced for linear structural dynamics problems, it seems logical to extend its application to multibody dynamics problems with holonomic constraints by applying the scheme to the linearized equations of motion. The scaled, linearized equations are recast here as (h2 K + sKb )Δˇ q + hGhΔˇ v + Mh2 Δˇ a + sB T Δˇ " ! ˇ + h2 F + sB T ˇ , = − Mh2 a
(17.49a)
sKc Δˇ q = −sC,
(17.49b)
where the stiffness, gyroscopic, and mass matrices were deſned in eq. (17.45), the dynamic load vector by eq. (17.46), and ˇ = μ ˆ
(17.50)
are the Lagrange multiplier stages. Additionally, the following notations were introduced ˇ tˇ), Kc = K c (ˇ B = B(ˇ q , tˇ), Kb = K b (ˇ q , , q , tˇ). (17.51) Introducing the increments in the stages as deſned in eqs. (17.47) into eqs. (17.49) yields the following discrete equations ˆ F hG + β α ˆF (h2 K + sKb ) β α ˆ F sB T α ˆM M + γ α Δq Δˆ μ α ˆ F sKc 0 (17.52) ! " Tˇ 2 2 −β Mh a ˇ + h F + sB = . −sC These linearized equations are solved sequentially as part of an iterative procedure up to convergence. 17.4.4 Multibody dynamics problems with nonholonomic constraints Multibody dynamics problems with nonholonomic constraints were investigated in section 17.3.6, with equations of motion cast in the form of eqs. (17.33). The linearization process described in that section leads to the linearized equations of motion given by eqs. (17.35). The scaled, linearized equations are recast here as (h2 K + sKb )Δˇ q + hGhΔˇ v + Mh2 Δˇ a + sB T Δˇ
(17.53a)
ˇ ˇ + h F + sB ), = −(Mh a 2
2
T
shKd Δˇ q + sBhΔˇ v = −(sB hˇ v + shb).
(17.53b)
670
17 Finite element tools
where the stiffness, gyroscopic, and mass matrices were deſned in eq. (17.45), the dynamic load vector by eq. (17.46), the constraint related matrices by eq. (17.51), ˇ = μ ˆ are the Lagrange multiplier stages, and the following notation was used b = b(ˇ q , tˇ), and Kd = K d (ˇ q , vˇ, tˇ). Introducing the increments in the stages as deſned in eqs. (17.47) into eqs. (17.53) yields the following discrete equations α ˆM M + γ α ˆ F hG + β α ˆF (h2 K + sKb ) β α ˆF sB T Δq Δˆ μ βα ˆF shKd + γ α ˆ F sB 0 (17.54) " ! ˇ + h2 F + sB T ˇ −β Mh2 a . = v + shb) −β(sB hˇ These linearized equations are solved sequentially as part of an iterative procedure up to convergence.
17.5 Energy preserving and decaying schemes The equations of motion resulting from ſnite element based modeling of nonlinearly elastic multibody systems present distinguishing features: they are stiff, nonlinear, differential-algebraic equations. The stiffness of the system stems from the presence of high frequencies in the elastic members, but also from the inſnite frequencies associated with the kinematic constraints. The main focus of this section is the derivation of algorithms presenting high frequency numerical dissipation and for which unconditional stability can be proven in the nonlinear case. An energy decay argument will be used to establish stability [202]. The Newmark algorithm [201] is widely used in structural dynamics. In particular, the average acceleration scheme, also known as the trapezoidal rule, is an unconditionally stable, second-order accurate scheme when applied to linear problems. The classical stability analysis of this scheme is readily found in textbooks [197] and shows that the spectral radius of the ampliſcation matrix equals unity at all frequencies. An alternative way of proving stability is based on an energy argument. Indeed, it is easily shown that the average acceleration scheme exactly preserves the total energy of the system [202]. Finite element discretizations of complex structures involve numerous degrees of freedom. Consequently, high frequency modes are present in the models and high frequency numerical dissipation is a desirable, if not indispensable feature for robust time integration schemes. Numerical dissipation cannot be introduced in the Newmark method without degrading its accuracy. The HHT and generalized-α schemes presented in section 17.4 remedy this situation by achieving high frequency dissipation while minimizing unwanted low frequency dissipation. Both methods have been successfully used for both linear and nonlinear problems, although unconditional stability is proved for linear systems only.
17.5 Energy preserving and decaying schemes
671
Simo and his coworkers presented energy preserving schemes for rigid body dynamics [203], elastodynamics [204], beams [206], and plates and shells [208]. These schemes were presented as second-order accurate, ſnite difference schemes based on a mid-point approximation. Finite rotations were parameterized with the rotation vector, then using Cayley’s algebraic form of ſnite rotations. The unconditional stability of these schemes stems from a proof of preservation of the total energy of the system. It is important to understand that while the exact solution of the equations of motion implies the exact preservation of the total mechanical energy, a numerical, i.e., an inherently approximate solution of the problem does not, in general, guarantee the preservation of energy at the discrete level. When using energy preserving schemes, the computed, approximate solution exactly satisſes the energy preservation condition. An energy preserving scheme for nonlinear elastic multibody systems was proposed by Bauchau [210]. In this scheme, the discretization of the equations of motion implies the conservation of the total energy for the elastic components of the system, and that of the forces of constraint associated with the kinematic constraints implies the vanishing of work they perform. The combination of these two features of the discretization guarantees the unconditional stability of the numerical integration process for nonlinearly elastic multibody systems. When rotationless formulations of dynamics are used, see example 11.3, the governing equations of motion for rigid bodies, beams, and plates and shells involve algebraic nonlinearities only, of the second degree at most. It is remarkable, that for these problems, the mid-point time integration scheme naturally leads to discrete equations that satisfy energy and momentum conservation conditions. Betsch and his coworkers used this approach to develop energy preserving schemes for rigid body dynamics [205], beams [207], and plates and shells [209]. Although energy preserving schemes perform well for simple problems, their lack of high frequency numerical dissipation can cause problems [210]. First, the time histories of internal forces and velocities often present considerable high frequency content of a purely numerical origin. Second, these high frequency oscillations can hinder the convergence process for the solution of the nonlinear equations of motion. The selection of a smaller time step size does not necessarily help the convergence process, because smaller time step sizes allow even higher frequency oscillations to be present in the response. Finally, it seems that the presence of high frequency oscillations also renders strict energy preservation difſcult to obtain. This could prove to be a real limitation of energy preserving schemes when applied to more and more complex problems, for which the use of integration schemes presenting high frequency numerical dissipation become desirable, if not indispensable. It appears that the development of “energy decaying schemes,” i.e., schemes eliminating the energy associated with vibratory motions at high frequency, is desirable. This is particularly important when dealing with problems presenting a complex dynamic response such as nonlinearly elastic multibody problems. The key to the development of an energy decaying scheme is the derivation of an energy decay inequality [202] rather that the discrete energy conservation law which is central to energy preserving schemes. A methodology that can systematically lead
672
17 Finite element tools
to an energy decay inequality is the time discontinuous Galerkin method [341, 342, 343] which was initially developed for hyperbolic equations. Hughes and Hulbert [344, 345] have investigated the use of the time discontinuous Galerkin methodology for linear elastodynamics. They point out that “classical elastodynamics can be converted to ſrst-order symmetric hyperbolic form, which has proved useful in theoretical studies. Finite element methods for ſrst-order symmetric hyperbolic system are thus immediately applicable. However, there seems to be several disadvantages: in symmetric hyperbolic form the state vector consists of displacements, velocities, and stresses which is computationally uneconomical; and the generalization to nonlinear elastodynamics seems possible only in special circumstances.” Indeed, writing the nonlinear equations of motion of geometrically exact beams in this symmetric hyperbolic form does not appear to be possible. In this section, an alternative route is taken. Practical time integration schemes that do not rely on the symmetric hyperbolic form of the equations of motion are developed. These schemes are of a ſnite difference nature, and imply an energy balance condition that is obtained by a direct computation of the work done by the discretized inertial and elastic forces over a time step. The mean value theorem guarantees the existence of discretizations leading to these energy preservation, or energy decay statements, leading to a rigorous proof of unconditional stability for the scheme. Energy decaying schemes were presented by Bauchau and his co-workers for beams [346], elastodynamics [347], and multibody systems [217, 218, 211], and plates and shells [348, 213]. These schemes originate from Galerkin and time discontinuous Galerkin approximations of the equations of motion written in the symmetric hyperbolic form. Finite rotations were parameterized using the conformal rotation parameters. Bottasso and Borri proposed both energy preserving and decaying schemes for beams [212, 349] and multibody systems [214, 215]. Their schemes were cast within the framework of ſnite elements in time at ſrst, then as 2-stage FSAL Runge-Kutta methods. The rotation vector was used to represent ſnite rotations. Some of the proposed schemes also imply the conservation of momenta, or are geometrically invariant [350, 214, 215, 219]. These additional features are easily obtained by recasting the ſeld equations in ſxed pole form [269], see example 8.9. 17.5.1 The symmetric hyperbolic form Consider a dynamical system described by a kinetic energy K = K(˙q , q), and a strain energy V = V (q), where q are the system’s generalized coordinates. Classical forms of the equations of motion The Lagrangian of the system is deſned as L(˙q , q) = K − V , and the equations of motion of the system in Lagrangian form are then d * + L,q˙ − L,q = 0. dt
(17.55)
17.5 Energy preserving and decaying schemes
673
Notation (·),q is used here to indicate a derivative with respect to q. Hamilton’s formulation is obtained with the help of Legendre’s transformation [87]. First, the momenta are deſned as p(˙q , q) = L,q˙ , and these relationships can be inverted to yield q˙ = q˙ (q, p). The Hamiltonian of the system is now deſned H(q, p) = ˙ p) − L(q, p). The equations of motion of the system in Hamiltonian form are pT q(q, then H,p ,
(17.56a)
p˙ = −H,q .
(17.56b)
q˙ =
The symmetric hyperbolic form The symmetric hyperbolic form stems from a second Legendre transformation. The following variables are deſned ſrst f (q, p) = H,q and v(q, p) = H,p . These relations can be inverted to yield q = q(f , v) and p = p(f , v). A new function is now deſned G(f , v) = f T q(f , v) + v T p(f , v) − H(f , v), (17.57) implying q = G,f and p = G,v . It can be readily shown that the Hessians of H and G are the inverse of each other. Hence, if H is a positive-deſnite function, so is G. Hamilton’s equations, eqs. (17.56), can be expressed in terms of the new variables, f and v, to ſnd the symmetric hyperbolic form of the equations of motion G,f f f˙ + G,f v v˙ − v = 0 and G,vf f˙ + G,v v v˙ + f = 0. To simplify the notation, an implicit form of the equations is preferred q˙ (f , v) − v = 0,
(17.58a)
p(f ˙ , v) + f = 0.
(17.58b)
The Galerkin approximation In the Galerkin approximation, the equations of motion are enforced in a weak, integral manner. Fig. 17.21 shows a time interval from ti to tf , and an approximate solution over that interval. Subscripts (·)i and (·)f will be used to indicate the value of a quantity at times ti and tf , respectively. The Galerkin approximation of the equations of motion in implicit symmetric hyperbolic form (17.58) writes % tf
T w 1 q˙ (f , v) − v + wT2 p(f ˙ , v) + f dt = 0, (17.59) ti
where w1 and w 2 are arbitrary test functions. Integration by parts yields % tf −w˙ T1 q − w˙ T2 p − w T1 v + wT2 f dt + w T1f q f + w T2f pf − wT1i q i − w T2i pi = 0. ti
This approximation of the equations of motion enjoys remarkable properties. Indeed, 't T selecting the test functions as w 1 = f and w2 = v yields tif [−f˙ G,f − v˙ T G,v −
674
17 Finite element tools
f T v + v T f ] dt + f Tf q f + v Tf pf − f Ti q i − v Ti pi = 0. The time integral clearly has a closed form solution, leading to Gi −Gf +f Tf q f +vTf pf −f Ti q i −v Ti pi = 0. Finally, function G is expressed in terms of the Hamiltonian, H, with the help of eq. (17.57) to ſnd H f = Hi . (17.60) In summary, the Galerkin approximation, eq. (17.59), of the equations of motion written in symmetric hyperbolic form implies a discrete Hamiltonian preservation statement (17.60). If the Hamiltonian is a positive-deſnite function, this statement implies the unconditional stability of integration schemes based on eq. (17.59).
time
time ti
tm
ti tj
tf
Fig. 17.21. The time continuous Galerkin approximation.
tf
Fig. 17.22. The time discontinuous Galerkin approximation.
The time discontinuous Galerkin approximation In the time discontinuous Galerkin approximation, the solution is allowed to present discontinuities in the displacement and velocity ſelds at discrete times. Figure 17.22 shows a time interval from ti to tf and the approximate solution over that interval. At the initial instant, the solution presents a jump. Subscripts (·)i will be used to denote the value of a discontinuous quantity to the left side of the jump, whereas a subscript (.)j indicates the value of that quantity to the right side of the jump. The equations of motion and initial conditions are enforced in a weak, integral manner. The time discontinuous Galerkin approximation of the equations of motion in implicit symmetric hyperbolic form (17.58) writes % tf
T w 1 q˙ (f , v) − v + w T2 p(f ˙ , v) + f dt + w T1j q + wT2j p = 0, (17.61) tj
where notation · indicates the jump in a quantity at the initial time, i.e., q = q j − q i and p = pj − pi . This approximation of the equations of motion also enjoys remarkable properties. Indeed, integrating by parts and selecting the test functions as w 1 = f and w 2 = v 't yields tjf [−f˙ G,f − vG ˙ ,v − f T v + v T f ] dt + f Tf q f + v Tf pf − f Tj q i − v Tj pi = 0. The time integral clearly has a closed form solution, leading to Gj − Gf + f Tf q f +
17.5 Energy preserving and decaying schemes
675
v Tf pf − f Tj q i − vTj pi = 0. Finally, we express G in terms of the Hamiltonian H with the help of eq. (17.57) to ſnd Hf − Hj + f Tj q + v Tj p = 0.
(17.62)
Because the Hamiltonian is a continuous function of q and p, the mean value theorem implies Hj = Hi + f Tj q + v Tj p −
1 T q H,qq q + qT H,q p p+ 2
pT H,p q q + pT H,p p p
h
= Hi + f Tj q + v Tj p − c2 ,
(17.63)
where the last equality holds if the Hamiltonian is a positive-deſnite function. Combining eqs. (17.62) and (17.63) then yields Hf = Hi − c2 , ⇒ Hf ≤ Hi .
(17.64)
In summary, the time discontinuous Galerkin approximation (17.61) of the equations of motion written in symmetric hyperbolic form implies a Hamiltonian decay inequality, eq. (17.64), if the Hamiltonian is a positive-deſnite quantity. This inequality implies the unconditional stability of time integration schemes based on eq. (17.61). Example 17.8. Linear spring-mass system To illustrate the procedures described in the previous sections, a very simple example will be treated here. Consider a linear spring-mass system with a kinetic energy K = 1/2 m u˙ 2 , a strain energy V = 1/2 ku2 , and subjected to an external force F a (t). In this simple case, f = k u and v = p/m, and the symmetric hyperbolic form of the equations of motion becomes: p˙ + ku = F a ; u˙ − p/m = 0. The Galerkin approximation (17.59) for this problem writes %
tf ti
3 4 p w1 [u˙ − ] + w2 [p˙ + ku − F a ] dt = 0. m
Using a linear in time approximation for the displacement and momentum, and a constant in time approximation for the test functions, the following discrete equations are obtained I E a Fm + Fm = Fm , (17.65) where subscript (·)m denotes a quantity at the mid-point tm , see ſg. 17.21. The I discretized inertial forces are Fm = m(u˙ f − u˙ i )/Δt, and the following velocitydisplacement and force-displacement relationships are used uf − ui u˙ f + u˙ i = , Δt 2
E Fm =k
uf + ui , 2
(17.66)
where Δt indicates the time step size. Finally, the discretized applied forces are
676
17 Finite element tools a Fm =
1 2
%
1
(17.67)
F a (τ ) dτ,
−1
where τ is a non-dimensional time variable such that τ = −1 or +1 at times ti and tf , respectively. The properties of this integration scheme can be investigated using the classical techniques for the analysis of linear schemes. The spectral radius of the ampliſcation matrix is always equal to unity, implying unconditional stability. This scheme is identical to the Newmark scheme [201] with γ = 1/2 and β = 1/4. It can be readily shown that the discrete equations of motion (17.65) imply a discrete energy preservation statement Ef = Ei , where E = K + V is the total mechanical energy, as expected from the theoretical developments presented above. The same problem can be treated with the time discontinuous Galerkin approximation, eq. (17.61), which writes % tf 3 4 p w1 [u˙ − ] + w2 [p˙ + ku − F a ] dt + w1j u + w2j p = 0. (17.68) m tj Using a linear in time approximation for the displacement, momentum, and test functions, the following discrete equations are obtained I Fm + FgE = Fga ,
and FhI − [FgE − fj ]/3 = Fha .
(17.69)
Subscript (·)g denotes a quantity at the midpoint between times tj and tf , and (·)h denotes a quantity at the midpoint between times ti and tj , see ſg. 17.22. The disI cretized inertial forces are Fm = m(u˙ f − u˙ i )/Δt and FhI = m(u˙ j − u˙ i )/Δt, and the following velocity-displacement and force-displacement relationships are used uf − ui u˙ f + u˙ j = , Δt 2
3
uj − ui u˙ f − u˙ j =− , Δt 2
FgE = k
uf + uj . 2
Finally, the discretized applied forces are % % 1 1 a 1 1 a Fga = F dτ, and Fha = − F τ dτ. 2 −1 2 −1
(17.70)
(17.71)
It can be readily shown that the discrete equations of motion (17.69) imply a discrete energy decay inequality Ef ≤ Ei . This is a direct consequence of (17.64), since the Hamiltonian is equal to the total energy of the system for this simple problem. This can be conſrmed by a conventional analysis of the scheme based on the characteristics of the ampliſcation matrix. The period elongation is ΔT /T = ω 4 Δt4 /270 + O(ω 6 Δt6 ), while the algorithmic damping is ζ = ω 3 Δt3 /72 + O(ω 5 Δt5 ), where ω 2 = k/m. Hence, the scheme is third-order accurate. The spectral radius, shown in ſg. 17.23 as function of Δt/T = ωΔt/(2π), is compared with that of the generalized-α scheme for three different values of spectral radius at inſnity, ρ∞ = 0.9, 0.5, and 0. Asymptotic annihilation is obtained with the time discontinuous Galerkin scheme. The scheme is unconditionally stable since the spectral radius is always smaller than unity.
17.5 Energy preserving and decaying schemes
677
1
SPECTRAL RADIUS r
0.8
0.6
0.4
0.2
0 -2 10
10
-1
10
0
Dt/T
10
1
10
2
10
3
Fig. 17.23. Comparison of spectral radii of various time integration schemes. Solid line: time discontinuous Galerkin scheme. Dotted, dash-doted, and dash-double dotted lines: generalized-α scheme with ρ∞ = 0.9, 0.5, and 0, respectively.
17.5.2 Discussion Both Galerkin (17.59) and time discontinuous Galerkin (17.61) approximations applied to the equations of motion written in the symmetric hyperbolic form (17.58) have been shown to provide a systematic way of deriving unconditionally stable time integration schemes, provided the Hamiltonian is a positive-deſnite function. The energy decay inequality associated with the time discontinuous Galerkin approximation implies the presence of numerical dissipation in the resulting time integration schemes, whereas such dissipation is ruled out by the strict energy preservation associated with the Galerkin approximation. Since the presence of numerical dissipation is highly desirable, the time discontinuous Galerkin approach appears to be the most promising method. However, both of these approaches present a major drawback: it is not always possible to recast the equations of motion of general systems into the symmetric hyperbolic form. In particular, it does not seem possible to cast the governing equations of constrained multibody systems in the symmetric hyperbolic form. Furthermore, the time discontinuous Galerkin approach require two level of unknowns (at tj and tf ). In elastodynamics, three ſelds are required for the symmetric hyperbolic form: displacements, stresses and momenta. Hence, the ſnal discrete equations will involve 6N unknowns, resulting in unacceptably high computational cost [344]. 17.5.3 Practical time integration schemes In this section, time integration schemes applicable to nonlinear elastic multibody systems will be developed, without resorting to the symmetric hyperbolic form of the equations of motion. The investigation will focus on dynamical system deſned by a kinetic energy, K = 1/2 v T M v, and a strain energy, V = 1/2 εT C ε. The mass matrix M and stiffness matrix C are symmetric and positive-deſnite; the velocities and strains are given as v = T (q)˙q , and ε = ε(q), respectively. Velocities are assumed to be linear functions of the generalized velocities, resulting in a kinetic
678
17 Finite element tools
energy that is a quadratic form of the same quantities. Under these conditions the total mechanical energy of the system is preserved [87]. The equations of motion of such systems simply write F I + F E = F a (t), where a F (t) are the time dependent external forces. The inertial and elastic forces, F I and F e , respectively, are d F I = (T T p) − v T,q p, (17.72) dt F E = εT,q f ,
(17.73)
where p = M v, f = C ε and notation (·),q indicates a derivative with respect to q. The energy preservation statement can be obtained by evaluating the work done by the inertial, elastic, and applied forces. The work done by the inertial forces is 't computed ſrst W I = tif q˙ T F I dt = Kf − Ki . Next, the work done by the elastic 't forces is evaluated W E = tif q˙ T F E dt = Vf − Vi . Finally, the work done by the 't applied forces is W a = tif q˙ T F a dt. Hence, the equations of motion imply the following work balance equation Kf − Ki + Vf − Vi = W a ,
⇒ Ef − Ei = W a ,
(17.74)
where the total mechanical energy E = K + V . In the absence of externally applied forces W a = 0 and the total energy is preserved. The goal is to obtain discretized equations of motion that will imply an exact energy preservation condition (17.74), or an energy decay inequality. At ſrst, discretizations of the inertial and elastic forces will proposed, then energy preserving and energy decaying schemes will be derived. Discretization of inertial and elastic forces Consider a time interval ti tf , and an approximate solution over this interval, as shown in ſg. 17.21. The following discretizations of the inertial (17.72) and elastic (17.73) forces are proposed: F Im
=
T Tf pf − T Ti pi Δt
* + − v T,q
* + FE = εT,q f m, m m
pf + pi m
2
;
(17.75) (17.76)
where the quantities (v ,q )m , (ε,q )m and f m are as yet undetermined. The work done by the discretized inertial forces is W I = (q f − q i )F Im , and regrouping the term yields: I
W =
q Tf − q Ti Δt
T Tf
Δt * T + Δt * T + T v ,q v ,q pf − T i + pi . (17.77) − 2 2 m m
The following condition is now imposed
17.5 Energy preserving and decaying schemes
qf − qi qf − qi Δt * + Δt * + v ,q = Ti + v ,q . vm = T f − 2 Δt 2 Δt m m
679
(17.78)
These relationships deſne both (v ,q )m and v m . Note that the existence of (v ,q )m satisfying eq. (17.78) is guaranteed by the mean value theorem which states that v f = v i + (v ,q )m (q f − q i ). The work done by the discretized inertial forces now becomes W I = (v Tf − v Ti )M v m . Next, the work done by the discretized elastic forces is evaluated W E = (q Tf − q Ti )(ε,q )m f m . The following condition is now imposed * + εf − εi = εT,q
m
(q f − q i ).
(17.79)
Here again, the existence of (ε,q )m satisfying this condition is guaranteed by the mean value theorem. The work done by the discretized elastic forces now becomes W E = (εTf − εTi )f m . Energy preserving scheme The discretized equations of motion for the energy preserving scheme mimic those obtained in example 17.8 for the Galerkin approximation of the linear spring-mass problem (17.65) a F Im + F E (17.80) m = F m, a where F Im and F E m are now given by (17.75) and (17.76), respectively; and F m = '1 a 1/2 −1 F (τ ) dτ , as in eq. (17.67). The work done by these discretized forces can be evaluated, as was done earlier. With the help of conditions (17.78) and (17.79), equations of motion (17.80) imply a work balance statement (v Tf − v Ti )M v m + a (εTf − εTi )f m = Wm . The following algorithmic velocity-displacement and forcestrain relationship are now selected, see eq. (17.66),
vm =
q˙ f + q˙ i 2
,
fm = C
εf + εi . 2
(17.81)
The work balance equation then becomes Kf − Ki + Vf − Vi = W a , and ſnally, the discrete energy preservation statement is obtained, Ef − Ei = W a . In summary, discretization (17.80) implies the discrete energy preservation statement provided that relationships (17.78) and (17.79) are satisſed, and that the algorithmic velocity-displacement and force-strain relationships (17.81) are used. Energy Decaying Scheme The discretized equations of motion for the energy decaying scheme mimic those obtained in example 17.8 for the time discontinuous Galerkin approximation of a linear spring-mass system (17.69)
680
17 Finite element tools a F Im + F E g = F g,
and F Ih −
1 E * + F g − vq f j = F ah 3 h
(17.82)
where F Im , and F Ih are given by (17.75) using subscripts (·)f , (·)i and (·)j , (·)i , is given by eq. (17.76) using subscripts (·)f , (·)j ; and F ag = respectively; F E '1 '1 a g 1/2 −1 F dτ and F ah = 1/2 −1 F a τ dτ , as in eq. 17.71. The work done by the discretized inertial forces is W I = (q Tf − q Ti )F Im + 3qT F Ih . With the help of condition (17.78) this becomes W I = v Tm M (v f − v i ) + 3vTh M v.
(17.83)
E T The work done by the discretized elastic forces is W E = (q Tf −qTi )F E g −q [F g −
(ε,q )h f j ]. With the help of condition (17.79) this becomes W E = (εTf − εTj ) C f g + εT C f j . The following velocity-displacement and force-strain relationship are now selected, see eq. (17.70), vm =
q˙ f + q˙ j 2
,
3vh = −
q˙ f − q˙ j 2
,
fg = C
εf + εj . 2
(17.84)
The work balance equation now writes Ef − Ej + v Tj M v + f Tj C ε = W a ,
(17.85)
which mirrors eq. (17.62). Since the total mechanical energy is a positive-deſnite function of the velocities and strains, the mean value theorem implies 1 T h h v E,vv v + vT E,vε Ej = Ei + v Tj M v + f Tj Cε − ε 2 h h +εT E,εv v + εT E,εε ε = Ei + v Tj M v + f Tj C ε − c2 ,
(17.86)
which is equivalent to eq. (17.63). Combining eqs. (17.85) and (17.86) yields Ef = Ei − c2 + W a and ſnally, the energy decay statement Ef ≤ Ei + W a . In summary, discretization (17.82) implies the energy decay statement provided that relationships (17.78) and (17.79) are satisſed, and that the algorithmic velocitydisplacement and force-strain relationships (17.84) are used. Example 17.9. Nonlinear spring-mass system Consider a nonlinear spring mass oscillator deſned by kinetic energy K = 1/2 mu˙ 2 , strain energy V = 1/2 kε2 , and strain ε = u2 . For this example m = k = 1. It is clear that condition (17.79) implies (ε,u )m = uf + ui in this case, and fm = k(εf + εi )/2. For the trapezoidal rule scheme, the discretized equation of motion is (mu˙ f − mu˙ i )/Δt+2 k[(uf +ui )/2]3 = 0. Although this scheme is unconditionally stable for linear system, there is no guarantee of stability when applied to nonlinear systems. Figure 17.24 shows the response of the system for initial conditions u0 = 1 and u˙ 0 =
17.5 Energy preserving and decaying schemes 6
2.5
DISPLACEMENT AND VELOCITY
681
2 5
SYSTEM ENERGIES [N]
1.5 1 0.5 0 -0.5 -1
4
3
2
1 -1.5 -2
0
5
10
0
15
0
5
2
8
1.5
7
1 0.5 0
-0.5 -1
-1.5
6 5 4 3 2
-2
1
-2.5
0
5
15
Fig. 17.25. Energy response for the trapezoidal rule. Solid line: kinetic energy; dashed line: strain energy; dashed-dotted line: total mechanical energy.
SYSTEM ENERGIES [N]
DISPLACEMENT AND VELOCITY
Fig. 17.24. Displacement response for the trapezoidal rule (Initial conditions: u0 = 1 and u˙ 0 = 0). Solid line: displacement; dashed line: velocity.
0
10
TIME
TIME
10
15
TIME
Fig. 17.26. System response for the trapezoidal rule (Initial conditions: u0 = 2 and u˙ 0 = 0). Solid line: displacement; dashed line: velocity.
0
5
10
15
TIME
Fig. 17.27. Energy response for the trapezoidal rule. Solid line: kinetic energy; dashed line: strain energy; dashed-dotted line: total mechanical energy.
0. The total mechanical energy rapidly increases as shown by ſg. 17.25, although not monotonously. For initial conditions u0 = 2 and u˙ 0 = 0, the corresponding results are shown in ſgs. 17.26 and 17.27, which now show a rapid decrease in the total mechanical energy, although here again, not monotonously. For the energy preserving scheme, the discretized equation of motion is (mu˙ f − mu˙ i )/Δt + (ε,u )m fm = 0. Figure 17.28 shows the response of the system for initial conditions u0 = 1 and u˙ 0 = 0. The total mechanical energy of the system, shown in ſg. 17.29, is preserved exactly, as expected from the energy preservation condition that characterizes this scheme. Finally, for the energy decaying scheme, the discretized equations of motion are (mu˙ f −mu˙ i )/Δt+(ε,u )g fg = 0 and (mu˙ j −mu˙ i )/Δt−[(ε,u )g fg −(ε,u )h kεj ]/3 = 0. Figure 17.30 shows the response of the system for initial conditions u0 = 1 and u˙ 0 = 0. The total mechanical energy of the system, shown in ſg. 17.31, decays
682
17 Finite element tools 0.7
SYSTEM ENERGIES [N]
DISPLACEMENT AND VELOCITY
1 0.8 0.6 0.4 0.2 0 -0.2 -0.4
0.6 0.5 0.4
0.3
0.2
-0.6 0.1
-0.8 -1 0
5
10
0
15
0
5
TIME
15
Fig. 17.29. Energy response for the energy preserving scheme. Solid line: kinetic energy; dashed line: strain energy; dasheddotted line: total mechanical energy.
1
0.5
0.8
0.45
SYSTEM ENERGIES [N]
DISPLACEMENT AND VELOCITY
Fig. 17.28. System response for the energy preserving scheme (Initial conditions: u0 = 1 and u˙ 0 = 0). Solid line: displacement; dashed line: velocity.
0.6 0.4 0.2 0 -0.2 -0.4
0.4 0.35 0.3 0.25 0.2 0.15 0.1
-0.6 -0.8 0
10
TIME
0.05 5
10
15
TIME
Fig. 17.30. System response for the energy decaying scheme (Initial conditions: u0 = 1 and u˙ 0 = 0). Solid line: displacement; dashed line: velocity.
0
0
5
10
15
TIME
Fig. 17.31. Energy response for the energy decaying scheme. Solid line: kinetic energy; dashed line: strain energy; dashed-dotted line: total mechanical energy.
monotonously, as expected from the energy decay inequality that characterize this scheme. 17.5.4 Enforcement of the constraints Consider a multibody system subjected to holonomic constraints C(q) = 0. If the Lagrange multiplier method is used to enforce these constraints, a constraint potential V c = λT C is added to the strain energy of the system, where λ are the Lagrange multipliers. The corresponding forces of constraint are F c = B T λ, where B is the con't 't straint matrix. The work done by these forces is W c = tif q˙ T F c dt = tif λT C˙ dt. Since C must vanish at all times, C˙ = 0, and W c = 0, i.e., the work done by the forces of constraint vanishes exactly.
17.5 Energy preserving and decaying schemes
683
Energy preserving scheme Lagrange’s multiplier approach will be used to enforce the constraint. The discretized forces of constraint are selected in the following manner * + F cm = C T,q λm , (17.87) m
where Lagrange’s multipliers, λm , are additional unknowns of the problem and the mean value theorem, C f − C i = (C ,q )m (q f − q i ), deſnes quantity (C ,q )m . The work done by these forces of constraint then becomes W c = (C f − C i ) λm . The additional equations required to solve the problem are obtained by enforcing the exact vanishing for the work done by the discretized forces of constraint. Since λm = 0, this implies C f − C i = 0. To avoid the drift phenomenon, it is preferable to enforce the condition C f = 0 at each time step. In summary, the discretization of the forces of constraint, eq. (17.87) together with the deſnition of (C ,q )m and the discrete constraint C f = 0 imply the exact vanishing of the work done by the discretized forces of constraint. Energy decaying scheme The Lagrange multiplier method for the energy decaying scheme is obtained in a similar manner. The discretized forces of constraint are selected as follows * + * + 1 F cg = C T,q (17.88) λg , F ch = − F cg − C T,q λj . 3 g h The work done by these forces become W c = (C f − C j )λg + (C j − C i )λg , and vanishes only if C f − C j = 0 and C j − C i = 0. Here again, it is preferable to enforce the constraints as C f = C j = 0, to avoid the drift phenomenon. In summary, the discretization of the forces of constraint (17.88) together with the discrete constraints C f = C j = 0 imply the exact vanishing of the work done by the discretized forces of constraint. Example 17.10. The pendulum problem Consider a pendulum problem deſned by kinetic energy K = 1/2 m˙q T q, ˙ potential energy V = −m gq2 , and constraint C = (q T q −2 )/2 = 0, where is the length T
of the pendulum. The generalized coordinates of the problem, q T = q1 , q2 , correspond to the horizonal and vertical displacements of the bob, respectively. For this example, m = 1 kg, = 0.5 m, v0 = 1.695 m/s, and g = 9.81 m/s2 . It is clear that (C,q )m = q m , where q m = (q f + q i )/2. The governing equations for the trapezoidal rule and energy preserving schemes are (m˙q f − m˙q i )/Δt + q m λm = mg. For the trapezoidal rule the constraint condition is Cm = (q Tm q m − 2 )/2 = 0, whereas for the energy preserving it is Cf = (q Tf q f − 2 )/2 = 0.
684
17 Finite element tools
Finally, the governing equations for the energy decaying scheme are (m˙q f − m˙q i )/Δt + q g λg = mg and (m˙q j − m˙q i )/Δt − (q g λg − q h λj )/3 = 0, subjected to
two constraint conditions Cf = 0 and Cj = (q Tj q j − 2 )/2 = 0. Figures 17.32, 17.33 and 17.34 show the time history of the pendulum displacements, velocities, and Lagrange multiplier, respectively, for the trapezoidal rule. Although the displacement history is accurately predicted, the velocities and Lagrange multipliers present violent oscillations of a purely numerical origin. The sharp rise in total energy shown in ſg. 17.35 clearly indicates the unstable nature of this scheme. 6
0.4
4
BOB VELOCITIES [m/s]
BOB DISPLACEMENT [m]
0.3 0.2 0.1
0 -0.1 -0.2
2
0
-2
-4 -0.3 -0.4
-6 0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0
TIME
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
TIME
Fig. 17.32. Response for the trapezoidal rule. Solid line: q1 ; dashed line: q2 .
Fig. 17.33. Response for the trapezoidal rule. Solid line: q˙1 ; dashed line: q˙2 .
0.05
25
20
0.03
SYSTEM ENERGIES [N]
LAGRANGE MULTIPLIER
0.04
0.02 0.01 0 -0.01 -0.02 -0.03
15
10
5
0 -0.04 -0.05 0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
TIME
Fig. 17.34. Response for the trapezoidal rule. Solid line: Lagrange multiplier.
-5 0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
TIME
Fig. 17.35. Energy response for the trapezoidal rule. Solid line: kinetic energy; dashed line: strain energy; dashed-dotted line: total mechanical energy.
Figures 17.36 to 17.39 show the corresponding results for the energy preserving and decaying schemes which are in very close agreement. All predicted histories are smooth. The total energy is exactly preserved for the energy preserving scheme, and
17.6 Implementation of cable elements
685
0.4
2
0.3
1.5
BOB VELOCITIES [m/s]
BOB DISPLACEMENT [m]
for the energy decaying scheme, the amount of dissipated energy is very small for this simple problem.
0.2 0.1
0 -0.1 -0.2
0 -0.5 -1 -1.5
-0.3 -0.4
1 0.5
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
-2
1
0
0.1
0.2
0.3
0.4
Fig. 17.36. Response for the energy preserving and decaying schemes. Solid line: q1 ; dashed line: q2 .
x 10
2.5
0.5
0.6
0.7
0.8
0.9
1
TIME
TIME
Fig. 17.37. Response for the energy preserving and decaying schemes. Solid line: q˙1 ; dashed line: q˙2 .
-4
1.8
2
SYSTEM ENERGIES [N]
LAGRANGE MULTIPLIER
1.6
1.5
1
0.5
1.4 1.2 1 0.8 0.6 0.4 0.2
0 0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
TIME
Fig. 17.38. Response for the energy preserving and decaying schemes. Solid line: Lagrange multiplier.
0 0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
TIME
Fig. 17.39. Energy response for the energy preserving and decaying schemes. Solid line: kinetic energy; dashed line: strain energy; dashed-dotted line: total mechanical energy.
17.6 Implementation of cable elements The formulation of the governing equations for cables is presented in section 16.2. Due to the high level of nonlinearity of these equations, analytical solutions cannot be obtained for all but the simplest problems. This section develops the implementation of the cable governing equations within the ſnite element framework.
686
17 Finite element tools
Sections 17.6.1 and 17.6.2 detail the deſnition and linearization of the inertial and elastic forces, respectively. It is sometimes necessary to include dissipative forces in the formulation, as explained in section 17.6.3. The last section presents the discretization of all forces, leading to the discretized force array and associated mass, gyroscopic, and stiffness matrices. 17.6.1 Inertial forces The inertial forces acting on the cable are obtained from the governing equations, eqs. (16.42), F I = m¨ u. (17.89) Linearization of inertial forces Since the expression for the inertial forces is already linear, the increment of inertial forces is simply ΔF I = MΔ¨ u, (17.90) where M is the mass matrix associated with the inertial forces, deſned as M = mI.
(17.91)
17.6.2 Elastic forces The elastic forces acting on the cable are also obtained from eqs. (16.42), F C = F ∗ G1 ,
(17.92)
where G1 = g¯1 + u , and the axial elastic force, F ∗ , is related to the axial strain through the constitutive law, eq. (16.38). Linearization of elastic forces Since the expression for the elastic forces is nonlinear, the computational process will require a linearization. At ſrst, the increment in strain is evaluated to ſnd Δγ11 = GT1 Δu . Next, the increment in the elastic force is computed ΔF ∗ = SGT1 Δu ,
(17.93)
where S is the axial stiffness of the cable. Taking variations of eq. (17.92) yields the following expression for increments in the elastic forces ΔF C = SΔu , (17.94) where the stiffness matrix, S, is deſned as S = F ∗ I + SG1 GT1 .
(17.95)
17.6 Implementation of cable elements
687
17.6.3 Dissipative forces The cable model discussed in the previous section is a purely conservative model, because the elastic forces are proportional the strain measures. It is often desirable to also introduce dissipative forces in the cable model. By an analogy to eq. (16.38), the dissipative force will be written as Fd = μS γ˙ 11 .
(17.96)
where μ is the damping coefſcient of units 1/s, and γ˙ 11 the time rate of change of the strain. Because the dissipative mechanisms in the cable are not well understood, it is postulated that the damping coefſcient is proportional to the stiffness coefſcient. The time rate of change of the sectional strain is readily obtained from eq. (16.32) as γ˙ 11 = GT1 u˙ . The dissipative forces, F dC , associated with the cable element then become F dC = Fd G1 . (17.97) Linearization of dissipative forces Because the expression for the dissipative forces is nonlinear, linearization is required here again. At ſrst, the increment in the time rate of change of the strain is evaluated to ſnd Δγ˙ 11 = GT1 Δu˙ + u˙ T Δu . (17.98) Next, the increment in the dissipative forces is computed " ! ΔFd = μS u˙ T Δu + GT1 Δu˙ .
(17.99)
Taking variations of eq. (17.97) yields the following expression for increments in the dissipative forces ΔF dC = S d Δu + E d Δu˙ , (17.100) where the dissipative matrices, S d and E d , are deſned as S d = Fd I + μSG1 u˙ T and E d = μSG1 GT1 , respectively. 17.6.4 Gravity forces for cables Gravity forces will be applied on cable due to their mass distribution. The potential of the gravity forces is written as V = −mg T (x+u), where g is the gravity vector. A variation of this potential is δV = −mgT δu. The gravity forces acting on a material point of cable then becomes F G = mg. (17.101)
688
17 Finite element tools
17.6.5 Finite element formulation of cables With the notation deſned in eqs. (17.89), (17.92), (17.97), and (17.101), the equations of motion of cable, eqs. (16.42), can be recast in the following form + * F I − F C + F dC = F G + F ext , where F ext are the external forces applied to the cable. A weighted residual formulation will be used here to enforce these dynamic equilibrium equations % + * N T F I − F C + F dC − F G − F ext dα1 = 0, (17.102) 0
where is the length of the cable element and N (α1 ) a matrix storing the selected test functions, see eq. (17.6). An integration by parts is performed on the second term of this equation, leading to % % * + N T F I + N T F C + F dC dα1 = N T (F G + F ext ) dα1 . 0
0
Because this set of algebraic equations is nonlinear, a linearization process is required to solve it. Equations (17.90), (17.93), and (17.100) are introduced to ſnd % * + ! " N T F I + MΔ¨ dα1 u + N T F C + SΔu + F dC + S d Δu + E d Δu˙ 0
%
N T (F G + F ext ) dα1 .
= 0
Next, the displacement, velocity, and acceleration ſelds of the element are expressed in terms of their nodal values using the assumed shape functions to ſnd ¨ ¨ u(α1 ) = N (α1 )ˆ u, u(α ˙ 1 ) = N (α1 )u ˆ˙ , and u ¨ (α1 ) = N (α1 )u ˆ , where u ˆ, u ˆ˙ , and u ˆ are the nodal displacements, velocities, and accelerations, respectively. With the help of these interpolations, the weak statement of dynamic equilibrium, eq. (17.102), becomes ¨ˆ + GΔ ˆ Δu ˆ u ˆ u = Fˆ G + Fˆ ext − Fˆ . M ˆ˙ + KΔˆ The mass, gyroscopic, and stiffness matrices of the cable element are % ˆ = N T M N dα1 , M
(17.103a)
0
%
ˆ= G 0
% ˆ = K
N T E d N dα1 ,
(17.103b)
* + N T S + S d N dα1 ,
(17.103c)
0
respectively, and the internal, gravity, and externally applied loads are
17.7 Finite element implementation of beam elements
ˆ = F
% * + N T F I + N T F C + F dC dα1 , 0
G Fˆ =
%
N T F G dα1 , 0
ext Fˆ =
%
689
(17.104a)
N T F ext dα1 ,
(17.104b)
0
respectively.
17.7 Finite element implementation of beam elements The formulation of the governing equations for beams is presented in section 16.3. This section develops the implementation of the beam governing equations within the ſnite element framework. Because the expressions for the various forces present in the beam are far more complex and nonlinear than those characterizing cables, the linearization process is more arduous, although the ſnal discretized equations are formally identical. Inertial, elastic and dissipative forces and their linearizations are presented in sections 17.7.1, 17.7.2, and 17.7.3, respectively. The last section presents the discretization of all forces, leading to the discretized force array and associated mass, gyroscopic, and stiffness matrices. 17.7.1 Inertial forces The inertial forces actin in the beam are obtained from the governing equations of motion, eqs. (16.63), 00 F I = P˙ + ˙ P, (17.105) u 0 where P is the momentum array resolved in the inertial system, deſned by eq. (16.61). In view of eq. (16.60), this momentum array can be expressed as P = M V, where the sectional mass matrix resolved in the inertial system is ηT mI m ∗ T . M = (R R0 )M (R R0 ) = m η
(17.106)
(17.107)
The location of the sectional center of mass and its moment of inertia tensor, both resolved in the inertial frame, are deſned as η = (R R0 )η ∗ , = (R R0 )∗ (R R0 )T , respectively. Expanding eq. (17.106) now leads to ηT ω mu˙ + m P = m (17.108) η u˙ + ω .
690
17 Finite element tools
The time derivatives of the location of the sectional mass center and its moment of inertia tensor are mη˙ = ω mη, and ˙ = ω + ω T , respectively. The time derivative of this momentum array, eq. (17.106), then becomes , ˙ + ω ω )mη m¨ u + (ω ˙ . (17.109) P= T m ηu ¨+u ˙ ω mη + ω ω + ω˙ Finally, the inertial forces, eq. (17.105), can be written in a compact form as , ˙ + ω ω )mη m¨ u + (ω I F = . (17.110) m ηu ¨ + ω˙ + ω ω The inertial forces are expressed in terms of physical quantities, the angular velocity and acceleration. In practical implementations of the ſnite element method, nodal rotations must parameterized using any of the techniques described in chapter 13. When using the vectorial parameterization of rotation, the angular velocity ˙ p˙ + H ¨ and acceleration vectors are expressed as ω = H p, ˙ eq. (13.54), and ω˙ = H p, respectively, where p is rotation parameter vector and H(p) the tangent tensor, eq. (13.55). Of course, the interpolation the rotation ſeld must be performed carefully, according to the algorithm presented in section 17.2. Linearization of inertial forces The expression for the inertial forces given above is nonlinear, and the solution process will require linearization of these forces. First, it will be necessary to compute increments of the sectional mass center location and of the sectional moment of in ertia tensor, which are found to be mΔη = m η T Δψ and (Δ)b = ( b − b)Δψ, respectively, where b is an arbitrary vector. Linearization of the inertial forces then yields Δu Δu˙ Δ¨ u +G +M , (17.111) ΔF I = K Δψ Δω Δω˙ where K, G, and M are the stiffness, gyroscopic, and mass matrices associated with the inertial forces, respectively. Simple algebra yields 0 (ω ˙ + ω ω )m ηT K= , (17.112a) ¨m η + ( ω ˙ − 0u ω) ˙ +ω ( ω − ω) T 0ω mη + ω m ηT G= , (17.112b) 0 ω − ω ηT mI m . (17.112c) M = m η Here again, these matrices are expressed in terms of physical quantities, the angular velocity and acceleration. Increments of these quantities are now related to increments in the rotation parameters using the chain rule for derivatives
17.7 Finite element implementation of beam elements
Δω =
∂ω ∂ω ˙ −ω Δp + Δp˙ = (H H)Δp + HΔp, ˙ ∂p ∂ p˙
691
(17.113)
˙ −ω H, where p is the rotation parameter vector and eq. (4.84), recast as ∂ω/∂p = H was used. Next, increments in the angular acceleration vector are evaluated by taking a time derivative of the above equation ¨ −ω ˙ −ω ˙ −ω Δω˙ = (H H ˙ H)Δp + (2H H)Δp˙ + HΔ¨ p.
(17.114)
Introducing these results into eq. (17.111) yields the following expression for increments in the inertial forces ΔF I = KI Δq + G I Δ˙q + MΔ¨q ,
(17.115)
where KI , G I , and M are the stiffness, gyroscopic, and mass matrices associated with the inertial forces, respectively. The incremental arrays of displacement, velocity, and acceleration arrays are deſned as Δq T = ΔuT , ΔpT , Δ˙q T =
T
T Δu˙ , Δp˙ T , and Δ¨q T = Δ¨ u , Δ¨pT , respectively. In summary, the inertial forces can be written in the following form m¨ u+α , (17.116) FI = m ηu ¨+γ and the stiffness, gyroscopic, and mass matrices are ˙ −α ¨ − 2βH ηT H H 0 m I , K = ¨m ¨ +εH ˙ + (u 0 H η−γ )H ˙ − 2βH ηT H 0 2m I , G = ˙ +εH 0 2 H ηT H mI m . M = m η H
(17.117a) (17.117b) (17.117c)
The following notations were introduced to simply the writing of the above expressions α = (ω ˙ + ω ω )mη, β = ω mη, γ = ω˙ + ω ω, and ε = ω + ( ω )T − ω. 17.7.2 Elastic forces The elastic forces acting in the beam element are obtained from eqs. (16.63) and will be treated in two separate components, denoted F C and F D , deſned as 0 0 N 0 , and F D = f , (17.118) = FC = f = M ( u0 + u )T 0 ( u0 + u )T N respectively. The components of the beam’s sectional force and moment vectors resolved in the inertial basis are denoted N and M , respectively.
692
17 Finite element tools
The sectional strains and curvatures deſned in eq. (16.45) are recast in the following compact notation u0 + u − (R R0 )¯ı1 e= , (17.119) k + R ki where k = axial(R RT ) are the components of the sectional curvature vector resolved in the inertial basis and ki the components of the corresponding curvature vector in the beam’s reference conſguration. The corresponding strain components resolved in the material basis are (R R0 )T E 1 − ¯ı1 e∗ = . (17.120) (R R0 )T (k + R k i ) The elastic forces in the beam are then f = C e, where C = (R R0 )C ∗ (R R0 )T is the sectional stiffness matrix resolved in the inertial basis, and the corresponding stiffness matrix resolved in the material basis, C ∗ , is deſned by eq. (16.47). The elastic forces are expressed in terms of physical quantities, the sectional strain and curvatures. In practical implementations of the ſnite element method, nodal rotations must parameterized using any of the techniques described in chapter 13. When using the vectorial parameterization of rotation, the curvature vector will be expressed as k = H p , eq. (13.54), where p is rotation parameter vector and H(p) the tangent tensor, eq. (13.55). Linearization of elastic forces The expressions for the elastic forces given above are nonlinear, and the ſnite element process will require a linearization of these forces. First, increments in the curvature vector are computed using the chain rule for partial derivatives Δk =
∂k ∂k Δp + Δp = (H − kH)Δp + HΔp , ∂p ∂p
(17.121)
where p is the rotation parameter vector and eq. (4.84), recast as ∂k/∂p = H − kH, was used. Increments in the strain components, eq. (17.119), are now easily evaluated to ſnd , R0 )¯ı1 HΔp Δu + (R Δe = . HΔp + H Δp − ( k+R k i )HΔp This leads to the following expression for increments in the elastic forces , T HΔp N )HΔp + Δu ( u0 + u +C . Δf = T HΔp + H Δp M HΔp Taking variations of eq. (17.118) yields the following expression for increments in the elastic forces
17.7 Finite element implementation of beam elements
ΔF C = SΔq + OΔq,
ΔF D = PΔq + QΔq.
693
(17.122)
The incremental arrays of displacement and displacement rates are deſned as Δq T =
T
Δu , ΔpT , and Δq T = ΔuT , ΔpT , respectively. In summary, the elastic forces can be written in the following form F C = C e,
FD = Υ FC,
and the stiffness matrices are T H 0N u0 + u )H I 0 0 ( + C , O= , S=C T H 0H 0 H 0M 0 0 + Υ S, Q = Υ O. P= N 0 The following notation was introduced 0 0 . Υ = ( u0 + u )T 0
(17.123)
(17.124a) (17.124b)
(17.125)
17.7.3 Dissipative forces The beam model discussed in the previous section is a purely conservative model, because the elastic force are proportional the strain measures. It is often desirable to also introduce dissipative forces in the beam model. By analogy to eq. (16.47), the dissipative forces in the material frame, f d∗ , will be written as f d∗ =
N d∗ M d∗
= μC ∗ e˙ ∗ ,
(17.126)
where μ is the damping coefſcient of units 1/s, and e˙ ∗ the time rate of change of the strains measured in the material frame. Since the dissipative mechanisms in the beam are not well understood, it is postulated that the damping matrix is proportional to the stiffness matrix. The time rate of change of the sectional strains in the material frame are readily obtained from eq. (17.120) as , 1 ω) (R R0 )T (u˙ + E ∗ e˙ = . (17.127) (R R0 )T ω The dissipative forces in the inertial frame now become f d = μC e˙ ,
(17.128)
where f dT = N dT , M dT , and N d = (R R0 )N d∗ and M d = (R R0 )M d∗ are the are the sectional dissipative force and moment vector components in the inertial
694
17 Finite element tools
frame, respectively, C = (R R0 )C ∗ (R R0 )T is the sectional stiffness matrix resolved in the inertial basis, and e˙ is deſned as 1 ω u˙ + E . (17.129) e˙ = ω The dissipative forces will be treated in two separate components, denoted F dC and F dD , d 0 0 d 0 N dD , F f (17.130) = = F dC = f d = 1T 0 T N d . E Md E 1 Linearization of dissipative forces Because the expression for the dissipative forces is nonlinear, the solution process will require a linearization. Increments in the strain array are evaluated to ſnd , 1 (H ˙ −ω 1 HΔp˙ + ω H)Δp + E T Δu Δu˙ + E . (17.131) Δ˙e = ˙ −ω ˙ −ω (H H − ω H )Δp + (H H)Δp + H Δp˙ + HΔp˙ Next, increments in the dissipative forces are computed , dT HΔp 1 ω 1 )HΔp N (u ˙ + E −ω E d Δf = dT + μC + μCΔ˙e. ω HΔp M HΔp
(17.132)
Taking variations of eq. (17.130) yields the following expression for increments in the dissipative forces ΔF dC = S d Δq + Od Δq + G d Δ˙q + E d Δ˙q ,
(17.133a)
(17.133b)
ΔF
dD
= P Δq + Q Δq + X Δ˙q + Y Δ˙q . d
d
d
d
In summary, the dissipative forces can be written in the following form F dC = μC e˙ ,
F dD = Υ F dC ,
(17.134)
where the gyroscopic and stiffness matrices are T 1 )H + E 1 H ˙ dT H 0 ˙ − ω E 0 (u ω 0N d d S = μC , O = , dT H + μC 0 ˙ −ω 0 (H˙ − ω H) 0M H H (17.135) 1 H 0 0 IE d d , E = μC , (17.136) G = μC 0H 0 H and
P = d
0 0 + Υ S d, Nd 0
Qd = Υ O d ,
Matrix Υ is deſned in eq. (17.125).
X d = Υ Gd,
Y d = Υ E d.
(17.137)
17.7 Finite element implementation of beam elements
695
17.7.4 Gravity forces for beams For many applications, the gravity forces associated with the beam’s distributed mass must be taken into account. The potential of these gravity forces is V = mg T (u0 + u + η), where vector η deſnes the location of the sectional mass center, see eq. (16.56). A variation of this potential is easily found to be δV = g T (mδu + m η T δψ) and the gravity forces acting on the cross-section are readily found as mg FG = . (17.138) m ηg 17.7.5 Finite element formulation of beams With the notation deſned in eqs. (17.105) and (17.118), the equations of motion of curved beams, eqs. (16.63), can be recast in the following compact form, F I −F C + F D = F G + F ext , where F ext are the external forces applied to the beam. A weighted residual formulation will be used here to enforce these dynamic equilibrium conditions % ! " N T F I − F C + F D − F G − F ext dα1 = 0, 0
where is the length of the beam element and N a matrix storing the selected test functions, see eq. (17.6). An integration by parts is performed on the second term of this equation, leading to % % ! T I " N F + N T F C + N T F D dα1 = N T (F G + F ext ) dx1 . 0
0
Since this set of algebraic equations is nonlinear, a linearization process is required to solve it. Equations (17.115) and (17.122) are introduced to ſnd % * + N T F I + KI Δq + G I Δv + MΔa + F D + PΔq + QΔq 0
! " + N T F C + SΔq + OΔq dα1 =
%
N T (F G + F ext ) dα1 . 0
Next, the elemental displacement, velocity, and acceleration ſelds are expressed in terms of their nodal values using the assumed shape functions, q(x1 ) = N ˆq , ˆ, where ˆq , vˆ, and a ˆ are the nodal valq (x1 ) = N ˆq , v(x1 ) = N ˆv , a(x1 ) = N a ues of the displacements, velocities, and accelerations, respectively. With the help of these interpolations of elemental ſelds, the weak statement of dynamic equilibrium becomes ˆ Δˆ ˆ v + KΔˆ ˆ q = Fˆ G + Fˆ ext − Fˆ . M a + GΔˆ (17.139) The mass, gyroscopic, and stiffness matrices of the beam element are
696
17 Finite element tools
%
ˆ = M
N T MN dα1 ,
(17.140a)
N T G I N dα1 ,
(17.140b)
0
% ˆ= G
0
ˆ = K
% N T (KI + Q)N + N T P N + N T S N + N T O N dα1 , (17.140c) 0
respectively, whereas the elemental forces, gravity loads, and externally applied loads are % ! T I " ˆ F = (17.141a) N F + N T F D + N T F C dα1 , 0
Fˆ
G
%
0
respectively.
N T F G dα1 ,
=
ext Fˆ =
%
N T F ext dα1 , 0
(17.141b)
18 Mathematical tools
18.1 The singular value decomposition The singular value decomposition theorem [82] states that an arbitrary, n × m matrix A (n > m), of rank r, r ≤ m can be decomposed into the following matrix product Σ (r×r) 0(r×(m−r)) V T(m×m) , A(n×m) = U (n×n) (18.1) 0((n−r)×r) 0((n−r)×(m−r)) where n > m, r ≤ m, U and V are orthogonal matrices, and Σ = diag(σi ) a unique diagonal matrix with real, non-negative elements. The other matrices in eq. (18.1) are zero matrices with the corresponding size indicated by their subscript. The elements of Σ are arranged in descending order as σ1 ≥ σ2 ≥ σ3 ≥ . . . ≥ σr > σr+1 = . . . = σm = 0,
(18.2)
where the σi are called the singular values of A, and, again, r = rank(A); if A has full rank, r = m. Matrices U and V can be partitioned as ˇ V 1 (m×r) V 2 (m×(m−r)) , Γ (n×(n−r)) , and V U (n×n) = U = (n×r) (m×m) (18.3) respectively, and hence, eq. (18.1) can be recast as Σ V T1 (r×m) 0(r×(m−r)) (r×r) ˇ A(n×m) = U (n×r) Γ (n×(n−r)) , 0((n−r)×r) 0((n−r)×(m−r)) V T 2 ((m−r)×m) (18.4) ˇ , Γ , V T and V T are indicated by their subscript. where the size of the matrices U 1 2 Thus, eq. (18.4) can be simpliſed to be ˇ Σ V T, A=U 1
(18.5)
ˇ and V are the left and right singular vectors of A, respectively. The orthogi.e., U 1 onality of U implies the following relationships.
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3_18 © Springer Science+Business Media B.V. 2011
698
18 Mathematical tools
ˇU ˇ T + Γ Γ T = I, U T
(18.6a)
ˇ U ˇ = I, U
(18.6b)
T
(18.6c)
Γ Γ
T
= I,
and ſnally ˇ T Γ = 0, U
ˇ = 0. ΓTU
(18.7)
Transposing eq. (18.5) and post multiplying by Γ leads to AT Γ = 0,
(18.8)
where property (18.7) was used; clearly, Γ forms the null space of AT . When matrix A has full rank, i.e. r = m, eq. (18.1) reduces to A=U
Σ V T, 0
(18.9)
i.e. the partition of V is itself, and eq. (18.9) simpliſes to ˇ ΣVT A=U
(18.10)
18.2 The Moore-Penrose generalized inverse The Moore-Penrose generalized inverse of matrix A(n×m) with n ≥ m is the unique matrix, denoted A+ , that features the following properties A A+ A = A,
(18.11)
A+ A A+ = A+ ,
(18.12)
(A A+ ) = (A A+ )T ,
(18.13)
(A+ A) = (A+ A)T .
(18.14)
The Moore-Penrose inverse is most elegantly computed using the singular value ˇ Σ V T , and its Mooredecomposition. If A is of full rank, eq. (18.10) implies A = U Penrose inverse is then ˇT. A+ = V Σ −1 U (18.15) It is readily veriſed that this expression satisſes the four conditions for a MoorePenrose inverse, eqs. (18.11) to (18.14). It is also possible to express the MoorePenrose inverse without resorting to the singular value decomposition; indeed, it is readily veriſed that −1 T A+ = AT A A , (18.16) veriſes, once again, the four conditions (18.11) to (18.14).
18.3 Gauss-Legendre quadrature
699
18.2.1 Problems Problem 18.1. Properties of Moore-Penrose inverse Verify that the Moore-Penrose inverse given by eq. (18.15) satisſes the four conditions (18.11) to (18.14).
Problem 18.2. Properties of Moore-Penrose inverse Verify that the Moore-Penrose inverse given by eq. (18.16) satisſes the four conditions (18.11) to (18.14).
Problem 18.3. Properties of Moore-Penrose inverse Consider matrix A(m×n) = B (m×m) C (m×n) , m > n, where rank(A) = rank(C) = n
and rank(B) = m. Is the following statement true: A+ = C + B −1 ?
Problem 18.4. Properties of Moore-Penrose inverse Consider matrix A(m×n) = B (m×r) C (r×n) , m > n, where rank(A) = rank(B) =
rank(C) = r. Is the following statement true: A+ = C T (C C T )−1 (B T B)1 B T ?
18.3 Gauss-Legendre quadrature When applying energy methods, the computation of the stiffness matrix and load array involves integrations of the product of the shape functions by the stiffness properties of the structure. As the number of assumed shape functions increases, it becomes increasingly cumbersome to perform all these integrations in closed form, specially when the expression for the shape functions becomes complex. To circumvent this problem, numerical integration can be used. A very powerful tool for numerical integration is the Gauss-Legendre quadrature scheme. In its simplest form [5], this scheme approximately evaluates an integral by the following sum % +1 NG ( f (s) ds ≈ wi f (si ), (18.17) −1
i=1
where si , i = 1, 2, . . . NG are the Gauss-Legendre quadrature points, and wi the associated weights. The Gauss-Legendre quadrature points are often called sampling points, because the integral is evaluated by sampling the value of the integrand at these points. Table 18.1 lists the Gauss-Legendre quadrature points and associated weights for NG = 2, 3, and 4. The fundamental property of the NG point GaussLegendre quadrature scheme is that it exactly integrates a polynomial of degree 2NG − 1. To illustrate the application of the Gauss-Legendre quadrature scheme, consider the following integral % +1 4 I= x − 5x3 + 3x2 + 5x dx = 2.4. −1
At ſrst, the 2-point quadrature formula is used to ſnd
700
18 Mathematical tools Table 18.1. Gauss points and associated weights for NG = 2, 3, and 4. NG
si wi NG 2 ± 1/3 1 4 ± 3 ±
0 8/9 3/5 5/9
±
si wi √ (3 − 2 6/5)/7 (18 + 30)/36 √ (3 + 2 6/5)/7 (18 − 30)/36
1/2 3/2 2 1 1 1 1 I≈ +5 +3 −5 3 3 3 3 1/2 2 3/2 1 20 1 1 1 = 2.22. = + −5 +3 +5 3 3 3 3 9 This 2-point formula exactly integrates a polynomial of degree 2 × 2 − 1 = 3; hence, an approximate answer is expected for this integral involving a polynomial of degree four. The approximate answer only incurs a 7.4% error. Next, the 3-point quadrature formula is used, leading to 1/2 3/2 2 5 3 3 3 3 I≈ +5 +3 −5 9 5 5 5 5 1/2 2 3/2 3 3 60 5 3 3 = 2.4. = + −5 +3 +5 9 5 5 5 5 25 This 3-point formula exactly integrates a polynomial of degree 3 × 2 − 1 = 5; hence, the exact solution is recovered. Next, consider the following integral involving transcendental function %
5
I= 1
1 5 dx = [ln x]1 = ln 5 = 1.609. x
To recast the problem in the standard form, a change of variable, x = 2s + 3, is ſrst performed. The Jacobian of the coordinate transformation is readily evaluated, dx/ds = 2. The 2-point quadrature formula then yields a ſrst approximation of the integral % +1 36 1 1 dx 1 I= = ds ≈ 2 = 1.565, + 2s + 3 ds 23 −2 1/3 + 3 2 1/3 + 3 −1 which only involves a 2.75% error. To improve the approximation, the 3-point quadrature formula is used, leading to 2 8 476 5 5 I≈ + + = = 1.603. 9 −2 3/5 + 3 3 2 3/5 + 3 297
18.3 Gauss-Legendre quadrature
701
The error is now reduced to about 0.42%. Higher order Gauss-Legendre quadrature scheme can be derived that involve an increasing number of sampling points and associated weights. This data have been tabulated, see Abramowitz and Stegun [351], or can be readily calculated [5]. For integration over a rectangular domain, the basic Gauss-Legendre quadrature scheme of eq. (18.17) is generalized as %
+1
−1
%
+1
−1
f (s, t) dsdt ≈
MG NG ( (
wi wj f (si , tj ),
(18.18)
i=1 j=1
where the sampling points, si and tj , and associated weights, wi and wj , respectively, are those listed in table 18.1.
References
1. O. Bottema and B. Roth. Theoretical Kinematics. Dover Publications, Inc., New York, 1979. 2. F.B. Hildebrand. Advanced Calculus for Applications. Prentice Hall, Inc., Englewood Cliffs, New Jersey, second edition, 1976. 3. W. Fl¨ugge. Tensor Analysis and Continuum Mechanics. Springer-Verlag, New York, Heidelberg, Berlin, 1972. Ň Bj¨orck. Numerical Methods. Prentice Hall, Inc., Englewood Cliffs, 4. G. Dahlquist and A. New Jersey, 1974. 5. W.H. Press, B.P. Flannery, S.A. Teutolsky, and W.T. Vetterling. Numerical Recipes. The Art of Scientiſc Computing. Cambridge University Press, Cambridge, 1990. 6. F. Pfeiffer and C. Glocker. Multi-Body Dynamics with Unilateral Contacts. John Wiley & Sons, Inc, New York, 1996. 7. P.R. Dahl. Solid friction damping of mechanical vibrations. AIAA Journal, 14:1675– 1682, 1976. 8. T. Baumeister, E.A. Avallone, and T. Baumeister III (eds.). Marks’ Mechanical Engineers Handbook. McGraw-Hill Book Company, New-York, 1978. 9. J.E. Shigley and C.R. Mischke. Mechanical Engineering Design. McGraw-Hill Book Company, New York, 1989. 10. A.K. Banerjee and T.R. Kane. Modeling and simulation of rotor bearing friction. Journal of Guidance, Control and Dynamics, 17:1137–1151, 1994. 11. J. Srnik and F. Pfeiffer. Dynamics of CVT chain drives: Mechanical model and veriſcation. In Proceedings of the 16th Biennial Conference on Mechanical Vibration and Noise, Sacramento, CA, Sept. 14-17, 1997. 12. E. Rabinowicz. Friction and Wear of Materials. John Wiley & Sons, New York, second edition, 1995. 13. J.C. Oden and J.A.C. Martins. Models and computational methods for dynamic friction phenomena. Computer Methods in Applied Mechanics and Engineering, 52:527–634, 1985. 14. L. Euler. D´ecouverte d’un nouveau principe de m´ecanique. M´emoires de l’Acad´emie des Sciences de Berlin, 6(1752):185–217, 1750. 15. L. Euler. Nova methodus motum corporum rigidorum determinandi. Novi Commentari Academiae Scientiarum Imperialis Petropolitanae, 20:208–238, 1775. 16. L. Euler. De motu corporum circa pumctum ſxum mobilium. Opera Mechanica et Astronomica, 9(Series Secunda):413–441, 1776. Leonhardi Euleri Opera Omnia.
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3 © Springer Science+Business Media B.V. 2011
704
References
17. L. Euler. Formulae generales pro translatione quacunque corporum rigidorum. Novi Commentari Academiae Scientiarum Imperialis Petropolitanae, 20:189–207, 1775. 18. O. Rodrigues. Des lois g´eometriques qui r´egissent les d´eplacements d’un syst`eme solide dans l’espace, et de la variation des coordonn´ees provenant de ces d´eplacements consid´er´es ind´ependamment des causes qui peuvent les produire. Journal de Math´ematiques Pures et Appliqu´ees, 5:380–440, 1840. 19. J. Angeles. Fundamentals of Robotic Mechanical Systems. Theory, Methods, and Algorithms. Springer-Verlag, New York, 1997. 20. G. Mozzi. Discorso Matematico Sopra il Rotamento Momentaneo dei Corpi. Stamperia di Donato Campo, Napoli, Italy, 1763. 21. M. Chasles. Note sur les propri´et´es g´en´erales du syst`eme de deux corps semblables entre eux et plac´es d’une mani`ere quelconque dans l’espace; et sur le d´eplacement ſni, ou inſniment petit d’un corps solide libre. Bulletin des Sciences Math´ematiques de F´erussac, 14:321–326, 1830. 22. J. Angeles. The application of dual algebra to kinematic analysis. In J. Angeles and E. Zakhariev, editors, Computational Methods in Mechanical Systems, volume 161, pages 3–31. Springer-Verlag, Heidelberg, 1998. 23. A.K. Pradeep, P.J. Yoder, and R. Mukundan. On the use of dual matrix exponentials in robot kinematics. International Journal of Robotics Research, 8(5):57–66, 1989. 24. D.J. Ewins. Modal testing: theory and practice. Wiley, New York, 1984. 25. R. Weinstock. Calculus of Variations with Applications to Physics and Engineering. Dover Publications, Inc., New York, 1974. 26. E. Hairer and G. Wanner. Solving Ordinary Differential Equations II : Stiff and Differential-Algebraic Problems. Springer, Berlin, 1996. 27. H. Hochstadt. Differential Equations. Dover Publications, Inc., New York, 1964. 28. J. Garc´Æa de Jal´on, J. Unda, and A. Avello. Natural coordinates for the computer analysis of multibody systems. Computer Methods in Applied Mechanics and Engineering, 56:309–327, 1986. 29. J. Angeles. Spatial Kinematic Chains. Springer-Verlag, Berlin, 1982. 30. D. Li and P.W. Likins. Dynamics of a multibody system with relative translation on curved, ƀexible tracks. Journal of Guidance, Control and Dynamics, 10:299–306, 1987. 31. A. Cardona. An Integrated Approach to Mechanism Analysis. PhD thesis, Universit´e de Li`ege, Belgium, 1989. 32. O.A. Bauchau. On the modeling of prismatic joints in ƀexible multi-body systems. Computer Methods in Applied Mechanics and Engineering, 181:87–105, 2000. 33. O.A. Bauchau and C.L. Bottasso. Contact conditions for cylindrical, prismatic, and screw joints in ƀexible multi-body systems. Multibody System Dynamics, 5:251–278, 2001. 34. L. Piegl and W. Tiller. The Nurbs Book. Springer-Verlag, Berlin, New Jersey, second edition, 1997. 35. G.E. Farin. Curves and Surfaces for Computer Aided Geometric Design. Academic Press, Inc., Boston, third edition, 1992. 36. A. Cardona and M. G´eradin. Finite element modeling of ƀexible tracks. In C. L. Kirk and J.L. Junkins, editors, Dynamics of Flexible Structures in Space, pages 411–424. Springer-Verlag, 1990. 37. R.E. Roberson and R. Schwertassek. Dynamics of Multibody Systems. Springer-Verlag, Berlin, 1988. 38. P.E. Nikravesh. Computer-Aided Analysis of Mechanical Systems. Prentice-Hall, Englewood Cliffs, New Jersey, 1988.
References
705
39. F.M.L. Amirouche. Computational Methods in Multibody Dynamics. Prentice-Hall, Englewood Cliffs, New Jersey, 1992. 40. W.O. Schiehlen. Advanced Multibody System Dynamics. Kluwer Academic Publishers, Doordrecht, The Netherlands, 1993. 41. J. Garc´Æa de Jal´on and E. Bayo. Kinematic and Dynamic Simulation of Multibody Systems. The Real-Time Challenge. Springer-Verlag, New York, 1994. 42. A.A. Shabana. Dynamics of Multibody Systems. Cambridge University Press, third edition, 2005. 43. W.O. Schiehlen, editor. Multibody System Handbook. Springer-Verlag, Berlin, 1990. 44. A. Laulusa and O.A. Bauchau. Review of classical approaches for constraint enforcement in multibody systems. Journal of Computational and Nonlinear Dynamics, 3(1):011004 1–8, January 2008. 45. O.A. Bauchau and A. Laulusa. Review of contemporary approaches for constraint enforcement in multibody systems. Journal of Computational and Nonlinear Dynamics, 3(1):011005 1–8, January 2008. 46. W.O. Schiehlen. Dynamics of complex multibody systems. SM Archives, 9:159–195, 1984. 47. C.W. Gear and L.R. Petzold. ODE methods for the solution of differential/algebraic systems. SIAM Journal on Numerical Analysis, 21(4):716–728, 1984. 48. P. L¨otstedt and L.R. Petzold. Numerical solution of nonlinear differential equations with algebraic constraints I: Convergence results for backward differentiation formulas. Mathematics of Computation, 46(174):491–516, April 1986. 49. L.R. Petzold and P. L¨otstedt. Numerical solution of nonlinear differential equations with algebraic constraints. II: Practical implications. SIAM Journal on Scientiſc and Statistical Computing, 7(3):720–733, July 1986. 50. K.E. Brenan, S.L. Campbell, and L.R. Petzold. Numerical Solution of Initial-Value Problems in Differential-Algebraic Problems. North-Holland, New York, 1989. 51. G.A. Maggi. Principii della Teoria Matematica del Movimento dei Corpi: Corso di Meccanica Razionale. Ulrico Hoepli, Milano, 1896. 52. G.A. Maggi. Di alcune nuove forme delle equazioni della dinamica applicabili ai systemi anolonomi. Rendiconti della Regia Accademia dei Lincei, Serie V, X:287–291, 1901. 53. J.I. Neimark and N.A. Fufaev. Dynamics of Nonholonomic Systems. American Mathematical Society, Providence, Rhode Island, 1972. 54. A. Kurdila, J.G. Papastavridis, and M.P. Kamat. Role of Maggi’s equations in computational methods for constrained multibody systems. Journal of Guidance, Control, and Dynamics, 13(1):113–120, 1990. 55. J.G. Papastavridis. Maggi’s equations of motion and the determination of constraint reactions. Journal of Guidance, Control, and Dynamics, 13(2):213–220, 1990. 56. P.E. Nikravesh. Some methods for dynamic analysis of constrained mechanical systems: A survey. In E.J. Haug, editor, Computer Aided Analysis and Optimization of Mechanical Systems Dynamics, pages 351–367. Springer-Verlag, Berlin, Heidelberg, 1984. 57. H. Hemami and F.C. Weimer. Modeling of nonholonomic dynamic systems with applications. Journal of Applied Mechanics, 48:177–182, March 1981. 58. P. L¨otstedt. Mechanical systems of rigid bodies subjected to unilateral constraints. SIAM Journal of Applied Mathematics, 42(2):281–296, April 1982. 59. C.W Gear, B. Leimkuhler, and G.K. Gupta. Automatic integration of Euler-Lagrange equations with constraints. Journal of Computational and Applied Mathematics, 12 & 13:77–90, 1985. 60. T.R. Kane and C.F. Wang. On the derivation of equations of motion. Journal of the Society for Industrial and Applied Mathematics, 13(2):487–492, June 1965.
706
References
61. T.R. Kane and D.A. Levinson. Dynamics: Theory and Applications. McGraw-Hill Book Company, New York, 1985. 62. J. Garc´Æa de Jal´on, J. Unda, A. Avello, and J.M. Jim´enez. Dynamic analysis of threedimensional mechanisms in “natural” coordinates. Journal of Mechanisms, Transmissions, and Automation in Design, 109:460–465, December 1987. 63. J. Unda, J. Garc´Æa de Jal´on, F. Losantos, and R. Enparantza. A comparative study on some different formulations of the dynamic equations of constrained mechanical systems. Journal of Mechanisms, Transmissions, and Automation in Design, 109:466–474, December 1987. 64. M. Borri, C.L. Bottasso, and P. Mantegazza. Acceleration projection method in multibody dynamics. European Journal of Mechanics, A/Solids, 11(3):403–418, 1992. 65. F.E. Udwadia, R.E. Kalaba, and H.C. Eun. Equations of motion for constrained mechanical systems and the extended d’Alembert’s principle. Quarterly of Applied Mathematics, LV(2):321–331, 1997. 66. F.E. Udwadia and R.E. Kalaba. A new perspective on constrained motion. Proceedings of the Royal Society London, Series A, 439:407–410, 1992. 67. R.E. Kalaba and F.E. Udwadia. On constrained motion. Applied Mathematics and Computation, 51:85–86, 1992. 68. R.E. Kalaba and F.E. Udwadia. Equations of motion for nonholonomic, constrained dynamical systems via Gauss’s principle. ASME Journal of Applied Mechanics, 60:662– 668, 1993. 69. R.E. Kalaba and F.E. Udwadia. Lagrangian mechanics, Gauss’s principle, quadratic programming, and generalized inverses: New equations for nonholonomically constrained discrete mechanical systems. Quarterly of Applied Mathematics, LII(2):229–241, 1994. 70. F.E. Udwadia and R.E. Kalaba. On motion. Journal of the Franklin Institute, 330(3):571–577, 1993. 71. F.E. Udwadia and R.E. Kalaba. Equations of motion for mechnical systems. Journal of Aerospace Engineering, 9(3):64–69, July 1996. 72. F.E. Udwadia and R.E. Kalaba. The geometry of constrained motion. Zeitschrift f¨ur angewandte Mathematik und Mechanik, 75(8):637–640, 1995. 73. F.E. Udwadia and R.E. Kalaba. The explicit Gibbs-Appell equation and generalized inverse forms. Quarterly of Applied Mathematics, LVI(2):277–288, 1998. 74. F.E. Udwadia and R.E. Kalaba. What is the general form of the explicit equations of motion for constrained mechanical system. Journal of Applied Mechanics, 69:335–339, May 2002. 75. F.E. Udwadia and R.E. Kalaba. On the foundations of analytical dynamics. International Journal of Non-Linear Mechanics, 37:1079–1090, 2002. 76. F.E. Udwadia and R.E. Kalaba. Analytical Dynamics: A New Approach. Cambridge University Press, Cambridge, 1996. 77. N. Orlandea, M.A. Chace, and D.A. Calahan. A sparsity-oriented approach to the dynamic analysis and design of mechanical systems. Part I. ASME Journal of Engineering for Industry, 99(3):773–779, 1977. 78. N. Orlandea, D.A. Calahan, and M.A. Chace. A sparsity-oriented approach to the dynamic analysis and design of mechanical systems. Part II. ASME Journal of Engineering for Industry, 99(3):780–784, 1977. 79. S.L. Campbell and B. Leimkuhler. Differentiation of constraints in differential-algebraic equations. Mechanics of Structures and Machines, 19(1):19–39, 1991. 80. H. Brauchli. Mass-orthogonal formulation of equations of motion for multibody systems. Journal of Applied Mathematics and Physics (ZAMP), 42:169–182, 1991.
References
707
81. H. Brauchli and R. Weber. Dynamical equations in natural coordinates. Computer Methods in Applied Mechanics and Engineering, 91:1403–1414, 1991. 82. G.H. Golub and C.F. van Loan. Matrix Computations. The Johns Hopkins University Press, Baltimore, second edition, 1989. 83. W. Blajer. A projection method approach to constrained dynamic analysis. Journal of Applied Mechanics, 59:643–649, September 1992. 84. W. Blajer, W. Schiehlen, and W. Schirm. A projective criterion to the coordinate partitioning method for multibody dynamics. Archive of Applied Mechanics, 64:86–98, 1994. 85. W. Blajer. A geometric uniſcation of constrained system dynamics. Multibody System Dynamics, 1:3–21, 1997. ¨ 86. C.F. Gauss. Uber ein neues algemeines Grundgesetz der Mechanik. Zeitschrift f¨ur die reine und angewandte Mathematik, 4:232–235, 1829. 87. C. Lanczos. The Variational Principles of Mechanics. Dover Publications, Inc., New York, 1970. 88. L. Lilov and M. Lorer. Dynamic analysis of multirigid-body systems based on Gauss principle. Zeitschrift f¨ur angewandte Mathematik und Mechanik, 62:539–545, 1982. 89. F. Cardin and G. Zanzotto. On constrained mechnical systems: D’Alembert’s and Gauss’ principles. Journal of Mathematical Physics, 30(7):1473–1479, July 1989. 90. M. Borri, C.L. Bottasso, and P. Mantegazza. Equivalence of Kane’s and Maggi’s equations. Meccanica, 25:272–274, 1990. 91. J. Angeles and S.K. Lee. The formulation of dynamical equations of holonomic mechanical systems using a natural orthogonal complement. ASME Journal of Applied Mechanics, 55:243–244, 1988. 92. P. Maißer. Analytical dynamics of multibody systems. Computer Methods in Applied Mechanics and Engineering, 91:1391–1396, 1991. 93. U. Jungnickel. Differential-algebraic equations in Riemannian spaces and applications to multibody system dynamics. Zeitschrift f¨ur angewandte Mathematik und Mechanik, 74(9):409–415, 1994. 94. H. Ess´en. On the geometry of nonholonomic dynamics. Journal of Applied Mechanics, 61:689–694, September 1994. 95. W. Blajer. A geometrical interpretation and uniform matrix formulation of multibody system dynamics. Zeitschrift f¨ur angewandte Mathematik und Mechanik, 81(4):247– 259, 2001. 96. C.W. Gear. Simultaneous numerical solution of differential-algebraic equations. IEEE Transactions on Circuit Theory, CT-18(1):89–95, January 1971. 97. C.W. Gear. Differential-algebraic equations. In E.J. Haug, editor, Computer Aided Analysis and Optimization of Mechanical Systems Dynamics, pages 323–334. SpringerVerlag, Berlin, Heidelberg, 1984. 98. E.J. Haug. Elements and methods of computational dynamics. In E.J. Haug, editor, Computer Aided Analysis and Optimization of Mechanical Systems Dynamics, pages 3–38. Springer-Verlag, Berlin, Heidelberg, 1984. 99. R.A. Wehage and E.J. Haug. Generalized coordinate partitioning for dimension reduction in analysis of constrained dynamic systems. ASME Journal of Mechanical Design, 104(1):247–255, January 1982. 100. W.C. Walton and E.C. Steeves. A new matrix theorem and its application for establishing independent coordinates for complex dynamical systems with constraints. Technical Report NASA TR R-326, NASA, 1969.
708
References
101. P.E. Nikravesh and I.S. Chung. Application of Euler parameters to the dynamic analysis of three-dimensional constrained mechanical systems. Journal of Mechanical Design, 104:785–791, October 1982. 102. R.P. Singh and P.W. Likins. Singular value decomposition for constrained dynamical systems. Journal of Applied Mechanics, 52:943–948, December 1985. 103. N.K. Mani, E.J. Haug, and K.E. Atkinson. Application of singular value decomposition for analysis of mechanical system dynamics. Journal of Mechanisms, Transmissions, and Automation in Design, 107:82–87, March 1985. 104. F.M.L. Amirouche, T. Jia, and S.K. Ider. A recursive Householder transformation for complex dynamical systems with constraints. Journal of Applied Mechanics, 55:729– 734, September 1988. 105. O.P. Agrawal and S. Saigal. Dynamic analysis of multi-body systems using tangent coordinates. Computers & Structures, 31(3):349–355, 1989. 106. C.G. Liang and G.M. Lance. A differentiable null space method for constrained dynamic analysis. Journal of Mechanisms, Transmissions and Automation in Design, 109:405– 411, September 1987. 107. C. Wampler, K. Bufſnton, and J. Shu-hui. Formulation of equations of motion for systems subject to constraints. Journal of Applied Mechanics, 52:465–470, June 1985. 108. S.S. Kim and M.J. Vanderploeg. QR decomposition for state space representation of constrained mechanical dynamic systems. Journal of Mechanisms, Transmissions and Automation in Design, 108:183–188, June 1986. 109. J. Garc´Æa de Jal´on, M.A. Serna, and R. Avil´es. Computer method for kinematic analysis of lower-pair mechanisms - I Velocities and accelerations. Mechanism and Machine Theory, 16(5):543–556, 1981. 110. J. Garc´Æa de Jal´on, M.A. Serna, and R. Avil´es. Computer method for kinematic analysis of lower-pair mechanisms - II Position problems. Mechanism and Machine Theory, 16(5):557–566, 1981. 111. J. Garc´Æa de Jal´on, M.A. Serna, F. Viadero, and J. Flaquer. A simple numerical method for the kinematic analysis of spatial mechanisms. Journal of Mechanical Design, 104:78–82, January 1982. 112. J.A. T´arrago, M.A. Serna, C. Bastero, and J. Garc´Æa de Jal´on. A computer method for the ſnite displacement problem in spatial mechanisms. Journal of Mechanical Design, 104:869–874, October 1982. 113. M.A. Serna, R. Avil´es, and J. Garc´Æa de Jal´on. Dynamic analysis of plane mechanisms with lower pairs in basic coordinates. Mechanism and Machine Theory, 17(6):397–403, 1982. 114. J. Garc´Æa de Jal´on, J.M. Jim´enez, A. Avello, F. Mart´Æn, and J. Cuadrado. Real time simulation of complex 3-D multibody systems with realistic graphics. In E.J. Haug and R.C. Deyo, editors, Real-Time Integration Methods for Mechanical System Simulation, pages 265–292. Springer-Verlag, Berlin, Heidelberg, 1990. 115. A. Avello, J.M. Jim´enez, E. Bayo, and J. Garc´Æa de Jal´on. A simple and highly parallelizable method for real-time dynamic simulation based on velocity transformations. Computer Methods in Applied Mechanics and Engineering, 107:313–339, 1993. 116. J.W. Kamman and R.L. Huston. Constrained multibody system dynamics - An automated approach. Computers & Structures, 18(6):999–1003, 1984. 117. J.W. Kamman and R.L. Huston. Dynamics of constrained multibody systems. ASME Journal of Applied Mechanics, 51:899–903, December 1984. 118. J.C. Chiou, K.C. Park, and C. Farhat. A natural partitioning scheme for parallel simulation of multibody systems. International Journal for Numerical Methods in Engineering, 36:945–967, 1993.
References
709
119. K.C. Park, J.C. Chiou, and Downer J.D. Explicit-implicit staggered procedure for multibody dynamics analysis. Journal of Guidance, Control, and Dynamics, 13(3):562–570, May-June 1990. 120. A. Arabyan and F. Wu. An improved formulation for constrained mechanical systems. Multibody System Dynamics, 2:49–69, 1998. 121. W. Blajer. Projective formulation of Maggi’s method for nonholonomic system analysis. Journal of Guidance, Control, and Dynamics, 15(2):522–525, 1992. 122. W. Blajer. An orthonormal tangent space method for constrained multibody systems. Computer Methods in Applied Mechanics and Engineering, 121:45–57, 1995. 123. M. Borri, C.L. Bottasso, and P. Mantegazza. A modiſed phase space formulation for constrained mechanical systems - Differential approach. European Journal of Mechanics, A/Solids, 11(5):701–727, 1992. 124. K.C. Park and J.C. Chiou. Stabilization of computational procedures for constrained dynamical systems. Journal of Guidance, Control, and Dynamics, 11(4):365–370, JulyAugust 1988. 125. C.W. Gear. Differential-algebraic equation index transformations. SIAM Journal on Scientiſc and Statistical Computing, 9(1):40–47, January 1988. 126. C.W. Gear. Maintaining solution invariants in the numerical solution of ODEs. SIAM Journal on Scientiſc and Statistical Computing, 7(3):734–743, July 1986. 127. L.F. Shampine. Conservation laws and the numerical solution of ODEs. Computers and Mathematics with Applications, 12B(5/6):1287–1296, 1986. 128. E. Eich. Convergence results for a coordinate projection method applied to mechanical systems with algebraic constraints. SIAM Journal on Numerical Analysis, 30(5):1467– 1482, October 1993. 129. J. Yen, E.J. Haug, and T.O. Tak. Numerical methods for constrained equations of motion in mechanical system dynamics. Mechanics of Structures and Machines, 19(1):41–76, 1991. 130. J. Yen. Constrained equations of motion in multibody dynamics as ODEs on manifolds. SIAM Journal on Numerical Analysis, 30(2):553–568, 1993. 131. F.A. Potra and J. Yen. Implicit numerical integration for euler-lagrange equations via tangent space parameterization. Mechanics of Structures and Machines, 19(1):77–98, 1991. 132. E.J. Haug and J. Yen. Implicit numerical integration of constrained equations of motion via generalized coordinate partitioning. Journal of Mechanical Design, 114:296–304, June 1992. 133. J. Yen and L.R. Petzold. An efſcient Newton-type iteration for the numerical solution of highly oscillatory constrained multibody dynamic systems. SIAM Journal on Scientiſc Computing, 19(5):1513–1534, September 1998. 134. J. Yen, L.R. Petzold, and S. Raha. A time integration algorithm for ƀexible mechanism dynamics: The DAE α-method. Computer Methods in Applied Mechanics and Engineering, 158:341–355, 1998. 135. H.M. Hilber, T.J.R. Hughes, and R.L. Taylor. Improved numerical dissipation for time integration algorithms in structural dynamics. Earthquake Engineering and Structural Dynamics, 5:283–292, 1977. 136. J. Chung and G.M. Hulbert. A time integration algorithm for structural dynamics with improved numerical dissipation: The generalized-α method. Journal of Applied Mechanics, 60:371–375, 1993. 137. F.C. Tseng, Z.D. Ma, and G.M. Hulbert. Efſcient numerical solution of constrained multibody dynamics systems. Computer Methods in Applied Mechanics and Engineering, 192:439–472, 2003.
710
References
138. M. Borri, L. Trainelli, and A. Croce. The embedded projection method: A general index reduction procedure for constrained system dynamics. Computer Methods in Applied Mechanics and Engineering, 195(50-51):6974–6992, 2006. 139. J. Parczewski and W. Blajer. On realization of program constraints: Part I - Theory. Journal of Applied Mechanics, 56:676–679, September 1989. 140. W. Blajer and J. Parczewski. On realization of program constraints. Part II - Practical implications. Journal of Applied Mechanics, 56:680–684, September 1989. 141. J.W. Baumgarte. Stabilization of constraints and integrals of motion in dynamic systems. Computer Methods in Applied Mechanics and Engineering, 1:1–16, 1972. 142. G.P. Ostermeyer. On Baumgarte stabilization for differential algebraic equations. In E.J. Haug and R.C. Deyo, editors, Real-Time Integration Methods for Mechanical System Simulation, pages 193–207. Springer-Verlag, Berlin, Heidelberg, 1990. 143. E. Eich and M. Hanke. Regularization methods for constrained mechanical multibody systems. Zeitschrift f¨ur angewandte Mathematik und Mechanik, 75(10):761–773, 1995. 144. P.E. Nikravesh, R.A. Wehage, and O.K. Kwon. Euler parameters in computational dynamics and kinematics. Part I and Part II. Journal of Mechanisms, Transmissions, and Automation in Design, 107(3):358–369, September 1985. 145. T.W. Park and E.J. Haug. A hybrid numerical integration method for machine dynamic simulation. Journal of Mechanisms, Transmissions, and Automation in Design, 108:211–216, June 1986. 146. C.O. Chang and P.E. Nikravesh. An adaptive constraint violation stabilization method for dynamic analysis of mechanical systems. Journal of Mechanisms, Transmissions, and Automation in Design, 107:488–492, December 1985. 147. D.S. Bae and S.M. Yang. A stabilization method for kinematic and kinetic constraint equations. In E.J. Haug and R.C. Deyo, editors, Real-Time Integration Methods for Mechanical System Simulation, pages 209–232. Springer-Verlag, Berlin, Heidelberg, 1990. 148. S. Yoon, R.M. Howe, and D.T. Greenwood. Stability and accuracy analysis of Baumgarte’s constraint violation stabilization method. Journal of Mechanical Design, 117:446–453, September 1995. 149. J.C. Chiou and S.D. Wu. Constraint violation stabilization using input-output feedback linearization in multibody dynamic analysis. Journal of Guidance, Control, and Dynamics, 21(2):222–228, March-April 1998. 150. S.T. Lin and M.C. Hong. Stabilization method for numerical integration of multibody mechanical systems. Journal of Mechanical Design, 120:565–572, December 1998. 151. E. Bayo, J. Garc´Æa de Jal´on, and M.A. Serna. A modiſed Lagrangian formulation for the dynamic analysis of constrained mechanical systems. Computer Methods in Applied Mechanics and Engineering, 71:183–195, November 1988. 152. E. Bayo, J. Garc´Æa de Jal´on, A. Avello, and J. Cuadrado. An efſcient computational method for real time multibody dynamic simulation in fully Cartesian coordinates. Computer Methods in Applied Mechanics and Engineering, 92:377–395, 1991. 153. A.J. Kurdila, J.L. Junkins, and S. Hsu. Lyapunov stable penalty methods for imposing nonholonomic constraints in multibody system dynamics. Nonlinear Dynamics, 4:51– 82, 1993. 154. S. Yoon, R.M. Howe, and D.T. Greenwood. Geometric elimination of constraint violations in numerical simulation of lagrangian equations. Journal of Mechanical Design, 116:1058–1064, December 1994. 155. W. Blajer. Elimination of constraint violation and accuracy aspects in numerical simulation of multibody systems. Multibody System Dynamics, 7:265–284, 2002. 156. J.W. Baumgarte. A new method of stabilization for holonomic constraints. ASME Journal of Applied Mechanics, 50:869–870, December 1983.
References
711
157. Z. Terze, D. Lefeber, and O. Mufti´c. Null space integration method for constrained multibody system simulation with no constraint violation. Multibody System Dynamics, 6:229–243, 2001. 158. E. Bayo and A. Avello. Singularity-free augmented Lagrangian algorithms for constrained multibody dynamics. Nonlinear Dynamics, 5:209–231, 1994. 159. E. Bayo and R. Ledesma. Augmented Lagrangian and mass-orthogonal projection methods for constrained multibody dynamics. Nonlinear Dynamics, 9:113–130, 1996. 160. W.O. Schiehlen. Computational aspects in multibody system dynamics. Computer Methods in Applied Mechanics and Engineering, 90:569–582, 1991. 161. J. Cuadrado, J. Cardenal, and Bayo. E. Modeling and solution methods for efſcient real-time simulation of multibody dynamics. Multibody System Dynamics, 1:259–280, 1997. 162. A.A. Shabana. Flexible multibody dynamics: Review of past and recent developments. Multibody System Dynamics, 1(2):189–222, June 1997. 163. T.M. Wasfy and A.K. Noor. Computational strategies for ƀexible multibody systems. ASME Applied Mechanics Reviews, 56(2):553–613, 2003. 164. M. G´eradin and A. Cardona. Flexible Multibody System: A Finite Element Approach. John Wiley & Sons, New York, 2001. 165. P.W. Likins. Modal method for analysis of free rotations spacecraft. AIAA Journal, 5(7):1304–1308, July 1967. 166. R.D. Milne. Some remarks on the dynamics of deformable bodies. AIAA Journal, 6(3):556–558, March 1968. 167. B. Fraeijs de Veubeke. The dynamics of ƀexible bodies. International Journal of Engineering Science, 14(10):895–913, 1976. 168. J.R. Cavanin and P.W. Likins. Floating reference frames for ƀexible spacecraft. Journal of Spacecraft, 14(12):724–732, 1977. 169. O.P. Agrawal and A.A. Shabana. Dynamic analysis of multibody systems using component modes. Computers & Structures, 21(6):1303–1312, 1985. 170. W.S. Yoo and E.J. Haug. Dynamics of articulated structures. Part I. Theory. Journal of Structural Mechanics, 14:105–126, 1986. 171. W.S. Yoo and E.J. Haug. Dynamics of articulated structures. Part II. Computer implementation and applications. Journal of Structural Mechanics, 14:177–189, 1986. 172. A. Cardona and M. G´eradin. Modelling of superelements in mechanism analysis. International Journal for Numerical Methods in Engineering, 32:1565–1593, 1991. 173. O. Friberg. A method for selecting deformation modes in ƀexible multibody dynamics. International Journal for Numerical Methods in Engineering, 32:1637–1655, 1991. 174. A. Cardona and M. G´eradin. A superelement formulation for mechanism analysis. Computer Methods in Applied Mechanics and Engineering, 100:1–29, 1992. 175. R. Schwertassek, O. Wallrapp, and A.A. Shabana. Flexible multibody simulation and choice of shape functions. Nonlinear Dynamics, 20(4):361–380, 1999. 176. R. Schwertassek, S.V. Dombrowski, and O. Wallrapp. Modal representation of stress in ƀexible multibody simulation. Nonlinear Dynamics, 20(4):381–399, 1999. 177. A. Cardona. Superelements modelling in ƀexible multibody dynamics. Multibody System Dynamics, 4:245–266, 2000. 178. A.A. Shabana. Substructure synthesis methods for dynamic analysis of multi-body systems. Computers & Structures, 20:737–744, 1985. 179. O.P. Agrawal and A.A. Shabana. Application of deformable-body mean axis to ƀexible multibody system dynamics. Computer Methods in Applied Mechanics and Engineering, 56(2):217–245, 1986.
712
References
180. A.A. Shabana. Resonance conditions and deformable body co-ordinate systems. Journal of Sound and Vibration, 192:389–398, 1996. 181. P. Ravn. Analysis and Synthesis of Planar Mechanical Systems Including Flexibility, Contact and Joint Clearance. PhD thesis, Technical University of Denmark, 1998. 182. C.B. Drab, J.R. Haslinger, R.U. Pfau, and G. Offner. Comparison of the classical formulation with the reference conditions formulation for dynamic ƀexible multibody systems. Journal of Computational and Nonlinear Dynamics, 2(4):337–344, 2007. 183. L. Meirovitch. Elements of Vibration Analysis. McGraw-Hill Book Company, New York, 1975. 184. J.A.C. Ambr´osio and J.P.C. Gonc¸alves. Complex ƀexible multibody systems with application to vehicle dynamics. In Jorge A.C. Ambr´osio and Werner O. Schiehlen, editors, Advances in Computational Multibody Dynamics, IDMEC/IST, Lisbon, Portugal, Sept. 20-23, 1999, pages 241–258, 1999. 185. J.A.C. Ambr´osio. Geometric and material nonlinear deformations in ƀexible multibody systems. In Jorge Ambr´osio and Michal Kleiber, editors, Proceedings of Computational Aspects of Nonlinear Structural Systems with Large Rigid Body Motion, NATO Advanced Research Workshop, Pultusk, Poland, July 2-7, pages 91–115, 2000. 186. R.R. Craig and M.C. Bampton. Coupling of substructures for dynamic analyses. AIAA Journal, 6:1313–1319, 1968. 187. R.H. MacNeal. A hybrid method of component mode synthesis. Computers & Structures, 1(4):581–601, 1971. 188. S. Rubin. Improved component-mode representation for structural dynamic analysis. AIAA Journal, 13:995–1006, 1975. 189. D.N. Herting. A general purpose, multi-stage, component modal synthesis method. Finite Elements in Analysis and Design, 1:153–164, 1985. 190. R.M. Hintz. Analytical methods in component modal synthesis. AIAA Journal, 13:1007– 1016, 1975. 191. R.R. Craig and C. Chang. Free-interface methods of substructure coupling for dynamic analysis. AIAA Journal, 14:1633–1635, 1976. 192. W.S. Yoo and E.J. Haug. Dynamics of ƀexible mechanical systems using vibration and static correction modes. Journal of Mechanisms, Transmissions, and Automation in Design, 108:315–322, 1986. 193. E.J. Wu, S. Haug. Geometric non-linear substructuring for dynamics of ƀexible mechanical elements. International Journal for Numerical Methods in Engineering, 26:2211– 2226, 1988. 194. O.A. Bauchau and J. Rodriguez. Formulation of modal based elements in nonlinear, ƀexible multibody dynamics. Journal of Multiscale Computational Engineering, 1(2):161– 180, 2003. 195. O.A. Bauchau, J. Rodriguez, and S.Y. Chen. Coupled rotor-fuselage analysis with ſnite motions using component mode synthesis. Journal of the American Helicopter Society, 49(2):201–211, 2004. 196. A.A. Shabana and R.A. Wehage. A coordinate reduction technique for dynamic analysis of spatial substructures with large angular rotations. Journal of Structural Mechanics, 11(3):401–431, March 1983. 197. T.J.R. Hughes. The Finite Element Method. Prentice Hall, Inc., Englewood Cliffs, New Jersey, 1992. 198. K.J. Bathe. Finite Element Procedures. Prentice Hall, Inc., Englewood Cliffs, New Jersey, 1996. 199. L.F. Shampine and M.K. Gordon. Computer Solution of Ordinary Differential Equations: The Initial Value Problem. W.J. Freeman, San Francisco, CA, 1975.
References
713
200. A. Cardona and M. G´eradin. Time integration of the equations of motion in mechanism analysis. Computers & Structures, 33(3):801–820, 1989. 201. N.M. Newmark. A method of computation for structural dynamics. Journal of the Engineering Mechanics Division, 85:67–94, 1959. 202. T.J.R. Hughes. Analysis of transient algorithms with particular reference to stability behavior. In T. Belytschko and T.J.R. Hughes, editors, Computational Methods for Transient Analysis, pages 67–155. North-Holland, Amsterdam, 1983. 203. J.C. Simo and K. Wong. Unconditionally stable algorithms for rigid body dynamics that exactly preserve energy and momentum. International Journal for Numerical Methods in Engineering, 31:19–52, 1991. 204. J.C. Simo and N. Tarnow. The discrete energy-momentum method. Conserving algorithms for nonlinear dynamics. ZAMP, 43:757–792, 1992. 205. P. Betsch and P. Steinmann. Constrained integration of rigid body dynamics. Computer Methods in Applied Mechanics and Engineering, 191:467–488, 2001. 206. J.C. Simo, N. Tarnow, and M. Doblare. Non-linear dynamics of three-dimensional rods: Exact energy and momentum conserving algorithms. International Journal for Numerical Methods in Engineering, 38:1431–1473, 1995. 207. S. Leyendecker, P. Betsch, and P. Steinmann. Objective energy-momentum conserving integration for the constrained dynamics of geometrically exact beams. Computer Methods in Applied Mechanics and Engineering, 195:2313–2333, 2006. 208. J.C. Simo and N. Tarnow. A new energy and momentum conserving algorithm for the nonlinear dynamics of shells. International Journal for Numerical Methods in Engineering, 37:2527–2549, 1994. 209. P. Betsch and N. S¨anger. On the use of geometrically exact shells in a conserving framework for ƀexible multibody dynamics. Computer Methods in Applied Mechanics and Engineering, 198:1609–1630, 2009. 210. O.A. Bauchau, G. Damilano, and N.J. Theron. Numerical integration of nonlinear elastic multi-body systems. International Journal for Numerical Methods in Engineering, 38:2727–2751, 1995. 211. O.A. Bauchau, C.L. Bottasso, and L. Trainelli. Robust integration schemes for ƀexible multibody systems. Computer Methods in Applied Mechanics and Engineering, 192(34):395–420, 2003. 212. C.L. Bottasso and M. Borri. Energy preserving/decaying schemes for non-linear beam dynamics using the helicoidal approximation. Computer Methods in Applied Mechanics and Engineering, 143:393–415, 1997. 213. O.A. Bauchau, J.Y. Choi, and C.L. Bottasso. Time integrators for shells in multibody dynamics. International Journal of Computers and Structures, 80:871–889, 2002. 214. C.L. Bottasso, M. Borri, and L. Trainelli. Integration of elastic multibody systems by invariant conserving/dissipating algorithms. Part I: formulation. Computer Methods in Applied Mechanics and Engineering, 190:3669–3699, 2001. 215. C.L. Bottasso, M. Borri, and L. Trainelli. Integration of elastic multibody systems by invariant conserving/dissipating algorithms. Part II: numerical schemes and applications. Computer Methods in Applied Mechanics and Engineering, 190:3701–3733, 2001. 216. C.L. Bottasso and L. Trainelli. An attempt at the classiſcation of energy decaying schemes for structural and multibody dynamics. Multibody System Dynamics, 12:173– 185, 2004. 217. O.A. Bauchau and N.J. Theron. Energy decaying schemes for nonlinear elastic multibody systems. Computers & Structures, 59(2):317–331, 1996. 218. O.A. Bauchau. Computational schemes for ƀexible, nonlinear multi-body systems. Multibody System Dynamics, 2(2):169–225, 1998.
714
References
219. O.A. Bauchau and C.L. Bottasso. On the design of energy preserving and decaying schemes for ƀexible, nonlinear multi-body systems. Computer Methods in Applied Mechanics and Engineering, 169(1-2):61–79, 1999. 220. M. Borri, C.L. Bottasso, and L. Trainelli. Integration of elastic multibody systems by invariant conserving/dissipating algorithms. Part I: Formulation. Computer Methods in Applied Mechanics and Engineering, 190:3669–3699, 2001. 221. M. Borri, C.L. Bottasso, and L. Trainelli. Integration of elastic multibody systems by invariant conserving/dissipating algorithms. Part II: Numerical schemes and applications. Computer Methods in Applied Mechanics and Engineering, 190:3701–3733, 2001. 222. O. Gonzalez. Mechanical systems subject to holonomic constraints: Differentialalgebraic formulations and conservative integration. Physica D, 132:165–174, 1999. 223. O.A. Bauchau. A self-stabilized algorithm for enforcing constraints in multibody systems. International Journal of Solids and Structures, 40(13-14):3253–3271, 2003. 224. P. Betsch. The discrete null space method for the energy consistent integration of constrained mechanical systems. Part I: Holonomic constraints. Computer Methods in Applied Mechanics and Engineering, 194(50-52):5159–5190, 2005. 225. P. Betsch and S. Leyendecker. The discrete null space method for the energy consistent integration of constrained mechanical systems. Part II: Multibody dynamics. International Journal for Numerical Methods in Engineering, 67:499–552, 2006. 226. S. Leyendecker, P. Betsch, and P. Steinmann. The discrete null space method for the energy consistent integration of constrained mechanical systems. Part III: Flexible multibody dynamics. Multibody System Dynamics, 19(1-2):45–72, 2008. 227. P. Betsch and P. Steinmann. A DAE approach to ƀexible multibody dynamics. Multibody System Dynamics, 8:367–391, 2002. 228. C.W. Gear. Numerical Initial Value Problems in Ordinary Differential Equations. Prentice-Hall, Englewood Cliff, N.J., 1971. 229. M. Arnold. A perturbation analysis for the dynamical simulation of mechanical multibody systems. Applied Numerical Mathematics, 18:37–56, 1995. 230. O.A. Bauchau, C.L. Bottasso, and Y.G. Nikishkov. Modeling rotorcraft dynamics with ſnite element multibody procedures. Mathematical and Computer Modeling, 33(1011):1113–1137, 2001. 231. R.L. Fox. Optimization Methods for Engineering Design. Addison-Wesley Publishing Company, Reading, Massachusetts, 1971. 232. G.V. Reklaitis, A. Ravindran, and K.M. Ragsdell. Engineering Optimization. Methods and Applications. John Wiley & Sons, New York, 1983. 233. G.N. Vanderplaats. Numerical Optimization Techniques for Engineering: With Applications. McGraw-Hill Book Company, New-York, 1984. 234. A. Cardona and M. G´eradin. Numerical integration of second order differentialalgebraic systems in ƀexible mechanism dynamics. In J. Ambrosio and M. Seabra Pereira, editors, Computer-Aided Analysis Of Rigid And Flexible Mechanical Systems, pages 501–529. NATO ASI Series, Kluwer Academic Publishers, 1993. 235. C.L. Bottasso, O.A. Bauchau, and A. Cardona. Time-step-size-independent conditioning and sensitivity to perturbations in the numerical solution of index three differential algebraic equations. SIAM Journal on Scientiſc Computing, 29(1):397–414, 2007. 236. C.L. Bottasso, D. Dopico, and L. Trainelli. On the optimal scaling of index-three DAEs in multibody dynamics. Multibody System Dynamics, 19:3–20, 2008. 237. P.E. Gill, W. Murray, M.A. Saunders, and M.H. Wright. Sequential quadratic programming methods for nonlinear programming. In E.J. Haug, editor, Computer-Aided Analysis and Optimization of Mechanical System Dynamics, pages 679–697. Springer-Verlag, Berlin, Heidelberg, 1984.
References
715
238. O.A. Bauchau, A. Epple, and C.L. Bottasso. Scaling of constraints and augmented Lagrangian formulations in multibody dynamics simulations. Journal of Computational and Nonlinear Dynamics, 4(2):021007 1–9, April 2009. 239. T.R. Kane. Dynamics. Holt, Rinehart and Winston, Inc, New York, 1968. 240. J. Argyris. An excursion into large rotations. Computer Methods in Applied Mechanics and Engineering, 32(1–3):85–155, 1982. 241. P.C. Hughes. Spacecraft Attitude Dynamics. John Wiley & Sons, New York, 1986. 242. H. Cheng and K.C. Gupta. A historical note on ſnite rotations. Journal of Applied Mechanics, 56:139–145, 1989. 243. M.D. Shuster. A survey of attitude representations. Journal of the Astronautical Sciences, 41(4):439–517, 1993. 244. A. Ibrahimbegovi´c. On the choice of ſnite rotation parameters. Computer Methods in Applied Mechanics and Engineering, 149:49–71, 1997. 245. P. Betsch, A. Menzel, and Stein. E. On the parameterization of ſnite rotations in computational mechanics. A classiſcation of concepts with application to smooth shells. Computer Methods in Applied Mechanics and Engineering, 155(3-4):273–305, 1998. 246. J. Stuelpnagel. On the parameterization of the three-dimensional rotation group. SIAM Review, 6(4):422–430, 1964. 247. W.R. Hamilton. Elements of Quaternions. Cambridge University Press, Cambridge, 1899. 248. A. Cayley. On certain results relating to quaternions. Philosophical Magazine, 26:141– 145, 1845. 249. K.W. Spring. Euler parameters and the use of quaternion algebra in the manipulation of ſnite rotations: A review. Mechanism and Machine Theory, 21:365–373, 1986. 250. R.A. Wehage. Quaternions and Euler parameters. A brief exposition. In E.J. Haug, editor, Computer Aided Analysis and Optimization of Mechanical Systems Dynamics, pages 147–180. Springer-Verlag, Berlin, Heidelberg, 1984. 251. R.M. Murray, Z. Li, and S.S. Sastry. A Mathematical Introduction to Robotic Manipulation. CRC Press, 1994. 252. M. G´eradin and A. Cardona. Kinematics and dynamics of rigid and ƀexible mechanisms using ſnite elements and quaternion algebra. Computational Mechanics, 4:115–135, 1989. 253. A.R. Klumpp. Singularity-free extraction of a quaternion from a direction-cosine matrix. Journal of Spacecraft and Rockets, 13:754–755, December 1976. 254. S.W. Shepperd. Quaternion from rotation matrix. Journal of Guidance and Control, 1:223–224, May-June 1978. 255. O.A. Bauchau and L. Trainelli. The vectorial parameterization of rotation. Nonlinear Dynamics, 32(1):71–92, 2003. 256. T.F. Wiener. Theoretical Analysis of Gimballess Inertial Reference Equipment Using Delta-Modulated Instruments. PhD thesis, Massachusetts Institute of Technology, Cambridge, Massachusetts, 1962. Department of Aeronautical and Astronautical Engineering. 257. V. Milenkovi´c. Coordinates suitable for angular motion synthesis in robots. In Proceedings of the Robot VI Conference, Detroit MI, March 2-4, 1982, 1982. Paper MS82-217. 258. S.R. Marandi and V.J. Modi. A preferred coordinate system and the associated orientation representation in attitude dynamics. Acta Astronautica, 15:833–843, 1987. 259. R.S. Ball. A Treatise on the Theory of Screws. Cambridge University Press, Cambridge, 1998.
716
References
260. J. Angeles. On twist and wrench generators and annihilators. In M.F.O. Seabra Pereira and J.A.C. Ambrosio, editors, Computer-Aided Analysis Of Rigid And Flexible Mechanical Systems, pages 379–411, Dordrecht, 1993. NATO ASI Series, Kluwer Academic Publishers. 261. D.P. Chevallier. Lie algebra, modules, dual quaternions and algebraic methods in kinematics. Mechanism and Machine Theory, 26(6):613–627, 1991. 262. A.T. Yang and F. Freudenstein. Application of dual-number quaternion algebra to the analysis of spatial mechanims. ASME Journal of Applied Mechanics, 86:300–308, 1964. 263. O.P. Agrawal. Hamilton operators and dual-number quaternions in spatial kinematics. Mechanism and Machine Theory, 22(6):569–575, 1987. 264. J.M. McCarthy. An Introduction to Theoretical Kinematics. The MIT Press, Cambridge, MA, 1990. 265. I.S. Fischer. Dual Number Methods in Kinematics, Statics and Dynamics. CRC Press, Boca Raton, 1999. 266. V. Brodsky and M. Shoham. The dual inertia operator and its application to robot dynamics. Journal of Mechanical Design, 116:1089–1095, 1994. 267. E. Pennestr`Æ and R. Stefanelli. Linear algebra and numerical algorithms using dual numbers. Multibody System Dynamics, 18:323–344, 2007. 268. T. Merlini and M. Morandini. The helicoidal modeling in computational ſnite elasticity. Part I: Variational formulation. International Journal of Solids and Structures, 41(1819):5351–5381, 2004. 269. M. Borri, L. Trainelli, and C.L. Bottasso. On representations and parameterizations of motion. Multibody Systems Dynamics, 4:129–193, 2000. 270. M. Borri and C.L. Bottasso. An intrinsic beam model based on a helicoidal approximation. Part I: Formulation. International Journal for Numerical Methods in Engineering, 37:2267–2289, 1994. 271. M. Borri and C.L. Bottasso. An intrinsic beam model based on a helicoidal approximation. Part II: Linearization and ſnite element implementation. International Journal for Numerical Methods in Engineering, 37:2291–2309, 1994. 272. O.A. Bauchau and J.Y. Choi. The vector parameterization of motion. Nonlinear Dynamics, 33(2):165–188, 2003. 273. S.P. Timoshenko and D.H. Young. Vibration Problems in Engineering. John Wiley & Sons, New York, 1974. 274. R. E. Nickell. Nonlinear dynamics by mode superposition. Computer Methods in Applied Mechanics and Engineering, 7(1):107–129, 1976. 275. J.M.T. Thompson and A.C. Walker. The non-linear perturbation analysis of discrete structural systems. International Journal of Solids and Structures, 4(8):281–288, 1968. 276. A.K. Noor and J.M. Peters. Reduced basis technique for nonlinear analysis of structures. AIAA Journal, 18(4):455–462, 1980. 277. A.K. Noor. Recent advances in reduction methods for nonlinear problems. Computers and Structures, 13(1-3):31–44, 1981. 278. O.A. Bauchau and D. Guernsey. On the choice of appropriate bases for nonlinear dynamic modal analysis. Journal of the American Helicopter Society, 38(4):28–36, 1993. 279. G.C. Ruzicka and D.H. Hodges. Application of the mixed ſnite element method to rotor blade modal reduction. Mathematical and Computer Modelling, 33(10-11):1177–1202, 2001. 280. G.C. Ruzicka and D.H. Hodges. A mixed ſnite element treatment of rotor blade elongation. Journal of the American Helicopter Society, 48(3):167–175, 2003.
References
717
281. J.C. Houbolt and G.W. Brooks. Differential equations of motion for combined ƀapwise bending, chordwise bending, and torsion of twisted nonuniform rotor blades. Technical Report 1348, NACA Report, 1958. 282. D.H. Hodges and E.H. Dowell. Nonlinear equations of motion for the elastic bending and torsion of twisted nonuniform rotor blades. Technical report, NASA TN D-7818, 1974. 283. O.A. Bauchau and D.H. Hodges. Analysis of nonlinear multi-body systems with elastic couplings. Multibody System Dynamics, 3(2):168–188, May 1999. 284. S.P. Timoshenko and Goodier J.N. Theory of Elasticity. McGraw-Hill Book Company, New York, third edition, 1970. 285. O.A. Bauchau and J.I. Craig. Structural Analysis with Application to Aerospace Structures. Springer, Dordrecht, Heidelberg, London, New-York, 2009. 286. K. Washizu. Variational Methods in Elasticity and Plasticity. Pergamon Press, Oxford, U.K., 1975. 287. I.H. Shames and C.L. Dym. Energy and Finite Element Methods in Structural Mechanics. Hemisphere Publishing Corp., 1985. 288. L. Anand. On H. Hencky’s approximate strain-energy function for moderate deformations. Journal of Applied Mechanics, 46:78–82, March 1979. 289. L. Anand. Moderate deformations in extension-torsion of incompressible isotropic elastic materials. Journal of the Mechanics and Physics of Solids, 34(3):293–304, 1986. 290. M. Degener, D.H. Hodges, and D. Petersen. Analytical and experimental study of beam torsional stiffness with large axial elongation. Journal of Applied Mechanics, 55:171– 178, March 1988. 291. R. Ledesma, Z.-D. Ma, G. Hulbert, and A. Wineman. A nonlinear viscoelastic bushing element in multibody dynamics. Computational Mechanics, 17(5):287–296, September 1996. 292. J. Kadlowec, A. Wineman, and G. Hulbert. Elastomer bushing response: Experiments and ſnite element modeling. Acta Mechanica, 163(5):25–38, 2003. 293. P. Masarati and M. Morandini. Intrinsic deformable joints. Multibody System Dynamics, 23(4):361–386, 2010. 294. MSC/ADAMS User’s Manual, 2007. 295. Abaqus Theory Manual, Abaqus Version 6.7 edition. 296. L.E. Malvern. Introduction to the Mechanics of a Continuous Medium. Prentice Hall, Inc., Englewood Cliffs, New Jersey, 1969. 297. L. Euler. Methodus Inveniendi Lineas Curvas Maximi Minimive Proprietate Gaudentes. Bousquet, Lausanne and Geneva, 1744. Appendix I: De Curvis Elasticis. 298. S.P. Timoshenko. On the correction factor for shear of the differential equation for transverse vibrations of bars of uniform cross-section. Philosophical Magazine, 41:744– 746, 1921. 299. S.P. Timoshenko. On the transverse vibrations of bars of uniform cross-section. Philosophical Magazine, 43:125–131, 1921. 300. E. Reissner. On one-dimensional ſnite-strain beam theory: the plane problem. Zeitschrift f¨ur angewandte Mathematik und Physik, 23:795–804, 1972. 301. E. Reissner. On one-dimensional large-displacement ſnite-strain beam theory. Studies in Applied Mathematics, 52:87–95, 1973. 302. E. Reissner. On ſnite deformations of space-curved beams. Zeitschrift f¨ur angewandte Mathematik und Physik, 32:734–744, 1981. 303. J.C. Simo. A ſnite strain beam formulation. the three-dimensional dynamic problem. Part I. Computer Methods in Applied Mechanics and Engineering, 49:55–70, 1985.
718
References
304. J.C. Simo and L. Vu-Quoc. A three dimensional ſnite strain rod model. Part II: Computational aspects. Computer Methods in Applied Mechanics and Engineering, 58:79–116, 1986. 305. M. Borri and T. Merlini. A large displacement formulation for anisotropic beam analysis. Meccanica, 21:30–37, 1986. 306. D.A Danielson and D.H. Hodges. Nonlinear beam kinematics by decomposition of the rotation tensor. Journal of Applied Mechanics, 54(2):258–262, 1987. 307. D.A Danielson and D.H. Hodges. A beam theory for large global rotation, moderate local rotation, and small strain. Journal of Applied Mechanics, 55(1):179–184, 1988. 308. V. Giavotto, M. Borri, P. Mantegazza, G. Ghiringhelli, V. Carmaschi, G.C. Mafſoli, and F. Mussi. Anisotropic beam theory and applications. Computers & Structures, 16(14):403–413, 1983. 309. M. Borri, G.L. Ghiringhelli, and T. Merlini. Composite beam analysis: Linear analysis of naturally curved and twisted anisotropic beams. Technical report, Politecnico di Milano, 1992. 310. V.L. Berdichevsky. On the energy of an elastic rod. Prikladnaya Matematika y Mekanika, 45(4):518–529, 1982. 311. D.H. Hodges. A review of composite rotor blade modeling. AIAA Journal, 28(3):561– 565, March 1990. 312. A.R. Atilgan, D.H. Hodges, and M.V. Fulton. Nonlinear deformation of composite beams: Uniſcation of cross-sectional and elastica analyses. Applied Mechanics Reviews, 44(11):S9–S15, November 1991. 313. A.R. Atilgan and D.H. Hodges. Uniſed nonlinear analysis for nonhomogeneous anisotropic beams with closed cross sections. AIAA Journal, 29(11):1990 – 1999, November 1991. 314. W.B. Yu, D.H. Volovoi, V.V. Hodges, and X. Hong. Validation of the variational asymptotic beam sectional (VABS) analysis. AIAA Journal, 40(10):2105–2112, October 2002. 315. B. Popescu and Hodges D.H. Asymptotic treatment of the trapeze effect in ſnite element cross-sectional analysis of composite beams. International Journal of Non-Linear Mechanics, 34(4):709–721, 1999. 316. D.H. Hodges. Nonlinear Composite Beam Theory. AIAA, Reston, Virginia, 2006. 317. A.K. Noor, T. Belytschko, and J.C. Simo, editors. Analytical and Computational Models of Shells, volume 3. ASME, CED, 1989. 318. A.K. Noor. Bibliography of monographs and surveys on shells. Applied Mechanics Review, 43(9):223–234, 1990. 319. W.B. Yu, D.H. Hodges, and V.V. Volovoi. Asymptotic generalization of ReissnerMindlin theory: Accurate three-dimensional recovery for composite shells. Computer Methods in Applied Mechanics and Engineering, 191(44):5087–5109, 2002. 320. W.B. Yu and D.H. Hodges. A geometrically nonlinear shear deformation theory for composite shells. Journal of Applied Mechanics, 71(1):1–9, 2004. 321. K.J. Bathe and E.N. Dvorkin. A four-node plate bending element based on Mindlin/ Reissner plate theory and a mixed interpolation. International Journal for Numerical Methods in Engineering, 21:367–383, 1985. 322. K.J. Bathe and E.N. Dvorkin. A formulation of general shell elements - The use mixed interpolation of tensorial components. International Journal for Numerical Methods in Engineering, 22:697–722, 1986. 323. M.L. Bucalem and K.J. Bathe. Higher-order MITC general shell elements. International Journal for Numerical Methods in Engineering, 36:3729–3754, 1993. 324. R.D. Cook, Malkus D.S., and M.E. Plesha. Concept and Applications of the Finite Elements Method. John Wiley & Sons, New York, 1989.
References
719
325. M.A. Crisſeld and G. Jeleni´c. Objectivity of strain measures in the geometrically exact three-dimensional beam theory and its ſnite-element implementation. Proceedings of the Royal Society, London: Mathematical, Physical and Engineering Sciences, 455(1983):1125–1147, 1999. 326. G. Jeleni´c and M.A. Crisſeld. Geometrically exact 3D beam theory: Implementation of a strain-invariant ſnite element for static and dynamics. Computer Methods in Applied Mechanics and Engineering, 171:141–171, 1999. 327. A. Ibrahimbegovi´c, F. Frey, and I. Kozar. Computational aspects of vector-like parameterization of three-dimensional ſnite rotations. International Journal for Numerical Methods in Engineering, 38(21):3653–3673, 1995. 328. A. Cardona and M. G´eradin. A beam ſnite element non-linear theory with ſnite rotation. International Journal for Numerical Methods in Engineering, 26:2403–2438, 1988. 329. M.A. Crisſeld. A consistent co-rotational formulation for non-linear, three-dimensional beam-elements. Computer Methods in Applied Mechanics and Engineering, 81:131– 150, 1990. 330. P. Betsch and P. Steinmann. Frame-indifferent beam element based upon the geometrically exact beam theory. International Journal for Numerical Methods in Engineering, 54:1775–1788, 2002. 331. I. Romero and F. Armero. An objective ſnite element approximation of the kinematics of geometrically exact rods and its use in the formulation of an energymomentum conserving scheme in dynamics. International Journal for Numerical Methods in Engineering, 54:1683–1716, 2002. 332. I. Romero. The interpolation of rotations and its application to ſnite element models of geometrically exact rods. Computational Mechanics, 34(2):121–133, 2004. 333. A. Ibrahimbegovi´c and R.L. Taylor. On the role of frame-invariance in structural mechanics models at ſnite rotations. Computer Methods in Applied Mechanics and Engineering, 191:5159–5176, 2002. 334. A.A. Shabana. A computer implementation of the absolute nodal coordinate formulation for ƀexible multibody dynamics. Nonlinear Dynamics, 16(3):293–306, 1998. 335. T.J.R. Hughes. Stability, convergence, and growth and decay of energy of the average acceleration method in nonlinear structural dynamics. Computers & Structures, 6:313– 324, 1976. 336. C. Farhat, L. Crivelli, and M. G´eradin. Implicit time integration of a class of constrained hybrid formulations - Part I: Spectral stability theory. Computer Methods in Applied Mechanics and Engineering, 125:71–107, 1995. 337. C. Lunk and B. Simeon. Solving constrained mechanical systems by the family of Newmark and α-methods. Zeitschrift f¨ur angewandte Mathematik und Mechanik, 86(10):772–784, 2006. 338. L.O. Jay and D. Negrut. Extensions of the HHT-α method to differential algebraic equations in mechanics. Electronic Transactions on Numerical Analysis, 26(1):190– 208, 2007. 339. D. Negrut, R. Rampalli, G. Ottarsson, and A. Sajdak. On an implementation of the HHT method in the context of index 3 differential-algebraic equations of multibody dynamics. Journal of Computational and Nonlinear Dynamics, 2(1):73–85, January 2007. 340. M. Arnold and O. Br¨uls. Convergence of the generalized-α scheme for constrained mechanical systems. Multibody System Dynamics, 18(2):185–202, 2007. 341. C. Johnson. Numerical Solutions of Partial Differential Equations by the Finite Element Method. Cambridge University Press, Cambridge, 1987.
720
References
342. C. Johnson, U. N¨avert, and V. Pitk¨aranta. Finite element methods for linear hyperbolic problems. Computer Methods in Applied Mechanics and Engineering, 45:285– 312, 1984. 343. C. Johnson and V. Szepessy. Convergence of a ſnite element method for a nonlinear hyperbolic conservation law. Technical Report 1985-25, Mathematics Department, Chalmers University of Technology, G¨oteborg, Sweden, 1985. 344. T.R.J. Hughes and M. Hulbert. Space-time ſnite element formulations for elastodynamics: Formulation and error estimates. Computer Methods in Applied Mechanics and Engineering, 66:339–363, 1988. 345. M. Hulbert. Space-Time Finite Element Methods for Second-Order Hyperbolic Equations. PhD thesis, Stanford University, 1989. 346. O.A. Bauchau and N.J. Theron. Energy decaying scheme for non-linear beam models. Computer Methods in Applied Mechanics and Engineering, 134:37–56, 1996. 347. O.A. Bauchau and T. Joo. Computational schemes for nonlinear elasto-dynamics. International Journal for Numerical Methods in Engineering, 45:693–719, 1999. 348. O.A. Bauchau, J.Y. Choi, and C.L. Bottasso. On the modeling of shells in multibody dynamics. Multibody System Dynamics, 8(4):459–489, 2002. 349. C.L. Bottasso and M. Borri. Integrating ſnite rotations. Computer Methods in Applied Mechanics and Engineering, 164:307–331, 1998. 350. M. Borri, C.L. Bottasso, and L. Trainelli. A novel momentum-preserving energydecaying algorithm for ſnite element multibody procedures. In Jorge Ambr´osio and Michal Kleiber, editors, Proceedings of Computational Aspects of Nonlinear Structural Systems with Large Rigid Body Motion, NATO Advanced Research Workshop, Pultusk, Poland, July 2-7, pages 549–568, 2000. 351. M. Abramowitz and I.A. Stegun. Handbook of Mathematical Functions. Dover Publications, Inc., New York, 1964.
Index
Absolute acceleration, 58 Absolute system of units, 60 Absolute velocity, 58 Acceleration absolute, 58 Coriolis, 168 inertial, 58 relative, 168 Acceleration level constraint, 364, 427, 438, 442, 445, 446 Acceleration projection method, 442 Active column solver, 486, 657 Addition theorem, 135–136 Admissible momentum ſeld, 581 stress ſeld, 581 Algebraic variable, 498 Ampliſcation matrix, 667 Angular acceleration vector, 137 Angular distortion, 582, 592 Angular momentum, 75, 97, 202, 312 Angular velocity vector, 129–133 Augmented Lagrangian formulation, 477, 499 Augmented Lagrangian term, 493 Axial vector, 23 Base vector, 589 3D space, 50 derivatives, 44, 52 of a surface, 41 Baumgarte’s method, 474–476 Bi-quaternion, 546
algebra, 546–547 Bilateral contact, 87 Bound vector, 13 Boundary conditions displacement, 582 force, 581 geometric, 582 natural, 581 Bushing element, 570 Calculus of variations, 284 Cam-follower pair, 179 Candidate contact point, 368 Canonical basis, 118 Cartesian basis, 6 Cartesian coordinates, 31 Cartesian rotation vector, 531, 537–538 Cauchy-Green deformation tensor, 593 Cayley’s formula, 514 motion parameters, 544–545 rotation parameters, 513–514 Cayley-Gibbs-Rodrigues motion parameters, 563–564 rotation parameters, 532, 538–539 Center of mass, 95 Central force, 66 Change of basis, 116, 193, 194 frame, 155, 193, 194 Characteristic equation, 24 Characteristic exponent, 338 Compatible
O. A. Bauchau, Flexible Multibody Dynamics, DOI 10.1007/978-94-007-0335-3 © Springer Science+Business Media B.V. 2011
722
Index
strain ſeld, 582 velocity ſeld, 582 virtual strain ſeld, 585 virtual velocity ſeld, 585 Component mode synthesis, 484–485 Components of a vector, 6 Composition of rotations, 124–127, 518, 523, 530 Conſguration constraint, 362 Conſguration space, 258 Conformal rotation vector, 539 Conservative force, 62–65 Constrained problem, 359 Constraint acceleration level, 364, 427, 438, 442, 445, 446 conſguration, 362 holonomic, 362–364, 392–394, 427 kinematic, 362 matrix, 357, 363, 426 nonholonomic, 364–365, 402, 403, 427 rheonomic, 363, 392, 402 scleronomic, 362, 363, 392, 402 velocity level, 364, 427, 428, 446 violation, 432, 441, 444, 446 Continuous friction law, 89 Contravariant component, 13 Convected basis, 588 frame, 599 Coordinates Cartesian, 31 curvilinear, 32 cylindrical, 54 orthogonal curvilinear, 53 path, 39 spherical, 55 surface, 49, 369 Coriolis acceleration, 168 Coulomb’s friction law, 88 Covariant component, 13 Cross product, 7 Curvature geodesic, 42 normal, 42 tensor, 143–145 vector, 143 Curve binormal vector, 33
curvature, 33, 143 Frenet’s triad, 33 intrinsic parameterization, 32 natural parameterization, 32 normal vector, 33 osculating plane, 33 planar, 34 radius of curvature, 33 radius of twist, 33 tangent vector, 32 twist, 33, 143 Curve sliding joint, 419 Curvilinear coordinates, 32 Cylindrical coordinates, 54 Cylindrical joint, 415 D’Alembert’s principle, 295–303, 385 Dashpot constant, 70 rectilinear, 70 torsional, 70 Deformation gradient tensor, 592 Deformed conſguration, 589 Degree of freedom, 259, 284, 362 Determinant of tensor, 22 Determination of Euler angles, 110 Euler parameters, 518 the rotation parameter vector, 529 Differential displacement vector, 263 motion vector, 196 position vector, 263 rotation vector, 148 work, 61 Differential-algebraic equation, 358, 395, 427 Direction cosine matrix, 107, 108 Director, 157, 632 Displacement boundary conditions, 582 inſnitesimal, 282 interpolation matrix, 647 virtual, 282 Displacement interpolation matrix, 641 Dissipative force, 70 Dot product, 5, 6 Drift phenomenon, 432, 441, 444, 446, 463, 473
Index Dynamic equilibrium, 295, 323 Eigenpair, 24 Eigenvalue, 23 Eigenvector, 23 Elastic material, 586 Elastodynamics, 579–583 constitutive laws, 583 dynamic equilibrium equations, 581 strain-displacement relationships, 582 velocity-displacement relationships, 582 Energetically conjugate, 198 Energy kinetic, 62, 204 strain, 64, 68, 69 Energy closure equation, 71 Equilibrium dynamic, 295 static, 59, 295 Euler motion parameters, 547–551 Parameters, 516–519 Euler angle, 109, 111, 112, 125, 136–141 attitude, 112, 139 bank, 112, 139 heading, 112, 139 nutation, 109, 111, 138, 139 precession, 109, 111, 138, 139 spin, 109, 111, 138, 139 Euler’s equations, 213, 227 ſrst law, 98, 287 second law, 99, 287 theorem on rotations, 112 Euler-Lagrange equation, 257 Euler-Rodrigues motion parameters, 563 rotation parameters, 532, 538 Exponential map of rotation, 531, 537–538 Extended vectorial parameterization, 533–537 External force, 94, 279, 285, 287 Externally applied force, 58, 61 Finite element method, 639 First-order tensor, 117 Flexible joint, 570, 601–613 Floating frame of reference, 482–484, 569 Force
central, 66 conservative, 62–65 dissipative, 70 external, 94, 279, 285, 287 externally applied, 58, 61 impressed, 61, 94 impulse of, 75, 100 inertial, 295 internal, 94, 279, 285, 287 natural, 279 non-conservative, 65, 70 normal contact, 86, 87 tangential contact, 86, 87 vector, 13 viscous, 70 Force boundary conditions, 581 Frame inertial, 57 Free vector, 3 Frenet’s triad, 143, 148 Friction coefſcient kinetic, 88 static, 88 viscous, 89 Gauss’ formula, 46 Gauss’ principle, 457 Gauss-Codazzi conditions, 145, 146 Gauss-Legendre quadrature, 699–701 points, 699 weights, 699 Gaussian, 457 Generalized constraint force, 393, 403 coordinate, 258 coordinates, 287, 308 force, 268, 287 inertial force, 323 momentum, 308, 323 speed, 428 velocities, 260 Generating function, 526, 531 Geometric boundary conditions, 582 Geometric nonlinearity, 661 Geometric notation, 29 Geometrically exact beam theory, 617 Gravitational constant, 61 Green deformation tensor, 593 Green-Lagrange strain tensor, 593
723
724
Index
Green-Saint Venant strain tensor, 593 Gyroscopic moments, 227 Hamilton’s principle, 305–320, 392–394, 402, 403, 586–588 Holonomic constraint, 362–364, 392–394, 427 system, 362 Hooke’s law, 583 Identity tensor, 23 Impressed force, 61, 94 Impulse of a force, 75, 100 Independent quasi-acceleration, 460 Independent quasi-velocity, 428 Index notation, 29 Index-1 formulation, 438 Inertial acceleration, 58 force, 295 frame, 14, 57 velocity, 58 Infeasible direction, 277, 373 Inſnitesimal displacements, 282 rotation vector, 526 Initial boundary value problem, 306 Integrability conditions, 364 Internal force, 94, 279, 285, 287 Intrinsic displacement of a rigid body, 163 equations of motion, 314, 334 Invariant of a tensor, 24 parameterization, 511 Jacobian, 50 Jacobian matrix of the constraints, 357, 363 Joint curve sliding, 419 cylindrical, 415 planar, 416 prismatic, 414 revolute, 412 screw, 416 sliding, 421 spherical, 417 universal, 418 Kinematic
characteristic, 428 constraint, 362 constraints, 259 parameter, 428 Kinematically admissible direction, 277, 373 admissible displacement ſeld, 582 admissible virtual displacements, 263, 277, 374 inadmissible direction, 277, 373 Kinetic energy, 62, 204 energy density function, 586 rotational energy, 204 translational energy, 204 Kronecker’s symbol, 6 Lagrange’s equations of the ſrst kind, 426 formulation, 322–334, 394, 403, 426 multiplier, 359 multiplier method, 358–361, 385 Lagrangian, 307, 587 representation, 589 strain tensor, 593 Levi-Civita symbol, 8 Linear momentum, 74, 97, 312 Linear parameterization, 531 Lower pair joint, 405–418 constraints, 408–412 kinematics, 406–408 Maggi’s formulation, 428–435 Mass matrix, 311, 426 Material basis, 588 compliance matrix, 583 coordinates, 589 elastic, 586 frame, 599 line, 589 stiffness matrix, 583 Material nonlinearity, 661 Matrix notation, 29 Metric of a space, 450 Metric tensor, 591 Mixed product, 10 Modal analysis, 572
Index expansion, 572 Momentum angular, 75, 97, 202, 312 linear, 74, 97, 312 Moore-Penrose generalized inverse, 698 Motion parameters Cayley’s, 544–545 Cayley-Gibbs-Rodrigues, 563–564 Euler, 547–551 Euler-Rodrigues, 563 vector, 544, 545, 555 Wiener-Milenkovi´c, 564 Motion tensor, 187–192 Mozzi-Chasles’ axis, 163, 189 theorem, 163 Natural boundary conditions, 581 Natural force, 279 Newton’s ſrst law, 58 second law, 59 third law, 59 NewtonRaphson method, 658 Non-conservative force, 65, 70 Non-invariant parameterization, 511 Non-vectorial parameterization, 511 Nonholonomic constraint, 364–365, 402, 403, 427 quantity, 268 vector, 130 Normal contact force, 86, 87 Notation geometric, 29 index, 29 matrix, 29 Null space, 429 formulation, 442 Ordinary differential equation, 395 Oriented line segment, 3 Orthogonal complement, 429 curvilinear coordinates, 53 parameterization, 51 projection, 448 vectors, 5 Orthonormal basis, 6 Out-of-balance force array, 658
725
Parallel axis theorem, 205–206 Parameterization invariant, 511 non-invariant, 511 non-vectorial, 511 vectorial, 511 Particle, 57 path, 57 speed, 39, 58 Path coordinates, 39 Penalty method, 381 Permutation symbol, 8 Pfafſan form, 364 Physical strain component, 592 Pitch of a screw, 163 Pivot equation, 213, 228 Pivoting, 499, 500 Pl¨ucker coordinates, 15, 187 Planar joint, 416 motion, 227–237 rotation, 108 Positive-deſnite tensor, 23, 24 Potential deſnition, 63 function, 63, 66, 269 of a conservative force, 63, 269 of gravity force, 67 of the body forces, 587 of the constraint forces, 393 of the surface tractions, 587 Principal axes of inertia, 207 Principle of angular impulse and momentum, 76, 101 conservation of energy, 66 D’Alembert’s, 295–303, 385 Gauss, 458 Hamilton, 305–320, 392–394, 402, 403, 586–588 least action, 308 linear impulse and momentum, 75, 101 minimum total potential energy, 588 virtual work, 271–287, 371–382, 583–585 virtual work for a particle, 272 virtual work for a particle system, 284 work and energy, 62, 100, 214, 215 Prismatic joint, 414 Product cross, 7
726
Index
dot, 5, 6 mixed, 10 scalar, 5, 6 tensor, 9 vector, 7 Product of inertia, 206 Projection, 5 Projection tensor, 12 Projector image, 448, 450 kernel, 448, 450 null space, 448, 450 Quaternion, 514 algebra, 514–516 operators, 514 orthogonal, 516 scalar part, 514 unit, 516 vector part, 514 Radius of twist, 46 Rayleigh damping, 660 Reaction force, 580 Reciprocal vector, 13 Rectilinear dashpot, 70 spring, 68 Reference conſguration, 589 frame, 14 Reƀection tensor, 13, 27 Relative acceleration, 168 elongation, 592 velocity, 167 Relative elongation, 582 Rescaling operation, 534–537 Revolute joint, 412 Rheonomic constraint, 363, 392, 402 Right-hand basis, 8 Rodrigues’ rotation formula, 114 Rotation parameters Cayley’s, 513–514 Cayley-Gibbs-Rodrigues, 532, 538–539 Euler-Rodrigues, 532, 538 linear, 531 vector, 512, 524 Wiener-Milenkovi´c, 532, 539–541
Rotation tensor, 114 properties, 115 Rotational kinetic energy, 204 Rotationless formulation, 261 Scalar product, 5, 6 Scaling equations of motion, 490–504 factor, 493 Scleronomic constraint, 297, 362, 363, 392, 402 Screw axis, 165 joint, 416 motion, 163, 165 Second-order tensor, 22 Semi positive-deſnite tensor, 23 Shape function, 640 Similarity transformation, 25 Skew-symmetric part of tensor, 22 Skew-symmetric tensor, 22 Skyline solver, 499 Sliding joint, 421 Spectral radius, 667 Spherical coordinates, 55 joint, 417 Spring rectilinear, 68 stiffness constant, 68 stretch, 68 torsional, 69 un-stretched length, 68 Stability analysis, 336–344 Staggered stabilization technique, 476 State space, 260 Static equilibrium, 59, 295 Stationary point of a deſnite integral, 255 of a function, 254 Strain direct, 582 energy, 69 shear, 582 Strain energy density function, 586 for spring, 68 function, 68 Stress
Index direct, 581 shear, 581 Surface base vectors, 41 coordinates, 49, 369 dilatation, 595 equilibrium equations, 581 ſrst metric tensor, 41 Gaussian curvature, 44 line of curvature, 44, 629 mean curvature, 44 principal curvature, 44 principal radii of curvature, 44, 631 radius of twist, 45 second metric tensor, 42 traction, 580 Symbol Kronecker’s, 6 Levi-Civita, 8 permutation, 8 Symmetric part of tensor, 22 Symmetric tensor, 22 System of particles, 286 Tangent operator, 134, 138 Tangential contact force, 86, 87 Tensor Cauchy-Green deformation, 593 characteristic equation, 24 curvature, 143–145 deformation gradient, 592 determinant, 22 ſrst-order, 117 Green deformation, 593 Green-Lagrange strain, 593 Green-Saint Venant strain, 593 identity, 23 invariant, 24 Lagrangian strain, 593 metric, 591 of mass moments of inertia, 203 operation, 119 positive-deſnite, 23, 24 product, 9 projection, 12 reƀection, 13, 27 rotation, 114 second-order, 22 semi positive-deſnite, 23
727
skew-symmetric, 22 skew-symmetric part, 22 symmetric, 22 symmetric part, 22 trace, 22 Time integration schemes, 664–685 Torsional dashpot, 70 spring, 69 Total kinetic energy, 586 Total mechanical energy, 65 Total strain energy, 586 Trace of tensor, 22 Translational kinetic energy, 204 Unconstrained problem, 359 Unilateral contact, 87 Unit vector, 4 Universal constant of gravitation, 60 Universal joint, 418 Variation of the position vector, 263 Vector angular acceleration, 137 angular velocity, 129–133 axial, 23 bound, 13 component, 6 curvature, 143 differential rotation, 148 force, 13 free, 3 nonholonomic, 130 norm, 4 null, 4 orthogonal, 5 product, 7 reciprocal, 13 unit, 4 Vectorial parameterization, 511 of motion, 554–564 of rotation, 524–537 Velocity absolute, 58 inertial, 58 relative, 167 Velocity level constraint, 364, 427, 428, 446 Virtual
728
Index
displacements, 272, 282 rotation vector, 264 Virtual displacement dependent forces, 282 rigid bodies, 283 vector, 263 Virtual work, 268 external, 285 internal, 285 Viscous force, 70
Viscous friction law, 89 Volumetric strain, 596 Warping, 619, 629 Weingarten’s formula, 46 Wiener-Milenkovi´c motion parameters, 564 rotation parameters, 532, 539–541 Work differential, 61