From Waves in Complex Systems Dynamics of Generalized Continua 0
Tributes to Professor Yih-Hsing Pao on His 80th Birthday
Y|jWorld Scientific
From Waves in Complex Systems to Dynamics of Generalized Continua
Tributes to Professor Yih-Hsing Pao on His 80th Birthday
8090.9789814340717-tp.indd 1
5/4/11 9:29 AM
From Waves in Complex Systems to Dynamics of Generalized Continua Tributes to Professor Yih-Hsing Pao on His 80th Birthday
Editors
Kolumban Hutter (Federal Institute of Technology, Zurich, Switzerland) Tsung-Tsong Wu (National Taiwan University, Taiwan) Yi-Chung Shu (National Taiwan University, Taiwan) World Scientific NEW JERSEY
•
8090.9789814340717-tp.indd 2
LONDON
•
SINGAPORE
•
BEIJING
•
SHANGHAI
•
HONG KONG
•
TA I P E I
•
CHENNAI
5/4/11 9:29 AM
This page is intentionally left blank
Published by World Scientific Publishing Co. Pte. Ltd. 5 Toh Tuck Link, Singapore 596224 USA office: 27 Warren Street, Suite 401-402, Hackensack, NJ 07601 UK office: 57 Shelton Street, Covent Garden, London WC2H 9HE
British Library Cataloguing-in-Publication Data A catalogue record for this book is available from the British Library.
FROM WAVES IN COMPLEX SYSTEMS TO DYNAMICS OF GENERALIZED CONTINUA Tributes to Professor Yih-Hsing Pao on His 80th Birthday Copyright © 2011 by World Scientific Publishing Co. Pte. Ltd. All rights reserved. This book, or parts thereof, may not be reproduced in any form or by any means, electronic or mechanical, including photocopying, recording or any information storage and retrieval system now known or to be invented, without written permission from the Publisher.
For photocopying of material in this volume, please pay a copying fee through the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, USA. In this case permission to photocopy is not required from the publisher.
ISBN-13 978-981-4340-71-7 ISBN-10 981-4340-71-5
Printed in Singapore.
Steven - From Waves in Complex.pmd
1
2/25/2011, 9:50 AM
PREFACE
This book has been assembled from contributions dedicated to Professor Yih-Hsing Pao on the occasion of His 80th birthday, held at the International Symposium on Engineering Mechanics on 21/22 May 2010 and organized by the Institute of Applied Mechanics, National Taiwan University, Taipei. It concentrates on reviews and new developments in the fields of waves in complex systems and dynamics of generalized continua, the two classes of subjects, embracing the more than 50 years of active research to which Professor Pao dedicated his scientific efforts in the field of Applied Mechanics. His scientific achievements and his leading role played as a University teacher and internationally as an educator and researcher are reviewed in the preliminary items of this book in the Laudatio. The fifteen contributions, which are collected in this book are divided into four groups typified as -
Waves, Numerical Methods and Time Series Analyses, Continuum Mechanical Theories, and Wind Energy Techniques.
Waves are analyzed in the context of Lamb waves in phononic band gap structures and as early time responses in trusses and frames, using the reverberation-ray matrix technique. Wave theory is also applied to surface wave nonlinearities for fatigued steel and as acousto-elastic Lamb waves in implications to structural health monitoring. Time series analyses are treated by an adaptive data analysis method via a novel so-called Hilbert-Huang Transformation (HHT), which is particularly apt in the characterization of non-stationary statistical processes in a great variety of data through measured digital quantities. v
vi
Preface
In the subject of numerical mathematics, computational fluid dynamics is illustrated with the so-called Unified Coordinates (UC); these are generalizations of the Euler-Lagrange Algorithms (ELA), by which initial boundary value problems can be more effectively solved than with the classical ELAs, since the UCs are optimally and automatically generating boundary adjusted meshes. Moreover, in the fields of seismology and exploration geophysics, imperfectly partitioned ambient wave fields allow retrieval of the Green’s function and, thus, yield exact determination of travel times, attenuations, specific intensities and scattering strengths in complex wave fields. A contribution, which applies optimization techniques, shows their use in structural elements, which minimize their size on the basis of periodic micro-structural elements, and inverse techniques are applied to problems in optical sciences. Methods of advanced continuum-thermo-mechanics are applied to such complex systems as lava flows, in which the aggregation states arise as hot fluids, temperate solid-fluid mixtures at the solidification temperature and as cold solids separated by phase change surfaces and nourished by mass transports across these surfaces, as ternary bio-fluid mixtures and as acoustic and electromagnetic meta-materials, and in electro-magneto-mechanical coupled fields with various postulations of electro-magnetic interaction forces, couples and stress tensors. Wind energy is, finally, tackled by studying, how nonlinear vibrowind energy generation competes with its rotational counterpart and how off-shore wind energy can be produced below the costs of coal electricity. The articles either review a topic or present new developments; but both appeal to experts as well as researchers and practitioners alike. The editors express their heartfelt thanks to the distinguished international team of contributors whose scientific efforts unite to form this book. They also express thanks to Professor C. C Chang, chairperson of the Institute of Applied Mechanics at National Taiwan University and his staff, in particular Chi-Wei Liu for organizing the conference and to Fong Kiaw Wong at World Scientific Publishing Company for the assistance in editing this book. Zurich, Switzerland and Taipei, Taiwan, 22. May 2010 K. Hutter, T. T. Wu and Y. C. Shu
CONTENTS
Preface
v
Contributors
ix
Group Photo
xi
Laudatio for Professor Yih-Hsing Pao on the Occasion of the International Symposium on Engineering Mechanics 2010, on the Occasion of His 80th Birthday (21/22 May 2010)
xiii
Waves 1. Lamb Waves in Phononic Band Gap Structures (by T. T. Wu)
1
2. On Generalization of the Phase Relations in the Method of Reverberation-Ray Matrix (by W. Q. Chen)
49
3. Surface-Wave Nonlinearity Measured with EMAT for Fatigued Steels (by M. Hirao)
75
4. Acoustoelastic Lamb Waves and Implications for Structural Health Monitoring (by J. E. Michaels)
91
5. Source Synthesis for Inverse Problems in Wave Propagation (by W. W. Symes)
119
Numerical Mathematics / Time Series Analysis 6. An Introduction to an Adaptive Data Analysis Method (by N. E. Huang)
137
7. Computational Fluid Dynamics Based on the Unified Coordinates — A Brief Review (by W. H. Hui)
159
vii
viii
Contents
8. Towards Green’s Function Retrieval from Imperfectly Partitioned Ambient Wave Fields: Travel Times, Attenuations, Specific Intensities, and Scattering Strengths (by R. L. Weaver)
183
9. Study on Two Scale Design Optimization for Structure and Material with Periodic Microstructure (by G. D. Cheng)
195
Continuum Mechanical Theories 10. A Continuum Formulation of Lava Flows - From Fluid Ejection to Solid Deposition (by K. Hutter)
219
11. Rigorous Mechanics and Elegant Mathematics on the Formulation of Constitutive Laws for Complex Materials: An Example from Biomechanics (by V. Mow)
285
12. Professor Pao’s Influence on Research in Coupled Field Problems, Chirality and Acoustic and Electromagnetic Metamaterials and their Applications (by V. V. Varadan)
307
13. Transient Response of an Elastic Half Space by a Moving Concentrated Torque (by C.-S. Yeh)
333
14. Magnetic Force Model for Magnetizable Elastic Body in the Magnetic Field (by Z. J. Zheng)
353
Wind Energy 15. Principles of Nonlinear Vibro-Wind Energy Conversion (by F. Moon)
385
CONTRIBUTORS
W. Q. Chen Zhejiang University, China
[email protected]
V. C. Mow Columbia University, U.S.A
[email protected]
G. D. Cheng Dalian University of Technology, China
[email protected]
W. W. Symes Rice University, U.S.A
[email protected]
M. Hirao The University of Tokyo, Japan
[email protected]
V. V. Varadan University of Arkansas, U.S.A
[email protected]
N. E. Huang National Central University, Taiwan
[email protected]
T. T. Wu National Taiwan University, Taiwan
[email protected]
K. Hutter Laboratory of Hydraulics, Hydrology and Glaciology, Federal Institute of Technology, Zurich, Switzerland
[email protected]
R. L. Weaver University of Illinois at Urbana-Champaign U.S.A
[email protected] C. S. Yeh National Taiwan University, Taiwan
[email protected]
W. H. Hui Hong Kong University of Science and Technology, Hong Kong
[email protected]
X. J. Zheng Lanzhou University, China
[email protected]
J. E. Michaels Georgia Institute of Technology, U.S.A.
[email protected] F. Moon Cornell University, U.S.A
[email protected] ix
This page is intentionally left blank
Conference participants at the entrance of the Institute of Applied Mechanics, National Taiwan University, Taipei, with the jubilee, Professor Y.-H. Pao (seated in the middle, front row)
This page is intentionally left blank
LAUDATIO FOR PROFESSOR YIH-HSING PAO ON THE OCCASION OF THE INTERNATIONAL SYMPOSIUM ON ENGINEERING MECHANICS 2010, ON THE OCCASION OF HIS 80TH BIRTHDAY (21/22 May 2010)
Kolumban Hutter Retired Professor of Mechanics, Darmstadt University of Technology, Germany NOW: c/o Laboratory of Hydraulics, Hydrology and Glaciology Swiss Federal Institute of Technology, Zurich, Switzerland E-mail:
[email protected]
Dear Professor Pao, dear Mrs Pao, Respected representatives of the National Taiwan University (NTU) and other Universities, of governmental and scientific agencies of Taiwan and abroad, Dear Academicians, dear scientists and Professors from Academies and Universities worldwide, Dear guests and friends of Professor Pao We have come together here to celebrate Professor Pao on the occasion of his 80th birthday – it was on 19th January 2010 – and to look back and contemplate about him as a personality, a scientist and simply as a human being. We have also come together to pay tribute to a superior, a colleague and a friend with whom some of us have had extensive work interactions, scientific discussions, disagreements, long working sessions in seminar rooms, intellectually, always as equals, but hard and subject oriented and not person oriented, focused on every detail until xiii
xiv
Laudatio for Professor Y.-H. Pao
intellectual agreement was reached. When experiencing this for the first time in 1969 in seminars together with the late Steve Thau, this was for me, the young Swiss student a unique experience for which I first had to take breath to understand that objective scientific arguments could be separated from professional seniority or superiority. At that time, this was not at all so in Swiss academic circles, but I learned quickly! This, however, does not mean that a clear superior-subordinate relationship would not have been respected. I addressed him as ‘Professor Pao’ and he responded as ‘Yes Kolumban!’, and this is essentially still so now. However, the creation of absolute unquestioned equality of scientific arguing was my first experience of an attitude of Professor Pao, which I had never experienced before and am still sometimes missing in Europe today. There is a further early incidence in my academic education which demonstrated Professor Pao’s generosity. During the first summer break as a student in the Department of Theoretical and Applied Mechanics at Cornell University, I had a research assistantship with the duty to develop finite difference software on elastic waves in a layered half plane as a subject of my Master of Science thesis. This work dragged on in the following semester with no success. In that fall semester 1969 Professors Pao and Rand had organized a reading seminar on Cole’s new book on ‘Perturbation Theory’. One chapter dealt with pre-stressed cables with small bending rigidity. We all know the result: Bending effects are strong in the vicinity of built-in supports due to the small flexural stiffness associated with the highest derivative arising in the differential equation, a problem of matched asymptotics. This problem hit me immediately with high heart beating. A year or so before in Switzerland, I was involved with the construction of a concrete bridge with box cross section and skew supports. We had computed the torsional response by use of the Saint Venant theory, but measurements of the stress distribution in the supports of the realized bridge revealed large deviations from the computed twisting moment along the bridge axis, whilst agreement distant from the supports was good. My heart beating in the seminar room at Cornell’s Thurston Hall was caused by my immediate recognition that warping torsion due to the cantilever pedestrian paths on each side of the box cross section of the bridge was
Laudatio for Professor Y.-H. Pao
xv
ignored; it corresponded to the bending effects of the pre-stressed cable in Cole’s book, whilst Saint Venant torsion was the analogue of the tension-string effects in the cable. I went home that Friday in fall 1969, worked day and night through the weekend and presented Professor Pao a drafted manuscript on Monday, asking – but being very scared – whether he would accept this as a basis for a Master’s Thesis. He did; the Master’s Thesis is from 1970 and the work became my first paper in the Int. J. Solids and Structures in 1971. I was scared because I had failed with the layered media problem and weigh Professor Pao’s decision at that time as generosity, because he measured the success and forgot the failing precursor. My experience in Switzerland years before was the opposite. I have above illustrated objective professional judgment and generosity as two outstanding features which Professor Pao employs in reaching optimal scientific advancement. Endurance in pursuing a chosen approach in reaching intended goals and vision for the longer range development into the future of the field of Applied Mechanics are yet other ones. He also believes that first rank achievements, such as the foundation of the Institute of Applied Mechanics at National Taiwan University (NTU), which he founded single-handed and which is based on his long range vision and firm belief: Namely, that Applied Mechanics has important developments to offer to future technology both in education and industrial development of Taiwan on a global scale. He did this contrary to today’s army of deans and presidents of Engineering Schools and Universities who closed Departments of Theoretical and Applied Mechanics or Departments of General Engineering Sciences. Such educational units disappeared by and large on the entire Globe. In Germany, the Department of Mechanics at Darmstadt University and the Department of Physical Engineering Sciences at the Technical University in Berlin have been closed. The professors were distributed to more professional engineering departments, and at their retirement were not or have been differently replaced by representatives of professional specialties. In the United Kingdom the Department of Theoretical and Applied Mechanics at Nottingham has been closed, as have the Departments of Theoretical and Applied Mechanics at Cornell University and the University of Illinois at
xvi
Laudatio for Professor Y.-H. Pao
Urbana-Champaign in the US. In Darmstadt, four Professors now teach statics, dynamics and strength of materials to about 3000 students. (When I was there, we were first 12 and then 8 professors.) There is hardly any time left for specialized courses to form accordingly specialized pupils. Professor Pao’s ‘endurance and vision’ during the foundation period of the Institute of Applied Mechanics at NTU have been very efficient supports for the creation of the institute, and the Institute of Applied Mechanics has already and may further prove to be very valuable for forefront activity not only in the industrial development of Taiwan. What are Professor Pao’s research achievements and how did he pursue research to get where he is now? There seems to be one main theme. This is ‘Waves in complex continuous systems’. This theme is embellished with variations on ‘Dynamics of generalized continua’. Waves became already the main focus when our jubilee started as a Ph. D. student under the supervision of Professor Raymond D. Mindlin, the American born Russian professor of civil engineering at Columbia University, who is most famous for his work on the derivation of the dynamical plate equations by employing a variational formulation (the Principle of Weighted Residuals) and using Cauchy series expansion of the displacement field in the direction perpendicular to the plate reference plane. He is not only known for his applied work on vibrations of elastic plates, but has also done fundamental work in early Cosserat theories, in vibrations of piezoelectric plates, in which polarization gradients were introduced (for the first time?) to describe deformation-induced birefringence. The graduate student Yih-Hsing Pao was exposed to this environment of fundamental applied physics, in which plates were two-dimensional continua with Cosserat structure, rather than just elements of structural engineering, which could also be loaded by electromagnetic forces, which were subjected to electrical and magnetic dipoles, and when transparent and deformable, could change the polarization of the light. The point I want to make is, that the young engineer Y.-H. Pao wrote his first paper on ‘Dispersion of Flexural Waves in an Elastic, Circular
Laudatio for Professor Y.-H. Pao
xvii
Cylinder’ with his advisor, Professor Mindlin, on a classical subject of Applied Dynamics, being immersed in a non-classical work environment. He was exposed to a wealth of other subjects of Applied Physics, where expert knowledge of engineers could be used with tremendous advantage. It is quite natural to look at Professor Pao’s early work in the field of scattering of elastic waves from this point of view. Flexural waves in circular cylinders are the beam or shaft analogue of plates, and circular cylinders are bounded by the simplest quadratic surface. Scattering of elastic waves by spherical and elliptical obstacles and parabolic cylinders involves more complex quadratic surfaces and makes a fantastic playground to study the mathematics of spherical harmonics, Mathieu and parabolic cylinder functions. This was work done with C. C. Mow (1963) and S. A. Thau (1966/67). Also in the sixties, work with K. F. Graff on the influence of couple-stresses on wave reflection and scattering by a spherical cavity (1967) shows Mindlin’s tangible influence on dynamics of generalized continua. However, elastodynamics with waves, vibrations, focusing on scattering, diffraction, acoustoelasticity, residual stresses, layered media, generalized ray theory, hyperbolic heat conduction equation, ultrasonic waves and non-destructive testing of materials continued to be Professor Pao’s main research activities. Collaborators through the years on these subjects were, among others, R. Gayewski, W. Sachse, V. K. and V. Varadan, F. Ziegler, U. Gamer, T. T. Wu, M. Hirao and Y. S. Wang. Most of these subjects were initiated and formulated in their elements already in the late sixties and early seventies. I participated in 1970-1972 with Professors W. Sachse and J. T. Jenkins in long work-seminars on ultrasonic waves and nondestructive testing. In the summer of 1971 Professors Jenkins, Pao and Sachse and two graduate students, D. K. Banerjee and myself also wrestled with the formulation of a thermodynamic theory of solids close to absolute zero. A hyperbolic heat equation was a necessity, and it turned out that a linearized theory for the coldness – the inverse of the absolute temperature (not the temperature itself) – was the optimal variable for a linearized hyperbolic heat conduction equation. The work merged into D. K. Banerjee’s Ph. D dissertation and two papers on thermoelastic waves in dielectric and anisotropic solids. (As a side
xviii
Laudatio for Professor Y.-H. Pao
remark: Dr. I. Luca from Bucharest, Romania who was 2.5 years at Academia Sinica from 2007 to 2009 and worked there on debris flow, continued that work in the 80s as her Ph. D. dissertation under the supervision of Prof Soos in the Mathematics Department of the University of Bucharest). Waves in plates became the center of activity again in the late seventies and eighties in work with R. L. Weaver and W. J. Parzygnat as waves emerging from point sources and resonating in non-linear vibrations, as propagations of elastic pulses and as inverse source problems with F. Santosa, W. W. Symes, A. N. Ceranoglu and J. E. Michaels. In my opinion, Professor Pao’s activity truly deserving the qualification ‘waves in complex systems’ is work on waves in trusses and frames, done in the late nineties to present. The problem of steady vibration of trusses and frames as linear elastic systems is today a standard linear eigenvalue problem. Much more difficult is the analysis of wave propagation in the transient regime (the early time behavior). This research led to the method of reverberation-ray matrices, an alternative to the method of transfer matrices, in which a wave arriving at a nodal point of an element of the truss or frame, is distributed among the elements at the nodal point and then propagated in each element to the neighboring nodal points. Work on this subject has been done with D. C. Keh, S. M. Howard, X. Y. Su, J. Y. Tian and W.Q. Chen and is presently on an advanced level to reach commercial exploitation. Back to the sixties and early seventies, when a brilliant student, F. C. Moon, performed experiments on buckling of magnetoelastic cantilever plates! The early work of F. C. Moon and Professor Pao was also on waves, but the paper ‘Magnetoelastic Buckling of a Thin Plate’ (1968) was the trigger of a whole surge of Ph. D. dissertations and research articles, and this work is still under way. A fascination of the problem was that electromagnetic forces could generate a quasistatic mechanical instability; the disappointment was that the buckling load predicted by the theory was far off from its measured counterpart in the experiment. The second paper by F. C. Moon and Professor Pao on the subject ‘Vibration and Dynamic Instability of a Beam-Plate in a Transverse Magnetic Field’ did not fully rectify the discrepancy. A thorough study of the foundations of electro-magneto-mechanics was needed, a theory of
Laudatio for Professor Y.-H. Pao
xix
Electrodynamics of Deformable Continua with the flexibility that an arbitrary class of constitutive behavior could be postulated and accordingly reduced to be in conformity with the second law of thermodynamics. A beautiful book by the MIT Professors P. Penfield and H. A. Haus ‘Electrodynamics of Moving Media’ (1967) led us to the complexities of the electric and magnetic force postulates. They were not the same, depending upon, whether magnetization was constructed by a dipole of magnetic monopoles or an electric current in a closed loop. Another beautiful book by W.F. Brown Jr. ‘Magnetolelastic Interactions’ (1966) showed us that the separation of electromagnetic short and long range effects could not be established in a unique way. This result says that the electromagnetic body forces (and couples) are not unique and therefore neither are the stresses. This also means that the definition of a magnetoelastic material is subtle. Moreover, since in an elastic material the stress tensor is given by the derivative of the Helmholtz free energy with respect to the strain tensor, two different formulations of magnetoelasticity have only a chance of furnishing the same solution for observables (e.g., the displacement distribution), if the free energies in the two formulations are related to one another in the correct manner. This discussion shows that the answer to the question whether a certain electromagnetic force postulate is correct, is at last a problem of thermodynamics. Moreover, its discussion in isolation without corresponding stress postulates is futile. The papers by Pao and Yeh ‘A Linear Theory of Soft Ferromagnetic Elastic Solids’ (1973) and of Pao and Hutter ‘Electrodynamics for Moving Elastic Solids and Viscous Fluids’ (1974) are concerned with a proper embedding of the thermomechanical and electrodynamic equations in an entropy principle plus a systematic derivation of magnetic body forces and body couples from a dipole model. This latter topic was subsequently deepened by Professor Pao in his famous 1978-article ‘Electromagnetic Forces in Deformable Continua’, which appeared in Mechanics Today. This work has been continued in the last 32 years by many researchers; the final answer is still not given. The problem with the magnetic force models is reviewed in this book by X. Zheng and K. Jin, however, without touching the question of thermodynamic consistency. It is my belief that disagreements between theoretical results and experimental findings
xx
Laudatio for Professor Y.-H. Pao
which still exist (see X. Zheng & K. Jin, this volume) need a thermodynamic setting to be resolved. The book by K. Hutter, A.A.F. van de Ven ‘Field Matter Interaction in Thermoelastic Solids’ (1978), in the second edition by K. Hutter, A.A.F. van de Ven and A. Ursescu as ‘Field Matter Interaction in Thermoelastic Solids and Viscous Fluids’, Springer Verlag (2006) provides a hint. Let us finish this review with yet another hobby horse of Professor Pao’s scientific activities: Variational Principles of Thermomechanics of Continuous Systems. [Professor Pao has long fought to get his work with L. S. Wang published, now, as I am told, in press in the Int. J. of Engr. Sci.] Publishing novel ideas in this topic is almost certainly bound to be condemned to fail. Why is this so? In my opinion, it probably goes back to several centuries of competing relations between the French and the Britons. On an intellectual level the Britons (and today also the Americans) take forces as the fundamental entities to rein the physical world. Geometry is simply the space in which forces live. The French take work or better the power of working as the fundamental entity. For them, the two elements ‘force’ and ‘trajectory’, which I now identify with ‘geometry’, are equal partners. Geometry reins our physical world as much as do forces. This is d’Alembert’s Principle: force and geometry are combined as equal partners to define the work which reins the world. Note, after all, the French beheaded the king and founded their democracy. The Britons still have a Queen, and a further King certainly in less than 20 years. Professor Pao might want to be a king, but his teacher, Professor Mindlin, showed him, how two-dimensional plate equations are obtained from three-dimensional theory. The vehicle to do this is the Principle of Weighted Residuals, essentially nothing else than the power of working, equality between forces (the momentum equations) and geometry (variations of the dual geometric field). It is a fairly trivial step to extend this to other fields, e.g. the energy equation (~force) and variation of temperature (~geometry) and eventually any other quantity with dual deformation field (angular momentum, angular velocity, see the article of P. Germain in SIAM J. Appl. Math., 1973). This approach is simple and so natural that many scientists treat it as intellectually almost empty. There is no room to make a big fuss about
Laudatio for Professor Y.-H. Pao
xxi
such a natural thing, and this is taken by referees to claim that nothing is new in such a formulation and to reject such papers. The mechanics people in the western world (modulo the French) are “Britons” and will likely reject papers on such variational principles, claiming there is nothing to them. Perhaps you should formulate your variational principles in a setting of general relativity. (Beam theories in a covariant setting of the theory of general relativity do exist!) I am joking now, but if you do so, you might have a chance to get a theoretical physicist as a referee. Physicists know very well since Einstein that the curvature of the space is made by the mass distribution which determines the forces. Here, geometry and forces are true brother and sister. I nevertheless believe – and I now return to Professor Pao’s activities on the Principle of Virtual Power – that this is useful as an alternative to the classical balance law approach. However, success is more likely, if it is applied in the context of generalized continua. The role of the ‘forces’ is then played by the evolution equations of the subgrid field variables and the ‘geometry’ is described by dual fields, which need to be invented together with the mathematical properties of the operations which generate a scalar field, representing the ‘power of working’. It is obvious that from a viewpoint of inventing field equations including constitutive relations and boundary (jump) condition, it may be easier to guess scalars, rather than tensors. The apparent triviality of the method is then hidden behind the complexity of the physics. Mention should also be made about Professor Pao’s handling of his blindness, due to retinitis pigmentosa, the tunnel vision which started in the eighties and progressed to complete blindness in the late nineties. Apart from his negligence to learn Braillie, it is absolutely amazing and deserves our highest respect and admiration, how he keeps his spirit and intellectual level. This allows him not only to follow forefront research but also to inspire and take part in actual research activity. I have never seen him presenting a talk at a conference since complete blindness, but I am told that it is a well organized event to see how he guides the audience in a GPS manner through his densely filled transparencies, made by one of his aids. Of course, all of this is only possible, because the faculty and staff members of the Institute of Applied Mechanics at the National Taiwan University provide admirable support to an extent
xxii
Laudatio for Professor Y.-H. Pao
impossible in the USA and in Europe. They deserve my admiration and thank for the continued service which they offer. I suppose that similar support is also provided at Zhejiang University in Hangzhu, China. This is a laudatio on Professor Pao, which is rather personal. I do not apologize for it as my relation to him is personal. [I have, however, attached a formal curriculum vitae below]. I shall offer apologies to all whose work has not adequately been mentioned and who feel not to have received the proper credit. I had to be selective. The most important thing is to state that Professor Pao is a very unusual personality, generous, with usually objective judgment; his relations to others and to his and his coauthors work show endurance; he does not give up a topic readily but sticks with it almost indefinitely. He has given me a lot for which I am tremendously thankful and without which I would not be where I am now. I am sure, many of you who experienced support from him think alike. Let us wish him good health and many more years of active research in the years to come. Please join me in a standing ovation for him. Biography (From homepage, Department of Mechanical Engineering, Hong Kong University of Science and Technology, with changes and extensions) Professor Yih-Hsing Pao was born in 1930 at Nanjing, China, and grew up during the Second World War in the Chongqing area; he went to the National Second High School for five years on a government scholarship. He attended the National Chao-Tung University (1947-49) and National Taiwan University (1950-52), receiving the Bachelor degree in Civil Engineering. In 1953, he was awarded a teaching assistantship to do graduate studies at Renssalear Polytechnic Institute, Troy, New York, USA, receiving the degree M. S. in Mechanics (1955). He continued the graduate study at Columbia University in the city of New York (1955-58), receiving the Ph. D. degree in Applied Mechanics (1959). Since 1958, he has been a faculty member at Cornell University, rising from the rank of Assistant Professor, Associate Professor, Professor, Joseph C. Ford Professor of Theoretical and Applied Mechanics, to J. C. Ford Professor Emeritus (2000-present). He served also as
Laudatio for Professor Y.-H. Pao
xxiii
Chairman of the Department of Theoretical and Applied Mechanics, Cornell University, from 1974 to 1980. By an Intergovernmental Loan Agreement, he visited the National Taiwan University, on leave of absence from Cornell, and founded the Institute of Applied Mechanics at NTU, and served as Professor and Director twice (1984-86, 1989-94). By then, his longtime illness of Retinitis had deteriorated to almost complete blindness. He resigned from the directorship in 1994 and retired from NTU in 1998 as Professor Emeritus. While at Cornell, he had also held visiting assignments at Princeton University and Stanford University in the USA; Technische Hochschule Darmstadt (now Darmstadt University of Technology); Hong Kong University of Science and Technology in China between one and nine months each. In 2003, he was appointed by Zhejiang University, Hangzhou, China, as a professor of the College of Civil Engineering and Architecture with tenure. At Cornell he had taught undergraduate engineering courses in Mechanics and Applied Mathematics in large classes and graduate courses in Theoretical and Applied Mechanics (TAM) every year, advised several hundreds of graduate students majoring in TAM, and specially supervised 17 Ph.D. students and 8 post doctoral fellows. Altogether there were 25 Ph.D. Fellows, after leaving Cornell, 16 of them remain teaching at well-known universities in America, Germany, Australia, Turkey, Austria, Japan, Mainland of China, and Taiwan and so on. From 1984 to1995 his work at NTU was mainly administration. After 1995, he resumed teaching and research in blindness through administrative assistants and graduate students doing reading and dictation in Chinese and English. In addition to teaching and research he was active in several professional societies, and was elected a fellow of the American Society of Mechanical Engineers, and the American Academy of Mechanics. He was the General Chairman and Scientific Committee Chairman of the ninth United States National Congress of Applied Mechanics held at Cornell University in 1982. At Taiwan, he was elected President of the Chinese Society of Theoretical and Applied Mechanics (1992-1995). In 1985, he was elected a member of the US National Academy of Engineering, the highest honor that can be bestowed on an engineer by the US government. In 1986, he was elected an Academician by
xxiv
Laudatio for Professor Y.-H. Pao
Academia Sinica (founded in 1928 at Nanjing, China), a lifetime honor in China. He received the senior US Scientist Award from the Humboldt Foundation, Germany, and visiting Darmstadt University of Technology in the spring 1989. In 1995, he was awarded an Honorary Doctoral Degree by the National Chiao-Tung University (Hsin-Chu). In 2003, he was one of the three recipients of the Presidential Science Prize of Taiwan. From 1960 to present he has authored and co-authored 118 technique papers as listed in the appendix. The first publication no. 6 is a monograph co-authored with Dr. Henry C. C. Mow which summaries research work on “Scattering-Diffraction of Elastic Waves and Dynamic Stress Concentration” done by Pao with his graduate students at Cornell and by Mow with his associates. The remaining papers numbered Nos. 1-118 may be grouped in five categories: 1. Elastodynamics (Dynamic theory of elasticity) and Mechanical Vibrations of Elastic Solid; 2. Interactions of Elastodynamics with Physical Acoustics, Heat Pulse, Geophysics (Seismic waves and Under-Water Acoustics), Ultrasonic Waves and Non-destructive Testing of Materials; 3. Field-Theory of MagnetoElectro Mechanics and Magneto Elastic Stability; 4. Elastodynamic Theory of Structures; 5. Variational Principles of Thermomechanics of a Continuous Medium, Unifying Theory of Continuum Mechanics with Thermodynamically Irreversible Constitutive Laws for the Medium. In category 4 research works were mainly done at NTU and ZJU after 1999, in category 5 research works on variational principles were done after 2000 with Profs. L. S. Wang and G. Q. Chen at NTU. The list of publications include more than ten comprehensive review articles (marked with *). The one paper ‘Magnetic forces in deformable continua’ in Mechanics Today, (Pergamon Press, 1978), was translated into a Chinese monograph Scientific Press (Beijing, 1996). Another article, ‘Generalized Ray Theory and Transient Responses of Layered Elastic Solids’ in Physical Acoustics (Academic Press, 1977) was chosen by IUTAM as one of the landmark papers in Mechanics of the 20th century. Acknowledgements: I thank Profs. T. T. Wu and C. C. Chang for reading an earlier version of this text and suggesting improvements.
Laudatio for Professor Y.-H. Pao
xxv
List of Publications – Professor Y.-H. Pao 1. Y.-H. Pao and R. D. Mindlin, 1960, “Dispersion of Flexural Waves in an Elastic, Circular Cylinder”, Journal of Applied Mechanics, v.27, pp.513-520. 2. M. Onoe and Y.-H. Pao, 1961, “Edge Mode of Thin Rectangular Plate of Barium Titanate”, Journal of the Acoustical Society of America, v.33, no.11, pp.1628. 3. Y.-H. Pao, 1962, “The Dispersion of Flexural Waves in an Elastic, Circular Cylinder — Part 2”, Journal of Applied Mechanics, v.29, pp.61-64. 4. Y.-H. Pao, 1962, “Dynamical Stress Concentration in an Elastic Plate”, Journal of Applied Mechanics, v.29, pp.299-305. 5. Y.-H. Pao and C. C. Mow, 1962, “Dynamic Stress Concentration in an Elastic Plate with Rigid Circular Inclusion”, Proceedings of the Fourth U.S. National Congress of Applied Mechanics, pp.335-345. 6. Y.-H. Pao and C. C. Mow, 1963, “Scattering of Plane Compressional Waves by a Spherical Obstacle”, Journal of Applied Physics, v.34, no.3, pp.493-499. 7. C. C. Chao and Y.-H. Pao, 1964, “On the Flexural Motions of Plates at the Cut-Off Frequency”, Journal of Applied Mechanics, v.31, pp.22-24. 8. Y.-H. Pao and C. C. Chao, 1964, “Diffractions of Flexural Waves by a Cavity in an Elastic Plate”, AIAA Journal, v.2, no.11, pp.2004-2010. 9. M. A. Medick and Y.-H. Pao, 1965, “Extensional Vibrations of Thin Rectangular Plates”, Journal of the Acoustical Society of America, v.37, no.1, pp.59-65. 10. Y.-H. Pao and C. Kowal, 1965, “A Laboratory for Teaching Mechanical Vibration”, Journal of Engineering Education, v.56, no.3, pp.96-100. 11. S. A. Thau and Y.-H. Pao, 1966, “Diffractions of Horizontal Shear Waves by a Parabolic Cylinder and Dynamic Stress Concentrations”, Journal of Applied Mechanics, v.33, pp.785-792. 12. F. C. Moon and Y.-H. Pao, 1967, “Interactions of Point Defects and Elastic Inclusions”, Journal of Applied Physics, v.38, no.2, pp.595-601. 13. S. A. Thau and Y.-H. Pao, 1967, “Stress-Intensification Near a Semi-Infinite Rigid-Smooth Strip Due to Diffraction of Elastic Waves”, Journal of Applied Mechanics, v.34, pp.119-126. 14. F. C. Moon and Y.-H. Pao, 1967, “The Influence of the Curvature of Spherical Waves on Dynamic Stress Concentration”, Journal of Applied Mechanics, v.34, pp.373-379. 15. S. A. Thau and Y.-H. Pao, 1967, “A Perturbation Method for Boundary Value Problems in Dynamic Elasticity”, Quarterly of Applied Mathematics, v.25, no.3, pp.243-260. 16. K. F. Graff and Y.-H. Pao, 1967, “The Effects of Couple-Stresses on the Propagation and Reflection of Plane Waves in an Elastic Half-Space”, Journal of Sound and Vibration, v.6, no.2, pp.217-229. 17. S. A. Thau and Y.-H. Pao, 1967, “Wave Function Expansions and Perturbation Method for the Diffraction of Elastic Waves by a Parabolic Cylinder”, Journal of Applied Mechanics, v.34, pp.915-920.
xxvi
Laudatio for Professor Y.-H. Pao
18. K. F. Graff and Y.-H. Pao, 1967, “The Effects of Couple-Stresses on the Scattering of Plane Waves by a Spherical Cavity”, Proceedings of the 10th Mid-Western Conference of Mechanics, pp.721-739, Fort Collins, Colorado, 1967. 19. F. C. Moon and Y.-H. Pao, 1968, “Magnetoelastic Buckling of a Thin Plate”, Journal of Applied Mechanics, v.35, pp.53-58. 20. F. C. Moon and Y.-H. Pao, 1969, “Vibration and Dynamic Instability of a BeamPlate in a Transverse Magnetic Field”, Journal of Applied Mechanics, v.36, pp.92-100. 21. S. A. Thau and Y.-H. Pao, 1970, “On the Derivation of Point Source Responses from Line Source Solutions”, International Journal of Engineering Science, v.8, pp.207-218. a. S. A. Thau and Y.-H. Pao, 1973, “Comments on Two- and Three-Dimensional Wave Propagation”, Bulletin of the Seismological Society of America, v.63, no.1, pp.329. 22. Y.-H. Pao and S. A. Thau, 1970, “A Perturbation Method for Boundary-Value Problems in Dynamic Elasticity, Part II”, Quarterly of Applied Mathematics, v.28, no.2, pp.191-204. 23. K. Hutter and Y.-H. Pao, 1971, “Regular and Singular Perturbation Solutions for Bending and Torsion of Beams”, International Journal of Solids and Structures, v.7, pp.1523-1537. 24. Y.-H. Pao, 1972, “Some Recent Developments in Elastic Waves in Solids”, Experimental Mechanics, v.12, pp.83-89. (Invited Review) 25. G. C. Kung and Y.-H. Pao, 1972, “Nonlinear Flexural Vibrations of a Clamped Circular Plate”, Journal of Applied Mechanics, v.39, pp.1050-1054. 26. Y.-H. Pao and C. S. Yeh, 1973, “A Linear Theory for Soft Ferromagnetic Elastic Solids”, International Journal of Engineering Science, v.11, pp.415-436. 27. Y.-H. Pao and D. K. Banerjee, 1973, “Thermal Pulses in Dielectric Crystals”, Letters in Applied and Engineering Sciences, v.1, no.1, pp.33-41. 28. M. A. Oien and Y.-H. Pao, 1973, “Scattering of Compressional Waves by a Rigid Spheroidal Inclusion”, Journal of Applied Mechanics, v.40, pp.1073-1077. 29. Y.-H. Pao and K. Hutter, 1973, “Magnetoelastic Waves in Soft Ferromagnetic Solids”, (Proceedings of the 10th Annual Conference of the Society of Engineering Science, North Carolina State University, Raleigh), Recent Advances in Engineering Science, v.6, pp.83-91. (Invited Lecture) 30. K. Hutter and Y.-H. Pao, 1974, “A Dynamic Theory for Magnetizable Elastic Solids with Thermal and Electrical Conduction”, Journal of Elasticity, v.4, no.2, pp.89-114. 31. *Y.-H. Pao and R. K. Kaul, 1974, “Waves and Vibrations in Isotropic and Anisotropic Plates”, in R. D. Mindlin and Applied Mechanics, pp.149-195, edited by G. Herrmann, Pergamon Press, New York, 1974. 32. Y.-H. Pao and W. Sachse, 1974, “Interpretation of Time Records and Power Spectra of Scattered Ultrasonic Pulses in Solids”, Journal of the Acoustical Society of America, v.56, no.5, pp.1478-1486.
Laudatio for Professor Y.-H. Pao
xxvii
33. D. K. Banerjee and Y.-H. Pao, 1974, “Thermoelastic Waves in Anisotropic Solids”, Journal of the Acoustical Society of America, v.56, no.5, pp.1444-1454. 34. W. J. Parzygnat and Y.-H. Pao, 1975, “Resonance Phenomena in Non-Linear Systems Governed by the Duffing Equation”, Letters in Applied and Engineering Sciences, v.3, no.2, pp.109-118. 35. Y.-H. Pao and K. Hutter, 1975, “Electrodynamics for Moving Elastic Solids and Viscous Fluids”, Proceedings of the Institute of Electrical and Electronics Engineers, v.63, no.7, pp.1011-1021. 36. U. Gamer and Y.-H. Pao, 1975, “Diffraction of a Plane Harmonic SH Wave by Semi-Cylindrical Layers”, Archives of Mechanics (Archiwum Mechaniki Stosowanej), v.27, no.1, pp.133-139. 37. Y.-H. Pao, 1976, “Electromagnetic Forces in Deformable Moving Media”, Letters in Applied and Engineering Sciences, v.4, no.1, pp.75-83. (Invited Lecture) 38. Y.-H. Pao and C. C. Mow, 1976, “Theory of Normal Modes and Ultrasonic Spectral Analysis of the Scattering of Waves in Solids”, Journal of the Acoustical Society of America, v.59, no.5, pp.1046-1056. 39. Y.-H. Pao and V. Varatharajulu, 1976, “Huygens’ Principle, Radiation Conditions, and Integral Formulas for the Scattering of Elastic Waves”, Journal of the Acoustical Society of America, v.59, no.6, pp.1361-1371. 40. V. Varatharajulu and Y.-H. Pao, 1976, “Scattering Matrix for Elastic Waves, I. Theory”, Journal of the Acoustical Society of America, v.60, no.3, pp.556-566. 41. Y.-H. Pao, 1976, “Spectral Analysis of Diffracted Elastic Pulses and GeneralizedRay Theory for Mechanical Wave Guides”, Elastic Waves, pp.398-412. (Report of the Workshop on “Application of Elastic Waves in Electrical Devices, NonDestructive Testing and Seismology”, to National Science Foundation, edited by J. D. Achenbach, Y. H. Pao and H. F. Tiersten). 42. U. Gamer and Y.-H. Pao, 1977, “Interaction of Shear Wall with Elastic Foundation under the Excitation of SH Waves”, Engineering Transactions, Polish Academy of Science, v.25, no.3, pp.447-461. 43. *Y.-H. Pao and R. R. Gajewski, 1977, “The Generalized Ray Theory and Transient Responses of Layered Elastic Solids”, in Physics Acoustics, v.13, pp.183-265, edited by W. P. Mason and R. N. Thurston, Academic Press, New York, 1977. 44. P. Chen and Y.-H. Pao, 1977, “The Diffraction of Sound Pulses by a Circular Cylinder”, Journal of Mathematical Physics, v.18, no.12, pp.2397-2406. 45. *Y.-H. Pao, 1978, “Electromagnetic Forces in Deformable Continua”, in Mechanics Today, v.4, pp.209-305, edited by S. Nemat-Nasser, Pergamon Press, Oxford, 1978. (Invited Review) 46. Y.-H. Pao and A. N. Ceranoglu, 1978, “Determination of Transient Response of a Thick-Walled Spherical Shell by the Ray Theory”, Journal of Applied Mechanics, v.45, pp.114-122. 47. V. K. Varadan, V. V. Varadan and Y.-H. Pao, 1978, “Multiple Scattering of Elastic Waves by Cylinders of Arbitrary Cross Section, I. SH Waves”, Journal of the Acoustical Society of America, v.63, no.5, pp.1310-1319.
xxviii
Laudatio for Professor Y.-H. Pao
48. Y.-H. Pao, 1978, “Theory of Acoustic Emission”, Elastic Waves and NonDestructive Testing of Materials, AMD, v.29, pp.107-129, edited by Y. H. Pao, ASME, New York. 49. Y.-H. Pao, 1978, “The Transition Matrix for the Scattering of Acoustic Waves and for Elastic Waves”, Modern Problems in Elastic Wave Propagation, pp.123-144, edited by J. Miklowitz and J. D. Achenbach, John Wiley & Sons, 1978. 50. Y.-H. Pao, 1978, “Betti’s Identity and Transition Matrix for Elastic Waves”, Journal of the Acoustical Society of America, v.64, no.1, pp.302-310. 51. Y.-H. Pao and D. K. Banerjee, 1978, “A Theory of Anisotropic Thermoelasticity at Low Reference Temperature”, Journal of Thermal Stresses, v.1, pp.99-112. 52. W. J. Parzygnat and Y.-H. Pao, 1978, “Resonance Phenomena in the Nonlinear Vibration of Plates Governed by Duffing’s Equation”, International Journal of Engineering Science, v.16, pp.999-1017. 53. W. Sachse and Y.-H. Pao, 1978, “On the Determination of Phase and Group Velocities of Dispersive Waves in Solids”, Journal of Applied Physics, v.49, no.8, pp.4320-4327. 54. Y.-H. Pao, R. R. Gajewski and A. N. Ceranoglu, 1979, “Acoustic Emission and Transient Waves in an Elastic Plate”, Journal of the Acoustical Society of America, v.65, no.1, pp.96-105. 55. R. L. Weaver and Y.-H. Pao, 1979, “Application of the Transition Matrix to a Ribbon-Shaped Scatterer”, Journal of the Acoustical Society of America, v.66, no.4, pp.1199-1206. 56. R. L. Weaver and Y.-H. Pao, 1980, “Multiple Scattering of Waves in Irregularly Laminated Composites”, Journal of Applied Mechanics, v.47, pp.833-840. 57. Y.-H. Pao, 1980, “The Sound of Stress — Acoustic Emission and Elastic Waves”, Cornell Engineering Quarterly, v.14, pp.2-11. 58. Y.-H. Pao, 1980, “Mathematical Theories of the Diffraction of Elastic Waves”, Ultrasonic Materials Characterization, pp.457-473, (National Bureau of Standards Special Publication No. 596), U.S. Government Printing Office, Washington D. C., 1980. 59. Y.-H. Pao, 1980, “Theory of Acoustic Emission”, Transactions of the Twenty-Sixth Conference of Army Mathematicians, (ARO Report 81-1), pp.389-396, U.S. Army Research Office, Research Triangle Park, NC. 60. A. N. Ceranoglu and Y.-H. Pao, 1981, “Propagation of Elastic Pulses and Acoustic Emission in a Plate, Part I: Theory; Part II: Epicentral Responses; Part III: General Responses”, Journal of Applied Mechanics, v.48, pp.125-132, 133-138 & 139-147. 61. S. Sancar and Y.-H. Pao, 1981, “Spectral Analysis of Elastic Pulses Backscattered from Two Cylindrical Cavities in a Solid, Part I”, Journal of the Acoustical Society of America, v.69, no.6, pp.1591-1596 (Part II by S. Sancar and W. Sachse, Journal of the Acoustical Society of America, vol.69, no.6, pp.1597-1609, 1981). 62. R. L. Weaver and Y.-H. Pao, 1981, “Dispersion Relations for Linear Wave Propagation in Homogeneous and Inhomogeneous Media”, Journal of Mathematical Physics, v.22, no.9, pp.1909-1918.
Laudatio for Professor Y.-H. Pao
xxix
63. W. Sachse and Y.-H. Pao, 1981, “Signals in The Far-Field of an Acoustic Emission Source”, Ultrasonics International 1981 Conference Proceedings, pp.116-122, IPC Science and Technology Press, Ltd., Guildford, Surrey, UK. 64. Y.-H. Pao, 1982, “Applied Mechanics and Physics”, (in Chinese), Transactions, Society of Theoretical and Applied Mechanics of China, no.18, pp.1-8. 65. Y.-H. Pao and F. Ziegler, 1982, “Transient SH-Waves in a Wedge-Shaped Layer”, Geophysical Journal of the Royal Astronomical Society, v.71, pp.57-77. 66. R. L. Weaver and Y.-H. Pao, 1982, “Axisymmetric Elastic Waves Excited by a Point Source in a Plate”, Journal of Applied Mechanics, v.49, pp.821-836. 67. R. L. Weaver and Y.-H. Pao, 1982, “Spectra of Transient Waves in Elastic Plates”, Journal of the Acoustical Society of America, v.72, no.6, pp.1933-1941. 68. W. Sachse and Y.-H. Pao, 1982, “Locating and Characterizing Sources of Acoustic Emission”, Proceedings of the Fifth International Conference on Nondestructive Evaluation in The Nuclear Industry, pp.326-331, San Diego, California, 1982. 69. Y.-H. Pao, 1982, “Transient AE Waves in Elastic Plates”, Progress in Acoustic Emission, pp.181-197, Proceedings of the 6th International AE Emission Symposium, October 31-November 3, 1982, edited by M. Onoe, K. Yamaguchi, and T. Kishi, Japanese Society for Non-Destructive Inspection, Susono, Japan. 70. *Y.-H. Pao, 1983, “Elastic Waves in Solids”, Journal of Applied Mechanics (50th Anniversary Volume), v.50, pp.1152-1164, Transactions of the ASME. 71. *Y.-H. Pao, W. Sachse and H. Fukuoka, 1983, “Acoustoelasticity and Ultrasonic Measurements of Residual Stresses”, Physical Acoustics, v.17, pp.61-143, edited by W. P. Mason and R. Thurston, Academic Press, New York, 1983. 72. Y.-H. Pao, G. C. C. Ku and F. Ziegler, 1983, “Application of the Theory of Generalized Rays to Diffractions of Transient Waves by a Cylinder”, Wave Motion, v.5, pp.385-398. 73. T. M. Proctor, F. R. Breckenridge and Y.-H. Pao, 1983, “Transient Waves in an Elastic Plate: Theory and Experiment Compared”, Journal of the Acoustical Society of America, v.74, no.6, pp.1905-1907. 74. J. E. Michaels and Y.-H. Pao, 1984, “Deconvolution of Source Time Functions of the Moment Density Tensor”, in Review of Progress in Quantitative Nondestructive Evaluation, v.3, pp.707-715, edited by D. O. Thompson and D. E. Chimenti, Plenum Press, New York, 1984. 75. Von U. Gamer and Y.-H. Pao, 1984, “Grundlagen der akustoelastischen Spannungsmessung”, Osterreichische Ingenieur und Architekten-Zeitschrift, pp.51-54. 76. Y.-H. Pao, F. Ziegler and P. L. Chen, 1984, “Analysis of Transient Waves in Layered Media with Dipping Structure”, Proceedings of the Eighth World Conference on Earthquake Engineering, pp.703-710, San Francisco, California, 1984. 77. Y.-H. Pao, F. Ziegler and P. L. Chen, 1984, “Transient Waves in Layered Media with Dipping Structure”, presented at the 1984 Pressure Vessels and Piping
xxx
78.
79.
80. 81. 82. 83.
84.
85.
86. 87.
88. 89.
90.
91.
Laudatio for Professor Y.-H. Pao Conference and Exhibition, pp.49-59, San Antonio, Texas, June 17-21, 1984, edited by S. K. Datta. F. Ziegler and Y.-H. Pao, 1984, “A Note on Transient Elastic Cylindrical Waves in a Dipping Layer on Top of a Half-Space”, Osterreichishce Akademie der Wissenschaften, B. 193, Heft 8-10, pp.501-512. J. E. Michaels and Y.-H. Pao, 1984, “Characterization of AE Sources from Experiments with Oblique Forces”, in Progress in Acoustic Emission II, pp.189-195, edited by M. Onoe, K. Yamaguchi and H. Takahashi, Japanese Society for Non-destructive Inspection, 1984. F. Ziegler and Y.-H. Pao, 1984, “Transient Elastic Waves in a Wedge-Shaped Layer”, Acta Mechanica, v.52, pp.133-163. F. Ziegler and Y.-H. Pao, 1985, “Theory of Generalized Rays for SH-Waves in Dipping Layers”, Wave Motion, v.7, no.1, pp.1-24. Y.-H. Pao and U. Gamer, 1985, “Acoustoelastic Waves in Orthotropic Media”, Journal of the Acoustical Society of America, v.77, no.3, pp.806-812. M. Hirao and Y.-H. Pao, 1985, “Dependence of Acoustoelastic Birefringence on Plastic Strains in a Beam”, Journal of the Acoustical Society of America, v.77, no.5, pp.1659-1664. J. E. Michaels and Y.-H. Pao, 1985, “The Inverse Source Problem for an Oblique Force on an Elastic Plate”, Journal of the Acoustical Society of America, v.77, no.6, pp.2005-2011. F. Ziegler, Y.-H. Pao and Y. S. Wang, 1985, “Generalized Ray-Integral Representation of Transient SH-Waves in a Multiple Layered Half-Space with Dipping Structure”, Acta Mechanica, v.56, pp.1-15. F. Ziegler, Y.-H. Pao and Y. S. Wang, 1985, “Transient SH Waves in Dipping Layers: The Buried Line-Source Problem”, Journal of Geophysics, v.57, pp.23-32. Y.-H. Pao, C. S. Yeh and Francis C. Moon, 1985, “Dynamic Stresses in a Buried Cylinder due to Ground Shock”, Proceedings of the Trilateral Seminar-Workshop on Lifeline Earthquake Engineering, pp.297-308, Taipei, Taiwan, November, 1985. J. E. Michaels and Y.-H. Pao, 1986, “Determination of Dynamic Forces from Wave Motion Measurements”, Journal of Applied Mechanics, v.53, pp.61-68. T. Ohira and Y.-H. Pao, 1986, “Microcrack Initiation and Acoustic Emission during Fracture Toughness Tests of A533B Steel”, Metallurgical Transactions A, v.17A, pp.843-852. F. Santosa and Y.-H. Pao, 1986, “Accuracy of a Lax-Wendroff Scheme for the Wave Equation”, Journal of the Acoustical Society of America, v.80, no.5, pp.1429-1437. Y. K. Yeh and Y.-H. Pao, 1987, “On the Transition Matrix for Acoustic Waves Scattered by a Multilayered Inclusion”, Journal of the Acoustical Society of America, v.81, no.6, pp.1683-1687.
Laudatio for Professor Y.-H. Pao
xxxi
92. Y.-H. Pao, 1987, “Theory of Acoustoelasticity and Acoustoplasticity”, in Solid Mechanics Research for Quantitative Non-destructive Evaluation, pp.257-273, edited by J. D. Achenbach and Y. Rajapakse, Martinus Nijhoff Publishers, Dordrecht, the Netherlands, 1987. 93. F. Santosa and Y.-H. Pao, 1989, “Transient Axially Asymmetric Response of an Elastic Plate”, Wave Motion, v.11, pp.271-295. 94. T. Ohira and Y.-H. Pao, 1989, “Quantitative Characterization of Microcracking in A5033B Steel by Acoustic Emission”, Metallurgical Transactions A, v.20A, pp.1105-1114. 95. Y.-H. Pao, F. Ziegler and Y. S. Wang, 1989, “Acoustic Waves Generarated by a Point Source in a Sloping Fluid Layer”, Journal of the Acoustical Society of America, v.85, no.4, pp.1414-1426. 96. K.-C. Weng and Y.-H. Pao, 1989, “ ”, Doctoral Dissertation, National Taiwan University. 97. Y.-H. Pao, T. T. Wu and U. Gamer, 1991, “Acoustoelastic Birefringences in Plastically Deformed Solids: Part I – Theory”, Journal of Applied Mechanics, v.58, pp.11-17., T. T. Wu, M. Hirao and Y.-H. Pao, 1991, “Acoustoelastic Birefringences in Plastically Deformed Solids: Part II-Experiment”, Journal of Applied Mechanics, v.58, pp.18-23. 98. P. Haupt, Y.-H. Pao and K. Hutter, 1992, “Theory of Incremental Motion in a Body with Initial Elasto-Plastic Deformation”, Journal of Elasticity, v.28, pp.193-221. 99. X. Y. Su and Y.-H. Pao, 1992, “Ray-Normal Mode and Hybrid Analysis of Transient Waves in a Finite Beam”, Journal of Sound and Vibration, v.151, no.2, pp.351-368. 100. M. Muller, Y.-H. Pao and W. Hauger, 1993, “A Dynamic Model for a Timoshenko Beam in an Elastic-Plastic State”, Archive of Applied Mechanics, v.63, pp.301-312. 101. Y.-H. Pao and D. C. Keh, 1996, “Moment-Distribution Method and ReDistribution Matrix Analysis of Frame Structures”, The Chinese Journal of Mechanics, v.12, no.1, pp.157-165. 102. *Y.-H. Pao, 1998, “Applied Mechanics in Science and Engineering”, Applied Mechanics Reviews, v.51, no.2, pp.141-153. 103. S. M. Howard and Y.-H. Pao, 1998, “Analysis and Experiments on Stress Waves in Planar Trusses”, Journal of Engineering Mechanics, ASCE, v.37, pp.884-891. 104. Y.-H. Pao and G. S. Lee, 1999, “Analyses of Acoustic Waves in Shallow Water Over A Sloping Bottom — A Review”, Theoretical and Computational Acoustics’97, p.323-328, edited by Y.-C. Teng, E.-C. Shang, Y.-H Pao, M. H. Schultz and A. D. Pierce, World Scientific, Singapore (presented in the 3rd International Conference on Theoretical and Computational Acoustics, 1997). 105. Y.-H. Pao, D. C. Keh and S. M. Howard, 1999, “Dynamic Response and Wave Propagation in Plane Trusses and Frames”, AIAA Journal, v.37, no.5, pp.594-603.
以超音波測定二維殘留應力
xxxii
Laudatio for Professor Y.-H. Pao
106. Y.-H. Pao, X. Y. Su and J. Y. Tian, 2000, “Reverberation Matrix Method for Propagation of Sound in a Multilayered Liquid”, Journal of Sound and Vibration, v.230, no.4, pp.743-760. 107. P. Borejko, C. F Chen and Y.-H. Pao, 2001, “Application of the Method of Generalized Rays to Acoustic Waves in a Liquid Wedge over Elastic Bottom”, Journal of Computational Acoustics, v.9, no.1, pp.41-68. 108. X. Y. Su, J. Y. Tian and Y.-H. Pao, 2002, “Application of the Reverberation-Ray Matrix to the Propagation of Elastic Waves in a Layered Solid”, International Journal of Solids and Structures, v.39, pp.5447-5463. 109. L. S. Wang and Y.-H. Pao, 2003, “Jourdain’s Variational Equation and Appell’s Equation of Motion for Nonholonomic Dynamical Systems”, American Journal of Physics, v.71, no.1, pp.72-82. 110. Y.-H. Pao and G. Sun, 2003, “Dynamic Bending Strains in Planar Trusses with Pinned or Rigid Joints”, Journal of Engineering Mechanics, ASCE, v.129, no.3, pp.324-332. 111. J. F. Chen and Y.-H. Pao, 2003, “Effects of Causality and Joint Conditions on Method of Reverberation-Ray Matrix”, AIAA Journal, v.41, no.6, pp.1138-1142. 112. Y.-H. Pao et. al, “Forward and Inverse Problems of Elastic Waves in Environmental Engineering, Part I: A General Review” by Y.-H. Pao, G. S. Lee and Y. H. Liu, 2003; “Part II: Inverse Medium Problems of Layered Media” by Y.-H. Pao, Y. H. Liu and T. T. Wu, 2003, Enviromental Vibration: Prediction, Monitoring and Evaluation, ed. Chen Yumin and Takemiya Hirokazu, pp.3-23 & pp.24-37, Proceedings of the International Seminar on Environmental Vibration, Zhejiang University, Hangzhou, China, 16-18 October 2003. 113. Y.-H. Pao, 2003, “A Unified Variational Principle of Thermo-mechanics”; K. C. Chen, L. S. Wang and Y.-H. Pao, 2003, “A Variational Principle for the Dynamics of Rigid Bodies”, Proceedings of the Cross-Strait Workshop on Dynamics, Control, and Variational Principles in Mechanics, Beijing, China, October 2003, pp.1-5 & pp.70-74. 114. Y.-H. Pao, W.-Q. Chen and X.-Y. Su, 2007, “The Reverberation-ray Matrix and Transfer Matrix Analyses of Unidirectional Wave Motion”, Wave Motion, Vol.44, Iss.6, pp.419-438. (Special Issue Guest Ed. Y.H. Pao, Selected papers presented at the International Symposium on Mechanical Waves in Solids, Zhejiang University, Hangzhou, China, May, 2006) 115. Y.-H. Pao and W.-Q. Chen, 2009, “Elastodynamic theory of framed structures and reverberation-ray matrix analysis”, Acta Mechanica, 204/1,2 pp.61-79. 116. F. X. Miao, Guojun Sun and Y. H. Pao, 2009, “Vibration Mode Analysis of Frames by the Method of Reverberation Ray Matrix”, ASME Journal of Vibration and Acoustics, Vol.131. 117. Y.-H. Pao, G.-H. Nie and D.-C. Keh, “Dynamic Response and Wave Propagation in Three-dimensional Framed Structures”. (submitted for publication, 2010)
CHAPTER 1 LAMB WAVES IN PHONONIC BAND GAP STRUCTURES
Tsung-Tsong Wu1, Jin-Chen Hsu2 and Jia-Hong Sun1 1
Institute of Applied Mechanics, National Taiwan University 1, Sec. 4, Roosevelt Road, Taipei 10617, Taiwan E-mail:
[email protected] 2
Department of Mechanical Engineering National Yunlin University of Science and Technology 123, Sec. 3, University Road, Douliou, Yunlin 64002, Taiwan Phononic crystals (PCs) which consist of periodically arranged media have, in the past two decades, attracted intensive studies due to their renewed properties of band gaps and maneuverable band structures. Recently, Lamb waves in thin plates with PC structures started to receive increasing attention for their potential applications in filters, resonators, and waveguides. This paper presents a review of recent works conducted by the authors and co-workers on this topic. Both theoretical and experimental studies of Lamb waves in twodimensional (2D) PC plate structures are covered. On the theoretical side, analyses of Lamb waves in 2D phononic plates using the plane wave expansion (PWE) method, finite-difference time-domain (FDTD) method, and finite-element (FE) method are addressed. These methods were applied to determine the frequency ranges of band gaps of Lamb waves and characteristics of the propagating and localized eigenmodes that can exist in the PC plate structures. The theoretical analyses demonstrated the effects of PC based waveguides and resonant cavities. We discuss the influences of geometrical parameters on the guiding and resonance efficiency and the frequencies of waveguide and cavity modes. On the experimental side, we discuss band gaps of Lamb waves in a surface stubbed phononic plate made of aluminum which were demonstrated using laser ultrasonics. Furthermore, we present design and fabrication of a silicon based two-port Lamb wave resonator which utilizes PC reflective gratings to form the cavity. The measured results 1
2
T.-T. Wu, J.-C. Hsu and J.-H. Sun
showed significant improvement of the insertion losses and quality factors of the resonators when the PCs were applied.
1. Introduction Over the past two decades, propagation of acoustic waves in periodic structures comprised of multi-components has received much attention because of renewed physical properties and potential applications of the periodic structures in a variety of fields, such as noise and vibration isolation, frequency filters in wireless communication, super lens design, etc. These periodic structures, called phononic crystals (PCs),32,55 give rise to forbidden gaps of acoustic waves which are analogous to the band gaps of electromagnetic waves in photonic crystals. Major mechanisms leading to the forbidden gaps are known as Bragg scattering and localized resonances.37 The former opens up the Bragg gap at the Brillouin-zone (BZ) boundaries, and the band-gap frequency corresponds to the wavelength in the order of the structural period, i.e. the lattice constant, and relates to the lattice symmetry. On the other hand, localized resonance creates the resonant gap dictated by the frequency of resonance associated with scattering units and depends less on the lattice symmetry, orderliness, and periodicity of the structure. In the literature, theoretical calculations of bulk acoustic waves (BAWs) in two-dimensional (2D) periodic composites were reported using the plane wave expansion (PWE) method,13,30---33,35,37,55,56,67,70---73 multiple scattering theory,26,36,38,46 and finite-difference time-domain method (FDTD).12,28,61 On the other hand, experimental evidence for the existence of complete acoustic band gaps (BAW modes) have also been reported.12,16,42,50,62,64,66 Among these, existence of band gap, transmission properties, and local resonance of 2D phononic structures in the low frequency range were demonstrated. A comprehensive review of bulk waves in PCs can be found in a review article by Sigalas et al.54 Investigations into the surface acoustic wave (SAW) properties of PCs in which periodic modulation occurs on the traction-free surface did not take place until the late 90’s.20,34,59,60,63,69,75,76 Theoretical studies of surface waves on a square and hexagonal superlattice consisting of cubic (AlAs/GaAs) and isotropic (Al/polymer) materials were reported59,60 and,
Lamb Waves in Phononic Band Gap Structures
3
a couple of years later, extended to the case of general anisotropy by Wu et al.76 Subsequently, the case of SAW modes in the 2D piezoelectric PC was reported.20,34,75 In a recent article,69 the concept of the localization factor was introduced to study the Rayleigh wave propagation and localization in disordered piezoelectric PCs and the authors found that the larger the randomness degree the stronger the degree of wave localization. Since the late 90’s, experimental studies of SAW in millimeter-scale 2D PCs have demonstrated the existence of band gaps, attenuation, localization, and anisotropic transmission of surface states for sonic propagation in finite periodic systems with frequency in the range of a few MHz or lower.4,39,63,68,81 Nevertheless, study in the micrometerscale PCs which may find applications in radio frequency (RF) communications or MEMS devices had not been started until the middle of this decade. In an effort toward the integration of PC and SAW frequency filters or oscillators, Wu et al.79 utilized the Silicon micromachining to fabricate an air/silicon square lattice PC with layered slanted finger interdigital transducers (SFIT) attached. The transmission of surface acoustic waves through 6 layers of a PC has more than 30 dB attenuation, as was observed in the band gap between 183 to 215 MHz. In a subsequent paper, Benchabane et al.1 demonstrated experimentally the existence of a complete SAW band gap in a 2D square lattice piezoelectric PC etched in lithium niobate. The surface acoustic wave was generated by a normal interdigital transducer (IDT) and a complete band gap extending from 203 to 226 MHz was demonstrated experimentally. Scattering and propagation of surface acoustic waves in PCs revealed by optical methods were given by Kokkonen et al.29 and Profunser et al.49 Recently, a design that combines two-port SAW devices and PCs acting as reflective gratings was demonstrated.78 The design consists of a layered ZnO/Si SAW device and a square lattice PC composed of cylindrical holes on silicon. By only using 15-layer PC cylinders, experimental insertion loss shows a 7 dB improvement at the central frequency of 212 MHz. As compared with the conventional metallic reflective gratings, the major advantage of using PC is that the size of the CMOS compatible layered SAW filter can be reduced significantly.
4
T.-T. Wu, J.-C. Hsu and J.-H. Sun
Apart from the BAW and SAW modes, propagation of Lamb waves in phononic plate structures has rapidly received increasing interests. It has been known that excitation of high-frequency Lamb waves is achievable by using interdigital transducers (IDTs) deposited on a thin piezoelectric plate25 as that of SAWs on a piezoelectric half space and has been important in a variety of resonators and sensing applications. Theoretical studies on the propagation of Lamb waves in plates with periodic inclusions which are arranged transversely to the propagation direction have not been investigated extensively until recently. For onedimensional (1D) periodic8,17 or quasi-periodic11,53 composite thin plates, the stop bands and transmission properties were investigated and discussed. Dependences of the widths and starting frequencies of band gaps on the plate thickness, filling fraction, and lattice spacing were discussed. As a result, the difference between the band gaps of the bulk and Lamb waves in quasi-periodic composites was revealed. For the 2D case, propagation of Lamb waves in phononic plates with either non-piezoelectric or piezoelectric constituents have also been investigated5---7,21–23,27 utilizing the PWE or modified PWE methods. It was found that, similar to the cases of the bulk and surface modes, directional as well as complete band gaps also existed in phononic plate structures. In order to reduce the computation time of the band structure, a formulation which is based on Mindlin’s plate theory and the PWE method was proposed.21 In addition to the theoretical prediction, the existence of band gaps in phononic plates was also demonstrated through the laser ultrasonics technique.5 In recent years, band structures of phononic plates with three-dimensional (3D) inclusions and/or defects were studied and discussed in the literature using the multiple scattering method,10,51 FDTD method or modified PWE methods.57,58,65,80 Results of these studies showed that with the introduction of defects in phononic plates, wave disturbances can be localized or propagated along the defects and worked as a point resonance cavity or waveguides. In addition to flat phononic plates which have two flat surfaces, complete band gaps and wave guiding in PC plates with periodic stubbed surfaces have been demonstrated both numerically47,48,74,77 and experimentally.74,77 In particular, the results showed that by introducing the periodic stubbed cylinders on the surface
Lamb Waves in Phononic Band Gap Structures
5
of a homogeneous thin plate, a low-frequency gap which is similar to the case of locally resonant structures can be formed.48 On the experimental side, there have been studies, even though not many, on the demonstrations of the band gap properties of phononic plates with lattice constants on the millimeter and micron scales. For the millimeter case (with operating frequencies at a couple of MHz or lower), the propagation of acoustic waves in phononic flat plates or plates with periodic stubbed surfaces was examined experimentally using laser ultrasonics; very good agreements were found with corresponding numerical predictions.3,18,45,77 In addition, demonstration of the wave guiding and frequency selection capability was also conducted.74 Similar to the integration of SAW devices with phononic MEMS structures,1,78,79 the physical realization of Lamb wave phononic MEMS devices had not been started until recently. The first demonstration of the complete band gap of Lamb waves in a phononic slab was given by Olsson et al.44 The phononic slab was realized by including tungsten scatters in a SiO2 matrix. Wide band aluminum nitride piezoelectric couplers were adopted for interrogating the devices with center frequency around 67 MHz. Results showed a 30 dB acoustic rejection with bandwidths exceeding 25% of the midgap. In their work, single and multimode acoustic waveguides have also been investigated by defecting the acoustic crystals through removal of a subset of the tungsten scatters. The other demonstration of a complete band gap of a 2D phononic slab formed by embedding cylindrical air holes in a silicon plate was given by Mohammadi et al.41 The PC structure was made by etching a hexagonal array of air holes through a free standing plate of silicon. A pair of IDT was utilized to measure the transmission of elastic waves through eight layers of the hexagonal lattice PC in the ΓK direction. Their results showed more than 30 dB attenuation was observed at 134 MHz with a band gap to midgap ratio of 23%. In a subsequent paper,40 by etching a hexagonal array of holes in a 15 µm thick slab of silicon, high-Q PC resonators were fabricated using a CMOS compatible process. Results showed that the complete phononic band gap of the PC structure supports resonant modes with quality factors of more than 6000 at 126 MHz. Recent developments in the area of micro-fabricated PCs can be found in a recent review article.43
6
T.-T. Wu, J.-C. Hsu and J.-H. Sun
In this article, our recent investigations of Lamb waves in 2D phononic band gaps structures are reviewed and presented, including theoretical methods, numerical calculations, and experimental results, with a demonstrated application of PCs to Lamb wave resonators. This paper is organized as follows. The formulations of the PWE formalism and numerical methods, including the FDTD and finite-element (FE) methods, for Lamb waves in the PC plate structures are summarized in Sec. 2. Section 3 reviews the calculated results of Lamb wave propagation and related phenomena, including the band gaps, in several PC plate structures, and the utilization of the properties of phononic band gaps to construct phononic waveguides and resonant cavities for efficient guiding and confinement of acoustic energy. In Sec. 4, experimental demonstrations of phononic band gaps and a potential application to Lamb wave resonators, based on the simulated results shown in Sec. 3 are given. Finally, Sec. 5 summarizes this presentation. 2. Formulation and Numerical Methods In this section, the theoretical formulations and numerical methods to analyze the propagation of Lamb waves in 2D PC plate structures are summarized. In Sec. 2.1, the typical PWE methods based on Fourier series expansions of the full 3D wave equations,75,76 as well as the Mindlin plate theory based PWE method are introduced. These can effectively reduce the computation time when analyzing the lower-order Lamb modes.21 In Sec. 2.2, the FDTD method is briefly explained.19,57 In the method, in order to tackle wave propagation in PC plate structures, Bloch periodic boundary conditions and absorbing layers with the FDTD algorithm were developed. Moreover, the construction of parallel computation with a personal computer cluster (PC Cluster) that can accelerate the numerical computing of the FDTD method was introduced. Finally, FE analysis employing the solver COMSOL MULTIPHYSICS with Bloch periodic boundary conditions applied to the FE models is introduced in Sec. 2.3.
Lamb Waves in Phononic Band Gap Structures
7
2.1. Plane Wave Expansion Method 21,75,76 2.1.1. Wave Equation and Eigen-Solutions Consider a periodic elastic structure composed of a 2D periodic array of material A in the x1-x2 plane embedded in a background material B. The governing field equations of wave propagation in solids can be expressed as
ρ (r ′)uɺɺj (r , t ) = ∂ i cijkl (r ′)∂ l uk (r, t ) ,
(2.1.1)
where r=(x1, x2, x3)=(r , x3) is the position vector, uj (r, t) is the displacement vector, and ρ (r ′) and cijkl (r ′) are position-dependent mass density and elastic stiffness, respectively. Due to the spatial periodicity in the x1-x2 plane, the material properties can be expanded in Fourier series with respect to the 2D reciprocal lattice vectors, G = (G1 , G2 ) , as '
cijkl (r′) = ∑ eiG⋅r ′ cGijkl ,
(2.1.2)
G
ρ (r ′) = ∑ eiG ⋅r′ ρG ,
(2.1.3)
G
where cGijkl and ρG are the associated Fourier coefficients which can be obtained from
cGijkl = Ac−1 ∫ cijkl (r ′)e − iG ⋅r ′ d 2 r ′ ,
(2.1.4)
ρG = Ac−1 ∫ ρ (r ′)e −iG⋅r ′ d 2 r ′ .
(2.1.5)
Ac
Ac
In the above integrals, Ac is the enclosed area of a primitive unit cell of the 2D periodic structure on the x1-x2 plane. On utilizing the Bloch theorem and expanding the displacement vector uj in Fourier series, we have u j (r , t ) = ∑ AGj eik3 x3 ei (G +k )⋅r ′−iωt , ( j = 1,2,3)
(2.1.6)
G
where k=(k1, k2) is the Bloch wave vector, ω is the circular frequency, k3 is the wave number along the x3-direction, and A jG, ( j=1,2,3), are the amplitudes of the displacement vector. Inserting Eqs. (2.1.2), (2.1.3), and (2.1.6) in Eq. (2.1.1), the following expression can be obtain
8 PG11,G ′ 2 21 k3 PG ,G ′ P 31 G ,G ′
T.-T. Wu, J.-C. Hsu and J.-H. Sun PG12,G ′ PG22,G ′ PG32,G′
Q11 PG13,G ′ G ,G ′ PG23,G ′ + k3 QG21,G ′ 31 PG33,G′ Q G ,G ′
Q12 G ,G ′ QG22,G ′ QG32,G′
R11 Q13 G ,G ′ G ,G ′ QG23,G ′ + R G21,G ′ QG33,G′ R 31 G ,G ′
R12 G ,G ′ R G22,G ′ R 32 G ,G ′
A1G ,G′ R13 G ,G ′ R G23,G ′ A G2 ,G′ = 0 3 R 33 G ,G′ A G ,G′
(2.1.7) ij G ,G ′
ij G ,G ′
ij G ,G ′
, Q , and R , (i, j=1, 2, 3), are where the submatrices P ijkl functions of k, G, ω, cG and ρG . When the summations in Eqs. (2.1.2), (2.1.3), and (2.1.6) are truncated up to include n reciprocal-lattice vectors G, the matrices PGij ,G′ , QijG ,G′ , and R ijG ,G′ are reduced to n × n matrices, and the expressions of the matrix elements are given by ij 2 α ij β PG,G ′ = ω ρ G − G ′δ ij − (Gα + kα )(Gβ′ + k β )cG − G ′ ,
(2.1.8)
ij α ij 3 3ij β QG,G ′ = − (Gα + kα )cG − G ′ − (Gβ′ + k β )cG − G ′ ,
(2.1.9)
ij 3ij 3 RG,G ′ = −cG − G ′ ,
(α , β = 1, 2).
(2.1.10)
Eq. (2.1.7) can be rewritten in the form of a generalized eigenvalue problem with respect to k3 as
( Pk
2 3
+ Qk3 + R ) U = 0 .
(2.1.11)
Eq. (2.1.11) can be transformed into a standard eigenvalue problem to solve for the eigenvalues k3 and suitably normalized eigenvectors U. By introducing V=k3 U, Eq. (2.1.11) can be rewritten as
I U 0 U −1 = k3 . −1 − P R − P Q V V
(2.1.12)
2.1.2. Lamb Wave in Phononic Crystals Consider a PC of finite thickness to be a phononic plate with thickness denoted by h. The x1-x2 plane of the coordinate system rests in the middle plane of the plate, and the x3-axis is normal to it and directed upward. When the matrices P, Q, and R in Eq. (2.1.11) are reduced to 3n×3n matrices by choosing n reciprocal lattice vectors, solving Eq. (2.1.12) with an arbitrarily guessed frequency ω yields 6n eigenvalues k3(m) ( m = 1 − 6n) . The eigenvalues and the associated eigenvectors can be used to build the displacement field of Lamb waves in the plate. The general solution can be written as
9
Lamb Waves in Phononic Band Gap Structures 6n
( m)
u j ( r, t ) = ∑∑ X m aGj ( m) ⋅ eik3
x3
⋅ ei ( G + k )⋅r ′ − iω t ,
(2.1.13)
G m =1
where Xm, ( m = 1 − 6n) , are undetermined weighting coefficients. The two parallel surfaces of the plate at x3 = ± h / 2 are assumed to be stress free; so the boundary conditions give
τ i3
x3 =± h 2
= ci 3kl ∂ l uk
x3 =± h 2
= 0.
(2.1.14)
Substituting Eq. (2.1.13) in Eq. (2.1.14) leads to a system of 6n homogeneous linear equations for Xm, L(1,1)G 1 L(2,)G ⋮ L(1) 6,G
L(1,G) 2
L(2,)G ⋮ 2
L(6,)G 2
⋯ L(1,G) X 1 6n ⋯ L(2,G) X 2 ⋅ = L⋅X = 0, ⋯ ⋮ ⋮ 6n ⋯ L(6,G) X 6 n 6n
(2.1.15)
where L is a 6n × 6n matrix. L(bm,G) , ( b = 1 − 6 ) , are n × 1 column matrices and their elements L(im,G) are (m)
m L(i ,mG) = ∑ ( Gα′ + kα ) ( cGi 3−kαG ′ aGk (′ m ) ) + k3( ) ( cGi 3−kG3 ′ aGk (′m ) ) eik3
h 2
,
(2.1.16)
G′
(m)
m L(i +m3,) G = ∑ ( Gα′ + kα ) ( cGi 3−kαG ′ aGk (′ m ) ) + k3( ) ( cGi 3−kG3 ′ aGk (′ m ) ) e− ik3
h 2
.
(2.1.17)
G′
Accordingly, the eigenfrequencies of the Lamb waves can be correctly chosen from those guessed frequencies if following condition
det ( L ) = 0 ,
(2.1.18)
is satisfied. Eq. (2.1.18) is the dispersion relation for Lamb waves propagating in 2D periodic composite plates. 2.1.3. Mindlin Theory Based Method Other than the full 3D wave equations, the Mindlin plate theory can be utilized as an alternative.21 According to the assumptions of the Mindlin plate theory, the components of the displacement field uj, ( j=1, 2, 3), for waves propagating in a plate are expanded in power series of the coordinate variable x3:
10
T.-T. Wu, J.-C. Hsu and J.-H. Sun ∞
u j ( r ′, x3 , t ) = ∑ x3n u (j
n)
( r′, t ) = u (j0 ) ( r′, t ) + x3u (j1) ( r′, t ) + …
(2.1.19)
n =0
where uj(n) is called the nth order component of the displacement field. Retaining the zeroth - and first-order terms in Eq. (2.1.19), the equations of motion can eventually be expanded as
ρ uɺɺ1( 0) =
∂ 1 (0) (0) (0) g11u1,1 + g12 u2,2 + κ 2 g14 u3,2 + u2( ) ∂x1
(
)
(2.1.20a)
∂ 1 ( 0) (0) (0) , κ1c56 u3,1 + + u1( ) + c66 u2,1 + u1,2 ∂x2
(
ρ uɺɺ2( 0) =
)
(
)
∂ 1 (0) (0) (0) κ1c56 u3,1 + u1( ) + c66 u2,1 + u1,2 ∂x1
(
)
(
)
(2.1.20b)
∂ 1 ( 0) ( 0) ( 0) g12 u1,1 + + g 22 u2,2 + κ 2 g 24 u3,2 + u2( ) , ∂x2
(
ρ uɺɺ3( 0) =
)
∂ 2 1 ( 0) ( 0) ( 0) κ1 c55 u3,1 + u1( ) + κ1c56 u2,1 + u1,2 ∂x1
(
)
(
)
(2.1.20c)
∂ 1 (0) (0) ( 0) + + κ 2 g 24u2,2 + κ 22 g 44 u3,2 + u2( ) , κ 2 g14u1,1 ∂x2
(
ρ h3 12
1 uɺɺ1( ) =
3 h3 ∂ (1) (1) h ∂ (1) (1) u1,1 γ γ 66 u2,1 + γ 12u2,2 + + u1,2 11 12 ∂x1 12 ∂x2
(
− h κ c
2 1 55
ρ h3 12
1 uɺɺ2( ) =
)
(u
( 0) 3,1
(1)
+ u1
) + κ c (u 1 66
)
(0)
2,1
)
3 h3 ∂ (1) (1) h ∂ (1) γ 66 u2,1 γ u (1) + γ 22u2,2 + u1,2 + 12 ∂x2 12 1,1 12 ∂x1
(
(2.1.20d)
+ u1,2 ,
(0)
)
(
(2.1.20e)
)
− h κ 2 g14u1,1 + κ 2 g 24u2,2 + κ g 44 u3,2 + u2 . (0)
(0)
2 2
(0)
(1)
In the above expressions, Voigt’s notation has been used to replace the fourth order cijkl (i,j,k,l = 1,2,3) by the second order cIJ , ( I , J = 1 − 6) . gIJ and γIJ are the modified elastic stiffnesses given by
g IJ = cIJ − cI 3c3 J c33 ,
(2.1.21)
Lamb Waves in Phononic Band Gap Structures
γ IJ =
cof sIJ sIJ
,
sIJ
s11 = s21 2 s16
s12 s22 2 s26
2s16 2 s26 , 4 s66
11
(2.1.22)
sIJ being the elastic compliances, and κ1 and κ 3 being the correction factors given by
κ1 =
κ 3 = κ1
c33 + c44 −
π 12
,
( c33 − c44 ) 2 g 44
(2.1.23) 2
2 + 4c44
.
(2.1.24)
By Bloch’s theorem, the zeroth- and first-order components of the displacement field have the forms ⌣ u (j0) ( r ′, t ) = ∑ AGj ⋅ ei ( G + k )⋅r ′ − iω t , j = 1, 2, 3, G (2.1.25) ⌣ α i (G + k )⋅r ′ − iω t (1) , α = 1, 2. uα ( r ′, t ) = ∑ BG ⋅ e G
Expansion of those material properties in Eq. (2.1.20) in Fourier series as Eq. (2.1.3) and then substitution of Eq. (2.1.25) into Eq. (2.1.20) lead to a system of equations in the matrix form, ⌣ ⌣ A1G ′ A1G ′ ⌣ ⌣2 M11 A G2 ′ ⋯ M15 G ,G ′ G ,G ′ A ⌣ G′ ⌣3 A3 = 0 . ⋅ = ⋅ ⋮ ⋱ ⋮ (2.1.26) M A ⌣ G′ ⌣ 1G ′ 1 M 51 ′ ⋯ M 55 ′ B BG ′ G ,G G ,G ⌣ G2 ′ B ⌣2 G′ BG ′ While the summations of Eq. (2.1.25) are truncated to include just n reciprocal lattice vectors in practice, the matrix M in Eq. (2.1.26) is reduced to a 5n × 5n matrix. The explicit expressions of the matrix components can be found in Ref. [21]. The eigenfrequencies of the plate modes in the 2D phononic plate can be correctly chosen by examining the following condition:
det ( M ) = 0 .
(2.1.27)
12
T.-T. Wu, J.-C. Hsu and J.-H. Sun
2.2. Finite-Difference Time-Domain Method 2.2.1. Three-Dimensional Difference Equations In a linear elastic material, the equations of motion and constitutive law can be expressed as
ρ uɺɺi = ∂ jτ ij + ρ fi ,
(2.2.1)
τ ij = cijkl ε kl ,
(2.2.2)
where τij is the stress tensor, fi is the body force, and εkl is the strain tensor. Equations (2.2.1) and (2.2.2) describe the motion of an infinitesimal element of an anisotropic material in general. These equations are applicable to the inhomogeneous structure of phononic crystals by arranging the density and elastic constants periodically in space. Further, with staggered grids, the differential equations (2.2.1) and (2.2.2) are transferred into difference equations based on Taylor series expansion to develop the 3D heterogeneous finite-difference formulation.19,57 2.2.2. Boundary Conditions and Perfect Matching Layers The Bloch theorem was introduced to treat the periodic boundary conditions (PBCs) for a unit cell of the PC. According to the theorem, the displacement and stress fields of the PCs can be expressed by multiplying of a plane wave and a periodic function as
ui (r, t ) = eik ⋅rU i (r, t )
(2.2.3)
τ ij (r, t ) = eik ⋅rTij (r, t )
(2.2.4)
where Ui(r, t) and Tij(r, t) are the periodic functions satisfying
U i (r + a, t ) = U i (r, t )
(2.2.5)
Tij (r + a, t ) = Tij (r, t )
(2.2.6)
Lamb Waves in Phononic Band Gap Structures
13
with a lattice translation vector a. Combining Eqs. (2.2.3) and (2.2.4) with Eqs. (2.2.5) and (2.2.6), respectively, the PBCs of displacement ui and stress τij can be written as
ui (r + a, t ) = eik ⋅a ui (r, t )
(2.2.7)
τ ij (r + a, t ) = eik ⋅aτ ij (r, t )
(2.2.8)
With the PBCs, 2D PCs can be analyzed by calculating the unit cell. In analyses of the dispersions, a small disturbance at a random position in the unit cell is set as the initial condition. Thus, all possible wave modes can be transported inside the considered unit cell, and the displacement is recorded and Fourier transformed. Then, the eigenfrequencies with a given wave vector k are indicated by selecting the resonance peaks of the Fourier transformed spectrum, and this allows finding the information about possible types of the waves. On the other hand, Berenger2 introduced the concept of perfect matching layer (PML) to reduce the electromagnetic wave reflection from a boundary, and then, PML was also developed for the elastic wave propagation.9 The PML program was adopted to serve as the nonreflecting boundary condition in analyzing PCs. A stretched coordinate is defined with a complex variable, ei = ai + i
Ωi
ω
(2.2.9)
to derive the code for the PML region. In Eq. (2.2.9), the real part ai is a scale factor, and Ωi ω is the imaginary part with attenuation factor Ωi and circular frequency ω . Then, the differential operation in the stretched coordinate is defined and applied to the equations of motion and constitutive law. After introducing the plane wave solutions into the equations, the numerical attenuation is achieved via the factor Ωi . In addition, the non-reflecting condition at the interface between the PML region and the inner space is obtained by setting corresponding material constants and the unity scale factor. In the elastodynamic equations for the stretched coordinate, displacement and stress components are taken with their spatial partial differential operations in all directions; thus the variables are split into three components to realize the difference
14
T.-T. Wu, J.-C. Hsu and J.-H. Sun
Fig. 1. The hardware architecture of the PC Cluster system.19
equations. Then, the actual values are obtained from summation of split components. Finally, the PML equations can be shown to have the form, ρui j + ρ Ω j uɺi j = τ ij , j , (2.2.10)
τɺij m + Ω mτ ij m = cijkl
uɺk ,lδ ml + uɺl , k δ mk
. (2.2.11) 2 In the above equations, ui/j and τij/m are the split displacement and split stress components, which satisfy ui=ui/1+ui/2 +ui/3 and τij=τij/1+τij/2+τij/3. After transforming Eqs. (2.2.10) and (2.2.11) into difference formulae, the PML is arranged outside the space boundaries as a buffer zone with a matched acoustic impedance to suppress the reflection. A numerical attenuation occurs as waves decay rapidly inside the region. With the absorbing layers, the reflection could be reduced to less than 1%, and PML was used in calculations of the dispersion relations and transmission. 2.2.3. Parallel Computing Because of the high-speed computation and low cost of the PC Cluster system, the complex research of PCs can be achieved efficiently by saving CPU computation time. As shown in Fig. 1, the PC Cluster
Lamb Waves in Phononic Band Gap Structures
15
Fig. 2. Parallel model-working queue.19
consists of several personal computers with different operating systems and different hardware architectures. The computers are connected with each other by external network devices, such as network cards based on Ethernet. Thus, the computers in the system can use the messages passing around to communicate with each other. Therefore, the whole computer system can be regarded as a complete parallel machine. The parallel model of the PC Cluster is SPMD (single program multiple data). In SPMD, the parallelism is controlled by the programs, not the CPU instructions. The controller of the system sends the program to every computational node involved in the calculation, and commands every computer to do the task. In the parallel computing, MPICH,14 a highperformance portable implementation of the standard MPI, was adopted to parallel the FDTD program. MPICH offers many kinds of parallel ways in the library so that the programmer can easily pass the message from one of the computers to the others by simply calling the function offered from the library. The FDTD calculation speed was accelerated by paralleling the program. There were several parallel methods in the calculation, such as the method of dividing the time or space and the method of dividing the wave vectors. When calculating the dispersion relations of acoustic
16
T.-T. Wu, J.-C. Hsu and J.-H. Sun
waves in PCs, the method of dividing the wave vectors was used. Dividing the calculation by time or space requires a large number of data exchanges. On the other hand, when calculating different wave vectors around the irreducible part of the first Brillouin Zone (BZ), there are much fewer messages passing in the PC Cluster system so that the computation can achieve higher performance. There is no message passing, data exchanging, and synchronization between the calculations with different wave vectors. In this parallel method, as Fig. 2 shows, the working queue is created to store all the wave vectors of the whole computation in the server before calculation. Then, the client sends the request to the server for the jobs asked for in the calculation. If the working queue in the server is not empty, the new job will be sent to the client. After receiving the reply from the server, the client can do the calculation with the specific wave vector. Then, the client can iterate the job requesting process until the working queue is empty. This completes the whole computation. Using the parallel method of wave vector dividing, the whole computation can save more calculation time by increasing numbers of CPUs. Because the numbers of CPUs in calculations are increased, jobs dispatched to each node are also reduced. Moreover, in the method, there is no message passing between the clients and fewer data exchanging between the client and the server. Thus, the total waiting time is spent mostly in the calculation and just a little in the waiting for message passing. Therefore, the parallel method of wave vector dividing is very suitable for use in the FDTD study of the dispersion relations in PCs. 2.3. Finite Element Analysis
In the FE analysis, the commercial FE package and solver COMSOL MULTIPHYSICS was used to carry out the numerical calculations. For the eigenfrequency and eigenmode calculations of Lamb waves, a FE structural model is constructed for the unit cell of the considered phononic structure and then meshed. Assuming a time-harmonic solution, the wave equation results in an eigenvalue problem, which yields the eigenvalues and the eigenfrequencies. According to the Bloch’s theorem,
Lamb Waves in Phononic Band Gap Structures
17
the displacement fields obey the following periodic condition on the boundaries of the unit cell, u j ( x1 + ma1 , x2 + na2 , x3 , t ) = u j ( x1 , x2 , x3 , t ) exp ( ik1ma1 + ik2 na2 ) ,
(2.3.1)
where a1 and a2 are the pitches along the x1- and x2-direction of the periodic array, respectively. m and n are integers. The above conditions were applied on the perimeter of the unit cell, and the top and bottom surfaces of the unit cell plate in the model were set to be free. In the calculations, the COMSOL connected to MATLAB Script can be used to iteratively vary the structural dimensions and/or boundary conditions. 3. Phononic Band Gaps, Waveguides, and Cavities
In this section, numerical results conducted by using the methods described in Sec. 2 are reviewed to explain the behavior of Lamb waves in 2D phononic plates. To start with, Sec. 3.1 focuses mainly on discussions of the phenomenon of phononic band gaps. Their existence was determined by the dispersion relations in a frequency range where no eigenmode is allowed and by the transmitting spectra where the power transmission is extremely low. Apart from finding the phononic band gaps, the eigenmodes in the dispersion curves are also discussed. The eigenmodes of phononic crystals are useful for understanding the band-gap formation, wave behavior in the structures, and designing the phononic band gap materials for further applications. In Sec. 3.2 and Sec. 3.3, we discuss the utilization of phononic plates with complete band gaps (i.e., omni-directional band gaps that can forbid wave propagation in any direction) to construct the waveguides and cavities for Lamb waves, respectively. 3.1. Phononic Band Gaps of Lamb Waves
As the 2D phononic structure has a finite thickness in the x3 direction, the structure becomes a plate, and the wave energy is totally confined between the two traction free surfaces of the plate and developed into the Lamb waves. The following two examples illustrate the calculated results
18
T.-T. Wu, J.-C. Hsu and J.-H. Sun Table 1. Elastic Stiffness (1010 N/m2) and Mass Density (kg/m3) Material
Class
c11
c11
c11
c11
c11
ρ
ZnO Si Au Steel Al Epoxy
6mm m3m m3m isotropic isotropic isotropic
20.97 16.56 19.25 28.02 11.1 0.76
12.11 6.39 16.30
10.51
21.09
4.25 7.95 4.24 8.29 2.5 0.159
5676 2329 19300 7900 2695 1180
of the band structures of Lamb waves in 2D Au/epoxy and steel/epoxy phononic plates using the PWE and FDTD methods, respectively. In the first case, the result of the full 3D PWE method was compared with that of the Mindlin-theory-based PWE (abbreviated as MPWE in the following) method.21 They are in good agreement. The MPWE method also exhibits an improvement in reducing the computation time. In the second case, the FDTD method calculated not only the band structure to show the complete band gaps but also the transmission spectra to obtain a close approach to the experiment. The eigenmodes in the periodic plate structure are also discussed. In the third case, another form of periodic plate structure is discussed, where the plate has periodic stubs on one of the plate surface instead of the periodic fillers. The idea to create such a structure was to produce a periodic variation in geometries rather than in the material properties. Calculated result showed that the structure can generate not only complete band gaps but also some highly resonant modes. The FE method was applied to analyze the structures. The material properties used in the following calculations are listed in Table 1. 3.1.1. Au/Epoxy Phononic Plate In the system, crystalline gold (Au) serves as the filling material, and epoxy as the host material. The band structure of the Au/epoxy plate with thickness h=0.25a, where a is the lattice constant, was first calculated using the full 3D PWE method, and then, compared with that using the MPWE method. The Au circular cylinders of radius r are arranged as a square lattice embedded in the epoxy host as depicted in Fig. 3(a). The
19
Lamb Waves in Phononic Band Gap Structures 5
x1
x2
Normalized Frequency (ωa/ Ct )
k2 M Γ
X
k1
Unit cell
a r
x1
(a)
x3
Full 3D Mindlin
4 3 2 1
h
(b)
0
−
Γ
−X
−
M
−
Γ
Fig. 3. (a) Geometries of the phononic plate. (b) Comparison of the band structures using the full 3D PWE method and the MPWE method.21
filling fraction is F=πr2/a2=0.283. Figure 3(b) displays the calculated dispersion curves along the boundary of the irreducible first BZ, ΓXMΓ, in the wave vector space. The solid lines represent the results obtained by using the full 3D PWE method, and the dots denote the results calculated by using the MPWE method. Due to the large computation time required for the full 3D PWE method, the number of reciprocal lattice vectors G is restricted to 81. The CPU times to calculate the band structure along the boundary of the irreducible part of the first BZ (40500 grid points are included) are about 250 h for the full 3D PWE method and only 2 h for the MPWE with 81 reciprocal lattice vectors on a personal computer equipped with Intel Pentium4 CPU of 2.80 GHz and 512 MB memory. In the figure, it can be observed that the two methods result in good agreement of numerical results. The maximum difference is about 1.6% at the eighth band in the calculated interval of normalized frequency. The above analysis shows that the MPWE method exhibits satisfactory accuracy and affords much less computation time. The results suggest that the MPWE method can serve as a quick and good predicting tool in the design of a phononic plate in the lower frequency bands. Moreover, good convergence of the numerical results should be obtained by implementing the calculations with a large number of reciprocal lattice vectors when a phononic plate consists of materials with a large contrast. 81 reciprocal lattice vectors used in the calculations of Fig. 3(b) do not provide satisfactory convergence. The full 3D PWE
20
T.-T. Wu, J.-C. Hsu and J.-H. Sun
(a)
4 Normalized Frequency (ωa/ Ct)
Normalized Frequency ( ωa/ C t)
5 4 3 2 1 0
− Γ
− X
−
M
−
Γ
(b)
3
2
1
0
−
Γ
− X
−
M
−
Γ
Fig. 4. Band structures of Lamb waves in Au/epoxy phononic plates of plate thickness h=0.25a (a) and 0.175a (b), calculated by using the MPWE method with 441 reciprocal lattice vectors.21
method applied to the phononic plate problem, however, is not practical for computation time consideration when the number of the reciprocal lattice vectors is large. Therefore, with the MPWE method, 441 reciprocal lattice vectors were used to re-calculate the dispersion relations of the Au/epoxy example, and the result is shown in Fig. 4(a). In the figure, the complete frequency band gap exists between the sixth and seventh frequency bands extending in the normalized frequency from 2.62 to 3.03. The ratio of gap width to midgap frequency, therefore, is 14.6%. In the phononic plate, the band structure could be quite different from that of an infinite phononic crystal for bulk waves because the waves confined in the finite thickness plate result in an acute dispersion effect in the low frequency region by supporting the flexural and thicknessshear vibrations. Therefore, the ratio h/a can be another influential parameter on opening the complete band gap and band shifting in the band structure. Figure 4(b) shows the band structure corresponding to the phononic plate with a smaller thickness h=0.175a; other parameters employed in the calculations were remained unchanged. In Fig. 4(b) another complete band gap in the thinner phononic plate is found, and the ratio of gap width to midgap frequency is 9.3%. Comparing Fig. 4(b) with Fig. 4(a), the complete band gap between the sixth and seventh frequency bands is closed, and another complete band gap is opened between the fifth and sixth frequency bands, and in the lower frequency
Lamb Waves in Phononic Band Gap Structures
21
50
∆ω/ωm (%)
40 30 20 10 0
0
0.1
0.2
0.3
0.4
0.5
0.6
h/ a
Fig. 5. Variation of the band gap width in the Au/epoxy plate as a function of the plate thickness.21
range, by tuning down the value of the plate thickness. In others words, for Au/epoxy phononic plates, wider and higher complete frequency band gap is obtained in a thicker phononic plate, and lower complete band gap can be created in a thin phononic plate. Figure 5 displays the thickness dependence of the complete band gap width in the Au/epoxy phononic plate. In the thin plate region ( h a ≤ 0.20 ), a complete band gap exists between the fifth and sixth frequency bands sustains a short thickness/lattice spacing range, h / a ≅ 0.075 , and closes down when the thickness h ≤ 0.125a or h ≥ 0.20a . The local maximum band gap width takes place at h=0.175a. In the thicker plate region h a ≥ 0.20 , the complete band gaps appear between the sixth and seventh frequency bands and lie in the relatively higher frequency regions; the band gap width increases progressively with the increase of the thickness when h ≥ 0.20a . It is known that the formation of wide band gaps for BAWs propagating in infinite PCs originates from the interaction between rigid-body resonances of individual fillers and waves propagating in an 52,82 effective homogeneous medium, and the coalescence with Bragg gaps. Correspondingly, in the phononic plate, hybridization of the rigidbody resonance of the individual circular Au plates with the propagation in the effective homogeneous medium corresponding to the periodic
22
T.-T. Wu, J.-C. Hsu and J.-H. Sun
plate results in a complete band gap of Lamb waves to exist. Change in the thickness of the plate dramatically changes the scattering properties of the individual circular plates and thus their resonances; therefore, the band structure of the phononic plate is sensitive to the variation of the thickness. Eventually, the eigenfrequencies of the resonance states can be shifted by tuning the plate thickness to create complete band gaps between different frequency bands as shown in Fig. 4. 3.1.2. Steel/Epoxy Phononic Plate57 In this example, a square lattice phononic plate consisting of steel cylinders embedded in an epoxy matrix is analyzed; its usage to construct the phononic waveguides will be shown in Sec. 3.2.1. To calculate the eigenmodes of acoustic waves in the phononic plate, a unit cell was defined as displayed in Fig. 6(a). The cross section on the x1-x2 plane is 1a×1a. a is divided into 48 grids in the calculations. The lattice constant a=8 mm, and the radius r of the steel cylinder is 3 mm. A time step interval of 10 ns in the FDTD calculation was chosen, sufficiently small to satisfy the numerical stability condition. In this case, F=0.442, and h=0.25a. The dispersion relations were calculated and are shown in Fig. 6(b). In the figure, two noticeable complete band gaps appear from 89 to 101 kHz and 125 to 162 kHz, respectively. In the phononic plate,
250
Phononic-Crystal Plate, Cylinder:Steel / Base:Epoxy, r:a:h=18:48:12, sq. k2
M
π a
π a
200
x1
Frequency (kHz)
x2 x3 h
Γ
H
150
k1
The Second complete band gap
G
100 F
a
The First complete band gap
E
50
(a)
X
(b)
D
C B A
0
-40
-20
0 Γ
Transmission(dB)
X
M
Γ
Reduced Wave Vector
Fig. 6. (a) Setup of the unit cell of the phononic plate in using the FDTD method. (b) Band structure and transmission of Lamb waves in a steel/epoxy phononic plate.57
Lamb Waves in Phononic Band Gap Structures
23
the eigenmodes are coupled, and they are identified as flexural (antisymmetric), longitudinal (symmetric), and transverse (shear horizontal) waves such as Lamb waves in classical plates. In order to investigate the wave modes in the dispersion relation, the transmission of the waves and the polarization of the specific modes were analyzed. First, to launch a wide frequency wave packet along the ΓX direction, x2 and x3 polarized line sources were defined on the plate surface. The acoustic waves propagating through a ten-layered phononic plate and a homogeneous plate without steel cylinders were recorded and compared with each other to calculate the transmission. The results are shown in the left part of Fig. 6(b). When the wave is generated from the x3 polarized source, the gaps appear at 72---116, 125---172, and 176---183 kHz, shown by the solid line. The dotted line presents that, when the wave is generated from the x2-polarized source, the pass bands are between 73---87 kHz and the range below 43 kHz. Compared to the dispersion curves along the ΓX direction, the two transmission distributions show a consistent result with the band structure, except a deaf band that can be observed on the top of the second complete band gap. The displacement distribution inside the unit cell was calculated to display the eigenmodes. The calculation setup was the same as in Fig. 6(a) but the initial condition has been replaced by a monochromatic wave source. The displacement distributions of the eigenmodes which belong to the first seven bands along the ΓX direction are shown in Fig. 7. The displacement distributions of the modes A and B are shown in Figs. 7(a) and 7(b). These two modes belong to the lowest band and its folded band with the same wave vector. In these two cases, the displacement fields are almost invariant along the x2 axis. The 3D vector plots show the displacement distribution inside a unit cell. Additional 2D figures are also plotted for clearer understanding. The 2D figures show the polarizations along the plane which is parallel to the x1-x3 plane passing through the center of the steel cylinder. Obviously, the displacement components u1 and u3 dominate the behavior of these modes, which are basically corresponding to the lowest flexural mode in a plate. The next eigenmode, mode C, belongs to the second band in the band structure. Its displacement field on the plate at x3=h/2 is shown in Fig. 7(c). The mode has a primary polarization in the x2 direction. This is the lowest mode of
24
T.-T. Wu, J.-C. Hsu and J.-H. Sun
horizontally polarized shear wave of which the distribution does not vary appreciably along the thickness. Mode D has a higher phase velocity than the previous two modes. The distribution of the displacements and the polarization on the plane x3=h/2 of the plate is shown in Fig. 7(d). The polarization in this case is mainly along the x1 direction, and the field does not change much along the plate thickness. This means it is the lowest longitudinal mode. For mode E, the plotted slice of the displacement field is parallel to the x2-x3 plane and passes through the center of the steel cylinder, as shown in Fig. 7(e). The polarization remains at the x2-x3 plane but the magnitude also varies with the x1 direction. Basically, this mode contains u3 components, but they are antisymmetric with respect to the x1-x3 plane passing through the center of the unit cell. Thus, this band cannot be excited by the x3-polarization line source, as shown in the transmission of Fig. 6(b). The result of mode F is shown in Fig. 7(f), and this band determines the bottom edge of the first complete band gap. The displacement distribution shows a circular polarization pattern on the plate at x3=h/2, and this band is excitable by the x2-polarized source. The next mode, mode G, is shown in Fig. 7(g) and exists between the two complete band gaps. The displacement fields of two slices a-a' and b-b' are plotted, respectively, in order to show the polarization clearly. Basically, the displacement field shows the property as a flexural mode in a plate, similar to the case shown in Fig. 7(b), but the field varies significantly along the x2 direction. Finally, an example of the deaf band observed in the transmission is mode H shown in Fig. 7(h). The displacement field is antisymmetric with respect to the x1-x3 plane passing through the central line of the unit cell. The x3 component is antisymmetric and the x2 component shows two opposite directions across the unit cell. Thus, neither the x2 polarization nor the x3 polarization line sources on the plate surface can excite this wave, and this band was identified as a deaf band. Dispersion relations of the phononic plates with the same filling fraction but different thicknesses varied from 0.125a to 1.5a and showed that the bands of longitudinal waves (mode D) and transverse waves (modes C and F) appear at similar ranges. That is, the plate thickness does not alter the gaps of these two wave types. However, the flexural modes (modes A, B, E, G, and H) in the plates are affected by the thicknesses. In a thinner plate, the lowest
25
Lamb Waves in Phononic Band Gap Structures
flexural mode shifts toward the lower frequency range. Thus, the folded bands above the gaps of longitudinal and transverse waves shift downward and narrow the gap width. For a thicker plate, the cutoff frequency of higher order flexural modes is lowered, and then, the higher order modes appear at the range of the band gap. Therefore, the phononic plates usually have narrower complete band gaps than those of the 2D phononic crystal of the same filling fraction. With the complete band gaps, the steel/epoxy phononic plate based waveguide is illustrated in the next subsection.
x1 x2
x1 x2
x3
x3 x2
x1
x3
x3
(e)
(a) x1 x2
x1
x1 x2
x3
x3
x2
x1 x3
(f)
(b) x1
x1 x2
x1
x1
x2
x3
x2
x3
x3
a b
a-a'
a' b'
(c)
(g)
b-b'
x1
x1
x1 x2
x3
x2
a'
x2
x3 b'
x2
x3
a a-a' b
(d)
(h) b-b'
Fig. 7. Vibrations of the eigenmodes in the band structure of the steel/epoxy phononic plate labeled on Fig. 6(b).57
26
T.-T. Wu, J.-C. Hsu and J.-H. Sun
3.1.3. Surface Stubbed Phononic Plate77 Different from the two previous cases, a thin plate with a periodic stubbed surface is now considered instead of periodic inclusions. The geometry detail is described as follows. On one side of a thin aluminum plate (plate thickness h1=1 mm), periodic stubbed aluminum cylinders of heights h2 are arranged in a square lattice with lattice constant a=10 mm. The diameter of the cylindrical stubs is 7 mm (i.e., F=0.385, h1/a=0.1). Numerical results of the FE analysis are shown in Fig. 8 to observe the influence of the stub height. Figure 8 shows the dispersion relations of the thin plate with stubs of different heights for waves propagating along the ΓX direction. Upon comparison with the dispersion relation of a homogeneous thin plate [Fig. 8(a)], formation of the special coupling modes can be identified. For example, in Fig. 8(b) of the stub height h2=0.25h1, the purple line marked with S0+A0 means that the mode is transformed from the S0 mode and A0 modes shown in Fig. 8(a). Similarly, the yellow line marked with A0+S0+A1 in Fig. 8(b) indicates that the mode is transformed from the A0, S0, and A1 modes. The evolution of the coupling modes can be identified by comparing Figs. 8(a)–8(c).
(a)
(b)
(c)
250
Frequency (kHz)
200 S0
A1+A0+S0+A0
A1+A0+S0+A0
150
A1+S0+A1
A1+S0+A1
100
A1 A0+S0+A1
A0+S0+A1
T0
50
Γ
0
S0+A0 A0 X
Γ
S0+A0
X
Γ
X
Fig. 8. Dispersion relations along the ΓX direction of stubbed phononic plate with different stub heights h2. (a) Flat plate, (b) h2=0.25h1, and (c) h2=0.5h1. The base plate 77 thickness h1=1 mm, the radius of the stubs is 7 mm, and the lattice constant a=10 mm.
Lamb Waves in Phononic Band Gap Structures
27
250
Frequency (kHz)
200 150
114 – 143 kHz 100 50 0
Reduced Wave Vector (ka/π )
Fig. 9. Band structure of the stubbed phononic plate with h2=10h1.77
If the stub height is gradually increased, some resonances are formed, which result in slower wave velocity and flatter bands near the boundary of the first BZ. It was found that, as the stub height approaches about triple the plate thickness, i.e., h2=3h1, a narrow complete band gap ranging from 167.5 to 171 kHz forms. Moreover, when h2=9h1, the complete band gap is largest, with a range from 119 to 157.5 kHz and a relative band gap width equal to 27.8%. When the stub height is ten times the plate thickness, i.e., h2=10h1, the complete band gap ranges from 119 to 143 kHz (blue region), and three partial band gaps (green region) also appeared. The full band structure of the case h2=10h1 is shown in Fig. 9. 3.2. Phononic Waveguides
Because of the existence of complete band gaps, Lamb wave propagation is forbidden in the frequency ranges with any polarization and propagation direction in the phononic plate structures. The effect allows one to build highly efficient waveguides by utilizing the band gap materials. This subsection describes energy confinement effects for Lamb waves propagating in some defect-contained (line defects) phononic structures with complete band gaps, i.e., phononic waveguides.
28
T.-T. Wu, J.-C. Hsu and J.-H. Sun
To understand clearly the wave propagation and energy distributions in phononic waveguides, the dispersion relations and displacement fields of defect modes in the waveguides, and their characteristics related to the structures are discussed. 3.2.1. Waveguides of Lamb Waves in Steel/Epoxy Phononic Plate Guided Lamb waves in the phononic plates were analyzed with the FDTD method.57 Phononic waveguides were formed based on the complete band gaps of Lamb waves. In the steel/epoxy phononic plate presented in Sec. 3.1.2, there exist complete band gaps in the 89---101 and 125---162 kHz windows. Thus, such plates can be used to create waveguides. In the FDTD method, the supercell used to analyze the dispersion of Lamb waves in the waveguide is shown in Fig. 10(a). The supercell contains a defect area and extra ten unit cells. Using the supercell and the PBCs, the eigenmodes (defect modes) can be excited by incident wideband wave sources. The frequencies of the defect modes can be obtained by selecting the local maximum peaks from the Fourier transformed spectra, following the procedure for a unit cell in Sec. 3.1.2.
Phononic-Crystal Plate Waveguide, Cylinder:Steel / Base:Epoxy, w=10 mm
190 180 170
Frequency (kHz)
160
w
150
140 kHz
140 130 120 110 100
95 kHz
90
Supercell
80 70
(a)
(b)
60 _
Γ
Reduced Wave Vector
_ X
Fig. 10. (a) Top view of the phononic waveguide for Lamb waves and the supercell. (b) Band structure of the defect modes in the waveguide of width w=10 mm.57
29
Lamb Waves in Phononic Band Gap Structures
First, consider the waveguide with a width w=10 mm [see Fig. 10(a)]. The dispersion of the waveguide is shown in Fig. 10(b). The figure is focused on the frequency range of 60---190 kHz to observe defect modes in the complete band gaps. The boundaries of complete band gaps are marked with horizontal solid lines. In the waveguide, there are numerous extended modes outside the range of the complete band gaps but these modes are not concerned and the regions are presented in gray. In Fig. 10(b), 13 recognizable defect bands appear in the complete band gaps. The defect bands could be active or deaf. In general, numerous defect bands in the same frequency range support the acoustic waves to propagate inside the waveguide. The x3-polarized monochromatic line source of frequencies equal to 95 and 140 kHz injected from the lefthand inlet of the waveguide showed that wave propagation can be well confined within the scatter-free area. However, observing Fig. 10(b), the monochromatic wave sources of 95 and 140 kHz can excite multiple defect modes inside the waveguide. Thus, the amplitude distribution of the guided waves will be complicated. A waveguide with single mode or only a few defect modes is much preferred for some further applications. The 2D phononic waveguide of narrower width can allow fewer defect modes. The dispersion relations of the defect modes in a narrow waveguide with w=6 mm is shown in
Phononic-Crystal Plate Waveguide, Cylinder:Steel / Base:Epoxy, w=6 mm
190 180 170
Frequency (kHz)
160 150 140
(b)
130 120 110
x1
100
A
x2
90
x3
x1
80 70
x3
60 _
(a)
Γ
Reduced Wave Vector
_ X
(c)
Fig. 11. (a) Band structure of Lamb waves in the steel/epoxy phononic waveguides of width w=6 mm. (b) Amplitude distribution of the 95 kHz Lamb wave [point A in (a)]. (c) Displacement field of eigenmode A in (a).57
30
T.-T. Wu, J.-C. Hsu and J.-H. Sun Phononic-Crystal Plate Waveguide, Cylinder:Steel / Base:Epoxy, w=4 mm
190 180 170
Frequency (kHz)
160 150 140 130 120 110 100 90 80 70 60 _
Γ
Reduced Wave Vector
_ X
Fig. 12. Band structure of Lamb waves in the steel/epoxy phononic waveguide of width w=4 mm.57
Fig. 11(a). Compared to Fig. 10(b), the number of defect bands in the complete band gaps is reduced from 13 to 10. In the first complete band gap, 89---101 kHz, there are only two defect bands. Figure 11(b) shows the amplitude distribution on the surface at x3=0 of a 95 kHz wave source injected into the left inlet of the waveguide. In the plot, darker color means larger amplitude. The acoustic wave propagation inside the waveguide has a simple pattern. From the dispersion, the defect mode is marked as point A in Fig. 11(a). The characteristic displacement field is plotted in Fig. 11(c). The 3D vector plot shows a full view of the waveguide, and the 2D figure shows the cross section at the center of the waveguide. The figures identify the wave as a flexural mode. Though fewer bands exist in the waveguides of 6 mm width in the first complete band gap, there are still numerous defect modes in the second complete band gap of 125---162 kHz. The band structure of an even narrower waveguide of width w=4 mm is shown in Fig. 12. The waveguide results from an additional space of one quarter of a lattice constant between adjacent unit cells. Because of the extremely narrowed width, there are only six bands inside the waveguide. Both the two complete band gaps have simple defect bands, and in addition, the range of 93---101 kHz in the first complete band gap allows no defect modes. For instance, 95 kHz waves cannot propagate
31
Lamb Waves in Phononic Band Gap Structures
into the waveguide. In short, the number of the defect bands can be depressed by constructing a very narrow waveguide; however, the width of the waveguide should not be too small in practice for acoustic wave propagation. Although waveguides with a narrow width have fewer defect bands, these waveguides may no longer consists of a multiplicity of unit cells. This is inconvenient for the construction of an acoustic circuit with PCs of single periodicity, especially when creating a bent waveguide. This can be circumvented by inserting scatters in the center of waveguides, as shown in Fig. 13(a). The waveguide has a width of 10 mm (the same with the lattice constant) and the scatters in the center of the waveguide have a smaller diameter of 4 mm. The band structure of the waveguide is shown in Fig. 13(b). In the dispersion diagram, there are 11 defect bands in the two complete band gaps. However, a single-mode band ranges from 130 to 142 kHz in the second complete band gap. The propagation of the 136 kHz acoustic wave is shown in Fig. 13(c), and the corresponding mode is marked as point A in the dispersion curve. The characteristic displacement field of this mode is shown in Fig. 13(d). From the displacement patterns, the wave is also identified as a flexural mode and the polarization varies along the x2 direction. Since the polarization plane shows symmetry with respect to the central plane (the
Phononic-Crystal Plate Waveguide with scatters of d=4 mm, Cylinder:Steel / Base:Epoxy, w=10 mm
190 180 170
w
d
Frequency (kHz)
160 150 140
A
130
(c)
120 110
x1 x1
100
x2
90
Supercell
a
x3
b
80
a'
70
b'
60 _
(a)
x3
(b)
Γ
Reduced Wave Vector
_ X
a-a'
(d) b-b'
Fig. 13. (a) Top view of the phononic waveguide with scatters and the supercell used in the FDTD calculation. (b) Band structure of Lamb waves in the steel/epoxy phononic waveguide of width w=10 mm with the scatters of diameter d=4 mm. (c) Amplitude distribution of the 136 kHz acoustic wave [point A in (b)]. (d) Displacement fields of the eigenmode labeled by A in (b).57
32
T.-T. Wu, J.-C. Hsu and J.-H. Sun
x1-x3 plane) of the waveguide, the mode can be excited by the line source of the x3 polarization. 3.2.2. Stubbed PC Plate Waveguide The plate with a periodic stubbed surface can result in complete band gaps of Lamb waves, as described in Sec. 3.1.3. Therefore, the structure can be further applied to the guided propagation of Lamb waves. Consider a line-defect waveguide produced by removing one row of the stubs from the perfect periodic structure. The geometrical parameters of the defect-contained structure are the same as the scales described in Sec. 3.1.3 (i.e., a=10 mm, h1=1 mm, d=7 mm, and h2=10 mm) but one row of the stubs has been removed. The waveguide structure [see Fig. 14(b)] was analyzed using the FE method,74 and the band structure is shown in Fig. 14(a). In the figure, it can be clearly observed that some additional frequency bands, which correspond to the existence of defect modes localized in the waveguide, appear inside the complete band gap. Specifically, the intensity of the displacement field the defect mode with frequency in the complete band gap marked by the red circle in Fig. 14(a) with the reduced wave number ka/π=0.25 and frequency f=126 kHz is
Max
(a)
180
(b)
Frequency (kHz)
160 Min Max
140
(c)
120 Min Max
100
(d)
80 Min
Fig. 14. (a) Band structures along the ΓX direction of Lamb waves in the stubbed phononic plate containing a waveguide produced by removing one row of the stubs. (b)–(d) are the eigenmode shapes corresponding to the red circle marked in (a) with different viewpoints, respectively.74
Lamb Waves in Phononic Band Gap Structures
33
shown in Figs. 14(b)---14(d) from three different views, respectively. In the figures, it is obvious that most of the elastic-wave energy is well confined within the area of the line-defect waveguide. 3.3. Cavity
Another approach utilizing PCs to confine acoustic energy by band gaps is to create a cavity in the structure. The confined energy can result in resonance with frequencies associated with the dimensions of the cavity. As a result, this leads to possible applications of phononic cavities to acoustic resonator devices. To be compatible with micro fabricating processes for the device, silicon based structures could be adopted.24 The phononic plate can be constructed by etching periodic circular holes in a silicon plate. Thus, the resonance conditions of the cavity should be considered. To design a resonant cavity utilizing the phononic structure, the band gap property of the PC needs to be considered. Figure 15(a) shows the dispersion curves calculated by the FE method for a 2D air/silicon PC plate of square lattice. The lattice constant a is 20 µm, the radius of the air cylindrical hole 8.9 µm, and the plate thickness h=12 µm. With the filling fraction F=0.622, there exist (i) a partial band gap from 148.8 to 174.1 MHz forbidden the Lamb wave propagation along the ΓX direction and (ii) a complete band gap from 148.8 to 160 MHz. Designing a Lamb wave resonator device could begin by calculating the proper position of the wave sources. The interference of Lamb waves encountering the PC structure was calculated by the FE simulations. The frequency of the Lamb wave was chosen to be 160 MHz and the wavelength λ of the corresponding lowest anti-symmetric mode A0 in a 12µm plate is equal to 25 µm. In the simulation, 10 layers of air holes were assumed. Two pairs of normal line forces with interlaced positive and negative polarization were used as external excitation. The interval between the adjacent line forces was one half of the wavelength. The distance between the PC structure and the wave source is defined as D. Two examples of different distances D are shown in Fig. 15(b). They present constructive and destructive interferences, respectively. The color shows
34
T.-T. Wu, J.-C. Hsu and J.-H. Sun
(a)
Dispersion of Air/Si PC Plate
D=1.178λ
y
A0 S0
D=1.43λ
x y
150 SH1 SH0
100 M Γ
X
S0
50
A0 SH0
0
x
(c)
A0
Normalized Amplitude
Frequency (MHz)
SH0
SH1
200
(b)
6
2 0 -2 -8
Γ
X
Constructive Destructive
4
-7
-6
-5
-4
-3
-2
-1
0
1
2
3
4
Arc length (a)
Fig. 15. (a) Dispersion of Lamb waves along the ΓX direction in the air/silicon PC plate with h=12 µm, a=20 µm, and F=0.622. (b) Constructive and destructive interference situations. (c) u3 component along the dashed lines in (b) of the 160 MHz Lamb wave.24
the amplitude of the displacement component u3. In the first case of Fig. 15(b), the amplitude is enhanced significantly as the reflective distance D is equal to 1.178λ. The amplitude of the u3 component is almost twice the case without the PC structure. Alternatively, when the reflective distance D is equal to 1.43λ, the normalized amplitude of the region of interest is almost vanishes, and results in a destructive interference [the second case of Fig. 15(b)]. Comparison of the amplitudes of the constructive and destructive interferences is shown in Fig. 15(c). Consequently, the distance D between the line force and the PC structure accordingly should be designed. 4. Demonstrations of Band Gaps and Potential Applications
This section describes measurement of band gaps and resonance characteristics found in the surface stubbed phononic plate. In this presentation, the demonstration of micro cavities based on silicon
Lamb Waves in Phononic Band Gap Structures
35
phononic structures for potential applications to high frequency acoustic wave devices is also included. Band gaps of Lamb waves and resonance peaks in the surface-stubbed phononic plate were detected and characterized using the laser ultrasonic technique and piezoelectric transducers. Details are described in Sec. 4.1. In Sec. 4.2, the design and fabrication of a micro cavity for Lamb waves are explained. In the designs, the cavities can serve as acoustic wave resonator devices where the phononic structures act as high efficient reflective gratings. The performance of the devices in frequency responses was experimentally determined, and the results showed significant improvements of the performance. 4.1. Measurements of Band Gaps in a Stubbed PC Plate77
The experimental scheme is shown in Fig. 16(a). The stubbed phononic plate was fabricated with aluminum using mechanical machining. The geometrical parameters of the structure are the same as the scales considered for Fig. 9, and there are 16×10 stubs on one side of the plate surfaces. In the experiments, an Nd: YAG (yttrium aluminum garnet) pulsed laser was utilized to generate broadband elastic waves in the specimen, and an optical interferometer (with He-Ne laser) was employed to measure the out-of-plane component of the displacement of the waves. The signals were received on the base plate at positions with four stubs away from the wave source [see Fig. 16(a)]. Other than the optical interferometer which might be insensitive to the in-plane vibrations, point piezoelectric transducers (the contact spots are about 1.5 mm in diameter) with different polarizations were, respectively, used to detect the out-of-plane and in-plane vibrations of the waves. The measured point was located at the center of the top surface of the stub six rows apart from where the pulsed laser source was applied. Then the measured time-domain signals were Fourier transformed. The digitized sampling rate for the signal from the detectors was 50 MHz; so the transformed spectra were obtained in a good resolution.
36
T.-T. Wu, J.-C. Hsu and J.-H. Sun
The left-hand side of Figs. 16(b) and 16(c) show the measured reference spectra by the optical interferometer for waves propagating along the ΓX and ΓM directions, respectively, and the corresponding band structures are attached on their right-hand side. The reference spectrum is defined as the ratio of the values in the spectrum measured on the stubbed phononic plate to that measured on the uniform thin plate of thickness 1 mm. In the figures, the blue regions indicate the range of the complete band gap, which is from 114 to 143 kHz. Both the measured reference spectra in the ΓX and ΓM directions exhibit very low intensities in that range which agree well with the prediction by the band structure. In Fig. 16(b), three apparent dips in the spectral ranges which correspond to the three partial band gaps (the green regions) also can be clearly observed, indicating much conformity to the locations and width of the partial band gaps appearing in the band structure. In addition to the observations on the band gaps, several peaks can be obviously found in the spectra. These peaks are resonances corresponding to the band-edge frequencies, such as at 19 kHz (the bottom edge of the first partial band gap), 100 kHz, and 109 kHz (the bottom and upper edges of the second partial band gap, respectively), where the slopes of the frequency bands approach zero and associate with high density of states of Lamb modes. Moreover, the peak frequency at 109 kHz has a value of 1.46, which is
250
Frequency (kHz)
200
250
ΓX
(b)
150
100
(b)
150
100
50
50
(a)
ΓM
200
Frequency (kHz)
(a)
0 0 0.3 0.6 0.9 1.2 1.5
Γ
X
(c)
0
0 0.2 0.4 0.6 0.8 1
Γ
M
Fig. 16. (a) Schematic of the experimental setup for measuring the Lamb waves in the stubbed phononic plate along the ΓX and ΓM directions. (b) and (c) are the measured reference spectra (normal displacement) for waves propagating along the ΓX and ΓM directions, respectively.77
37
Lamb Waves in Phononic Band Gap Structures
larger than unity, which means an amplified out-of-plane vibration relative to that in the uniform thin plate, induced by the resonance. In Figs. 16(b) and 16(c), in the frequency range between 143 and 193 kHz, the reference spectra have values about 0.15, which means that elastic waves with frequency in this range may be propagating but with perceptible attenuation. Two nearly flat bands exist in the ΓX direction around 190 kHz. However, these bands are mainly governed by the T0 plate mode (with polarization perpendicular to the sagittal plane), and thus, result in no peak in the spectrum. Figure 17(a) shows the reference spectra measured on the top surface of the stub by using the longitudinal and transverse-polarization piezoelectric transducers. For the measurement of the in-plane vibration, the polarization of the transverse transducer was set to be parallel to the wave vector corresponding to the ΓX direction. In the figure, both the reference spectra measured by longitudinal and transverse transducers show, again, very low intensities within the complete band gap, i.e., the range from 114 to 143 kHz. As a result, the experiment confirmed the existence of the complete band gap in such a periodic stubbed plate. Moreover, the in-plane measurement (shown by the red dashed line)
250 On the plate
Frequency (kHz)
200
150
(g)
Transverse signal on the top of stub Longitudinal signal on the top of stub
(f)
(b)
(c)
(d)
(e)
(f)
(g)
(e) (d)
100
(c)
50
(b)
0
0 1 2 3 4 5 6
Γ
X
Fig. 17. (a) Reference spectra measured on the top of the stub (normal and transverse displacements by using point piezoelectric transducers). (b)–(g) are the eigenmodes labeled on the right panel of (a) by the FE method.77
38
T.-T. Wu, J.-C. Hsu and J.-H. Sun
exhibits more resonant peaks with values larger than one. To give an insight into the resonances in the structure, several calculated eigenmode shapes associated with the peak frequencies are, respectively, shown in Figs. 17(b)–17(g). Figure 17(b) displays the eigenmode associated with the lowest outstanding peak around 19 kHz. The measured spectra in Fig. 17(a) simultaneously demonstrate notably enhanced in-plane and out-of-plane vibrations at this frequency. From the mode shape it can be observed that the resonance forms by concentrating energy in the stubs, and thus, results in a very slow group velocity. The second peak appearing at 25 kHz corresponding to the joint cutoff frequency of the mode in the bands in red and green colors is a band-edge state. Next, the eigenmode shape with frequency 60 kHz where the third apparent peak appears is shown in Fig. 17(c). This mode is mainly governed by the antisymmetric mode but with significant in-plane vibration in the stub so the spectrum measured by the transverse transducer exhibits a peak there. The fourth peak appearing at 100 kHz is a mode in the flat part of the band (in green color) evolved from the S0 and A1 plate modes. This mode is also a band-edge state, and its mode shape is shown in Fig. 17(d) where notable transverse vibrations can also be found in the stub. Adjacent to the peak at 100 kHz, the next peak appears at 109 kHz. The associated mode shape is shown in Fig. 17(e). The vibration of this mode is very similar to the mode in Fig. 17(d). For the bottom edge state of the complete band gap, the mode shape is calculated and plotted in Fig. 17(f). It is found that this mode generates no transverse vibration on the top surface of the stub. For frequencies close to 205 kHz, there exists a small peak and the corresponding mode shape is shown in Fig. 17(g). The vibration shows that strong dilatational vibration exists in the stub and out-of-plane vibration exists in the base plate. However, this mode in the band in purple color is mainly governed by the T0 plate mode and the excitation by the pulsed laser source is not that effective as are the antisymmetric modes. Therefore, relatively low peak intensity appears around 205 kHz when detecting by the optical interferometer and longitudinal transducer [Figs. 16(b) and 17(a)], and no significant intensity is observed when detecting by the transverse transducer applied on the stub.
Lamb Waves in Phononic Band Gap Structures
39
4.2. Micro Phononic Cavity for Lamb Wave Resonator
This section focuses on a study of Lamb wave resonators using phononic structures as reflective gratings. Note that in traditional acoustic wave resonators, metal strips are usually adopted as reflective gratings to improve insertion losses of IDTs. To achieve a high reflection, gratings may consist of hundreds of metal strips. Moreover, the distance between the IDT and the grating has to be optimized to result in coherent 15 reflective waves in the resonator. If PCs are applied to replace the metal gratings and formed a phononic cavity, the reflection efficiency can be improved and the size of the resonator can be considerably reduced. A schematic of the Lamb wave resonator using the PC gratings is shown in Fig. 18(a). The Lamb wave device was constructed on a silicon plate. The PC reflector on the plate was introduced that air cylinders form a square lattice with band gap of 148.8–174.1 MHz [refer to the band structure shown in Fig. 15(a)]. As can be seen, the Lamb wave device also included a pair of IDTs and a piezoelectric film on top of the silicon plate to generate elastic waves.
-20
IDT
Insertion Loss (dB)
PC reflective grating
ZnO silicon plate
(a)
(b)
-30 -40 -50 -60 150
155 160 Frequency (MHz)
165
Fig. 18. (a) Schematic of the Lamb wave resonator combined with PC gratings. (b) The measured transmission of the two-port Lamb wave resonator with the PC gratings.24
40
T.-T. Wu, J.-C. Hsu and J.-H. Sun Table 2. Parameters of the Two-Port Lamb Wave Resonator IDT line width (µm) IDT pair
6.2 40
IDT aperture
80λ
IDT thickness (nm)
150
PC grating layers
15
IDT-IDT interval (µm)
756.4
IDT-PC distance (D)
2.178λ
PC-PC distance (µm)
2886.5
Based on the simulation, two-port Lamb wave resonators which utilize the PC reflective gratings were designed and fabricated. The design parameters are listed in Table 2. The fabrication processes include the deposition of metals for developing IDTs and bottom electrodes, deposition of piezoelectric ZnO film for generating elastic waves, lithography for patterning the IDTs and PCs, dry etching for air PC holes, and wet back etching for obtaining the thin plate structure. A silicon-oninsulator (SOI) wafer with a 12 µm thick Si device layer was chosen to control the plate thickness of the Lamb wave device precisely. In addition to the device layer, the thicknesses of the handle layer and the oxide layer of the SOI wafer are 300 µm and 0.5 µm, respectively. The oxide layer was used as blocking material during the dry etching process. At the beginning, a gold layer of thickness 100 nm was first evaporated on the device side of the SOI wafer. Then a 0.9 µm thick piezoelectric ZnO layer was deposited and patterned using RF sputtering and wet etching. The IDTs structures were pattered by lithography before the metallization. Aluminum film of 150 nm thickness was evaporated on the photoresist structure and then IDTs were formed using a lift-off process. The width of the IDTs is 6.2 µm. Then, 15 layers of air holes were obtained by patterning and dry etching using plasma etching. As shown in the figure, the PC reflectors are placed on the outer sides of the two IDT ports. Finally, the lower handle layer and oxide layer were removed by KOH and BOE wet etching, respectively. The frequency response of the Lamb wave resonator with phononic reflective gratings was
Lamb Waves in Phononic Band Gap Structures cavity length
(a)
Frequency (MHz)
(b)
41
170
165
160
155
150
3
3.5
4 20
20.5
21
cavity length (a)
Fig. 19. (a) Schematic of cavities in PCs. (b) Resonant frequencies of anti-symmetric modes inside the phononic cavities.24
measured by a vector network analyzer and the result is shown in Fig. 18(b). The frequency response shows a main lobe of around 2.5 MHz width and the central frequency is around 157 MHz. The shift of central frequency may be caused by the difference of the actual plate thickness to the simulated case. Furthermore, the signal shows three sharp peaks at the main peak frequency region. The frequencies of the three peaks are 157.2, 158.2 and 159.1 MHz and the largest quality factor is 2259 at the third peak. The difference between these frequencies is about 1 MHz. The appearance of three peaks is due to a large size of the resonant cavity consisting of two PC gratings. The resonance of Lamb waves inside the cavity could be understood by the Fabry-Perot resonant condition. It is worth noting that the multiwavelength of different frequencies may satisfy the resonant condition in one resonant cavity. Thus, theoretically there can be infinite resonant modes in a cavity. Figure 19 shows the resonant frequency of antisymmetric Lamb waves inside the PC cavities of different cavity length. As show in Fig. 19(a), the cavity length is defined as the additional size inserted into the PC plate. Fig. 19(b) shows the variation of the resonant frequency as cavity length changes from 3a---4a and 20a---21a. The
42
T.-T. Wu, J.-C. Hsu and J.-H. Sun
concerned frequency range is 150---170 MHz inside the band gap. In Fig. 19(b), the cavities with length 3.58 a and 20.49 a have the 160 MHz resonant frequency. Further, the cavity with 20.49a allows another two resonant modes at 153.98 and 165.78 MHz inside the 150-170 MHz range. This shows the difference of two neighboring resonant frequencies depends on the cavity size. On the experimental case, the calculated frequency difference narrows down to about 0.95 MHz. Obviously, a cavity of larger size allows a smaller frequency difference. Therefore, to design a single resonant mode resonator, it is necessary to estimate the interval frequency range of two resonant modes and the width of the main peak controlled by the IDT pair number. 5. Conclusions
Theoretical and experimental studies of Lamb waves in 2D phononic plates are reviewed. The theoretical formulation and methods to analyze the phononic structures are summarized, including the PWE methods, FDTD method, and FE method. Analyzing the phononic plate structures using these methods shows that band gaps of Lamb waves exist in the structures. Further analysis of the phononic waveguides and resonant cavity based on phononic plates with complete band gaps shows that efficient confinement and localization of the acoustic energy can be achieved, and the characteristics of the waveguides and cavity are described. The transport efficiency of the wave energy in straight and bent waveguides was evaluated where an improved design was proposed. The conditions of constructive interference of the resonant modes in long phononic cavities were determined as well. Experimental result show that the detected signals of Lamb waves generated by a pulsed laser in a centimeter-scale stubbed phononic plate exhibited complete band gap between 143 and 193 kHz. Moreover, phononic cavities had been designed and applied to serve as the reflective gratings of Lamb wave resonators. Experimental result showed that, with proper designed band gap frequencies and cavity lengths, the insertion losses and quality factors of the Lamb wave resonators can be significantly improved.
Lamb Waves in Phononic Band Gap Structures
43
References 1. Benchabane, S., Khelif, A., Rauch, J.-Y., Robert, L., and Laude, V. (2006). Evidence for complete surface wave band gap in a piezoelectric phononic crystal, Phys. Rev. E 73, 065601. 2. Berenger, J. (1994). A perfectly matched layer for the absorption of electromagnetic waves, J. Comput. Phys. 144, 185---200. 3. Bonello, B., Charles, C., and Ganot, F. (2007). Lamb waves in plates covered by a two-dimensional phononic film, Appl. Phys. Lett. 90, 021909. 4. Bonello, B., Charles, C., and Ganot, F. (2006). Velocity of a SAW propagating in a 2D phononic crystal, Ultrasonics 44, 1259---1263. 5. Brunet, T., Vasseur, J., Bonello B., Djafari-Rouhani, B., and Hladky-Hennion, A.-C. (2008). Lamb waves in phononic crystal slabs with square or rectangular symmetries, J. Appl. Phys. 104, 043506. 6. Charles, C., Bonello, B., and Ganot, F. (2006). Propagation of guided elastic waves in 2D phononic crystals, Ultrasonics 44 e1209---e1213. 7. Chen, J.-J., Bonello, B., and Hou, Z.-L. (2008). Plate-mode waves in phononic crystal thin slabs: Mode conversion, Phys. Rev. E 78, 036609. 8. Chen, J.-J., Zhang, K.-W., Gao, J., and Cheng J.-C. (2006). Stopbands for lowerorder Lamb waves in one-dimensional composite thin plates, Phys. Rev. B 73, 094307. 9. Chew, W. C. and Liu, Q. H. (1996). Perfectly matched layers for elastodynamics: A new absorbing boundary condition, J. Comput. Acoust. 4, 341---359. 10. Fung, K. H., Liu, Z., and Chan, C. T. (2005). Transmission properties of locally resonant sonic materials with finite slab thickness, Z. Kristallogr. 220, 871---876. 11. Gao, J. and Cheng, J.-C. (2007). Propagation of Lamb waves in one-dimensional quasiperiodic composite thin plates: A split of phonon band gap, Appl. Phys. Lett. 90, 111908. 12. Garica-Pablos, D., Sigalas, M., Montero de Espinosa, F. R., Torres, M., Kafesaki, M., and Garcia, N. (2000). Theory and Experiments on Elastic Band gaps, Phys. Rev. Lett. 84, 4349---4352. 13. Goffaux, C. and Vigneron, J. P. (2001). Theoretical study of a tunable phononic band gap system, Phys. Rev. B 64, 075118. 14. Gropp, W., Lusk, E., Doss, N., and Skjellum, A. (1996). A high performance, portable implementation of the MPI message passing interface standard, Parallel Comput. 22, 789---828. 15. Hashimoto, K., Surface Wave Devices in Telecommunications: Modeling and Simulation. (Springer, New York, 2000). 16. Ho, K. M., Cheng, C. K., Yang, Z., Zhang, X. X., and Sheng, P. (2003). Broadband locally resonant sonic shields, Appl. Phys. Lett. 83, 5566---5569.
44
T.-T. Wu, J.-C. Hsu and J.-H. Sun
17. Hou, Z. and Assouar, B. M. (2008). Transmission property of the one-dimensional phononic crystal thin plate by the eigenmode matching theory, J. Phys. D: Appl. Phys. 41, 095103. 18. Hsiao, F.-L., Khelif, A., Moubchir, H., Choujaa, A., Chen, C.-C., and Laude, V. (2007). Waveguiding inside the complete band gap of a phononic crystal slab, Phys. Rev. B 76, 056601. 19. Hsieh, P.-F. Wu, T.-T., and Sun, J.-H. (2006). Three-dimensional phononic band gap calculations using the FDTD method and a PC cluster system, IEEE Trans. Ultrason. Ferroelectr. Freq. Contr. 53, 148---158. 20. Hsu, J.-C. and Wu, T.-T. (2006). Bleustein-Gulyaev-Shimizu surface acoustic waves in two-dimensional piezoelectric phononic crystals, IEEE Trans. Ultrason. Ferroelectr. Freq. Contr. 53, 1169---1176. 21. Hsu, J.-C., and Wu, T.-T. (2006). Efficient formulation for band-structure calculations of two-dimensional phononic-crystal plates, Phys. Rev. B 74, 144303. 22. Hsu, J.-C. and Wu, T.-T. (2008). Calculations of Lamb wave band gaps and dispersions for piezoelectric phononic plates using Mindlin’s theory-based plane wave expansion method, IEEE Trans. Ultrason. Ferroelectr. Freq. Contr. 55, 431---441. 23. Hsu, J.-C. and Wu, T.-T. (2007). Lamb waves in binary locally resonant phononic plates with two-dimensional lattices, Appl. Phys. Lett. 90, 201904. 24. Huang, C.-Y., Sun, J.-H., and Wu, T.-T. (2010). A two-port ZnO/Silicon Lamb wave resonator using phononic crystals, submitted to Appl. Phys. Lett. 25. Joshi, S. G. and Jin, Y. (1991). Excitation of ultrasonic waves in piezoelectric plates, J. Appl. Phys. 69, 8018---8024. 26. Kafesaki, M. and Economou, E. N. (1999). Multiple-scattering theory for threedimensional periodic acoustic composites, Phys. Rev. B 60, 11993. 27. Khelif, A., Aoubiza, B., Mohammadi S., Adibi, A., and Laude V. (2006). Complete band gaps in two-dimensional phononic crystal slabs, Phys. Rev. E 74, 046610. 28. Khelif, A., Djafari-Rouhani, B., Laude, V., and Solal, M., Coupling characteristics of localized phonons in photonic crystal fibers, J. Appl. Phys. 94, 7944---7946 (2003). 29. Kokkonen, K., Kaivola, M., Benchabane, S., Khelif, A., and Laude, V. (2007). Scattering of surface acoustic waves by a phononic crystal revealed by heterodyne interferometry, Appl. Phys. Lett. 91, 083517. 30. Kushwaha, M. S. and Halevi, P. (1994) Band-gap engineering in periodic elastic composites, Appl. Phys. Lett. 64 ,1085---1087. 31. Kushwaha, M. S. and Halevi, P. (1997). Stop-bands for periodic metallic rods: Sculptures that can filter the noise, Appl. Phys. Lett. 70, 3218---3220. 32. Kushwaha, M. S., Halevi, P., Dobrzynski, L., and Djafari-Rouhani, B. (1993). Acoustic band structure of periodic elastic composites, Phys. Rev. Lett. 71, 2022---2025.
Lamb Waves in Phononic Band Gap Structures
45
33. Kushwaha, M. S., Halevi, P., Martinez, G., Dobrzynski, L., and Djafari-Rouhani, B. (1994). Theory of acoustic band structure of periodic elastic composites, Phys. Rev. B 49, 2313---2322. 34. Laude, V., Wilm, M., Benchabane, S., and Khelif A. (2005). Full band gap for surface acoustic waves in a piezoelectric phononic crystal, Phys. Rev. E 71, 036607. 35. Li, X., Wu, F., Hu, H., Zhong, S., and Liu, Y. (2003). Large acoustic band gaps created by rotating square rods in two-dimensional periodic composites, J. Phys. D: Appl. Phys. 36, L15---L17. 36. Liu, Z., Chan, C. T., and Sheng, P. (2000). Elastic wave scattering by periodic structures of spherical objects: Theory and experiment, Phys. Rev. B 62, 2446---2457. 37. Liu, Z., Chan, C. T., and Sheng, P. (2002) Three-component elastic wave band-gap material, Phys. Rev. B 65, 165116. 38. Mei, J., Liu, Z., Shi, J., and Tian, D. (2003). Theory for elastic wave scattering by a two-dimensional periodical array of cylinders: An ideal approach for band-structure calculations, Phys. Rev. B 67, 245107. 39. Meseguer, F., Holgado, M., Caballero, D., Benaches, N., Sanchez-Dehesa, J., Lo´pez, C., and Llinares, J. (1999). Rayleigh-wave attenuation by a semi-infinite two-dimensional elastic-band-gap crystal, Phys. Rev. B 59, 12169---12172. 40. Mohammadi, S., Eftekhar, A. A., Hunt, W. D., and Adibi, A. (2009). High-Q micromechanical resonators in a two-dimensional phononic crystal slab, Appl. Phys. Lett. 94 051906. 41. Mohammadi, S., Eftekhar, A. A., Khelif, A., Hunt, W. D., and Adibi, A. (2008). Evidence of large high frequency complete phononic band gaps in silicon phononic crystal plates, Appl. Phys. Lett. 92, 221905. 42. Montero de Espinosa, F. R., Jimenez, E., and Torres, M. (1998). Ultrasonic band gap in a periodic two-dimensional composite, Phys. Rev. Lett. 80, 1208---1211. 43. Olsson III, R. H. and El-Kady, I. (2009). TOPICAL REVIEW: Microfabricated phononic crystal devices and applications, Meas. Sci. Technol. 20, 012002. 44. Olsson III, R. H., El-Kady, I. F., Su, M. F., Tuck, M. R., and Fleming, J. G. (2008). Microfabricated VHF acoustic crystals and waveguides, Sensors and Actuators A 145, 87---93. 45. Ostrovskii, I. V., Klymko, V. A., and Nadtochiy, A. B. (2009). Plate wave stopbands in periodically poled lithium niobate, J. Acoust. Soc. Am. Express Letters 125, EL129---EL133. 46. Psarobas, I. E. and Stefanou, N. (2000). Scattering of elastic waves by periodic arrays of spherical bodies, Phys. Rev. B 62, 278---291. 47. Pennec, Y., Djafari Rouhani, B., Larabi, H., Akjouj, A., Gillet, J. N., Vasseur, J. O., and Thabet, G. (2009). Phonon transport and waveguiding in a phononic crystal made up of cylindrical dots on a thin homogeneous plate, Phys. Rev. B 80, 144302. 48. Pennec, Y., Djafari-Rouhani, B., Larabi, H., Vasseur, J. O., and Hladky-Hennion, A. C. (2008). Low-frequency gaps in a phononic crystal constituted of cylindrical dots deposited on a thin homogeneous plate, Phys. Rev. B 78, 104105.
46
T.-T. Wu, J.-C. Hsu and J.-H. Sun
49. Profunser, D. M., Muramoto, Matsuda, E., O., Wright, O. B., and Lang, U. (2009). Dynamic visualization of surface acoustic waves on a two-dimensional phononic crystal, Phys. Rev. B 80, 014301. 50. Russell, P. St. J., Marin, E., and Díez, A. (2003). Sonic band gaps in PCF preforms: enhancing the interaction of sound and light, Optics Express 11, 2555---2560. 51. Sainidou, R., Djafari-Rouhani, B., and Vasseur, J. O. (2008). Elastic properties of finite three-dimensional solid phononic-crystal slabs, Photonics and Nanostructures – Fundamentals and Applications 6, 122---126. 52. Sainidou, R., Stefanou, N., and Modinos, A. (2002). Formation of absolute frequency gaps in three-dimensional solid phononic crystals, Phys. Rev. B 66, 212301. 53. Sesion Jr., P. D., Albuquerque, E. L., Chesman, A. C., and Freire, V. N. (2007). Acoustic phonon transmission spectra in piezoelectric AlN/GaN Fibonacci phononic crystals, Eur. Phys. J. B: Conden. Matter Complex System 58, 379. 54. Sigalas, M., Kushwaha, M. S., Economou, E. N., Kafesaki, M., Psarobas, I. E. and Steurer, W. (2005). Classical vibrational modes in phononic lattices: theory and experiment, Z. Kristallogr. 220, 765---809. 55. Sigalas, M. M. and Economou, E. N. (1992). Elastic and acoustic wave band structure, J. Sound Vib. 158, 377---382. 56. Sigalas, M. M. and Economou, E. N. (1996). Attenuation of multiple-scattered sound, Europhys. Lett. 36, 241---246. 57. Sun, J.-H. and Wu, T.-T. (2007). Propagation of acoustic waves in phononic-crystal plates and waveguides using a finite-difference time-domain method, Phys. Rev. B 76, 104304. 58. Sun, J.-H. and Wu, T.-T. (2009). Propagation of bending waves in phononic crystal thin plates with a point defect, IEEE Trans. Ultrason. Ferroelectr. Freq. Contr. 56, 121---128. 59. Tanaka, Y. and Tamura, S. (1998). Surface acoustic waves in two-dimensional periodic elastic structures, Phys. Rev. B 58, 7958---7965. 60. Tanaka, Y. and Tamura, S. (1999) Acoustic stop bands of surface and bulk modes in two-dimensional phononic lattices consisting of aluminum and a polymer, Phys. Rev. B 60, 13294---13297. 61. Tanaka, Y., Tomoyasu, Y., and Tamura, S. (2000). Band structure of acoustic waves in phononic lattices: Two-dimensional composites with large acoustic mismatch, Phys. Rev. B 62, 7387---7392. 62. Torres, M., Montero de Espinosa, F. R., and Aragón, J. L. (2001). Ultrasonic wedges for elastic wave bending and splitting without requiring a full band gap, Phys. Rev. Lett. 86, 4282---4285. 63. Torres, M., Montero de Espinosa, F. R., Garcia-Pablos, D., and Garcia, N. (1999). Sonic band gaps in finite elastic media surface states and localization phenomena in linear and point defects, Phys. Rev. Lett. 82, 3504---3057.
Lamb Waves in Phononic Band Gap Structures
47
64. Vasseur, J. O., Deymier, P. A., Chenni, B., Djafari-Rouhani, B., Dobrzynski, L., and Prevost, D. (2001). Experimental and theoretical evidence of absolute acoustic band gaps in two-dimensional solid phnonic crystals, Phys. Rev. Lett. 86, 3012---3015. 65. Vasseur, J. O., Deymier, P. A., Djafari-Rouhani, B., Pennec, Y., and HladkyHennion, A-C. (2008). Absolute forbidden bands and waveguiding in twodimensional phononic crystal plates, Phys. Rev. B 77, 085415. 66. Vasseur, J. O., Deymier, P. A., Frantziskonis, G., Hong, G., Djafari-Rouhani, B., and Dobrzynski, L., Experimental evidence for the existence of absolute acoustic band gaps in two-dimensional periodic composite media, J. Phys.: Condens. Matter. 10, 6051---6064 (1998). 67. Vasseur, J. O., Djafari-Rouhani, B., Dobrzynski, L., Kushwaha, M. S., and Halevi, P. (1994). Complete acoustic band gaps in periodic fiber reinforced composite materials: the carbon/epoxy composite and some metallic systems, J. Phys.: Condens. Matter. 6, 8759---8770. 68. Vines, R. E., Wolfe, J. P., Every, A. G. (1999). Scanning phononic lattices with ultrasound, Phys. Rev. B 60, 11871---11874. 69. Wang, Y.-Z., Li, F.-M., Huang, W.-H., and Wang, Y.-S. (2008). The propagation and localization of Rayleigh waves in disordered piezoelectric phononic crystals, J. Mech. and Phys. of Solids 56, 1578---1590. 70. Wang, Y.-Z., Li, F.-M., Kishimoto, K., Wang, Y.-S., and Huang, W.-H. (2009). Elastic wave band gaps in magnetoelectroelastic phononic crystals, Wave Motion 46, 47---56. 71. Wilm, M., Khelif, A., Ballandras, S., and Laude, V. (2003). Out-of-plane propagation of elastic waves in two-dimensional phononic band-gap materials, Phys. Rev. E 67, 065602. 72. Wu, F., Liu, Z., and Liu, Y. (2002). Acoustic band gaps created by rotating square rods in a two-dimensional lattice, Phys. Rev. E 66, 046628. 73. Wu, F., Liu, Z., and Liu, Y. (2002). Acoustic band gaps in 2D liquid phononic crystals of rectangular structure, J. Phys. D: Appl. Phys. 35, 162---165. 74. Wu, T.-C., Wu, T.-T., and Hsu, J.-C. (2009). Waveguiding and frequency selection of Lamb waves in a plate with a periodic stubbed surface, Phys. Rev. B 79, 104306. 75. Wu, T.-T., Hsu, Z.-C., and Huang, Z.-G (2005). Band gaps and the electromechanical coupling coefficient of a surface acoustic wave in a twodimensional piezoelectric phononic crystal, Phys. Rev. B 71, 064303. 76. Wu, T.-T., Huang, Z.-G., and Lin, S. (2004). Surface and bulk acoustic waves in two-dimensional phononic crystals consisting of materials with general anisotropy, Phys. Rev. B 69, 094301. 77. Wu, T.-T., Huang, Z.-G., Tsai, T.-C., and Wu, T.-C. (2008). Evidence of complete band gap and resonances in a plate with periodic stubbed surface, Appl. Phys. Lett. 93, 111902.
48
T.-T. Wu, J.-C. Hsu and J.-H. Sun
78. Wu, T.-T., Wang, W.-S., Sun, J.-H., Hsu, J.-C., and Chen, Y.-Y. (2009). Utilization of phononic-crystal reflective gratings in a layered surface acoustic wave device, Appl. Phys. Lett. 94 101913. 79. Wu, T.-T., Wu, L.-C., and Huang, Z-G. (2005). Frequency band-gap measurement of two-dimensional air/silicon phononic crystals using layered slanted finger interdigital transducers, J. Appl. Phys. 97, 094916. 80. Yao, Z.-J., Yu, G.-L., Wang, Y.-S., and Shi, Z.-F. (2009). Propagation of bending waves in phononic crystal thin plates with a point defect, Int’l J. of Solids and Struct. 46, 2571---2576. 81. Zhang, X., Jackson, T., and Lafond, E. (2006) Evidence of surface acoustic wave band gaps in the phononic crystals created on thin plates, Appl. Phys. Lett. 88, 041911. 82. Zhao, H., Liu, Y., Wang, G., Wen, J., Yu, D., Han, X., and Wen, X. (2006). Resonance modes and gap formation in a two-dimensional solid phononic crystal, Phys. Rev. B 72, 012301.
CHAPTER 2 ON GENERALIZATION OF THE PHASE RELATIONS IN THE METHOD OF REVERBERATION-RAY MATRIX†
W. Q. Chen1, Y.-H. Pao2,‡, Y. Q. Guo3 and J. Q. Jiang4 1
Department of Engineering Mechanics, Zhejiang University Yuquan Campus, Hangzhou, Zhejiang 310027, China E-mail:
[email protected] (corresponding author)
2
College of Civil Engineering and Architecture, Zhejiang University Yuquan Campus, Hangzhou, Zhejiang 310027, China E-mail:
[email protected]
3
Key Laboratory of Mechanics on Disaster and Environment in Western China Ministry of Education, and School of Civil Engineering and Mechanics Lanzhou University Lanzhou, Gansu 730000, China E-mail:
[email protected] 4
Department of Civil Engineering, Zhejiang University Zijingang Campus, Hangzhou, Zhejiang 310058, China E-mail:
[email protected]
The method of reverberation-ray matrix (MRRM) has been developed and successfully applied to analyses of dynamic responses of framed structures and transient wave propagation in layered media. However, attention has been paid mainly to solving various practical problems, and there is a lack of study on the theoretical aspects of the method
† This work is supported by the National Project of Scientific and Technical Supporting Programs Funded by Ministry of Science & Technology of China (No. 2009BAG12A01A03-2) and the National Basic Research Program of China (No. 2009CB623204). It is also partly supported by the National Natural Science Foundation of China (Nos. 10972196 and 10725210). ‡ Professor Emeritus, National Taiwan University and Cornell University, USA.
49
50
W. Q. Chen et al.
itself. In this paper, we intend to explore the local phase relations in MRRM based on the theory of first-order linear differential equations with variable coefficients. By using the coordinate transform, a generalized form of the local phase relations is presented, which includes the effect of inhomogeneous terms appearing in the differential equations. These terms may correspond to distributive external loads in a mechanics problem. When the coefficients of the differential equations are constant, the local phase relations will take a simple form. Furthermore, to ensure numerical stability, the phase relations should be rearranged properly, and a modification of the scattering relations is also suggested. These aspects are illustrated by considering the wave propagation in an arbitrarily anisotropic elastic laminate.
1. Introduction In the recent decade, a new method, called the method of reverberationray matrix (abbreviated as MRRM hereafter), has been proposed by Professor Pao and his associates for the analysis of dynamic responses of framed structures2,12,21 and wave propagation in stratified media.22,24 Numerous applications1,23,26-29,34-37 show that the MRRM exhibits certain advantages in vibration and wave propagation analyses when compared to other methods such as the method of transfer matrix (MTM),18 the method of exact dynamic stiffness matrix (MEDSM),31 the finite element method (FEM)39, the spectral element method (SEM)4, the traveling wave approach (TWA),30 etc. Just like the MTM and MEDSM, the MRRM is also based on the exact solutions of the governing equations, and hence there is no need to divide a single uniform member in a structure (or a single layer in a stratified medium) further into elements of small size, as in the FEM. However, the MRRM is basically quite different from the MTM and MEDSM: (1) the unknowns in the MRRM are the wave amplitudes in the traveling wave solutions of the governing equations, instead of the state variables in the MTM or the displacements in the MEDSM; and (2) there is a unique characteristic in the MRRM that two local coordinate systems are employed simultaneously for a single element (a structural member in a framed structure or a layer in a stratified medium). For a comprehensive comparison between the
On Generalization of the Phase Relations
51
MRRM and MTM, the reader may be referred to the paper of Pao et al.20 Recently, Pao and Chen19 summarized the most important features of the MRRM for the dynamic analysis of three-dimensional framed structures. In the formulations of MRRM, two important kinds of relations are needed in the construction of the reverberation-ray matrix: The (local) phase relations represent the phase differences between different wave components in the two local coordinate systems persistent to each single element, and the (local) scattering relations reflect the compatibility and equilibrium conditions at each node of the system (an end of a member in a structure or a surface/interface in a stratified medium). The treatment of the scattering relations is trivial, but the use of two local coordinates leads to a much simpler form of the compatibility and equilibrium conditions at the node, which does not involve the length of the element since each node always coincides with the origin of a local coordinate system. Thus, the scattering relations in the MRRM should be free of the numerical instability phenomenon associated with the large product of element length and frequency as encountered in the conventional MTM.18 To the MRRM, the local phase relations seem to be theoretically more important and also interesting. Most available studies1-3,9,12,21-24,26-29,32-37 deal with concentrated external loads only, which act on the nodes of the system. Then the governing equations for any element are homogeneous, and only the complementary traveling wave solutions are used. In such cases, the phase relations are also homogeneous. In practice, however, we frequently encounter cases of distributive loadings. It is therefore necessary to extend the current formulations of MRRM so that the distributive loads can be taken into consideration. For a simple bar model, Jiang and Chen16 showed that it is possible to develop the formulations of MRRM when there are distributive or moving loads acting on the bar. In such a case, the phase relations become inhomogeneous. Recently, Jiang et al.15 further presented the procedure of deriving the inhomogeneous phase relations for a Timoshenko beam subject to distributive loads. A detailed discussion can also be found in Ref. [14].
52
W. Q. Chen et al.
In the available studies,2,12,21,22,24 the governing equations are usually assumed to have constant coefficients. If a non-uniform element is encountered, then we must divide it into many sub-elements so that in each sub-element the geometric and material properties can be regarded as constant.15,16,32 The constant coefficients in the governing differential equations enable us to use complementary traveling wave solutions in a simple form. Nevertheless, whether it is possible or not to establish the formulations of MRRM for non-uniform elements directly is always a concern of the related studies. On the other hand, a solid theoretical basis of the MTM has already been put forward using the concept of propagator by Gilbert and Backus,7 who demonstrated that, for any linear differential system the formulations of MTM can be established, at least theoretically. Inspired by their work, this paper intends to consider a general linear differential system as well, but is focused on the MRRM. The layout of the paper is as follows. Section 2 presents some basic formulations for first-order linear ordinary differential equations, whose coefficients may be functions of the independent variable. In particular, the solution of the inhomogeneous differential system is presented in two different forms, both will be employed for relevant analysis purposes. The property of the propagator7 is also utilized in our analysis. As mentioned above, the MRRM employs two local coordinate systems (the dual coordinate system) for each element. The relations of various vectors and matrices, including the propagator, in this dual coordinate system are discussed in Sec. 3 through coordinate transform. We then, in Sec. 4, derive the generalized phase relations for any linear differential equations with variable coefficients as well as inhomogeneous terms. Section 5 considers the particular case of homogeneous linear differential equations for wave propagation in arbitrarily anisotropic elastic laminated plates. Some particular features of the linear system with constant coefficients are emphasized. Phase relations are shown to take a very simple form. A proper rearrangement of the phase relations and a modification of the scattering relations are suggested in the consideration of numerical stability. The paper finally ends with a summary drawn in Sec. 6.
On Generalization of the Phase Relations
53
2. System of First-Order Differential Equations and the Solution Consider the following set of first-order ordinary differential equations: d f ( z) = C( z )f ( z ) + g ( z ) , ( z0 ≤ z ≤ z1 ) , dz
(1)
where f(z) is the vector to be determined, whose n elements are unknown functions of z, C( z ) is the coefficient matrix of order n × n , and g(z) is the inhomogeneous vector, which is known. For a mechanics problem, Eq. (1) is usually called the state equation of the physical system, and f(z), C( z ) and g(z) are the state vector, the system matrix, and source (input) vector, respectively. Since for most mechanics problems we usually encounter partial differential equations rather than ordinary ones, proper methods should be employed to transform the original set of governing equations to the form as given in Eq. (1). For example, for dynamic problems of framed structures, Fourier (or Laplace) transforms may be used to eliminate the time-dependence and Eq. (1) corresponds to that in the frequency domain19,20; and for wave propagation in infinite layered media, Fourier (or Hankel) transforms may be used so that Eq. (1) represents the corresponding equation in the joint wave numberfrequency domain.22,24 In the first example, the state vector, system matrix and source vector will also depend on the frequency, in addition to the independent spatial variable z, while in the second example, they all are further related to the wave numbers. The latter will be considered in great detail in Sec. 5. It is also noted that, for a general mechanics problem, n must be an even number, and f(z) shall consist of n/2 (generalized) displacement components and n/2 (generalized) force (stress) components. The complete solution to Eq. (1) can be expressed as6 z
f ( z ) = B( z , z0 )f ( z0 ) + ∫ B( z , ζ )g (ζ )d ζ , z0
(2)
where B( z, z0 ) = M ( z )M −1 ( z0 ) is the propagator,7 M ( z ) is any fundamental matrix, whose columns are n linearly independent complementary solutions of Eq. (1), and f ( z0 ) is the state vector at z0 , i.e. the initial state vector. We have the following property of the propagator (see Ref. [6])
54
W. Q. Chen et al.
B( z2 , z1 )B( z1 , z0 ) = B( z2 , z0 ) , B( z1 , z0 )B( z0 , z1 ) = I .
(3)
The first term on the right-hand side of Eq. (2) corresponds to the complementary solution, while the second is the particular solution of Eq. (1). The complementary solution, denoted by fc , can also be written as a1 a fc ( z ) = M ( z ) 2 ≡ M ( z ) A , ⋮ an
(4)
where ai are arbitrary constants to be determined, and A is the corresponding unknown amplitude vector. By comparing Eq. (4) with Eq. (2), we obtain A = M −1 ( z0 )f ( z0 ) or f ( z0 ) = M ( z0 ) A ,
(5)
which gives the relation between the amplitude vector A and the initial state vector f ( z0 ) . Making use of Eq. (5), the complete solution can be rewritten as z
f ( z ) = M ( z ) A + ∫ M ( z )M −1 (ζ )g (ζ ) d ζ , z0
(6)
which will be employed in Sec. 4 to derive the generalized phase relations for the general physical system that can be described by the linear differential equations as shown in Eq. (1). If the coefficient matrix C is a constant matrix, we have M ( z ) = exp(Cz ) = V exp( Λz )V −1 ,
(7)
where V = [V1 , V2 , ⋯ , Vn ] , with Vi being the eigenvector corresponding to the eigenvalue λi of C ,a and Λ = diag(λ1 , λ2 , ⋯, λn ) is an n-th order diagonal matrix. The complete solution to Eq. (1) then becomes z
f ( z ) = V exp[ Λ( z − z0 )]V −1f ( z0 ) + ∫ V exp[ Λ( z − ζ )]V −1g(ζ )d ζ z0
(8)
or z
f ( z ) = V exp( Λz )V −1 A + ∫ V exp[ Λ ( z − ζ )]V −1g (ζ )d ζ , z0
(9)
a For the sake of simplicity, it is assumed in this paper that all eigenvalues of the constant matrix C are distinct, as in cases of most dynamic problems.
On Generalization of the Phase Relations
55
where A = V exp(− Λz0 )V −1f ( z0 ) .
(10)
On the other hand, for a constant coefficient matrix C , the complementary solution to Eq. (1) may also be written as n
f c ( z ) = ∑ ai′Vi exp(λi z ) = V exp( Λz ) A′ ,
(11)
i =1
which has been employed to study the elastic surface waves in a general semi-finite anisotropic medium by Ingebrigtsen and Tonning.13 Here A′ is a new amplitude vector and ai′ are the corresponding elements. Clearly, we can derive the following relation between the two amplitude vectors A′ = V −1A = exp(− Λz0 )V −1f ( z0 ) .
(12)
The complementary solution in the form of Eq. (11) will be used in Sec. 5. There are still many other choices of the form of the complementary solution5 because of the flexibility in defining the amplitude vector through the initial state vector. 3. Dual Coordinate System and Coordinate Transform As a unique characteristic of the MRRM,12,19-22 we shall use two local (or dual) coordinate systems for a single element in the physical system. As mentioned earlier, a practical problem is usually governed by a set of partial differential equations, and hence transform methods may be employed to reduce the governing equations to ordinary differential equations. Let z be the only retained independent variable so that the transformed equations can be recast into the state equation in Eq. (1). Furthermore, assume that the z-direction coincides with the depth direction of a stratified medium or the axial direction of a beam, as depicted in Fig. 1. No matter which case is considered, we will use a one-dimensional element as shown in Fig. 2, to represent the problem described by Eq. (1). The two ends (nodes) of the element, denoted as I and J, correspond to the upper and lower surfaces of a layer in Fig. 1(a) or the left and right end sections of a beam in Fig. 1(b). This element will be referred to as element IJ, which has the length h IJ . Two local
56
W. Q. Chen et al.
coordinate systems are used with z IJ pointing from I to J, and z JI pointing from J to I. Thus z IJ and z JI are opposite in direction. In the following, the superscript IJ or JI will be affixed to all physical quantities to denote the corresponding coordinate system ( z IJ or z JI ) and the associated element, except those that are independent of the element and the coordinate systems. According to Eq. (2), the complete solution to Eq. (1) with respect to coordinate z IJ is, f IJ ( z IJ ) = B IJ ( z IJ ,0)f IJ (0) + ∫
z IJ 0
B IJ ( z IJ , ζ )g IJ (ζ )d ζ .
(13)
Note that the origin of z IJ has been taken to coincide with end I so that we have z0IJ = 0 . Similarly, in the coordinate z JI we have f JI ( z JI ) = B JI ( z JI ,0)f JI (0) + ∫
z JI 0
B JI ( z JI , ζ )g JI (ζ )d ζ .
(14)
Solutions (13) and (14) may be called dual solutions of the inhomogeneous differential system represented by Eq. (1).
z z (a)
(b)
Figure 1. The z coordinate of (a) a layer in a stratified medium, and (b) a beam in a framed structure.
I
zIJ
zJI
J
hIJ Figure 2. The dual coordinate system for a typical element in the system.
The unknown state vector f and the inhomogeneous vector g usually take different values in different coordinates. Note that, although the x and y-axes are not shown in Figs. 1 and 2, their directions also have an
On Generalization of the Phase Relations
57
influence on the values of these vectors and the associated transformation matrices. For a practical physical problem, since the state vector f contains both (generalized) displacements and (generalized) forces or stress components, as will be seen in Sec. 5, its transformation does not follow the rule for a vector in Euclidean space. We denote the transformation matrix between f JI and f IJ at a same point ( z JI = h IJ − z IJ ) as Tf , i.e. f JI (h IJ − z IJ ) = Tf f IJ ( z IJ ) .
(15)
Note that the transformation matrix only depends on the direction cosines between the corresponding coordinate axes of the two local coordinate systems. Similarly, we assume the transformation matrix between g JI and g IJ to be Tg , and that between C JI and CIJ to be TC , i.e. g JI (h IJ − z IJ ) = Tg g IJ ( z IJ ) , C JI (h IJ − z IJ ) = TC C IJ ( z IJ ) .
(16)
Then we can deduce the following relations directly from Eq. (1) CIJ ( z IJ ) = −Tf−1TC CIJ ( z IJ )Tf = −Tf−1C JI (h IJ − z IJ )Tf
(17a)
C JI (h IJ − z IJ ) = −Tf C IJ ( z IJ )Tf−1 ,
(17b)
Tf−1Tg = −I , or Tg = −Tf ,
(18)
or and where I is an identity matrix. The minus sign in the above equations results from the fact z IJ = h IJ − z JI for any point between I and J. Equations (17) and (18) show that the transforms for the state vector, the inhomogeneous vector as well as the system matrix are not independent of each other. The transform of f is the negative of that of g ; both are similar to, but not the same as that of, a first-rank tensor. The transform of C resembles that of a second-rank tensor, and the transformation matrix TC is a combined result of the left-multiplied transformation matrix −Tf and the right-multiplied matrix Tf−1 . Once a differential system is mathematically formulated, its propagator is determinate as well. Hence, we will pay attention to the behavior of the propagator matrix under the coordinate transform discussed above. To do so, we consider the homogeneous system with its
58
W. Q. Chen et al.
solution given by f c ( z ) = B( z, z0 )f ( z0 ) , which is rewritten here for the local dual coordinate system f IJ ( z IJ ) = B IJ ( z IJ ,0)f IJ (0) , f JI ( z JI ) = B JI ( z JI ,0)f JI (0) .
(19)
With the aid of Eq. (15), we can obtain the following relation B IJ ( z IJ , h IJ ) = Tf−1B JI (h IJ − z IJ ,0)Tf .
(20)
This relation of the propagator holds for the inhomogeneous system as well. In the derivation of Eq. (20), we have utilized the property of the propagator as shown in Eq. (3). Incidentally, the transform in Eq. (20) can also be obtained directly using the Magnus expansion.17 Moreover, with Eq. (20) we can easily deduce Eq. (13) from Eq. (14) and vice versa. 4. Generalized Phase Relations The transformation rules presented in the previous section can be used to derive the relation between two amplitude vectors A IJ and A JI , if the consistency in physical reality (or compatibility) of the dual solutions is noticed. To this end, according to Eq. (6), we rewrite the dual solutions in Eqs. (13) and (14) as:
{ {
f IJ ( z IJ ) = M IJ ( z IJ ) A IJ + ∫ f JI ( z JI ) = M JI ( z JI ) A JI + ∫
z IJ 0
z JI 0
} ζ }.
[M IJ (ζ )]−1 g IJ (ζ )d ζ ,
(21)
[M JI (ζ )]−1 g JI (ζ )d
(22)
In general, the fundamental matrix M ( z ) also varies with the coordinate system. It should be noted that, although all columns of the matrix M ( z ) satisfy Eq. (1), the transform of these columns between different coordinate systems does not follow the same rule as that of f ( z ) because M ( z ) are associated with arbitrary constants. Since B( z, z0 ) = M ( z )M −1 ( z0 ) , we deduce from Eq. (20) M IJ ( z IJ ) = Tf−1M JI (h IJ − z IJ )[M JI (0)]−1 Tf M IJ (h IJ ) ,
and in view of Eqs. (15)-(18), we obtain from Eq. (22)
(23)
59
On Generalization of the Phase Relations
{
f IJ ( z IJ ) = Tf−1M JI (h IJ − z IJ ) A JI + ∫
h IJ − z IJ 0
[M JI (ζ )]−1 g JI (ζ ) d ζ
}
= M IJ ( z IJ )[M IJ ( h IJ )]−1 Tf−1M JI (0) ×
{
× A JI − ∫
IJ
h −z 0
(24)
}
IJ
[M JI (ζ )]−1 Tf g IJ ( h IJ − ζ ) d ζ .
The above equation can be further transformed, by variable substitution, to f IJ ( z IJ ) = M IJ ( z IJ )[M IJ ( h IJ )]−1 Tf−1M JI (0) ×
{
× A JI + ∫ IJ
z IJ h IJ
[M JI ( h IJ − ζ )]−1 Tf g IJ (ζ ) d ζ
}
(25)
IJ
= M (z ) ×
{
× [M IJ (h IJ )]−1 Tf−1M JI (0) A JI + ∫
z IJ h IJ
}
[M IJ (ζ )]−1 g IJ (ζ ) d ζ .
Combining this result with Eq. (21) gives A IJ = [M IJ (h IJ )]−1 Tf−1M JI (0) A JI − ∫
h IJ 0
[M IJ (ζ )]−1 g IJ (ζ ) d ζ .
(26)
This equation establishes the connection between the two amplitude vectors of the dual solutions in the dual coordinate system. As we will show in Sec. 5, this relation just expresses the phase differences between various waves existing in element IJ for dynamic response (wave propagation) problems. We here refer to them as generalized (local) phase relations because Eq. (26) is suitable for all problems that can be represented mathematically by Eq. (1). As seen from Eq. (26), the generalized phase relations are quite different from those reported earlier.1-3,9,12,21-24,26-29,32-37 First, the current phase relations are derived in a mathematically more strict sense by adopting the concept of coordinate transforms, while those in the previous works are obtained based on a direct physical interpretation. Second, the current phase relations are also suitable for systems with variable coefficients, which have not been tackled before. Third, the phase relations given by Eq. (26) include inhomogeneous terms, which result directly from the inhomogeneous vector g in Eq. (1). The vector g evidently corresponds to the external stimuli of a given mechanics problem. Note that, in most studies on the MRRM,1-3,9,12,21-24,26-29,32-37 only concentrated external loads applied on
60
W. Q. Chen et al.
the nodes (joints) are considered. When a concentrated load is applied in the interior of an element, one may use subdivision to make the load act just on the additional node, and hence the analysis procedure outlined in Pao et al.2,12,21,22,24 remains valid. Only recently, Jiang and Chen16 and Jiang et al.15 demonstrated through simple physical models (bar or beam) that problems involving distributive loads can be readily dealt with within the framework of MRRM. Their analyses show that consideration of distributive loads will lead to inhomogeneous phase relations, as just shown here for a more general differential system. If the physical problem with n dimensions involves one element only as illustrated in Fig. 1, there will be n / 2 conditions at each end of the element, providing a total of n linear or nonlinear algebraic equations. Combining these conditions with Eq. (26), we are able to solve for the total 2n unknowns in A IJ and A JI . For problems involving many elements, compatibility (of displacements) and equilibrium conditions (of forces) at every node should be taken into consideration. These conditions give another set of relations between the amplitude vectors; these are called (local) scattering relations in dynamic response/wave propagation problems. The treatment of scattering relations is more trivial than the phase relations, as to be seen in the next section for the problem of wave propagation in a layered medium. If the boundary conditions are also linear, then the whole problem is linear. For linear problems, we can group all local phase relations for each element and all local scattering relations at each node to form the global phase relations and the global scattering relations for the whole system. Then, according to Pao et al.,2,12,21,22,24 we can construct the reverberation-ray matrix, which is particularly useful in the calculation of transient responses along with the Neumann series expansion. To avoid numerical instability associated with the classical MTM, the local phase relations given in Eq. (26) should be rearranged in a proper way in the construction of the reverberation-ray matrix.19,20 The scattering relations are also suggested to take a slightly different form from the available works2,12,21,22,24 to ensure unconditional numerical stability. These aspects will be discussed in the next section for the problem of wave propagation in anisotropic elastic laminates.8,10
61
On Generalization of the Phase Relations
5. Wave Propagation in Anisotropic Laminates To illustrate the analysis for any linear differential system with constant coefficients, we consider in this section the problem of wave propagation in an arbitrarily anisotropic laminated plate of N homogeneous elastic layers (Fig. 3) subject to no external loads. The bonds between two adjacent layers are assumed to be perfect. The z coordinate of the local dual coordinate system for the i-th layer, with the upper and lower surfaces denoted as I and J respectively, are also depicted in Fig. 3. All local coordinates (x, y, z) are right-handed, with their x-axes (not shown in the figure) identical to the X-axis of the global coordinate system (X, Y, Z) as shown in the panel on the left.
1 2
X Y zIJ
i
zJI zJK
I (i) i J (i+1)
N Z Figure 3. An N-layered plate and the local dual coordinate system for a typical layer. i(ωt − k x x − k y y )
Consider a plane harmonic wave with a common factor e in ω k k all physical quantities. Here is the circular frequency, and x and y are the wave numbers in the x and y-directions, respectively. There are several ways to establish the state Eq. (1) for such a problem of wave propagation in anisotropic elastic media. Here we follow the derivation put forward by Ingebrigtsen and Tonning13 but omit details. We choose u x , u y , u z , σ xz , σ yz , and σ zz as the primary variables, and their amplitudes which vary with the z coordinate only, constitute the state vector f ( z ) = [u x , u y , u z , σ xz , σ yz , σ zz ]T
62
W. Q. Chen et al.
in Eq. (1), where the overbar indicates the amplitude, the transformed counterpart in the joint wave number-frequency domain, −1 −1 i C33 L ⋮ C33 C = ⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯⋯ ⋯ ⋯⋯⋯ , (27) −1 −1 − ρω 2 I 3 + k x2C11 + k y2 C22 + k x k y (C12 + C21 ) − LT C33 L ⋮ i LT C33
in which L = k x C31 + k y C32 , and c1i1 j Cij = c2i1 j c3i1 j
c1i 2 j c2 i 2 j c3i 2 j
c1i 3 j c2 i 3 j . c3i 3 j
(28)
In Eqs. (27) and (28), ρ and cijkl are the mass density and elastic constants of the anisotropic material, respectively, both assumed to be constant within a single layer, and I n is an identity matrix of order n × n . Obviously, we have Cij = CTji because of the symmetry property of the elastic tensor. For the current problem, the transformation matrix in Eq. (15) for the state vector is 1 0 0 0 0 −1 0 0 0 0 −1 0 Tf = J 2 = 0 0 0 −1 0 0 0 0 0 0 0 0
0 0 0 0 0 0 . 0 0 1 0 0 1
(29)
where J 2 represents a diagonal matrix with components being ±1 only. We have J 2 = J 2−1 = J 2T . It is straightforward to verify that C JI = − J 2 C IJ J 2 .
(30)
as a result of the coordinate transform of elastic constants, which agrees with Eq. (17) which has been derived based on the set of linear differential equations, Eq. (1). Assume that λ IJ is an eigenvalue of CIJ , i.e. (C IJ − λ IJ I 6 ) VλIJ = 0 . IJ
(31)
with Vλ being the corresponding eigenvector. Multiplying Eq. (31) on the left by J 2 , gives
On Generalization of the Phase Relations
J 2 CIJ VλIJ − λ IJ J 2 VλIJ = J 2 CIJ J 2 J 2 VλIJ − λ IJ J 2 VλIJ = (J 2CIJ J 2 − λ IJ I 6 )J 2 VλIJ .
63
(32)
In view of Eq. (30), we immediately conclude that −λ IJ is the eigenvalue of C JI with the corresponding eigenvector J 2 VλIJ .b Thus, for monoclinic materials (materials have one plane of symmetry parallel to the x − y plane such that C JI = C IJ = C ), as a special case of this result, we know that if λ is an eigenvalue of C , −λ is also an eigenvalue of it. The generalized local phase relations in Eq. (26), which are suitable for a general differential system as in Eq. (1), are derived based on the dual solutions given in Eqs. (21) and (22). When the system has constant coefficients we prefer to use, in view of Eqs. (9) and (11), the following formulae f IJ ( z IJ ) = V IJ exp( Λ IJ z IJ ) A IJ + exp(CIJ z IJ ) ∫ f JI ( z JI ) = V JI exp( Λ JI z JI ) A JI + exp(CJI z JI )∫
z IJ 0 z JI 0
exp(−C IJ ζ )g IJ (ζ )d ζ , (33) exp(−CJI ζ )g JI (ζ )d ζ . (34)
These differ from Eqs. (21) and (22) only by the arbitrary constants involved, as noticed in Sec. 1. For the current problem, the integral terms in Eqs. (33) and (34) should vanish since there are no sources at all (i.e. g = 0 ). Using Eqs. (33) and (34), and noticing Eq. (20), we obtain the local phase relations as A IJ = exp(− Λ IJ h IJ )(V IJ ) −1 Tf−1V JI A JI ,
(35)
which can, of course, be derived from Eq. (26) by proper substitution. According to the above discussion on eigenvalues and eigenvectors, we can make the following arrangement of eigenvalues Λ IJ = diag(λ1IJ , λ2IJ , ⋯, λ6IJ ) , Λ JI = diag(λ1JI , λ2JI , ⋯, λ6JI ) = diag(−λ1IJ , − λ2IJ , ⋯, − λ6IJ ) ,
b
(36)
This conclusion is extendable to the general differential system, Eq. (1). The statement is then modified as, if λ IJ is an eigenvalue of CIJ ( z IJ ), with eigenvector VλIJ , then −λ IJ is an eigenvalue of C JI (h IJ − z IJ ), with eigenvector Tf VλIJ . In this case, the coordinate z IJ is seen as a parameter.
64
W. Q. Chen et al.
so that V JI = J 2 V IJ .
(37)
Equation (35) then can be significantly simplified to A IJ = exp(− Λ IJ h IJ ) A JI .
(38)
Equation (38) is actually valid for any linear differential system in Eq. (1) with constant coefficients. The local scattering relations are straightforward to derive; they are obtained directly from the compatibility and equilibrium conditions at the nodes. For the current problem, using the solutions given in Eq. (33) or (34), and noticing the continuity conditions at the interface, we can derive V JI A JI = J 2 V JK A JK , V IJ A JI = V JK A JK ,
(39)
where Eq. (37) has been utilized. The local scattering relations at the surfaces can also readily be obtained according to different boundary conditions imposed there,8,22,24 as will be shown later. As can be seen from Eq. (39), the element length does not enter the local scattering relations, which therefore are free from numerical instability as encountered by the conventional MTM.18 With the local phase relations for all layers and scattering relations at all interfaces/surfaces as shown in Eq. (38) and (39), respectively, we are able to construct the reverberation-ray matrix in the MRRM. However, this should be done in a proper way to avoid numerical instability and other problems. First, let us check the eigenvalues ( η ) of the matrix E = i C , which satisfy the following characteristic equation13 C33η 2 − (LT + L)η − ρω 2 I 3 + k x2C11 + k y2C22 + k x k y (C12 + C21 ) = 0 . (40)
Once η is determined from this equation, the corresponding eigenvalue of C , λ , is readily calculated as λ = − i η . The eigenvectors of C are thus identical to those of E . If ω , k x and k y are all real, Eq. (40) is a sextic algebraic equation about η with real coefficients. Thus, we have the following four possibilities for the six λ s: (a) all are complex ( λ j = a j + i b j , λ j +3 = −a j + i b j , j = 1, 2,3 , where a j > 0 and b j are real numbers); (b) all are pure imaginary ( λ j = i b j , j = 1, 2,⋯,6 , where b j are
On Generalization of the Phase Relations
65
real numbers); (c) four are complex and two are purely imaginary, in the forms as in (a) and (b), respectively; (d) two are complex and four are purely imaginary. The purely imaginary λ corresponds to a homogeneous wave, and the complex one to an inhomogeneous wave. According to Synge,25 the solutions may be divided into two different catalogues for each situation. One half of the complex solutions (corresponding to λ j = a j + i b j ) represent inhomogeneous waves which grow exponentially with z, while the other half decay exponentially. One half of the pure imaginary solutions (corresponding to λ j = i b j , with b j < 0 ) represent homogeneous waves with energy transport in the positive z-direction, while the others ( b j > 0 ) in the negative z-direction. Hence, let A −IJ and λ jIJ− ( j = 1,2,3) be, respectively, the amplitudes and eigenvalues of the exponentially growing waves or the arriving plane waves with energy transport in the negative z IJ -direction, and A +IJ and λ jIJ+ ( j = 1,2,3) those of the exponentially decaying waves or the departing plane waves with energy transport in the positive z IJ -direction. For a semi-infinite space, A +IJ should, therefore, correspond to the ‘physically acceptable solutions’, and A −IJ to the ‘physically unacceptable solutions’, as discussed by Ingebrigtsen and Tonning.13 For the current problem, both types of solutions are acceptable and should be retained in the analysis. The property λ jJI = −λ jIJ ( j = 1,2,⋯,6) as shown in Eq. (36) indicates that the solution in the coordinate z JI corresponding to the eigenvalue λ jJI and the one in the coordinate z IJ corresponding to the eigenvalue λ jIJ represent waves of the same mode but traveling in opposite directions. Thus, we can recast Eq. (38) into A −IJ exp( − Λ −IJ h IJ ) A +JI 0 = IJ , 0 exp( − Λ +IJ h IJ ) A −JI A +
(41)
Λ −IJ = diag(λ1IJ− , λ2IJ− , λ3IJ− ) , Λ +IJ = diag(λ1IJ+ , λ2IJ+ , λ3IJ+ ) .
(42)
where By rearrangement, we obtain IJ IJ 0 exp(− Λ −IJ h IJ ) A +IJ A − IJ A + JI = JI = P JI . IJ IJ 0 A − exp( Λ + h ) A + A +
(43)
66
W. Q. Chen et al.
where P IJ is the local phase matrix. Equation (43) expresses the right phase relations between arriving homogeneous waves and departing homogeneous waves in the layer IJ. If inhomogeneous waves exist, the corresponding elements in the phase matrix P IJ involve exponential functions whose indices have negative real parts only. Thus, the large product of element length and frequency encountered in the MTM, which causes numerical instability,18 is eliminated from the proper phase relations shown in Eq. (43). For monoclinic materials, since Λ +IJ = − Λ −IJ , we simply have IJ 0 exp(− Λ −IJ h IJ ) A +IJ A − JI = JI . IJ IJ 0 A − exp(− Λ − h ) A +
(44)
By grouping all local phase relations for each layer together, we get A12 A12 − + 12 21 21 P ⋯ 0 A+ A − ⋮ ⋮ ⋮ = ⋮ ⋱ A ( N −1) N 0 ⋯ P ( N −1) N A ( N −1) N + − N ( N −1) N ( N −1) A − A +
(45a)
a = Pd ,
(45b)
or which expresses the global phase relations, where a and d are the global arriving wave vector and the global departing wave vector, respectively, and P is the global phase matrix. Accordingly, the local scattering relations at the interface, Eq. (39), can be rewritten as V1IJ− IJ V2−
V1IJ+ A +JI V1JK − = V2IJ+ A −JI V2JK−
A −JK V1JK + , JK JK V2+ A +
(46)
which, in turn, gives V1IJ− IJ V2 −
A +JI −V1IJ+ −V1JK + = −V2JK+ A +JK −V2IJ+ S +J
or
A −JI V1JK − V2JK− A −JK S −J
(47a)
On Generalization of the Phase Relations JI JI A + J A− S +J JK = S − JK , A + A −
67
(47b)
where V1IJ− and V1IJ+ , etc. are the partitioned matrices given by V IJ V IJ = 1IJ− V2−
V1IJ+ . V2IJ+
(48)
For invertible matrix S +J Eq. (47b) may also be written as JI A +JI A − J −1 J J JK = (S + ) S − JK = S A + A −
A −JI JK , A −
(49)
where S J is the local scattering matrix.22,24 However, in some particular cases, the matrix inversion is impossible because S +J may be singular.8 Thus, for the free wave propagation problem, it is suggested to use Eq. (47) directly. Note that Eq. (47) is only valid for J = 2, 3,⋯ , N − 1 , i.e. at the interfaces between two adjacent layers. At the uppermost and lowermost surfaces, where J = 1 and J = N , if free surface conditions are assumed, we have 12 12 N ( N −1) V212+ A12 A +N ( N −1) = −V2N−( N −1) A −N ( N −1) + = − V2− A − , V2+
(50a)
1 12 N N ( N −1) S1+ A12 = S −N A −N ( N −1) . + = S− A− , S+ A+
(50b)
or Combing Eq. (47b) for J = 2, 3,⋯ , N − 1 with Eq. (50b) gives the global scattering relations S1+ 06× 3 ⋮ 06×3 0 3×3
S1− 03×6 ⋯ 03×6 03×3 A12 + 2 21 S + ⋮ 06× 6 06× 3 A + 06×3 ⋮ ⋱ ⋮ ⋮ ⋮ = ⋮ 06×6 ⋯ S +N −1 06×3 A (+N −1) N 06×3 03×6 ⋯ 03×6 S +N A +N ( N −1) 03×3
03×6 ⋯ 03×6 03×3 A12 − S −2 ⋮ 06× 6 06× 3 A 21 − ⋮ ⋱ ⋮ ⋮ ⋮ 06×6 ⋯ S −N −1 06×3 A (−N −1) N 03×6 ⋯ 03×6 S −N A −N ( N −1)
(51a)
or Sd d = Saa .
(51b)
The following equation then can be derived from Eqs. (45b) and (51b) as
68
W. Q. Chen et al.
(S d − S a P)d = 0 ,
(52)
from which we obtain the secular equation Sd − S a P = 0 .
(53)
From this relation the dispersion characteristics of the waves in an arbitrarily anisotropic elastic laminated plate can be determined. Obviously, the reverberation-ray matrix in the MRRM2,12,21,22,24 can be obtained as R = S −d1S a P , which contains no large product of element length and frequency as mentioned earlier. The secular equation then takes the form I − R = 0 as in Refs. [2,12,21,22,24]. However, the matrix inversion will constitute a severe numerical problem when S d is singular or nearly singular. On the other hand, the modified form as given by Eq. (53) is numerically unconditionally stable.8 For illustration, we display in Fig. 4 the dispersion curves of a twolayered cross-ply laminate with surfaces traction-free and interface perfectly bonded. The constituent 0° and 90° laminas are assumed to have equal thickness and the material properties are given in Table 1. Waves propagating in the X-direction are considered for illustration. Denote the frequency, wave number and phase velocity by ω , k and va respectively. The corresponding dimensionless quantities are defined as Ω = ω H / (2π vs ) , γ = kH / (2π ) and V = va / vs , where H is the total thickness of the laminate and vs = (c55 ρ )0° is the shear wave velocity with (c 55 )0° and ( ρ )0° the stiffness coefficient and material density of the 0° lamina. It is interesting to note from Fig. 4 that all Lamb-type propagated and attenuated wave modes as well as the asymptotic lines can be predicted based on Eq. (53). Numerically stable results at high frequencies can be found in Ref. [10]. Table 1. Material properties of the 0° or 90° lamina. Lamina 0° 90° Lamina 0° 90°
ρ 1200 1200 c23 0.0692 0.0644
c11 1.6073 0.1392 c33 0.1392 0.1392
c12 0.0644 0.0644 c44 0.035 0.0707
c13 0.0644 0.0692 c55 0.0707 0.035
Note: The unit of density is kg/m3 and that of stiffness is 1011 N/m2.
c22 0.1392 1.6073 c66 0.0707 0.0707
69
On Generalization of the Phase Relations
Dimensionless circular frequency Ω
2.0
1.6
1.2
0.8
0.4
0.0 -2.0
-1.5 -1.0 -0.5 0.0 Dimensionless imaginary wave number γI
0.5 1.0 1.5 2.0 Dimensionless real wave number γR
(a) Frequency vs. wave number
6.0
Dimensionless phase velocity va/vs
5.0
4.0
3.0
2.0
1.0
0.0 0.0
0.5
1.0 Dimensionless wavenumber γR
1.5
(b) Phase velocity vs. wave number
Figure 4. Dispersion spectra of a [0°/90°] laminated plate.
2.0
70
W. Q. Chen et al.
The derivation and numerical results presented in this section shows that, with the general method described in Sec. 4 to derive the local phase relations, we are able to deal with any linear differential system with constant coefficients within the framework of MRRM. In fact, the wave propagation analysis for arbitrarily anisotropic laminates can be as simple as that for isotropic24 or transversely isotropic29 materials. Investigation of wave propagation in piezoelectric laminates has also been conducted recently using the modified MRRM.11 6. Summary This paper presents a solid theoretical basis for the establishment of the phase relations for a general linear differential system based on the concept of coordinate transforms. The generalized phase relations are inhomogeneous when the differential equations are inhomogeneous. Hence, theoretically, we are able to deal with any distributive dynamic loading case within the framework of the method of reverberation-ray matrix. Recent efforts using simple bar and beam models have indicated such a feasibility, although for practical problems care should be taken of the particular solutions in order to gain an easy numerical scheme for the MRRM. For concentrated loads, previous studies usually assume that they are applied just on the nodes of a mechanics system. When they are applied in the interior of an element, then the load point can be taken as an additional node so that the conventional MRRM2,12,21,22,24 can be adopted as well. In this case, we will obtain homogeneous phase relations for the two elements connected to this additional node, but the local scattering relations at the node are inhomogeneous2,12,21,22,24. In fact, by eliminating the unknown amplitudes related to the two coordinate systems of which the origins coincide with the node, one may derive the relations between the amplitudes related to the other two coordinate systems, which are also pertinent to the two elements. These are just the generalized phase relations of the undivided element and are inhomogeneous! The detailed verification of the equivalence between the two formulations for a bar element subject to a concentrated load can be found in Jiang14. Interestingly, following the same idea of eliminating the
On Generalization of the Phase Relations
71
unknown amplitudes at the internal nodes, we can arrive at a numerically very efficient recursive formula for the MRRM.38 The phase relations can be simplified significantly for a linear differential system with constant coefficients. It is demonstrated in this paper by considering the problem of wave propagation in a general anisotropic elastic laminated plate. To ensure unconditional numerical stability, the phase relations as well as the scattering relations must be organized in a proper way. Specifically, any exponentially growing factor should be eliminated from the phase matrix, while the matrix inversion operation should be avoided in obtaining the scattering relations. For the linear system with constant coefficients, we have for simplicity confined ourselves to the case of distinct eigenvalues. This usually happens for most dynamic problems. But such a formulation excludes cases such as static deformations of structures. Therefore, it seems important to consider the cases when there are equal eigenvalues. Acknowledgments The junior authors (W.Q. Chen, Y.Q. Guo and J.Q. Jiang) are very grateful to Professor Pao for his advice and guidance. Numerous discussions with him have greatly helped them in understanding the method of reverberation-ray matrix as well as associated topics. Professor Pao once proposed to use “reverberator” instead of the reverberation-ray matrix that is currently employed. The property of the “reverberator” itself has not been discussed in this paper, and needs further investigation. A part of this paper is based upon unpublished notes of a lecture (co-authored by Y.H. Pao, W.Q. Chen, and J.Q. Jiang), delivered by Y.H. Pao at the 4th International Symposium on Environmental Vibration: Prediction, Monitoring and Evaluation (ISEV2009), held at Beijing on October 28-30, 2009. The lecture was invited by Professor H. Xia of Beijing Jiao Tong University, General Chairman of the Symposium, to whom the authors are indebted.
72
W. Q. Chen et al.
References 1. G. Q. Cai and G. H. Nie, Reverberation-ray analysis and program implementation on beam structure in space, Computer Aided Engineering 18, 12 (2009). (in Chinese) 2. J. F. Chen and Y.-H. Pao, Effects of causality and joint conditions on method of reverberation-ray matrix, AIAA Journal 41, 1138 (2003). 3. W. Q. Chen, H. M. Wang and R. H. Bao, On calculating dispersion curves of waves in functionally graded elastic plate, Composite Structures 81, 233 (2007). 4. J. F. Doyle, Wave Propagation in Structures (2nd ed.) (Springer, New York, 1997). 5. J. W. Dunkin, Computations of modal solutions in layered elastic media at high frequencies, Bulletin of the Seismological Society of America 55, 335 (1965). 6. F. R. Gantmacher, The Theory of Matrices (II) (Chelsea Publishing Company, New York, 1960). 7. F. Gilbert and G. E. Backus, Propagator matrices in elastic wave and vibration problems, Geophysics 31, 326 (1966). 8. Y. Q. Guo, The Method of Reverberation-Ray Matrix and its Applications, Ph.D. Thesis (Zhejiang University, Hangzhou, 2008). (in Chinese) 9. Y. Q. Guo and W. Q. Chen, Dynamic analysis of space structures with multiple tuned mass dampers, Engineering Structures 29, 3390 (2007). 10. Y. Q. Guo and W. Q. Chen, On free wave propagation in anisotropic layered media, Acta Mechanica Solida Sinica 21, 500 (2008). 11. Y. Q. Guo, W. Q. Chen and Y. L. Zhang, Guided wave propagation in multilayered piezoelectric structures, Science in China (Series G) 52, 1094 (2009). 12. S. M. Howard and Y.-H. Pao, Analysis and experiments on stress waves in planar trusses, ASCE Journal of Engineering Mechanics 124, 884 (1998). 13. K. A. Ingebrigtsen and A. Tonning, Elastic surface waves in crystal, Physical Reviews 184, 942 (1969). 14. J. Q. Jiang, Reverberation-Ray Analysis of Inhomogeneous Elastodynamic Equations and Nondestructive Inspection of Structures, Ph.D. Thesis (Zhejiang University, Hangzhou, 2008). (in Chinese) 15. J. Q. Jiang, Y. X. Bao and W. Q. Chen, Extension and application of reverberationray matrix method to dynamic responses of structures, Journal of Zhejiang University (Engineering Science) 43, 1063 (2009). (in Chinese) 16. J. Q. Jiang and W. Q. Chen, Reverberation-ray analysis of moving or distributive loads on a non-uniform elastic bar, Journal of Sound and Vibration 319, 320 (2009). 17. W. Magnus, On the exponential solution of differential equations for a linear operator, Communications on Pure and Applied Mathematics 7, 649 (1954). 18. E. C. Pestel and F. A. Leckie, Matrix Methods in Elasto Mechanics (McGraw-Hill, New York, 1963). 19. Y.-H. Pao and W. Q. Chen, Elastodynamic theory of framed structures and reverberation-ray matrix analysis, Acta Mechanica 204, 61 (2009).
On Generalization of the Phase Relations
73
20. Y.-H. Pao, W. Q. Chen and X. Y. Su, The reverberation-ray matrix and transfer matrix analyses of unidirectional wave motion, Wave Motion 44, 419 (2007). 21. Y.-H. Pao, D. C. Keh and S. M. Howard, Dynamic response and wave propagation in plane trusses and frames, AIAA Journal 37, 594 (1999). 22. Y.-H. Pao, X. Y. Su and J. Y. Tian, Reverberation matrix method for propagation of sound in a multilayered liquid, Journal of Sound and Vibration 230, 743 (2000). 23. Y.-H. Pao and G. J. Sun, Dynamic bending strains in planar trusses with pinned or rigid joints, ASCE Journal of Engineering Mechanics 129, 324 (2003). 24. X. Y. Su, J. Y. Tian and Y.-H. Pao, Application of the reverberation-ray matrix to the propagation of elastic waves in a layered solid, International Journal of Solids and Structures 39, 5447 (2002). 25. J. L. Synge, Flux of energy for elastic waves in anisotropic media, Proceedings of the Royal Irish Academy 58, 13 (1956). 26. J. Tian and Z. Xie, A hybrid method for transient wave propagation in a multilayered solid, Journal of Sound and Vibration 325, 161 (2009). 27. J. Y. Tian and X. Y. Su, The transient waves in frame structures, Explosion Shock Waves 21, 98 (2001). (in Chinese) 28. J. Y. Tian and X. Y. Su, The vibration suppression of joint mass-damping in frame structures, Acta Mechanica Solida Sinica 23, 47 (2002). (in Chinese) 29. J. Y. Tian, W. X. Yang and X. Y. Su, Transient elastic waves in a transversely isotropic laminate impacted by axisymmetric load, Journal of Sound and Vibration 289, 94 (2006). 30. A. H. von Flotow, Disturbance propagation in structural networks, Journal of Sound and Vibration 106, 433 (1986). 31. F. W. Williams and W. H. Wittrick, An automatic computational procedure for calculating natural frequencies of skeletal structures, International Journal of Mechanical Sciences 12, 781 (1970). 32. W. Yan, W. Q. Chen, J. B. Cai and C. W. Lim, Quantitative structural damage detection using high frequency piezoelectric signatures via the reverberation matrix method, International Journal for Numerical Methods in Engineering 71, 505 (2007). 33. W. Yan, C. W. Lim, W. Q. Chen and J. B. Cai, A coupled approach for damage detection of framed structures using piezoelectric signature, Journal of Sound and Vibration 307, 802 (2007). 34. Y. Y. Yu, Y. X. Bao and Y. M. Chen, Damage detection of frame structures based on the reverberation matrix method, China Civil Engineering Journal 38, 53 (2005). (in Chinese) 35. Y. Y. Yu, Y. X. Bao and Y. M. Chen, Discussion on quality testing of frame structure embedded partially in soil, Acta Mechanica Sinica 38, 339 (2006). (in Chinese) 36. Y. Y. Yu and Y. M. Chen, Flexural wave of finite pile in a non-uniform soil, Acta Mechanica Solida Sinica 26, 429 (2005). (in Chinese)
74
W. Q. Chen et al.
37. Y. Y. Yu, Y. H. Pao and Y. M. Chen, Wave analysis of damaged frame structures, Journal of Vibration Engineering 17, 20 (2004). (in Chinese) 38. J. Zhu, W. Q. Chen, G. R. Ye and C. F. Lü, Recursive formulations for the method of reverberation-ray matrix and the application, Science in. China (Series G) 52, 293 (2009). 39. O. C. Zienkiewicz, The finite element method: from intuition to generality, Applied Mechanics Reviews 23, 249 (1970).
CHAPTER 3 SURFACE-WAVE NONLINEARITY MEASURED WITH EMAT FOR FATIGUED STEELS
Hirotsugu Ogi* and Masahiko Hirao* *Graduate
School of Engineering Science, Osaka University Toyonaka, Osaka 560-8531, Japan E-mail:
[email protected]
A nonlinear acoustic measurement technique is studied for fatigue damage monitoring. An electromagnetic acoustic transducer (EMAT) magnetostrictively couples to a surface-shear wave resonance along the circumference of a rod specimen during rotating bending fatigue of carbon steels. Excitation of the EMAT at the half of the resonance frequency caused the standing wave containing only the secondharmonic component, which was received by the same EMAT to determine the second-harmonic amplitude. Thus measured surfacewave nonlinearity always showed two distinct peaks approximately at 60% and 85% of the total fatigue life. We attribute the earlier peak to the crack nucleation and growth, and the later peak to an increase of free dislocations associated with crack extension in the final stage of life. This non-contact resonance-EMAT measurement can monitor the evolution of the surface-shear-wave nonlinearity throughout the metal’s fatigue life and is able to detect the pertinent precursors of the eventual failure.
1. Introduction Acoustic nonlinearity holds the potential of becoming the primary means of characterizing fatigue in materials, because it is capable of probing the process of crack nucleation and growth,1,10,11,21 and dislocation movement.2,4-6 The sensitivity to microstructural attributes during the incubation period is often higher than that of the linear properties 75
76
H. Ogi and M. Hirao
(velocity and attenuation). Richardson21 analyzed the second-harmonic amplitude generated by the opening and closing effect of an interface subjected to a bias pressure with a passing acoustic wave. An optimum pressure exists for the second-harmonic generation. Buck et al.1 and Morris et al.10 demonstrated that Richardson’s model is basically applicable to fatigue cracks in metals; the second-harmonic amplitudes showed maxima with a low external compressive stress. Thus, small fatigue cracks act as generators of harmonics, and the efficiency should exhibit a maximum during crack growth because they are partly closed by compressive residual stress and large cracks are fully open, producing no higher harmonics. Concerning dislocations, Hikata et al.4 studied the harmonic generation caused by their anelastic vibration and showed that the second-harmonic amplitude increases with the increase of dislocation density and loop length. We recently revealed that a dislocation-structure change occurs in the later stages of rotating bending fatigue, during which much mobile dislocations temporally arise.13,17 These theoretical and experimental studies predict a nonlinearity peak responding to this event. We then expect two nonlinear peaks during a fatigue life; one associated with cracks and the other with dislocations. However, such a study has not been reported in the literature. We note two key points. First, surface waves should be used to focus on the specimen surfaces, where damage progresses. Second, a contactless acoustic transduction should be used to avoid background nonlinearity caused by coupling agents and the transducer itself. Among the previous nonlinear acoustic studies for the characterization of materials, few satisfied these requirements and none for detecting the fatigue damage. A notable exception is reported by Hurley8, who used a comb transducer for excitation and an interferometric laser detector to accurately measure Rayleigh wave harmonics in aluminum; it is a quasi-contactless technique. In the present study, we apply an electromagnetic acoustic transducer (EMAT) to monitoring the surface-shear-wave nonlinearity throughout the metal’s fatigue life. The use of an EMAT makes the
Surface-Wave Nonlinearity Measured with EMAT for Fatigued Steels
77
contactless transduction possible. However, EMATs lack a large enough transduction efficiency, which the nonlinear measurement needs. To overcome this limitation, we choose using acoustic resonance so as to excite and enhance the standing wave of the second-harmonic component around a rod specimen. Coherent superposition produces a highly magnified amplitude of the second harmonics. Thus measured surface-wave nonlinearity detected the two peaks in the fatigue life for the first time, one caused by microcracking and the other by dislocations. 2. Materials We used commercial steel rods containing 0.25 and 0.35 mass percent carbon. They were heated at 880 °C for one hour and cooled in air. Specimens were 600 mm long. The diameter was 14 mm at the center; it smoothly increased from 14 mm to 20 mm at the end portions with a large curvature to bring about damage and failure at the minimum diameter. We prepared the measuring surface by electropolishing. 3. Resonance-EMAT for Nonlinear Acoustics 3.1. Spectroscopy Measurement The EMAT consists of a solenoid coil to introduce the bias magnetic field H0 in the specimen’s axial direction and a meander-line coil, having equal meandering period of 0.9 mm, to induce the dynamic field Hω in the circumference direction (see Fig. 1). When a sinusoidal current is applied to the meander-line coil, the total field Ht oscillates about the axial direction at the same frequency as the driving current and produces the shearing vibration through the magnetostriction effect to excite the surface shear wave propagating along the circumference with the axial polarization.16 This is called the axial shear wave.9 After the excitation, the same meander-line coil receives the axial shear wave through the reversed magnetostrictive effect.
78
H. Ogi and M. Hirao
Fig. 1. Axial-shear-wave EMAT consisting of a solenoid coil and a meander-line coil surrounding the cylindrical surface. The magnetostriction mechanism causes the axially polarized surface SH wave in the cylindrical specimen.
Driving the meander-line coil with long tone bursts causes interference among the axial shear waves traveling around the cylindrical surface, and a frequency scan detects resonance peaks at unequal frequency intervals, at which all the waves overlap coherently to produce large amplitudes. We used the first resonance mode around fr=3.9 MHz, whose amplitude distribution has the maximum at the surface and a steep radial gradient; the penetration depth is estimated to be 0.5 mm.13 We defined the maximum amplitude of the first resonance peak as the fundamental amplitude, A1 (Fig. 2). We then excited the axial shear wave by driving the EMAT at the half of the resonance frequency ( fr /2), keeping the input power unchanged. In this case, the driving frequency does not satisfy the resonance condition and the fundamental component does not make a detectable signal. However, the second-harmonic component having the double frequency ( fr) satisfies the resonance condition, and the resonance spectrum of the received signal contains a peak at the original resonance frequency as shown in Fig. 2. We defined this peak height as the second-harmonic amplitude, A2, to calculate the nonlinearity A2/A1. The magnitude of A2/A1 varied in the order of 10-3. These measurements were made possible with the system for nonlinear acoustic phenomena (SNAP) manufactured by RITEC Inc., which provided the burst signals to the meander-line coil and detected the resonance peaks.
Surface-Wave Nonlinearity Measured with EMAT for Fatigued Steels
79
3.2. Second Power Law In metals without flaws, the higher harmonics arise from the nonlinear elasticity caused by lattice anharmonicity and the anelasticity due to dislocation movement. These two effects are inseparable in actual nonlinear measurements. Both generate the higher harmonics, among which the second harmonic usually dominates. The amplitude is then given by the square of the fundamental wave amplitude,4 that is, the second power law. The present definition of nonlinearity A2/A1 is based on the second power law and the proportionality between A1( fr) and the true fundamental amplitude, say A1’( fr/2). The second harmonic amplitude A2 is of course proportional to the square of the true fundamental amplitude A1’, that is, A2∝A1’2. This component lasts in a very short time after excitation and vanishes through mutual interference in the off-resonance condition. Within this short time, it generates the second harmonics following the second power law. But, we cannot measure A1’. Instead, we measured A1 and A2, both at fr, by changing the driving voltage. Figure 3 shows the result for 0.25 mass%C steel before and after applying 1.5% tensile plastic deformation. It demonstrates the linear relationship between A2 and A12, proving that A1∝A1’. (The larger slope after the deformation is caused by the dislocation multiplication. The slope is proportional to the dislocation density Λ times the fourth power of the loop length L; A2/A12 ∝ΛL4.5) Normalization of A2 in terms of A1 removes the influences of liftoff, frequency dependence of the transduction efficiency, and other anomalies. 3.3. Background Nonlinearity by Magnetostrictive Effect Magnetostrictively coupling EMAT entails background nonlinearity caused by the metal’s magnetostrictive response to the applied magnetic field. We have to estimate this effect. The preceding study15 derived a relationship between the bulk-shear-wave amplitude and the metal’s magnetostrictive characteristics by assuming isovolume magnetostriction and reversible magnetization rotation. Similar approach is possible in the present situation by assuming a flat surface. The resultant surface-shear-
80
H. Ogi and M. Hirao
0.01
A1 0.008 0.006
A2
0.5
0.004 0.002
0
3.88
3.9
3.92
Amplitude (a.u.)
Amplitude (a.u.)
1
0 3.94
Frequency (MHz) Fig. 2. Resonance spectra for the fundamental and the second-harmonic components.
4 as annealed 1.5% deformed
A2 (a.u.)
3
2
1
0 0
0.5
1
A12 (a.u.)
1.5
Fig. 3. Verification of the second power law before and after plastic elongation. The second-harmonic amplitude is proportional to the square of the fundamental amplitude.
10-2
2
10-3
0
10-4
-2
10-5 0
2 4 Static Field (104 A/m)
81
A2/A1
measurement approximation A2/A1
4
-6
Magnetostriction (10 )
Surface-Wave Nonlinearity Measured with EMAT for Fatigued Steels
6
Fig. 4. Measured magnetostriction (solid circle), the fitted function (solid line), and the calculated nonlinearity in transmission (broken line). A 0.25 mass%C steel was used after demagnetization.
wave amplitude, USH, is related to the magnetostriction εM and its field derivative (dε M dH ) as U SH ∝
jωt 3µ dε M 2 2 sin θ cos θe ε M cos 2θ cos θ + 3H 0 H0 dH
(1)
Here, µ denotes the shear modulus, θ=tan-1(Hω/H0) and Hω=hωexp( jωt) with the dynamic-field amplitude hω. When the static field H0 is much larger than the dynamic field Hω, i.e., │θ(t)│ << 1, Eq. (1) is approximated as U SH ∝
3µ H0
2 dε M hω jωt jωt 3 e e ε + M dH H 0
(2)
The amplitude ratio A2/A1 is then given by
A2 hω2 3 dε M = A1 ε M dH H = H 0 H 0
(3)
82
H. Ogi and M. Hirao
Figure 4 shows the magnetostriction measured by changing the applied field with 0.25 mass%C steel. We obtained the fitting function to the measurements; ε M ( H ) = 3H 0.15 exp(−0.457 H + 1.14) − 2.75
(4)
with εM in microstrain (10-6) and H in 104 A/m. Thompson22 derived the magnitude of the tangential dynamic-field amplitude hω induced by driving a meander-line coil, which, in the first approximation, reduces to hω =
2 I sin (πa/D ) 2πG exp − D πa/D D
(5)
where I denotes the current, D the meander-line period, a the width of the coil lines, and G the air gap between the coil and specimen surface. In this study, D=0.9 mm and a=0.01 mm. Substitution of Eqs. (4) and (5) into Eq. (3) results in the normalized second-harmonic amplitude A2/A1 in Fig. 4 for the typical values of I=0.5 A and G=0.3 mm. We see the infinite transduction nonlinearity at zero static field and also at H0~ 3.1×104 A/m, both of which result from zero magnetostriction. An ideal static field should provide large transduction efficiency and a minimum nonlinearity. We fixed the static field at H0=2.1×104 A/m throughout this study so that the background nonlinearity remains one order of magnitude smaller than the fatigue-induced nonlinearity, keeping a sufficient efficiency (the efficiency is nearly proportional to εM). It should be noted that the background nonlinearity continues to be unchanged during the fatigue test so far as the constant static field is applied. 4. Rotating Bending Fatigue 4.1. Measurements The measurement setup of the rotating bending fatigue test was the same as that developed in the previous study.13 The meander-line coil surrounded the specimen’s center part. We rotated the specimen at
Surface-Wave Nonlinearity Measured with EMAT for Fatigued Steels
83
240 rpm (4 Hz). Four-point bending configuration gave the maximum bending stress of 280 MPa (=0.84σy, σy: yield strength) for 0.25 mass%C steel, and 357 and 382 MPa for 0.35 mass%C steel (0.79σy and 0.85σy, respectively). The cycle to failure, NF, was on the order of 104. We measured the nonlinearity, attenuation, and phase velocity of the surface shear wave by interrupting the cyclic loading and releasing the bending stress. Details of the velocity and attenuation measurements have appeared in Ref. 15. Along with the acoustic measurements, we observed the surface crack nucleation and growth by replication.13 We then restarted the cyclic loading. This procedure was repeated until failure. 4.2. Results Figure 5 shows typical evolutions of the attenuation coefficient α, the phase velocity v, and the normalized nonlinearity A2/A1 for the first axialshear-wave resonance during the fatigue test. Individual measurements of A1 and A2 are also plotted there. Because the specimen diameter remains unchanged throughout, the resonance frequency change equals the velocity change (v-v0)/v0, where v0 denotes the velocity before fatiguing. The horizontal axis in Fig. 5 is the cycle number N normalized by the failure cycle number, NF. More than forty specimens were tested with various loading conditions. Common observations are as follows: I. Over the first half of the life, α is almost stable, but v monotonically decreases. Approaching 70% of the lifetime, α starts to decrease below the initial value. II. Then, α increases to a maximum and immediately drops. At the maximum, v pauses or slightly increases. III. The peak of α is very sharp. Its width is only a few percent of the life. (The peak sometimes appears twice.) The ratio N/NF appears to be around 0.85, being independent of the carbon content and the bending stress. This indicates a potential capability of evaluating the remaining life. IV. The nonlinearity shows two peaks; the first one appears around 60% and the second one around 85% of NF, being almost the same time as the attenuation peak. Again, their appearances are independent of the carbon content and the bending stress (see Fig. 6).
84
H. Ogi and M. Hirao 10
(v-v0)/v0 6
-2
4
α
-3
-2
-1
8
-1
α (10 µs )
(v-v0)/v0 (%)
(a)
2 -4 0
(b)
-3
A2/A1 (10 )
5 4 3 2 1 0
A1
(d) 300
density
4
200 2
100 length 0
0 0
0.2
0.4
0.6
0.8
1
Maximum crack length (mm)
A2
-1
Crack density (mm )
Amplitude (a.u.)
(c)
N/NF Fig. 5. Typical evolutions of (a) velocity and attenuation, (b) the nonlinearity, (c) the fundamental and second-harmonic amplitudes, and (d) surface cracks for 0.25mass%C steel (NF =56,000, σ =0.84σy). Arrows indicate the first and the second nonlinearity peaks.
Surface-Wave Nonlinearity Measured with EMAT for Fatigued Steels
85
10
4
Np (10 )
8
first peak second peak
6
4
2
0
2
4
6
4
8
10
NF (10 )
Fig. 6. Relationship between the failure cycle number NF and the cycle numbers at the first and the second nonlinearity peaks (NP) for 0.25 mass%C and 0.35 mass%C steels.
Figure 5(d) shows the result of the surface crack observation. The crack density was calculated from the total crack length divided by the viewing area with replicated films. Cracks were observable with an optical microscope as early as at about 30% of NF; the average length being about 50µm at this stage. The crack density increased linearly with N, while the maximum length remained unchanged until about 70%, which was followed by a rapid increase toward the failure. This observation indicates that the cyclic loading between N/NF =0.3 and 0.7 was spent mainly by the crack nucleation, not the crack growth, producing many small cracks of nearly uniform size. This appears reasonable from the metal’s work hardening and the bending stress gradient. After N/NF =0.7, these small cracks coalesce with each other to form longer cracks, resulting in growth inward in the final stage and then fracture. 5. Discussions Detailed investigation of the attenuation/nonlinearity peaks and observations of dislocation structure by transmission-electron micrographs (TEM) revealed that the attenuation peak responds to a temporal increase of free dislocations.7,13,18,19 According to the string
86
H. Ogi and M. Hirao
model of vibrating dislocations,3,20 α (∝ΛL4) increases and v (∝-ΛL2) decreases with an increase of the free dislocations that vibrate with the acoustic wave. The crack coalescence at the later stage produces high stress zones ahead of the crack tips and gives rise to rearrangement of the dislocation cell structure, release of pinned dislocations from obstacles, and dislocation multiplication. This involves the production of free dislocations, which raises α several times of the initial measurement. The attenuation coefficient, however, decreases soon because of the dislocation tangling and piling up again. This process takes place simultaneously at many sites within the thin surface layer of the specimen. The velocity change can be interpreted in the same way, although it is less sensitive to the microstructure. 5.1. First Nonlinearity Peak We attribute the first nonlinearity peak to the crack nucleation and growth. Small fatigue cracks are partly closed along the tips due to the compressive residual stress, which arises from local plastic deformation (plasticity-induced closure).12 The residual stress is large near the tip, which tightly closes the crack faces. It diminishes when approaching the crack mouth and there is a thin band where the crack faces are in contact at very small pressure.10 These thin bands open and close when the acoustic wave impinges on them, distort the waveform, and make a strong nonlinearity source. In the present case, there are many sites of them, since the crack shape is shallow semi-ellipsoidal with relatively uniform size. The nonlinearity should then increase as the number of small cracks increases. In Fig. 5, the crack density correlates with the nonlinearity prior to the first peak, where the crack length remains unchanged and the number of cracks monotonically increases. In the course of fatigue, the small cracks coalesce with each other and start to grow inward. This process reduces the area of weakly touching crack faces because of the asperity contact across the shallower faces, resulting in the drop of the nonlinearity. The crack length starts to increase after this nonlinearity peak in Fig. 5(d), being compatible with the above interpretation. Thus, the first nonlinearity peak indicates the beginning of the crack coalescence.
Surface-Wave Nonlinearity Measured with EMAT for Fatigued Steels
87
5.2. Second Nonlinearity Peak We interpret the second nonlinear peak to be caused by the above crackdislocation interaction because it synchronizes with the attenuation peak. Anelastic behavior of the free dislocations is the common source of attenuation and nonlinearity. A supplemental experiment of lowering the free dislocations confirms this viewpoint. We stopped the fatigue test right at the second nonlinearity peak (also attenuation peak), exposed the specimen to 300°C for 1hr, and observed a nonlinearity drop to that before the peak. The attenuation coefficient and the velocity also returned to the previous values by this heat treatment. We finally remark on the effect of attenuation in the nonlinearity A2/A1. Roughly speaking, α and A1 vary in the reversed ways in Fig. 5, implying that attenuation governs A1. However, A2 is independent of the attenuation variation and A2 dominates in the first nonlinearity peak. The amplitude A2 originates from A1’, which is considered to be constant throughout the fatigue tests since we used the equal input power to the EMAT. Indeed, no linkage is found between A1 and A2 until the attenuation peak. They then change in a similar way, indicating the common influence from dislocations. The normalization by A1 is still useful because the electromagnetic properties of metal such as conductivity may change during fatigue and affect the EMAT’s transduction efficiency. The most important observation in this study is that the nonlinearity peaks appear at the fixed fractions to NF in Fig. 6. The same is true for the indicative decline in the phase velocity v and the sharp peak(s) of the attenuation coefficient α These measurements detect the pertinent microstructural changes inside the metal, which closely correlates with the degradation process and leads to the final fracture. 6. Conclusion Combination of the magnetostrictive EMAT and the resonance method enables us to detect the second harmonic amplitude of the surface-shear wave without contact.
88
H. Ogi and M. Hirao
Unique measurement of acoustic nonlinearity obeys the second power law and shows the expected sensitivity to cracks and dislocations. The nonlinearity shows two distinct peaks during rotating bending fatigue of carbon steels, at 60% and 85% of the total life. These percentage lives are unchanged in the stress region of 0.79 to 0.85 of the yield strength. The resonance-EMAT method then holds promise for development into means to predict the remaining life of a fatigued steel. The first nonlinearity peak responses to the crack nucleation and then coalescence, which is supported by the surface-crack observation. The second nonlinearity peak synchronizes with the attenuation peak and is induced by the temporal increase of free dislocations. References 1. Otto Buck, W. L. Morris, and John M. Richardson, Acoustic harmonic generation at unbonded interfaces and fatigue cracks, Appl. Phys. Lett., 33, 371-373 (1978). 2. John H. Cantrell and William T. Yost, Acoustic Harmonic Generation from FatigueInduced Dislocation Dipoles, Phil. Mag. A, 69, 315-326 (1994). 3. Andrew V. Granato and Kurt Lücke, Theory of Mechanical Damping Due to Dislocations, J. Appl. Phys., 27, 583-593 (1956). 4. Akira Hikata, Bruce B. Chick, and Charles Elbaum, Dislocation Contribution to the Second Harmonic Generation of Ultrasonic Wave, J. Appl. Phys., 36, 229-236 (1965). 5. Akira Hikata and Charles Elbaum, Generation of Ultrasonic Second and Third Harmonics Due to Dislocations. I, Phys. Rev., 144, 469-477 (1966). 6. Akira Hikata, F. A. Sewell, Jr., and Charles Elbaum, Generation of Ultrasonic Second and Third Harmonics Due to Dislocations. II, Phys. Rev., 151, 442-449 (1966). 7. Masahiko Hirao and Hirotsugu Ogi, EMATs for Science and Industry Noncontacting Ultrasonic Measurements (Kluwer/Springer Publishers, Boston, 2003) , Chap.15. 8. Donna C. Hurley, Nonlinear propagation of narrow-band Rayleigh waves excited by a comb transducer, J. Acoust. Soc. Am., 106, 1782-1788 (1999). 9. Ward Johnson, Bert A. Auld, and George A. Alers, Application of Resonant Modes of Cylinders to Case Depth Measurement, Review of Progress in QNDE, Vol.13, eds. D. O. Thompson and D. E. Chimenti (Plenum Press, New York, 1994), 1603-1609. 10. W. L. Morris, Otto Buck, and R. V. Inman, Harmonic Generation due to Fatigue Damage in High-Strength Aluminum, J. Appl. Phys., 50, 6737-6741 (1979).
Surface-Wave Nonlinearity Measured with EMAT for Fatigued Steels
89
11. Peter B. Nagy, Fatigue Damage Assessment by Nonlinear Ultrasonic Materials Characterization, Ultrasonics, 36, 375-381 (1998). 12. James C. Newman, Jr. and Wolf Elber, Eds., Mechanics of Fatigue Crack Closure, (ASTM STP 982, Philadelphia, 1988). 13. Hirotsugu Ogi, Takayuki Hamaguchi, and Masahiko Hirao, Ultrasonic Attenuation Peak in Steel and Aluminum Alloy During Rotating Bending Fatigue, Metal. Mater. Trans., 31A, 1121-1128 (2000). 14. Hirotsugu Ogi, Field Dependence of Coupling Efficiency between Electromagnetic Field and Ultrasonic Bulk Waves, J. Appl. Phys., 82, 3940-3949 (1997). 15. Hirotsugu Ogi, Masahiko Hirao, and Takashi Honda, Ultrasonic attenuation and grain size evaluation using electromagnetic acoustic resonance, J. Acoust. Soc. Am., 98, 458-464 (1995). 16. Hirotsugu Ogi, Masahiko Hirao, and Kiyoshi Minoura, Generation of Axial Shear Acoustic Resonance by Magnetostrictively Coupled EMAT, Review of Progress in QNDE, Vol.15, eds. D. O. Thompson and D. E. Chimenti (Plenum Press, New York, 1996), 1939-1944. 17. Hirotsugu Ogi, Masahiko Hirao, and Kiyoshi Minoura, Noncontact Measurement of Ultrasonic Attenuation during Rotating Fatigue Test of Steel, J. Appl. Phys., 81, 3677-3684 (1997). 18. Hirotsugu Ogi, Masahiko Hirao, and Shinji Aoki, Noncontact Monitoring of Surface-Wave Nonlinearity for Predicting the Remaining Life of Fatigued Steels, J. Appl. Phys., 90, 438-442 (2001). 19. Hirotsugu Ogi, Yoshikiyo Minami, and Masahiko Hirao, Acoustic Study of Dislocation Rearrangement at Later Stages of Fatigue: Noncontact Prediction of Remaining Life, J. Appl. Phys., 91, 1849-1854 (2002). 20. Toshihiro Ohtani, Hirotsugu Ogi, and Masahiko Hirao, Noncontact Evaluation of Surface-Wave Nonlinearity for Creep Damage in Cr-Mo-V Steel, Jap. J. Appl. Phys., 48, 07GD02-1-6 (2009). 21. John M. Richardson, Harmonic Generation at an Unbonded Interface – I. Planar Interface between Semi-Infinite Elastic Media, Int. J. Engn. Sci., 17, 73-85 (1979). 22. R. Bruce Thompson, Physical Principles of Measurements with EMAT Transducers, Physical Acoustics, Ed. R. N. Thurston and A. D. Pierce (Academic Press, New York, 1990), Vol. 19, 157-200.
This page is intentionally left blank
CHAPTER 4 ACOUSTOELASTIC LAMB WAVES AND IMPLICATIONS FOR STRUCTURAL HEALTH MONITORING
Jennifer E. Michaels*, Navneet Gandhi and Sang Jun Lee School of Electrical and Computer Engineering Georgia Institute of Technology 777 Atlantic Drive, NW, Atlanta, Georgia, USA 30332-0250 *E-mail:
[email protected] Acoustoelasticity, or the change in elastic wave speeds with stress, is a well-studied phenomenon for bulk waves. The effect of stress on Lamb waves is not as well understood, although it is clear that anisotropic stresses will produce anisotropy in the Lamb wave dispersion curves. Here the theory of acoustoelastic Lamb wave propagation is considered, and it is shown that, as expected, dispersion curves change anisotropically for an aluminum plate under uniaxial tension. Theoretical predictions are compared to experimental results for the S0 Lamb wave mode at 250 kHz for a uniaxial load. Theoretical dispersion curves are then used to simulate Lamb waves propagating in a bounded plate subjected to various loading conditions. Signal changes are examined, and the implications for structural health monitoring are discussed.
1. Introduction Guided waves in plates, which are known as Lamb waves, are of interest for nondestructive evaluation (NDE) because of their ability to propagate relatively long distances while still maintaining sensitivity to damage24. However, using Lamb waves for NDE can be problematic because not only can multiple modes simultaneously exist, they are dispersive. The concomitant shape changes with propagation distance
91
92
J. E. Michaels, N. Gandhi and S. J. Lee
make signal interpretation challenging, and thus most ultrasonic NDE methods are based upon nondispersive bulk and surface waves. The situation for structural health monitoring (SHM) is much different because transducers are spatially fixed and it is not possible to scan them to cover a large spatial extent. By necessity, transducers are permanently mounted to the structure being monitored, and it is thus advantageous to use guided waves so that the number of transducers can be minimized21. A practical difficulty is that the transducers and propagating waves are subjected to the same environmental and operating conditions as the structure itself. Because of the difficulties in performing controlled experiments on large structures subjected to realistic conditions, progress has been slow in terms of understanding the effects of such conditions on measured guided wave signals. Homogeneous temperature changes were the first operational conditions to be considered, primarily because temperature variations are unavoidable even in a laboratory environment13. Many proposed SHM methods using guided waves require comparison of current signals to baselines recorded when the structure was known to be free of damage, and differences are presumed to be caused by damage. Many studies have now shown that even very small temperature changes (~1-2 °C) can compromise the ability of an SHM system to detect damage if a direct signal comparison is made3,13. A common technique to at least partially overcome this problem is to record multiple baseline signals over a range of temperatures, select the best baseline in the least squares sense, and adjust this best baseline by stretching or compressing it to best match the signal of interest2,4,13. This technique is effective because a homogeneous temperature change causes isotropic changes in dimensions and wave speeds, and the first order signal change, at least for a small temperature change, is a simple stretch without much of an effect on shape28. Another common environmental variation is surface wetting. One approach to surface wetting is to choose guided wave modes with minimal out-of-plane motion so that signals are relatively insensitive to fluid loading1. Another approach is to utilize differential features that are not sensitive to small surface perturbations but that are sensitive to damage (if such features can be found)14. A general approach to surface wetting has not been reported.
Acoustoelastic Lamb Waves and Implications for Structural Health Monitoring
93
Stresses caused by normal operation of the structure also have a significant effect on ultrasonic guided waves15,16. Similar to temperature, stresses cause changes in both dimensions and wave speeds, but unlike a homogeneous temperature change, a homogeneous stress field is in general anisotropic. Individual echoes simply shift in time as the stress is varied, but a complicated signal typical of a real structure will not exhibit the stretching or compressing caused by a temperature change. The subject of this current work is the effect of acoustoelasticity on Lamb wave propagation, where the term acoustoelasticity refers to the change of elastic wave speeds with stress. Unlike bulk shear and longitudinal waves, which demonstrate a simple shift in velocity with stress, Lamb waves demonstrate a mode and frequency dependence on stress, which is described by a change in dispersion curves. These changes in dispersion can then be used to simulate what would happen to a complicated signal comprising many individual echoes. 2. Background The theory of acoustoelasticity was first developed in 1953 by Hughes and Kelly8, who applied the Murnaghan theory of finite deformation to propagation of elastic waves in a predeformed but initially isotropic solid. They give expressions for shear and longitudinal wave speed in terms of material properties and applied stress for both a uniaxial compressive stress and hydrostatic pressure. Toupin and Bernstein27 extended the results of Hughes and Kelley to materials of arbitrary symmetry. These early investigations were partially motivated by using acoustoelasticity measurements to determine third order elastic constants, whereas later investigations were largely motivated by the possibility of measuring both applied and residual stresses11,26. The article by Pao et al.18 provides a comprehensive treatment of acoustoelasticity, particularly its application to stress measurements. Acoustoelastic Rayleigh waves have also been investigated5,10, but much less work has been done regarding acoustoelastic Lamb waves. Husson9 presented a theoretical treatment of acoustoelasticity for Lamb waves and predicted that, to first order, they will be sensitive only to the symmetric part of the stress field and thus will not be affected by a
94
J. E. Michaels, N. Gandhi and S. J. Lee
bending stress. He also predicted a strong frequency dependence of the acoustoelastic constants. Qu and Liu20 generated dispersion curves for waves propagating in a stressed aluminum plate, but they did not consider the directional dependence or wave speed changes relative to the unloaded state. Rizzo and Lanza di Scalea22 experimentally considered acoustoelastic effects for guided waves propagating in bars, and acoustoelastic constants show a strong frequency dependence, even changing in sign. Lematre et al.12 developed theory for Lamb wave propagation in stressed piezoelectric plate structures. They show numerical results for propagation along the direction of uniaxial loading with and without taking piezoelectricity into account, but do not present experimental results. Recent measurements by Michaels et al.15 using sensors bonded on a wing panel for a structural health monitoring experiment clearly show changes in wave speed as a function of direction of propagation, but no theory is presented. Here we consider theory, experiments, and numerical simulations to illustrate the potential impact of Lamb wave acoustoelasticity on structural health monitoring. 3. Theory The theory for acoustoelastic Lamb waves is based upon the combination of acoustoelastic bulk waves and propagation of Lamb waves in anisotropic plates. Details of the theory are given by Gandhi6 and Gandhi et al.7, and an overview is presented here. 3.1. Acoustoelastic Bulk Waves The theory of acoustoelasticity is reviewed here following the development of Pao and Gamer19. As shown in Figure 1, consider the deformation of a body from its unstressed, or natural, state, to a statically deformed, or initial, state. Wave motion superposed on the initial state is the final state. Coordinates of a material point in the natural, initial and final states are given by the position vectors ξ, X and x, respectively. The deformations between the various states are given by Eq. (1), where ui is the deformation from the natural state to the initial state, uf is from the natural to final states, and u is from the initial to final states.
Acoustoelastic Lamb Waves and Implications for Structural Health Monitoring
95
u x
Final state ((wave motion) ti )
X ui Initial state (stressed)
Natural state (unstressed) Fig. 1. Coordinates of a material point as it moves from the natural state (ξ) to the initial state (X) to the final state (x).
u i (ξ ) = X − ξ, u f (ξ, t ) = x − ξ, f
(1) i
u(ξ, t ) = x − X = u (ξ, t ) − u (ξ ). Understanding wave propagation in a stressed medium requires obtaining the equation of motion for u, the incremental deformation between the initial and final states. Lagrangian strain tensors in the initial and final states are, i ∂u i ∂u i 1 ∂u i ∂u i Eαβ = α + β + λ λ 2 ∂ξ β ∂ξα ∂ξα ∂ξ β
,
f 1 ∂u f ∂uβ ∂uλf ∂uλf = α + + 2 ∂ξ β ∂ξα ∂ξα ∂ξ β
.
f Eαβ
(2)
96
J. E. Michaels, N. Gandhi and S. J. Lee
Following the convention used by Pao and Gamer19, Greek subscripts in this and subsequent equations indicate that the quantities are expressed in terms of the natural, or unstressed coordinates. Also, summation over repeated indices is implied. If the wave motion is small compared to the static predeformation, the incremental strain tensor, which is the difference between Ei and Ef, is given approximately by,
∂uβ ∂uλi ∂uλ ∂uλi ∂uλ 1 ∂u f i − Eαβ = α + + + Eαβ = Eαβ . 2 ∂ξ β ∂ξα ∂ξα ∂ξ β ∂ξ β ∂ξα
(3)
A constitutive equation relating the Lagrangian strain tensor to the second Piola-Kirchoff stress tensor can be derived by assuming that the material is hyperelastic. Only the second and third order elastic constants are retained, which results in, i Tαβi = Cαβγδ Eγδi + 12 Cαβγδεη Eγδi Eεη ,
(4)
f Tαβf = Cαβγδ Eγδf + 12 Cαβγδεη Eγδf Eεη .
A constitutive equation relating incremental stresses and strains is obtained by subtracting the above equations and discarding higher order terms, Tαβ = Cαβγδ Eγδ + Cαβγδεη eγδi eεη ,
(5)
where the infinitesimal initial and incremental strain tensors are, i 1 ∂u ∂u i eγδi = γ + δ 2 ∂ξδ ∂ξγ
∂uη 1 ∂u , eεη = ε + 2 ∂ξη ∂ξε
.
(6)
Acoustoelastic Lamb Waves and Implications for Structural Health Monitoring
97
The equation of motion for the incremental displacement is obtained by subtracting the equation of equilibrium for the static predeformation from the equation of motion for the final state and neglecting one higher order term. Prior to subtraction, these equations are, ∂ ∂ξ β
i i ∂ i ∂uα Tβα + Tβγ = 0, ∂ξγ ∂ξ β
f 2 f f f ∂uα 0 ∂ uα , Tβα + Tβγ =ρ ∂ξγ ∂t 2
(7)
where ρ0 is the density in the natural state. After subtraction and simplification, the resulting equation of motion for the incremental displacement is, ∂ ∂u i ∂ 2u i ∂uα + Tβγ α = ρ 0 2α . Tαβ + Tβγ ∂ξ β ∂ξγ ∂ξγ ∂t
(8)
If it is assumed that both the material and the static predeformation are homogeneous, then substitution of Eq. (5), the incremental constitutive equation, into Eq. (8) results in,
Aαβγδ
∂ 2uγ
∂ξ β ∂ξδ
= ρ0
∂ 2 uα , ∂t 2
(9)
where
i Aαβγδ = Cαβγδ + Cβδεη eεη δαγ + Cαβλδ
∂uγi ∂ξ λ
+ Cλβγδ
∂uαi i + Cαβγδεη eεη . ∂ξλ
(10)
Equation (9) is the acoustoelastic equation of motion for wave propagation in a predeformed solid with the tensor A given by Eq. (10). It should be noted that A does not have the same symmetries as the stiffness tensor C, with the only symmetry being Aijkl = Aklij; in general, Aijkl ≠ Ajikl ≠ Aijlk.
98
J. E. Michaels, N. Gandhi and S. J. Lee
x 2' x2
x1 is the direction of Lamb wave propagation x1' and x2' are the principal stress directions
x1
x 1'
x3
d
x1
Fig. 2. Geometry for propagation of Lamb waves in a stressed plate.
3.2. Acoustoelastic Lamb Waves Propagation of Lamb waves in a prestressed plate requires combining the equation of motion and constitutive equation developed for bulk waves in the previous section with the theory for Lamb wave propagation in an anisotropic plate. The approach taken here follows closely that of Nayfeh and Chimenti17 for anisotropic Lamb waves. Figure 2 illustrates the geometry considered here, which consists of a Lamb wave propagating at an arbitrary angle φ to the principal stress directions. Referring to the figure, the initial stresses are specified in the primed coordinate system, and the Lamb wave is propagating along the x1 direction. The initial stress tensor is,
σ 11 0 T ' = 0 σ 22 0 0
0 0 . 0
(11)
Acoustoelastic Lamb Waves and Implications for Structural Health Monitoring
99
It can be expressed in the unprimed system via a rotational transformation, i ′ Tiji = β im β jnTmn ,
(12)
where βij is the cosine of the angle between the xi and the xj' axes. In this and subsequent equations, lower case Roman subscripts are used instead of Greek subscripts for convenience. It must be kept in mind that all quantities are expressed in terms of the natural coordinates, which are expressed as xj instead of ξj. If the assumption is made that the initial strains are small, the constitutive equation relating initial stresses and strains given in Eq. (4) can be simplified to, Tiji = Cijkl ekli .
(13)
The strains are calculated by inverting this equation. The strains in the unprimed system can be computed by either direct computation from the stresses in the unprimed system, or by a tensor rotation of the strains in the primed system. From this point forward, all quantities are expressed in the unprimed system. The A tensor as given by Eq. (10) is in an inconvenient form because there are both initial strains and derivatives of initial displacements. The displacement derivatives can be expressed in terms of strains by noting that for the stress as given in Eq. (11), the rotation terms are zero for all angles φ. Thus, ∂u ij ∂xk
= e ijk ,
(14)
100
J. E. Michaels, N. Gandhi and S. J. Lee
and i i i i Aijkl = Cijkl + C jlmn emn δ ik + Cijml ekm + Cmjkl eim + Cijklmn emn .
(15)
The incremental stress-strain relation of Eq. (5) can similarly be simplified to Tij = Bijkl
∂uk i i . , Bijkl = Cijkl + Cijml ekm + Cijklmn emn ∂xl
(16)
Acoustoelastic Lamb wave propagation for a homogeneous and biaxial stress field requires solving the wave equation for the incremental displacements as given by Eq. (9), with the tensor A given by Eq. (15), subject to stress-free boundary conditions at x3 = ± d / 2 with the stresses given as per Eq. (16). This problem differs from Lamb wave propagation in anisotropic media in two regards: (1) as previously noted, the tensor A does not have the same symmetries as the stiffness tensor C, and (2) the boundary conditions at the plate surfaces are different. The approach to solving this problem is to assume solutions of the form, u j = U j e iξ ( x1 +α x3 − ct ) ,
(17)
where ξ is the wavenumber in the x1 direction, α is the ratio of x3 to x1 wavenumbers, and c is the phase velocity along the x1 axis. For a specific value of c, these solutions correspond to upgoing and downgoing bulk waves in the x1-x3 plane of the plate, which are then summed together to form the Lamb wave. This approach is sometimes referred to as the partial wave method23. Substitution of Eq. (17) into Eq. (9) yields a form of the Christoffel equations, K mn (α )U n = 0 ,
(18)
Acoustoelastic Lamb Waves and Implications for Structural Health Monitoring 101
where the parameters Kmn are given by,
K11 = c 2 ρ 0 − A1111 − α 2 A1313 , K 22 = c 2 ρ 0 − A1212 − α 2 A2323 , K33 = c 2 ρ 0 − A1313 − α 2 A3333 ,
(19)
K12 = K 21 = − A1112 − α 2 A1323 , K13 = K31 = −α ( A1133 + A1331 ), K 23 = K 32 = −α ( A1233 + A1332 ).
To obtain non-trivial solutions for the displacement amplitudes Un, the determinant of the K matrix must go to zero, which produces a 6th order equation in α. The coefficients of the odd powers of α are all zero, resulting in a cubic equation in α2 (details are given in the paper by Gandhi et al.7). For a specific value of the Lamb wave phase velocity c, solving this cubic equation yields six values of α, which correspond to three upgoing bulk waves and three downgoing bulk waves. The next step is to satisfy the stress-free boundary conditions on the plate surfaces as per Eq. (16). The approach taken here is similar to that of Nayfeh and Chimenti17. The strategy is to define displacement ratios of U2 and U3 to U1 for each of the six partial wave solutions, Vq =
U 2q U1q
, Wq =
U 3q U 1q
, q = 1,2… 6 .
(20)
Equation (18) enables each ratio to be expressed as a function of the Kmn and the corresponding αq. The total displacement field of the Lamb wave is constructed by summing the six partial waves, 6
{u1 , u2 , u3 } = ∑ {1,Vq ,Wq }U1q e
iξ ( x1 +α q x3 − ct )
.
(21)
q =1
A similar expression can be derived for the stresses by substituting Eq. (21) into Eq. (16). The three stress components needed to satisfy the boundary conditions are given as
102
J. E. Michaels, N. Gandhi and S. J. Lee 6
{T33 , T13 , T23} = ∑ iξ {D1q , D2 q , D3q }U1q e
iξ ( x1 +α q x3 − ct )
,
(22)
q =1
where the parameters Dmq are, D1q = B3311 + B3312Vq + α q B3333Wq ,
D2 q = α q ( B1313 + B1323Vq ) + B1331Wq ,
(23)
D3q = α q ( B1323 + B2323Vq ) + B1332Wq . Application of the boundary conditions; i.e., setting T13, T23 and T33 to zero at x3 = ± d / 2, produces six equations in terms of the six displacement amplitudes, U1q, of the six partial waves, q = 1,2…6. The determinant of coefficients, which is given below, must go to zero to obtain nontrivial solutions for these six displacement amplitudes, D11 E1 D21 E1 D31 E1 D Eɶ 1
D12 E2 D22 E2 D32 E2 D Eɶ 2
D13 E3 D23 E3 D33 E3 D Eɶ 3
D14 E4 D24 E4 D34 E4 D Eɶ 4
D15 E5 D25 E5 D35 E5 D Eɶ
D16 E6 D26 E6 D36 E6 = 0, D Eɶ
D21 Eɶ1 D31 Eɶ1
D22 Eɶ 2 D32 Eɶ 2
D23 Eɶ 3 D33 Eɶ 3
D24 Eɶ 4 D34 Eɶ 4
D25 Eɶ 5 D35 Eɶ 5
D26 Eɶ 6 D36 Eɶ 6
11
12
iξα d /2
13
14
15
5
16
(24)
6
− iξα d /2
where Eq = e q and Eɶ q = e q . It can be shown that after a number of row and column operations, Eq. (24) decouples into the following two equations, where the first one is for the symmetric modes and the second is for the antisymmetric modes, f s (ω , c) = D11G1 cot(γα1 ) + D13G3 cot(γα 3 ) + D15G5 cot(γα 5 ) = 0,
(25)
f a (ω , c) = D11G1 tan(γα1 ) + D13G3 tan(γα 3 ) + D15G5 tan(γα 5 ) = 0.
(26)
Acoustoelastic Lamb Waves and Implications for Structural Health Monitoring 103
In these equations γ = ξd / 2 = ωd / (2c), where ω is the angular frequency. The parameters Gm are given as
G1 = D23 D35 − D33 D25 , G3 = D31D25 − D21D35 , G5 = D21 D33 − D31D23 . (27) Solving Eqs. (25) and (26) yields the dispersion curves relating phase velocity and angular frequency for the symmetric and antisymmetric Lamb wave modes, respectively. It should be noted that solving these equations is not trivial. Details regarding the solution method are given by Gandhi6.
4. Numerical Results Numerical results are presented for Lamb waves propagating in an aluminum plate of thickness 6.35 mm that has isotropic crystal symmetry. Its material constants are listed in Table 1, where the third order elastic constants were determined by Stobbe25. All results shown are for the S0 Lamb wave mode to match experimental results and numerical simulations.
Table 1. Density (ρ0), Lamé second order elastic constants (λ and µ), and Murnaghan third order elastic constants (l, m and n) for aluminum that were used to generate numerical results. Parameter 0
Value
ρ
2800 kg/m3
λ
54.9 GPa
µ
26.5 GPa
l
-252.2 GPa
m
-324.9 GPa
n
-351.2 GPa
104
J. E. Michaels, N. Gandhi and S. J. Lee
5500 5060 5050
Phase Velocity (m/s)
5000
5040 5030
4500
5020 5010
4000 3500 3000 2500 0
5000
0 MPa 20 MPa 40 MPa 60 MPa 80 MPa 100 MPa 120 MPa
200
4990 240
245
400 600 Frequency (kHz)
250
255
800
260
1000
Fig. 3. Dispersion curves for the Lamb wave S0 mode (fundamental symmetric mode) propagating at an angle of 45° with respect to the direction of an applied uniaxial load.
4.1. Stress Dependence at a Fixed Propagation Angle The most obvious acoustoelastic effect for all wave types is the change of wave speed as a function of stress. Figure 3 shows a set of dispersion curves for the S0 mode propagating at an angle of 45° to the direction of an applied uniaxial load. The tensile load ranges from 0 to 120 MPa, and it can be seen that the changes are very small. The zoomed window is centered at a frequency of 250 kHz, and here the small change in phase velocity with load is clearly shown. Figure 4 shows the change of phase velocity at 250 kHz as a function of load. The relation is linear and, as is customary for bulk waves, an acoustoelastic constant can be defined to relate the change in phase velocity to the applied load,
Acoustoelastic Lamb Waves and Implications for Structural Health Monitoring 105
Change in Phase Velocity (m/s)
0
Theory Linear Fit −5
−10
−15
−20 0
20
40
60 Load (MPa)
80
100
120
Fig. 4. Change in phase velocity versus load for the S0 mode at 250 kHz propagating at an angle of 45° with respect to the direction of an applied uniaxial load.
∆c = K S0 (ϕ , ω )σ 22 . c
(28)
The superscript “S0” indicates that the constant is mode-dependent, and it is also a function of propagation angle and frequency. For the data shown in Figure 4, KS0 = −2.99×10-11 Pa−1. 4.2. Angle Dependence at a Fixed Stress
The next most noteworthy observation is the dependence of phase velocity on the direction of propagation for a fixed stress, here taken to be a uniaxial load of 46 MPa applied in the x2 direction (σ22 = 46 MPa). Figure 5 shows a set of dispersion curves for the S0 mode propagating at angles ranging from 0 to 90°, where 90° is along the direction of the applied load and 0° is the transverse direction. The zero load curve is also shown for comparison. As before, it can be seen that the changes are
106
J. E. Michaels, N. Gandhi and S. J. Lee
5500
5080 5060
5000
Phase Velocity (m/s)
5040
4500
5020
No Load
4000
0° 15° °
3500 3000
5000 4980 240
245
250
255
260
30
45° 60° 75° 90°
2500 0
200
400 600 Frequency (kHz)
800
1000
Fig. 5. Dispersion curves for the S0 mode as a function of angle of propagation for an applied uniaxial load of 46 MPa, where the load is applied along the 90° direction. The zero load curve is shown as a reference.
very small. The zoomed window is centered at a frequency of 250 kHz, which allows the phase velocity changes with angle to be seen. Figure 6 shows the change of phase velocity relative to zero load as a function of angle at a fixed frequency of 250 kHz. The data points are as computed from the theory, and the smooth curve is the result of fitting these points to a function of the form, ∆c(ϕ ) = a0 + a1 cos(2ϕ ) + a2 sin(2ϕ ) .
(29)
This relation is expected from bulk wave acoustoelasticity but is not explicit in the Lamb wave formulation. The data points match this function almost perfectly.
Acoustoelastic Lamb Waves and Implications for Structural Health Monitoring 107
Change in Phase Velocity (m/s)
5
0
−5
−10
−15
−20 −90
Theory Curve Fit −45
0 Propagation Angle (degrees)
45
90
Fig. 6. Change in phase velocity for the S0 mode at 250 kHz versus propagation angle with respect to the direction of a 46 MPa uniaxial load applied along the x2 axis (φ = 90°).
5. Experimental Results
Experiments were performed to measure changes in phase velocity as a function of load for the S0 mode at 250 kHz propagating in a plate of thickness 6.35 mm. Changes in recorded signals are analyzed to obtain the time shift with load, and measured time shifts are converted to changes in phase velocity assuming that the zero load dispersion curves and transducer separation distances are known. 5.1. Description of Measurements
Ten transducers consisting of 7 mm diameter PZT disks were bonded to the surface of a 6.35 mm thick aluminum plate on the circumference of a 218 mm diameter circle. The dimensions of the plate were 305 mm × 610 mm, and the transducer circle was located in the center of the plate. The plate was mounted in a testing machine, and ultrasonic signals were recorded between nine pairs of transducers at loads ranging from 0 to 57.5 MPa at 5.75 MPa steps. The load was applied along the long
108
J. E. Michaels, N. Gandhi and S. J. Lee Table 2. Angles of propagation and separation distances for the nine transducer pairs used to measure changes in phase velocity with load. Pair Number 1
Separation Distance (mm) 218.00
Propagation Angle (degrees) 90.00
2
213.81
-78.75
3
218.00
-67.50
4
181.26
-56.25
5
218.00
45.00
6
181.26
-33.75
7
218.00
-22.50
8
213.81
-11.25
9
218.00
0
dimension of the plate, and the ends of the plate were machined so that they could be effectively gripped during the loading process. The transducer pairs were selected to include all available angles of propagation between the ten individual sensors while keeping the separation distances as large as possible; distances and angles are summarized in Table 2. The excitation was a 250 kHz, 5-cycle, Hanning windowed tone burst with an amplitude of 10 volts, and signals were recorded by averaging 50 waveforms to reduce incoherent noise. 5.2. Signal Analysis and Results
Signals were analyzed to obtain the time shift of the first arrival as a function of load by tracking the time of a zero crossing near the center of the echo corresponding to the direct arrival from transmitter to receiver. This time shift is predominantly caused by changes in phase velocity rather than group velocity3. A linear fit was performed with the slope equal to the change in time per load. The time for a load of 46 MPa was calculated and then normalized by the transducer separation distance. Results are shown in Figure 7 along with the trigonometric curve fit.
Acoustoelastic Lamb Waves and Implications for Structural Health Monitoring 109
Time Change per Distance ( µs/m)
0.6
Experiment Curve Fit 0.4
0.2
0
−0.2 −90
−45
0 Propagation Angle (degrees)
45
90
Fig. 7. Time change per unit distance plotted as a function of propagation angle relative to a 46 MPa uniaxial load applied along the x2 axis (φ = 90°).
The most likely source of experimental error is the possibility that the zero crossing being tracked was not from the direct S0 arrival. At the measurement frequency of 250 kHz, the only other mode is the A0 mode, which has a slower group velocity. However, it is possible that either the direct A0 arrival or an S0 reflection could overlap with the direct S0 arrival. The quality of the experimental data can be assessed by comparing the measurements with the expected trigonometric relationship. The fact that there is excellent agreement gives confidence that the correct zero crossing was tracked as a function of load. The time change per distance data were then used to compute the change in phase velocity. If a plane wave propagates a distance d, the time for a point of constant phase to travel this distance is simply d/c. Small changes in time can be caused by small changes in either or both distance and phase velocity,
∆t =
∂t ∂t 1 d ∆d + ∆c = ∆d − 2 ∆c . c c ∂d ∂c
(30)
110
J. E. Michaels, N. Gandhi and S. J. Lee
Change in Phase Velocity (m/s)
5
0
−5
−10 Experiment Curve Fit Theory
−15
−20 −90
−45
0 Propagation Angle (degrees)
45
90
Fig. 8. Time change per unit distance plotted as a function of propagation angle relative to a 46 MPa uniaxial load applied along the x2 axis (φ = 90°).
Referring back to Figure 6, the change in phase velocity does not take into account that the propagation distance has changed because the theory was developed using natural coordinates (i.e., in the unstressed state). Since the transducers used for the experiments were attached to the plate, measured time changes are also relative to natural coordinates. That is, the distance is taken to be the same as in the unstressed state. Thus, when comparing experiment to theory in natural coordinates, it is necessary to set ∆d = 0 in Eq. (30). Changes in phase velocity can then be calculated as ∆c = −c 2
∆t . d
(31)
Figure 8 compares phase velocities determined from measurements with those of Figure 6 that were calculated from theory. The nominal phase velocity at 250 kHz is 5048.8 m/s, and the time shift per distance (∆t/d) values are taken from Figure 7. The agreement is very good, particularly considering the uncertainty in the third order elastic constants25.
Acoustoelastic Lamb Waves and Implications for Structural Health Monitoring 111
R Receiver i
Source
Fig. 9. Geometry considered for numerical simulations of guided waves propagating from the source to the receiver. The four paths of propagation, which include the direct arrival and three edge reflections, are shown by the various lines in the figure. Note that the direct arrival (dashed line) and the reflection from the bottom edge (grey line) overlap.
6. Simulations and Discussion
Although the primary effect of an applied load is to shift an echo in time, a secondary effect can be seen via simulations that include edge reflections. It is usually impossible to avoid edge and other geometrical reflections for structural health monitoring applications, and it is these reflections that cause a considerable increase in complexity of measured guided wave signals. As previously mentioned, a key observation that has been made for temperature changes is that received signals exhibit only minor shape changes as the temperature is varied with the main effect being a stretching (or compressing) of the overall signal. It is instructive to compare this behavior with that due to applied loads. Figure 9 illustrates the geometry used for numerical simulations, which consists of a source located at (x,y) = (250 mm, 100 mm) and a receiver at (250 mm, 400 mm), where both sets of coordinates are given relative to a corner reflector. The corner causes three additional echoes to be received after the direct arrival, and it is assumed that the reflection
112
J. E. Michaels, N. Gandhi and S. J. Lee Zero Load
Uniaxial
Hydrostatic
0.05
0
−0.05 50
100
150 Time (µs)
200
250
Fig. 10. Simulated S0 signals at 250 kHz under the conditions of no load, a 46 MPa uniaxial load along the y direction, and a 2D hydrostatic load of 46 MPa.
coefficient is equal to -1. The simulation is carried out by using the dispersion curves for the specified load and direction to propagate each pulse from the source to the receiver via each of the four paths considered. The initial pulse shape was taken to be a 250 kHz, 7 cycle, Hanning windowed tone burst, and the final signal is computed as the sum of the four components. A geometric decay inversely proportional to the square root of the distance is also incorporated. Three cases were simulated that correspond to (1) no applied load, (2) an applied uniaxial load of 46 MPa along the y direction, and (3) an applied 2D hydrostatic load of 46 MPa. Figure 10 shows the three signals in a 200 µs time window, where it can be seen that changes are rather small. The nature of the changes can be assessed by magnifying several time windows as shown in Figure 11. The top window is centered on the first arrival, and, as expected, this echo shifts in time by approximately the same amount for both the uniaxial and hydrostatic loads; any shape changes are negligible. The second window includes edge reflections that
Acoustoelastic Lamb Waves and Implications for Structural Health Monitoring 113
partially overlap the tail of the first arrival. In this plot, it can be seen that the shape of the signal corresponding to the uniaxial load has significantly deviated from the other two signals, and the time shift is not only different from that of the hydrostatic load, it does not systematically increase (or decrease) with time. The third plot is the latest one in time and the signal in this time window is predominantly from the corner reflection. Here the wave shapes are all similar, but the time shift for the uniaxial load is quite a bit smaller than that for the hydrostatic load. By examining the signals in detail, it can be seen that for the hydrostatic load, the time shifts consistently increase with time and shape changes are minimal, as is the case for a temperature change. Both of these conditions are isotropic, and it is expected that the corresponding signal changes would be similar. However, that is not the case for the uniaxial load, which is anisotropic. Since different propagation paths correspond to different dispersion curves because of the anisotropic strains, time shifts are inconsistent. For the case shown here, there are only four echoes, but for a realistic structure, there are many geometrical reflectors, which cause received signals to be composed of multiple, overlapping echoes. The effect of an anisotropic load is to not only shift individual echoes differently even if they arrive at the same time, but to change the shapes of the signals15. There are several implications for structural health monitoring. First of all, time shifts due to applied stresses are significant when comparing current signals to baseline data and cannot be ignored. Second, the time shifts of the first arrivals can be used to deduce that there is an anisotropic stress field if signals from enough directions of propagation are available. If it is possible to account for any isotropic time shifts due to temperature, then it should be possible to estimate the applied stress if the stress field is homogeneous. Third, shapes of complex signals change with load, particularly for later times where the number of overlapping echoes increases. Existing methods for temperature compensation that rely upon stretching or compressing signals will not be effective for anisotropic loads; however, they should work for 2D hydrostatic loads.
114
J. E. Michaels, N. Gandhi and S. J. Lee Zero Load
Uniaxial
Hydrostatic
0.05
0
−0.05 75
80
85 Time (µs)
90
95
140
145 Time (µs)
150
155
180
185 Time (µs)
190
195
0.05
0
−0.05 135
0.05
0
−0.05 175
Fig. 11. Magnified time windows of the simulated signals shown in Figure 10.
Acoustoelastic Lamb Waves and Implications for Structural Health Monitoring 115
7. Conclusion and Future Work
Described here is the theory for acoustoelastic Lamb wave propagation, and both numerical and experimental results are shown for the S0 mode. These results show the expected linear dependence of phase velocity with load and sinusoidal dependence with angle of propagation, all at a single frequency. Unlike bulk waves, the acoustoelastic behavior is frequency dependent. Experimental results that measure the change in phase velocity as a function of both load and propagation angle agree well with theory. Numerical simulations show that the complicated signals resulting from wave propagation in a bounded structure change shape as a result of a uniaxial load, whereas for a 2D hydrostatic load shape changes are much less significant. The conclusion is that signal changes due to applied loads for guided wave structural health monitoring are expected to be significant and cannot be ignored. Future work should consider biaxial applied stresses, additional Lamb wave modes, continued experimental validation of loading effects, development of methods to compensate for applied stresses, and determination of third order elastic constants from Lamb wave measurements. Acknowledgments
This work was supported by the Air Force Research Laboratory under Contract No. FA8650-09-C-5206. References 1. T. Cicero, P. Cawley, M. J. S. Lowe and F. Simonetti, “Effects of liquid loading and change of properties of adhesive joints on subtraction techniques for structural health monitoring,” Rev. Prog. Quant. NDE 28A, Eds. D. O. Thompson and D. E. Chimenti (American Institute of Physics, 2009), 1006. 2. T. Clarke, P. Cawley, P. D. Wilcox and A. J. Croxford, “Evaluation of the damage detection capability of a sparse-array guided-wave SHM system applied to a complex structure under varying thermal conditions,” IEEE Trans. Ultrason. Ferroelect. Freq. Contr. 56(12), 2666 (2009). 3. A. J. Croxford, P. D. Wilcox, B. W. Drinkwater, and G. Konstantinidis, ‘‘Strategies for guided-wave structural health monitoring,’’ Proc. R. Soc. A 463, 2961 (2007).
116
J. E. Michaels, N. Gandhi and S. J. Lee
4. A. J. Croxford, J. Moll, P. D. Wilcox and J. E. Michaels, “Efficient temperature compensation strategies for guided wave structural health monitoring,” Ultrasonics 50, 517 (2010). 5. M. Duquennoy, M. Ouaftouh, M. Ourak and F. Jenot, “Theoretical determination of Rayleigh wave acoustoelastic coefficients: comparison with experimental values,” Ultrasonics 39, 575 (2002). 6. N. Gandhi, Determination of Dispersion Curves for Acoustoelastic Lamb Wave Propagation, M.S. Thesis, Georgia Tech (2010). 7. N. Gandhi, J. E. Michaels and S. J. Lee, “Acoustoelastic Lamb wave propagation in biaxially stressed plates,” (in preparation). 8. D. S. Hughes and J. L. Kelly, “Second-order elastic deformation of solids,” Phys. Rev. 92, 1145 (1953). 9. D. Husson, “A perturbation theory for the acoustoelastic effect of surface waves,” J. Appl. Phys. 57(5), 1562 (1985). 10. Y. Iwashimizu and O. Kobori, “Rayleigh wave in a finitely deformed isotropic elastic material,” J. Acoust. Soc. Am. 64(3), 910 (1978). 11. R. B. King and C. M. Fortunko, “Determination of in-plane residual stress states in plates using horizontally polarized shear waves,” J. Appl. Phys. 54(6), 3027 (1983). 12. M. Lematre, G. Feuillard, T. Delaunay and M. Lethiecq, “Modeling of ultrasonic wave propagation in integrated piezoelectric structures under residual stress,” IEEE Trans. Ultrason. Ferroelect. Freq. Contr. 53(4), 685 (2006). 13. Y. Lu and J. E. Michaels, “A methodology for structural health monitoring with diffuse ultrasonic waves in the presence of temperature variation,” Ultrasonics 43(9), 717 ( 2005). 14. Y. Lu and J. E. Michaels, “Feature extraction and sensor fusion for ultrasonic structural health monitoring under changing environmental conditions,” IEEE Sensors Journal 9(11), 1462 (2009). 15. J. E. Michaels, T. E. Michaels and R. Martin, “Analysis of global ultrasonic sensor data from a full scale wing panel test,” Rev. Prog. Quant. NDE 28A, Eds. D. O. Thompson and D. E. Chimenti (American Institute of Physics, 2009), 950. 16. J. E. Michaels, S. J. Lee and T. E. Michaels, “Effects of applied loads and temperature variations on ultrasonic guided waves,” Proc. 2010 European Workshop on Structural Health Monitoring, June 29 - July 2, 2010, Sorrento, Italy, pp. 1267-1272. 17. A. H. Nayfeh and D. E. Chimenti, “Free wave propagation in plates of general anisotropic media,” J. Appl. Mech. 56, 881 (1989). 18. Y.-H. Pao, W. Sachse and H. Fukuoka, "Acoustoelasticity and ultrasonic measurement of residual stress," in Physical Acoustics Volume XVII, Eds. W. P. Mason and R. N. Thurston (Academic Press, New York, 1984), 61. 19. Y.-H. Pao and U. Gamer, “Acoustoelastic waves in orthotropic media,” J. Acoust. Soc. Am. 77, 806 (1985).
Acoustoelastic Lamb Waves and Implications for Structural Health Monitoring 117 20. J. Qu and G. Liu, “Effects of residual stress on guided waves in layered media,” Rev. Prog. Quant. NDE 17, Eds. D. O. Thompson and D. E. Chimenti (Plenum Press, 1998), 1635. 21. A. Raghavan and C. E. S. Cesnik, “Review of guided-wave structural health monitoring,” Shock Vib. Digest 39, 91 (2007). 22. P. Rizzo and F. Lanza di Scalea, “Effect of frequency on the acoustoelastic response of steel bars,” Exp. Tech. 27(6), 40 (2003). 23. J. L. Rose, Ultrasonic Waves in Solid Media (Cambridge University Press, 1999). 24. J. L. Rose, “Guided wave nuances for ultrasonic nondestructive evaluation,” IEEE Trans. Ultrason. Ferroelect. Freq. Contr. 47(3), 575 (2000). 25. D. M. Stobbe, Acoustoelasticity in 7075-T651 Aluminum and Dependence of Third Order Elastic Constants on Fatigue Damage, M.S. Thesis, Georgia Tech (2005). 26. R. B. Thompson, S. S. Lee and J. F. Smith, “Angular dependence of ultrasonic wave propagation in a stressed orthorhombic continuum: Theory and application to the measurement of stress and texture,” J. Acoust. Soc. Am. 80(3), 921 (1986). 27. R. A. Toupin and B. Bernstein, “Sound waves in deformed perfectly elastic materials. Acoustoelastic effect,” J. Acoust. Soc. Am. 33, 216 (1961). 28. R. L. Weaver and O. I. Lobkis, “Temperature dependence of diffuse field phase,” Ultrasonics 38, 491 (2000).
This page is intentionally left blank
February 25, 2011
13:57
World Scientific Review Volume - 9in x 6in
CHAPTER 5 SOURCE SYNTHESIS FOR INVERSE PROBLEMS IN WAVE PROPAGATION
William W. Symes Computational and Applied Mathematics, MS 134, Rice University Houston TX 77005, USA E-mail:
[email protected] Extended sources, both physical and synthetic, have a long history in acoustics and seismology. Recently, extended synthetic sources have been used to substantially reduce the computational complexity of reflection seismic waveform inversion (model-driven data fitting). Most proposed synthetic sources for waveform inversion are random in some way. A different, deterministic extended source selection strategy has met with success in application to other continuum-mechanical inverse problems. This paper reports preliminary work on a translation of the method to the reflected wave imaging setting, a basic theoretical framework for analysing its behaviour, and a numerical example. The example shows the very strong tendency of normalized source energy to insert large high angle components into the acoustic field; this tendency is also predicted by the theory. Such high angle waves produce the largest possible discrimination between the responses of two different models, but may not optimally illuminate the subsurface for inversion updates. The paper ends with suggestions for modification of the source selection strategy to better adapt it to inversion applications.
5.1. Introduction Many acoustic and seismic imaging technologies employ localized sources and sensors for economic or practical reasons. Examples are the transducers used in ocean acoustics or the airguns and other source devices used in reflection seismology. It is sometimes possible to use a number of such localized sources simultaneously to create an extended physical source. Also, linearity of the underlying wave processes permits a posteriori synthesis of 119
Chapter5
February 25, 2011
120
13:57
World Scientific Review Volume - 9in x 6in
W. W. Symes
the signals that would have been created by extended arrays of sources, through the principle of superposition. Synthetic extended source processing has a long history in reflection seismology. Early applications used deterministic synthesis principles, such as plane wave synthesis [Diebold and Stoffa (1981); Treitel et al. (1982)]. In the last few years, the renewed interest in model-driven full waveform inversion of reflection seismic data has motivated considerable work on both physical and synthetic extended sources [Beasley et al. (1998); Berkhout (2008); Blacquiere et al. (2009); Romero et al. (2000); Verschuur and Berkhout (2009); Krebs et al. (2009)]. The benefits are clear: simultaneous recording of physical sources reduces recording time hence cost. Rolling together the responses to many sources in a digital synthesis and processing the resulting summary data can reduce computational costs dramatically. Krebs et al. (2009) for example demonstrate more than an order of magnitude speedup in full waveform inversion over collective inversion of all localized source data. Most of the recently suggested synthesis methods rely on some form of randomization, possibly motivated by recent work in compressive sensing (an early example is [Romero et al. (2000)]). This paper presents an initial exploration of an alternative, deterministic synthesis method based on spectral theory. The principle underlying the deterministic choice of synthesis filter is simple: amongst all admissible synthesis filters, choose the one which makes the predicted data at the current model differ as much as possible from the corresponding synthesized target data. This maximization problem is equivalent to finding the singular vector with largest singular value of the difference of operators mapping source coefficients to data (precise definitions below) for the current model estimate and the “real” model (the latter being computed by proxy synthesis from measured localized source data), or equivalently its operator norm. Thus the inverse problem is implicitly reformulated as minimization of the error, measured in the operator norm, between source-to-data operators, rather than between the usual collective mismatch of data measured somehow. Various numerical methods can be employed to approximate the operator norm, the simplest being the power method. The author learned this idea through the work of D. Isaacson on electrical impedance tomography, a biomedical inverse problem similar to resistivity imaging [Isaacson (1987)]. Essentially the same idea has been used to good effect in ultrasonic and microwave array processing, synthetic aperture radar, underwater acoustics, and other imaging technologies [Borcea et al.
Chapter5
February 25, 2011
13:57
World Scientific Review Volume - 9in x 6in
Source Synthesis for Inverse Problems in Wave Propagation
Chapter5
121
(2007); Fink and Prada (2001)]. In biomedical EIT, the current-voltage relation amongst a set of electrodes is measured. This relation depends on the conductivity interior to the body to which the electrodes were attached, but the dependence is very weak. Only a few current patterns produce voltage responses capable of distinguishing interior conductivity distributions. Therefore it is extremely important to find these optimal patterns. The mathematical setup is the same: the voltage response is linear in the applied currents at the various electrodes, and the best pattern can be found without knowing the internal conductivity structure - by solving a singular value problem. Our initial exploration of this concept has uncovered an obstacle apparently peculiar to wave problems. Identification of singular values requires that metrics (in fact, inner products) be identified in both domain and range of the operator. The obvious choice of metric for the source coefficients leads to an overemphasis on wave fields propagating nearly horizontally - these correspond to the largest singular values, and so dominate the sources synthesized by the operator norm principle. These sources appear to be less-than-optimal from the viewpoint of imaging or inversion. We explain the origin of this phenomenon in the Theory section below, illustrate it in the Numerical Example section, and suggest a possible amendment to the method in the Conclusion section, via definition of a different metric, which may lead to source estimates better-suited to waveform inversion. This deterministic source synthesis method shares with its random cousins a potential drawback: its formulation requires “fixed spread” acquisition geometry, at least in principle: that is, every receiver records data from every source. Not every data acquisition modality conforms to this constraint, a notable violator being towed streamer acquisition in marine seismology. While this restriction can be relaxed to some extent, it appears to constitute an important limitation. 5.2. Theory Linear acoustodynamics takes two natural forms. The first order system pairs the linear acoustic constitutive law with Newton’s law for small amplitude motions of a fluid continuum: 1 ∂ p(t, x) = −∇ · v(t, x) + s(t, x), κ(x) ∂t ∂ ρ(x) v(t, x) = −∇p(t, x). ∂t
(5.1) (5.2)
February 25, 2011
122
13:57
World Scientific Review Volume - 9in x 6in
Chapter5
W. W. Symes
The bulk modulus κ and material density ρ are functions of position x. The right-hand side s in the first equation represents the source of acoustic energy as a constitutive law defect. Eliminating the velocity field from (5.1) via (5.2) leads to the secondorder wave equation for pressure: 1 ∂2 ∂s 1 ∇ p(t, x) = (t, x). (5.3) − ∇ · κ ∂t2 ρ ∂t In this work I assume that the fluid supporting wave motion occupies the full Euclidean space R3 , and that the source distribution is confined to a plane, for convenience z = 0. Thus s(t, x) = f (t, x, y)δ(z).
(5.4)
Also all fields are causal: p, v, and f vanish for t < 0, that is, p(0, x) = 0, v(0, x) = 0. Of course this source representation is an idealization - physically realizable sources are reasonably well-represented as superpositions of point sources, and the representation (5.4) is only reasonable if sampling is adequate, an issue which this paper does not address. Idealized observations of the pressure are the time series recorded on the same plane z = 0, for a time interval 0 ≤ t ≤ T fixed for the remainder of the discussion: d(t, x, y) = p(t, x, y), 0 ≤ t ≤ T. The relation between the energy source s and the data d is linear, but is affected nonlinearly by the material parameter fields κ and ρ. Thus the linear operator relating f to d is also a function of κ and ρ: d = Λ[κ, ρ]f
(5.5)
I will suppress κ and ρ from the notation occasionally when they are playing no explicit role. Note that in application to field data due to sources fl (t, x, y; xs ) localized at many positions xs , say dl (t, x, y, xs ), one would naturally represent the relation (5.5) implicitly. The relation between the localized sources and the synthetic source f is represented as a convolution and summation: f (t, x, y) = dτ dxs a(t − τ, xs )fl (τ, x, y; xs ) as is the relation between localized source and synthetic source data: d(t, x, y) = dτ dxs a(t − τ, xs )dl (τ, x, y; xs ). (5.6)
February 25, 2011
13:57
World Scientific Review Volume - 9in x 6in
Source Synthesis for Inverse Problems in Wave Propagation
Chapter5
123
That is, dl embodies the action of the localized source simulation operator depending on the same material parameters as does d and the localized source distribution. Since the localized operator is unknown in an inverse problem context, it is important to note that its effect has already been accounted for in the measured (localized source) data. For the rest of the paper I will use the representation (5.5) for the target data, as if the target model were known; the reader will understand that the alternative representation (5.6) is actually to be used in applications presenting experimental localized-source data. Assuming that the localized sources fl are known (estimating these is yet another interesting problem!), the filter kernel a then parametrizes f , and in effect it is a that is to be determined somehow. In the present, preliminary study, I will suppress this complication. As explained in the introduction, the objective of the present study is the determination of source distributions f giving rise to the largest possible difference between the pressure field responses for two different media, say (κ, ρ) and (κ0 , ρ0 ). That is, ∆[κ, ρ, κ0 , ρ0 ]f ≡ Λ[κ, ρ]f − Λ[κ0 , ρ0 ]f is to be made as large as possible. To turn this request into a solvable problem, some meaning has to be assigned to the word “large”. A natural measure of size for the acoustic state (p, v) at a given time is the stress-energy 2 p 1 dx + ρv · v (x, t). (5.7) E(t) = 2 κ However, the energy in the field at a given time does not necessarily bound the field restricted to a positive codimension manifold such as that defined by z = 0. The relation between field energy and the “energy” (in various senses) of such restrictions has been studied from various points of view, see eg. [Symes (1983)] for an extensive formal analysis. To capture the essence of the relation, a less formal approach is useful. Via Radon transform, any source waveform f (x, y, t) can be expressed as a sum of plane wave components of the form F (t − ξx − ηy). Assuming for the moment that the material is homogeneous, that is, that κ, ρ are independent of x, it is reasonable to guess that for z = 0, p is also a plane wave, with the same wave vector (1, ξ, η) in (t, x, y). Thus, set p(t, x, y, z) = P± (t − ξx − ηy − ζ± z) for ± z > 0.
February 25, 2011
124
13:57
World Scientific Review Volume - 9in x 6in
W. W. Symes
In order that p, so defined, solves the homogeneous wave equation for z = 0 and is causal, set ρ − ξ2 − η2 . ζ± = ±ζ, ζ = κ Of course, for the plane wave defined by (ξ, η, ζ) to propagate, the argument of the square root must be positive, meaning that the plane wave propagation direction makes a nonzero angle with z = 0. The pressure field must be a generalized solution of the wave equation, since the right hand side in (5.3) is singular. Applying the definition of weak solution [Lax (2006)] and making the standard integration-by-parts argument using smooth test functions, the connection between P± and F follows: 1 F (t). (5.8) P+ (t) = P− (t) = 2ζ Plane waves have infinite energy, of course, but the energy per unit volume is finite if the waveform is square-integrable. It follows from (5.8) that the energy of the plane wave acoustic field in a spatial volume at time T is proportional to the mean-square of the derivative of F over a related volume and time < T . The presence of ζ in the denominator of (5.8) implies that in general the energy density in the plane wave field is not bounded by the mean-square of the source waveform F (t), as ζ may be arbitrarily small (positive). As it turns out, neither does the energy of the field bound the mean-square of the source. Examples illustrating the failure of any such bounds may be found in Symes (1983). However, the anomalously large or small fields involve high-frequency pulses propagating near (“grazing”) z = 0, that is, to phenomena confined to a small zone around the source support plane. In contrast, the energy of the solution over any finite subvolume of {x : z ≥ zmin > 0} for any positive minimum depth zmin is bounded by a fixed multiple of the source mean-square. Even with this restriction to wave energy propagating into the region |z| > 0, however, the energy contributed to the acoustic field per unit volume for a unit mean square source field f (t, x, y) should be larger for f that are rich in plane wave components with high angle of incidence (small ζ). This effect will be evident in the examples to come. Further insight into the nature of Λ may be gained through study of the map from data to field at the (final) time T: set Γ[κ, ρ]f (x) = (p(x, T ), v(x, T )).
Chapter5
February 25, 2011
13:57
World Scientific Review Volume - 9in x 6in
Source Synthesis for Inverse Problems in Wave Propagation
Chapter5
125
Given a scalar field φ(x) and a vector field ψ(x), define (q(x, t), w(x, t)) to be the dual acoustic field satisfying 1 ∂q = −∇ · w, κ ∂t ∂w = −∇q, ρ ∂t q(T, x) = φ(x),
(5.9) (5.10) (5.11)
w(T, x) = ψ(x).
(5.12)
Compare Γf with (φ, ψ) using the bilinear form associated with the energy: 1 Γf, (φ, ψ)E ≡ dx pq + ρv · w (T, x) (5.13) κ
T
=
d dx dt
dt −∞
T
=
dt
dx
−∞
+ =
∂v 1 ∂p q+ρ ·w κ ∂t ∂t
dt
dx
−∞
+ =
dt
dx
−∞
+
−∞
1 ∂p q − ∇p · w κ ∂t
1 ∂p q + p∇ · w κ ∂t
dt
+
(t, x)
(t, x)
(t, x)
1 ∂q p + q∇ · v (t, x) κ ∂t
T
=
1 ∂q p − ∇q · v (t, x) κ ∂t
T
1 pq + ρv · w (t, x) κ
1 ∂q ∂w p+ρ · v (t, x) κ ∂t ∂t
T
dx
1 ∂p +∇·v q κ ∂t
1 ∂q − ∇ · w p (t, x) κ ∂t
February 25, 2011
13:57
World Scientific Review Volume - 9in x 6in
126
Chapter5
W. W. Symes
T
=
dt
dx (f (t, x, y)δ(z)) q(x, y, z)
−∞
T
dt
=
dx
dyf (t, x, y)q(t, x, y, 0).
−∞
It follows that the formal adjoint of Γ (with respect to the energy as metric for the acoustic Cauchy data (output), and the RMS metric for the input (RHS data), is given by ΓT [κ, ρ](φ, ψ) = q(t, x, y, 0), 0 ≤ t ≤ T. Suppose (φ, ψ) = (p, v)t=T = Γ[κ, ρ]f . Then the uniqueness theorem for the wave equation Cauchy problem [Lax (2006)] implies that actually q = p in the time range 0 ≤ t ≤ T , from which we deduce that Λ[κ, ρ] = ΓT [κ, ρ]Γ[κ, ρ].
(5.14)
Thus Λ is self-adjoint and positive (semi-)definite. The analysis presented in [Symes (1983)] implies that Λ has arbitrarily large and small positive eigenvalues. However, the very large and very small eigenvalues are related to solutions which vanish away from z = 0. Therefore, if two acoustic models κ, ρ and κ0 , ρ0 are identical near z = 0, then the eigenvectors corresponding to very large and very small eigenvalues of Λ[κ, ρ] and Λ[κ0 , ρ0 ] should be the same, and should combine to produce near-zero eigenvalues of ∆[κ, ρ, κ0 , ρ0 ]. That is, these extreme source effects should not affect the size of the operator ∆, which can be regarded as a bounded self-adjoint operator. A natural measure of size of ∆ is its operator norm, as mentioned in the Introduction, that is, the least upper bound of the absolute value of its spectrum. 5.3. Numerical Example The power method is the simplest method for estimating an approximate eigenpair f, λ approximating the spectral upper bound. The power method applies to bounded self-adjoint positive (semi)definite operators, whereas the difference operator ∆ is only bounded self-adjoint. Therefore I apply the power method to ∆2 , which is positive semidefinite. To update a current
February 25, 2011
13:57
World Scientific Review Volume - 9in x 6in
Source Synthesis for Inverse Problems in Wave Propagation
Chapter5
127
estimate f, λ to an improved estimate f + , λ+ , g = ∆f +
λ f
+
f+
T
(5.15)
= g g
(5.16)
= ∆g
(5.17)
← f + / (f + )T f +
(5.18)
The last step normalizes the eigenvector estimate f + to have unit norm. To make this prescription into an iterative algorithm for approximating the spectral upper bound, replace f, λ with f + , λ+ and repeat until the increase λ+ − λ falls below a user-chosen tolerance, or a fixed number of iterations are accomplished. Implementation of this algorithm requires an approximate solver for the acoustic wave system (5.1,5.2). I have used IWAVE, a staggered-grid finite difference solver developed in my group, which supports high (up to 14th) order difference approximation and parallel execution, amongst other features [Terentyev (2009)]. For the small 2D example described here, 4th order spatial difference approximation and serial execution sufficed. Implementation of algorithms such as the power method also requires an algebraic realization of simulators such as IWAVE in the form of operators, with all of the features that operators (linear, in this case) commonly possess. I have employed the operator interfaces defined by the Rice Vector Library [Padula et al. (2009)], via wrapper code which my group also uses to construct various inversion algorithms [Sun and Symes (2010a,b)]. Figures 5.1 and 5.2 show bulk modulus (κ) and density (ρ) fields used in this example. The values of density are typical of shallow sediments overlain by a water layer. The reference bulk modulus and density were constants approximating those of sea water (κ0 = 2250 MPa, ρ0 = 1000 kg/m3 ). The bulk modulus is actually computed as κ = ρv2 , where v ≡ 1.5km/s is a homogenous “water” velocity - this accounts for the very similar appearance of κ and ρ. That is, the wave velocities in the target and reference materials are the same. This choice produces an easier material parameter estimation problem, as the wave velocity is by far the most difficult parameter to estimate. Even though this paper will not address this material parameter inverse problem directly, the intent of the work reported here is to produce good selections of sources for this inverse problem, so the example is set up to produce a relatively easy instance. The domain depicted in the figures is the computational domain used in this example. Absorbing boundary conditions of PML type [Cohen (2002)]
February 25, 2011
13:57
World Scientific Review Volume - 9in x 6in
128
W. W. Symes
0
x (km) 4
2
6
z (km)
0
1
3000
Fig. 5.1.
4000 MPa
5000
Bulk modulus model used in numerical example.
0
2
x (km) 4
6
z (km)
0
1
1000
Fig. 5.2.
1500 kg/m^3
2000
Density model used in numerical examples.
Chapter5
February 25, 2011
13:57
World Scientific Review Volume - 9in x 6in
Chapter5
Source Synthesis for Inverse Problems in Wave Propagation
129
simulate a full (2D) Euclidean space, consistent with the theory. The source plane is z = 40 m, in the coordinate system of the figures. The distributed source is represented by a grid function in the horizontal direction with the same sampling (20 m) as the fields. Data were simulated and recorded for 2 s, at a sample rate of 2 ms. The initial source distribution is a tapered plane wave at normal incidence, depicted in Figure 5.3. The Rayleigh quotient for this source is 173.9 (dimensionless, as it is a ratio of norms of similar vectors). After 20 iterations of the power method, the Rayleigh quotient had grown to 3785.5, with increase of well under 1% for the last five iterations. Thus this final Rayleigh quotient can be taken as a reasonably accurate estimate of the operator norm of ∆[κ, ρ, κ0 , ρ0 ]2 . Clearly the initial source estimate is far from an eigenvector with maximum eigenvalue. The final optimized source f is displayed in Figure 5.4. Note the very high angle waves which dominate this source. The output, that is ∆f , is displayed in Figure 5.5. High angle waves also predominate in this display. As noted before, it is possible to confine an oscillating pulse near the surface with arbitrarily large Rayleigh quotient, but such solutions would be identical for both the target (κ, ρ) and reference (κ0 , ρ0 ) models, hence would cancel to yield a near-null vector for the difference operator ∆. Instead,
x (km)
t (s)
0
0
2
4
6
1
2 Fig. 5.3. Initial truncated plane wave source used in numerical example. Waveform is trapezoidal bandpass filter with corner frequencies 2.5, 5, 12.5, and 20 Hz, tapered to zero at the edges of the domain with a cosine shoulder.
February 25, 2011
13:57
World Scientific Review Volume - 9in x 6in
130
Chapter5
W. W. Symes
x (km)
t (s)
0
0
2
4
6
1
2 Fig. 5.4. Source estimate, approximating eigenvector with largest eigenvalue of ∆2 , produced by 20 iterations of the power method starting with the truncated plane wave source of Figure 5.4. Note dominance of high-angle waves - these are the traces on z = 40 of nearly horizontally propagating waves.
x (km)
t (s)
0
0
2
4
6
1
2 Fig. 5.5. Output of ∆ applied to the optimized source of Figure 5.4, that is, the difference of responses between (κ, ρ) (Figures 5.1, 5.2) and the homogeneous background model. Displays high-angle energy, propagating just far enough into z > 0 to interrogate the water bottom with maximum energy.
February 25, 2011
13:57
World Scientific Review Volume - 9in x 6in
Source Synthesis for Inverse Problems in Wave Propagation
Chapter5
131
the data of Figure 5.5 is that of a solution propagating nearly horizontally but penetrating the half-space z > 0 sufficiently to interact with the largest feature differentiating the two models, that is, the water bottom at depth 400 m. It is somewhat easier to appreciate the wave geometry of this example through examination of time slices of the field. Figures 5.6 and 5.7 show time slices of the pressure field produced by the optimized source (Figure 5.4) in the homogeneous reference model, at 1 s and 2 s. It is clear that most of the energy in the acoustic field is involved in near-horizontal propagation, but that the angle is steep enough to cause a substantial reflection from the water bottom interface. To assess the impact of this emphasis on high-angle waves on inversion for the material parameters, it is natural to examine the gradient of the least-squares objective (in 2D) (κtrial , ρtrial ) →
T
dt 0
dx|Λ[κtrial , ρtrial ]f − d|2 .
This gradient is known as the reverse-time migration image in reflection seismology, and may be regarded as a scaled, bandlimited image of the model difference [Tarantola (1987); Symes (2008)]. I take d = Λ[κ, ρ]f as the data created by the target model (Figures 5.1, 5.2). Figure 5.8 shows the bulk modulus component of the resulting gradient at κtrial = κ0 , ρtrial = ρ0 for the plane wave source (Figure 5.3). The first interface (water bottom) is correctly positioned, as the sound velocity in the water layer is correct in the reference model. Deeper features are also correct in overall shape, though mispositioned due to the incorrect velocity in the reference model below the water bottom. The gradient for the optimized source, while almost three orders of magnitude larger, is visually much less appealing (Figure 5.9). It does provide a bandlimited image of the central part of the water bottom, hence has captured some of the difference between the two models. However it does not possess the aperture of the initial source image. The reduced bandwidth evident in the figure is easy to understand: the horizontal interface has been interrogated with high-angle waves, which it “sees” as being lower in frequency content as a consequence of their larger vertical velocity. It is not possible to dismiss this direction in model space as unconstructive, but it is far from obvious that the optimized source will lead to more rapidly convergent inversion iterations.
February 25, 2011
13:57
World Scientific Review Volume - 9in x 6in
132
W. W. Symes
0
2
x (km) 4
6
z (km)
0
1
Fig. 5.6. Pressure field response to optimized source (Figure 5.4), propagating in homogeneous model (κ0 , ρ0 ), at t = 1 s.
0
2
x (km) 4
6
z (km)
0
1
Fig. 5.7. Pressure field response to optimized source (Figure 5.4), propagating in homogeneous model (κ0 , ρ0 ), at t = 2 s.
Chapter5
February 25, 2011
13:57
World Scientific Review Volume - 9in x 6in
Source Synthesis for Inverse Problems in Wave Propagation
0
2
x (km) 4
Chapter5
133
6
z (km)
0
1
Fig. 5.8. Bulk modulus component of least-squares gradient (also known as reverse time migration image) at homogeneous reference model (κ0 , ρ0 ), for initial source estimate (Figure 5.3). This field represents the bulk modulus perturbation that most rapidly increases the least squares difference between the pressure trace responses to current trial estimate (κ0 , ρ0 ) and the target model (κ, ρ). As is evident, it is also an image of the zones of rapid change in the model.
0
2
x (km) 4
6
z (km)
0
1
Fig. 5.9. Bulk modulus component of least squares gradient at homogeneous reference model, for optimized source (Figure 5.4) produced by 20 iterations of the power method.
February 25, 2011
13:57
134
World Scientific Review Volume - 9in x 6in
W. W. Symes
5.4. Conclusion In the preceding pages, I have explained a translation of Isaacson’s distinguishability algorithm to the reflected wave problem, and explored its properties. The emphasis on high-angle energy, both predicted by the theory and evident in the numerical experiments, appears unlikely to be appropriate for application to waveform inversion. The sources produced by approximating the largest eigenvalue of the difference simulation operator (∆ above) certainly produce the largest differences between responses per unit input energy, but do not seem to result in particularly good material parameter updates. At least one obvious avenue for improvement of this algorithm is evident. The phrase “per unit energy” is the key: the mean square of the distributed source field f (t, x, y) is not an energy, nor does it correspond directly to the actual energy in the field. A natural avenue for improving the correspondence between “best” source and useful interrogation of the acoustic medium may be the use of the field energy as a source metric. Within the power method, this simply amounts to a redefinition of the source normalization. A straightforward way to implement this renormalization of the difference operator may be via the identity (5.14) linking the simulation operator Λ to the “final state” operator Γ. As discussed in the theory section, the (in principle unboundedly) large eigenvalues of Λ, or singular values of Γ, correspond to solutions which do not significantly leave the plane of the source. Thus insertion of a window function wR in Γ, restricting attention to a zone R contained in z > 0, converts Γ into a bounded operator. If the “energy” of the source is measured as actual energy in R: 2 1 T p + ρv · v (T, x) dt dx wR (x) ER (f )2 = 2 0 κ then the tendency of high angle energy to generate bigger fields is automatically compensated, and moreover the source selection is focused on producing large fields in the window region R. A further investigation along these lines is in progress. Acknowledgments I am grateful to the sponsors of The Rice Inversion Project for their longterm support of my research, and to the organizers of the International
Chapter5
February 25, 2011
13:57
World Scientific Review Volume - 9in x 6in
Source Synthesis for Inverse Problems in Wave Propagation
Chapter5
135
Symposium on Engineering Mechanics for the opportunity to present my evolving work on source synthesis. My understanding of the theoretical basis of this subject relies in the work I performed in the early 1980’s as a member of Prof. Y.-H. Pao’s research group at Cornell University. My appreciation of the central importance of source mechanisms in everything having to do with wave motion dates from my time with Prof. Pao and reflects his strong influence on my work. References Beasley, C. J., Chambers, R. E. and Jiang, Z. (1998). An new look at simultaneous sources, in Expanded Abstracts (Society of Exploration Geophysicists), pp. 133–135. Berkhout, G. (2008). Changing the mindset in seismic data acquisition, The Leading Edge 27, pp. 924–938. Blacquiere, G., Berkhout, G. and Verschuur, E. (2009). Survey design for blended acquisition, in Expanded Abstracts (Society of Exploration Geophysicists), pp. 56–61. Borcea, L., Papanicolaou, G. and Tsogka, C. (2007). Optimal waveform design for array imaging, Inverse Problems 23, pp. 1973–2020. Cohen, G. C. (2002). Higher Order Numerical Methods for Transient Wave Equations (Springer, New York). Diebold, J. and Stoffa, P. (1981). The traveltime equation, tau-p mapping, and inversion of common midpoint data, Geophysics 46, pp. 238–254. Fink, M. and Prada, C. (2001). Acoustic time-reversal mirrors, Inverse Problems 17, pp. R1–R38. Isaacson, D. (1987). Distinguishability of conductivities by electric current computed tomography, IEEE Transactions on Medical Imaging MI-5, pp. 91– 95. Krebs, J. R., Anderson, J. E., Hinkley, D., Neelamani, R., Lee, S., Baumstein, A. and Lacasse, M.-D. (2009). Fast full-waveform seismic inversion using encoded sources, Geophysics 74, pp. WCC177–WCC188. Lax, P. D. (2006). Hyperbolic Partial Differential Equations (Courant Lecture Notes) (American Mathematical Society, Providence, RI). Padula, A. D., Symes, W. W. and Scott, S. D. (2009). A software framework for the abstract expression of coordinate-free linear algebra and optimization algorithms, ACM Transactions on Mathematical Software 36, pp. 8:1–8:36. Romero, A., Ghiglia, D. C., Ober, C. C. and Morton, S. A. (2000). Phase encoding of shot records in prestack migration, Geophysics 65, pp. 426–436. Sun, D. and Symes, W. W. (2010a). IWAVE implementation of Born simulation, Tech. Rep. 10-05, Department of Computational and Applied Mathematics, Rice University, Houston, Texas, USA, URL http://www.caam.rice.edu. Sun, D. and Symes, W. W. (2010b). IWAVE implementation of the adjoint state method, Tech. Rep. 10-06, Department of Computational and Ap-
February 25, 2011
136
13:57
World Scientific Review Volume - 9in x 6in
W. W. Symes
plied Mathematics, Rice University, Houston, Texas, USA, URL http: //www.caam.rice.edu. Symes, W. W. (1983). Trace theorem for solutions of the wave equation and the remote determination of acoustic sources, Math. Methods in Appl. Sciences 5, pp. 131–152. Symes, W. W. (2008). Approximate linearized inversion by optimal scaling of prestack depth migration, Geophysics 73, pp. R23–35. Tarantola, A. (1987). Inverse Problem Theory (Elsevier). Terentyev, I. (2009). A software framework for finite difference simulation, Tech. Rep. 09-07, Department of Computational and Applied Mathematics, Rice University, Houston, Texas, USA, URL http://www.caam.rice.edu. Treitel, S., Gutowski, P. and Wagner, D. (1982). Plane-wave decomposition of seismograms, Geophysics 47, pp. 1375–1401. Verschuur, D. J. and Berkhout, G. (2009). Target-oriente least-squares imaging of blended data, in Expanded Abstracts (Society of Exploration Geophysicists), pp. 2889–2894.
Chapter5
CHAPTER 6 AN INTRODUCTION TO AN ADAPTIVE DATA ANALYSIS METHOD
Norden E. Huang Research Center for Adaptive Data Analysis National Central University Chungli, Taiwan, ROC E-mail:
[email protected] The existing methods of data analysis, either the probability theory or the spectral analysis, are all developed by mathematicians or based on their rigorous mathematical rules. For analyzing data from the real physical world, we have to face the reality of nonstationarity and nonlinearity in the processes. The traditional analysis methods are based on rigorous approach, but cannot fully accommodate these conditions. A new adaptive data analysis was introduced by Huang et al (1998), which was designated by NASA as the Hilbert-Huang Transform (HHT). It consists of Empirical Mode Decomposition (EMD) and the Hilbert Spectral Analysis (HSA) methods; both were introduced recently by Huang et al. (1996, 1998, 1999 and 2003). The method is adaptive, and specifically designed for analyzing data from nonlinear and nonstationary processes. Since its introduction over ten years ago, the HHT has been applied to a wide range of applications, covering (among many others) biology, geophysics, ocean research, engineering, radar and medicine (see, for example, Huang and Attok-Okine, 2005; Huang and Shen, 2005; Huang and Wu, 2008). Yet, up to this time, a rigorous mathematical foundation is still lacking. Under this condition, progresses are still empirically based. An introduction and some of the recent advances on Ensemble Empirical Mode Decomposition (Wu and Huang, 2009), Instantaneous frequency and trend computations, Time-dependent Intrinsic Correlation (Chen,
137
138
N. E. Huang
2010) and its extension to multi-dimensional data (Wu et al. 2009) are summarized and briefly discussed. These advances have made the HHT method much more robust and mature.
1. Introduction Data analysis is indispensable to every science and engineering endeavor. In fact collecting and analyzing data; synthesizing and theorizing on the analyzed results are the core activities of scientific studies. As a result, data analyzing is a key link in this continuous loop of these activities. As Poincaré had famously put it: Science is built up of facts, as a house is built of stones; but an accumulation of facts is no more a science than a heap of stones is a house. Here ‘facts’ are in fact data. Yet, just data are still useless. Undigested or un-analyzed data are just like a heap of stones. To convert the facts as part of the edifice of science, we have to analyze, synthesize and even theorize on the data. The logic of this argument seems so plain, yet our present practice is anything but logical. Instead of analyzing data, we process them. There are fundamental differences between data processing and data analysis. All the data processing methods are based on well established algorithms, which are developed through rigorous mathematics approaches. To be rigorous, we are forced to idealize the real physical conditions consummating with the mathematical assumptions used in establishing the algorithms. Unfortunately, the physical conditions are never ideal. Indeed, Einstein had once stated: ‘As far as the laws of mathematics refer to reality, they are not certain; and as far as they are certain, they do not refer to reality.’ In pursuing mathematical rigor, we are forced to live in a pseudo-real world, in which all processes are idealized to conform to mathematician’s
An Introduction to an Adaptive Data Analysis Method
139
conditions and requirements. There is always a conflict between reality and the ideal world. The most prevailing assumptions in the data processing algorithms are linearity and stationarity. All the statistical and probabilistic studies are based on these assumptions. Yet, most physical processes, either in sciences or engineering studies, are both nonstationary and nonlinear. As research becomes increasingly sophisticated, the inadequacies of the traditional data analysis methods based on the linear and stationary assumptions have become glaringly clear; consequently, the algorithms have become woefully inadequate. The only viable way is to break away from the traditional limitations and establish a new paradigm in the data analysis method. Take the powerful spectral analysis as an example. The foundation of the traditional spectral analysis methods were built on a priori bases, be it Fourier or wavelet analyses. Indeed an a priori basis is needed for a rigorous analysis. Unfortunately, the a priori basis would exact a heavy toll on the methodology: Once the a priori basis is chosen, the results produced from the processing would be limited to a simple convolution computation or an integral transform. The consequences are the uncertainty on time-frequency resolutions and spurious harmonics. To overcome these inherited limitations, we have to explore new approaches, for example to use adaptive basis; i.e., with the basis founded on and derived from the data. This approach is still in its infancy. The general mathematical frame work has not been established. For lack of rigorous mathematical foundation, this approach is operated empirically. Yet some preliminary results had amply demonstrated the usefulness of this approach to analyze data from nonlinear and nonstationary processes. In this paper, an empirically based data analysis will be introduced. Some simple applications will be presented also, which will be followed by a general discussion and a sketch for the future developments. 2. The Hilbert-Huang Transform The Hilbert-Huang Transform (HHT) is the NASA designated name for the combination of Empirical Mode Decomposition (EMD) and Hilbert Spectral Analysis (HSA) methods developed recently by Huang et al.
140
N. E. Huang
(1996, 1998, 1999, 2003 and 2010) and Wu and Huang (2009). The method is adaptive, and specifically for analyzing data from nonlinear and nonstationary processes. The motivations for HHT are obvious: First, the natural physical processes are mostly nonlinear and nonstationary. The available methods are either for linear but nonstationary (such as the wavelet analysis, Wagner-Ville, and various short-time Fourier spectrograms as summarized by Priestley (1988), Cohen (1995), Daubechies (1992) and Flandrin (1999)), or nonlinear but stationary and statistically deterministic processes (such as the various phase plane representations and time-delayed imbedded methods as summarized by Tong (1990), Diks (1997) and Kantz and Schreiber (1997)). Even on the more recent treatment on nonlinear time series analysis by Fan and Yao (2005), the treatment is limited to deal with stochastic properties of the time series when they were consisted of various ARIMA models. Second, the nonlinear processes need special treatment. Other than periodicity, we want to learn the detailed dynamics in the processes from the data. One of the typical characteristics of nonlinear processes, proposed by Huang et al. (1998), is the intra-wave frequency modulation, which indicates that the instantaneous frequency would change within one oscillation cycle. Let us examine a very simple nonlinear system given by the non-dissipative Duffing equation as d2x + x + ε x 3 = γ cos ω t . dt 2
(1)
This equation could be re-written as d2x + x 1 + ε x 2 = γ cos ω t . dt 2
(
)
(2)
Then the quantity within the parenthesis in equation (2) can be regarded as a variable spring constant, or a variable pendulum length. With this view, we can see that the frequency should be changing from location to location, and time to time, even within one oscillation cycle. Indeed with ε set at -1, the virtue pendulum would have its length shortening as it swings away from the central equilibrium position. Therefore, the period would be longer near x = 0, and shorter near x at
An Introduction to an Adaptive Data Analysis Method
141
Fig. 1. The solution of Duffing equation with ε = -1.
extrema positions. The numerical solution for the equation is shown in Figure 1. As pointed out by Huang et al. (1998), this intra-frequency frequency variation is the hallmark of nonlinear systems. In the past, the best we could do with these distorted wave data was to represent it with harmonics; hence the term harmonic distortion. Harmonic distortion is the result of perturbation analysis, which is obtained by imposing a linear structure on a nonlinear system. Those harmonics only have mathematical meaning: to fit the profiles of a distorted wave; they have no physical meaning. Take deep water surface waves for example. The solution based on small perturbation was derived by Stokes in 1847 with many harmonic terms. As the solution was based on the assumption of a wave with permanent form, all the harmonics must propagate at the same phase velocity as the fundamental, an impossible physical condition for the harmonics to meet, for water waves are dispersive: the wave speed is a function of frequency or wave number. Therefore, physically, the higher harmonics should lag behind the fundamental. The physically meaningful way to describe the system should be in terms of the instantaneous frequency as discussed by Huang et al. (1998, 2009). To illustrate the effect of instantaneous frequency, let us consider the mathematical model as in Huang (1998):
x (t ) = cos (ω t + ε sin 2ω t ) .
(3)
142
N. E. Huang
Fig. 2. The wave profile for the mathematical model with ε = 0.3.
For ε set at 0.3, we would have the oscillation given in Figure 2. It has been shown by Huang et al (1998) that the wave form given in Equation (3) is equivalent to
ε ε x (t ) ≈ 1 − cos ω t + cos 3ω t + ... , 2 2
(4)
while Equation (3) gives an instantaneous frequency of
Ω (t ) = ω (1 + 2ε cos 2ω t ) .
(5)
Equation (4) gives a series of spurious harmonics. Indeed the pointed crests and troughs of the waves in Figure 2 clearly representing the instantaneous frequency fluctuation. As the instantaneous frequency is defined through a derivative, it is strictly local and can be used to describe the detailed variation of frequency, including the intra-wave frequency variation. The easiest way to compute the instantaneous frequency is by the Hilbert transform. However, as discussed by Huang et al (2009),
An Introduction to an Adaptive Data Analysis Method
143
computing instantaneous frequency through the Hilbert Transform is limited by the Bedrosian (1963) and Nuttall (1966) theorems. As a result, it can only give an approximation. There are many other methods that could give the exact instantaneous frequency. The first is the quadrature method (Huang et al, 2009), which consists of an empirical AM-FM signal separation, such that x (t ) ; x (t ) = a (t ) cos θ (t ) ⇒ cos θ (t ) = (6) a (t ) therefore,
a 2 (t ) − x 2 ( t ) ; a( t )
sin θ (t ) = 1 − cos 2 θ (t ) =
(7) a ( t ) − x (t ) . x( t ) 2
⇒ θ (t ) = arc tan
2
With the phase function given, the instantaneous frequency could be computed easily. An alternative to this approach is proposed by Wu and Huang (2009) by examining the following equation: cos θ i +1 − cos θ i −1 cos(θ i + ∆θ i ) − cos (θ i − ∆θ i ) = sin θ i sin θ i =
[ cos θ i cos ∆θ i − sin θ i sin ∆θ i ] − [ cos θ i cos ∆θ i + sin θ i sin ∆θ i ] sin θ i
(8)
= 2 sin ∆θ i ≈ 2 ∆θ i . Once the ∆θi as a function of time is know, the instantaneous frequency can be computed by dividing it by ∆t. Still another approach is proposed by Hou et al (2009). In this approach, no normalization is required. Let the data be x(t). Using Taylor’s expansion, we can write
x ( t j +1 ) = ( a (t j ) + ∆ t a' (t j ) ) cos (θ (t j ) + ∆ t θ ' (t j ) ) + ... ; x ( t j −1 ) = ( a (t j ) − ∆ t a' (t j ) ) cos (θ (t j ) − ∆t θ ' (t j ) ) + ... . Therefore, we have x (t j +1 ) + x (t j −1 ) = cos ( ∆ t θ ' ( t j ) ) + o( ∆ t ) . 2 x(t j )
(9)
(10)
144
N. E. Huang
Thus, we should have the instantaneous frequency as the derivative of the phase function given as
θ ' (t j ) =
x (t j +1 ) + x (t j −1 ) 1 arccos . 2 x (t j ) ∆t
(11)
No matter which method, we have to reduce the function to the Intrinsic Mode Function (IMF), defined as any function having the same numbers of extrema and zero-crossings or at most differed by 1; moreover, the function should have symmetric envelopes defined by all the maxima and minima separately. To reduce an arbitrary function as a sum of IMFs calls for the Empirical Mode Decomposition (EMD) method, an adaptive decomposition method proposed by Huang et al (1998, 1999). The EMD is implemented through the following steps: For any data, we first identify all the local extrema, and then connect all the local maxima by a cubic spline line as the upper envelope. Then, we repeat the procedure for the local minima to produce the lower envelope. The upper and lower envelopes should cover all the data between them. Their mean is designated as m1,1, and the difference between the data and m1,1 is the first proto-IMF (PIMF) component, h1,1:
x (t ) − m1,1 = h1,1 .
(12)
By construction, this PIMF, h1,1, should satisfy the definition of an IMF, but the change of its reference frame from a rectangular to a curvilinear coordinate can cause anomalies, where multi-extrema between successive zero-crossings could still exist, mostly due to some inflection points having become local extrema. To eliminate such anomalies, the sifting process has to be repeated as many times as necessary to get rid of the riding waves. In the subsequent sifting process steps, h1,1 is treated as the data. With the iteration of the steps, we have h1,1 ( t ) − m1,2 ( t ) = h1,2 (t ) ;
⋅⋅⋅ ⋅⋅⋅ h1,k −1 (t ) − m1,k (t ) = h1,k (t ) ; ⇒ h1,k (t ) = c1 (t ) ,
(13)
An Introduction to an Adaptive Data Analysis Method
145
After each repetition, we will check the results against a ‘stoppage criterion.’ Various criteria had been proposed by Huang et al (1998, 2003), Flandrin (2004) and Wu and Huang (2004), for example. Whenever the stoppage criterion is satisfied, the PIMF is designated as an IMF, c1(t). The ‘stoppage criterion’ actually determines the number of sifting steps to produce an IMF; it is thus of critical importance in a successful implementation of the EMD method. Furthermore, the spline function used is also of critical importance, for x (t ) − m1,1 ( t ) = h1,1 ( t ) ; h1,2 ( t ) = h1,1 ( t ) − m1,2 ( t ) = x ( t ) − ( m1,1 + m1,2 ) ; ⋅⋅⋅ ⋅⋅⋅ h1,k (t ) = h1,k −1 (t ) − m1,k (t ) = x (t ) − ( m1,1 + m1,2 + ... + m1,k ) ;
(14)
⇒ c1 (t ) = x (t ) − ( m1,1 + m1,2 + ... + m1,k ) . Thus, c1(t) is the result of the data minus a sum of spline functions. From Equation (14), we can see that if we subtract the first IMF, c1(t), from the data, the residual is the sum of spline functions. Mathematically, we have
x (t ) − c1 (t ) = r1 (t ) .
(15)
Therefore, the residual, r1(t), contains all the remaining information in the data other than the first IMF, c1(t). We can, therefore, repeat the above operation using r1(t) as the next set of data, and re-iterate the process, we should have r1 ( t ) − c 2 ( t ) = r2 ( t ) ; r2 (t ) − c 3 ( t ) = r3 ( t ) ; ⋅⋅⋅ ⋅⋅⋅
(16)
rn −1 (t ) − c n ( t ) = rn (t ) ; n
⇒ x (t ) −
∑ c (t ) = r ( t ) . j
n
j =1
By summing up Equations (15) and (16), we finally obtain the last equation in (16), which means that the data could be decomposed into a
146
N. E. Huang
sum of IMFs and a residue, rn(t), which can be either a constant, a monotonic mean trend or a curve having only one extremum. Thus, we have succeeded in expanding the given function in terms of IMFs. With this operation, it is easily shown that k1
ci (t ) = ri −1 (t ) − ri (t ) =
∑m
i −1, j
j =1
k2
−
∑m
i, j
(17)
j =1
Thus all IMFs are combinations of spline functions. Therefore, the EMD is critically dependent on the spline function selected. Recent studies by Flandrin et al. (2004) and Wu and Huang (2004) have established that the EMD is a dyadic filter, and it is equivalent to an adaptive wavelet. Being adaptive, we have avoided the pitfalls of using an a priori-defined basis, and also avoided the spurious harmonics that would have resulted, had the a priori basis been employed. As the representation is dyadic, therefore, the maximum number of the IMFs is limited by log2N, with N as the total number of data points; thus, this expansion is a sparse representation. The components of the EMD are usually physically meaningful, if there is no scale mixing, defined as mixed characteristic scales in a single IMF component. To avoid the scale mixing, Wu and Huang (2009) had proposed an Ensemble EMD (EEMD), which is essentially the same EMD procedure, except that the procedure will be repeated n times each with a different white noise added to the data. The procedures are these: a. add a white noise series to the targeted data; b. decompose the data with added white noise into IMFs; c. repeat step 1 and step 2 again and again, but with different white noise series each time; and d. obtain the (ensemble) means of corresponding IMFs of the decompositions as the final result. The critical concept advanced in the EEMD is based on the following observations: a. A collection of white noise will cancel each other out in a time domain ensemble mean; therefore, only the signal can survive and persist in the final noise added signal ensemble mean.
An Introduction to an Adaptive Data Analysis Method
147
b. Finite, not infinitesimal, amplitude white noise is necessary to force the ensemble to exhaust all possible solutions; the finite magnitude noise will make the different scale signals to reside in the corresponding IMF dictated by the dyadic filter banks, and render the resulting ensemble mean more meaningful. c. The true and physically meaningful answer of the EMD is not the set without noise; it should be the ensemble mean of a large number of trials consisted of the noise added signal. This new method proposed in EEMD has utilized all these important statistical characteristics of noise. The critical conclusion here is that the final IMFs obtained is not the one with zero noise, but should be the ensemble mean of infinite many trials each with different perturbation of finite amplitude of noise. The EEMD indeed represents a major improvement over the original EMD. EEMD has fully utilized the statistical characteristics of noise assisted data analysis. As the level of the added noise is not of critical importance, as long as it is of finite amplitude to enable a fair ensemble of all the possibilities, the EEMD can be used without any subjective intervention; thus, it provides a truly adaptive data analysis method. By eliminating the problem of mode mixing, it also produces a set of IMFs that bears the full physical meaning for the signal, and a time-frequency distribution without transition gaps. EMD, with the Ensemble approach, has become less adaptive due to the fixed dyadic window dedicated by the added noise. Yet, the EEMD has become a more robust and mature tool for nonlinear and nonstationary time series analysis.
3. A Significant Product of HHT: The Determination of Trend Other than the decomposition, HHT could help in the determination of the trend and detrending the data set, which are important steps in data analysis. Yet definitions of “trend” tend to be imprecise, and no logical algorithm has been established for extracting the trend; it is similar in determining the regression of a data set (Fan and Yao, 2005). As a result, various ad hoc extrinsic methods have been used to determine the trend,
148
N. E. Huang
or to effect a detrend operation. Such a confused state prompt Stock and Watson (1988) to lament: “one economist’s ‘trend’ can be another’s ‘cycle’.” A new procedure to determine trend based on HHT was proposed by Wu et al. (2007). They first gave the trend a precise definition as: Within the given data span, the trend is an intrinsically fitted monotonic function, or a function in which there can be at most one extremum. This definition is identical to the definition of the residue used in the Empirical Mode Decomposition method (Huang et al., 1996, 1998, 2003), where the residue had been treated as the overall trend. But the concept of the trend given here is far more general. Before proceeding to the details, there are several subtle, but important, points to be clarified. First, the trend should be an intrinsic property of the data; it should be part of the data, and driven by the same mechanisms that generate the observed or measured data. Most of the available methods, however, define trend by using an extrinsic approach, such as using pre-selected simple functional forms. Being intrinsic, therefore, requires that the method used in defining the trend must be adaptive, so that the trend extracted is derived from and based on the data. Second, the trend exists only within a given data span; therefore, it should be local and should be associated with a local length scale in terms of data length. Consequently, the trend can only be valid within that part of the data, which should be shorter than a full local wavelength. If there were two or more extrema of the same sense, the data would share that characteristic with a cycle. Thus the difficulty expressed by Stock and Watson (1988) could be resolved. Unfortunately, in most applications, the trend is taken as a simple straight line. Let us take the global warming phenomenon as an example. This case has been studied by Wu et al (2007). According to the 4th Assessment Report of the Intergovernmental Panel on Climate Change (IPCC, 2007), the global temperature anomaly is given in Figure 3, in which various trends are determined from the annual averaged data.
An Introduction to an Adaptive Data Analysis Method
149
Fig. 3. Instrumental climate data covering the period 1856 to 2007. Various trends and temperature increasing rates are determined for different data lengths: 25 (yellow), 50 (brown), 100 (black) and 150 (red) years.
Based on the slopes of the various trends, IPCC reached this conclusion: “Note that for shorter recent periods, the slope is greater, indicating accelerated warming.” If we use EMD to analyze the original monthly data, we would get 12 different sets of IMFs each representing a specific month of the years such as January, February and so on as shown in Figure 4. Clearly we have a clear trend given by the residue after all the oscillations were removed. From the twelve different monthly results, we could get an annual mean and the scattering range indicating the confidence limit of this decomposition. Having obtained this result, we will consider an alternative of the straight line trends given by IPCC. Instead of drawing the 25-year trend at the end of the data, we can draw 25-year trend centered at any point. The slope values would be a variable now. The result is plotted in Figure 5. Clearly, the running 25-years trends showed a fluctuating
150
N. E. Huang
pattern with the period roughly 65 years based on this study; we concluded that the true warming rate of the climate is only half the rate as determined by the IPCC. Therefore, the recent global warming is partially contributed by natural fluctuation with a 65 years cycle corresponding to the multi-decadal oscillation as suggested by Schlesinger and Ramankutty (1994). IMFs of each downsample 0.5 IMF 1
0 -0.5 0.2
IMF 2
0 -0.2 0.2
IMF 3
0 -0.2 0.2
IMF 4
0
Nonlinear Trend
-0.2
0.5 0 -0.5 1840
1860
1880
1900
1920 1940 Time: Year
1960
1980
2000
2020
Fig. 4. The various IMFs representing each specific month of the year from the original monthly data. We could get the mean and the scattering bound from these results.
151
An Introduction to an Adaptive Data Analysis Method Various Rates of Global Warming 0.025 0.02
Warming Rate: oC/Yr
0.015 0.01 0.005 0 -0.005 -0.01 -0.015 -0.02 1840
1860
1880
1900
1920 1940 Time : Year
1960
1980
2000
2020
Fig. 5. The 25-year trends determined with any 25 period. The thin jaged dashed line gives the running value determined from the annual mean data. The mean of the corresponding slopes determined from the specific monthly data is given as a solid curve line with roughly 70 years cycle and the scattering given by the shaded band. The slopes from the various straight line trends are given by the corresponding colored dashed lines (25-year, 50-year, 100-year and 150-year all in dashed lines), each is roughly the mean values over the time span. The slope from the EMD determined trend is given by the solid line with the scattering band in gray that is only half the value as given by IPCC.
4. Some Applications In this section, we will use two examples to illustrate the usefulness of the adaptive data analysis approach in time series data. The first example demonstrates the prowess of the method to represent chirp signals and the second example demonstrates the application of the method in speech analysis. Finally, we also use a 2-dimensional image to illustrate the Multi-dimensional EEMD.
152
N. E. Huang
Fig. 6. Comparisons among Fourier, wavelet and Hilbert Spectral analysis for a chirp wave. The labels for the prospective figures are these: time axes are from 0 to 1000 and frequency axes are from 0 to 0.01 Hz.
a. The Chirp Data Let us consider the following model equation 2π t t x (t ) = sin , for t=0:1024 . 256 1024
(18)
The wave shape and the analyzed results based on Fourier, Morlet wavelet, and Hilbert spectral analysis are given in Figure 6. The model wave clearly has frequency variation along the time axis. Yet the Fourier spectrum totally missed the temporal variation, for in the analysis both amplitude and frequency are constant over the whole time domain. Wavelet certainly could capture the temporal variation to some
An Introduction to an Adaptive Data Analysis Method
153
degree. Yet, the leakage and fixed basis limit the power of frequency resolution. Only Hilbert spectral analysis could give a clean and crisp frequency variation as a function of time.
b. Speech Signal Analysis Any utterance of speech could be transcribed as time variation of the pressure field. Figure 7 gives the recorded signal when the word, ‘Hello,’ is uttered. The recording was taken at a sampling rate of 22050 Hz. Again, various methods of analyses were used on this set of data. The results are given in Figure 8. Here the top row shows the results from the Fourier Spectrogram. For narrow and wide band representations. The narrow band result, on the left, is computed with a window size of 1024 data points; while the wide band result, on the right, is computed with a window size of 64 data points. The narrow band representation can certainly resolve the
Fig. 7. The signal of the word, ‘Hello,’ recorded at 22050 Hz.
154
N. E. Huang Narrowband Spectrogram Hello 1024
4000
3500
3500
3000
3000
F re q u e n c y : H z
F re q u e n c y: H z
4000
2500 2000 1500
2000 1500 1000
500
500 0
0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.55 Time: second
4000
0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.55 Time: second
Wavelet Spectrum: Hello 5:2048
4000
3500
3500
3000
3000
F re q u e n c y: H z
F re q u e n c y: H z
2500
1000
0
2500 2000 1500
Wideband Spectrogram Hello 64
Hilbert Spectrum: Hello
2500 2000 1500
1000
1000
500
500 0
0 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.55 Time: second
0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.55 Time: second
Fig. 8. Fourier spectrograms, Wavelet and Hilbert spectral analyses for the ‘Hello’ signal. The figures in the top row are the result from Fourier Analysis (narrow band on the left, wide band on the right); the bottom left is Morlet wavelet and the right one is from HHT.
frequency better, but the time location is smeared. The wide band representation can resolve the time location better, but the frequency is smeared. The inability to represent both time and frequency to an arbitrary degree of precision is known as the uncertainty principle. This uncertainty principle is very different from the counter part in physics, known as the Heisenberg Uncertainty Principle, which has full physical meaning. The uncertainty principle here is purely the result of using integral transformation for the frequency determination. It is artificially inflected on the result because an improper analysis method is used.
An Introduction to an Adaptive Data Analysis Method
155
The Wavelet transform seems to be an improvement over the Fourier transform in this representation. Yet, the resolution is still poor. Furthermore, the continuous Wavelet analysis used here is over redundant and not energy conserving. The discrete Wavelet analysis, unfortunately, could not be used to recognize or extract physical features of the data. The solution for the true time-frequency analysis is undoubtedly the Hilbert Spectral analysis. Here the frequency is determined by differentiation; therefore, its value and location is precise. The adaptive basis also enables the frequency to be a function of time and totally eliminated the need of the spurious harmonics. The intra-wave frequency modulation fully represents the nonlinear characteristics of the speech sound production processes.
5. Conclusion HHT is a new method in data analysis. Its power is in the adaptive approach, which results in the adaptive basis. This offers a new and valuable view of nonstationary and nonlinear data analysis methods. With the recent developments on the normalized Hilbert transform, the confidence limit, and the statistical significance test, the Ensemble EEMD and the more faithful instantaneous frequency computation methods have made HHT a more robust tool for data analysis, and it is now ready for a wide variety of applications. The development of HHT, however, is not complete yet. There is still the need for a more rigorous mathematical foundation for the general adaptive methods for data analysis. As any new method in mathematics, such as calculus, it always started from a more or less empirical approach. We hope the final establishment of this method on rigorous and solid mathematical foundation would not keep us wait for too long.
Acknowledgements The author would like to acknowledge the generous help he received from his colleagues, specifically Professors Zhaohua Wu of FSU and Xianyao Chen of FIO, Qingdao, for allowing the author to quote their
156
N. E. Huang
work freely. He would also like to thank the supports by a grant from Federal Highway Administration, DTFH61-08-00028, and grants from NSC, NSC95-2119-M-008-031-MY3, NSC97-2627-B-008-007, and finally a grant from NCU 965941 that have made the conclusion of this study possible.
References 1. Bedrosian, E., On the Quadrature Approximation to the Hilbert Transform of Modulated Signals. Proc. IEEE, 51, 868-869 (1963). 2. Cohen, L., Time-frequency Analysis, 299pp. Prentice Hall, Englewood Cliffs, NJ (1995). 3. Diks, C., Nonlinear Time Series Analysis, 209pp. World Scientific Press, Singapore (1997). 4. Daubechies, I., Ten Lectures on Wavelets, 357pp. Philadelphia SIAM (1992). 5. Fan, J. and Q. Yao, Nonlinear Time Series: Nonparametric and Parametric methods. 551pp. Springer, New York (2005). 6. Flandrin, P, Time-Frequency / Time-Scale Analysis. 386pp. Academic Press, San Diego, CA (1999). 7. Flandrin, P., Rilling, G. and Gonçalves, P., Empirical mode decomposition as a filter bank. IEEE Signal Proc. Lett. 11 (2): 112-114 (2004). 8. Hou, T. Y., M. P. Yan and Z. Wu, A variant of EMD Method for Multi-Scale data. Adv. Adap. Data Analy., 1, 483-516 (2009). 9. Huang N. E., S. R. Long, and Z. Shen, Frequency Downshift in Nonlinear Water Wave Evolution. Advances in Appl. Mech. 32, 59-117 (1996). 10. Huang, N. E., Shen, Z., Long, S. R., Wu, M. C., Shih, S. H., Zheng, Q., Tung, C. C. and Liu, H. H., The empirical mode decomposition method and the Hilbert spectrum for non-stationary time series analysis, Proc. Roy. Soc.London, A454, 903-995 (1998). 11. Huang, N. E., Z. Shen, R. S. Long, A New View of Nonlinear Water Waves – The Hilbert Spectrum, Ann. Rev. Fluid Mech. 31, 417-457 (1999). 12. Huang, N. E., Wu, M. L., Long, S. R., Shen, S. S. P., Qu, W. D., Gloersen, P. and Fan, K. L., A confidence limit for the empirical mode decomposition and the Hilbert spectral analysis, Proc. of Roy. Soc. London, A459, 2317-2345 (2003). 13. Huang, N. E. and N. O. Attoh-Okine, Hilbert-Huang Transforms in Engineering, 313 pp, CRC Taylor and Francis Group (2005). 14. Huang, N. E. and S. S. P. Shen, Hilbert-Huang Transform and Its Applications. Ed. N. E. huang and S. S. P. Shen, 311pp, World Scientific, New Jersey (2005). 15. Huang, N. E. and Z. Wu, A review on Hilbert-Huang transform: Method and its applications to geophysical studies, Rev. Geophys., 46, doi:10.1029/2007RG000228 (2008).
An Introduction to an Adaptive Data Analysis Method
157
16. Huang, N. E., Wu, Z., Long, S. R., Arnold, K. C., Blank, K., Liu, T. W., On Instantaneous Frequency, Adv. Adap. Data Analy. 1,. 177-229 (2009). 17. IPCC, Intergovernmental Panel on Climate Change, The Fourth Assessment Report “Climate Change 2007” (2007). Available http://www.ipcc.ch/ 18. Kantz, H. and T. Schreiber, Nonlinear Time Series Analysis, 304pp. Cambridge University Press, Cambridge (1997). 19. Nuttall, A. H., On the quadrature approximation to the Hilbert Transform of modulated signals, Proceedings of IEEE, 54, 1458-1459 (1996). 20. Priestley, M. B., Nonlinear and nonstationary time series analysis, 237pp. Academic Press, London (1988). 21. Schlesinger, M. E. and N. Ramankutty, An oscillation in the global climate system of period 65-70 years. Nature, 367, 723-726 (1994). 22. Stock, J. H. and M.W. Watson, Testing for Common Trends. J. Am. Statist. Assoc., 83, 1097-1109 (1988). 23. Tong, H., Nonlinear Time Series Analysis, 552pp. Oxford University Press, Oxford (1990). 24. Wu, Z. and Huang, N. E., A study of the characteristics of white noise using the empirical mode decomposition method, Proc. Roy. Soc. London, A460, 1597-1611 (2004). 25. Wu, Z., N. E. Huang, S. R. Long, and C.-K. Peng, On the trend, detrending, and variability of nonlinear and nonstationary time series. Proc. Natl. Acad. Sci., 104, 14,889-14,894 (2007). 26. Wu Z. and N. E. Huang, Ensemble Empirical Mode decomposition: A Noise-Assisted Data Analysis Method. Adv. Adap. Data Analy. 1, 1-42 (2009). 27. Wu Z., N. E. Huang and X. Chen, The Multi-Dimensional Ensemble Empirical Mode Decomposition Method. Adv. Adap. Data Analy. 1, 339-372 (2009).
This page is intentionally left blank
CHAPTER 7 COMPUTATIONAL FLUID DYNAMICS BASED ON THE UNIFIED COORDINATES — AN EXPOSE∗
W. H. Hui Emeritus Professor of Applied Mathematics, University of Waterloo, Canada and Hong Kong University of Science and Technology, Hong Kong E-mail:
[email protected] Computational Fluid Dynamics (CFD) uses large scale numerical computation to solve problems of fluid flow. Traditionally, it uses either the Eulerian or the Lagrangian coordinate system. These two systems are numerically non-equivalent, but each has its advantages as well as drawbacks. A unified coordinate system (UC) has recently been developed which combines the advantages of both Eulerian and Lagrangian systems, while avoiding their drawbacks. This paper gives an expose of the CFD using the unified coordinates. Specifically, it will be shown that: (1) Governing equations of fluid flow in any moving coordinates can be written as a system of closed conservation PDEs; consequently, the effects of moving mesh on the flow are fully accounted for. (2) The system of Lagrangian gas dynamics equations is written in conservation PDE form for the first time. (3) The Lagrangian gas dynamics equations in 2-D and 3-D are shown to be theoretically non-equivalent to the Eulerian ones. (4) The UC is superior to both Eulerian and Lagrangian systems in that contact discontinuities are resolved sharply without mesh tangling. (5) Additionally, the UC is superior to the Eulerian as it avoids the tedious ∗
In Honor of Prof. Y. H. Pao on his 80th Birthday. I first met Professor Y. H. Pao in May 1985 and, at his invitation, spent one semester in 1990 as a Visiting Professor in his Institute of Applied Mechanics, National Taiwan University to begin my new research area: computational fluid dynamics, especially searching for a new coordinate system to do better computation in CFD. It is fitting to celebrate Professor Pao’s 80th birthday by giving an account on what my collaborators and I have achieved in this area since then. 159
160
W. H. Hui
and time-consuming task of mesh generation: the mesh in UC is automatically generated by the flow. Many examples are given to demonstrate these properties of the UC.
1. CFD as Numerical Solution to Nonlinear Hyperbolic PDEs – An Overview of Major Developments The objective of this paper is to give an exposé of the recently developed unified coordinate system (UC) in CFD mostly by the author and his collaborators;13-28,30,32,34-40,42,44,46,47,51 details can be found in the upcoming monograph bearing the same title by W. H. Hui and K. Xu.29 To put it in perspective we shall first give an overview of the major developments of CFD as numerical solution to the initial value problem of nonlinear hyperbolic partial differential equations before discussing the role of coordinates in CFD in Sect. 2. The theoretical foundation was laid by Riemann in his pioneering work (Riemann43, 1862) where he introduced the concept of Riemann invariants and posed the special initial value problem – since been known as the Riemann problem. It turns out that the Riemann problem plays a central role in all numerical methods of CFD. Nothing very significant happened during the following six decades until Richardson proposed weather prediction by numerical process (L. F. Richardson, Cambridge University Press, 1922). Even without an electronic computer, wanting to find numerical solutions to nonlinear hyperbolic PDEs immediately raises many interesting theoretical and practical questions, and progresses are made in answering them. The first of these is the discovery of the CFL condition (Courant, Friedrichs and Lewy5, 1928). It simply says that in a time-marching process to find a numerical solution, marching too fast causes numerical instability and destroys the solution. Practical methods for computing solutions with shock discontinuities are developed: the artificial viscosity method which smears shock discontinuities (von Neumann & Richtmyer49, 1950); the Godunov method which reduces the general initial value problem to a sequence of Riemann problems with cell-averaging data (Godunov9, 1959); the Glimm random choice method which also reduces the general initial
CFD Based on the Unified Coordinates — An Expose
161
value problem to a sequence of Riemann problems but with data of randomly chosen representative states (Glimm8, 1965; Chorin4, 1968); and the shock-fitting method (G. Moretti41, Polytechnic Institute of Brooklyn Report, 1972). The last two methods are not easy to extend the three-dimensional flow. A very important discovery was made by Lax & Wendroff 33 (1960) that in order to numerically capture shock discontinuities correctly, the governing PDE must be written in conservation form to begin with. This is easily done in Eulerian coordinates (in any dimensions) and also for one-dimensional flow in Lagrangian coordinates. But for a long time, it was not known how to use Lagrangian coordinates to write the governing PDEs for multidimensional flows in conservation form. This problem was solved by Hui et al (Ref. 18, 1999). To extend Godunov’s method to higher order accuracy, the important concepts of limiters and TVD were introduced which avoid non-physical oscillations in high resolution schemes (Boris and Book2, 1973; Van Leer48, 1973). From the onset of CFD, it was known that the numerical solution to a given flow depends on the coordinates (mesh) used to compute it; hence great efforts have been devoted to search for the optimum coordinate system: the Particle-in-Cell method (Harlow10, 1955); the ArbitraryLagrangian-Eulerian method (Hirt et al12, 1974); various moving mesh methods (Brackbill and Saltzman3, 1982); and the unified coordinate method (Hui et al18, 1999). To compute a flow past a body, which is the central problem in fluid dynamics, it is necessary to construct a body-fitted mesh prior to computing the flow. Even after decades of research, mesh-generation remains tedious and time-consuming. The unified coordinate approach to CFD has opened up a way of automatic mesh-generation (Hui et al25, 2005). 2. The Role of Coordinates in CFD 2.1. Theoretical Issues For more than 200 years, two coordinate systems have existed for describing fluid flow: The Eulerian system is fixed in space, whereas the
162
W. H. Hui
Lagrangian system follows the fluid. An immediate question is ‘are they equivalent to each other theoretically?’ This question must have been asked by numerous researchers in fluid dynamics, and the answer presumably was positive. Surprisingly, the first mathematical proof of equivalency, meaning the existence of a one-to-one map between the two sets of weak solutions obtained by using the two systems, was given as 50 late as 1987 by Wagner and holds only for one-dimensional flow (Note: in the presence of a vacuum, the definition of weak solution for the Lagrangian equations must be strengthened to admit test functions which are discontinuous at the vacuum). Even more surprisingly, for two- and three-dimensional flows, Hui et al18,20 showed that they are not equivalent to each other theoretically (see Sect. 6). 2.2. Computational Issues Computationally, Eulerian and Lagrangian systems are not equivalent even for one-dimensional (1-D) flow. However, we shall show19,21,22,32 in Sect. 4 that for 1-D flow, the Lagrangian system plus a shock–adaptive Godunov scheme14,34 is superior to the Eulerian system. The situation in two- and three-dimensional flows is more complicated: Eulerian and Lagrangian systems have advantages and drawbacks. In general, the Eulerian method is relatively simple, because the gas dynamics equations can be easily written in conservation PDE form, which provides the theoretical foundation for shock-capturing computation. However, it has two drawbacks: (a) it smears contact discontinuities badly and (b) it needs generating a body-fitted mesh prior to computing flow past a body, but mesh generation is tedious, time-consuming and requires specialized training. The Lagrangian method, by contrast, resolves contact discontinuities (including material interfaces and free surfaces) sharply, because they coincide with Lagrangian coordinates. It, too, has two drawbacks: (a) it may break down due to cell deformation, because a Lagrangian computational cell is literally a fluid particle with finite — though small — size and hence deforms with the fluid and (b) the gas dynamics equations could not be written in conservation partial differential
CFD Based on the Unified Coordinates — An Expose
163
equations (PDE) form, rendering numerical computation complicated. In this regard, we note that as late as 1999 Serre45 stated ‘Writing the equations of gas dynamics in Lagrangian coordinates is very complicated if (dimension) D ≥ 2’. Interestingly enough, it was shown18 in the same year that using the unified coordinates it is quite easy to derive the Lagrangian gas dynamics equations in conservation PDE form (see Sect. 6). This motivates us to search for an ‘optimal’ coordinate system, leading to the unified coordinate system. 2.3. The “Optimal Coordinate System” Can we have a coordinate system that combines the advantages of Eulerian and Lagrangian systems, while avoiding their drawbacks? Such a system would be ‘optimal’ in some sense (whether or not a system is optimal depends on the criteria, which are necessarily subjective). Specifically, we want the system to possess the following properties for compressible flow computation: (a) Conservation PDE form exists, as in Eulerian; (b) Contact discontinuities are sharply resolved, as in Lagrangian; (c) Mesh can be automatically generated to fit given body shapes; (d) Mesh is orthogonal; (e) …… The unified coordinate system satisfies these requirements as shown in Sects. 4 to 7. 3. The Unified Coordinate System We introduce arbitrary coordinates, (λ , ξ ,η , ζ ) , via a transformation from Cartesian (t , x, y , z ) as follows: dt = dλ , dx = Udλ + Adξ + Ldη + Pdζ , dy = Vdλ + Bdξ + Mdη + Qdζ , dz = Wdλ + Cdξ + Ndη + Rdζ .
(1)
164
W. H. Hui
For (1), we request ξ DQ η = 0 , Dt ς
where
DQ
Dt
≡
∂ + Q ⋅ ∇x . ∂t
(2)
So the coordinates (ξ ,η , ζ ) , and hence the computational cells, move with velocity Q = (U , V , W ). It should be noted that the time transformation in (1) is not trivial: while both t and λ are physical time, they are not equal to each other. For instance, in the Lagrangian case (see Eqs. (10) and (8) below), t is Eulerian time while λ is Lagrangian time. On the other hand, for 2-D steady flow, (1) reduces to31 dx = hu dλ + Adξ , dy = hv dλ + Bdξ
(1’)
where q = (u, v ) is the fluid velocity of two-dimensional flow and h is arbitrary. In this case, the Eulerian time t disappears by definition of steady flow, but λ still exists, representing the time travelled by the fluid particle along the streamline ξ = const. There are two special cases: Eulerian when Q = 0 and Lagrangian when Q = q . In the general case, we have a coordinate system with three degrees of freedom: U , V and W are arbitrary. On the other hand, the nine coefficients A, B,…, R in the transformation are not arbitrary, but must satisfy a set of compatibility conditions for dx, dy and dz to be total differentials. These conditions are:
∂A ∂U ∂L ∂U ∂P ∂U = , = , = , ∂λ ∂ξ ∂λ ∂η ∂λ ∂ζ ∂B ∂V ∂M ∂V ∂Q ∂V = , = , = , ∂λ ∂ξ ∂λ ∂η ∂λ ∂ζ ∂C ∂W ∂N ∂W ∂R ∂W = , = , = . ∂λ ∂ξ ∂λ ∂η ∂λ ∂ζ
(3a)
∂A ∂L = , ∂η ∂ξ ∂B ∂M = , ∂η ∂ξ ∂C ∂N = , ∂η ∂ξ
(3b)
∂A ∂P = , ∂ζ ∂ξ ∂B ∂Q = , ∂ζ ∂ξ ∂C ∂R = , ∂ζ ∂ξ
∂L ∂P = , ∂ζ ∂η ∂M ∂Q = , ∂ζ ∂η ∂N ∂R = . ∂ζ ∂η
CFD Based on the Unified Coordinates — An Expose
165
We note that of the eighteen conditions in (3), only nine are independent. We shall take the first nine conditions, (3a), which all involve λ-derivatives and are called time-evolution (also called geometric conservation laws), to be the independent conditions; the remaining nine, called differential constraints, then hold for all time provided they do initially. The coordinates (λ , ξ ,η , ζ ) are called unified in the sense that (a) they combine the geometric conservation laws with the physical conservation laws; and (b) they unify the Lagrangian and Eulerian coordinates (including ALE, but simpler). One way to choose the mesh-moving velocity Q = (U ,V ,W ) is as follows:
1-D
2-D 3-D
Dq ξ
Dt Dq ξ
Dt Dq ξ
Dt Dqη
Dt
=0
thus U = u , but A is arbitrary (in the classical Lagrangian case: U = u , A = 1 / ρ ), plus adaptive Godunov scheme (Lepage & Hui34, 1995).
=0
plus mesh-angle (hence orthogonality) preserving.
=0
plus mesh-skewness preserving.
=0
Here q is the fluid velocity, u is its x -component, and ρ is the fluid density. With the above choice for the mesh-moving velocity (U ,V , W ) , we see that (a) UC is a generalization of Lagrangian coordinates; (b) Since ξ is a material coordinate, UC resolves contacts sharply without mesh tangling; (c) ξ is a level-set function, as it is a material function.
4. One-Dimensional Flow It is shown32 that UCs (Lagrangian coordinates plus shock-adaptive Godunov scheme34) are superior to the Lagrangian system which, in turn, is superior to the Eulerian system.
166
W. H. Hui
We begin with the Euler equations of gas dynamics in Cartesian coordinates (t , x ) for a γ-law gas ρu ρ ∂ ∂ (4a) ρu + ρ u 2 + p = 0, ∂t ρe
e=
∂x u ( ρ e + p )
1 2 1 p u + . 2 γ −1 ρ
(4b)
Here u is velocity, p pressure, ρ density and e specific total energy. System (4) is hyperbolic and in conservation form. We transform (4) to the Lagrangian coordinates (λ , ξ ) by dt = dλ , dx = udλ + 1 dξ , ρ
(5)
to get ∂ ∂λ
1 ρ ∂ u + ∂ξ e
−u p = 0. up
(6)
Example 1. A Riemann problem System (6) is also hyperbolic and in conservation form, and Wagner50 showed that weak solutions to (4) and to (6) are equivalent theoretically. However, they give different numerical solutions, as demonstrated in Figs. 1a and 1b19,21 for a Riemann problem, whose solution consists of two shocks and a contact discontinuity in between. It is seen that, while shock resolutions are similar, Lagrangian computation resolves the contact much better than Eulerian. This is because a contact line is a material line and hence coincides with the Lagrangian coordinate, whereas it does not coincide with the Eulerian coordinate. The error near the contact discontinuity (Fig. 1b) in Lagrangian computation can be remedied using the shock-adaptive Godunov scheme34 instead of the classical Godunov scheme. In the shock-adaptive
CFD Based on the Unified Coordinates — An Expose
167
A Riemann Problem (Godunov-MUSCL scheme) (a)
(Eulerian)
ρ
(b)
(Lagrangian)
ρ
(c)
(UC)
ρ
Fig. 1. Numerical solutions to a Riemann problem. Solid line: exact, dots: computed. (a) Eulerian, (b) Lagrangian, (c) UC (Lagrangian with shock-adaptive Godunov scheme).
Godunov scheme shocks are fitted, using the Riemann solution with no extra cost, so we can replace the energy conservation equation by the entropy conservation equation which holds in the smooth flow regions. Accordingly, (6) becomes 1 − u ∂ ρ ∂ u + p = 0. ∂λ p ∂ξ γ 0 ρ
(7)
168
W. H. Hui
Computations using (7) resolve both shock and contact discontinuities sharply, as shown in Fig. 1c. Indeed, UC can cure all known defects of Eulerian and Lagrangian shock-capturing methods32, namely: (a) contact smearing; (b) slow moving shocks; (c) sonic-point glitch; (d) wall-overheating†; † (e) start-up errors ; † (f) low-pressure flow ; (g) strong rarefaction waves†. We therefore conclude that UC computation is superior to both Lagrangian and Eulerian computation, and is numerically optimal for 1-D flow. Furthermore, UC reduces the mathematical problem of 2-D steady supersonic flow to that of 1-D unsteady flow by marching in the flow direction (instead of time), so it is also numerically optimal for that flow.
5. Multi-Dimensional Flow For simplicity, we consider 2-D flow, for which the Euler equations of gas dynamics in Eulerian coordinates are ∂E ∂F ∂G + + = 0, ∂t ∂x ∂y
where ρv ρu ρ 2 ρuv ρu + p ρ u E = , F = ρuv , G = ρv 2 + p . ρv ρu e + p ρv e + p ρe ρ ρ
†
also shared in Lagrangian computation.
(8)
CFD Based on the Unified Coordinates — An Expose
169
Under transformation (1), we obtain ∂ E ∂ F ∂G + + = 0, ∂λ ∂ξ ∂η
(9)
where ρX ρY ρJ Ju Xu pM Yu pB + − ρ ρ ρ ρJv ρXv − pL ρYv + pA ρJe ρXe + p(uM − vL ) ρYe + p(vA − uB ) E= , F = , G = 0 −U A B 0 −V 0 −U L M 0 V −
with J = AM – BL, X = (u - U )M – (v - V )L and Y = (v – V )A – (u – U )B. The first four equations of (9) are the physical conservation laws and the last four the geometric conservation laws. We note that this system of equations is: (a) a closed system of PDE in conservation form, and (b) hyperbolic in λ, except for the special case of Lagrangian coordinates when U = u and V = v, which will be discussed in Sect. 6.
Remarks: The consequences of having a closed system of governing equations in conservation form are: • Effects of moving mesh on the flow are fully accounted for, making the UC superior to other existing moving mesh methods. • System (9) can be solved as easily as the Eulerian system with one extra work: at each time step the mesh velocity (U, V ) is either given or computed. 6. Lagrangian Case For U = u and V = v, we get ∂ E ∂ F ∂G + + = 0, ∂λ ∂ξ ∂η
(10)
170
W. H. Hui
where 0 0 ρJ pM ρJu − pB ρJv − pL pA ρJe p(uM − vL ) p (vA − uB ) E= , F = , G = . −u 0 A B −v 0 −u 0 L M v − 0
Remarks: • The gas dynamics equations in Lagrangian coordinates are here written in conservation form for the first time18. • It thus provides a foundation for developing Lagrangian schemes as moving mesh schemes6,7. • The system of Lagrangian gas dynamics equations is only weakly hyperbolic18: all eigenvalues are real, but there is no complete set of linearly independent eigenvectors. This is also true for the 3-D case20. Hence, the Lagrangian system of coordinates is not equivalent to the Eulerian system. 7. Automatic Mesh-Generation As one of its coordinates is a material coordinate, the UC provides the basis for automatic mesh-generation for flow past a body. We shall illustrate the computational procedure via an example. In all examples below γ = 1.4 is used, unless otherwise stated.
Example 2. Mach 0.8 steady flow past a NACA 0012 airfoil28 We compute the solution to equations (9) by time marching in λ. The computation procedure is as follows: Initialization stage — automatic generation of body-fitted mesh in a computational window.
CFD Based on the Unified Coordinates — An Expose
171
Given the grid sizes, ∆x and ∆y , and the number of cells, M × N , in the window, a flow-chart of the computations looks as follows: (1) Begin with a column of N orthogonal cells, representing the given uniform flow in the x-direction (Fig. 2a). This gives the initial values of ( A, B, M , L ) = (1, 0, 0,1) . We also take (U ,V ) = (u, v ) initially. (2) Compute the solution to (9) by marching in time λ using operator splitting: splitting into two 1-D systems in λξ and λη . Each of them is solved using the standard Godunov/ MUSCL scheme with minmod limiter. (Details are given in27). After one time step ∆λ , this column of cells moves to the right by a distance equal to U∆λ . (3) After several time steps when the initial column of cells has moved to the right by a distance equal to ∆x , add one new column of cells on the left that is identical to the initial column. (4) Repeat this process of adding cell columns on the left of the computational region until the leading column of cells meets the body surface and afterwards (Fig. 2b), we then impose the boundary condition of zero normal relative fluid velocity on the body surface by solving a boundary Riemann problem. (5) Continue this process until after the columns of cells cover the whole body surface and further downstream, when we have M columns of cells in the window (Fig. 2c). This completes the initialization stage, and we now have an airfoil-fitted mesh and a flow field around it in the window of M × N cells. The flow-generated mesh (Fig. 2d) is seen body-fitted and orthogonal, as predicted. It is also fairly uniform in the x -direction. The flow field computed (Fig. 2e) so far is, however, only a rough approximation to the correct one, partly because it has not reached the steady state and partly because the downstream boundary condition used in the transient times, e.g., in Fig. 2b, are obviously inaccurate as the computational regions at those times have not yet reached the full window. To compute further, one could use the body-fitted orthogonal mesh generated so far to perform an Eulerian computation with the associated flow field as an initial solution. This can be easily done by
172
W. H. Hui
Fig. 2. Mach 0.8 flow past a NACA 0012 airfoil at zero angle of attack. (a) – (c) flow-generated meshes, (d) close view of preliminary mesh, (e) preliminary surface pressure, (f) close view of final mesh, (g) final surface pressure.
CFD Based on the Unified Coordinates — An Expose
173
putting U = V = 0 in the subsequent iterations towards a steady state. In this way, the UC approach plays the role of mesh generation for Eulerian computation. An alternative and better way is to continue the UC computation in the Main stage — iteration with flow-adjusted mesh (6) To iterate the computation towards a steady state, whenever we add a new column of cells on the left we also simultaneously delete the right-most column of cells from the computation window, thus keeping the window approximately in the same size. To improve the solution, we may also use26 the information of the flow field at every time step, e.g., the surface pressure gradient, so that the mesh is refined in regions of high pressure gradient (Fig. 2f).
Example 3 Figure 3 shows a sample computation28 for a Mach 2.2 steady flow of air-SF6 past a NACA 0012 airfoil at an angle of attack of 8° using the procedure explained above. It is seen that without any special treatment, the interface between the two fluids is sharply resolved.
Example 4 This is an unsteady supersonic flow, M ∞ = 3.0 , past a diamond-shaped airfoil with 10° vertex angle which is oscillating about its vertex according to θ = 2° sin 30t , where θ is the instantaneous pitching angle. In Eulerian computation it is necessary to generate a body-fitted mesh at every time step, which is time-consuming. Mesh-generation at every time step is completely avoided using the unified coordinate 25 approach. The flow-generated meshes at different times, computed using UC, are plotted in Fig. 4. It is seen that at all times of the oscillation, the flow-generated meshes are body-fitted and are almost orthogonal, as predicted, although the mesh on the expansion side of the airfoil is coarsened.
174
W. H. Hui
(a)
(b)
(c)
(d)
(e)
(f)
(g)
Fig. 3. Mach 2.2 flow of air-SF6 over a NACA 0012 airfoil at 8° angle of attack. Flow-generated meshes and density (and pressure) contours at different times are shown here.
175
CFD Based on the Unified Coordinates — An Expose
t=
t =0
t=
6 T 10
t = 10T
3 t = (10 + )T 4
1 T 10
t=
t=
8 T 10
1 t = (10 + )T 4
3 T 10
t =T
2 t = (10 + )T 4
t = 11T
Fig. 4. Flow-generated meshes for oscillating diamond-shape airfoil. Apex angle = 10°, pitching motion about the apex: θ(t) = 2°sin(2πt/T ), the period of oscillation T = 2π/30, free stream Mach number M = 3.0.
176
W. H. Hui
Fig. 5. Fixed computational mesh around a rectangular plate.
(b)
Fig. 6. Trajectories of the falling rectangular plate. (a) Computed trajectory, (b) The tumbling phase: from experiment (black) and as computed (red).
CFD Based on the Unified Coordinates — An Expose
177
8. Aerodynamics of Falling Leaves This is the problem of unsteady incompressible aerodynamics of a freely falling leave. There is rich dynamic behavior, such as fluttering and tumbling (Refs. 1, 31). It is a problem of interaction between the unsteady motion of the leave and the air flow around it, and is rather cumbersome to compute using the Eulerian coordinate system. Numerical and experimental studies have been presented in Ref. 1 and, more recently, 31 Jin and Xu have elegantly used the unified coordinate formulation, 52 together with a gas-kinetic BGK solver , to obtain very accurate results. In Ref. 31 the computational mesh is fixed rigidly with the leave (a plate) (Fig. 5), hence the mesh velocity is determined according to the translational and rotational velocity of the plate which, in turn, is
Fig. 7. Computed vortex fields of a falling rectangular plate at four instants during a full rotation.
178
W. H. Hui
determined by the unsteady aerodynamic forces and the gravitational force according to Newton’s laws of motion. In the UC formulation, Eqs. in (9), the effects of the mesh movement are fully accounted for in Eqs. (3) as pointed out earlier, while the effects of viscosity are fully included in the gas-kinetic BGK solver. Computed trajectories of the falling rectangular plate are plotted in Fig. 6b and are seen to agree well 1 with the experimental measurements in . The computed vortex fields of the same motion are also given in Fig. 7, showing the complex vortex shedding of a falling rectangular plate at four different instants during a full rotation. 9. Conclusions (1) For 1-D flow, UC is superior to Eulerian and Lagrangian systems and completely satisfactory. Also, with UC, 2-D steady supersonic flow computation is reduced to that of 1-D unsteady flow, hence is also completely satisfactory. (2) The governing equations in any moving coordinates can be written as a system of closed conservation PDE; consequently, effects of moving mesh on the flow are fully accounted for (hence it is better than any moving-mesh method). (3) The Lagrangian system of equations of gas dynamics is written in conservation PDE form for the first time, thus providing a foundation for developing Lagrangian schemes as moving mesh schemes. (4) The Lagrangian system of gas dynamics equations in 2-D and 3-D are shown to be only weakly hyperbolic, in contrast to the Eulerian system which is fully hyperbolic; hence the two systems are not equivalent to each other. (5) The UCs retain the advantages of the Lagrangian system: contact discontinuities are resolved sharply. (6) Finally, in using the UC there is no need to generate a body-fitted mesh prior to computing flow past a body: the mesh is automatically generated by the flow.
CFD Based on the Unified Coordinates — An Expose
179
References 1. Andersen, A., Pesavento, U. and Wang, Z.J., “Unsteady aerodynamics of fluttering and tumbling plates.” J. Fluid Mech., 541, 65 (2005). 2. Boris, J.P. and Book, D.L., “Flux corrected transport I, SHASTA, a fluid transport algorithm that works.” J. Comp. Phys., 11, 38 (1973). 3. Brackbill, J.U. and Saltzman, J.S., “Adaptive zoning for singular problems in two dimensions.” J. Comput. Phys., 46, 342 (1982). 4. Chorin, A.J., “Random choice solution of hyperbolic systems.” J. Comput. Phys., 22, 517 (1968). 5. Courant, R., Friedrichs, K.O. and Lewy, H., Physik. Math. Ann., 100, 32 (1928). 6. Despres, B. and Mazeran, C., “Symmetrization of Lagrangian gas dynamics in dimensions two and multidimensional solvers.” Comptes Rendus (Mecanique), 331, 475 (2003). 7. Despres, B. and Mazeran, C., “Lagrangian gas dynamics in two dimensions and Lagrangian systems.” Arch. Rational Mech. Anal., 178, 327 (2005). 8. Glimm, J., “Solutions in the large for nonlinear hyperbolic systems of equations.” Comm. Pure and Appl. Math, 18, 697 (1965). 9. Godunov, S.K., “Difference method of numerical computations of discontinuous solutions in hydrodynamic equations.” Math. Sbornik, 47, 271 (1959). 10. Harlow, F.H., Los Alamos Scientific Laboratory Report, LAMS-1956, (1955). 11. Hafez, M.M., Osher, S. and Whitlow, W., “Improved finite different schemes for transonic potential calculations.” AIAA Paper 84-0092, AIAA 22nd Aerospace Sciences Meeting, (1984). 12. Hirt, C.W., Amsden, A.A., and Cook, J.L., “An arbitrary Lagrangian-Eulerian computing method for all flow speeds.” J. Comput. Phys., 14, 227 (1974). 13. Hui, W.H., and Loh, C.Y., “A new Lagrangian method for steady supersonic flow computation, Part II: slip-line resolution.” J. Comput. Phys., 103, 450 (1992). 14. Hui, W.H., and Loh, C.Y., “A new Lagrangian method for steady supersonic flow computation, Part III: strong shocks.” J. of Comput. Phys., 103, 465 (1992). 15. Hui, W.H., and Zhao, Y.C., “A generalized Lagrangian method for solving the Euler equations.” in “Nonlinear Hyperbolic Problems: Theoretical, Applied, and Computational Aspects”, (Eds.) Donato, A. and Oliveri, F., Vieweg, (1993), p336. 16. Hui, W.H., and Chu, D.L., “Optimal grid for the steady Euler equations.” Comput. Fluid Dyn. J., 4, 403 (1996). 17. Hui, W.H., and He, Y., “Hyperbolicity and optimal coordinates of the three-dimensional steady Euler equations.” SIAM J. Appl. Maths., 57, 893 (1997). 18. Hui, W.H., Li, P.Y., and Li, Z.W., “A unified coordinate system for solving the two-dimensional Euler equations.” J. Comput. Phys., 153, 596 (1999). 19. Hui, W.H., and Koudriakov, S., “Role of coordinates in the computation of discontinuities in one-dimensional flow.” Comput. Fluid Dyn. J., 8, 495 (2000).
180
W. H. Hui
20. Hui, W.H., and Kudriakov, S., “A unified coordinate system for solving the three-dimensional Euler equations.” J. Comput. Phys., 172, 235 (2001). 21. Hui, W.H., and Kudriakov, S., “On contact overheating and other computational difficulties of shock-capturing methods.” Comput. Fluid Dyn. J., 10, 192 (2001). 22. Hui, W.H., and Kudriakov, S., “Computation of shallow water waves using the unified coordinates.” SIAM J. Sci. Comput., 23, 1615 (2002). 23. Hui W.H. and Zhao, G.P., “Capturing contact discontinuities using the unified coordinates.” in “Proceedings of the 2nd MIT Conference on Computational Fluid and Solid Mechanics”, (Ed: K J Bathe) vol. 2, (2003), p 2000. 24. Hui, W.H., Wu, Z.N. and Gao, B., “Preliminary extension of the unified coordinate approach to computation of viscous flows.” accepted by J. Sci. Comput., in print, (2006). See also Gao, B. and Wu, Z.N., Physically related coordinate system for compressible flow, Modern Phys. Letters, B19, 1455 (2005). 25. Hui, W.H., Zhao, G.P., Hu, J.J. and Zheng, Y., “Flow-generated-grids — gridless computation using the unified coordinates.” “Proceedings of the 9th International Conference on Numerical Grid Generation”, San Jose, CA, 11-18 June, (2005), p 111. 26. Hui, W.H. and Hu, J.J., “Space-marching gridless computation of steady supersonic/hypersonic Flow.” Int’l J. of Comput. Fluid Dyn., 20, 55 (2006). 27. Hui, W.H., “The unified coordinate system in computational fluid dynamics, Review Article.” Communications in Computational Physics, 2, 577 (2007). 28. Hui, W.H., Hu, J.J. and Shyue, K.M., “Role of coordinates in Computational Fluid Dynamics.” Proceedings of the 14th Canadian CFD Conference, Session # 6, Queen’s University, Kingston ON Canada, 16-18 July, 2006. To be published in the Int’l J. of Comput. Fluid Dyn., (2007). 29. Hui, W.H. and Xu, K., “Computational Fluid Dynamics based on the Unified Coordinates.” Springer, (2010). 30. Jia, P., Jiang, S. and Zhao, G.P., “Two-dimensional compressible multi-material flow calculations in a unified coordinate system.” Computers & Fluids, 35, 168 (2006). 31. Jin, C. and Xu, K., “A unified moving grid gas-kinetic method in Eulerian space for viscous flow computation.” J. Comput. Phys., 218, 68 (2006). 32. Kudriakov, S. and Hui, W.H., “On a new defect of shock-capturing methods.” Journal of Computational Physics, 227, 2105 (2008). 33. Lax, P.D. and Wendroff, B., “Systems of conservation laws.” Comm.Pure Appl. Math., 13, 217 (1960). 34. Lepage, C.Y., and Hui, W.H., “A shock-adaptive Godunov scheme based on the generalized Lagrangian formulation.” J. Comput. Phys., 122, 291 (1995). 35. Liou, M.S., “An extended Lagrangian method.” J. Comput. Phys., 118, 294 (1995). 36. Loh, C.Y., and Hui, W.H., “A new Lagrangian method for steady supersonic flow computation Part I: Godunov scheme.” J. Comput Phys., 89, 207 (1990).
CFD Based on the Unified Coordinates — An Expose
181
37. Loh, C.Y. and Liou, M.S., “A new Lagrangian method for solving the 2-D steady flow equations for real gas.” J. Comput. Phys., 104, 150 (1993). 38. Loh, C.Y. and Liou, M.S., “A new Lagrangian method for three-dimensional steady supersonic flows.” J. Comput. Phys., 113, 224 (1994). 39. Loh, C.Y., Liou, M.S. and Hui, W.H., “An investigation of random choice method for 3-D steady supersonic flow.” Int’l J. for Numer. Methods in Fluids., 29, 97 (1999). 40. Loh, C.Y. and Hui, W.H., “A new Lagrangian method for time-dependent inviscid flow computation.” SIAM J. on Sci. Comput., 22, 330 (2001). 41. Moretti, G., “Thoughts and Afterthoughts About Shock Computations.” Polytechnic Institute of Brooklyn PIBAL Report, 72 (Dec. 1972). 42. Niu, Y.Y., Lin, Y.H., Hui, W.H. and Chang, C.C., “Development of a moving artificial compressibility solver on unified coordinates.” Int’l J. for Numerical Methods in Fluids, 65, 1029 (2009). 43. Riemann, G.F.B. “Ueber die Fortpflanzung ebener Luftwellen von endlicher Schwingungsweite”, Abh. König. Gesell. Wiss. Göttingen, 8, 43 (1860). 44. Rogers, C., Schief, W.K. and Hui, W.H., On complex-lamellar motion of a Prim gas, J. Math Anal. Appl., 266, 55 (2002). 45. Serre, D., Systems of Conservation Laws, Cambridge University Press, (1999). 46. So, R.M.C., Liu, Y. and Lai, Y.G., “Mesh shape preservation for flow-induced vibration problems.” Journal of Fluids and Structures, 18, 287 (2003). 47. Tai, Y.C. and Kuo, C.Y., “A new model of granular flows over general topography with erosion and deposition.” Acta Mech., 199, 71 (2008). 48. Van Leer, B., “Towards the ultimate conservative difference scheme I. The quest of monotonicity.” Springer Lecture Notes in Physics, 18, 163 (1773). 49. von Neumann, J. and Richtmyer, R.D., “A method for the numerical calculation of hydrodynamic shocks.” J. Appl. Phys., 21, 232 (1950). 50. Wagner, D.H., “Equivalence of Euler and Lagrangian equations of gas dynamics for weak solutions.” J. Differential Equations, 68, 118 (1987). 51. Wu, Z.N., “A note on the unified coordinate system for computing shock waves.” J. Comput Phys., 180, 110 (2002). 52. Xu, K., “A gas-kinetic BGK scheme for the Navier-Stokes equations and its connection with artificial dissipation and Godunov method.” J. Comput. Phys., 171, 289 (2001).
This page is intentionally left blank
CHAPTER 8 TOWARDS GREEN’S FUNCTION RETRIEVAL FROM IMPERFECTLY PARTITIONED AMBIENT WAVE FIELDS: TRAVEL TIMES, ATTENUATIONS, SPECIFIC INTENSITIES, AND SCATTERING STRENGTHS
Richard L. Weaver Department of Physics, University of Illinois at Urbana-Champaign 1110 W. Green St., Urbana Illinois 61801, USA E-mail:
[email protected] Recent theorems have established that fully diffuse wave fields permit, on cross-correlating the noisy signal detected at different positions, retrieval of the Green’s function, and in particular, retrieval of wave travel times. Application to seismology has led to the creation of maps of seismic surface wave velocity with exceptional resolution. What is perhaps perplexing is that the ambient seismic noise fields that are used for this are rarely fully diffuse; the conditions for the theorems are not satisfied. It is shown here that this may be understood theoretically by a simple asymptotic argument. It is further shown that incompletely diffuse noise fields may permit us to robustly recover more than just travel times, but also ray arrival amplitudes, the ambient field’s specific intensity, the strength and density of its scatterers if any, and most importantly attenuation. A theoretical basis for this is presented.
1. Introduction Interest in diffuse elastic waves has seen an extraordinary growth over the last decade, driven largely by applications in seismology and exploration geophysics. One key to this growth was the observation22,23,25 that fully diffuse wave fields permit, on cross-correlating the ostensibly random material displacements at different positions, retrieval of the Green’s function, where the Green’s function is the signal one would 183
184
R. L. Weaver
have at one receiver if the other were replaced with an impulsive force. Further theoretical arguments24,17,13,20 and laboratory demonstrations7,1,8 in support of this conclusion quickly appeared. The essential theorem states that the (time derivative of the) cross correlation between fully diffuse white noise fields ψ at two points x , and y C xy′ (τ ) = limT →∞
(1) d 1 d < ψ ( x , t )ψ ( y , t + τ ) >t ψ ( x , t )ψ ( y, t + τ ) dt = ∫ T dτ T dτ
is equal to the (time symmetrized) Green’s function G between points x and y . C ′xy (τ ) = E (G ( x , y ,τ ) − G ( x , y, −τ )) , (2)
where E is a measure of the energy density in the diffuse field. The white noise condition is not onerous. If the diffuse field is band limited, then the cross correlation is merely the Green’s function convolved with the spectrum of the ambient field, i.e. the factor E becomes a convolution. The time-symmetrization is not onerous either, as G is causal, so the two terms in Eq. (2) have distinct supports in τ . The amount of time T over which one must integrate to obtain sufficiently converged C has received considerable attention. In practice seismologists find that a period of months usually suffices. Theory suggests that the amount of time needed scales with the distance between stations, but is greater in the presence of attenuation. Considerable thought has gone into creatively weighting the time-average to prevent periods of high energy density from dominating the average. One method is to replace the ambient field with its sign before cross correlating. If, as in elastodynamics, the fields are vector-valued, the above expressions generalize easily. The relevance of this to seismology and exploration geophysics,21,6 ocean acoustics12 and structural acoustics,2,15 was immediately apparent. Applications in long-period seismology have been particularly striking. Tomographic maps of Rayleigh wave velocity, in which different frequencies reveal information from different depths, have been
Towards Green’s Function Retrieval
185
constructed with extraordinary spatial resolution.16,14 The literature also reports retrieval of waveforms due to Love waves and bulk waves. High spatial resolution arises because the method frees the seismologist from needing a controlled source (impractical in seismology where one needs an earthquake or a nuclear explosion and in exploration geophysics where one uses chemical explosives or thumper trucks.) Dense large arrays consisting of up to hundreds of seismic stations are a recent development; the world-wide availability of their daily records for periods of years lends itself naturally to the necessary kind of data processing. The early work made it clear that the theoretical basis for the relation is restrictive; the notion of a fully diffuse wave field is a demanding one and few natural fields are fully diffuse to the required degree. In practice, ambient surface seismic waves at frequencies below 1 Hz are not fully diffuse. They appear to have their sources in localized ocean storms that are often distant from the seismic detection network.18,4,10 Therefore the fields at the detectors retain strong directionality. (Scattering from seismic heterogeneities mitigates this somewhat but not entirely). The fields are diffuse in the sense that they lack coherent wave fronts and are superpositions of uncorrelated waves traveling in different directions. They are not fully diffuse in that their distributions of intensity are not isotropic, and further in that the energy amongst different wave types, bulk and surface, is probably not equipartitioned. Nonetheless numerous researchers report highly robust retrieval of travel times between distant stations, both in the lab and in the earth’s surface. It transpires that an isotropic distribution of intensity is not a prerequisite for retrieving travel times. Retrieval of Green’s functions per se does require both isotropy and equipartition amongst wave types. But both experience16,14 and theory26,3,5 show that retrieval of travel times is robust against these failures of full diffusivity. This is the basis for the extraordinary quality of the maps of seismic Rayleigh wave velocity in spite of the failures of the correlations C′ to correspond exactly to G . This raises the further question of whether it is possible using the imperfectly diffuse fields with which seismology has to deal and for which C ′ ≠ G , to retrieve not only travel times, but also meaningful measures of amplitude. If so, we could extract maps of attenuation and
186
R. L. Weaver
scattering strength as well as seismic velocity. It is towards this goal that the present communication is directed.
2. Imperfectly Diffuse Fields Ambient seismic fields fail to be fully equipartitioned in at least two ways. They often fail to be equipartitioned amongst all wave types. The ambient field is dominated by surface waves. For this reason, retrieval of bulk waves from correlations has been more difficult than retrieval of surface waves. (This is in part also due to the lower amplitude of typical P and S contributions to Green’s functions between surface stations.) Retrieval of surface wave ray arrivals has proven far more feasible, however the ambient field's surface waves also fail to be fully equipartitioned in practice: the ambient surface waves are generally not distributed isotropically, some directions have greater intensity than others. Recently Weaver et al26 investigated the effect of such a distribution, or “ponderosity” B (θ ) , on retrieval of ray arrival times. Here we summarize that argument, and extend it to the effect on ray arrival amplitudes. Consider two receivers, one at the origin, the other a distance x from the origin along the x-axis. We distribute incoherent sources s (θ , t ) over an annular region of (large) radius R around the receivers, with < s > = 0 , and < s (θ ) s * (θ ′ ) > = B (θ )δ (θ − θ ′ ) . These give rise to a fieldψ (r , t ) whose Fourier transform
ψɶ (r ,ω ) = ∫ψ ( r , t ) exp ( −iω t ) dt is a superposition of cylindrical waves from the many sources s . At a point on the x-axis it is
ψɶ ( x, ω ) = ∫ s (θ ) dθ exp (iω R − iω x cosθ ) / Ri (ω − iε ) ,
(3)
the angle θ being defined relative to the strike line connecting the receivers. The cylindrical waves have been written in a form valid for asymptotically large ω R . Thus the derivation is restricted to noise whose sources are in the far field. ε is an infinitesimal positive quantity. Units are used such that wave speed c is unity.
Towards Green’s Function Retrieval
187
The time derivative of the correlation between receivers at r = 0 and ˆ r ′ = ix has Fourier transform Cɶ 0,' x (ω ) ≡ iω ∫ C0, x (τ ) exp( −iωτ )dτ = iω < ψɶ (0, ω ) *ψɶ ( x,ω ) > .
(4)
The relevant expectation is
< ψɶ (0) *ψɶ ( x) >=< ∫ s(θ ')dθ '∫ s(θ )dθ exp( −iω x cos θ ) / R ω 2 + ε 2 > = ∫ B(θ )dθ exp(−iω x cos θ ) / R ω 2 + ε 2 .
(5)
On returning to the time domain and removing the uninteresting factor of
R, C0,' x (τ ) = ∫ B (θ ) dθ i sgn(ω )exp(−iω x cos θ ) exp(iωτ )d ω .
(6)
In practice, source fields and correlation waveforms have finite bandwidth. In this case Eq. (6) must be modified to include a smooth factor of spectral weight, assumed independent of angle,
C0,' x (τ ) = ∫ B (θ )dθ i sgn(ω ) exp(−iω x cos θ ) exp(iωτ ) S (ω )d ω
(7)
In accord with typical seismic applications we take the spectrum to be band-limited with central frequency ω o , in the form of a sum of (assumed non-overlapping) parts at positive and negative frequency, S (ω ) = Aɶ (ω − ω 0 ) + Aɶ (ω + ω 0 ) ,
(8)
where Aɶ (ω ) is the Fourier transform of a real envelope function A (t ) that has its maximum at t = 0 . Without loss of generality we take the maximum to be unity. The autocorrelation, C′ at x = 0 , is then readily and exactly evaluated to be ' C0,0 (τ ) = ∫ B(θ )dθ
∫ i sgn(ω ) exp(iωτ )S (ω )dω
= −2{ ∫ B(θ )dθ } A(τ )sin ω oτ ,
(9)
which shows that, unlike the cross correlation below, the auto-correlation receives significant contributions from waves in all directions θ .
188
R. L. Weaver
At x ≠ 0 , and if ponderosity B is independent of angle, i.e. if ambient energy is incident uniformly from all directions, the integral (7) over θ is recognized as giving a Bessel function and one exactly recovers the usual 2-d Green’s function. If B is θ -dependent, then the integral can be evaluated analytically for asymptotically large ω x . One recognizes that the dominant contribution comes from the vicinity of θ = 0 , i.e. from the ambient waves that are incident in the on-strike direction from the origin towards x . Then, to leading order we have C0,' x (τ ) ~ ∫ {B (0) + 12 B "(0)θ 2 + 241 B ''''(0)θ 4 + ..} ×
exp( −iω x{1 − 12 θ 2 }) i sgn(ω ) exp(iωτ ) S (ω ) dω dθ .
(10)
Higher order corrections to the exponent (e.g. exp(−iω xθ 4 / 24) ) will affect the asymptotic corrections, but such effects will be of the same (small) order for the leading term B (0) and the first correction proportional to its derivatives. Thus we neglect them. Asymptotically, then C ′ becomes (c.c. representing complex conjugate) ∞
C0,' x (τ ) ~ i ∫ exp(iωτ ) Aɶ (ω − ω o )d ω 2π / | ω | x × 0
(11)
1 1 exp( −iω x + iπ / 4)[ B(0) − B ''(0) − B ''''(0) 2 2 + ..] + c.c. 2iω x 8ω x At leading order, where we neglect the B '' and B '''' terms, the correlation becomes C0,' x (τ ) / B(0) 2π / | ω o | x ∞
~ exp(−iω o x + iπ / 4) i ∫ exp(iωτ ) Aɶ (ω − ωo )d ω + c.c. 0
∞
~ exp(iω o (τ − x ) + iπ / 4)i ∫ exp(i β (τ − x )) Aɶ ( β )d β + c.c. 0
= exp(iω o (τ − x ) + iπ / 4)iA(τ − x ) + c.c. = −2 A (τ − x ) sin(ω o (τ − x ) + 3π / 4).
(12)
Towards Green’s Function Retrieval
189
C' is therefore the usual Green’s function at asymptotically large ωx. It has an arrival at time x / c = x , and an amplitude proportional to the noise intensity B (0) on strike. The higher order term B '' can be seen in Eq. (11) to affect the phase of the ray arrival. This has been discussed at length elsewhere26,3,5, where it is established that the practical consequences for retrieval of ray travel times are usually negligible. Both the term in B '' and B '''' affect the amplitude of the apparent ray arrival. A small bit of algebra establishes that the amplitude B (0) in Eq. (12) should be replaced by
B '' (0)2 B '''' ( 0) 1 B (0) 1 + − 2 2 + .. . 2 B ( 0) 8ωo x B (0)
(13)
At practical station separations of more than two wavelengths, the difference is usually negligible.
3. Retrieval of Attenuation While retrieval of travel times is largely independent of the uniformity of ponderosity, retrieval of attenuation requires that it be considered. A full examination of the amplitude of a correlation waveform will supplement Eq. (12) with a factor related to attenuation, and another for the so-called ‘site effect’. The amplitude of the ray arrival in the correlation between the fields at site i and site j is then, on neglecting the asymptotically negligible corrections to B(0) indicated in Eq. (13),
X ij = 2 si s j Bi (nˆi → j ) 2π / ω o | xi − x j | exp(−α | xi − x j |) .
(14)
If the attenuation were to vary spatially, one would replace the exponent with an integral over the line between sites i and j . Here Bi is the ponderosity in the direction from i towards j evaluated at the position of station i . Here we have neglected additional geometric factors related to focusing and defocusing due to spatially varying wave speeds. In principle these effects can be important, and they can be accounted for once one has the velocity map, although Prieto et al9 suggested that they average out.
190
R. L. Weaver
The expression above may be supplemented by the corresponding expression for the amplitude of the autocorrelation
X ii = 2si2 ∫ Bi (θ ) dθ .
(15)
Site factors si , in which the sensitivity of a seismic station to an incident seismic wave depends on the geology near the station, are commonly assumed to be scalar and real. The assumption of reality means that they do not affect apparent ray arrival times. They do, however, affect the amplitude of a seismic station record and in principle must be accounted for if one is attempting to compare amplitudes between different stations in order to recover attenuation. If the seismic signal at a station i is contaminated by so-called ‘local noise’ that does not propagate into the interior and from there to other stations, then the autocorrelation includes an additional term related to the energy of the local noise. The goal is to recover the attenuations, and by implication the B and the site factors, from measurements of the amplitudes of the ray arrivals X in the cross correlations amongst all the stations of a region. This goal is not new. It has been suggested that the cross correlations between two seismic stations could be normalized by the noise auto-correlation at the nominal receiver, by considering quantities like X ij / X ii X jj ,
(16)
which has the virtue of eliminating the site factors s . Unfortunately this normalizes with respect to the angle-integrated B , not to the B on strike. Thus, this construction does not correct for non-isotropy in the ambient noise fields. Another suggestion is to identify a linear array, along a direction nˆ , of several seismic stations for which the amplitudes of cross correlations X for j > i should be xj
X i < j = 2 si s j Bi ( nˆi → j ) 2π / ω o | xi − x j | exp(− ∫ α dx). xi
(17)
Taking N to be the number of stations along this array, one notes that there are at most N ( N − 1) / 2 such ray-amplitudes and up to 3N
Towards Green’s Function Retrieval
191
unknowns to be extracted (the site factors, the B and the local attenuations). Rays in direction − nˆ can provide up to N ( N − 1) / 2 additional amplitudes and N additional unknowns B . It is not difficult then to show that one needs from 5 to 7 stations along this line before one has an over-determined set of conditions. A model for the spatial dependence of the α, and a model that assumes B( nˆ ) diminishes exponentially with distance like exp(-2α x), and also geometrically due to spreading from its distant source, but is not replenished by scattering into direction nˆ from other directions, might help constrain the parameters and reduce the demand for a large number of stations. The approach is intuitive and may well suffice in many circumstances. It does, however, rely on identifying a linear array of sufficient length, and/or upon assumptions on how attenuation and noise vary in space. More problematic in this researcher’s eyes however is that it does not make use of all the information that is in principle available. How might one use sets of seismic stations that are not aligned along a linear array? Can one make use of autocorrelations Xii ? How quickly ought one permit the diffuse intensity implicit in the above formulation to vary in direction and from place to place? Is other information extractable? It is the contention here that certain conditions on diffuse intensity, in particular how it is permitted to vary in space and direction, can be introduced that will further constrain the model and thus lead to more reliable recovery of seismic attenuation and diffuse intensity. The appropriate condition is that the noise field ought to obey a Radiative Transfer Equation19,11 or RTE, nˆ ⋅ ∇B (r , nˆ ) + 2α (r ) B (r , nˆ ) = p2(πr ) ∫ B (r , nˆ ')dnˆ '+ P(r , nˆ ) (18) The ponderosity B of the earlier sections is thus interpretable as the socalled ‘specific intensity’ of Radiative Transfer Theory. It is the diffuse Rayleigh wave intensity in direction nˆ at position r . In Eq. (18) α is the usual attenuation. It includes intrinsic absorption, and also losses due to scattering to bulk waves and to other modes, and also scattering of Rayleigh waves into Rayleigh waves going in other directions. The left side indicates that intensity decays like exp(-2α x); this is the coherent loss. The right side shows that the specific intensity is
192
R. L. Weaver
augmented by sources P (which are generally negligible for long period seismology on the continents and may therefore be confined to the boundaries of the region of interest) and by scattering, with strength p , into direction nˆ from direction nˆ ′. Here p is the (assumed for now isotropic) scattering strength. Scattering p contributes to attenuation; indeed on recognizing that 2α = p + intrinsic losses + scattering into other modes, one can demand the inequality 2α ≥ p ≥ 0 . The RTE governs how quickly specific intensity B can vary, in directions or position. Arbitrary differences in ambient intensity at different stations are not permitted. It is proposed that Eqs. (14), (15) and (18) together with measurements of ray amplitudes X (as many as 104 in an array of 100 stations) and reasonable models for attenuation and scattering p, constitute a large global and overdetermined set of conditions for recovering the noise field B, the site factors s, the attenuation, the diffuse scattering strength p and the source distribution P. Inasmuch as the X are always determined with some readily estimated uncertainty, this may be formulated as a nonlinear least square minimization in a space of hundreds of parameters. Unless good first guesses are given, such minimizations are notoriously poor at converging to the global best fit. Towards that end the above procedures entailing linear arrays may be of use to generate starting points. Preliminary numerical trials on a set of six stations and 36 amplitudes with spatially constant B (θ ) have shown that, even with random initial guesses, a simple conjugate gradient minimization sufficed to find the correct solution.
Acknowledgments The author’s work in diffuse field seismology has been supported by the National Science Foundation, in particular by grant no. EAR-05-43328.
References 1. A. Derode, E. Larose, M. Tanter, J. de Rosny, A. Tourin, M. Campillo and M. Fink, “Recovering the Green’s function from field-field correlations in an open scattering medium,” J. Acoust. Soc. Am. 113, 2973 (2003).
Towards Green’s Function Retrieval
193
2. A. Duroux, K. G. Sabra, J. Ayers, M. Ruzzene, “Extracting guided waves from cross-correlations of elastic diffuse fields: Applications to remote structural health monitoring,” J. Acoust. Soc. Am., 122, 204 (2010). 3. B. Froment, M. Campillo, P. Roux, P. Gouédard, A. Verdel and R. L. Weaver, “Estimation of the effect of non-isotropically distributed energy on the apparent arrival time in correlations” Geophysics 75, SA85 (2010). 4. P. Gerstoft and T. Tanimoto, “A year of microseisms in southern California,” Geophs. Res. Lett. 34 L20304 (2007). 5. O. Godin, “Accuracy of the deterministic travel time retrieval from crosscorrelations of non-diffuse ambient noise,” J. Acoust. Soc. Am., 126 EL183 (2009). 6. P. Gouedard, L. Stehly, F. Brenguier, M. Campillo, Y. C. de Verdiere, E. Larose, L. Margerin, P. Roux, F. J. Sanchez-Sesma, N. M. Shapiro and R. L. Weaver, “Cross-correlation of random fields: mathematical approach and applications,” Geophys. Prospecting, 56, 375 (2008). 7. E. Larose, A. Derode, M. Campillo and M. Fink, “Imaging from one-bit correlations of wideband diffuse wave fields,” J. Appl. Phys. 95, 8393 (2004). 8. A. E. Malcolm, J. A. Scales and B. A. van Tiggelen, “Extracting the Green function from diffuse, equipartitioned waves,” Phys. Rev. E 70, 015601(R) (2004). 9. G. A. Prieto, J. F. Lawrence and G. C. Beroza, “Anelastic Earth structure from the coherency of the ambient seismic field,” J. Geophys. Res. 114, B07303 (2009). 10. J. Rhie and B. Romanowicz, “Excitation of Earth’s continuous free oscillations by atmosphere–ocean–seafloor coupling,” Nature, 431, 552 (2004). 11. L. Ryzhik, G. Papanicolaou and J. B. Keller, “Transport equations for elastic and other waves in random media,” Wave Motion, 24, 327 (1996). 12. P. Roux and W. A. Kuperman, “Extracting coherent wave fronts from acoustic ambient noise in the ocean,” J. Acoust. Soc. Am. 116, 1995 (2004). 13. P. Roux, K. G. Sabra, W. A. Kuperman and A. Roux, “Ambient noise cross correlation in free space: Theoretical approach,” J. Accoust. Soc. Am., 117, 79 (2005). 14. K. G. Sabra, P. Gertoft, P. Roux, W. A. Kuperman and M. C. Fehler, “Surface wave tomography from microseisms in Southern California,” Geophys. Res. Lett. 32, L14311 (2005). 15. K. G. Sabra, A. Srivastava, F. L. di Scalea and I. Bartoli, “Structural health monitoring by extraction of coherent guided waves from diffuse fields,” J. Acoust. Soc. Am. 123, EL8-EL13 (2008). 16. N. M. Shapiro, M. Campillo, L. Stehly and M. H. Ritzwoller, “High-resolution surface-wave tomography from ambient seismic noise,” Science, 307, 1615 (2005). 17. R. Snieder, “Extracting the Green’s function from the correlation of coda waves: A derivation based on stationary phase,” Phys. Rev. E, 69, 046610 (2004). 18. L. Stehly, M. Campillo and N. M. Shapiro, “A study of the seismic noise from its long-range correlation properties,” J. Geophys. Res., 111, B10306 (2006).
194
R. L. Weaver
19. J. A. Turner and R. L. Weaver, “Radiative Transfer of Ultrasound,” J. Acoust. Soc. Am., 96, 3654 (1994). 20. K. Wapenaar, “Retrieving the elastodynamic Green’s function of an arbitrary inhomogeneous medium by cross correlation,” Phys. Rev. Lett., 93, 254301 (2004). 21. K. Wapenaar, D. Draganov and J. Robertsson (eds) Seismic Interferometry supplement to Geophysics 71 (2006). 22. R. L. Weaver and O. I. Lobkis, “Ultrasonics without a source, Thermal fluctuation correlations at MHz frequencies,” Phys. Rev. Lett., 87 art. no. 134301 (2001). 23. R. L. Weaver and O. I. Lobkis, “On the emergence of the Greens function in the correlations of a diffuse field,” J. Acoust. Soc. Am., 110, 3011 (2001). 24. R. L. Weaver and O. I. Lobkis, “Diffuse waves in open systems and the emergence of the Greens’ function,” J. Accoust. Soc. Am., 116, 2731-4 (2004). 25. R. L. Weaver, “Perspectives Geophysics: Information from Seismic Noise,” Science, 307, 1568 (2005). 26. R. L. Weaver, B. Froment and M. Campillo, “On the correlation of non-isotropically distributed ballistic scalar diffuse waves,” J. Acoust. Soc. Am, 126 1817 (2009).
CHAPTER 9 STUDY ON TWO SCALE DESIGN OPTIMIZATION OF STRUCTURES AND MATERIALS WITH PERIODIC MICROSTRUCTURE*
Gengdong Cheng1,* and Jun Yan1,* *State
Key Laboratory of Structural Analysis of Industrial Equipment1 Dalian University of Technology, Dalian, China, 116024 E-mail:
[email protected]
Two scale design optimization for structures and materials with periodic microstructure is studied in the present paper. The aim is to obtain optimal configurations of macro scale structures and microstructure of materials for optimizing structural performance. The link between macro effective material properties and material micro structure is established by a homogenization approach. Structural topology optimization and inverse homogenization is applied to obtain the optimum structural topology design and optimum microstructure. Global structural performance such as minimum compliance design and maximizing fundamental frequency and local performance such as stress concentration are investigated. The proposed method and computational model are validated by a set of numerical examples. A number of issues for further development in two scale design are addressed.
1. Introduction Light weight structures can reduce the long-run energy consumption of aeronautic crafts. With shortage of energy and resources and intense competitions light weight structures have attracted more and more This paper is dedicated to Professor Yih-Hsing Pao on the occasion of his 80th birthday. This work is supported by the program (90816025, 10902018) of NSFC. 195
196
G. Cheng and J. Yan
attention in various industries. For high speed vehicles, structural weight is the decisive factor for their essential performances such as speed, efficiency and maneuverability. Optimum design of light weight structures has been the focus in the area of structural optimization. Structural topology optimization3,4 is a powerful approach to create innovative light weight structural design. A well known alternative approach to reduce structural weight is to use light material such as Al-alloy, Al-Mg alloy, Ti-Alloy and composite materials in the aeronautic, aerospace and car industry. With the rapid development in manufacturing techniques, more ultra-light porous metal materials such as truss-like materials, linear cellular metals (LCMs) and metal foams are emerging in the engineering practice. Because of their porous periodic microstructure in the size of tiny structure, they are called structural materials. They have received increasing attention for their high stiffness-weight and strength-weight ratios together with the design potential for multifunctional applications9. More importantly, their properties can be engineered to some extent to suit the application by changing their microstructure. As is pointed out by Ashby1 that innovation, often, takes the form of replacing a component made of one material by another one, then redesigning the product to exploit, to the maximum, the potential offered by the change. The emergence of various structured materials provides a vast opportunity for optimum design of structures and components by concurrent design of material and structures. It can be expected that the global optimum design, or the ideal light weight structure, should have the best material for every component, and the ideal light weight structural component should have the best material for every material point. That is, the material of ideal structures varies point-wise in an optimum way. In reality, material of natural biological structures such as animal’s bone and plant’s stalk has a point-wise varying microstructure8. Manufactured by new technology such as laser melting deposition, functional gradient material varies its composition gradually, which alleviates thermal stresses and enhances fracture toughness. Rodrigues et al24, Coelho et al.6 propose a hierarchical approach for the optimization design of two-dimensional and three-dimensional macro structures composed of porous materials
Study on Two Scale Design Optimization
197
which are inhomogeneous at the macro scale (material microstructure varies point-wise). Nevertheless, it is not only expensive but extremely difficult to control the material microstructure variation point-wise. On the other hand, material with macro homogeneous microstructure such as metal foam material is routinely produced nowadays although random variation is unavoidable. The present paper introduces our attempt to achieve ultra-light structures composed of ultra-light structured materials by utilizing topology optimization in both structural and material scales. To improve the possibility of practical applications, manufacturing factors are strongly underlined by assuming homogeneity of the material micro-structures at the macro scale. Multi scale analysis is an important research area and has received great interest in recent years13,19,21. Mechanical performance of structures made of structured material can be obtained by multi scale analysis. Developments in multi scale analysis and structural optimization provide the basis for two scale design optimizations of structures and materials with periodic microstructure. Two scale design optimization of ultra-light structures composed of structured materials is studied in the present paper. The aim is to obtain optimal configurations of macro scale structures and microstructure of materials for optimizing structural performance. Global structural performance such as minimum compliance and maximum fundamental frequency design and local performance such as stress concentration are investigated. The present paper is organized as follows. In Section 2, a concurrent multi scale topology optimization formulation is proposed. Macro densities in the structural design domain and micro densities in the material unit cell are introduced as design variables to independently represent the macro structure and material microstructure topology. For global performance, the homogenization method is applied to establish effective material properties from the material microstructure. Penalization approaches together with new filter techniques are adopted at both scales to ensure clear topologies, i.e. SIMP2 (Solid Isotropic Material Penalization) at micro-scale and PAMP (Porous Anisotropic Material Penalization) at macro scale. Optimization and sensitivity analysis with respect to both macro and micro densities in two scales are
198
G. Cheng and J. Yan
integrated into one system. The problem formulation and computational model are illustrated in more detail by minimum compliance design under mechanical load. Minimum compliance design under thermomechanical loads and maximum frequency design are presented with less detail. In Section 3, concurrent optimization of material and structure for the local performance is studied. Optimum stress distribution around a hole is investigated for hollowed plates composed of a truss-like material with a micropolar continuum representation. Relative density and cell size distribution of truss-like materials are determined by optimization. The relation between stresses in two scales is discussed. In Section 4, a number of issues for further development in two scale design optimization are discussed. 2. Two Scale Design Optimization for Global Structural Performance Let us consider design optimization of ultra-light structures composed of structured materials. The Design domain and boundary conditions of the macro structure to be designed are given. The objective and constraints include the structural weight and its global performances such as the structural compliance and fundamental vibration frequency. Both the macro structural topology, including size and shape, and the micro structural topology of the structured material is to be designed. This is a two scale design optimization. Considering today’s reality of manufacture, we will assume that the structure are made of a material with macro homogeneous porous micro structure. The mathematical formulation and basic approach of solving the two scale design optimization is described as follows. Figure 1 illustrates a macro structure composed of a porous anisotropic material with homogeneous microstructure and its material micro-structure in unit cell. There are two materials involved in our study, the base material and the porous anisotropic material. The base material, black in the figure, could be any type of solid materials such as aluminum and alloy. The porous anisotropic material, grey in the figure, sometimes merely called ‘material’ for short in the following discussion, is assumed to be made of the base material and to have periodic
199
Study on Two Scale Design Optimization
microstructure free of any restriction (e.g. cavities in linear cellular materials and ranked laminates with regular shape). For the structural analysis and optimization, the macro design domain Ω and micro design domain Y , i.e. the unit cell, is meshed into Porous anisotropic material
Characterization
Magnify
Base material
y2
x2 F
A unit cell
y1
x1
Fig. 1. Structure composed of a porous anisotropic material and its unit cell.†
Base material
Porous anisotropic material B
D B Thermal expan. coeff. α Elasticity Matrix
H
D B Effective thermal expan. coeff. α Elasticity matrix
∆T
MI
α
B
D = ρ ⋅D αB Solid Isotropic Material Penalization †
α
H
D MA =Ρ⋅ D MA α H B β =Ρ⋅ D ⋅α Porous Anisotropic Material Penalization
F
The topology optimizations of structures are normally based on relatively coarse resolution of the
design Fig. 2. Penalization-based concurrent optimization two classes of density computational grid, which then naturally lead to step-likewith boundaries of computed domains. The variables. figures in this paper represent the resolution at which the computations have been done actually.
200
G. Cheng and J. Yan
N and n elements, respectively, see Figure 2. To describe the macro structural topology and material micro structural topology, the element in macro-scale and micro-scale is assigned a density Ρi and ρ j , respectively. Their values are either zero or one. Elements with zero material density, white in the figure, are void. The macro element of unit density is occupied by the real porous anisotropic material. The micro element of unit density is composed of the base solid material. In this way, topology optimization of macro structure and micro structure in a unit cell is replaced by finding the optimum element density distribution in the two scales. In the terminology of mathematical programming, this is a 0-1 programming because the design variables Ρ i and ρ j can only have the values zero and one. To reduce the computational burden, the 0-1 programming is transformed into mathematical programming with continuous design variables by relaxing the constraint on density. That is, intermediate densities, i.e., 0 < Ρ i < 1 , 0 < ρ j < 1 are allowed. Physically, the element of intermediate density is artificial. In the well known SIMP approach15 of structural topology optimization, the elasticity matrix of material with artificial density is assumed to be a mathematical function of the elasticity matrix of its parent material of unit density. To ensure clear topology, elements of intermediate density should not exist in the final optimum design; a penalty on the elasticity matrix of the artificial material of intermediate density is introduced. In micro scale this is done by defining
D MI = ρ α ⋅ D B .
(1)
D MI , D B are elasticity matrices for the base material of artificial density
ρ and base solid material of unit density, respectively. α is a penalty exponent which is set equal to 3 in this research. For the given base material, D B is prescribed. Since the real material is solid base material, this basic idea represented by equation (1) is the well known approach SIMP. In macro scale, a similar penalization is adopted
D MA = Pα ⋅ D H .
(2)
Study on Two Scale Design Optimization
201
D MA and D H are the effective elasticity matrices for the porous anisotropic material of artificial density P, and the porous material of unit density, respectively. Since it penalizes the intermediate density of the porous anisotropic material, this approach is called as PAMP (Porous Anisotropic Material with Penalty)16. The key step in two scale design optimization is to establish a relation between objective/constraint function and the design variables, in particular the relation between the structural performance at the macro scale and the micro element density at the micro scales. To this end, D H serves as a crucial link between the two scales: on the one hand it is a representation of effective material properties depending on the microstructural configuration, and on the other hand it is also involved in the macrostructural analysis. The computation of D H could follow the classical homogenization procedures by implementing the following two steps2: Firstly, analyze the unit cell subjected to periodic boundary conditions and body forces corresponding to uniform strain fields k⋅u =
∫
bT D MI dY ,
(3)
b T ⋅ D MI ⋅ bdY .
(4)
Y
k=
∫
Y
Here k is the stiffness matrix of the microstructure in a unit cell with periodic boundary conditions, u is the microstructural nodal displacement vector, b is the strain/displacement matrix and D MI is defined in Eq. (1) and depends on the micro element density ρ , which in turn describes the microstructure in a unit cell. Secondly, compute the effective elasticity matrix by performing the integration over the domain of a unit cell DH =
1 Y
∫D
MI
⋅ ( I − b ⋅ u )dY .
(5)
Y
By virtue of Eqs. (1)-(5), we have established the link between the effective elasticity matrices of porous anisotropic material with artificial density and microstructural topology in a unit cell. For the structural performance other than mechanical performance such as heat conductivity and seepage capacity, a similar homogenization
202
G. Cheng and J. Yan
method is available for computing the corresponding effective properties from the microstructure in a unit cell. The present approach can directly be extended to two scale design optimizations for structural thermal performance and seepage performance, respectively. 2.1. Minimum Compliance Design of Structure Subject to Mechanical Loads The first problem is the optimum design of elastic structures composed of macro homogeneous porous anisotropic materials. The macro structure with Ω denoting its design domain is subjected to external forces F . The structural compliance is minimized with specific base material volume. With the SIMP and PAMP approaches, the formulation of minimum compliance design can be expressed as16,
∫
Minimize : C = F ⋅ Ud Ω ,
(6)
Ω
ρ PAM ⋅ Constraint I : ς =
∫
Pd Ω
Ω MA
≤ ς , ρ PAM =
∫ ρdy = ς Y
V V MI Constraint II : 0 < δ ≤ P ≤ 1, 0 < δ ≤ ρ ≤ 1 .
MI
,
(7) (8)
Here, C denotes the structural compliance and U is the structural nodal displacement under static load F . ρ PAM *P and ρ PAM are the relative densities of the artificial and real porous anisotropic material, respectively. Constraint I sets a limit on the total available base material by defining relative density ς and its upper bound ς . V MI , V MA are the areas of the macro and micro design domains, respectively. Constraint II sets bounds for the density variables at two scales to avoid numerical singularities, where δ is a small predetermined positive number, chosen here as δ = 0.001 here. The structural nodal displacement U can be obtained from the following structural governing equation K⋅U = F
(9)
Here the structural stiffness matrix K is the assemblage of the element stiffness matrices k e , which depend on the effective elasticity matrix
Study on Two Scale Design Optimization
203
DeMA = Peα DH of macro element composed of porous anisotropic material with artificial density K = AeN=1k e ,
ke =
∫
Ωe
Pe :
BT ⋅ D MA ⋅ Bd Ω =Peα
∫
Ωe
BT ⋅ D H ⋅ Bd Ω . (10)
Based on Eqs. (1-6), (9) and (10), the sensitivity of the objective function with respect to the artificial densities of macro element and micro element could be derived as ∂C α ⋅ Ci =− , ∂Ρ i Ρi
(11)
∂C ∂D H = − Peα ⋅ UTe ⋅( BT ⋅ ⋅ Bd Ω) ⋅ U e , ∂ρ j ∂ρ j e =1
(12)
N
∑
∫
Ωe
where the derivative of D
H
with respect to ρ j is given by
H
∂D = αρ αj −1 ∫ (I − b ⋅ u j )T ⋅ D B ⋅ (I − b ⋅ u j )dY . ∂ρ j Yj
(13)
Now, we have completed the structural analysis and sensitivity analysis. With the sensitivity information available, gradient-based optimization algorithms such as MMA (Moving Asymptotic Algorithm), SLP (Sequential Linear Programming) and SQP (Sequential Quadratic Programming) can be applied to find the optimum design of Eqs. (6-8). To illustrate the formulation, an example problem is shown here. The MBB beam in Figure 3 is loaded with a concentrated vertical force of P = 1000 at the centre of the top edge and is supported on rollers at the bottom-right corner and on simple supports at the bottom-left corner. The base material is assumed to have Young’s modulus E = 2.1 × 105 and Poisson’s ratio v = 0.3 . Geometric parameters are L = 4 and h = 1 . As we are only interested in qualitative results, the dimensions and loads for this problem are chosen in non-dimensional units. Due to the axial symmetry of the problem, only the right half part is considered as macro design domain. The mesh is 50×25 for the macro design domain and 25×25 for the microstructure (8-node planar isoparametric element). OPTIMUM SEARCH is implemented by the optimization package DOT using the SQP algorithm.
204
G. Cheng and J. Yan
Table 1 shows the optimal designs with varying ς (available base material) and specified ς MI = 0.4. The resulting microstructures are all porous anisotropic, the topology varies considerably among the different P = 1000
Micro design domain A unit cell
?
h =1
?
Macro design domain
L=4
Fig. 3. MBB Beam.
Mixed cell
Triangular cell
Fig. 4. Two microstructures for Linear Cellular Material9.
cases. When ς = 0.25, the microstructure after a rotation of 45 degrees closely resembles the so-called ‘Triangular cell’ (see Figure 4) which is said to have superior in-plane mechanical properties9. Alternatively, when ς = 0.075, the microstructure after a rotation of 45 degrees is somewhat similar to the ‘Mixed cell’ (see Figure 4), which is considered as another competitive microstructure for linear cellular material. The resemblance between optimized results and some existing superior microstructures implies that the proposed method indeed generates optimum micro-topology for porous materials. In macro-scale, the design also changes with the increase of ς . For the compliance value, it is reasonable that better system performance is achieved with more available base material, see the third column of Table 1.
Study on Two Scale Design Optimization
205
Table 1. Results for varying ς (available base material) and specified ς MI = 0.4.
ς
Compliance
0.075
9855
0.12
5676
0.18
3707
0.25
2234
Structural topology
Microstructural topology
Table 2. Results for varying ς MI (specified material volume fraction of microstructure) and specified ς = 0.12.
ς MI
Compliance
0.2
8880
0.3
6210
0.4
5676
Structural topology
Microstructural topology
206
G. Cheng and J. Yan
One interesting problem is the trade-off between macro-scale design and micro-scale design, namely the allocation of material between the two scales which can be characterized by ς /ς MI and ς MI . To illustrate the discussion, the two-scale design problem will be solved again with predetermined available base material ς = 0.12 and varying ς MI . As shown in Table 2, a larger value of ς MI leads to a lower system compliance with more base material consumption in micro-scale design and less porous material consumption in macro-scale design. This means in this case, it is advantageous to strengthen the porous material rather than enhancing the macro structure. 2.2. Minimum Compliance Design of Structure Subject to Mechanical and Thermal Loads
The second problem is optimum design of thermo-elastic structures composed of macro homogeneous porous anisotropic materials. The optimization aims at minimum compliance design under mechanical and thermal loads by optimizing configurations of the macro structures and the micro structures of the material. The base material volume is given. The structure is subjected to external forces F and an uniform temperature increase ∆T . With SIMP and PAMP approaches, the formulation of the minimum compliance design can be expressed as29,
∫
∫
Ω
Ω
Minimize : C = F ⋅ Ud Ω + β MA ⋅ ε ⋅ ∆Td Ω
ρ PAM ⋅ Constraint I : ς =
V
∫
Ω MA
Pd Ω
≤ ς , ρ PAM =
∫
(14)
ρ dy
Y
V MI
Constraint II : 0 < δ ≤ P ≤ 1, 0 < δ ≤ ρ ≤ 1 .
,
(15) (16)
Here, C denotes the structural compliance and U is the structural nodal displacement vector under thermal and static mechanical load. ε , β MA represent structural strain and the effective thermal stress matrix of the porous material in macro scale, respectively, and β MA = Pα ⋅ D H ⋅ α H .
207
Study on Two Scale Design Optimization
A Unit Cell
?
? Micro design domain
∆T
H=47.7cm
Macro Design Domain
L=72cm F=500000
Fig. 5. Numerical example of beam with two fixed ends.
According to reference17 no matter what configuration of the micro H B unit cell has, we always have α = α for porous anisotropic materials composed of one pure base material, α B being the matrix of the coefficient of thermal expansion of the base material. The other constraints have been explained in 2.1. To illustrate the formulation, an example is given here. The initial design domain and boundary conditions of the structure to be designed are shown in Fig. 5. A vertical load F = 500000 N and a uniform temperature increment ∆T are applied. The volume constraint is equal to 5% of the total volume. Due to the symmetry, only the left half part is considered as macro design domain. The mesh is 60×30 for the macro design domain and 25×25 for the micro structure (8-Node Planar Element). Optimum design is obtained by the SQP algorithm. Influences of the temperature differential on the two scale design results are listed in Table 3. Two observations from Table 3 are very interesting, more or less, out of expectation. First, it can be seen from the second row of Table 3 that the optimum material was solid if no thermal load exists. In other words, no porous material was needed for minimum compliance design if the structure was only subjected to a mechanical load, and both macro structural topology and micro material topology were allowed to be optimized. However, the structure made of porous material gives
208
G. Cheng and J. Yan Table 3. Influences of temperature differential on the concurrent design results.
∆T
Structural topology
Microstructural topology Unit Cell
4 × 4 arrays
0°C
1°C
20°C
40°C
lower compliance if both thermal load and mechanical load are applied. Second, as temperature differential increased from 1° to 40° the macro structural topologies were greatly changed. More test examples confirmed that in most cases isotropic solid material was the best configuration for microstructure to improve the stiffness of the structure with only mechanical loads, except the uniaxially stressed structures. Moreover, this assertion is valid under the following two conditions: (1) only macro uniform material, even with microstructure, is allowed and both macro structural topology and micro
Study on Two Scale Design Optimization
209
structural topology are optimized. (2) The optimization aims at minimum compliance under mechanical loads. The reason for superiority of the solid configuration could be understood with the sensitivity analysis. When ∆T = 0 , the sensitivity of compliance to two designing densities can be written as ∂C α T ui BT DiMA B d Ω e ui , =− ∂Pi Pi e Ω
∫
N
∂C = − uTr ∂ρ j r =1
∑
H T α ∂D r r B B P d Ω u r . r r ∂ρ j Ω
∫
(17)
(18)
It could be observed from the above two equations that the sensitivity of compliance to macro/micro density was always negative, which means that the compliance decreased with an increase of available base material volume. At the same time, the sensitivity of compliance to micro density was the sum of the corresponding values over all N macro elements. Moreover, its counterpart at the macro scale is related only with one local macro element. Then generally, the sensitivity of the micro scale was much larger than that of the macro scale in absolute value. This fact means that it was more efficient to distribute material of the micro scale than that of the macro scale to decrease the structural compliance, or the isotropic solid material was the best choice for the microstructure of the unit cells. It should be pointed out that the Michell truss is not contradicting the above statement because its microstructure varies point-wise. Furthermore, the above statement is consistent with the prediction of Lakes (1993)15, who suggested that the stiffness to weight ratio of framework-type structures decreases with increasing structural hierarchy. For structures with both static and thermal loads, it was found that PAM porous materials ( ρOpt < 1 ) could help to reduce the compliance of structures. Some explanations of the cellular configuration could also be given by sensitivity analysis. Observing the sensitivity analysis for compliance optimization with both mechanical and thermal loads, Eq. (19) and Eq. (20), it was found out that the sensitivity of the compliance w.r.t macro/micro density was not always negative because
210
G. Cheng and J. Yan
they were composed of two terms corresponding to static and thermal loads, respectively. That is, the sensitivity of compliance to densities might be positive, which means hat the compliance might increase as the available base material volume increases. In numerical implementations, it was found that there was indeed an optimum material volume fraction. The optimum macro/micro volume fractions and configurations remained relatively stable and the volume constraint, constraint Ι in Eq. (15), was no longer active after the optimum volume fraction was reached. ∂C α α = − ui T BT DiMA Bd Ωe ui + 2 ⋅ ui T BT βi MA ∆Td Ωe , ∂ Pi Pi Pi e Ω Ωe
∫
N
∂C = − uiT ∂ρ j i =1
∑ ∫
∫
α
BT P i
Ωi
∂DiH Bd Ωi u i + 2 ⋅ ∂ρ j
N
∑ ∫ uTi
i =1
α
BT P i
Ωi
(19)
∂DiH B α ∆Td Ωi . (20) ∂ρ j
Since these two observations were little out of expectation, the optimum design of truss structures with artificial porous material under similar condition was studied. The results confirmed our observations. Due to the space limitation, they are not shown here. 2.3. Maximum Fundamental Frequency Design
The third problem shown here is the two-scale topology design for maximization of fundamental frequency of vibrating elastic structures composed of porous anisotropic material. With above approaches, the problem can be formulated as a max-min problem as follows20: Find : X = {P, ρ} ,
{
Obj : max min
j =1,…, J
{λ
j
(21)
= ω 2j
}} ,
(22)
S .t. : Kφ j = λ j Mφ j , φ Tj Mφk = δ jk , k , j = 1,… , J ,
ς=
ρ PAM ⋅
∫
Ω MA
Pd Ω
≤ ς , ρ PAM =
∫
ρdY
= ς MI , V V MI 0 < P ≤ Pi ≤ P, i = 1,… , NE ; 0 < ρ ≤ ρ l ≤ ρ , l = 1,… , n. Y
(23)
211
Study on Two Scale Design Optimization
In equations (21)-(23), ω 2j denotes the jth eigenfrequency and φ j the corresponding eigenvector, and K and M are the symmetric and positive definite stiffness and mass matrices. Figure 6 shows an example problem, a beam-like structure with L = 80m, H = 50m and M0 = 216000kg for maximization of the fundamental frequency. Table 4 lists the topological design of the macro and micro structure with the variation of the available base material at a certain specific micro volume ς MI = 40%.
Fig. 6. Numerical example of a cantilever beam. Table 4. Topological design of macro and micro structure for maximizing fundamental frequency. ς
λ1
10%
156.81
20%
487.99
25%
593.81
Macro structure
The first mode
Micro structure Cell
4 × 4 array
212
G. Cheng and J. Yan
3. Two-Scale Optimization for Local Performance
Structural local performance such as local stress concentration and fracture is very important for structural safety. Structural optimizations with local performance have been well studied. One of many difficulties is the large number of constraints. The difficulty of a singular optimum is typical for the structural topology optimization with stress constraint; it needs special constraint relaxation5,10. For two scale structure/material design optimization, an additional difficulty is how to define stress. Homogenized stress could be computed from a macro structural analysis based on the effective material properties. However, from a physical viewpoint, micro stress could be equally interesting. In the following, the optimum stress distribution around a hole is investigated for a hollow plate composed of a linear cellular metal, see Fig. 7. Instead of the classical continuum, the linear cellular metal is homogenized with a micropolar continuum representation14,30 which introduces independent rotational degrees of freedom and couple stresses in addition to the usual Cauchy-type stresses at each material point for a higher calculation precision of stresses. Two classes of design variables, a relative density and cell size distribution of LCMs are defined in one element. This is different from the traditional topology optimization with only an element material density as design variable. The finite element method is applied to discretize the micropolar continuum. The material distribution or the arrangements of the cells is uniform in each element and varies from element to element. Concurrent designs of materials and structures are carried out through three different optimization formulations. They are minimizations of the stress around the hole, the highest stress within the specified point-set and the ratio of stress over the corresponding yield strength. The third formulation actually maximizes the strength reservation and seems more rational. The three formulations are as follows30, Model I: To minimize the stress around a hole in order to reduce the stress concentration. Its mathematical formulation is Find
S.t
( )
ρi , Li , Min
∫
Ω
Ψ = σC
ρ i d Ω = S0 ,
0 < ρ ≤ ρi ≤ 0.3;
2
Li (1 − 1 − ρi ) ≥ h ;
L ≤ Li ≤ L
(24)
213
Study on Two Scale Design Optimization
Model II: To minimize the maximum stress at a specified point-set about 2
which we care most, i.e., Min
Ψ = Max σ i , V0 i ∈V0
is the set of
specified points. Model III: To maximize the strength reservation of the part around the hole, or equivalently, to minimize the ratio of stress around the hole over the effective yield strength. The effective yield strength for LCMs with square unit cells can be written as σ pl σ ys = 0.5 ρ , and the
objective can be written as Min
( )
Ψ = σC
2
(0.5ρC ) .
Here the
subscripts p and s refer to LCMs and cell wall material, respectively. σ C and ρ C denote the stresses and relative density of LCMs at stress concentration points, such as point P shown in Fig. 5. Model II and Model III have the same constraints as model I.
pressure = 1.0
φ =0 X2
P X1 R = 20
1 element ○
φ =0
Fig. 7. Model of geometry and FEM for a quarter of a circular plate with circular hole.
214
G. Cheng and J. Yan
A hollowed circular plate of radius 400mm with FEM mesh is shown in Fig. 7. The plate is made of orthotropic LCMs with square unit cells, which is schematically shown in the inner square of Fig. 7. The outer edge is subjected to a prescribed uniform pressure p = 1.0 . We assume that the orientations of square unit cells coincide with the global coordinates. The relative density and cell size of initial design is uniform over the domain, ρ = 0.1 and cell size is L = 5mm . The prescribed material volume is 10% of the design space. h = 0.1mm , ρ = 0.01 and L = 0.1mm; L = 15mm . Due to symmetry we only study a quarter plate. 45 elements along the axial direction are chosen. The maximum stress σ C = σ yP appears at point P in the X 2 direction when the plate is composed of homogeneous cells. This stress is to be reduced. And the nodes on the horizontal edge along the X 1 axis are chosen as specified point-set V0 in model II. We assume that all elements on the same circumference have identical material properties. Max The results are summarized in Table 5. σ X2 is the maximum stress in the X2 direction of point-set V0 . Results in Table 5 show that for the uniform initial design the stress around the hole is 7.57 and the objective values are 57.37, 57.37 and 1147.41 for Models I, II and III, respectively. In Model I, the stress around the central hole is minimized to 0.14. However, the maximum stress emerges at another point on the horizontal axis and reaches 5.14, which is certainly not desired. In Model II, the maximum stress in the specified point set V0 can be reduced to 2.52. It must be mentioned that the above stress is the stress at the equivalent continuum. A mapping method has been studied to obtain stress at the micro scale. Table 5. Comparison of stress between uniform design and optimum design. Model
Initial objective function value
Objective function value after optimization
σ XMax 2
Model I
57.37 (stress around the hole 7.57)
0.34 (stress around the hole 0.58)
5.14
Model II
57.37 (stress around the hole 7.57)
6.36 (stress around the hole 2.52)
2.52
Model III
1147.41 (stress around the hole 7.57)
151.64 (stress around the hole 4.77)
4.77
Study on Two Scale Design Optimization
215
4. Concluding Remarks Two scale design optimizations of macro structure and micro-structure of structured materials for structural global and local performances were studied in this paper. To improve the possibility of practical applications, manufacturing factors were strongly underlined by assuming homogeneity of the material microstructures at the macro scale. Structural topology optimization approaches such as SIMP, PAMP, homogenization-based two scale analysis method, inverse homogenization method for material microstructure design and new filter techniques were integrated to develop their mathematical formulation and obtain optimum structures and materials. Numerical examples illustrated the effectiveness of the strategy. In addition, a number of interesting observations were obtained from the numerical examples. For example, for the cases in which only mechanical loads apply, microstructure of isotropic solid material is best for achieving the minimum structural compliance; while for the cases in which both mechanical and thermal loads apply, the configuration of porous material is more effective to reduce the system compliance. The fundamental frequency and the stress distribution of structures composed of porous materials also can be optimized with the presented two scale optimization method. The method was also extended to more realistic applications considering multifunctional performances, e.g. active or passive heat transfer26, seepage flow rate28, or incorporated non-local effects of material model18. The two scale topology optimization suffers more from the checkerboard and gray element than the classic single scale optimization. In order to prevent these numerical instabilities and obtain a clear design, various regularization methods such as perimeter control, linear and nonlinear filters have been applied in our study. Due to the space limitation, they were not detailed here. Interesting readers are referred to relevant work12,25,27. Further development in this direction is also needed. Many challenges remain for multi scale design optimization. For example, the current homogenized continuum models suffer from their inability to capture the correct underlying deformation and local strength mechanisms, a more advanced multi scale continuum theory is needed.
216
G. Cheng and J. Yan
Many excellent works7,13,19,21,31 on multi scale analysis has deepened our understanding of material strength and failure mechanisms down to crystal, molecular or even smaller scale. Based on such a theory, it is expected that one can integrate the more realistic constraints, such as local fracture, plasticity failure into the multi scale design optimization. Material microstructure design at the smallest level could be considered too. This would be extremely exciting. Another important aspect in multi scale design optimization is to consider the size effect. The present work assumes that the unit cell is infinitely small, but if the two scales of structure and microstructure are comparable, size effects need to be considered. Especially for the design optimization problem with wave absorption or wave transmissivity constraint/objective, wave propagation in the structure composed of structured materials must be studied. When the frequency of the external excitations is high, the wave length approaches the intrinsic size of the microstructures of structured materials; so, the actual size of the microstructure will affect the results of the analysis and optimization greatly. The excellent work11,22,23 of Prof. Pao and his colleagues in dynamics and wave propagation in truss and frame structures may provide the benchmark test for approximate numerical methods. References 1. M.F. Ashby and D.R.H. Jones, Engineering Materials 2, Butterworth Heinemann (1998).
2. M.P. Bendsoe, M.P., O.Sigmund, Topology Optimization: Theory Methods and Applications. Springer-Verlag. Berlin (2003).
3. M.P. Bendsoe, N. Kikuchi, Generating optimal topologies in structural design using a homogenization method. Computer Methods in Applied Mechanics and Engineering, 71(2): 197 (1988). 4. G.D. Cheng, On non-smoothness in optimal design of solid elastic plates. International Journal of Solids and Structures, 17: 795 (1981). 5. G.D. Cheng, X. Guo, Epsilon-relaxed approach in structural topology optimization. Structural Optimization, 13: 258 (1997). 6. P.G. Coelho, P.R. Fernandes, J.M.Guedes, A hierarchical model for concurrent material and topology optimisation of three-dimensional structures. Structural and Multidisciplinary Optimization, 35(2): 107 (2008).
Study on Two Scale Design Optimization
217
7. J. Fish and K. Shek, Multiscale analysis of composite materials and structures. Composites Science and Technology, 60: 2547 (2000).
8. P. Fratzl and R. Weinkamer, Nature's hierarchical materials. Progress in Materials Science, 52: 1263 (2007).
9. L.J. Gibson and M.F. Ashby, Cellular Solids: Structure and Properties. Cambridge, Cambridge University Press (1997).
10. X. Guo, G. D. Cheng, A new approach for the solution of singular optimum in structural topology optimization. Acta Mechanica Sinica, 13: 171 (1997).
11. Y.Q. Guo, W.Q. Chen, Y.H. Pao, Dynamic analysis of space frames: The method of
12.
13.
14.
15. 16. 17.
18. 19. 20.
21.
22. 23. 24.
reverberation-ray matrix and the orthogonality of normal modes. Journal of Sound and Vibration, 317: 716 (2008). J.K. Guest, J.H. Prevost and T. Belytschko, Achieving minimum length scale in topology optimization using nodal design variables and projection functions. International Journal for Numerical Methods in Engineering, 61 (2): 238 (2004). S. Hao, W.K. Liu, B. Moran, V. Franck, G.B. Olson, Multiple-scale constitutive model and computational framework for the design of ultra-high strength, Computer Methods in Applied Mechanics and Engineering, 193(17-20): 1865 (2004). R.S. Kumar and M.C. Dowell, D.L., Generalized continuum modeling of 2-D periodic cellular solids. International Journal of Solids and Structures, 41:7399 (2004). R. Lakes, Materials with structural hierarchy. Nature, 361: 511 (1993). L. Liu, J. Yan, G.D. Cheng, Optimum structure with homogeneous optimum trusslike material. Computers and Structures, 86: 1417 (2008). S.T. Liu, G.D. Cheng, Homogenization-based method for predicting thermal expansion coefficients of composite materials. Journal of Dalian University of Technology, 35(5): 451 (1995). S.T. Liu, W.Z. Su, Topology optimization of couple-stress material structures. Structural and Multidisciplinary Optimization, 40: 319 (2010). W.K. Liu and C. McVeigh, Predictive multiscale theory for design of heterogeneous materials. Computational Mechanics, 42(2): 147 (2008). B. Niu, J. Yan, G.D. Cheng, Optimum structure with homogeneous optimum cellular material for maximum fundamental frequency. Structural and Multidisciplinary Optimization, 39: 115 (2009). H.S. Park and W.K. Liu, An introduction and tutorial on multiple-scale analysis in solids. Computer Methods in Applied Mechanics and Engineering, 193: 1733 (2004). Y.H. Pao, D.C. Keh, S.M. Howard, Dynamic response and wave propagation in plane trusses and frames. AIAA Journal 37: 594 (1999). Y.H. Pao, W.Q. Chen, X.Y. Su, The reverberation-ray matrix and transfer matrix analyses of unidirectional wave motion. Wave Motion, 44: 419 (2007). H. Rodrigues, J.M. Guedes and M.P. Bendsoe, Hierarchical optimization of material and structure. Structural and Multidisciplinary Optimization, 24:1 (2002).
218
G. Cheng and J. Yan
25. O. Sigmund, Morphology-based black and white filters for topology optimization. Structural and Multidisciplinary Optimization, 33 (4-5): 401 (2007).
26. B. Wang, G.D. Cheng, L.Jiang, Design of multi-tubular heat exchanger for optimum efficiency of heat dissipation. Engineering Optimization, 40:767 (2008).
27. S. Xu, Y. Cai and G.D. Cheng, Volume preserving nonlinear density filter based on
28.
29.
30.
31.
heaviside functions. Structural and Multidisciplinary Optimization, 41 (4): 495 (2010). S.L. Xu, G.D. Cheng, Optimum material design of minimum structural compliance under seepage constraint. Structural and Multidisciplinary Optimization, 41 (4): 575 (2010). J. Yan, G.D. Cheng, L. Liu, A uniform optimum material based model for concurrent optimization of thermoelastic structures and materials. International Journal for Simulation and Multidisciplinary Design Optimization, 2: 259 (2008). J. Yan, Liu, L., G.D. Cheng, and S.T. Liu, Concurrent material and structural optimization of hollow plate with truss-like material. Structural and Multidisciplinary Optimization, 35: 153 (2008). S. Zhang, R. Khare, Q. Lu, T. Belytschko, A bridging domain and strain computation method for coupled atomistic-continuum modeling of solids. International Journal for Numerical Methods in Engineering, 70: 913 (2007).
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
CHAPTER 10 A CONTINUUM FORMULATION OF LAVA FLOWS FROM FLUID EJECTION TO SOLID DEPOSITION∗
Kolumban Hutter† and Olivier Baillifard‡ c/o Laboratory of Hydraulics, Hydrology and Glaciology Swiss Federal Institute of Technology, Zurich, Switzerland ∗ This
paper is dedicated to Prof. Y.-H. Pao on the occasion of his 80th birthday.
A continuum formulation of lava flow down the mountain side of a volcano is presented. Three regimes are differentiated: Hot, temperate and cold lava. Hot lava is treated as a non-linear viscous heat conducting fluid; temperate lava as a binary solid-fluid frictional mixture and cold lava as a frictional heat conducting granular solid. These regions are separated from one another by mass-exchanging phase change surfaces. At the base, erosion, deposition processes and at the free surface heat transfer by radiation and sensible and latent heat are accounted for. The paper has fundamental character as a model for this kind of gravity flow is presented.
10.1. Introduction 10.1.1. Extended Summary A continuum formulation of lava flow down the mountain side of a volcano is presented. The lava, ejected from a volcanic vent, is treated as a non-Newtonian, strongly thermo-mechanically coupled density preserving fluid, which loses heat at the free surface and the ground by radiation and conduction, respectively. The moving lava will partly solidify at the † Retired
Professor of Mechanics from Darmstadt University of Technology, Germany. Corresponding author:
[email protected] ‡ Now at: Ch. du Byai 19, CH-1934 Bruson,
[email protected]
219
Chapter10
March 31, 2011
15:34
220
World Scientific Review Volume - 9in x 6in
K. Hutter and O. Baillifard
free surface, and this solidification will move into the flowing mass further down. Thus, a new region will form, a binary mixture of solid and fluid constituents at the freezing temperature. This mixture is called temperate lava. Both constituents can be modeled as non-Newtonian viscous fluids. In the temperate region the solid volume fraction will grow until at some distance downstream, close to the deposition area all lava will be solidified. The depositing lava will be treated as a one-constituent body with solid properties. This cold lava will also be modeled as a very viscous nonNewtonian material, but having some plastic (rate independent) properties. The three regimes of hot, temperate and cold lava are bounded by the free and ground surfaces and two interfaces, bounding (i) the hot fluid and the fluid-solid mixture and (ii) the solid-fluid mixture and the cold solid. Of these last two interfaces the first, called hot-temperate transition surface (HTS), has a very low solid fraction on its mixture side, the second, called cold-temperate transition surface (CTS), has a high solid fraction on its mixture side. All boundary and transition surfaces are treated as non-material surfaces. The free surface may be subjected to precipitation of non-buoyant particles of ejected magma, and wind stress and atmospheric pressure. Heat exchange, will involve radiation, sensible and latent heat. At the basal surface, erosion of solids and deposition of fluid and solid constituents give rise to induced topographic changes by the flow. Across the HTS and CTS, mass is exchanged by solidification. They are defined as singular surfaces at which the temperature and the tangential components of the velocities are continuous. 10.1.2. Description of the Physical Problem Avalanches, debris and mudflows are natural phenomena which occur in many regions on our globe. Volcanic eruptions also generate avalanching motions of debris. When first expelled from a volcanic vent, lava is a liquid at temperatures from 1000 K to 1500 Ka . Although lava is quite viscous, with about 100’000 times the viscosity of water, it can flow great distances before cooling and solidifying, because of both its thixotropic and shear thinning properties. The correct description of such lava flow is faced with very difficult thermo-mechanical complexities: a Sources:
www.wikipedia.com and Pinkerton & al. 2002 [74].
Chapter10
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
221
• Mass, momentum and heat may be exchanged (i) at the ground via mass and momentum fluxes and conduction of heat and (ii) at the free surface by precipitation of granular material from the air-borne dust cloud both associated with a momentum and energy inflow. • Phase changes may occur between the fluidized lava and its solid phase, and when conditions are right, also the rocky particles may melt and solidify again. These interior phase changes take place because the moving lava exchanges heat with the atmosphere and the ground. • Lava is made up of a variety of minerals dependent on the tectonic setting of the volcano from which it erupts. This complicates its thermo-dynamic description considerably. Indeed, the fluid-solid phase change processes depend in their details upon how the material is composed. If the material is a pure substance like H2 O, then (H2 O)solid and (H2 O)fluid transform into each other at a well defined temperature. This remains so also for polycrystalline ice that is e.g. formed from fallen snow which transforms into a solid ice polycrystal and may be isotropic or have any other distinguished symmetry according to its history of deformation. The crystallographic compound of ice has a well defined melting temperature of a pure substance. Not so for lava! It consists of several different minerals, a compound substance. The phase change processes in lava occur, in general, selectively because the various chemicals or crystallites respond differently in a transition from the fluid to the solid thermodynamic state. So, in reality there exists a temperature interval TL ≤ T ≤ TU within which lava exhibits properties between a solid and a fluid. At temperature T = TL (L for ‘lower’) or below, the composition of the cell elements comprising lava exhibits properties of a solid. Similarly, at the temperature T = TU (U for ‘upper’) or above, all elements comprising lava exhibit fluid properties. For T ∈ T := [TL , TU ] the material is in an aggregation state which neither exhibits fluid nor solid behavior. This thermodynamic behavior for a polycrystal will be simplified. We assume T to be very small on the temperature scale to be effectively reduced to a single point. This is tantamount to interpreting lava as a pure substance such as water/ice and not a polycrystal consisting of different crystallographic elements. More formally: it will be assumed that our ‘model lava’ is a pure substance having a well defined melting/freezing point.
March 31, 2011
15:34
222
World Scientific Review Volume - 9in x 6in
K. Hutter and O. Baillifard
Thus, lava in a gravity flow may in disjoint regions be a very hot fluid, a mixture of a fluid intermingled with solidified components or a solid body. The mixture will be assumed to be saturated. The solid components within the mixture will, in general consist of particles of different sizes; however, this size difference will be ignored. So, our model lava as a fluid-solid mixture will be assumed to be a saturated binary continuous mixture of which all particles have the same nominal size. Important quantities in the subsequent developments will be the constituent volume fractions, νf and νs , of the fluid and the solid, respectively: να is defined as the volume fraction of constituent α(= s, f ) per unit volume element, να = ∆volα /∆volmix . With this definition, the saturation condition reads α να = 1 and for the binary mixture νs + νf = 1. 10.1.3. Gravity Flow Specifics When lava is flowing out of a volcanic vent, then its state is likely a fluid, Fig. 10.1. The fluid is bounded by rigid rock at the walls of the vent and by the equally rigid bottom of the volcano’s sloping mountain sides, and from above by the free surface, of which the form and position follows from the mass flow of magma M˙ and the pressure conditions at the lava-atmosphere interface. Since heat loss at the free surface is intensive, solidification will first occur at a point (P in Fig. 10.1) in the sloping regime of the free surface. From this point, a surface entering the flow region will mark the transition of the single-constituent region of fluid behavior into the two-constituent region of simultaneous existence of fluid and solid lava. This surface inside the flowing mass separates a pure fluid layer overlain by a fluid-solid mixture, but eventually it will reach the bottom surface at point Q (in Fig. 10.1). Downstream of this interior ‘separatrix’ the gravity flow is at phase change: The fluid and solid ‘particles’ have the same freezing or melting temperature dictated by the ‘Clausius-Clapeyron relation’. At this point it is advantageous to define the following denotations: • Single constituent fluid lava will be called hot lava. • The binary mixture consisting of fluid and solid lava constituents at the phase change temperature will be called temperate lava. • Single constituent solid lava will be called cold lava. The energy balance in the region of temperate lava does not adjust the temperature but solidify some fluid or melt solid grains. These phase transition processes will release or consume latent heat and govern the mass
Chapter10
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
A Continuum Formulation of Lava Flows
Fluid
223
P Atmosphere
Vent
Free surface
Hot lava
Temperate lava S
Basal surface
HTS Rock & soil bed
CTS Cold lava
Fluidsolid mixture Q R
Solid
M
Fig. 10.1. The picture shows an idealized situation where hot lava leaves the vent ˙ . At PQ and beyond some of the fluid lava solidifies and forms a with a mass rate M fluid-solid mixture at the freezing/melting temperature. At SR the solid volume fraction reaches νs = 1. So, beyond, the lava is a solid. The different material constituents are called hot, temperate and solid lava and the interfaces are the hot-temperate transition surface (HTS) and the cold-temperate transition surface (CTS). (Principal sketch)
production of solid mass and the annihilation of fluid mass or vice-versa. The region of temperate lava exists where the solid volume fraction νs lies in the interval 0 < νs < 1; νs = 1 is likely first reached at point S (see Fig. 10.1) of the free surface. Downstream of this point a surface layer of cold lava develops and a new internal surface is formed which eventually will reach the rigid bed in point R (of Fig. 10.1). It is now advantageous to introduce the following nomenclature: • The interior interface separating hot lava from temperate lava (surface PQ in Fig. 10.1) is called hot-temperate transition surface (HTS). • The interior interface separating temperate lava from cold lava (surface SR in Fig. 10.1) is called cold-temperate transition surface (CTS). Remarks • The existence of a region of temperate lava is a conjecture. A simpler assumption would be, if the HTS and CTS would coincide so that fluid lava would directly change over to solid lava on the single interface HTS = CTS.
March 31, 2011
15:34
224
World Scientific Review Volume - 9in x 6in
K. Hutter and O. Baillifard
• Because we assume the ‘model lava’ to be a pure substance, the fluid and solid constituents of temperate lava have the same temperature at the melting/freezing point. • HTS and CTS are non-material surfaces at which phase change processes take place. Therefore we request 1. Across HTS and CTS the temperature is continuous. 2. The phase change processes at the HTS and CTS are reversible. 3. On the basis of the viscous nature of lava flow, the tangential velocities of the solid and fluid constituents on the temperate side of the interfaces HTS and CTS are requested to be the same. 4. The tangential velocities of the fluid on the HTS and of the solid at the CTS are continuous. There are two other singular surfaces, which bound the flow, (i) the free surface and (ii) the basal surface. If precipitation of solid mass from ejected magma is ignored, then the free surface is material. Alternatively, the basal surface may be evolving in time and space because soil material may be eroded from the ground and entrained into the lava mass or lava may be deposited to the ground. 10.1.4. Earlier Theoretical Approaches Effusive eruptions produce lava flows with features that mainly depend on magma rheological properties, effusion rates, and topography (Hulme 1974 [36], Hess 1980 [33], Dobran & al. 1990 [17]). Such flows typically have very complex behavior, such as (i) non-Newtonian rheology, (ii) transitions between different flow regimes (iii) change in the topography during the flow, (iv) lava cooling governed by the coupling of energy transport in the flowing lava with heat transfer from the lava surface into the atmosphere by convection and radiation (Griffiths 2000 [24]), (v) dramatic changes in the kinematic and dynamic behavior with temperature variations because of the strong temperature dependence of the viscosity (Wylie & Lister 1998 [96], Costa & Macedonio 2003 [11]). A complete simulation of all these phenomena does not exist, and current models describe only a part of the observed phenomena. In avalanche and debris-flow research thermal effects are ignored, Pudasaini & Hutter 2004 [76]. The approaches summarized by them aim at describing the velocity distribution, stress and strain rates in the course of an avalanche sliding down a mountain topography. However, none of these describe the temperature behavior inside the avalanche. This simplification of the physical model description works well in snow avalanches, landslides and debris flows, but temperature variations
Chapter10
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
225
will become important for hot materials such as lava or pyroclastic flows, particularly when phase changes occur. 10.1.4.1. Deterministic Models based on Transport Theory Lava flow is currently modeled by a set of highly nonlinear differential equations due to thermomechanical coupling, non-Newtonian behavior, presence of a free surface, and nonlinear boundary conditions in the energy equation (heat loss by radiation). During cooling, levees form laterally, and the flow may exhibit regime transitions. Current computational models commonly assume lava flows as a Bingham liquid with a non-null yield strength, (e.g. Miyamoto & Sasaki 1997 [64]; Robson 1967 [80]; Walker 1967 [91]). Its advantage is that the stress-strain rate relation is linear when the shear stress is sufficiently large. So the equations of motion for a Newtonian fluid can still be used. Isothermal Models Isothermal models of lava flows are a reasonable approximation in describing a limited segment of the flow, where the temperature can be considered approximately uniform (Dragoni 1989 [18]). If one assumes a flow rate and an initial temperature of the liquid at the eruption vent, the temperature decrease due to heat radiation and the consequent change in the rheological parameters can be computed along the flow. The equations of motion are solved by imposing a slow downslope change in the flow parameters, yielding flow thicknesses and velocities as functions of the distance from the eruption vent (Park & Iversen 1984 [72], Dragoni 1989 [18], Ishihara & al. 1989 [46]). Three-Dimensional Models The LavaSIM code (Fujita & al. 2004, [21]) is a three-dimensional CFD model for lava flow simulations based on the algorithm described by Hidaka & Ujita 2001 [34] and Hidaka & al. 2002 [35]. LavaSIM is able to account for three-dimensional convection, spreading and solidification, temperature dependent viscosity of magma and minimum spreading thickness. The model was applied to simulate lava flows at Mt. Fudji. Currently, the high complexity of the dynamics of actual lava flows has not permitted satisfactory development and efficient three-dimensional CFD codes. Thus, simplified models are usually adopted, e.g., 2D models and cellular automata (CA), see Crisci & al. 1986 [13]; Barca & al. 1987 [4]; DiGregorio & Serra, 1999 [23]; Cannataro & al. 1995 [7].
March 31, 2011
15:34
226
World Scientific Review Volume - 9in x 6in
K. Hutter and O. Baillifard
Two-dimensional Models based on Depth-Averaged Equations Costa & Macedonio 2005 [12] presented depth-averaged equations obtained by integrating mass, momentum and energy equations over depth from the bottom to the free surface. Such shallow water equations have been applied to hazard assessments from flood simulation (Burguete & al. 2002 [6]) and tsunami propagation (Heinrich & al. 2001 [31]). For lava flows the viscosity is strongly temperature dependent. Costa & Macedonio 2005 [12] adopted a heuristic equation for the depth-averaged temperature using a simple exponential viscosity temperature relation and applied it to the 1991-1993 Etna eruption. The model showed fair agreement with field observations. Simplified Models Flowgo (Harris & Rowland 2001 [28]) is a self adaptative numerical model that follows lava elements down an open channel. Its basis is the estimation of lava velocity as a Bingham fluid flowing in a channel. At each step in the calculation, heat loss and gain are calculated in order to determine their effects on lava rheology. The model was applied and calibrated by simulating flow fields at Mauna Loa, Kilauea and Etna. Its principal limits are the drastic assumptions of a one-dimensionality and steadiness. 10.1.4.2. Probabilistic Models based on the Maximum Slope Macedonio & al. 1990 [58] assume that topography plays the major role in controlling the lava flow path. The identification of the different zones which are potentially invaded by lava is performed by computing the probability of invasion by using a Monte-Carlo algorithm. The flow is allowed to propagate along random paths, but cannot propagate upward. It must be noted that this scheme does not solve the physical equations but allows fast computations. The program was used to model the eruptions of Mt. Etna of 1983, 1985, 1987 and 1989. 10.2. Kinematics, Balance Laws, Saturation and Constituent Density Constraints Henceforth, scalars will be written as italics, vectors and tensors will be set in bold-faced letters, but if advantageous, indexed Cartesian tensor notation paired with Einstein’s summation convention will be used. Basic continuum mechanical concepts will be explained on the basis of mixture
Chapter10
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
227
theory, from which the single phase expressions are deduced by reduction to just one component or restricting considerations to the mixture as a whole. Particular components of the mixture are identified by lower case Greek subscript labels α, β, etc. 10.2.1. Kinematics of Multi-Phase Mixtures We start from configurations and motions of a three-dimensional non-empty continuum B, the so-called material body. As we are dealing with mixtures, we must consider configurations one for each constituent and motions of a collection of n non-empty continua Bα (for each constituent one), that constitute the body B. An element, Xα , of continuum Bα is understood as a name tag for a specific material particle of constituent Kα . An open set of these elements is called Qα , and its surface is ∂Qα . Every Bα has a set of configurations {καθ }θ∈I with I ⊂ R that are bijective mappings from Bα into connected and compact regions, {Rαθ }θ∈I , in the Euclidian space, E 3 . In other words, one specific καθ assigns a vector X αθ to the material particle Xα at a fixed time θ, i.e., καθ : Bα → E 3 , X αθ = καθ (Xα ) ∈ Rαθ ,
for θ fixed .
(10.1)
X αθ is called the position vector of the corresponding material particle and Rαθ is the region occupied by constituent Kα . Now, let us choose any element from the set of configurations and call it a reference configuration, κα0 . Rαθ and X αθ corresponding to κα0 are then Rα0 and X α0 , respectively. κα0 can be different for every constituent, Truesdell 1984 [89]. The present configuration can be obtained by sequentially applying the mappings κ−1 α0 and καt (see Fig. 10.2), where καt is the mapping from Bα to the present configuration. However, all constituents are located at the same present configuration Rt , i.e. all Rαt fall together to one Rt and thus, in the present configuration we have only one position vector x. This can be expressed as x = καt ◦ κ−1 or x = χα (X α0 , t) , (10.2) α0 (X α0 ) , where καt ◦ κ−1 α0 denotes a mapping that results from sequentially applying and then καt . The symbol χα denotes a vector-valued function of first κ−1 α0 X α0 and t with value x. χα is called the motion of the constituent material body Bα . With these definitions we can now specify the constituent velocity v α , the material derivative dα (·)/dt following the motion of constituent Kα , the constituent acceleration aα , the constituent velocity gradient Lα ,
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
228
Fig. 10.2. Material body Bα of constituent Kα . An open set Qα ⊂ Bα becomes in the reference configuration the material region Ωα0 ⊂ Rα0 ⊂ E 3 with boundary ∂Ωα0 , and the material particle Xα is mapped onto X α0 ∈ Rα0 . Similarly, in the present configuration, Qα is mapped into the open set Ωαt := ωα ∈ Rαt ⊂ E 3 with boundary ∂Ωαt := ∂ωα , and the material particle Xα is mapped onto x ∈ Rαt . {Eαi } (i = 1, 2, 3) is a basis for Kα in the reference configuration, and {ei } (i = 1, 2, 3) is a basis for Kα in the present configuration. Note all different Xα0 (α = 1, 2 , . . . , n) are mapped in the present configuration onto the same point x. So ωα and ∂ωα are the same region ω and boundary ∂ω for all α.
its symmetric, Dα , and skewsymmetric, W α , parts, the constituent deformation gradient F α and the constituent left Cauchy-Green deformation tensor , B α . They are defined as ∂ χα (X α0 , t) d = χ (X α0 , t) v ˆα (X α0 , t) := ∂t dt α X α0 dα χα χ−1 α (x, t) , t = ´α (x, t) , (10.3) =: v ˜α (x, t) =: x dt ∂ 2 χα (X α0 , t) d2 a ˆ α (X α0 , t) := = 2 χα (X α0 , t) 2 ∂t dt X α0 dα2 χα χ−1 α (x, t) , t =: a ˜α (x, t) , = (10.4) d2 t Lα := ∇v α = Dα + W α , F α :=
∂ χα (X α0 , t) , ∂X α0
D α := sym (∇v α ) , B α := F α F T α,
W α := skw (∇v α ) .
Bα = BT α .
(10.5)
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
229
´ is an abbreviation for dα (·)/dt. Moreover, |X In these formulae (·) α0 α indicates that the indexed quantity is held fixed among the variables that are indexed; the functions v ˆα and v ˜α take the same values for the same X α0 , t and x evaluated from (10.2)2 , but they are different functions, as they depend on different position vectors. The same is true for the accelerations. 10.2.2. Balance Equations Let ψ α be the true density of a physical quantity. It stands, in particular, for the mass density, momentum density, energy density and entropy density per unit volume of constituent Kα at every point within a region ¯α , which is defined as the ωα . This density must be distinguished from ψ physical quantity per unit mixture volume. In the sequel we shall denote densities carrying an overbar as partial densities and those without a bar as true densities. The two are related by ¯ = να ψ , ψ α α
(10.6)
in which να is the volume fraction density of constituent α. The physical quantity contained in ωα is given by additivity over the volume as ¯α (x, t)dv . Gα (t) = (10.7) ψ ωα
We now postulate the balance law dGα (10.8) = Pα + Sα + Fα , dt where Pα , Sα , Fα are the production, supply and flux densities of Gα , which are given by ψ ¯ψ ¯ψ ¯ α (x, t) + γ Pα (t) = (x, t) dv , S (t) = σ π α α α (x, t)dv , ωα
ωα
(10.9) Fα (t) =
∂ωα
¯ ψ (x, t) · nda , φ α
(10.10)
¯ψ ¯ψ ¯ψ ¯ψ in which π α, γ α, σ α and φ α are partial densities per unit mixture volume and unit mixture area, respectively. The partial production density has ¯ψ been divided into two contributions. We interpret henceforth π α as the ψ ¯ ¯ α is the production (self) production rate of ψ α by constituent Kα , whilst γ ¯α by all constituents other than Kα . For mass, linear and angular rate of ψ
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
K. Hutter and O. Baillifard
230 Table 10.1. Balance Mass Momentum Moment of momentum Total energy Entropy
Chapter10
Densities for the constituent balance relations. ψ
ψα
πα
ψ
φα
ψ
ψ
σα
γα
ρα
0
0
0
ρα cα
ρα vα
0
tα
bα
mα
x × ρα vα ρα εα + 12 vα·vα
0
x × tα
x × bα
Mα
0
tα vα − qα
rα + bα · vα
eα
ρη σα
ρη γα
ρα ηα
ρη πα
φρη α
momentum, energy and entropy these quantities are defined in columns 3 ¯ψ and 6 of Table 10.1. Analogously, σ α is the partial density of the supply ¯α , a source of ψ ¯α outside the body volume ωα . rate of ψ ¯α through the boundary of the body The partial density of the flux of ψ ψ is defined per unit area on the surface ∂ωα and is denoted by φ¯ α . Thus, the total flux through the boundary ∂ωα into the body volume ωα is the surface integral (10.10). In writing down the flux quantity (10.10), Cauchy’s Lemma has been used to express the flux of ψ through the surface as a linear function of the unit vector perpendicular to the surface. The flux and supply terms are listed in columns 4 and 5 of Table 10.1. ¯ ψ (x, t, n) indicates that the flux density The bar in the representation φ α ¯ψ = in (10.10) is referred to a unit mixture area. To assume the relation φ α ψ να φ α , we are postulating in fact that areal and volume fractions are the same. With this we have ψ ψ ψ ψ ψ ¯ψ ¯ ψ , γ ¯α , π ¯ψ . (10.11) ψ α , φα , σ α ¯ α = να ψ α , π α , φ α , σ α , γ α The local balance law for a physical quantity of constituent Kα takes the form ¯α ∂ψ ¯ψ ¯ ¯ψ ¯ψ ¯ψ (10.12) −∇· φ α − ψα ⊗ vα − π α −σ α −γ α = 0 . ∂t The requirement that the mixture as a whole obeys local balance laws as if it were a single material implies the local balance law ∂ψ (10.13) = π ψ + ∇ · φψ − ψ ⊗ v + σ ψ , ∂t for the mixture variables which are related to the constituent quantities via
ψ ¯α , ¯α − ψ ¯α ⊗ v α , ψ := ψ φ φψ − ψ ⊗ v := (10.14)
ψ ψ ψ ψ π := π ¯α , σ := σ ¯α .
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
A Continuum Formulation of Lava Flows
Here and henceforth ρ :=
ρ¯α ,
ρv :=
231
ρ¯α v α .
(10.15)
define the mixture density and the barycentric velocity. The physical balance laws can be written as follows: ρα v α ) − ρ¯α cα = 0 , Rα := ∂ ρ¯α + ∇ · (¯
• mass :
(10.16)
¯α = 0, ¯α−b ρα v α ) − ∇ · (¯tα − ρ¯α v α ⊗ v α ) − m • momentum : Mα := ∂ (¯ (10.17) • energy :
E := ∂ (ρε) + ∇ · (q + ρεv) − t · ∇v − ρr = 0 ,
(10.18)
• entropy :
H := ∂ (ρη) + ∇ · (φρη + ρηv) − σ ρη = π ρη ,
(10.19)
in which we require tα = tT α , which holds true if exchange of angular momentum of constituent α is only due to exchange of linear momentum of the same constituent. In the above
¯ α = 0, (10.20) uα := v α − v , ρ¯α uα = 0 , ρ¯α cα = 0, m α
t :=
tI := q :=
¯tα −
ρ¯α uα ⊗ uα = tI + tD ,
¯tα ,
q¯ α −
α
tD := −
¯tα uα +
(10.21)
ρ¯α uα ⊗ uα ,
ρ¯α εα +
1 2
(10.22)
(uα · uα ) uα = q I + q D , (10.23)
q I :=
q¯ α , q D := −
¯tα uα +
ρ¯α εα +
1 2
(uα · uα ) uα . (10.24)
So, the local balance equations for mass, linear- and angular momentum, energy and entropy for the mixture as a whole are ∂ρ + ∇ · (ρv) = 0, ∂t
(10.25)
∂ (ρv) + ∇ · (ρv ⊗ v) = ∇ · t + ρg, ∂t
(10.26)
∂ (ρ(x × v)) + ∇ · (x × ρv ⊗ v) = ∇ · (x × t) + ρ(x × g), (10.27) ∂t ∂ 1 ( 2 ρv·v+ρ)+∇· ( 12 ρv · v + )v = −(∇·q)+∇·(vt)+ρ(v·g+r) (10.28) ∂t
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
232
∂ρη + ∇ · (ρηv) = −∇ · φρη + πρη + ρςρη . ∂t
(10.29)
Notice that (10.27) can be replaced by the statement t = tT . 10.2.3. Saturation and Constituent Density Constraints Of the three variables ρ¯α , ρα and να , related by ρ¯α = να ρα , να is treated as an internal variable, for which the following balance law for the volume fractions να is postulated Nα =
∂να ¯ α = 0. + ∇ · (να v α ) − n ∂t
(10.30)
Here, n ¯ α is the volume fraction production rate density, for which a constitutive equation must be formulated. Because of the saturation condition, α να = 1, the volume fraction balance law for α = n is given by Nn := −
n−1
n−1
β=1
β=1
∂ ¯n = 0 . νβ + ∇ · v n − ∇ · νβ v n − n ∂t
(10.31)
Obviously, only να , α = 1, . . . , n − 1 are independent fields. In a binary mixture, N2 = −
∂ ν1 + ∇ · ((1 − ν1 )v 2 ) − n ¯ 2 = 0, ∂t
N1 =
∂ ν1 + ∇ · (ν1 v 1 ) − n ¯1 = 0 ∂t
(10.32)
(10.33)
are obtained. In a single constituent material ‘density preserving’ is defined as ρ˙ = 0, i. e., the density does not change along particle trajectories. Via mass balance this is equivalent to volume preserving. In a mixture, we commonly define density preserving of constituent α as the statement ρα = const., α = m + 1, . . . , n (0 ≤ m ≤ n) If the constituent densities ρα are independent constitutive variables, these constraint conditions reduce the number of independent constitutive variables by (n − m). In particular, comparison of the constituent mass balance laws with the volume fraction balance laws implies ¯α , c¯α = n
or
cα = nα ,
for all density preserving constituents.
(α = m + 1, . . . , n)
(10.34)
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
233
10.3. Constitutive Laws for Different Aggregation States of Lava The basic concept about the gravity current in this work is that hot lava is a single constituent fluid, viscous and thermodynamically coupled. The material properties of this fluid ensue from a thermodynamic analysis of a heat conducting fluid, which is nearly density preserving. Similarly, cold lava will also be treated as a single-constituent material; however it possesses the properties of a solid, likely with some granular structure. In such materials the formation of an equilibrium heap at zero motion can be achieved if elastic and/or frictional, i.e. plastic effects are accounted for. Most likely, this material is porous with constant true density. By contrast, temperate lava will be treated as a binary mixture of nearly incompressible fluid and solid constituents at the same temperature. That the fluid and solid constituents possess the same temperature is a consequence that hot and cold lava are pure substances and temperate lava is at the melting point. Therefore, only the energy equation for the mixture as a whole must be formulated, and this energy equation is an evolution equation for the (volumetric) mass production rate density cs . Below we discuss in due order the postulations for the constitutive behavior of hot (§10.3.1), cold (§10.3.2) and temperate lava (§10.4). 10.3.1. Material Equations for Hot Lava Hot lava is treated here as a one component body ejected from the volcano and flowing down the mountain side. We assume that this material is a viscous, heat conducting and nearly density preserving fluid. A viscoelastic heat conducting compressible fluid is given by Ψ = Ψ(ρ, D, θ, ∇θ), where Ψ ∈ {, η, q, t}, D = 12 (L + LT ). If elastic effects are ignored, a dependence on ρ is dropped: Ψ = Ψ(D, θ, ∇θ), where Ψ ∈ {, η, q, t}. This is the constitutive class of viscous heat conducting fluids. Hot lava is (nearly) density preserving, however it is not recommended to model it at this stage of the development as strictly density preserving; the reason is that there would be no Clausius-Clapeyron equation in this case. Because at the hot-temperate transition surface the pressure dependence of the melting/freezing temperature may be important, we shall treat hot lava as a compressible fluid for which the compressibility in the bulk may be negligibly small, but density jumps across phase change surfaces will be accounted for.
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
234
Constitutive relations must also satisfy the second law of thermodynamics. It can be shown that the second law implies the following properties: Define the Helmholtz free energy as ψ := − θη and decompose the Cauchy stress tensor t = −p1 + tE into a pressure tensor −p1 and the so-called extra stress tensor tE . Then, for a viscoelastic heat conducting fluid we have 1 1 2 ∂ψ d + pd , , p = ρ , dη = {, ψ, η, p} = fcts(ρ, θ), η = − ∂ψ ∂θ ∂ρ θ ρ {tE , q} = fcts(ρ, θ, D, ∇θ),
tE |E = 0,
q |E = 0.
(10.35) Here, and q |E are the values of t and q in thermodynamic equilibrium. We now wish to specify the material equations for tE , q and ψ if these are of the class Ψ(ρ, D, θ, ∇θ). • Cauchy stress tensor: An isotropic representation of tE of this class is given by tE |E
E
tE = α1+βD +γD 2 +δ ∇θ ⊗∇θ +ξ sym(∇θD ⊗∇θ)+ζ sym(∇θD 2 ⊗∇θ) (10.36) where α, β, γ, δ, ξ, ζ are scalar coefficients which are functions of S, where S is given by S = {ID , IID , IIID , ∇θ, Isym(∇θD⊗∇θ) , IIsym(∇θD⊗∇θ) , IIIsym(∇θD⊗∇θ) , Isym(∇θ D2 ⊗∇θ) , IIsym(∇θD2 ⊗∇θ) , IIIsym(∇θD2 ⊗∇θ) , θ, ρ }.
(10.37)
The symbols I, II, III are the principal invariants of the subscripted second order tensors. For simplicity we assume that the Cauchy stress tensor does not depend on the temperature gradient. Thus, (10.36), (10.37) reduce to tE = α1 + βD + γD 2 ,
(10.38)
where α, β, γ are functions of SRed = {ID , IID , IIID , θ, ρ}.
(10.39)
It is advantageous to separate the spherical and deviatoric parts of the extra stress and stretching tensors
tE = −pN 1 + tE , D = e1 ˙ + D ,
pN := − 31 (tr tE ), e˙ := 13 (tr D),
tr tE = 0, tr D = 0.
(10.40)
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
A Continuum Formulation of Lava Flows
235
Substituting these in (10.38) and separating the spherical and deviatoric components yields pN = −α − β e˙ − γ e˙ 2 − 13 γ(tr D2 ), (10.41)
˙ + γ(D )2 . tE = − 13 γ(tr D 2 )I + (β + 2γ e)D
We now simplify these relations by conjecturing that tE and D are collinear, thus γ ≡ 0. Therefore, the non-equilibrium pressure pN and the extra-stress deviator are given by
tE = βD ,
pN = − (α + β e) ˙ ,
(10.42)
in which α and β are functions of e, ˙ IID and IIID such that α|E = 0. Further simplification is achieved by assuming that ˙ α=α ˜ (e, ˙ IID , ρ, θ)e,
α ˜ + β = ζ(e, ˙ ρ, θ),
1 2β
= µ(IID , ρ, θ).
(10.43)
With this, the parameterizations (10.42) take the forms pN = −ζ(e, ˙ ρ, θ)e, ˙
tE = 2µ(IID , ρ, θ)D .
(10.44)
ζ and µ are, respectively, a non-linear bulk and shear viscosity. Note, a dependence of µ on IIID is omitted. We wish to make the following remarks: (i) A density dependence of
the bulk and shear viscosity is generally omitted, pN = −ζ(e, ˙ θ)e, ˙ tE = 2µ(IID , θ)D . (ii) The assumption that the coefficient γ of the constitutive relation is negligibly small has strong support in the literature. Polycrystal melts of metals and alloys are modeled with constitutive relations of the class (10.44). Similarly, polymers and flows of glycerine with particle suspensions and glacier ice, which is a polycrystalline material, very viscous heat conducting fluid, is described by a law of class (10.44). (iii) To ignore a dependence of µ on the third invariant has ample support in the plastic deformation of (polycrystalline) metals and alloys. It has also formal support in deformation experiments of ice. (iv) Because hot lava is a melt of a polycrystal we believe that (10.44) may have a good chance to be a realistic stress model, simply on the basis of analogy. (v) The law (10.44)
is non-linear, in general, despite the fact that tE is affine to D. Linearity only prevails when µ = const. The phenomenological parameter µ is a function of two scalars, IID and θ. It can be product-decomposed as follows: µ(IID , θ) = g(IID )B(θ),
(10.45)
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
236
where good choices for g and B are g = (IID )
n−1 2
,
B=
β20
Q . exp − kB θ
(10.46)
Here Q is the activation energy, kB the Boltzmann constant if Q is given in eV and θ the Kelvin temperature. Moreover, n is the power law exponent. • Fourier heat law: The most general isotropic law for the heat flux vector of the class Ψ(ρ, D, θ, ∇θ) has the form (10.47) q = − κ + κ1 D + κ2 D2 ∇θ, where (κ, κ1 , κ2 ) are scalar functions of the parameter set S in (10.37). We now postulate that the heat flux vector does not depend on the strain rate tensor D: q = −κ (θ, ∇θ) ∇θ,
(10.48)
in which κ = κ(θ, ∇θ) is the heat conductivity. However, a dependence on ∇θ is unlikely. • Thermodynamic quantities: We postulate the internal energy of the compressible fluid in the form θ ¯ θ¯ cv (θ)d (10.49) (ρ, θ) = 0 (ρ) + θ0
with a specific heat which depends linearly on temperature cv (θ) = c1v + c2v θ,
(10.50)
implying (ρ, θ) = c1v (θ − θ0 ) +
c2v 2 (θ − θ02 ) + 0 (ρ). 2
(10.51)
θ − θ0 θ0 θ − θ0 ψ 0 c2v + (θ − θ0 )2 − 0 (ρ) + (ρ)θ, (10.52) − ψ = c1v θ ln θ θ0 2 θ0 θ0 η
0 − c2v (θ − θ0 ) + η0 , = −c1v ln θθ0 + 2 θ−θ θ0
η0 := η|θ=θ0 = − ψθ00 (ρ) + ψ0 = 0 − η0 θ0 ,
0 (ρ) , θ0
(10.53)
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
237
ψ˜0 (v) θ ∂˜0 (v) θ − θ0 − , ∂v θ0 v θ0 (10.54) in which v := 1/ρ is the specific volume. We introduce the isothermal compressibility as p = ρ2
∂ψ = ρ2 ∂ρ
κθ = −
θ ∂ψ0 ∂0 θ − θ0 (ρ) − (ρ) θ0 ∂ρ ∂ρ θ0
=
1 ∂v 1 1 1 . =− =− 2 v ∂p v ∂p/∂v ∂ ˜0 θ − θ0 ∂ 2 ψ˜0 θ v − ∂v2 θ0 ∂v 2 θ0
(10.55)
With the choices (1)
(2)
˜0 = c0 + ˜0 (v − v0 ) + 12 ˜0 (v − v0 )2 , (1) (2) ψ˜0 = d0 + ψ˜0 (v − v0 ) + 12 ψ˜0 (v − v0 )2 ,
(10.56)
(1) (1) (2) (2) in which c0 , d0 , ˜0 , ψ˜0 , ˜0 , ψ˜0 are constants, we obtain
1 κθ = − θ − θ 0 (2) (2) θ v ˜0 − ψ˜0 θ0 θ0
(10.57)
and
(1) (2) 0 ˜(1) + ψ˜(2) (v − v0 ) θ , − ψ p = ˜0 + ˜0 (v − v0 ) θ−θ 0 0 θ0 θ0 (1) (1) (2) 1 1 ˜(2) ˜ η0 = − θ0 (d0 − c0 ) + (ψ0 − ˜0 )(v − v0 ) + 2 (ψ0 − ˜0 )(v − v0 )2 . (10.58) The number of parameters in these expressions can be reduced by as(2) (2) (1) (1) (2) (2) 0 , ψ˜0 )|| with (˜ 0 , ψ˜0 ) = suming that (i) ||(˜ 0 , ψ˜0 )(v − v0 )|| ||(˜ (1) ˜(1) O(1/ε), 0 < ε 1, with ε so small that ˜0 , ψ0 can be ignored in com(2) (2) (2) (2) 0 for some positive parison to (˜ 0 , ψ˜0 )(v − v0 ). (ii)c0 = d0 , (iii)ψ˜0 = λ˜ 3 λ. With these restrictions we get for (v − v0 ) = O(ε ) κθ = O(ε), (2)
p = ˜0 (v − v0 ) (2)
η0 =
˜ − 21 θ00
θ−θ0 θ0
− λ θθ0
= O(ε),
(10.59)
2
(λ − 1)(v − v0 ) .
10.3.2. Material Equations for Cold Lava In what follows solid lava will be modelled as an isotropic elasto-visco-plastic one-constituent continuum with vanishing porosity. Viscoplastic behaviour will be modeled as a hypoplastic frictional material, first formulated in a thermodynamic setting by Svendsen & al. 1999 [86]. According to this procedure “the phenomenological generalization of the Mohr-Coulomb
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
238
model for internal friction in a granular material at low energy and high grain volume fraction, [here νs = 1] [...] is based upon a Euclidian frame indifferent, stress-like, symmetric-tensor-valued spatial internal variable Z. [...] In the current phenomenological setting, Z is modeled constitutively by an incremental relation of the formb ◦
Z:= Z˙ − [Ω, Z] = Φ
or
◦
Z :=Z −Φ = 0.
(10.60)
Here, Φ represents the constitutive part of (10.60), and the left hand side is a so-called ‘co-rotational’ objective time derivative of the spatial tensor field Z, Ω being the corresponding spin. For example, Ω is given by W in the Jaumann case (relevant, e.g. to the case of hypoplasticity)”, Svendsen & al. 1999 [86]. To this we add that the evolution equation (10.60) has no flux and no external source terms. Apart from (10.60) the governing equations are the physical balance laws • momentum:
ρv˙ − ∇ · t − ρg
• energy:
ρ˙ − ∇ · q − tr (tD) − r = 0,
= 0, (10.61)
in which ρ.v, t, g, , q, D, r are the density, velocity vector, Cauchy stress tensor t = tT , gravity vector g, internal energy, heat flux vector, stretching tensor D = sym∇v, and radiation density, later be set to zero. If x = χ(X, t) then F = ∂χ/∂X is the deformation gradient and L := F˙ F −1 = ∇v = D+W is the spatial gradient of the velocity field and W = skwL the vorticity tensor. Balance of mass has not been written in equations (10.61), because ρ = ρR (det F )−1 , where ρR is the reference density. Thus, since F is necessarily one of the independent constitutive variables and ρR will be taken as a constant, ρ can be treated as known, once F is determined. Constitutive equations are postulated to be of the class Ψ = ˙ ∇θ), where B = F F T is the left Cauchy-Green deΨ(B, D, Z, θ, θ, formation tensor. The second law of thermodynamics is based here on the entropy inequality ρη˙ + ∇ · φ − ς ≥ 0,
(10.62)
in which η is the entropy and φ the entropy flux, whilst ς is the entropy supply and η and φ are constitutive variables. Imagine that concrete, b We use the Lie, [·, ·], and Jacobi, ·, · brackets defined by [A, B] := AB − BA, A, B := AB + BA.
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
239
explicit constitutive relations for {t, q, , η, φ, Φ} are prescribed and assume that they are substituted in (10.61) and (10.60). Then, (10.61) and (10.60) define 10 functional evolution equations for the 10 unknown fields {x, θ and Z}. The entropy inequality must be satisfied for all fields {x, θ and Z}, which satisfy the balance laws (10.61), (10.60) and constitutive relations for {t, q, , η, φ, Φ}. A theorem of Liu then says that the second law of thermodynamics is fulfilled, if the extended entropy inequality ρη˙ + ∇ · φ − ς − λv · {ρv˙ − ∇ · t − ρg} ◦
−λ {ρ˙ + ∇ · q − tr (tD) − r} − tr {ΛZ (Z −Φ)} = π ≥ 0
(10.63)
is satisfied for arbitrary differentiable fields. Here, λv , λ , ΛZ are Lagrangean multipliers, which scalarly premultiply the balance statements (10.61) and (10.60). The exploitation of this extended inequality has been performed by Svendsen et al. 1999 [86]. We quote here their results in a number of propositions. Proposition 10.1. For any admissible thermodynamic process of the outlined constitutive model of cold lava the following relations hold true: (1) The Lagrange multipliers are given by λv = 0,
ΛZ = PZ ,
ˆ (θ, θ), ˙ λ = λ
(10.64)
˙ is a materially independent (universal) function of the ˆ (θ, θ) where λ empirical temperature and its time-derivative. It is called coldness function. (2) Entropy, internal energy, entropy flux and heat flux vectors are related by the one-forms ρdη = λ ρd + P,
where
dφ = λ dq + F , ∂λ q · d∇θ, P = Pθ dθ + PB · dB + PZ · dZ − ∂ θ˙ ∂λ ∂λ ˙ dθ}. F = q{ dθ + ∂θ ∂ θ˙
(10.65) (10.66) (10.67)
(3) If the rotation tensor Ω = W = skwL, then [PB , B] = skw(∇θ ⊗ Pg ),
[PZ , Z] = 0,
(10.68)
but if Ω = W , then [PB , B] + [PZ , Z] = skw(∇θ ⊗ Pg ).
(10.69)
March 31, 2011
15:34
240
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
(4) Heat flux vector and entropy flux vector are collinear, φ = λ q,
(10.70)
ˆ (θ) = 1/θ corresponds to the Duhem relation, which in the case λ = λ ¨ller 1985 [66]. see Mu (5) With the above properties the residual entropy inequality takes the form ∂λ ˙ sym(q ⊗ ∇θ) · D π := Pθ θ + PB , B + λ t + ∂ θ˙ ∂λ q · ∇θ + PZ · Φ ≥ 0. + (10.71) ∂θ Relations (10.65)1 and (10.66) can be combined to yield the Gibbs relation that is appropriate to this theory, namely ∂λ ρ −λ ρd − dη = Pθ dθ + PB · dB + PZ · dZ − q · d∇θ. (10.72) λ ∂ θ˙ This suggests to introduce the generalized Helmholtz free energy ψ = − η/λ and using this expression to eliminate from (10.72). This implies Proposition 10.2. (1) The generalized Helmholtz free energy is not a function of D, ∂ψ = 0. (10.73) ∂D (2) ψ determines all elements of the one-form P, the non-equilibrium entropy η and the heat flux vector as follows: ⎫ ⎧ ∂ψ ⎪ ∂ψ ⎪ ⎪ ⎪ ⎬ ⎨ ∂ 1 ˙ ∂ θ ∂θ , (10.74) − Pθ = ρλ 1 ⎪ ∂ ∂ 1 ∂θ λ ⎪ ⎪ ⎪ ⎭ ⎩ ∂θ λ ∂ θ˙ λ ∂ψ PB = −ρλ , (10.75) ∂B ∂ψ PZ = −ρλ , (10.76) ∂Z Pθ 1 ∂ψ ∂ψ 1 + , (10.77) η=− =− ∂ ∂ 1 ∂ θ˙ 1 ∂θ ρλ ˙ λ ∂θ λ ∂θ ∂ψ λ . (10.78) q = ρ ∂λ ∂∇θ ∂ θ˙
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
241
We remark that all quantities (10.74) - (10.78) are known if the coldness function is known and the generalized Helmholtz free energy is satisfied. Moreover, if θ˙ is not among the independent constitutive variables, then it can be shown that Pθ = 0. Moreover, λ = 1/θ, where θ is the Kelvin temperature and ∂ 1 ∂ψ ( ) = 1 and η = − . ∂θ λ ∂θ (10.78) implies ∂ψ/∂∇θ = 0 for q to be regular, but q is now independent of ψ. Finally, the above proposition gives no information about Φ, the production rate density of Z. Thermodynamic equilibrium Further restrictions are obtained by exploring the imbalance (10.71) for thermodynamic equilibrium processes for which no entropy can be produced. If the independent constitutive variables are partitioned into ‘equilibrium variables’, e = (θ, B, Z), and ‘process vari˙ D) with n = 0 in equilibrium, then the entropy producables’, n = (∇θ, θ, ˆE (e) = π ˆ (e, n = 0) = 0, which tion in equilibrium must vanish, viz., πE = π implies, owing to (10.71), ΦE = Φ(e, 0) = 0 ⇒ Φ = ΦN (e, n), ΦN |E = 0. Here, fE and fN denote equilibrium and non-equilibrium parts of f = fE + fN and (·)|E will signify evaluation of (·) at equilibrium. Since π in the imbalance (10.71) is a differentiable, non-negative function of the process variables n with minimum value at n = 0, we have ∂π =0 ∂n E
n·
and
∂ 2 π n ≥ 0. ∂n∂n E
(10.79)
These assignments imply Proposition 10.3. (1) It follows from the isotropy of the vectorial constitutive function ˙ ∇θ, B, Z, D) that the heat flux vector in thermodynamic equiˆ (θ, θ, q librium vanishes qE = 0
⇒
q = qN .
(10.80)
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
242
(2) Performing (10.79)1 yields, in view of (10.80)1 , ∂π ∂Φ = 0 ⇒ ∂∇θ E ∂∇θ = 0, E ∂π ∂Φ = 0 ⇒ P | = −P · , θ E Z|E ∂ θ˙ E ∂ θ˙ E T (10.81) ∂π ∂Φ = 0 ⇒ λ t = −P , B − P , E B|E Z|E E ∂D E ∂ D |E P|E = −PZ|E · ∂∂Φ dθ + P · dB + P Z · dZ. B|E θ˙ E We shall now write 1 , (10.82) θ where from now on θ is identified with the Kelvin temperature. The equilibrium version of Propositions 10.2 & 10.3 then read: ˙ λ = λ E (θ) + λ N (θ, θ),
λ N (θ, 0) = 0,
λ E =
Proposition 10.4 (equilibrium). (1) The equilibrium Helmholtz free energy function is a function of θ, B, Z, viz., (10.83) ψE = ψˆE (θ, B, Z). (2) In thermodynamic equilibrium, with the help of (10.75) and (10.76), (10.68) reduce to ∂ψE ∂ψE ∂ψE ∂ψE B=B , Z=Z , if Ω = W (10.84) ∂B ∂B ∂Z ∂Z and ∂ψE ∂ψE ,B + , Z = 0, if Ω = W . (10.85) ∂B ∂Z (3) ρ ∂ψE ρ ∂ψE , PZ|E = − , PB|E = − θ ∂Z θ ∂B ∂ψE ∂Φ ∂ψE + · , ηE = − ∂θ ∂Z ∂ θ˙ E T ∂Φ ∂ψE ∂ψE ,B + ρ tE = ρ ∂B ∂D |E ∂Z
(10.86) (10.87) (10.88)
(4) The production rate density Φ of the stress like variable Z satisfies the conditions ∂Φ ˙ ∇θ, B, Z, D) = 0. (10.89) Φ = ΦN (θ, θ, ∂∇θ E
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
A Continuum Formulation of Lava Flows
243
The equilibrium Cauchy stress consists of two terms, ∂ψE ρ ,B is the elastic contribution, (10.90) ∂B T ∂Φ ∂ψE is the frictional equilibrium stress. (10.91) ρ ∂D |E ∂Z It is our thesis that the static stress in a volcanic heap is much more of frictional than elastic nature. So, in practical applications (10.91) must be kept, whilst (10.90) may be set to zero. So, (∂ψE /∂B) = 0 may perhaps be a tolerable assumption, whilst ∂Φ/∂D|E = 0, ∂ψE /∂Z = 0 are mandatory ˙ |E = 0. This will requirements. On the other hand, we may set (∂Φ/∂ θ) bring back relation (10.87) to the classical formula ηE = −∂ψE /∂θ. In anticipation of results to be derived in non-equilibrium we choose the Helmholtz free energy in equilibrium in the form E E (θ, IB , IIB ) + ψ12 (θ, IIIB , Z). ψE = ψ1E (θ, B, Z) = ψ11
(10.92)
If we choose E (θ, IB , IIB ) = C1 (IB − 3) + C2 (IIB − 3), ψ11 ρR E E ψ12 (θ, IIIB , Z) = ψ˜12 θ, ,Z , ρ
(10.93)
with constant C1 and C2 the elastic equilibrium stress tE E takes the form E E tE E = (tE )1 + (tE )2
= 2ρ (C1 + IB C2 )B − C2 B 2 + ρR
E ∂ ψ˜12 ∂(ρR /ρ)
! 1.
(10.94)
The first term is due to Mooney & Rivlin (cf. Rivlin & Saunders 1951 [77]) the last term is of frictional origin. If C1 = 12 µ ¯( 12 +β), C2 = 12 µ ¯( 12 −β), then the choice β = 1/2 leads to (tE E )1 = µB,
µ := ρ¯ µ.
(10.95)
which is known as Neo-Hookean behaviour. The last term in (10.94) ˜E gives a further contribution (tE E )2 which is due to ψ12 and defines an elastic E ˜ pressure due to friction. It only arises because ψ12 has been made to depend on (ρR /ρ).
March 31, 2011
15:34
244
World Scientific Review Volume - 9in x 6in
K. Hutter and O. Baillifard
To determine the contribution of the frictional stress to the equiE E librium stress (10.87)2 ∂ψE /∂Z is needed. Writing ψ˜12 as ψ˜12 = E ψ˜12 (θ, ρR /ρ, IZ , IIZ , IIIZ ) one may deduce E E E E ∂ ψ˜12 ∂ ψ˜12 ∂ ψ˜12 ∂ ψ˜12 = + IZ + IIZ 1 ∂Z ∂IZ ∂IIZ ∂IIIZ E E E ∂ ψ˜12 ∂ ψ˜12 ∂ ψ˜12 Z+ − Z + IZ Z 2. (10.96) ∂IIZ ∂IIIZ ∂IIIZ With this representation and a given function Φ the third equilibrium stress contribution is computable as E ∂Φij ∂ ψ˜12 E , (10.97) (tE )3ij = ρ ∂Dkl ∂Zkl E
E which is determined once the constitutive parameterization for Φ and ψ˜12 are chosen.
There remains to choose a parametrization for the production rate density ˙ ∇θ, B, Z, D). The selection Φ(θ, θ, ˆ θ, ˙ ∇θ, B, Z)D + N ˙ ∇θ, B, Z)|D|, ˆ (θ, θ, Φ = L(θ, (10.98) (in which L is a fourth order tensor and N is second order with symmetries ˆ = 0) Lijkl = Ljikl = Lijlk , Nij = Nji ) is able to generate hypoelastic (N ˆ and hypoplastic (N = 0) stress relations. The first term on the righthand side of (10.98) is linear in D, the second is non-linear, since |D| := (D · D)1/2 . In the literature, restricted dependencies of (10.98) with L = ˆ ˆ (B, Z) have been proposed. It follows from (10.98) that L(B, Z), N = N ∂Φ ˆ +N ˆ (·) ⊗ D . = L(·) (10.99) ∂D |D| The equilibrium stress tE can then be formally computed by applying formula (10.87)2 . This leads to ∂ψE ∂ψE ∂ψE D , B + ρLT|E + N |E · . (10.100) tE = ρ ∂B ∂Z ∂Z |D| In this relation, the first term on the right-hand side is the elastic stress, the second represents the hypoelastic contribution and the third is the nonlinear hypoplastic part. With (10.92), (10.94) and (10.97) tE in (10.100) may also be written as ! ˜E ∂ ψ˜12 D E E E T ∂ ψ12 + N |E · . (10.101) tE = (tE )1 + (tE )2 + (tE )3 + ρL|E ∂Z ∂Z |D|
Chapter10
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
A Continuum Formulation of Lava Flows
245
Note that neither (10.99) nor (10.100) are differentiable with respect to D at D = 0 through the non-linear term involving the derivative D/|D| of the Euclidian norm, which is not defined in the limit as D → 0. For a procedure how this can be handled, see Schneider & Hutter 2009 [83]. The literature on hypoplasticity is vastc and concentrates on the parameterization of the operators L and N . These have the general structure ˆ B, Z) = c1 (θ, B)(tr Z)14 + c2 (θ, B) Z ⊗Z , L(θ, (tr Z ) 2 2 (Z ) Z ˆ N (θ, B, Z) = c3 (θ, B)(tr Z) tr Z + c4 (θ, B) dev , (tr Z )
(10.102)
used by Wu & al. 1996 [90] in modeling various types of soils. In the " ij δkl , and the maabove, 14 represents the fourth order unity tensor, 14 =δ terial coefficients c1−4 depend on B through the void ratio; a temperature dependence has been added here for the application of lava flow. Non-Equilibrium Constitutive Parameterization Proposition 10.2 determines non-equilibrium values for Pθ , PB , PZ , η ˙ and ψ(θ, B, Z, θ, ˙ q) are prescribed, whilst and q, if the functions λ (θ, θ) Propositions 10.3 and 10.4 give the corresponding expressions in equilibrium. In line with the results obtained up to now, we may choose (1) λ =
1 θ
˙ − λ N θ,
λ N |E ≥ 0,
(2) t = tE + tN ,
tE given in (10.87)2 & (10.100) tN |E = 0,
(3) q = q E + q N ,
q E = 0,
q N |E = 0,
(4) Φ = ΦE + ΦN ,
ΦE = 0,
ΦN |E = 0,
∂ ΦN ∂∇θ
= 0. E
(10.103)
The choice ˙ − ψ = ψ1 (θ, B, Z, θ)
1 λ
∂λ ∂ θ˙
ψ2 (θ, g 2 ),
g 2 := ∇θ · ∇θ
(10.104)
produces, via (10.78) 2 ∇θ = −κ(θ, g 2 )∇θ, q = −2ρ ∂ψ ∂g 2
κ|E ≥ 0,
(10.105)
in which κ := 2ρ∂ψ2 /∂g 2 is the heat conductivity. Obviously (10.105) is a non-linear version of Fourier’s law of heat conduction. c See
for example Kolymbas 1991 [49], Kolymbas 1993 [50], Wu & al. 1996 [90].
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
K. Hutter and O. Baillifard
246
To construct a parametrization of the non-equilibrium viscous stress we ˙ ∇θ, D). Its isotropic representamake the restrictive choice tN = ˆtN (θ, θ, tion takes the form tN = K1 ID 1 + K2 D + K3 D2 + K4 G + K5 sym(GD) + K6 sym(GD 2 ), (10.106) (j = 1, . . . , 6) are functions of S = where G = ∇θ ⊗ ∇θ and K j ˙ θ, θ, ID , IID , IIID , IG , IGD , IGD2 , in which IA , IIA , IIIA are the principal invariants of A. Notice that with (10.106) the requirement tN |E = 0 is automatically satisfied. To this we add the following remarks: First, most parametrizations in the literature ignore a dependence of tN on G and also omit the D 2 term of the ensuing Reiner-Rivlin constitutive law. This then yields tN = ζID 1 + 2µ(D − 13 ID 1).
(10.107) ˙ ID , IID , IIID . in which ζ = (K1 + K32 )(S ), µ = 12 K2 (S ), S = θ, θ, The coefficients ζ and µ may be interpreted as non-linear bulk and shear m viscosities. Granular shear experiments suggest µ ∝ IID , −1 < m < 0. Then, the dominant dependence on µ is likely on IID with infinite viscosity µ as IID → 0. Such power laws are sometimes referred to as infinite viscosity laws. Because of the singularities of ζ and µ it is computationally advantageous to postulate the constitutive relation for non equilibrium stress inversely as follows
(10.108) D = Fb (ItN )1 + Fs tN − 13 (ItN )1 ,
where Fb and Fs are now a bulk fluidity and a shear fluidity. Second, the above qualitative analysis suggests to postulate that ˆ ID ), ζ = ζ(e, µ=µ ˆ(e, IID , IIID ) (10.109) ˆ Fs = Fˆs (e, IIt , IIIt ) Fb = Fb (e, It ), where D = D − 13 ID 1, tN = tN − 13 It 1, and e is a measure of the porosity, in soil science the so-called void ratio. Explicit formulae for ζ and µ are proposed in Schneider & Hutter 2009 [83], pp 174-182. 10.4. Theory for Temperate Lava 10.4.1. Preliminaries We shall consider temperate lava as a mixture of fluid and solid constituents at the freezing/melting temperature. This binary mixture should be regarded as frictional because such contacts will certainly be present in the
Chapter10
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
247
vicinity of the CTS where the solid volume fraction is close to, or at, unity. A mixture theory of this complexity has been presented very recently by Schneider & Hutter 2009 [83] with a brief summarizing review by Hutter & Schneider 2010 [43] and an application to debris avalanches by Hutter & Schneider 2010 [44]. We can only give a brief account of this theory. A large part of the ensuing presentation of the model requires detailed notation and painstaking reproduction of rather complicated results from [43], [44]. We shall at first address general facts and results mentioned in [43], [44] which are valid for arbitrary unidentified constituents Kα (α = 1, ..., n is the identifier of the constituent). The balance laws of mass and linear momentum for each constituent and the energy and entropy balances of the mixture can be written in the forms ∂ ρ¯α + ∇ · (¯ ρα v α ) − ρ¯α cα = 0; Rα := (10.110) ∂t Mα :=
∂ ρ¯α v α ¯ α = 0. + ∇ · (¯ ρα v α ⊗ vα − ¯tα ) − ρ¯α g α − m ∂t
E :=
∂ρ + ∇ · (q + ρv) − t · ∇v − ρr = 0; ∂t
(10.111) (10.112)
∂ρη + ∇ · (φρη + ρηv) − σ ρη = πρη . (10.113) ∂t Volume fractions να are treated as generic variables, for which we postulate the flux free balance law (10.30). Moreover, just as for cold lava, we will also introduce for each constituent a frictional stress like variable according to ◦ ◦ ¯ ¯ α = 0, ¯ α Ωα ). ¯α − Φ ¯ α = ∂ Z α + (∇Z ¯α − Z ¯ α )v α − (Ωα Z Z Zα := Z ∂t (10.114) Formula (10.114)2 is an objective time derivative and Ωα = skwLα is the skew-symmetric spin tensor of constituent α. It is natural to apply the frictional tensors Z α only to the solid constituents while they are set to zero for the fluid ones. The mixture model which we envisage to be appropriate for temperate lava is also assumed to satisfy the saturation condition n
να = 1. (10.115) H :=
α=1
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
248
It states that there is no material free space within the mixture. Equation (10.115) means that only n − 1 volume fractions are independent: νn = n−1 1 − α=1 να . Obviously, then ⎞ ⎞ ⎛ ⎛ n−1 n−1
∂ ⎝ ⎠ ¯ n = 0. (10.116) νβ + ∇ · v n − ∇ · ⎝ νβ v n ⎠ − n Nn := − ∂t β=1
β=1
We also assume that (n − m) constituents are density preserving, ρα = const,
α = m + 1, . . . , n
(0 ≤ m ≤ n),
(10.117)
i.e. these true densities remain constant. Condition (10.117) reduces the number of independent constitutive variables by (n − m). In particular, comparison of (10.110) with (10.117) yields c¯α = n ¯ α , or cα = nα (α = m + 1, . . . , n) for all density preserving constituents. This is an important result for temperate lava since for the solid cs (α = s) is the solid production rate and for the fluid cf = −cs is the annihilation (negative production) rate of the fluid due to freezing. Furthermore, in a mixture of which all constituents are density preserving the barycentric velocity is not necessarily solenoidal. This follows from ρ¯α = να ρα , because with constant true density the partial density can still vary owing to variations of the volume fractions. Since in this case (10.111) and (10.117) collapse to the same equations, one of them is simply omitted as a field equation. Schneider & Hutter [83] chose the constitutive equations to be of the form ˆ Ψ = Ψ(S), ˙ ∇θ, ρ, ∇ρ, ν , ∇ν, v , B, D, W ,Z S = θ, θ,
ν := ν1 , . . . , νn−1 ρ := ρ1 , . . . , ρm , := B 1 , . . . , B n , v := v 1 , . . . , v n , B := D 1 , . . . , Dn , := W 1 , . . . , W n , D W := Z 1 , . . . , Z n , Z in which every dependent constitutive quantity of the set
¯α ¯ α , q, φρη , tα , Φ ¯α, m Ψ = , η, cα , n is an isotropic function of the variables S.
(10.118)
(10.119)
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
249
10.4.2. Entropy Principle The union of the balance laws of mass, linear momentum, volume fraction and stress like second order tensor variables and constitutive relations for the variables (10.119) form a set of non-linear partial differential equations for the variables ¯ 1, . . . , Z ¯ n. θ, ρ1 , . . . , ρm , ν1 , . . . νn−1 , v 1 , . . . , v n , Z ' () * ' () * compr.
(10.120)
saturation
These equations must be in conformity with the second law of thermodynamics, here formulated as an entropy inequality, ∂(ρη) − ∇ · (φρη − ρηv) − σ ρη ≥ 0. ∂t
(10.121)
Satisfaction of (10.121) subject to the constraints (10.110)-(10.114) and (10.116) is equivalent to the fulfillment of the extended inequality ∂(ρη) ∂t
−
m − ∇ · (φρη − ρηv) − σ ρη − α=1 λρα Rα n−1 λνα · Mα − λ E − α=1 λνα Nα − λνn Nn − λZ α · Zα ≥ 0
(10.122)
for arbitrary independent fields, Liu [56]. In this imbalance the various λ’s are Lagrange multipliers to be determined together with restrictions of constitutive quantities as results of the exploitation of (10.122). The sum n mation in (10.122) must (and will henceforth) be interpreted as α=1 . The explicit exploitation of inequality (10.122) is long and tedious; in principle, it works as follows: The source free balance laws (10.110)-(10.114) and (10.116) are substituted in (10.122) (with σρη = 0) together with the constitutive relations (10.119) of the form (10.118) and in the emerging inequality all differentiations of constitutive quantities are performed. In this final expression it is convenient to introduce the one-forms P=
K
PxI dxI := d(ρη) − λ d(ρ),
(10.123)
I=1
F=
K
I=1
F xI dxI := dφ(ρη) + λ (dq) −
(d¯tα )λνα ,
(10.124)
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
250
in which xI (I = 1, . . . , K) are the constitutive variables, and the auxiliary variables ρα := λρα + λvα · v α , αρI := ρα + 12 λ (uα · uα ), ν ρ ν ανI := να + 12 ρα (uα · uα ) − k, α := ρα α + λα , T −1 k := 12 λ ρn (un · un ), Γ := λ ρ t + (η − λ )1 = Γ , s := −νn , s := sv n + ρn Γun , (10.125) as well as the so-called extra entropy flux vector
¯tα λvα . k := φρη + λ q − (10.126) After this formidable transformation the entropy inequality can be expressed as ∗
X(Y) · Y +Γ(Y) ≥ 0,
∗
∀ Y∈ RKD .
(10.127)
Here, Y collects the independent constitutive variables arranged as a vector ∗
of dimension K, Y are time and space derivatives of Y, again arranged as a vector of dimension KD > K; moreover X is a matrix of dimension K × KD and ‘·’ is the scalar product in RKD . Finally, Γ(Y) is a scalar function of ∗
Y. Since (10.127) is linear in Y, its solution is given by ∗
∗
X(Y) · Y ≡ 0 ∀ Y∈ RKD ,
and
Γ(Y) ≥ 0.
(10.128)
Relations (10.128)1 are called Liu-identities, whilst (10.128)2 is referred to as residual entropy inequality. 10.4.3. Inferences Implied by the Liu-Identities (10.128)1 represents a great number of identities. To be able to derive explicit results from them, the following assumptions are needed: (i) λ is a function of the empirical temperature and its total time derivative ˆ (θ, θ). ˙ The function λ ˆ (θ, θ) ˙ is called the coldness function. (ii) The λ = λ Lagrange multiplier of the linear momentum of constituent α is selected as λvα = −λ uα , in which uα := v α − v is the diffusion velocity. ¨ller Both assumptions have proven precursors in simpler theories, Mu 1985 [66]. It is also convenient to introduce the following quantities: (1) The Helmholtz-like free energy 1 G , ψ G := − η = ψIG + ψD λ
ψIG := I −
1 η, λ
G ψD := D ;
(10.129)
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
251
(2) the partial thermodynamic pressure of constituent α p¯G α := ρρα
∂ψIG , ∂ρα
α = 1, . . . , m;
(10.130)
α = 1, . . . , n − 1;
(10.131)
(3) the configuration pressure βαG = ρ
∂ψIG , ∂να
(4) the saturation pressure νn 1 s = − = ρn un · v n − (ρn λρn + λνn ) ; λ λ λ (5) the inner parts of the constituent Gibbs free energies ς :=
µαGI := −
1 ρ λ αI
(α=1,...,m)
=
ψIG +
pG α . ρα
(10.132)
(10.133)
On the basis of the above assumptions and definitions Schneider & Hutter [83] prove Proposition 10.5. (i) The internal energy , entropy η and Helmholtz-like free energy ψIG ∇ν, D cannotdepend on ∇ρ, and W and are thus only function of ˙ ∇θ, ρ, ν , v , B, Z . Moreover, λvα = −λ uα also rules out SR = θ, θ, a dependence of ψIG on any of the v α : ψIG = ψIG (SR \ {v }). ˆ (θ) = 1/θ, where θ is now the Kelvin If θ˙ ∈ / S, then λ = λ temperature. Furthermore, = (SRR ) and η = η(SRR ), whilst G G ˙ ∇θ . ψI = ψI (SRR \ {v }), with SRR = SR \ θ, G G (ii) p¯G α , βα and µα I as defined in (10.130), (10.131) and (10.133) are all derivable from ψIG and are functions of SR \ {v }. Alternatively, using the auxiliary variables (10.125) and (10.132), (10.133) it follows that ⎧ ∂(ρψIG ) ∂(ρψG ) ⎪ ⎪ ρα ∂ραI = να ∂να + ια + ς I , α = 1, . . . , m, ⎨ G ρ¯α µαGI = να ∂(ρψI ) + ια + ς I , α = m + 1, . . . , n − 1, ∂να ⎪ ⎪ ⎩ α=n να (ια + ς I ), (10.134) where λν ς I = ς − 12 ρn (un · un ). (10.135) ια := α , λ
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
252
(iii) All Lagrange multipliers except λνn can be expressed in terms of thermodynamic quantities. Specifically, with the Heaviside step function H, pG α + 12 uα · uα ), α = 1, . . . , m, (10.136) ρα λνα = (−ρα ρα )H(m − α) − λ (ς + (ρα − ρn )ψIG + βα (10.137)
λρα = λ (uα · v α − ψIG −
+ 12 (ρα uα · uα − ρn un · un )) = λ ια ,
α = 1, . . . , n − 1
(iv ) The coefficient functions of the one-form P and the entropy η are given as follows: Pθ˙ P∇ρ P∇να PD α PW α
= 0, = 0, = 0, = 0, = 0,
α = 1, . . . , m, α = 1, . . . , n − 1, α = 1, . . . , n, α = 1, . . . , n,
Pρα = ¯αρI ,
α = 1, . . . , m,
Pνα = ¯ανI + s,
α = 1, . . . , n − 1,
Pθ = −ρλ
∂ ∂θ
1 λ
η+
∂ψIG
⇒
∂θ
η=−
∂ψ G
G ∂ψI ∂θ
∂ ∂θ
P
+ λθρ
1 λ
.
I , P∇θ = −ρλ ∂∇θ ⎧ ∂ψ G ∂ρ ⎪ α = 1, . . . , m, ⎨ −ρλ ∂ BIα − λ να ψIG ∂ Bαα , PB α = ⎪ ⎩ −ρλ ∂ψIG , α = m + 1, . . . , n, ∂B
PZ¯ α (= λZ α ) = −ρλ
∂ψIG ¯ , ∂Z α
and
α
α = 1, . . . , n, (10.138)
η=−
∂ ∂ θ˙
1
1 λ
∂ψIG ∂ θ˙
,
(10.139)
provided θ˙ is an independent constitutive variable. If θ˙ ∈ / S, then Pθ = 0. Thus, from (10.138)8 , η=−
∂ ∂θ
1
1 λ
∂ψIG ∂θ
( λ1 =θ)
=
−
∂ψIG . ∂θ
(10.140)
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
253
Moreover, the only undetermined field variable is a quantity proportional to −νn , defined in (10.132) s = −νn or ς = s/λ and ς is called saturation pressure. It enters the theory as a new independent field quantity and replaces the nth volume fraction, that is lost as an independent variable by the saturation condition. The Liu identities also involve expressions of the vector one-form F . Exploring these identities, Schneider & Hutter [83] prove Proposition 10.6. (i) The coefficient functions of the one form F are given as F ∇θ F ∇ρα F ∇νβ F Dα
= = = =
0, 0, α = 1, . . . , m, 0, β = 1, . . . , n − 1, 0, α = 1, . . . , n,
FWα F θ˙ F Z¯ α F Bα
= 0, α = 1, . . . , n, = P∇θ , = −uα ⊗ PZ¯ α , α = 1, . . . , n, = −uα ⊗ PB α , α = 1, . . . , n, (10.141) (ii) The extra entropy flux vector possesses the representation (10.142) k = − 12 λ K α (S)uα , 1 K α = ¯tα − tr (¯tα )1 + {F vα − tr (F vα )1} ; λ
(10.143)
(iii) the entropy flux vector is given by
⎧ 1 1 ⎨ −λ q + ( ¯tα − 3 tr (¯tα )1)+ 3 tr (¯tα )1 uα if k = 0, 3 ¯ 1 ¯ φρη = −λ q + 2 tα − 3 tr (tα )1 uα ⎩ 1 −1
0, + 2 (λ ) {F vα − tr (F vα )1} uα , if k = (10.144) (iv ) In view of (10.141)6 an alternative expression for P∇θ is given by ∂λ ∂λ ¯ P∇θ = F θ˙ = − q − 32 tα − 13 tr (¯tα )1 uα (10.145) ˙ ˙ ∂θ ∂θ
∂ − 21 λ (¯tα − tr (¯tα )1)uα ∂ θ˙
∂ − 21 (F v¯ α − tr (F v¯ α )1)uα , ∂ θ˙ which reduces to P∇θ = 0 when θ˙ ∈ / S.
So, k, φρη and P∇θ are given by combinations of the diffusion velocities with tensorial coefficients which are given by constitutive quantities. Moreover,
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
254
the entropy flux vector is not collinear to the heat flux vector, not even when k = 0. Interestingly also, k, φρη depend on F vα and P∇θ on derivatives ˙ of it with respect to θ. Most interestingly also in view of (10.138)9 and (10.145), two different expressions for P∇θ are at our disposal, which obviously must be equal. This yields the following expression for the “heat flux vector”: Proposition 10.7. (i) If θ˙ ∈ S the heat flux vector in temperate lava is determinable from λ ∂ψIG
− q=ρ · Qα (S)uα , (10.146) ∂λ ∂∇θ α ∂ θ˙ 1 λ ∂ ¯ 3(¯tα − 13 tr (¯tα )1) + ∂λ Qα = (t − tr (¯tα )1) (10.147) ˙ α 2 ∂ θ ˙ ∂θ 1 ∂ (10.148) (F − tr (F vα )1) . + ∂λ ˙ vα ˙ ∂θ ∂θ
(ii) If the Helmholtz-like free energy is assumed to have the form ψIG
=
(ψIG )1 (SRR
\ {v α }) −
∂λ ∂ θ˙ ψ G (|∇θ|2 , θ), ρλ 2
(10.149)
then formula (10.146) produces the law ∂ψ2G q := −2ρ (|∇θ|2 , θ) ∇θ − ∂|∇θ|2 ' () * = κ, Fourier contribution
'
α
Qα (S)uα ()
(10.150)
*
Diffusive contribution
(iii) If θ˙ ∈ / S, then the Helmholtz-like free energy does not depend on ∇θ ˆ (θ) = 1/θ, P∇θ = 0, F ˙ = 0 [see Proposition 10.5, item (i)], λ = λ θ and the heat flux vector cannot be determined from the Liu identities. In this case information of the diffusive contribution to q is lost and a simple Fourier law ensues. The results (10.146)-(10.150) are significant, because they split q into a conductive Fourier contribution and a diffusive contribution, with the structure of flux of ‘power of working’. So, it is more appropriate to call q ‘energy flux’ with a (heat) conductive and a convective-diffusive contribution.
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
255
10.4.4. Implications of the Residual Entropy Inequality The residual entropy inequality (10.128)2 , when written in long hand, shows that the entropy production Γ is a function of the variable set S, see (10.118). This set can be written as S = e ∪ n, where e := ˙ ∇θ, v , D, ∇ν, B, Z , n := θ, W are called the equilibrium θ, ρ, ν , ∇ρ, and non-equilibrium variables, respectively. A process is now called a thermodynamic equilibrium, if no entropy is produced, Γ = 0. This is the case, if ˙ (i) the non-equilibrium variables, n vanish i.e. n = θ, ∇θ, v , D, W = 0, (ii) the constitutive quantities of the frictional production rate densities, Φα , (α = 1, . . . , n), are zero, lim n→0 Φα = Φα |E = 0 (α = 1, . . . , n), (iii) the interaction rate densities for mass and volume fraction also vanish lim n→0 cα := cα |E = 0, lim n→0 nα := nα |E = 0, (α = 1, . . . , n). It follows that Γ|E = 0, while else Γ ≥ 0. So, Γ takes a minimum in thermodynamic equilibrium. Of necessity then, ∂Γ =0 ∂nI E
∂ 2 Γ ∀nI ∈ n, is non-neg. definite, {nI , nJ } ∈ n. ∂nI ∂nJ E (10.151) Evaluation of (10.151)1 is straightforward; it implies Proposition 10.8. The following equilibrium properties can be derived from (10.151)1 : • If nI = v β : m
1 − (λε )−1 k,ρ v ∇ρα ¯ β E = δαβ − ξ¯β να (ρα )−1 pG m α E α β E E α=1
+
n−1
α=1
−
δαβ − ξ¯β ζα E 1 − c,vβ E − (λε )−1 E k,να vβ E ∇να
n
¯ α ,v ρ ψIG ,Z¯ α E Φ β E
α=1
−
n
ρ¯α µGI E (cα ),v β E − ια E (¯ . n α ),v β E α
α=1
(10.152)
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
256
• If nI = Dβ : ¯tβ = −ζ¯β 1 + 2ρ sym (ψIG ),B β B β + (λε )−1 k,vβ E E E E E + +
¯ α ),D ρ ψIG ,Z¯ α E (Φ β E
ρ¯α µαGI E (cα ),D β E − ια E (¯ nα ),D β E .
(10.153)
˙ • If nI = θ: Pθ E =
n
¯ α ), ˙ ρ ψIG ,Z¯ α E · (Φ θ E
λε E
α=1
n
ρ¯α µαGI E (cα ),θ˙ E − ια E (¯ +λ E nα ),θ˙ E . ε
α=1
(10.154) • If nI = ∇θ: n
¯ α ),∇θ ρ ψIG ,Z¯ α E · (Φ (λε ),θ E q E = λε E E α=1
n
ρ¯α µαGI E (cα ),∇θ E − ια E (¯ + λε E nα ),∇θ E . (10.155) α=1
in which the following abbreviations have been used:
−1 G ∆∗α 1 + 12 (uα · uα )1, (10.156) tD − (λε )−1 sym (P∇θ ⊗ ∇θ) − ψD D := ρ ζα :=
βαG − ρn ψIG + ς,
α = 1, . . . , n − 1,
−ρn ψIG + ς,
α = n,
c := ρn ∆∗n D + ζn 1 un ,
ρ¯α uα ⊗ uα . tD := −
(10.157) (10.158) (10.159)
α
¯ β |E exhibit dependences on ∇ρα The interaction forces in equilibrium, m (α = 1, . . . , m) and ∇νβ (β = 1, . . . , n − 1), including dependences on the thermodynamic configuration and saturation pressures via pG α (α = 1, . . . , m), ζγ , (γ = 1, . . . , n) and the extra entropy flux, k. If all constituents are density preserving, then no equilibrium interaction force contributions involving ∇ρα arise.
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
257
The thermodynamic pressures do not enter the constituent equilibrium stresses; only the configuration pressure arises through the pressure like variable ζβ |E . Compressibility can be accounted for by the elastic stress 2ρsym((ψIG )Bβ )|E B β . This stress has both dilatation and shearing contributions. However, equilibrium stresses are less due to elastic and more due to frictional effects, which will be expressed through the production rate ¯ α. density Φ Let us next concentrate on the contributions to (10.152)-(10.155) due ¯ α , cα and n to the production rate densities, Φ ¯ α . These contributions all ¯ α , cα and n ¯ α with prefactors which appear as products of derivatives of Φ ¯ β|E , ¯tα|E , Pθ|E and q |E . These factors are the same in the expressions of m are ¯α ρ ψIG ,Z¯ α |E for the friction term Φ ρ¯α µGI |E
for the mass production cα
ια |E
for the volume fraction production n ¯α .
α
(10.160)
and they are, in turn, multiplied ¯ iβ |E for m for ¯tα |E for Pθ |E for q|E
¯ α ,v |E , with Φ β ¯ with Φα ,D β |E , ¯ α , ˙ |E , with Φ θ ¯ α ,∇θ |E , with Φ
(cα ),vβ |E , (¯ nα ),v β |E (cα ),Dβ |E , (¯ nα ),Dβ |E (cα ),θ˙ |E ,
(¯ nα ),θ˙ |E
(10.161)
(cα ),∇θ |E , (¯ nα ),∇θ |E
and subsequently summed to reveal the corresponding representations arising in (10.152)-(10.155). The two lists (10.160) and (10.161) disclose the thermodynamic structure of the various contributions to the equilibrium quantities very clearly and make the rationals of simplifications rather sim ¯ α , cα , n ple. For instance, if we have reason to assume that Φ ¯ α ,θ˙ = ˙ ¯ ¯ α ,∇θ = 0, while θ, ∇θ -dependences are kept in the ener0 , Φ α , cα , n
gies I , ψIG , η then, provided θ is identified with the Kelvin temperature (λ |E = 1/θ), then η|E = −∂ψIG /∂θE and q|E = 0, which are most welcome properties. A frictional contribution to the equilibrium stress is important for temperate lava, but corresponding contributions due to melting/freezing and
March 31, 2011
15:34
258
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
¯ α /∂Dβ |E = volume fraction are unlikely. So, we must assume that ∂ Φ 0 for some α and β but may choose ∂cα /∂Dβ = 0, ∂ n ¯ α /∂Dβ = 0. Anal¯ α , cα and ogously, one may have reason to ignore any dependences of Φ i ¯ β|E are independent of n ¯ α on the velocities v α , which then implies that m ¯ α , cα and n Φ ¯ β . Caution is, however, recommended for temperate lava with such ad hoc assumptions since the melting, freezing rate is dictated by the energy equation. 10.4.5. Simplifying Assumptions for Lava Flows The preceding theoretical results hold for a rather general mixture theory with an arbitrary number of constituents. We now apply them to the simplified situation appropriate for lava flows. The simplifications we wish to impose are: • restriction to a binary mixture of solid (α = s) and fluid (α = f ) constituents, • both constituents are density preserving, ρs = const, ρf = const. ⇒ ns = cs , • mass production is exclusively due to melting/freezing such that cs > 0 ⇔ cf < 0 for solidification. Because both constituents are density preserving, mass and volume fraction balance laws collapse to the same equations. The two mass balance equations take the forms ∂νs + ∇ · (νs v s ) = νs cs , ∂t ∂νs − + ∇ · (v f ) − ∇ · (νs v f ) = −νf cs , ∂t and adding the two equations yields ∇ · νs v s + (1 − νs )v f = cs (νs − νf ) = (2νs − 1)cs . ' () * v vol
(10.162) (10.163)
(10.164)
So, the balance of mass equations can be used as (10.162) & (10.163) or as (10.162) & (10.164), respectively. Equation (10.164) states that the volume averaged mixture velocity v vol is only solenoidal when cs ≡ 0, i.e., when no melting and no freezing occur. • We ignore θ˙ as an independent constitutive variable. Then, according to Proposition 10.5 (i) ψIG = ψˆIG (SRR \ {v}) = ψIG (θ, ν , v , B, Z), (ii) λ = 1/θ, where θ is the Kelvin temperature, (iii) η = −∂ψIG /∂θ (Pθ = 0), (iv) P∇θ =
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
A Continuum Formulation of Lava Flows
259
0, F θ˙ = 0. (v) The heat flux vector cannot be determined from the Liu identities, and generally it contains only a Fourier component, cf. Proposition 10.7 item (iii). It is in this case, however still meaningful to compose the ‘energy flux vector’ q = q c + q l of two components q c the conductive heat, and q l , the latent heat due to melting as explained below. • It will be assumed that only the solid constituent exhibits contact frictional properties described by the symmetric tensor variable Z s ; Z f = 0. • The extra entropy flux vector will be set to zero, k ≡ 0, cf (10.144). 10.4.6. Parameterization of the Constituent Equilibrium Stresses and Interaction Forces All constitutive quantities can additively be split into equilibrium and nonequilibrium parts, ¯ts,f = ¯ts,f |E + ¯ts,f |N ,
¯s = m ¯ s|E + m ¯ s|N . m
(10.165)
The equilibrium contributions of (10.165) are stated as (10.152) and (10.153) and reduce with the above simplifications to n−1
¯ β E = δαβ − ξ¯β ζα E 1 − c,vβ E ∇να m
(10.166)
α=1
n n
G ¯ ρ¯α µGI E (cα ),vβ E , − ρ ψI ,Z¯ α E Φα ,vβ E − α=1
α=1
¯tβ = −ζ¯β 1 + 2ρ sym (ψIG ),B β B β E E E +
α
¯ α ),D + ρ¯α µαGI E ρ ψIG ,Z¯ α E (Φ β E
(10.167) (cα ),D β E ,
∗α −1 G − (λε )−1 sym (P∇θ ⊗ ∇θ) − ψD 1, c := ρn ∆∗n D + ζn 1 un , ∆D := ρ ζα :=
βαG − ρn ψIG + ς, α = 1, . . . , n − 1, −ρn ψIG + ς,
α = n,
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
260
with α, β = 1, 2 and n = 2; we shall identify α = 1 with the solid and α = 2 with the fluid. The solid stress in equilibrium consists of the contributions G ¯ts|E = −ζ¯s|E 1 + 2ρsym(ψI,B )B s + ρψ G ¯ (Φs ),D s |E ' () * ' () s * ' I,Zs|E() * ¯ tcs
+ '
ρ¯s µsGI
¯ tes
(cs ),Ds |E + ρ¯f µGI f |E ()
¯ tf ric
|E
(cf ),D s |E
¯ tsmelt
(10.168) *
Using (10.134)-(10.137), a somewhat lengthly calculation shows that G ¯tsmelt = νs ∂(ρψI ) + (ρs − ρf )ψIG + βs (cf ),D s |E . (10.169) ∂νs (10.168) and (10.169) require further modelling. To this end we choose a free energy function ψIG which allows ‘separation’ of the different influences and assume G G (B s ) + ψˆef (B f ). ψIG = ψˆfGric (νs , Z s ) + ψˆes
(10.170)
Here the indices ‘fric’, ‘es’, and ‘ef’ stand for ‘friction’, ‘elastic-solid’ and ‘elastic-fluid’. The last two terms in (10.170) are thought to account for elastic contributions of the solid and fluid, respectively, and ψˆf ric subsumes the frictional dependences which are known to strongly depend on the packing of the solid grains. G G Solid elastic stress: If ψes = ψˆes (IB s , IIB s ) is expressed as a function of the first two invariants of B s (but not of IIIB s for simplicity), then it is relatively easy to show, see Hutter & Schneider 2010 [43], that ! G ˆG ˆG ∂ ψ ∂ ψˆes ∂ ψ es es ¯tes = 2ρ B s − 2ρ + IB s B2. (10.171) ∂IBs ∂IIBs ∂IIB s s G = C1 (IB s −3)+C2 (IIB s − For a Mooney-Rivlin elastic body we have ρψˆes 3), (cf. [77]) which yields the solid elastic stress as
¯tes = 2(C1 + C2 IB s )B s − 2C2 B 2s
C2 =0, 2C1 =µ
=⇒
¯tes = µB s . (10.172)
Fluid elastic stress: Since the fluid stress in equilibrium must be spherical G is on the third invariant of B f . Thus the only possible dependence of ψef G G ψef = ψˆef (IIIB f ) and G ¯tef = 2ρsym(ψI,B )B f = 2ρ f
G G ∂ ψˆef ∂ ψˆef ∂IIIB f B f = 2ρIIIB f 1. ∂IIIB f ∂B f ∂IIIB f (10.173)
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
261
This parameterization accounts for the compressibility of lava. Even though it must be small, it is responsible for the pressure dependence of the melting temperature. A fairly reasonable parameterization would be IIIB f , (C − IIIB f ) (10.174) which generates a stress singularity when IIIB f approaches C, a measure for the densest possible packing. Because 1 < IIIB f < C for a compacting fluid volume, (10.174) generates a negative stress ¯tef (= pressure). G ψˆef = A ln(B(C − IIIB f )2 ),
C > 0,
⇒
¯tef = −4ρA
Frictional part of the solid stress: This stress is given by G ¯tcs = ρψI,Z (Ψ),D s |E , s |E
(10.175)
and the difficulty is to find ψˆIG (ν, Z s ) such that the stress ¯tcs is given by an expression that agrees with the stress relations of hypoplasticity. To this end we choose, see Teufel [87], ¯tcs = ρδZ s ,
δ = const.
(10.176)
Substituting (10.176) into the evolution equation for Z s , (10.114), leads to 1 ˚ ¯ s ( 1 ¯tcs , ·). ¯tcs + ν ρs − ρf (∇ · v s )¯tcs = Φ (10.177) ρδ ρ ρδ This is an evolution equation for ¯tcs ; by postulating an adequate expression for Φs we may generate an evolution equation of hypoplastic behavior. We choose
1/2 ¯ s )|D s | , ¯ s = f1 (·) L(Z¯s )D s + f2 (·)N (Z |D s | := tr (D 2s ) , Φ (10.178) in which f1 and f2 are the so-called coefficients of barotropy and pyknotropy which may depend on S. More realistic is to restrict this dependence to νs . L is a fourth order tensor. If we additionally require homogeneity of ¯ s with respect to Z ¯ s , then (10.177) with (10.178) and (10.176) can be Φ reduced to the form ˚ ¯tcs +νs ρs − ρf (∇·v s )¯tcs = tr (·) {L(t¯cs )D s + f2 (·)N (¯tcs )|D s |} . (10.179) ρ This evolution equation for the frictional stress deviates from that proposed for hypoplastic behavior by Wu & Kolymbas [90] by the second term on the left-hand side. For a dry granular material (ρf = 0), (10.179) agrees with the proposal by Svendsen & al. [86]. Schneider & Hutter 2009 [83]
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
262
show that an evolution equation for ¯tcs can be obtained which is free of the second term on the left hand side, if the evolution equation required to hold ¯ is not (10.114), but for Z
¯ s, ¯ s= Φ Z
˚ ¯ s − νs ρs − ρf (∇ · v s )Z¯s . ¯ s := Z Z ρ
(10.180)
The introduction of this new evolution equation for Z s does change the thermodynamic analysis, but only the solid equilibrium pressure changes and now takes the form G ¯ ζs → ζsnew = νs (βs − ρf ψIG + ς) − νs (ρs − ρf )(ψI, ¯ s · Z s ). Z
(10.181)
The second, new, term on the right hand side is a frictional pressure. We still need to prescribe the operators L and N; Wu & al. 1996 [90] write ¯ ¯ ¯ s ) = c1 (νs )tr (Z ¯ s )14 + c2 (νs ) Z s ⊗ Z s , L(νs , Z ¯ s) tr (Z 2 D ¯ 2 ¯ ¯ s ) = c3 (νs ) Z s + c4 (νs ) (Z s ) . N (νs , Z ¯ ¯ s) tr (Z s ) tr (Z
(10.182) (10.183)
with material coefficients c1 −c4 which are parameterized for soil. Therefore the laws (10.182)-(10.183) can only serve as a first hint when being applied to lava. In summary we have ¯s ¯ts|E = −ζ¯new 1 + µB s + ρδ Z ∂(ρψIG ) +νs + (ρs − ρf )ψIG + βsG (cs ),Ds |E , (10.184) ∂νs IIIB f ¯tf |E = −(1 − νs )(−ρf ψIG + ς) − 4ρA , (10.185) (C − IIIB f )
¯ ,v s )|E ¯ s|E = βsG (1 − ξ¯s ) − ρf ψfG + ς ∇νs + ρ(ψIG ),Z¯ s (Φ m ∂(ρψIG ) +νs + (ρs − ρf )ψIG + βsG (cs ),vs |E . (10.186) ∂νs ¯ s satisfies the evolution equation (10.180). However, an In these relations Z expression for cs has still not been given from which (cs ),Ds |E and (cs ),vs |E can be computed. This will be done last. Prior to this end, we need nonequilibrium expressions for the stresses and the interaction forces.
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
A Continuum Formulation of Lava Flows
263
10.4.7. Parameterization for the Non-Equilibrium Stresses and Interaction Forces A detailed treatment has been given by Schneider & Hutter 2009 [83] and 2010 [43]. Here, we restrict attention to the simplest possible case and assume ¯ts|N = ¯ts|N (D s , νs , θ), ¯tf |N = ¯tf |N (D f , νf , θ), m ¯ s|N = m ¯ s|N (v f −v s , νs , θ). More specifically, we postulate the quasi-linearizations ¯ts,f |N = κs,f IDs,f 1 + 2µs,f (D s,f ),
¯ s|N = mD (v f − v s ). (10.187) m
D s,f is the deviator of Ds,f ; moreover, κf,s are bulk viscosities and µs,f are shear viscosities, whilst mD is a permeability of a Darcy-type constitutive relation for the interaction force. The phenomenological coefficients in (10.187) are assumed to have the dependencies ˆs,f (θ, νs , ID s,f ), µs,f = µ ˆ s,f (θ, νs , IID s,f , IIID s,f ), κs,f = κ mD = m ˆ D (θ, |v f − vs |, νs ).
(10.188)
Apart from the dependences on temperature and volume fraction, the bulk viscosities only depend on IDs,f , which are measures of compaction or dilatation. Similarly, the shear viscosities primarily depend on IID s,f which are the ‘shearing’ invariants of Ds,f . Dependences on the third invariants IIID s,f are thought to be less significant and may, in a first attempt be omitted. Moreover, if we suppose the fluid to have Newtonian type behavior with vanishing bulk viscosity, then (10.188) reduce to κf ≡ 0,
µf = µ ˆ f (θ, νs ) = µ0f = const.
(10.189)
κs = κ ˆ s (θ, νs , ID s ),
µs = µ ˆs (θ, νs , II
(10.190)
D s
).
To parametrize the viscosities for the solid constituents, we introduce the scaled variable νscale :=
νs − νscrit , νsmax − νs
νscrit < νsmax .
(10.191)
in which νscrit is the solid volume fraction at the critical packing at which the mean distance of the particles does not give rise to solid-solid contact; νsmax , on the other hand, is the solid volume fraction at densest packing. We conjecture that for νs νscrit , the solid bulk viscosity is smallest and probably constant, whilst at densest packing the viscous dynamic stress associated with it, becomes infinite, so that densest packing can never be
March 31, 2011
15:34
264
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
achievedd . According to (10.191) νscale = (νscrit ) = 0 and νscale (νsmax ) = ∞. We choose κs {1 + (νscale )m } νscale > 0, κs = m > 0, (10.192) κs νscale < 0, in which κs may be taken as a constant or as κs = κs (θ, IDs ). A first choice is certainly κs = constant. For the solid shear viscosity we write ηs (IID s ) µs = µs (θ, νs )ˆ
(10.193)
with a monotonic dependence of ηs on IID s . For a power law representation we have n 0
0, p > 1. (10.195) µs = νscale < 0, µs 10.4.8. Parameterization of the Melting/Freezing Rate We are still missing a functional relation for the melting-freezing rate cf = −cs in the constituent mass balance relations. We now show that an explicit relation for cf , say, can be obtained from the energy equation for the mixture as a whole. Let us commence with the balance law of mass for the fluid constituent ∂ ρ¯f + ∇ · (¯ ρf v f ) = ρ¯f cf . (10.196) ∂t Introducing the mass fraction ξ¯f = ρ¯f /ρ, it is straightforward to show that this equation can also be written as ρ
dξ¯f = −∇ · j f + ρ¯f cf , dt
d The
j f := ρξ¯f (v f − v) = ρξ¯f uf = ρξ¯f νs (v f − v s ), (10.197)
corresponding parameterizations in Schneider & Hutter 2009 and Hutter & Schneider 2010 [43] assume κs (νsmax ) = 0, which is not realistic; κs (νsmax ) should rather become very large.
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
265
where v = νs v s + (1 − νs )v f has been used. j f may be interpreted as the fluid mass flux due to melting and, consequently, q l = Lj f = Lξ¯f (v f − v)
(10.198)
is the latent heat flux due to it. It follows, the total energy flux of the mixture, q = q c + q l , is given by the conductive Fourier heat flux plus the latent heat flux due to melting, q = −κ∇θM + Lξ¯f (v f − v) = −κ∇θM + Lξ¯f uf .
(10.199)
The balance law of the internal energy of the mixture as a whole is given by d (10.200) ρ = −∇ · q + tr (ttot D), dt in which q is prescribed in (10.199), ttot is the mixture stress and D the stretching tensor of the barocentric velocity field, ρ¯f 1 ρ¯s ρf } , Ds + D f + {(1 − νs )(v s − vf ) ⊗ ∇ρ¯s − νs (v s − v f ) ⊗ ∇¯ ρ ρ ρ (10.201) so that the second term on the right-hand side of (10.200) is known. In temperate lava the growth of the internal energy can be written as D=
d dξ¯f dθM =L + cv (θM ) . dt dt dt
(10.202)
Here, cv is the specific heat of the mixture at constant volume, cv = νs csv + (1−νs)cfv , evaluated at the melting temperature and the second term on the right-hand side only contributes, if the melting temperature changes in time and space due to a corresponding pressure variation. [A term −p(1/ρ)· has been ignored]. Substituting (10.202) into (10.200) and then using (10.197) and (10.199) yields dξ¯f d dθM = ρL + ρcv (θM ) ρ dt dt dt dθM (10.197) = −L∇ · j f + Lρ¯f cf + ρcv (θM ) dt = −∇ · q + tr (ttot D) (10.199)
=
∇ · (κ∇θM ) − L∇ · j f + tr (ttot D).
Comparing the second and fourth line yields the desired result, 1 dθM ρ¯f cf = tr (ttot D) + ∇ · (κ∇θM ) − ρcv (θM ) . L dt
(10.203)
(10.204)
March 31, 2011
15:34
266
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
This equation can be interpreted as follows: The energy available for melting, ρ¯f cf L, is composed of three terms, (i) the heat generated by stress power, (ii) the heat conducted to the point under consideration and (iii) the heat becoming consumed by the change of the melting temperature, the latter being negative for positive dθM /dt. The result is both interesting and disturbing, which we shall now explain. (i) We assume L > 0 and know vaguely that θM does not vary much. So we may assume that in (10.204) the two terms involving ∇θM and θ˙M are small. This means ρ¯f cf ≈ tr (ttot D)/L. (ii) It is clear that in thermodynamic equilibrium D = 0 and hence cf = 0. However, in a nonequilibrium process it is likely that tr (ttot D) > 0, so, since L > 0, cf > 0. (iii) Therefore, it is at least very likely that tr (ttot D)/L is positive. So, under this assumption only melting occurs. This is a bad result, because it says that, if a temperate lava region exists, it will eventually become hot lava and turn into a single constituent fluid. This means that lava in a cooling process does most likely not develop a temperate region. Only hot and cold regions probably exist which are separated by a singular surface, the phase change surface from hot to cold. However, if we have a heap of cold lava that is subject to hot fluid lava, then cf < 0 may well exist and then temperate lava may survive. For the present situation we cannot prove that the right-hand side of (10.204) is really positive. The only statement we are able to make is that this conclusion is likely. A proof would have to come from an explicit numerical computation. 10.5. Kinematic and Dynamic Boundary Conditions 10.5.1. General Jump Conditions Fig. 10.3 shows that the entire mass and the three regions of distinct thermodynamic states are separated from one another by many boundaries and interface surfaces. For each of these surfaces, kinematic and dynamic boundary conditions must be formulated. ¯αγ , ζ¯αγ , φ¯γα be the partial density of the physical quantity γα , its Let γ¯α , π production, supply and flux rates, see Section 2. Let, moreover, w be a velocity of the surface point on, and nσ the unit normal vector to, the surface σ. These variables satisfy the jump condition ¯ γα nσ = ℘¯γα , ¯ γα (v α − w) · nσ − φ α α
(10.205)
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
A Continuum Formulation of Lava Flows
267
z
- +- ++ -
Cold temperate transition surface, Fc(x,t)=0
fluid ++ - +
-+-+
- ++ --+
ns v HT
S
n
w
Free surface, Fs(x,t)=0
+ solid + fluid +- + C - -+ TS
Bottom surface, Fb(x,t)=0
solid
Hot temperate transition surface, Fh(x,t)=0
x-y
Fig. 10.3. Lava mass, sketched as a gravity flow down the mountain side of a volcano. This mass is divided into three regions of hot fluid, temperate fluid-solid mixture and cold solid lava. The moving masses are separated from one another by boundaries and/or internal interfaces as indicated.
in which ℘γα is the surface production rate density of γ. Summation of (10.205) over all constituents yields the jump condition for the mixture as a whole n
¯ γ nσ = ℘, ¯ γα (v α − w) · nσ − φ α
α=1
where
℘ :=
N
℘¯γα .
(10.206)
α=1
The corresponding jump conditions of mass, linear- and angular momentum, energy and entropy for each component are ¯ ρα (v α − w) · nσ = µ ¯α ; ρα v α ((v α − w) · nσ ) + ¯tα nσ = τ¯ α ; (¯ ρα v α ⊗ (v α − w))nσ + ¯tα nσ = ¯ ¯ α ) · nσ = ζ¯α ; ¯ ρα 12 v 2α + α (vα − w) + (v α¯tα − q ¯ · nσ = −¯ pσα ; ¯ ρα η(v α − w) − φ α (10.207)
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
268
and for the mixture as a whole ρ(v − w)nσ = µ; (ρv ⊗ (v − w))nσ + tnσ = ρv((v − w) · nσ ) + tnσ = τ ; ρ 12 v 2 + (v − w) + (vt − q) · nσ = ζ; ρη(v − w) − φ · nσ = −pσ ≤ 0. (10.208) in which µ=
N
α=1
µ ¯ α = 0,
τ =
N
α=1
τ¯ α ,
ζ=
N
¯
α=1 ζα .
It is presently not physically transparent whether all these mixture surface quantities can a priori be set to zero. The different surfaces which arise in our description of lava flow are (see Fig. 10.3): (i) The free surface which is here treated as material. This surface can be decomposed into 3 sub-surfaces, one for each transition between the hot-, temperate- and cold lava, respectively, and the atmosphere (A). These surfaces are mathematically defined by F sHA (x, t) = 0, F sT A (x, t) = 0, F sCA (x, t) = 0. (ii) The two transition surfaces (HTS and CTS) F HT S (x, t) = 0, F CT S (x, t) = 0. (iii) The bottom surface which can also be decomposed into 3 sub-surfaces. F bHB (x, t) = 0, F bT B (x, t) = 0, F bCB (x, t) = 0. 10.5.2. Kinematic and Dynamic Boundary Conditions at the Free Surface Kinematic surface equation: The subsequent calculations require expressions for the exterior surface normal vector ns . If x, y, z are Cartesian coordinates and F s = z − hs (x, y, t) = 0, where hs is the height of the free surface above the (x, y)-plane, then, since the free surface is material, ∂F s + ∇F s v s · ns = 0. ∂t With the material velocity as the convected velocity, this leads directly to ∂hs ∂hs ∂hs dhs (x, y, t) dz − = u+ v−w+ =0 at F s = 0. dt dt ∂x ∂y ∂t (10.209) The dynamic boundary conditions for this surface must also be satisfied. These conditions follow from the jump conditions of the physical balance laws.
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
A Continuum Formulation of Lava Flows
269
Since the free surface, F s (x, t) = 0, is material for which v · n = w · n, there is no mass flow across it and the jump condition for the density is trivially satisfied: ρ(w − v) · ns = 0. For the momentum jump we now have ρv(w − v) · ns +tns = 0 () * '
⇒
tns = −pns + tE ns = 0.
=0
(10.210) The statement on the right says that the traction (stress vector) is continuous across the material singular surface. It is not difficult to show that patm ≈ 105 P a and τwind < 10 P a. Both are considerably smaller than a typical stress paval ≈ 109 P a. The traction at the atmosphere side is therefore negligibly small; equation (10.210) can be written as 0 = −paval · ns + tE ns
tns = 0.
or
(10.211)
For the energy jump condition at the free surface we have vt · ns − q · ns = 0 +
+
s
−
−
s
−
s
+
(10.212)
s
t n* − v t n* +q · n − q · n = 0. 'v ·() ' ·() ' () *
(tE ns )atm ≈0
=0 see (10.211)
:=Q⊥ atm
With this definition a positive value of Q⊥ atm corresponds to a flow of energy into the atmosphere. So, s rad q s · ns = Q⊥ atm = Qatm + Qatm + Qatm ,
(10.213)
where Qatm , Qsatm and Qrad atm are the latent, sensible and radiative heat flows, respectively. The latent heat flow is the heat transferred to the atmosphere when phase transitions occur at the free surface; that is, when hot lava solidifies, it releases an amount of energy to the atmosphere surface. The latent heat flux can be parametrized as Qatm = f (Θsurf aval , v∗ ),
freeze surf Θsurf aval := (θaval − θatm )
(10.214)
The wind velocity function f is usually taken to be linear or quadratic in v∗ . This contribution is likely small and can be ignored in a first approximation. The sensible heat flow at the free surface describes the energy transferred from the avalanche to the air by convection or conduction. Analogously to the latent heat flux, it is parametrized by surf surf , v∗ )(θaval − θaval ). Qsatm = f˜(θaval
(10.215)
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
270
f˜(·) is a non-linear heat transfer coefficient. The last term arising in Q⊥ atm is the radiative heat flow, Qrad atm . This term describes the thermal (electromagnetic) radiation which is emitted from the surface of an object which is due to its temperature: Q
4 4 − Tatm ) = σα(T0 + θaval )4 − σα(T0 + θatm )4 σα(Tsurf
= Taylor
=
surf surf σα(T04 + 4T03 θaval + ...) − σα(T04 + 4T03θatm + ...).
Therefore, surf surf Qrad atm ≈ catm (θaval − θatm ),
catm = 4σαT03 .
(10.216)
T is the temperature given in degrees Kelvin, T0 ≈ 300K, α is the emissivity of the grey body and σ the Stephan-Boltzmann constant. In temperate lava, the emissivity is a combination of the two, likely with weighting factors depending on the solid volume fractions αtemp = (1 − νs )αf + νs αs . 10.5.3. Kinematic and Dynamic Boundary Conditions at the HTS Let F = 0 be the equation for the HTSe . As a non-material surface, it has the evolution equation ± ∇F M dF = ∇F (v − w)± · = ∇F a± ⊥ = ∇F ± , (10.217) dt ∇F ρ in which ± ± M := ρ± a± ⊥ = ρ (v − w) · n,
(10.218)
We define the surface of phase change as a special singular surface at which the temperature and tangential velocity are continuous. Thus θ = 0
and
v || := v − (v · n)n = 0,
(10.219)
Straightforward computations, in which (10.219) and (10.208)3 are com¨ hnk [41]) that bined, show (see e.g. Hutter & Jo n · T Esh n = 0 with T e We
Esh
:=
θη − −
1 2 (v
or
µ = 0.
± 1 − w) · (v − w) 1 + t ρ
(10.220)
(10.221)
assume here for simplicity that φ = q/θ. The more general case is treated elsewhere.
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
µ := − θη − 12 (v − w) · (v − w) +
p⊥ ρ
±
,
Chapter10
271
p⊥ = −n · tn
(10.222)
must hold across any phase change surface. T Esh is called Eshelby tensor and µ surface chemical potential. In words, the normal component of the Eshelbi stress traction or the non-equilibrium chemical potential is continuous across the phase change surface. For general dynamical conditions further inferences are difficult to derive but when quadratic contributions in the velocity can be ignored, (10.208)2,3 and (10.220) reduce to θ = 0, v || = 0, ⊥
(and M = 0)
(10.223)
p = 0, µ = 0, in which p⊥ = −nσ · tnσ = 0, which is the continuity requirement of the normal component of the total stress. Notice that (10.223) holds for both thermodynamic equilibrium and non-equilibrium conditions, for a ¨ hnk [41]. derivation, see Hutter & Jo Finally, recall that the latent heat at a phase boundary is given by ⊥ 1 L|E := θη = |E + p|E 1/ρ. (10.224) L := θη = + p ρ Subsequently we shall identify the dynamic L with the equilibrium latent heat of fusion: L = L|E . As an application, consider a body consisting of a mixture of two compressible viscous heat-conducting bodies. We assume that on one side of the surface of phase change, fluid lava and solid lava will coexist while on the other side, only the fluid lava component will be present (see Fig. 10.4a). For the mixture as a whole, we have in particular q · nσ = −θηρ(v − w) · nσ where η = ηs + ηf . Only ηs will suffer a jump across the HTS. Denoting the mass fraction of the solid lava in the fluid-solid-lava mixture by ξ¯s we have, ξ¯s =
ρ¯s , ρ¯s + ρ¯f
η + = (1 − ξ¯s+ )ˆ ηf + ξ¯s+ ηˆs ,
η − = ηf .
(10.225)
Here, η + and η− are the specific entropies per unit mixture volume on the (+)- and (–)-sides of the singular surface, and ηs , ηf are the specific entropies of the pure substances ‘solid lava’ and ‘fluid lava’, respectively.
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
272 a)
+ HTS
Solid + fluid lava
Fluid lava only
b)
Solid lava only
+ CTS
Solid + fluid lava
Fig. 10.4. Phase change surface, separating a solid-fluid mixture lava from, (a) fluid lava, and (b) solid lava.
Thus, (10.225) implies, since ξ¯s− = 0, θη = ξ¯s θ(ˆ ηs − ηˆf ) = −ξ¯s L. Consequently, because of reversibility from (10.208)4, we obtain q · n = −Lξ¯s ρ(v − w)nσ = −Lξ¯s M.
(10.226)
The same equation can be shown to hold also for the CTS (see Fig. 10.4b). In terms of the total mass density and the barycentric ve locity defined as ρ = α ρα , ρv = α ρα v α the solid-constituent mass balance can be written as ρξ¯˙s = −∇ · j s + ρξ¯s cs ,
j s · nσ + ρξ¯s (v − w) · nσ = 0, j s · nσ + ξ¯s M = 0.
(10.227) (10.228)
For j s = 0, (10.226) and (10.227) can be related by q · nσ = Lj s · nσ
(10.228)
=
−Lξ¯s M = −Lξ¯s ρ± a± ⊥.
(10.229)
This relation states that the jump in heat flow across the phase boundary is given by the latent heat times the solid mass flow across the surface. In a computational solution scheme equation (10.229) serves as an equation for the determination of a⊥ or M. This melting or freezing rate is needed in the evolution equation for the HTS. Indeed, with F HT S = z − hHT S (x, y, t), the kinematic equation takes the form + 2 HT S 2 ∂hHT S ∂hHT S ∂h ∂hHT S ∂hHT S − + + 1. + u+ v−w = a⊥ ∂t ∂x ∂y ∂x ∂y (10.230) This sign choice makes a⊥ positive for freezing, i.e. for a⊥ > 0 the hot lava flow direction is from the hot to the temperate region. An equation analogous to (10.229) also hold for hCT S .
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
A Continuum Formulation of Lava Flows
273
Lava (hot, temp or cold) Mb > 0
Mb < 0
zb-z=0 nb
+
soil or rock Fig. 10.5. Moving basal surface separating the soil or rock bed from hot, temperate or cold lava.
10.5.4. Kinematic and Dynamic Boundary Conditions at the Bottom For the basal boundary conditions, we want to take into account erosion and deposition processes. With F b (x, t) ≡ z b (x, y, t) − z = 0, the kinematic equation can then be written as, see Fig. 10.5, + b 2 b 2 b ∂z ∂z ∂z b ∂z b − ∂z b − M − + u + v −w = 1+ + (10.231) ∂t ∂x ∂y ∂x ∂y ρ¯b with (¯ ρf,s (v f,s − w) · nb )− = Mbf,s , (ρ(v − w) · nb )− = Mbf + Mbs = Mb .
(10.232)
We note that Mf > 0 (Ms > 0) means deposition of fluid (solid) lava to the ground and Mf < 0 (Ms < 0) means erosion of fluid (solid) lava from the ground. Let us now look in detail how the jump conditions for the basal surface look like: The linear momentum jump conditions at the base for the fluidand solid lava and for the mixture as a whole are v f,s Mbf,s + ¯tf,s nb = τ¯ f,s ,
vMb + tnb = 0,
(10.233)
where we used the fact that the sum of the constituent interface momentumexchange terms τ f,s is zero, i.e. τ¯ s + τ¯ f = 0, ensuring momentum balance in the mixture as a whole at the interface. The formulae (10.233) require basal stress and mass flow parameterizations to close the system. A first approach is to postulate the no-slip condition, requiring that the jumps of the tangential velocities vanish, (1 − nb ⊗ nb )v f,s base = 0,
(10.234)
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
K. Hutter and O. Baillifard
274
z
z
a) Fig. 10.6.
v
b)
v
Possible flow velocity distribution.
see Fig. 10.6a. For τ¯ s = 0, (10.233) then implies that the basal shear traction is continuous across the (moving) basal surface. This tangential − shear traction (1 − nb ⊗ nb )(¯tf,s nb ) then follows as part of the solution of the boundary value problem for the lava flow. We conjecture that the velocity distribution is more like that in Fig. 10.6b, namely a more or less uniform flow over most depth of the avalanching lava with a very strong shearing layer and postulate a frictional sliding law. Defining at the base ||
||
b b ¯ b ¯ ,v (T f,s ¯ f,s ) := (1−n ⊗n )(tf,s n , v f,s ),
b ¯ b p¯⊥ f,s := −n ·tf,s n , (10.235)
a frictional sliding law is a functional relation ¯ || := Tf,s (v || f,s , p⊥ , θ, . . .). T f,s Specific parameterizations, which are popular in geodynamics are ¯ f,s = −ρ¯f,s cf,s (| [v [ f,s]] |, p⊥ , θ)v || f,s , T
cf,s (·) = cf,s |v f,s | (10.236)
with dimensionless drag coefficients cf,s of order 10−3. At high solid concentration a Mohr-Coulomb contribution can be added on the right hand side of (10.236). The above formulae apply for hot, temperate and cold || || lava (with the assumption v f = vs at the base.) There remains the parameterization of the quantities τ s , Mbf and Mbs . As a first guess we set τ s = 0. Mbs and Mbf represent the solid and fluid mass flows through the surface. Deposition and entrainment of fluid and solid constituents occur according to whether > 0 → deposition, b (10.237) (v − − w) · n f,s < 0 → erosion.
Chapter10
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
275
So, if H is the Heaviside function, we may write dep − b b eros Mf,s = H((v − f,s − w) · n ) Mf,s − H(−(v f,s − w) · n ) Mf,s . (10.238) ' ' () * () *
=
1 for deposition 0 for erosion
=
0 for deposition 1 for erosion
dep/eros
For the fluid we conjecture Mf = 0. It means physically that the basal surface of temperate lava is material with respect to the fluid. To dep/eros we assume that erosion is dormant as long as the parameterize Ms solid second stress deviator invariant evaluated at the base is below a prescribed threshold value. Basal shear is certainly a key factor in determining whether material ,can be torn off from - the ground surface. This is the reason why J2 := 12 tr (ts − 13 (tr ts )1)2 |base is the decisive variable. Let J2thres be a positive constant (with the dimension of (stress)2 ). Then we assume − b that deposition occurs when J2thres − J2 > 0 and (v − s − v f ) · n > 0. The deposition rate of the solid in temperate lava is postulated to be − − b dep b = H(J2thres −J2 )H((v − ρ¯s ((v − Mdep s s −v f )·n ) α s − v f ) · n ) (10.239) where H is the Heaviside function and αdep a parameter characterizing the free fall of the particles in the fluid. Similarly, if (J2thres − J2 ) < 0, we conjecture = −H(J2 − J2thres )cs |J2 − J2thres |, |v||s |, . . . Meros s = −H(J2 − J2thres )αeros |J2 − J2thres |n |v ||s |m ,
(10.240)
where αeros , n, m are constant parameters. Exchange of heat between the lava avalanche and the ground is formulated by use of the energy jump condition + 21 v·vM+q·nb −v·tnb = 0, in which M > 0 is a net deposition of mass. If we let v + base = 0, q + · nb = −Qgeoth then, ⊥ q − · nb = −v− · (tnb ) − Qgeoth + ( − 12 (v − · v − ))M. ⊥
(10.241)
For hot lava M = Mf = 0 was assumed. Thus, (10.241) implies −q − · nb + v − · (tnb ) − Qgeoth = 0. ' () * ' () * ' ⊥() * (i)
(ii)
(iii)
(10.242)
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
Chapter10
K. Hutter and O. Baillifard
276
For cold lava (10.241) takes the form −q− · nb + v− · (tnb ) − Qgeoth + − 12 (v − · v − ) M = 0. ⊥ ' () * ' () * ' () * ' () * (i)
(ii)
(iii)
(10.243)
(iv)
In the above the various contributions account for (i) conductive transfer of heat, (ii) frictional heat due to basal sliding, (iii) geothermal heat provided by the rock formation of the volcano, and (iv) added mass-heat transfer due to the deposited/eroded mass; for this case ≈ 0. For temperate lava, the term corresponding to (iv) in (10.243) is split up in two contributions dep melt − 12 (v − · v − ) eros Mbs + − 12 (v − · v − ) f reeze Mmelt (10.244) f reeze ' () * ' () * (iv)1
(iv)2
(iv)1 is the heat transfer contribution due to deposition/erosion of solid material, and the energy jump, dep eros ≈ 0, may be ignored. The second part, (iv)2 , is the corresponding contribution due to phase changes. Since s + (1 − νs )ˆ f we have + = ˆs , − = νs ˆ ⊥ 1 melt f reeze = −(1 − νs )(f − s ) = −(1 − νs ) L − p ≈ −νs L. (10.245) ρ ' () * negl.
It follows that the energy jump condition at the basal boundary with deposition/erosion plus melting/freezing taking place is given by −q − · nb + v− · (tnb ) −Qgeoth ⊥
−
1 − 2 (v
(10.246) melt 1 − − · v )Ms − νs L + 2 (v · v ) Mf reeze = 0, (10.247) −
in which νs follows from the solid mass balance in the form j s · nb + νs Ms + Mmelt f reeze = 0, with
− − j s = −ρs (v − s − v ) = −ρs us
(10.248) (10.249)
see (10.228). The only additional unknown in equations (10.246) and melt (10.248) is Mmelt f reeze . So, these equations serve to determine Mf reeze . This concludes the derivation of the boundary conditions at the basal surface.
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
277
10.6. Conclusion, Outlook 10.6.1. Summary Lava is a multi-phase and chemically heterogeneous system. This entails a characteristic, non-Newtonian behavior, which is emphasized by the fact that the rheological parameters are strongly temperature dependent and therefore affected by progressive cooling of lava after effusion. From the time when effusion starts, a complex thermal interaction begins with the environment, producing the gradual cooling of lava. Thermal exchange processes include conduction toward the ground and the atmosphere, radiation into the atmosphere and convection in the atmosphere above the flow. Heat is produced in the flow as a result of viscous dissipation and latent heat of crystallization. Thermal and rheological boundary layers are formed progressively and thermically insulate an inner core of fluid lava, slowing down the process. In order to correctly model the process, the flow was divided into three regions: (i) a hot fluid region, (ii) a temperate region composed of a mixture of fluid and solid lava and (iii) cold solid lava region. Hot lava was treated as a non-Newtonian, thermomechanically coupled density preserving fluid; a viscoelastic-plastic heat conducting model was used to describe cold lava. To include the plastic response of cold lava, the hypoplastic class of material concepts, appropriate for the description of frictional behavior, was used and extended to a mixture of different constituents. The derivation of the sub-models of hot, temperate and cold lava showed that these models are ‘equipped’ each with a number of phenomenological coefficients. Some of these have been found in the literature; however, numerical values for others must still be determined. This by itself will be an important and time consuming procedure for which numerical computations, perhaps paired with inverse procedures will most likely be involved. This work is built on earlier known theoretical models of deformable continuous frictional bodies and is in large parts based upon specialized application of these concepts. It uses the thermodynamically consistent formulation of Schneider & Hutter 2009 [83] and Hutter & Schneider [43] [44]. Novel is the incorporation of phase change processes occurring as melting and solidification, which in earlier debris flow models have played no role. New, at least in emphasis, is also the incorporation and careful
March 31, 2011
15:34
278
World Scientific Review Volume - 9in x 6in
K. Hutter and O. Baillifard
consideration of the thermal sub-processes which describe the cooling as an exchange phenomenon of heat between the lava mass and the atmosphere and the ground. All these required a setting of the model within a complete thermodynamic exploitation of the second law of thermodynamics. In this regard, the derivation of the solid mass production rate density due to solidification via the mixture energy equation demonstrated a particularly nice but also disturbing aspect of the model. The beauty is its determination from a fundamental law of physics, the concern with the deduced result is that the temperate region of the moving lava may likely not form in a pure solidification scenario. A strict proof of this statement could, however, not be given. It should also be mentioned that the early assumption that lava is treated as a pure substance is a very drastic limitation of the physical capability of the model. It eliminates the multi-crystal composition of lava and makes descriptions of segregation of the crystal composition and the development of induced anisotropies of the polycrystalline material impossible. 10.6.2. Outlook It transpires from the above that the content of this thesis presents a possible model of which the validation must still be performed. This step would entail the completion and preparation of the model for numerical implementation by presenting numerical values for all phenomenological coefficients. This involves not only the parametrization of the coefficients of the continuum models of hot, temperate and cold lava, but also concerns the entrainment-deposition parametrization and the heat exchange processes with the atmosphere and the ground. A major step in achieving many of these important informations is the development of computational software for lava flow in form of sub-models, and eventually the entire complex model. The requirements are dictated by the fact that HTS and CTS represent singular surfaces which may not exist at the beginning of the process but may be formed in time. So, shock capturing schemes must be used in the numerical implementation. Moreover, because the flow goes through large displacements, large shearing and tremendous distortion of a selected initial grid, the flexibility of an Eulerian and Lagrangean formulation will be of advantage.
Chapter10
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
279
Acknowledgements The authors thank Profs. Ioana Luca, Yih-Chin Tai and an anonymous referee for constructive criticism of an earlier version of this manuscript. References [1] O. Baillifard. A Continuum Formulation of Lava Flows From Fluid Ejection to Solid Deposition. Master Thesis ETH Zurich, Switzerland, 2010. [2] O. Baillifard and K. Hutter. A continuum formulation of dense granular pyroclastic flows term paper for ethz d-phys. 2009. [3] D. Barca, G. Crisci, S. DiGregorio, S. Marabini, and F. Nicoletta. Nuovo modello cellulare per flussi lavici: Colate di Pantelleria. Bollettino del Gruppo Nazionale per la Vulkanologia: Roma Italy, 1988. [4] D. Barca, G. Crisci, S. DiGregorio, and F. Nicoletta. Lava flow simulation by cellular automata: Pantelleria’s example. International Applied Modeling and Simulation: Cairo, Egypt, 1987. [5] G. Blatter. Thermodynamik: Skript zur Vorlesung. ETHZ, 2007. [6] J. Burguete, P. Garcia-Navarro, and R. Aliod. Numerical simulation of runoff from extreme rainfall events in a mountain water catchment. Natural Hazards and Earth System Sciences, 2002. [7] M. Cannataro, S. D. Gregorio, S. Rongo, R. Spataro, W. Spezzano, and D. Talia. A parallel cellular automata environment on multicomputers for modeling and simulation. Parallel Computing, 1995. [8] C. Carath´eodory. Untersuchungen u ¨ber die Grundlagen der Thermodynamik. Math. Ann. 69, 1909. [9] H. Carslaw and J. Jaeger. Conduction of heat solids. Cladenron Press, Oxford, 1947. [10] L. Collier, J. Neuberg, N. Lensky, V. Lyakhovsky, and O. Navon. Attenuation in gas-charged magma. J. Volcanology and Geoth. Res., 2006. [11] A. Costa and G. Macedonio. Viscous heating in fluids with temperaturedependent viscosity: Implications for magma flows. Nonlinear Processes in Geophysics, 2003. [12] A. Costa and G. Macedonio. Numerical simulation of lava flows based on depth-averaged equations. Geophysical Research Letter, 2005. [13] G. Crisci, S. DiGregorio, O. Pindaro, and G. Ranieri. Lava flow simulation by a discrete cellular model: First implementation. International Journal of Modeling and Simulation, 1986. [14] R. Davis and G. Mullenger. A rate-type constitutive model for soil with critical state. Anal. Geomech. 2, 255-282, 1978. [15] D. Dingwell. Recent experimental progress in the physical description of silicic magma relevant to explosive volcanism. Geol. Soc. London, 1994. [16] G. Dissertori. Physik 1 & 2: Skript zur Vorlesung. ETHZ, 2005. [17] F. Dobran and G. Macedonia. Modeling of volcanological processes and simulation of volcanic eruptions. Gruppo Nazionale per la Vulkanologia (GNV), Italy, 1990.
March 31, 2011
15:34
280
World Scientific Review Volume - 9in x 6in
K. Hutter and O. Baillifard
[18] M. Dragoni. A dynamical model of lava flows cooling by radiation. Bull. Volcanol., 1989. [19] T. Erismann and G. Abele. Dynamics of Rockslides and Rockfalls. Springer, 2001. ´ esit¨ [20] G. Farkas. A Fourier-f´ele mechanikai elv alkamaz´ ai. Term´eszet. Ert´ o 12, 457-472, 1894. [21] E. Fujita, M. Hidaka, S. Umino, and A. Goto. Lava flow simulation code: Lavasim and by VTFS-project. Transactions, American Geophys. Union, 2004. [22] G. M. Graf. Kontinuumsmechanik: Skript zur Vorlesung. ETHZ, 2009. [23] S. D. Gregorio and R. Serra. An empirical method for modelling and simulating some complex macroscopic phenomena by cellular automata. Future Generation Computer Systems, 1999. [24] R. Griffiths. The dynamics of lava flows. Annual review of fluid mechanics, 2000. [25] G. Gudehus. A comparison of some constitutive laws for soil under symmetric loading and unloading. Proceedings of the 3rd International Conference on Numerical Methods in Geomechanics, pp 1309-1323, 1979. [26] M. Gurtin. Thermodynamics of evolving phase boundaries in the plane. Oxford University Press, 1993. [27] W. Haeberli, C. Huggel, A. K¨ aa ¨b, S. Oswald, A. Polkvoj, I. Zotikov, and N. Osokin. The Kolka-Karmadon rock/ice slide of 20 September 2002 – an extraordinary event of historical dimensions in North Ossetia (Russian Caucasus). Journal of Glaciology 50, 2004. [28] A. Harris and S. Rowland. FLOWGO: A kinematic thermo-rheological model for lava flowing in a channel. Bulletin of Volcanology, 2001. [29] R. Hauser and N. Kirchner. A historical note on the entropy principle of M¨ uller and Liu. Continuum Mech. Thermodyn. 14, 223-226, 2002. [30] H. Hebert, F. Hemberger, and J. Fricke. Thermophysical properties of a volcanic rock material. High temperatures – Hight pressures, 2002. [31] P. Heinrich, A. Piatanesi, and H. H´ebert. Numerical modelling of tsunami generation and propagation from submarine slumps: The Papua New Guinea event. Geophysical Journal International, 2001. [32] H. Herrmann. Introduction to Computational Physics: lecture notes. ETHZ, 2010. [33] P. Hess. Polymerization model for silicate melts. Princeton University Press, 1980. [34] M. Hidaka and H. Ujita. Verification for flow analysis capability in the model of three-dimensional natural convection with simultaneous spreading, melting and solidification for debris coolability analysis module in the severe accident analysis code ’SAMPSON’ (I). Journal of Nuclear Science and Technology, 2001. [35] M. Hidaka and H. Ujita. Verification for flow analysis capability in the model of three-dimensional natural convection with simultaneous spreading, melting and solidification for debris coolability analysis module in the severe accident analysis code ’SAMPSON’ (II). Journal of Nuclear Science and Technology, 2002.
Chapter10
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
281
[36] G. Hulme. The interpretation of lava flow morphology. Geophys. J. Roy. Astr. Soc., 1974. [37] S. Hurwitz and O. Navon. Bubble nucleation in rhyolitic melts: Experiments at high pressure, temperature, and water content. Earth Planet Sci. Letter, 1994. [38] K. Hutter. The foundations of thermodynamics, its basic postulates and implications. A review of modern thermodynamics. Acta Mechanica, 1977. [39] K. Hutter. Theoretical glaciology; Material Science of Ice and the Mechanics of Glaciers and Ice Sheets. D. Reidel Publishing Company, 1983. [40] K. Hutter. Fluid und Thermodynamik. Springer, 1995. [41] K. Hutter and K. J¨ ohnk. Continuum Methods of Physical Modeling. Springer, 2004. [42] K. Hutter and K. Rajagopal. On flows of granular materials. Cont. Mech. and thermodyn., 1994. [43] K. Hutter and L. Schneider. Important Aspect in the Formulation of SolidFluid Debris-Flow Models. Part 1: Thermodynamic Implications. Cont. Mech. and Thermodyn., 2010. [44] K. Hutter and L. Schneider. Important Aspect in the Formulation of SolidFluid Debris-Flow Models. Part 2: Constitutive Modelling. Cont. Mech. and Thermodyn., 2010. [45] K. Hutter and L. Schneider. Important Aspects in the formulation of solidfluid debris-flow models. Part I: thermodynamic implications. Cont. Mech. and Thermodynamics, 2010. [46] K. Ishihara, M. Iguchi, and K. Kamo. Numerical simulation of lava flows on some volcanoes in Japan, in Fink, J. ed., Lava flows and domes. Springer, 1989. [47] J. Jaeger and N. Cook. Fundamentals of rock mechanics 3rd ed. Chapman and Hall, 1979. [48] C. Kittel. Physique de l’´etat solide. Dunod, 2007. [49] D. Kolymbas. An outline of hypoplasticity. Ing. Arch. 61, 1991. [50] D. Kolymbas. Introduction to Hypoplasticity. Modern Approach to plasticity, 1993. [51] L. Landau and E. Lifshitz. Course of Theoretical Physics vol. 5: Statistical Physics. Butterworth-Heinemann, 1980. [52] L. Landau and E. Lifshitz. Course of Theoretical Physics vol. 6: Fluid Mechanics. Butterworth-Heinemann, 1980. [53] L. Landau and E. Lifshitz. Course of Theoretical Physics vol. 7: Theory of Elasticity. Butterworth-Heinemann, 1980. [54] I. Liu. Method of Lagrange multipliers for exploitation of the entropy principle. Arch. Rat. Mech. Anal. 94, 291-305, 1972. [55] I.-S. Liu. On entropy flux-heat flux relation in thermodynamics with Lagrange multipliers. Cont. Mech. Thermodyn. 8, 1996. [56] I.-S. Liu. Continuum Mechanics. Springer, 2002. [57] K. Lucas. Thermodynamik: die Grundgesetze der Energie- und Stoffumwandlung. Springer, 2007.
March 31, 2011
15:34
282
World Scientific Review Volume - 9in x 6in
K. Hutter and O. Baillifard
[58] G. Macedonio, M. Pareshi, and R. Santacroce. A simple model for lava hazard assessment: Mount Etna. International Association of Volcanology and Chemistry of the Earth’s Interior, 1990. [59] B. Marsh. Magmatic Process. Review in Geophysics, 1987. [60] A. McBirney and T. Murase. Rheological properties of magmas. Ann. Rev. Earth Planet, 1984. [61] G. McDonald. Volcanoes. Prentice-Hall, 1972. [62] C. D. Meyer. Matrix Analysis and Applied Linear Algebra. SIAM, 2004. [63] H. Minkowski. Geometrie der Zahlen. Teubner, 1896. [64] H. Miyamoto and S. Sasaki. Simulating lava flows by an improved cellular automata method. Computer and Geoscience, 1997. [65] I. M¨ uller. Entropy, Absolute Temperature, and Coldness in Thermodynamics: Boundary Conditions in Porous Materials. Springer, 1971. [66] I. M¨ uller. Thermodynamics. Pitman, 1985. [67] J. E. Mungall, N. S. Bagdassarov, C. Romano, and D. B. Dinwell. Numerical modelling of stress generation and microfracturing of vesicle walls in glassy rocks. J. Volcanology and Geoth. Res., 1996. [68] S. Murrell and S. Chakravarty. Some new rheological experiments in igneous rocks at temperatures up to 1200◦ C. Roy. Astr. Soc. Geophys. Journal, 1973. [69] W. Noll. A mathematical theory of the mechanical behavior of continuous media. Arch. Rational Mech. Anal. 2, 1958. [70] G. Norton and H. Pinterkon. The physical properties of carbonite lavas: implications for planetary volcanism. Lunar and Planetary Science, 1992. [71] G. Norton, H. Pinterkon, and J. Dawson. New measurements of the physiochemical properties of natrocarbonite lavas. Lunar and Planetary Science, 1990. [72] S. Park and J. Iversen. Dynamic of lava flow: thickness growth characteristic of steady two dimensional flow. Geophys. Res. Lett., 1984. [73] H. Pinkerton. Rheological properties of geological material. Pergamon Press, 1993. [74] H. Pinkerton, M. James, and A. Jones. Surface temperature measurements of active lava flows on Kilauea volcano, Hawaii. Journal of Volcanology and Geothermal Research 113, 2002. [75] S. Pudasaini and K. Hutter. Avalanche Dynamics. Springer, 2004. [76] S. P. Pudasaini and K. Hutter. Avalanche Dynamics. Springer, 2004. [77] R. Rivlin and D. Saunders. Experiments on the deformation of rubber. Phil. Trans. Roy. Soc. 243, 1951. [78] R. Rivlin and D. Saunders. Large elastic deformations of isotropic materials. Phil. Trans. Roy. Soc.A 243, 1951. [79] A. Robert. Non-standard Analysis. John Wiley Sons, 1988. [80] G. Robson. Thickness of Etnean lavas. Nature 216, 1967. [81] C. Romano, J. E. Mungall, T. Sharp, and D. Dingwell. Tensile strength of hydrous vesicular glasses: An experimental study. American mineralogist, 1996. [82] M. Romano. A continuum theory for granular media with a critical state. Arch. Mech. 20, 1011-1028, 1974.
Chapter10
March 31, 2011
15:34
World Scientific Review Volume - 9in x 6in
A Continuum Formulation of Lava Flows
Chapter10
283
[83] L. Schneider and K. Hutter. Solid-Fluid Mixtures of Frictional Materials in Geophysical and Geotechnical Context. Springer, 2009. [84] H. Shaw. Rheology of lava in the melting range. Journal of Petrology, 1969. [85] A. Stutz. Comportement elasto-plastique des milieux granulariens. Foundation of plasticity, pp 33-49, 1972. [86] B. Svendsen, K. Hutter, and L. Laloui. Constitutive models for granular materials including quasi-static frictional behaviour : Toward a thermodynamic theory of plasticity. Continuum Mech.Thermodyn., 4:263–275, 1999. [87] A. Teufel. Simple flow configurations in hypoplastic abrasive materials. Master’s Thesis Institute f¨ ur Mechanik Technische Universit¨ at Darmstadt, Germany., 2001. [88] P. A. Tipler and G. Mosca. Physik f¨ ur Wissenschaftler und Ingenieure. Elsevier, 2004. [89] C. Truesdell. Rational Thermodynamik. Springer, 1984. [90] E. B. W. Wu and D. Kolymbas. Hypoplastic constitutive model with critical state for granular materials. Mech. Mat. 23, 45-69, 1996. [91] G. Walker. Thickness and viscosity of Etnean lavas. Nature 213, 1967. [92] C. Wang. A new representation theorem for isotropic functions: Parts I and II. Arch. Rational Mech. Anal. 36, 1970. [93] C. Wang. Corrigendum to my recent paper on ”Representations for isotropic functions”. Arch. Rational Mech. Anal. 43, 1971. [94] D. Waugh. Geography: An integrated Approach. Nelson Thornes, 2002. [95] H. William and A. McBirney. Volcanology. Treeman Cooper, 1979. [96] J. Wylie and J. Lister. The stability of straining flow with surface cooling and temperature dependent viscosity. Journal of fluid mechanics, 2000. [97] P. Young and G. Wadge. Flowfront: simulation of lava flow. Computer and Geoscience, 1990.
This page is intentionally left blank
CHAPTER 11 A TRIBUTE TO A COLLEAGUE AND FRIEND: PROFESSOR Y. H. PAO RIGOROUS MECHANICS AND ELEGANT MATHEMATICS ON THE FORMULATION OF CONSTITUTIVE LAWS FOR COMPLEX MATERIALS: AN EXAMPLE FROM BIOMECHANICS
Van C. Mow*, Xin L. Lu* and Leo Q. Wan* *Department of Biomedical Engineering, Columbia University 351 Eng. Terra. 1210 Amsterdam Ave, New York, NY 10027 USA E-mail: [email protected]
Congratulations to Professor Y. H. Pao on this 80th birthday celebration. Lao Pao, as I have called him since I first met him in December of 1954! He was a graduate student at RPI and later Columbia University, and a school mate with my older brother, Dr. Harry C. C. Mow. He has been a constant friend to my family and me, a “brother”, and a professional colleague. Professor Pao’s works in mechanics have always represented rigorous developments in mechanics and elegant mathematical solutions, particularly in the analyses of elastodynamics problems. In 1971, he co-authored a monograph on Diffraction of Elastic and Dynamic Stress Concentrations with Harry. In this contribution for this occasion, I hope to demonstrate these same characteristics in my own biomechanics studies as a tribute to him. Again, congratulations, best wishes for a happy and prosperous future. Van C. Mow Abstract: A unified mechanical-physicochemical theory, i.e., a tertiary mixture theory, for modeling a class of biological materials characterized as hydrated-charged-soft tissues has been developed, along with applications to articular cartilage biomechanics studies are presented. Particular emphasis is placed on a linearization procedure 285
286
V. C. Mow, X. L. Lu and L. Q. Wan
for this highly nonlinear constitutive law, and the application of the resultant equations toward determining tissue mechanical behavior and biochemical composition by mechanical indentation. For the first application, the tested specimen is modeled as a homogeneous and isotropic charged-hydrated-soft tissue; in our second application, the specimen will be modeled as a layered laminate and in-plane orthotropic material. In this presentation a descriptive explanation is presented to motivate the need for a tertiary mixture theory to represent the behavior of biological tissues that are comprised of interstitial water (~75%), a porous-permeable charged organic solid matrix of collagen fibers and proteoglycan supra-macromolecules (~25%), and dissolved electrolytes (e.g., Na+, Cl− of negligible%). The viscoelastic behavior of such hydrated soft tissues has two major components: 1) a flow independent component due to the intrinsic macromolecular viscoelasticity; and 2) a flow dependent component due to frictional dissipation from flow through the porous-permeable matrix. A mathematical method of solution is suggested with the introduction of transformation of variables resulting in a new “generalized correspondence principle” between charged-hydrated-soft tissues and linear-isotropic-elastic materials (i.e., elasticity theory). This principle makes the employment of triphasic theory as straightforward as using an elasticity theory.
1. Introduction Articular cartilage is a load-bearing soft tissue that is vital for the maintenance of normal joint functions. It can absorb mechanical shocks during joint motion, spread the applied load onto the underlying subchondral bone, and provide a relatively smooth surface for the decades-long, highly loaded and repetitive gliding motions between articulating bones. However, cartilage has a limited ability to repair itself when injuries and degenerative joint diseases such as osteoarthritis (OA) occur [17]. OA affects a large population (~27% of the U.S. population under the age of 70) and causes joint deformities, pain, reduced joint motion, and even total loss of joint function (Figure 1). Therefore, tremendous efforts not only in the U.S. but also around the world have been taken to regenerate or repair cartilage tissue with an ultimate aim of the restoration of cartilage and joint mechanical function [13].
Cartilage Biomechanics
287
Fig. 1. An opened knee joint with glistening, smooth, and intact articular cartilage (left). In advanced OA, articular cartilage is entirely lost at weight-bearing sites over the joint surface and spurs grow out from the edge of the bone (right).
Articular cartilage is composed of an electrically charged and hydrated organic matrix of collagen and proteoglycan, where a sparse population of chondrocytes reside. Biomechanically the cartilage is widely considered as a mixture consisting of solid matrix and interstitial fluid, the interaction of which was formulized, and experimentally validated by the well-known biphasic theory [20, 21]. Normal cartilage contains on average approximately 75% water within the pores of the collagen-proteoglycan solid matrix that are estimated to be 50-65 Å in diameter; even though its water content is high, its hydraulic permeability is extremely low at approximately 10-15 m 4 / ( N ⋅ s ) . During joint movement, the cartilage undergoes volumetric deformation with interstitial fluid moving through the open-connected-pores of the solid matrix and carrying large mechanical loads, which, otherwise, would damage the solid matrix and be detrimental to embedded chondrocytes. The solid matrix of articular cartilage consists mainly of collagen fibrils and proteoglycans (Fig. 2) [20]. While the former is a slender, long, relatively stiff and neutral molecule, the latter is more like a globular, highly negatively charged, soft gel like substance with well defined and complex molecular architecture in solution. The charges on proteoglycans mainly derive from the presence of carboxyl (COO−) and sulfate groups (SO3−) along with the chondroitin and keratan sulfate chains. These negative fixed charges in such porous-permeable interstitial space require a high concentration of water, counter-ions
288
V. C. Mow, X. L. Lu and L. Q. Wan
Fig. 2. A schematic diagram indicating the fiber-reinforced, collagen-proteoglycan matrix in cartilage.
(Na+) for electro-neutrality to be maintained and also introduce an imbalance of total mobile ion concentration between the fluid compartment inside the tissue and the bathing fluid outside the tissue. The physico-chemical colligative property of this difference in total ion concentration is the Donnan osmotic pressure, which causes the tissue to swell both dimensionally and by weight [18, 20]. To analytically model the above mentioned interstitial fluid pressurization, and osmotic swelling behaviors, together with dynamic electrochemical events such as streaming and diffusion potentials, the triphasic mixture theory was developed based on a continuum point of view with an additional phase (electrolyte ions) introduced along with the charges fixed to the solid porous-permeable matrix [12, 14]. This theory was first introduced in 1991, and expanded in 1997. The triphasic theory, is not only entirely consistent with the well recognized earlier biphasic theory, but it can also explain many complex mechano-electrochemical (MEC) phenomena inside the tissue, such as solid matrix deformation, fluid flow, electrolyte transport, electrical potential, and swelling pressure. Due to the complexity of the triphasic theory, to date, however, there are only a handful of mathematical solutions to triphasic theory available in the literature using complex
Cartilage Biomechanics
289
numerical computational methods, not to mention being used by bioengineers who are often lack a rigorous mathematical training. In this chapter, we present the triphasic theory, a novel application of the theory to determine the biochemical composition of cartilage, recent major advances derived from a linearization of the triphasic formulations, and applications in two distinct areas: cartilage indentation – a simple poke that arouses spatio-temporally dependent mechanical, chemical and electrical events, and tissue curling – an important phenomena of cartilage that is dictated by the inherent inhomogeneities and anisotropies. 2. Triphasic Mixture Theory: A Unified Model for Soft Tissue The triphasic theory was developed in 1991 under continuum mixture theory framework [14]. It is mainly based on the general theory for a mixture of N phases with one incompressible solid phase and N-1 incompressible fluid phases developed by Bowen [4, 5]. Considering the structure and composition of articular cartilage, it is natural to treat the collagen and charged proteoglycans as the solid phase, the water as fluid phase, and the free electrolytes as a single fluid phase. This selfcontained and self-consistent field theory can describe both the known mechanical behaviors and the key electrochemical and physicochemical behaviors of the charged soft tissues. The triphasic theory may be conceptually summarized with the following constitutive assumptions: • All three phase are assumed to be incompressible. The fluid phases include a water phase and an ion phase, capable of individual free movement. The ions only occupy negligible volume. • The negative charges fixed on proteoglycan are attached to the whole solid phase. The fixed charge density (FCD) depends on the volume of solid matrix. At any point inside the tissue, the net charge should be zero. • The driving forces for fluid phases are the gradients of the chemical potential of the fluid phase (including the fluid pressure) and the electrochemical potentials of the ion phase. • Frictional drag between different phases exists when there is relative velocity. The drag forces between ions and solid, ion and ion, are negligibly small.
290
V. C. Mow, X. L. Lu and L. Q. Wan
The triphasic constitutive equations are thermodynamically permissible. To include the fixed charges into the model, Helmholtz energy is assumed to be dependent on the FCD value. The incorporation of traditional chemical potential for electrolyte and polyelectrolyte solutions provides a theoretical bridge between the continuum mixture theory and physicochemical theory. 2.1. Continuity Equation and Electroneutrality Condition If ϕ α denotes the volumetric fraction of α phase in the mixture, then we have:
ϕ α = ρ α / ρTα ,
(1)
α
where ρT is the true density of α phase and ρα is the apparent density. According to mixture theory, the water volume fraction ϕ w determines the apparent densities of w, +, and – phases through:
ρ w = ρTwϕ w , ρ + = ϕ wc + M + , ρ − = ϕ wc − M − ,
(2) α
where Mα is the α-ion molecular weight (g/mol), and c is the concentration of ion in fluid phase. Generally, it is assumed that the solution is dilute enough so that the volumetric fractions of ion phases are negligible, i.e. ϕ + , ϕ − ≈ 0 , then we have:
ϕw +ϕs =1.
(3)
When no chemical reaction takes place, the balance of mass equation for each individual fluid phase is:
∂ρ α + div( ρ α vα ) = 0, ∂t
α = w, +, − ,
(4)
α
where v is the velocity of α phase. When using the condition of intrinsic incompressibility for both the solid phase and the fluid phase of cartilage and again neglecting the volume of cations and anions, the continuity equation for entire mixture can be written as:
div(ϕ s v s + ϕ w v w ) = 0 .
(5)
291
Cartilage Biomechanics
According to assumption 2, the net electric charge at every spatial point within the tissue must be zero to satisfy the electroneutrality condition:
c+ = c− + c F ,
(6)
F
where c is the fixed charge density (FCD) raised by the negative charges on proteoglycans. This density changes with the volume deformation of the solid matrix according to the following equation:
c F = c0F (1 −
trE
ϕ 0W
),
(7)
where E is the infinitesimal strain tensor which defines the deformation of the solid matrix. Here trE represents the dilatation of solid matrix, c0F , ϕ0w are the fixed charge density and water volume fraction at the initial reference state, respectively. The conservation of ions and fixed charges, and the electro-neutrality condition yield:
div ( I e ) = 0
(8)
where I e is the electrical current density given by:
I e = Fcϕ w [c + ( v + − v s ) − c − ( v − − v s )] ,
(9)
where Fc is the Faraday constant. 2.2. Momentum Equations The balance of momentum equations for the tissue and each fluid phase, under a quasi-static condition, with inertia neglected, are given by: (10) ∇ ⋅σ = 0,
− ρ α ∇µ α + ∑ fαβ ( v β − vα ) = 0,
(α ≠ solid phase)
(11)
β
where σ is the total stress acting in the tissue; µα is the chemical potential for water or the electrochemical potentials for cations and anions. The summation in Eq. (11) is carried out for β over the solid, water, cation, and anion phases ( α , β = s, w, +, − ), and fαβ is the diffusive drag coefficient (per unit of mixture volume) between α and β phases and satisfies the Onsager’s reciprocity relations: fαβ = fβα and fαβ ≡ 0 if α = β . Usually the diffusive drag between anions and cations, and
292
V. C. Mow, X. L. Lu and L. Q. Wan
the diffusive drag between ions and the solid matrix are neglected (i.e., f +− ≈ f s + ≈ f s − ≈ 0 ). The remaining diffusive drag coefficients relate to the ion diffusivities ( D + , D − ) and interstitial fluid permeability k through:
f ws =
ϕw k
2
ϕ w RTc −
, f w− =
D−
, f w+ =
ϕ w RTc + D+
,
(12)
where R is the universal gas constant, T is the absolute temperature. 2.3. Constitutive Equations The constitutive equations for the total stress σ , chemical potential for water and electrochemical potentials for cations and anions are given by:
σ = − pI + σ e ,
µ w = µ 0w +
1
ρTw
[ p − RTΦ (c + + c − )] ,
(13) (14)
µ + = µ0+ +
Fc Ψ RT ln(γ + c + ) , + M+ M+
(15)
µ − = µ0− −
Fc Ψ RT ln(γ − c − ) , + M− M−
(16)
where σe is the elastic stress tensor for the solid matrix of cartilage tissue, and p is the interstitial fluid pressure, Fc is Faraday’s constant, Ψ is the electric potential across the tissue, Φ is the osmotic coefficient of the solution which is generally assumed as unity, γ+, γ- are the activity α coefficients of cation and anion inside the tissue, respectively. Also, µ0 is the chemical or electrochemical potential at a chosen reference state. If we assume the solid matrix as linear, isotropic elastic material undergoing infinitesimal deformation, then σ e turns out to be:
σ e = λs tr ( E ) I + 2µ s E .
(17)
Here, λs and µs are the Lames’ coefficients in elasticity of the porouspermeable solid matrix.
Cartilage Biomechanics
293
2.4. Boundary Conditions The complexity and variety of boundary conditions are responsible for much of the complications associated with a triphasic analysis. Before pursuing the triphasic solution, it is necessary to clarify the general boundary rules for triphasic mixture. It is natural to define the boundary of triphasic mixture to be fixed to the solid phase. When a triphasic soft tissue interfaces with an external bathing solution, this boundary allows free exchange of the interstitial fluid and ions with the external solution. The jump conditions of a triphasic mixture, corresponding to the balances of mass, momentum, and energy, with inertia neglected, are:
[[ v s ]] = 0 ,
(18)
[[ϕ w ( v w − v s )]] ⋅ n = 0 ,
(19)
[[ϕ wc + ( v + − v s )]] ⋅ n = 0 ,
(20)
[[ϕ wc − ( v − − v s )]] ⋅ n = 0 ,
(21)
[[σ ]] ⋅ n = 0 .
(22)
Assuming the interface is ideally singular with continuous temperature, the entropy jump condition can generate:
[[ µ α ]] = 0 ,
α = w, + , −
(23)
3. Determining the Proteoglycan Content: A Novel Mechanical Method 3.1. A Generalized Correspondence Principle of Triphasic Theory The objective of the following analysis is to establish a closed-form relationship between the displacements, stresses, strains, charge densities, pressures, and ionic concentrations within the loaded triphasic tissue at its final steady state. If an arbitrary piece of ideal triphasic medium subjected to a Heaviside loading function, the tissue would creep, due to fluid and ion exudation from the tissue, until it reaches equilibrium after the initial transient phase, both mechanically and thermodynamically. This configuration represents most mechanical tests and theoretical
294
V. C. Mow, X. L. Lu and L. Q. Wan
analyses employed in the literature, such as confined compression, unconfined compression, and indentation tests, etc. In some models, it is also common to assume that the material properties of cartilage are homogeneous and that the solid matrix strain remains infinitesimally small. Since the strain is small, a regular perturbation method can be employed to perform the analysis [27]. The perturbation sequence for the unknown parameters such as cF, p and σ are:
Q = Q0 + ε 0Q1 + ε 02 Q2 + ... ,
(24)
where Q0 is the value of unknowns at reference state as defined previously, and the linearized perturbation amount for a specific unknown will be:
δ Q = Q − Q0 ≈ ε 0Q1 .
(25)
At thermodynamic equilibrium, three of the boundary conditions at the surface of triphasic material are:
µ w = µ w* , µ + = µ +* , µ − = µ −*.
(26)
By combining the constitutive equations for the electrochemical potentials and the electroneutrality condition, the boundary conditions in Eq. (26) require that: 2
k
c =
F 2
(c )
γ + 4 ∗ c∗ . γ±
(27)
Thus the linearized equation between the FCD and the sum of ion concentrations is:
c0F
δ ck =
(c ) F o
2
+4
γ∗ ∗ 2 (c ) γ±
δ cF .
(28)
Similarly, the chemical potential of water within the tissue must also be equal to the chemical potential in the bathing solution. This condition yields the following relationship on p:
δ p = Φ RT δ c k ,
(29)
where δ p is the linearized perturbation to the fluid pressure resulting k from the deformation, and δ c is the total difference of ions
295
Cartilage Biomechanics
concentration inside and outside the tissue. According to Eq. (7), the linearized change in fixed charge density is related to the elastic deformation of solid matrix by:
δ cF = −
c0F
ϕ0w
δe,
(30)
where δ e is the solid matrix dilatation relative to the initial swollen state. It is equal to the trace of the perturbation to the strain tensor. Here c0F , ϕ0w are the fixed charge density and water volumetric fraction at the reference state, respectively. Combining Eqs. (27-30) yields an important relationship between δ p and δ e :
− δp= w ϕ0
Φ RT ( c0F )
(c ) F 0
2
+4
2
γ∗ ∗ 2 (c ) γ±
δ e .
(31)
This new and important equation relates the change of the osmotic pressure with the solid matrix dilatation e. The term inside the square brackets can be treated as the rate of change of the osmotic pressure with dilatation. In the loaded triphasic material, the total stress consists of both the solid matrix stress and the fluid pressure. The perturbation to the total tissue stress tensor resulting from a small solid matrix deformation is given by:
δ σ = −δ pI + δ σ e .
(32)
Substituting Eq. (31) into Eq. (32) and assuming a linear isotropic solid matrix for cartilage tissue, the total stress inside the tissue may be written as:
δ σ = 2µs*δ E + λs*δ eI,
(33)
where:
* λs = λs + Π, and Π = w ϕ0
Φ RT ( c0F )
(c )
µs* = µs .
F 0
2
+4
2
γ∗ ∗ 2 (c ) γ±
,
(34)
(35)
296
V. C. Mow, X. L. Lu and L. Q. Wan
Fig. 3. Comparison of the calculated FCD values (ordinate) from correspondence principle and those obtained from biochemical GAG assay (abscissa) using a commercial kit.
Again, µs and λs are the intrinsic shear modulus and Lame’ constant of * the solid matrix. Note that the parameter λs is independent of the spatial position within the deformed tissue. The situation for shear modulus is even simpler. It remains equal to its intrinsic value. More importantly, Eq. (33) is identical in mathematical form to the generalized Hooke’s law for an elastic material. Therefore, the mechanical response of a linear triphasic medium at final steady state will be indistinguishable from that * * of a linear elastic material exhibiting Lame’ coefficients λs and µs . Eqs. (34-35) define the generalized correspondence principle for triphasic theory. They make the employment of triphasic analysis as straightforward as using an elastic model to solve equilibrium problems, and yet being able to properly account for effects due to the FCD and the osmotic effect on the charged porous-permeable solid matrix. 3.2. Determination of Fixed Charge Density For osteoarthritic articular cartilage at its early stage, one of the changes is in the morphology and micro-structure of proteoglycans. This makes
Cartilage Biomechanics
297
proteoglycan/FCD an excellent biochemical marker for early OA diagnosis [6, 7, 20]. Numerous methods, such as chemical or MR imaging techniques were developed to determine proteoglycan content [3, 8, 19]. None of the techniques, however, can determine this biochemical composition change and simultaneously with mechanical properties of tissue at the same spatial location. According to correspondence principle, with the knowledge of both intrinsic and apparent mechanical properties, the FCD can be easily calculated [15]. A commonly employed method to obtain intrinsic properties of cartilage is to perform a mechanical test by submerging the tissue in a hypertonic environment (with high external solution concentration, such as 2.0 M NaCl) [9, 10]. In a hypertonic solution (2.0 M), the Donnan osmotic pressure induced by the FCD asymptotically approaches zero (as can be theoretically shown [15]). Thus, the intrinsic mechanical properties (without the effects of the osmotic swelling pressure) can be extracted by analyzing mechanical testing data obtained with the specimen submerged in 2.0 M solution. In contrast, when cartilage is equilibrated in physiological condition (0.15 M NaCl solution), the mechanical loading is supported by both the solid matrix mechanical stress and the osmotic pressure in the fluid phase. Therefore, the mechanical properties obtained directly has the osmotic effect lumped into it, which are known as the apparent properties. With the knowledge of both intrinsic and apparent mechanical properties, the fixed charge density can be calculated directly from Eq. (34). Figure 3 shows the comparison of the calculated FCD based on this strategy and those from independent biochemical GAG assay. The intrinsic and apparent mechanical properties were obtained from indentation creep tests [16]. A linear regression analysis showed that the results from correspondence principle and biochemical test are remarkably consistent with each other. This not only proves the validity and applicability of triphasic theory, but also provided an important and easy biomechanical alternative method for non-invasively determining the FCD in articular cartilage simultaneously with mechanical stiffness using a pure mechanical technique.
298
V. C. Mow, X. L. Lu and L. Q. Wan
4. Curling Behavior of Articular Cartilage: Anisotropic Lamination Model and Triphasic Theory With the introduction of biomechanical complexities in tissue microstructure such anisotropy and inhomogeneity, the triphasic theory can be further used to describe a variety of experimentally known mechanical behaviors of classes of cartilage, and other biological tissues as blood vessels. Here we demonstrate that with the aforementioned simplification (i.e., the linearization of the triphasic theory) the application of triphasic theory in explaining tissue curling behavior would rather be easy and straightforward. It has been known for decades that adult articular cartilage sample will curl and warp toward its articular surface after removal from the underlying subchondral bone. This is particularly important for grafting cartilage a procedure used in common plastic surgery (e.g., rhinoplasty). The curvature is found to vary with the saline concentration in the external bathing solution [25]. This curling behavior is an evidence of inhomogeneous residual stresses and strains inside the solid matrix of articular cartilage, and is believed to be contributed to by both swelling properties and layered inhomogeneous ultrastructure of articular cartilage [24]. Residual stresses and strains play important roles in the biomechanical function of the load-bearing tissues such as vein, cardiac ventricle, and cartilage [11].
Fig. 4. The layer structure of articular cartilage strip with its dimensions and the predominant orientation of collagen fibrils.
Cartilage Biomechanics
299
The layered inhomogeneties of chemical contents and macromolecular organization in situ cause variations of osmotic pressure and solid matrix stresses throughout the depth of the tissue, and thus giving rise to the often-observed curling behaviors of articular cartilage. Many polarized light and electron microscopy studies have been reported on the variation of collagen fibrillar (or fibrous) architectural organization through the depth of articular cartilage [20]. Collagen fibrils are aligned tangentially to the surface in the superficial tangential zone (SZ), randomly in the middle zone (MZ), and vertically in the deep zone (DZ) (Figure 4). These micro-structural features of collagen seem to correlate well with the relative amounts of the substances found within the matrix, such as water and collagen contents and fixed charge density (FCD); in other words, composition also varies layer-wise throughout the tissue depth from the surface to the deep layer attached to the bone. As a result, mechanical properties and swelling effects of the tissue also show the depth dependency, leading to the curling behavior of articular cartilage. While the intuitive picture for the cartilage curling behavior has been known for many years [18], the successful predictions from a theoretical model with experimental data for cartilage curling behavior remained a big challenge. However, with the development of the triphasic theory, this problem can now be modeled. The focus of this research is therefore to develop a quantitative model based on the triphasic constitutive law to describe the swelling and curling behaviors of articular cartilage; and to compare the predicted deformation on the curling behavior of thin strips of cartilage specimens with previous experimental results. 4.1. Mathematical Modeling for Cartilage Curling Again, at the hypertonic state (asymptotically c* →∞), we assume the swelling effects associated with the FCD are negligible and the entire tissue (length a × width b × thickness h) is assumed to be flat initially as shown in Figure 5(A). All physical parameters after swelling are determined relative to those at this hypertonic reference state (HRS). The material properties and chemical parameters are assumed to vary with depth (i.e., between the three (~150µm originally thin-flat layers), but to be homogenous within each layer. If these three layers were separated
300
V. C. Mow, X. L. Lu and L. Q. Wan
Fig. 5. Schematic explanations for the origin of curling behavior of articular cartilage. Three layers (A) have different swelling potentials (B), which lead to curling behavior of cartilage strips that space multiple layers (C).
Table 1. The intrinsic parametric values for articular cartilage, based on previous literatures. Position STZ MZ DZ
Thickness FCD Porosity λ (MPa) µ (MPa) (mm) (mEq/ml) 0.3 0.7 0.12 0.2 0.4 1.2 0.5
0.8 0.6
0.25 0.20
0.2 0.1
0.4 0.15
λ+1 (MPa)
λ-1 (MPa)
10
3.0
MZ=Isotropic 3.0
― 0.9
from each other, and allowed to swell independently, then each layer would experience a different deformation as shown in Figure 5(B) and remain straight [22]. However, if the layers were joined together before swelling, then the in-plane strains (in x-y plane) at the interface would have to match, which would produce curling (Figure 5(C)). The triphasic theory [14] is used to model the swelling behavior (Figure 5: A→B) of each layer. The total stress (σ) consists of elastic stresses inside the solid matrix (σs) and the osmotic pressure (p) as shown below: σ = –p I+σs
with p = π – П e
(36)
where π = ΦRT(ck –2c*) and Π = ΦRT(crF)2/(φrwck) [2, 27]. Here φrw and crF are the porosity and FCD at HRS, respectively. An orthotropic constitutive equation with tension-compression nonlinearity is used to describe the mechanical property of the solid matrix for each layer based on the collagen fibril microstructure [1, 22]. Since the cartilage strip is assumed to be very thin, i.e., h << a and h << b, the problem can be solved based on a classical lamination theory and the usual assumption of pure bending (Figure 5: B→C). To compare
Cartilage Biomechanics
301
with Setton et al’s previous experimental results [25], the typical parameters adopted for patellar cartilage are as the base case and shown in Table 1 [20, 23], in which λ+1 and λ-1, the elastic moduli for tension and compression, respectively, account for the tension-compression nonlinearity of fibrous-composite tissue solid matrix [26]. The elastic modulus was set to be 3MPa for the SL in the direction perpendicular to the split lines [23]. 4.2. Modeling Results and Discussion Figures 6 and 7 represent variations of surface stretch (Λ = 1 + ε ) and curvature (κ ) as a function of the external ion concentrations in the directions both parallel and perpendicular to the predominant collagen fibril directions [23]. It can be seen that the tissue at the bottom of the sample is under tension (i.e., convex side with stretch Λ > 1), while at the top (concave side) the strain will be very close to zero, or even slightly negative (i.e., in compression with stretch Λ ≤ 1). The curvature increases with the decrease of external saline concentration, and is always larger in the collagen fibril direction (x). The predicted stretch and curling are comparable to experimental data reported previously by Setton et al [25]. Just as shown in their study, the in-plane strains (εx and εy) at the articular surface is predicted to be very close to zero for the direction parallel to the split lines (the predominant collagen fiber direction at the surface) as well as for the direction perpendicular to the split lines. The experiment results also show that the strain changes about 0.07-0.11 in the deep zone of the strip when the ion concentration varies from 2M to 0.15M, which implies that the overall curvatures changes about 0.035-0.055mm-1 for a 2mm thick sample. Our values for stretch change (~0.1) and curvature change (~0.05 mm-1) are in these ranges. Considering that the typical material parameters adopted in this study are based on experimental data of chemical contents and mechanical properties in previous literatures, indeed our models have captured the actual mechanism of swelling and curling behavior of articular cartilage and therefore be used to predict mechano-electrochemical events happening inside the tissue during swelling.
302
V. C. Mow, X. L. Lu and L. Q. Wan
Fig. 6. Variation of stretch (Λ=dx/dX) of cartilage strip in different external ion concentrations (0.015M, 0.05M, 0.15M, 0.5M, and 2M). The stretch is defined as the ratio of the length after deformation (dx) and the original length (dX).
Fig. 7. Variation of curvature of cartilage strip in different ion concentrations for the base case parameters listed in Table 1.
Cartilage Biomechanics
303
5. Summary Within the past decade, significant advances have been made in experimental, theoretical, and biological studies of the basic sciences relating to articular cartilage, and other layered biological tissues. New testing techniques have emerged such that articular cartilage can be studied in greater detail in terms of its nonlinear and viscoelastic behaviors, depth-dependent inhomogeneity of the mechanoelectrochemical properties, and anisotropy characteristics. New advanced theories have been developed that encompass these experimental findings. The knowledge obtained from both fronts, experimental and theoretical, provides enrichment and in-depth understanding of the structure-function relationship in articular cartilage and in all softhydrated-charged connective tissues as well. There is now no doubt that each phase (the charged solid matrix, water and ions) of the cartilage contributes to its compressive, tensile, electrokinetic, and transport behaviors. The triphasic mixture theory has been successfully used to describe the flow-dependent and flow-independent viscoelastic behaviors, swelling behaviors, and electrokinetic behaviors of charged, hydrated soft tissues in the last two decades. It is widely considered in the biomechanics literature as the unified theory for such materials. The generalized correspondence principle presented in this review bridged the powerful yet complex triphasic mixture theory with the simple linear-isotropic-elastic model. The simultaneously measured mechanical properties and biochemical composition will inevitably provide significant new insights toward the understanding of OA etiology, especially during the early stages of the disease process. As the cartilage is anisotropic, inhomogeneous, and nonlinear mixture material, to demonstrate the capability of triphasic theory, we further combined it with the orthotropic lamination model for analyzing the well-known curling behavior of the tissue. The simulation results matched with experimental data well. These experimental confirmations further give confidence that the unified mechanical-physicochemical model for soft tissue, i.e., the triphasic theory, can be easily extended and strengthened to meet the requirement of modern biomedical research.
304
V. C. Mow, X. L. Lu and L. Q. Wan
Acknowledgments This work was supported in part by Stanley Dicker Endowed Chair for Biomedical Engineering, the Liu Ping Functional Tissue Engineering Laboratory funds, and the Whitaker Foundation Special Development Award. References 1. Akizuki, S., V.C. Mow, F. Muller, J.C. Pita, D.S. Howell, and D.H. Manicourt, Tensile properties of human knee joint cartilage: I. Influence of ionic conditions, weight bearing, and fibrillation on the tensile modulus. J Orthop Res, 1986. 4(4): p. 379-92. 2. Ateshian, G.A., N.O. Chahine, I.M. Basalo, and C.T. Hung, The correspondence between equilibrium biphasic and triphasic material properties in mixture models of articular cartilage. Journal of Biomechanics, 2004. 37(3): p. 391-400. 3. Bashir, A., M.L. Gray, R.D. Boutin, and D. Burstein, Glycosaminoglycan in articular cartilage: in vivo assessment with delayed Gd(DTPA)(2-)-enhanced MR imaging. Radiology, 1997. 205(2): p. 551-8. 4. Bowen, R.M., Theory of mixture., in Continuum Physics, A.E. Eringen, Editor. 1976, Academic Press: New York. p. 1-127. 5. Bowen, R.M., Incompressible porous media models by use of the theory of mixtures. Internation Journal of Engineering Science, 1980. 18: p. 182-185. 6. Buckwalter, J.A., K.E. Kuettner, and E.J.-M. Thonar, Age-related changes in articular proteoglycans: Electron microscopic studies. J Orthop Res, 1985. 3: p. 251-257. 7. Buckwalter, J.A., H.J. Mankin, and A.J. Grodzinsky, Articular cartilage and osteoarthritis. Instr Course Lect, 2005. 54: p. 465-80. 8. Buckwalter, J.A. and L.C. Rosenberg, Electron microscopic studies of cartilage proteoglycans. Electron Microsc Rev, 1988. 1(1): p. 87-112. 9. Chahine, N.O., F.H. Chen, C.T. Hung, and G.A. Ateshian, Direct measurement of osmotic pressure of glycosaminoglycan solutions by membrane osmometry at room temperature. Biophys J, 2005. 89(3): p. 1543-50. 10. Flahiff, C.M., D.A. Narmoneva, J.L. Huebner, V.B. Kraus, F. Guilak, and L.A. Setton, Osmotic loading to determine the intrinsic material properties of guinea pig knee cartilage. J Biomech, 2002. 35(9): p. 1285-90. 11. Fung, Y.C., Biomechanics: Motion, Flow, Stress and Growth. 1990, New York, NY: Springer-Verlag. 12. Gu, W.Y., W.M. Lai, and V.C. Mow, A triphasic analysis of negative osmotic flows through charged hydrated soft tissues. Journal of Biomechanics, 1997. 30(1): p. 71-78.
Cartilage Biomechanics
305
13. Guilak, F., D.L. Butler, and S.A. Goldstein, Functional tissue engineering: the role of biomechanics in articular cartilage repair. Clin Orthop Relat Res, 2001 (391 Suppl): p. S295-305. 14. Lai, W.M., J.S. Hou, and V.C. Mow, A triphasic theory for the swelling and deformation behaviors of articular cartilage. J Biomech Eng, 1991. 113(3): p. 245-58. 15. Lu, X.L., C. Miller, F.H. Chen, X. Edward Guo, and V.C. Mow, The generalized triphasic correspondence principle for simultaneous determination of the mechanical properties and proteoglycan content of articular cartilage by indentation. J Biomech, 2007. 40(11): p. 2434-41. 16. Lu, X.L., D.D. Sun, X.E. Guo, F.H. Chen, W.M. Lai, and V.C. Mow, Indentation determined mechanoelectrochemical properties and fixed charge density of articular cartilage. Ann Biomed Eng, 2004. 32(3): p. 370-9. 17. Mankin, H.J., V.C. Mow, J.A. Buckwalter, J.P. Iannotti, and A. Ratcliffe, Articular cartilage structure, composition, and function, in Orthopaedic Basic Science: Biology and Biomechanics of the Musculoskeletal System, J.A. Buckwalter, T.A. Einhorn, and S.R. Simon, Editors. 2000, American Academy of Orthopaedic Surgeons Publishers: Rosemont, IL, USA. 18. Maroudas, A., Physicochemical properties of articular cartilage, in Adult Articular Cartilage, M.A.R. Freeman, Editor. 1979, Pitman Medical: Kent, UK. p. 215-290. 19. Maroudas, A. and H. Thomas, A simple physicochemical micromethod for determining fixed anionic groups in connective tissue. Biochim Biophys Acta, 1970. 215(1): p. 214-6. 20. Mow, V.C., W.Y. Gu, and F.H. Chen, Structure and function of articular cartilage and meniscus, in Basic orthopaedic biomechanics and mechano-biology, V.C. Mow and R. Huiskes, Editors. 2005, Lippincott Williams & Wilkins: Philadelphia. p. 181-258. 21. Mow, V.C., S.C. Kuei, W.M. Lai, and C.G. Armstrong, Biphasic creep and stress relaxation of articular cartilage in compression? Theory and experiments. J Biomech Eng, 1980. 102(1): p. 73-84. 22. Myers, E.R., W.M. Lai, and V.C. Mow, A continuum theory and an experiment for the ion-induced swelling behavior of articular cartilage. J Biomech Eng, 1984. 106(2): p. 151-8. 23. Roth, V. and V.C. Mow, The intrinsic tensile behavior of the matrix of bovine articular cartilage and its variation with age. J Bone Joint Surg, 1980. 62(7): p. 1102-17. 24. Setton, L.A., W.Y. Gu, V.C. Mow, and W.M. Lai, Predictions of the swellinginduced pre-stress in articular cartilage, in Mechanics of Porous Media, A.P.S. Selvadurai, Editor. 1995, Kluwer Acad Press. p. 229-322. 25. Setton, L.A., H. Tohyama, and V.C. Mow, Swelling and curling behaviors of articular cartilage. J Biomech Eng, 1998. 120(3): p. 355-61.
306
V. C. Mow, X. L. Lu and L. Q. Wan
26. Soltz, M.A. and G.A. Ateshian, A conewise linear elasticity mixture model for the analysis of tension-compression nonlinearity in articular cartilage. J Biomech Eng, 2000. 122(6): p. 576-586. 27. Wan, L.Q., C. Miller, X.E. Guo, and V.C. Mow, Fixed electrical charges and mobile ions affect the measurable mechano-electrochemial properties of chargedhydrated biological tissues: the articular cartilage paradigm. Mechanics & Chemistry of Biosystems, 2004. 1(1): p. 81-99.
CHAPTER 12 PROFESSOR PAO’S INFLUENCE ON RESEARCH IN COUPLED FIELD PROBLEMS, CHIRALITY AND ACOUSTIC AND ELECTROMAGNETIC METAMATERIALS AND THEIR APPLICATIONS
Vasundara V. Varadan George & Boyce Billingsley Chair and Distinguished Professor Director – Microwave & Optics Laboratory for Characterization & Imaging Department of Electrical Engineering University of Arkansas E-mail: [email protected] This paper reviews the theme “Unity in Diversity” that envelopes acoustics, electromagnetics and elastodynamics. The Helmholtz decomposition of vector fields in terms of scalar and vector potentials serves to unify all continuum fields while the constitutive models that have been developed to explain the observed experimental behavior of the fields introduces diversity in wave propagation. This serves as a fertile ground for consideration of various coupled field problems. In this paper, the term coupling is used to refer to the coupling of solenoidal and irrotational fields as well as the coupling of electric and magnetic fields or elastodynamics and electromagnetic fields. It will be clear from this paper that developments during the last quarter century or more from consideration of coupled fields have lead to the development of many exciting applications using piezoelectric, piezomagnetic, magneto-electric, chiral and currently metamaterials. This paper reviews contributions in these areas for applications to sonar and radar coatings, NDE, smart materials and structures, electromagnetic and elastodynamics chirality and electromagnetic and acoustic metamaterials.
307
308
V. V. Varadan
Preamble This paper traces my evolution as a researcher after I was mentored by Professor Yih-Hsing Pao. I was fortunate to be his post-doctoral fellow during 1974-1977. As a freshly minted Ph.D. my research background was in electromagnetics, wave propagation and scattering and statistical mechanics. I had a vague knowledge of mechanics and elastic waves from hanging around the Theoretical & Applied Mechanics group at Northwestern University where my now husband was a graduate student. That is how I saw Professor Pao’s advertisement for a post-doctoral fellowship on a notice board at Northwestern. I went to meet Professor Pao at the Applied Mechanics Congress in Boulder and requested him to consider me for the position. The rest as they say is history. Professor Pao is a visionary and recognized back in the early 70s, the importance of coupled field problems involving elastic, acoustic, electromagnetic (EM) fields. He was eager to explore the literature in all three fields and could see great opportunities for theoretical and advanced device development. He asked me to develop a numerical method for solving elastic wave scattering problems based on the T-matrix method that had just been developed by P.C. Waterman for acoustic and electromagnetic waves. I realized that surface integral representations including the extinction theorem had to be developed for the elastodynamic field in addition to proofs of reciprocity and optical theorems for scattering problems analogous to those for acoustic and EM fields. The papers we published on these topics have been well cited. Since I did not know anything about computer programming, Professor Pao hired a graduate student to teach me Fortran for a month. The development of the T-matrix method offered the first numerical solution to the scattering of elastic waves by 2D and 3D bodies of arbitrary shape when the size of the body was comparable to the incident wavelength (Pao and Varadan (1975), Varadan and Pao (1976), Varadan (1977), Varadan and Varadan (1979)). My research has spanned the gamut of acoustics, elastodynamics and electromagnetics and this paper will review some of the contributions I have been able to make because of professor Pao’s mentorship. Topics include: (a) development and applications of the T-matrix method;
On Research in Coupled Field Problems
309
(b) Smart Structures for wireless sensing of stress/strain, temperature measurements and active noise/vibration cancellation; (c) acoustic and microwave chirality and applications; (d) electromagnetic and optical metamaterials and their applications. The common thread in these application areas is the knowledge of dynamical fields and waves and an intuitive understanding of the coupling of mechanical, electric and magnetic fields. I acknowledge with humility Professor Pao’s guidance in enabling me to see the beauty of continuum field theory that spans across all classical fields. 1. Helmholtz Decomposition of Vector Fields and Implications on Acoustic, Electromagnetic and Elastodynamic Fields The Helmholtz theorem for vector fields in conjunction with Gauss’s theorem is a powerful statement that serves to provide a unifying platform for acoustic, electromagnetic and elastodynamic fields. Any vector field F, which is finite, uniform and continuous and that also decays faster than 1/r at infinity may be expressed as the sum of the gradient of a scalar function and the curl of a vector function, Morse & Feshbach (1953), F = −∇Φ + ∇ × A
(1)
The proof of this decomposition is fairly straightforward as presented in many text books on mathematical physics. For (1) to be true, we must show that the scalar and vector potentials can be derived from the vector field F. It is clear that ∇ 2 Φ = f (r) ,
(2)
which is the Poisson’s equation with a scalar source function f(r). The Green’s function for this equation is known and the solution can be written as Φ(r) = ∫ V
f (r ') dV ' 4π r − r '
(3)
310
V. V. Varadan
Similarly, we can show that the vector field must satisfy a vector Poisson equation with F(r) = ∫ V
f(r ') dV ' 4π r − r '
(4)
Solutions in (3) and (4) are unique as long as the integral of these fields over all space is finite which is guaranteed by the required decay of F as 1/r at infinity. We can define an auxiliary vector function G(r) as G(r) = ∫ V
F(r ') dV ' 4π r − r '
(5)
By taking the divergence and curl of G, it is easily seen that (1) must be true since (5) is the solution of ∇2 G = F(r)
(6)
Using the Helmholtz decomposition of vector fields as given in (1), we can now discuss the ramifications of substituting field and constitutive equations that describe acoustic, electromagnetic and elastic fields in linear media. Acoustic Waves in Viscous and Inviscid Fluids The acoustic pressure field is irrotational and isotropic, hence, the particle velocity v(r,t) and the pressure p(r,t) can be represented in terms of a scalar potential Φ v(r, t) = −∇Φ; p(r, t) = ρ0
∂Φ ∂t
(7)
Conservation of momentum in a volume of fluid (Newton’s Law) leads to ρ0
∂v + ∇p = 0 ∂t
(8)
On Research in Coupled Field Problems
311
Substitution of (7) into (8) leads to the wave equation for the acoustic potential ∇2 Φ −
1 ∂2 Φ 1 = 0; c 2 = ; 2 2 κ S ρ0 c ∂t
(9)
where κS is the adiabatic compressibility and ρ0 is the mass density. For viscous fluids (Varadan & Varadan 1992), the constitutive equation in (7) must be changed to include shear motion and the pressure is no longer isotropic and must be replaced by
2 3
σ = − pI + η ( ∇ v + v∇ ) − I ∇ i v + ξ I ∇ i v ,
(10)
where I is the unit dyadic. Newton’s law for continuum force and displacement fields, known as Euler’s equation, can be written as ∂ ( ρ v) ∇iσ − − ∇ i ( vρ v ) = f , ∂t
(11)
where f is the body force. In the absence of body force, we can substitute (10) into (11), and obtain the linearized Navier-Stokes equation for acoustic fields in viscous liquids as
ρ0
∂v 4 + η∇ × (∇ × v ) − (ξ + η )∇ (∇ i v) + ∇p = 0 3 ∂t
(12)
The Helmholtz decomposition of the velocity field may now be invoked v = v l + v t ; ∇ i v t = 0; ∇ × v l = 0
(13)
Substitution of the decomposition into (12) yields uncoupled equations for the irrotational field vl and the solenoidal field vt
312
V. V. Varadan
ρ0
∂v l 4 − (ξ + η )∇ (∇i v l ) + ∇p = 0 , 3 ∂t ∂v ρ0 t + η∇ × (∇ × v t ) = 0 ∂t
(14) (15)
It is clear, that (14) can be transformed into a damped wave equation by using (8); it leads to acoustic wave propagation in the viscous fluid with redefined wave velocity that involves the viscosity. Equation (15) for the irrotational part of the velocity on the other hand leads to a diffusion equation. Elastic Waves in Solids Euler’s equation for a continuum solid is formally the same as (11) given
for a continuum fluid. The stress tensor T in the elastic solid for linear
isotropic materials is related to the strain field by generalized Hooke’s law that may be stated as
T = λ I∇ • u + µ ( ∇u + u∇ ) = λ I∇ • u + µ [ 2∇u + I × (∇ × u)] ,
(16)
where we have used some well known vector identities to obtain the second form of Hooke’s law. In (15), λ and µ are known as Lame constants and describe the elastic response of the material. Substituting (15) into Euler’s equation (10), we obtain (λ + 2 µ )∇(∇iu ) − µ∇ × (∇ × u ) − ρ0
∂ 2u ∂t 2
=0
(17)
Equation (17) is amenable to the application of Helmholtz decomposition for the displacement field u. Consequently, we write, u = ∇Φ + ∇ × Ψ; ∇ iΨ = 0
(18)
In (18), Φ and Ψ are irrotational and solenoidal potential fields, that are scalar and vector respectively. Substitution of (17) into (16) yield the
On Research in Coupled Field Problems
313
well known uncoupled wave equations of pressure waves (P-waves) and shear waves (S-waves) as follows
∇2Φ −
1 ∂2Φ c 2p
∇2 Ψ −
∂t
2
= 0; c 2p =
1 ∂2Ψ cs2
∂t
2
λ + 2µ ; ρ0
(19)
µ ρ0
(20)
= 0; cs2 =
We may remark that a linear isotropic elastic solid is similar to an optically active material in that for the same direction of propagation, the wave speed is different for different polarizations. In an elastic solid, the difference in polarization is more distinct since the P-wave results from an irrotational field whereas the S-wave results from a solenoidal field. In an optically active material on the other hand, the two wave speeds correspond to left and right circularly polarized waves both resulting from solenoidal fields.
Electromagnetic Waves Maxwell’s equations comprising of the two static equations for the electric and magnetic fields called Gauss’s Laws and the two dynamical equations called Faraday- Lenz Law and the Maxwell-Ampere Law, form the foundations of electromagnetic field theory. These equations together with Newton’s Laws then become the whole basis of all of classical Physics and lead to the rich plethora of wave phenomena when coupled with the constitutive behavior of the materials under different conditions of temperature and excitation. The two static Maxwell’s equations arising from the Coulomb force law between electric charges and the absence of point magnetic charges. They are very similar to the equations that can result from Newton’s law of gravitation, the only difference being the existence of positive and negative charges and both force laws refer to instantaneous action at a distance. The similarity of the static fields can be deduced very clearly by deriving an equation for the gravitational acceleration at a point
314
V. V. Varadan
(Brown, 2009). From the force law, we obtain the acceleration vector at the position r due to a mass m as
g=−
m r3
r
(21)
It is easy to verify that the divergence of the acceleration per unit mass vanishes at any point away from the mass, leading to
∇ig = −ρ g
(22)
Equation (22) describes the equation for the gravitational force field per unit mass in the region and ρg is the mass density. Gauss’s Laws for the electric and magnetic force fields can be written as
∇iD = ρV ,
(23)
where D is the electric displacement field (charge flux density) and ρv is the volume charge density. Since there are no magnetic charges, we similarly obtain
∇i B = 0 ,
(24)
where B is the magnetic flux density. We have complete analogy between the gravitational force law for mass and Coulomb’s law for electric charges leading to instantaneous force at a distance. Based on this concept, we would never know about the magnetic field, since the only way to experience the force due to a magnetic field is with a moving charge or an electric current as given by the Lorentz force law f = qE + qv × B
(25)
There is no analogy for a steady electric current, which consists of moving charges but can be produced by a static magnetic field in Newtonian field theory.
On Research in Coupled Field Problems
315
The triumph of Maxwell is the coupling of the electric and magnetic fields by stating laws that relate not only to the distribution of charge at a given “instant”, but also the movement of charge (the current density) with the rates of change of the E and B fields themselves. This is a major departure from the Newtonian theory of simultaneity of force and reaction and the concept of causality (reaction occurs at or after application of action) that is required in electromagnetic field theory. The time dependent Maxwell’s equations are the Faraday-Lenz law and the Maxwell-Ampere Law that result in ∂B ; ∂t ∂D ∇×H = J + ∂t ∇×E = −
(26) (27)
Equations (26) and (27) show the dynamical linking of the E and B fields and their interdependence explicitly involves how they change with respect to time and how the charge density is changing spatially and with respect to time (current density). Since Maxwell’s equations contain partial differential equations, two of which have only spatial derivatives and two of which have both spatial and temporal derivatives and do not contain any constant coefficients, then we require for dimensional consistency that the ratio of electrostatic to electromagnetic units must be the same as the ratio of space units to time units. This ratio turns out to be the speed of electromagnetic waves in vacuum, c = √ε0µ0 = 3×108 m/s. Maxwell was able to predict the speed of light accurately to nine decimal places but also presented a great challenge, namely that although Maxwell’s equations are invariant in all inertial reference frames, they violate Galilean transformation rules. This of course vexed the then young Einstein, all of 16 years old, still in high school in 1895. A few years later, he concluded that c is a fixed constant of nature according to the classical theory of relativity. This is the major difference between classical Newtonian theory for purely mechanical fields and classical Maxwellian theory for electromagnetic fields. Simultaneous or instantaneous action and reaction implied by Newton’s law of gravitation and Coulomb’s law for charges
316
V. V. Varadan
and the dynamical action implied by the dynamical Maxwell equations which then lead us to invoke the principles of causality. Unlike acoustic and elastodynamic fields, the construction of Maxwell’s equations already hint at the solenoidal nature of the electric and magnetic fields in source free regions; i.e., where ρv and J are zero. In that case, the first two Maxwell equations already state that the fields are divergence free. If time dependence is absent, then the last two Maxwell equations (25) and (26) state that the fields are irrotational in a source free region and the first two Maxell equations state further that E and H are irrotational. In order to arrive at uncoupled wave equations for the E and H fields, we have to invoke constitutive equations similar to Hooke’s law that give the response of the medium to the applied force fields E and H. At this juncture, we restrict the discussion to linear isotropic materials and use
D = ε 0 E + P = ε 0 [1 + χ E ]E = ε E ;
(28)
B = µ0 [H + M] = µ0 [1 + χ M ]H = µ H ;
(29)
where P and M are the responses of the medium to the externally applied E and H fields. They are respectively the electric and magnetic polarization fields and they would be absent in vacuum. From (28) and (29) we can identify the constants ε0 and µ0 as the permittivity and permeability of free space or vacuum and these are the same constants that enter into the definition of the speed of light c =1/√ε0µ0; ε and µ are the dielectric permittivity and magnetic permeability of the medium. Taking the curl of (26) and substituting (23), (24), (27) – (29) and similarly taking the curl of (27) and substituting (23), (24), (26), (28) and (29), we obtain
E 1 ∂2 E 1 ∇ 2 − 2 2 = 0; ν = µε H ν ∂t H
(30)
Equation (30) states that E and H satisfy the same wave equation and the fields travel with the same wave speed v.
On Research in Coupled Field Problems
317
In summary, we can say that in linear isotropic materials, mechanical fields obey wave equations starting with Newton’s law, suitable linear constitutive relations and Helmholtz decomposition. Electromagnetic fields obey wave equations starting from Maxwell’s equations which embody the force laws for charges and currents in the presence of electric and magnetic fields, constitutive models and the concept of irrotational and solenoidal fields. The mathematical unity in the description of the three types of fields is very obvious. Mathematically, it would appear that the behavior of elastic waves in solids is a combination of acoustic waves and electromagnetic waves that at material boundaries can convert from one to the other. Of course the underlying physics of the fields and the boundary conditions they obey have a very physical basis, nevertheless it is very illuminating to look at an exposition of the three classical fields and the wave equations they obey using Helmholtz decomposition of the fields. In the introductory chapter of the book, “Diffraction of Elastic Waves and Dynamic Stress Concentrations” (1973), Professors Pao and Mow give a very interesting description of how the history of electromagnetic waves and the speed of light, the quest for a magical ether and the elastic solid theory of light finally culminate in a parting of ways of electromagnetics and elastodynamics. The conjoined beginning is very much due to the mathematical unity of the fields that make it appear that elastic waves are a mathematical combination of acoustic waves and electromagnetic waves. As Pao and Mow point out, once the transverse (purely solenoidal) nature of electromagnetic waves far from their sources was established, the elastic solid theory of light fell by the wayside but was nevertheless instrumental in the birth of elastodynamics in its own right following the seminal work of Navier and Cauchy. 2. Chirality and Its Effect on the Propagation of Electromagnetic and Elastic Waves Professor Pao did not contribute directly to this field, but this author was greatly aided by his mentoring and tutelage on the nature and properties of elastodynamic fields and the conversion of P- to S- waves and vice versa at boundaries. This allowed her to clearly think about the various
318
V. V. Varadan
unsolved electromagnetic (EM) boundary value problems involving chiral media where right circularly polarized (RCP) EM waves are converted to left circularly polarized (LCP) EM waves and vice versa. More importantly, it was very definitely the knowledge of elastodynamics learned from Professor Pao that allowed the author to develop the theory for wave propagation in chiral elastic solids and to experimentally characterize samples of such solids. Chirality is the lack of reflection symmetry in a structure and by extension to a material. The word chiral originates from the Greek Cheir (χερι) for hand. Chiral materials may be isotropic or anisotropic but must contain in their microstructure both right handed and left handed (mirror images) of the structure. Only solenoidal fields can be sensitive to the handedness of the microstructure of a material since they are axial vectors whereas irrotational fields are polar vectors that are invariant under reflection. Chirality was first studied in optics by Louis Pasteur with his experimental studies on light propagation in solutions of dextrose and levose, right and left handed sugar molecules. He observed that such solutions rotated the polarization direction of polarized light in opposite directions, a round trip through such media resulted in no rotation and the racemic mixtures of the two sugar solutions also resulted in no polarization rotation. This was called optical activity but this is an electromagnetic effect that can occur in any part of the electromagnetic spectrum provided we have microstructure of the appropriate electrical size that lack reflection symmetry. Indeed, Tinoco demonstrated in the 1960s that the polarization of low frequency (~600 MHz) microwaves could be rotated by propagating them through very large Cu helices embedded in Styrofoam. In the 80s and 90s, active research was pursued to develop applications for electromagnetic (EM) applications at microwave frequencies by many groups, Varadan et al., Tretyakov et al., Sihvola et al., Jaggard et al., Lakhtakia et al. among others. Many electromagnetic problems including Green’s functions for chiral media, solution of many boundary value and scattering problems for chiral media and very importantly fabrication of chiral materials for microwave applications in the 2-40 GHz frequency range were all achieved during these two decades. Another significant achievement was the direct measurement of the chirality parameter that had never been measured
On Research in Coupled Field Problems
319
directly for a chiral medium. Optical experiments only measured the polarization rotation and the difference in refractive indices between left and right circularly polarized EM waves. In the late 70s and early 80s, with the introduction of the microwave vector network analyzer, it became possible to make accurate measurements of the phase of the reflected and transmitted waves and Varadan et al. completely characterized left handed, right handed and racemic dispersions of miniature copper helices embedded in a polymer matrix using a very innovative free space measurement system that mimicked an optical bench with spectroscopy and polarimetry capabilities. Development of theory and supporting experiments to explain wave propagation in chiral elastic solids, i.e., solids that lack reflection symmetry in their microstructure was an interesting contribution made by the Varadan group. This development again illustrates the wonderful mathematical unity present in all classical continuum field theories.
Electromagnetic Waves in Chiral Materials Chiral materials lack a center of inversion symmetry in their microstructure. This would imply that the response fields D and B in such materials must both be axial vectors. The simple constitutive equation given in (28) relating D to the applied electric field E via the dielectric permittivity is no longer sufficient for linear isotropic media without reflection symmetry. The simplest linear constitutive model that can be proposed for such media is as follows
D = ε E + β ∇ × E, B = µ H + β ∇ × H.
(31)
In (31), β (m-1) is the chirality parameter and this form of the constitutive equation is attributed to Federov (1959 a,b). An alternate form that is also commonly used is due to Post (1962) D = ε P E + iξ H , B = µ P H − iξ E.
(32)
320
V. V. Varadan
The two constitutive models are equivalent if we make the identification ε P = ε ; µ P = µ / (1 − ω 2 εµβ ); ξ = ωεβ ,
(33)
for time harmonic fields. These constitutive equations describe a material that is reciprocal. But the form in (32) is intuitively appealing for understanding optical activity or more correctly electromagnetic activity since it directly involves the curl of a vector field that is not invariant under reflection. In source free media, substitution of (31) or (32) into Maxwell’s equations and performing the curl operation as before does not lead to uncoupled wave equations for E and H in the chiral medium. This indicates that E and H are not eigenvectors for such a medium. Bohren (1974, 1975) devised a transformation that is appealing from a field theoretic viewpoint; and it established the eigenvectors for EM fields in chiral materials and leads to uncoupled wave equations. Application of the integrity basis as proposed by Spencer (1971) would have also resulted in Bohren’s decomposition. The decomposition may be stated as follows E = Q1 + a R Q2 ; H = a L Q1 + Q 2 ;
(34)
a R = −i µ / ε ; a L = −i ε / µ .
It may be seen that aR has the units of impedance and aL that of admittance. It is clear that in an unbounded chiral medium, LCP and RCP fields can exist independently and it follows that E = i µ / ε H for LCP fields; E = −i µ / ε H for RCP fields.
(35)
Substitution of the Bohren’s decomposition (34) into source free Maxwell’s equations and performing a second curl operation on the two curl Maxwell equations for time harmonic fields of frequency ω, results in
On Research in Coupled Field Problems
321
∇ 2 Q1 + γ 12 Q1 = 0; ∇ 2 Q2 + γ 22 Q2 = 0;
γ1 =
(36)
k k ; γ2 = ; k = ω µε . 1 − kβ 1 + kβ
From Maxwell’s equations we may also conclude that ∇ × Q1 = γ 1Q1; ∇ × Q2 = −γ 2Q2 .
(37)
This establishes that Q1 and Q2 are, respectively, independent LCP and RCP fields propagating with different wave speeds cL = c/1–kβ and cR = c/1+kβ. For β >0, cL cR (left handed medium). Unlike non-chiral media, linear combinations of the eigenfields Q1 and Q2 cannot lead to linearly polarized waves since the wave speeds for the two fields are different. It may also be noted that EM waves in chiral media are intrinsically dispersive, there is no static chirality since it must involve the coupling of electric and magnetic fields. Ro et al. (1992) and Varadan et al. (1994) were the first to perform the complete characterization of EM waves in chiral media by studying left handed, right handed and racemic media. Many applications of chirality were realized for electromagnetic shielding and absorption. Recently, it has been shown theoretically that chiral materials can result in negative refraction and near field focusing, and Monzon (2005) and Zhang et al (2009) provided experimental verification. Indeed, it has been shown that even weak chirality can result in a high negative refractive index under certain conditions.
Elastic Waves in Non-Centro Symmetric Solids Voigt (1887) first proposed media where particle interactions included a force and a torque. Cosserat and Cosserat (1909) were the first to study the effects of the lack of inversion symmetry in elastic continua that allow microrotation or spin. Eringen (1976) and Nowacki (1986) formalized the field theory of non-centro symmetric elastic solids and
322
V. V. Varadan
called them micropolar elasticity. Such a solid would allow point couples or microrotations resulting in a stress tensor that is non-symmetric. Portigal and Burstein (1968) first proposed acoustic activity and Varadan et al. (1988) proposed practical geometries that could be fabricated to experimentally realize elastically active solids. The dispersion equation for the simplest possible elastic chiral solid were solved by Lakhtakia et al. (1988) that established the RCP and LCP polarization states for shear waves and Yang et al. (1991), Varadan et al.(1992) showed experimentally that the polarization of propagating shear waves is rotated in opposite directions in left and right handed media. The experimental work for elastic waves in chiral elastic solids is very analogous to the experimental work Varadan et al. (1994) and Ro et al. (1992) for EM waves. We also refer to a review articles by Lakes (1995) on experimental methods for Cosserat media. This derivation given by Lakhtakia et al. (1988) is based on Spencer’s (1971) integrity basis that is determined by the symmetry properties of the material. For non-centro symmetric solids, the displacement u as well as the microrotation vector ω describe the motion of every point in the continuum. The strain tensor can be written as 1 1 S = ∇u = S s + S A = (∇u + u∇ ) + (∇ × u) × I , 2 2
(38)
where SS is the symmetric part of the strain tensor and S A is the antisymmetric part of the strain tensor. The stress tensor for a linear solid may now be written as
T = λ∇ • u + 2µS s + χ∇ω
(39)
The stress tensor T depends on both the symmetric strain as well as the microrotation. The couple stress tensor M for a linear material may be written as 1 M = α I (∇ • ω ) + β (ω∇ ) + γ (∇ω ) + χ [S s + (∇ × u) × I + I × ω ] (40) 2
On Research in Coupled Field Problems
323
This is the minimum constitutive model that can include the lack of inversion symmetry and contains six material constants α , β, γ , χ, λ, µ . The parameter χ is the chirality parameter and couples the microrotation to the stress tensor and the symmetric strain to the couple stress tensor. In source free regions Newton’s force and torque equations may be written as (λ + 2µ )∇(∇ • u) − µ∇ × (∇ × u) + χ [∇(∇ • ω ) −∇× (∇ × ω )] + ρω 2u = 0;
(41)
(α + β + γ )∇(∇ • ω ) − γ ∇ × (∇ × ω ) + χ [2(∇ × ω ) + ∇(∇ • u) −∇ × (∇ × u)] + ρω 2ω = 0.
(42)
These equations are quite complicated when compared to solids that have a center of inversion symmetry. We can obtain the dispersion equations corresponding to the two equations of motion (41) and (42) by assuming plane wave solution for the displacement vector and microrotation vector with unknown vector amplitudes A and B respectively and wavenumber k. Substituting in to (41) and (42), we obtain k 4 (λ + 2 µ )(α + β + γ ) − χ 2 − k 2ω 2 ρ [λ + 2 µ + α + β + γ ] − ρ 2ω 4 = 0
(43) The two solutions of (42) are k12 =
ρω 2 ρω 2 ; k22 = λ + 2µ α + β +γ
(44)
The eigenvectors corresponding to (44) yield 2 2 Az ρω − k (λ + 2 µ ) = , Bz k 2χ
(45)
324
V. V. Varadan
which correspond to longitudinal modes. One of the modes is identical to P-waves in non-chiral solids. The second mode may be called a slow wave since we expect the parameters α , β, γ to be quite small. Another dispersion equation results from substitution of plane wave solutions into (41) and (42) 2
2
k 4 χ 2 − γµ − k 2ω 2 ρ [γ + µ ] − ρ 2ω 4 − 2k χ ( µ k 2 − ρω 2 ) = 0 (46) The quartic equation for k has exact solutions that are quite complicated. But if we make the approximation γ 2 << χµ , we can get the following simple solutions that are physically intuitive and allow us to make connections with P-and S- waves in elastic solids and LCP and RCP waves as for EM waves in chiral materials. The first two solutions are degenerate and have the form k32 =
ρω 2 µ
(47)
The above solution is the ordinary transverse S-wave in elastic solids that is non-dispersive. It can have two polarizations, commonly referred to as the SV- and SH- polarizations. The next two solutions have the form k4 2 =
χ γ { 1
2 3
1/ 2
+ ρω 2γ
}
− χ 3 ; k5 2 =
χ γ { 1
2 3
1/ 2
+ ρω 2γ
+ χ3
}
(48)
It can be seen that these two solutions specifically depend on the chirality parameter χ and are dispersive as are EM waves in chiral materials. The eigenvectors corresponding to the last two solutions, k4 and k5 are as follows Ax Bx = = ±i Ay B y
(49)
On Research in Coupled Field Problems
325
This clearly indicates that these two solutions corresponding to LCP and RCP transverse waves. However, since 6 material constants are involved, a minimum of six independent experiments will be necessary and since the LCP/RCP modes are dispersive, the material constants are complex and more experiments are needed. Varadan et al. (1992) did some interesting experiments with specially configured fiber reinforced laminated composites. Three cubical samples were constructed. The first sample was right handed, from layer to layer, the fiber directions were rotated by 10° in a clockwise manner, so that after 36 layers had been put down, one complete turn of a helix was formed by the fibers. The second sample was made into a left handed one by rotating the fibers in a counterclockwise manner from layer to layer and the third sample was a racemic sample, alternate layers had fibers at ±10°. Varadan and Yang (1991) showed experimentally that shear waves propagating through such samples underwent clockwise and counter clockwise rotations respectively whereas there was no rotation of the polarization in the racemic sample. This is the very first experimental proof of activity in chiral elastic solids. Since it has been shown that chirality can lead to negative refraction for EM waves, it will not be too long before a researcher shows that it is so for elastic waves in chiral solids. That would be a tribute to Professor Pao’s vision of the unity in diversity of all continuum fields. This author hopes to pursue this problem actively in the near future. 3. Coupled Elastic-Electric Field Problems
Dating back to the mid 60s, Professor Pao was extremely interested in theoretical problems involving the coupling of stress and strain fields with electric and magnetic fields. He was involved with some of the very early work involving piezoelectrics (also called deformable dielectrics) and piezomagnetic materials. As is well known, in the linear constitutive models for such media, the stress field depends not only on the strain but also on the applied electric or magnetic fields. Due to the great difference in wave speeds, vibrational frequencies of solids even at so called ultrasonic frequencies are low for electromagnetic coupling and the
326
V. V. Varadan
electric and magnetic fields are considered as quasi-static and irrotational (conservative). This type of elasto-electric coupling occurs in ferroelectric crystals of metal oxides. At and below the Curie temperature, the metal atoms are displaced with respect to the oxygen atoms creating electric dipole moments even in the absence of applied fields. When a field is applied, the dipoles orient in the direction of the electric field and this large net dipole moment in a microscopic volume of the materials results in a measurable strain field. The effect is reciprocal, when the material is stressed, the molecular charges are displaced creating an electric displacement field. Such materials are necessarily anisotropic. Thus the constitutive equations must be modified as follows
T = C : S E − e iE; D = ε iE S + e : S
(50)
where the new material tensor e is a third rank tensor that provides coupling between the stress tensor T and the electric field E and also between the electric displacement field D and the symmetric strain field. Due to the large disparity in elastic and EM wavespeeds, we make the following assumption for the electric in source free regions ∇ • D = 0; ∇ × E = 0 ∇ • D = 0; ∇ × E = 0
(51)
Although, we have assumed that the field is irrotational, we still allow for a time dependent electric field that does not couple to the magnetic field and no displacement current contributes to the magnetic flux density B a indicated in (26) and (27). For time harmonic fields, we introduce the following quasistatic model E = (∇Φ )e − iω t
(52)
As described in Auld (1990), plane wave solutions for the elastic displacement u and Φ can be substituted into (50) and then (51) and (50) for materials of a defined symmetry class. Since the material is anisotropic, it is not possible to solve the resulting equations known as
On Research in Coupled Field Problems
327
the Christoffel equations for general symmetry classes. However, Auld (1990) has derived analytic solutions for hexagonal and cubic crystals and solved a number of boundary value problems. It is assumed in this treatment that the applied stress and electric fields are small enough so that the linear response model is valid. Piezoelectric materials find a number of applications as ultrasonic transducers and transducer arrays for non-destructive evaluation of flaws and medical imaging, Surface Acoustic Waves (SAW) devices and Bulk Acoustic Wave (BAW) resonators for circuits and sensors, ink jet printers, digital cameras, space telescopes and a number of other applications where precise positioning is require and more recently in touch screen pads for handheld communication devices. 4. Electromagnetic and Acoustic Metamaterials Veselago (1968) conjectured that Maxwell’s equations permitted solutions wherein the vectors E, H and k form a left handed orthogonal triad instead of right handed triad. This would imply that the k vector would be opposite in sign to the Poynting vector (EM power flux density) S = E× ×H. hence the phase velocity and power flow will be antiparallel. A negative phase velocity implies that the refractive index n given by k = c/Re(n) is negative and since n = √µε, and this has implications on the sign of the real part of the permittivity and permeability of the material. Although there is a general misunderstanding that both the real part of permittivity and permeability have to be negative, it has been shown by many researchers, that it is sufficient if only one of them is negative (see for example Depine and Lakhtakia, 2004). Such materials are called metamaterials, and are engineered metallic microstructures (<λ/10 in size) that are embedded in a dielectric medium. At the resonance frequency of the geometry (generally when the circumference or other length parameter becomes λ/2, the electrons in the metallic structure undergo collective oscillations at the metal/dielectric interface much like plasmons at optical frequencies. By definition, metamaterials are highly dispersive at the resonance frequency.
328
V. V. Varadan
Pendry (1996, 1998, 1999) is generally credited with proposals to realize materials with negative permittivity using very thin, long metallic wires periodically distributed in a dielectric and sub wavelength size split ring resonators (SRR) to realize materials with negative permeability. Shelby et al. (2001) demonstrated negative refraction experimentally using a combination of SRRs and thin wires. Electrical engineers argue that thin wires and SRRs were used nearly half a century ago to realize materials with negative permittivity and permeability. Indeed Schelkunoff and Friis in their book Antenna Theory and Practice (1952) have a section titled “Methods for increasing the permeability of dielectrics” wherein they propose the use of split ring resonators and prove that the resonance frequency is related to inductance of the ring and the capacitance of the gap and that hence the magnetic susceptibility and permeability can become very high at the resonance frequency. Today, research in metamaterials is not confined just to electromagnetic metamaterials but much progress has been made for acoustic metamaterials also. This is not surprising if we were to use circuit analogies for inductance, capacitance and resistance for mass, springs and dampers. Lee et al. (2009) experimentally proved that one can achieve an effective negative modulus by performing an interesting acoustics experiment in a 1D transmission line (an acoustic duct) that had a periodic distribution of side holes (microstructure) that provided the resonance characteristics needed to exhibit the negative modulus. Unlike an array of Helmholtz resonators, the side holes do not resonate by themselves but only when coupled to the tube. The novel behavior of acoustic waves in the tube is due to the motion of air inside the side tubes. A very simple1D theory has been proposed to show that the effective bulk modulus is negative for frequencies less than the resonance frequency defined by BnS/AM where B is the bulk modulus of air, n is the number of side holes per unit length, S is the area of cross section of the holes, A is the area of cross section of the tube and M is the mass of air in the side hole. Thus we see there is an exact analogy between the LC oscillator model of a split ring resonator and an acoustic tube with a periodic distribution of side holes. This section once again proves the importance of the unified vision that Professor Pao conceived for all continuum fields and although
On Research in Coupled Field Problems
329
Professor Pao has not directly contributed to research in metamaterials, legions of students, postdoctoral fellows and visiting faculty whose thinking he influenced has had a positive effect on the development of acoustic metamaterials. Applications of acoustic metamaterials and electromagnetic metamaterials are advancing rapidly and may be considered as one of the very ‘hot’ areas of research. Postscript This review article has traced a very personal outlook on some of the developments in acoustic, elastodynamic and electromagnetic fields that was very much influenced by a unified approach to all such fields starting with the Helmholtz potential decomposition of fields, waves propagate in these materials and their characteristics are determined by whether they are solenoidal or irrotational and the circuit analogies that exist between electrical and mechanical circuits. As Professor Pao has himself proved very many times, such thinking allows us discover very rich physical phenomena and the very exciting practical applications that follow. This has definitely been the case with this author, as she has freely crossed the lines between acoustics, electromagnetics and elastodynamics many times in her career, each time coming back to one or other of the fields with exciting new ideas gleaned from the other. The frontier is wide open, continuum fields with exciting constitutive models that can be proposed and then realized in practice will always lead to new worlds to explore. Professor Yih Hsing Pao is one of the early pioneers of such explorations. Acknowledgements The author expresses her sincere gratitude to many former graduate students and post-doctoral fellows who have contributed to the research reviewed in this article. This work would not have been possible without their contributions.
330
V. V. Varadan
References 1. Auld, B.A., Acoustic Fields and Waves in Solids, Volumes I and II, Krieger (1990). 2. Bohren, C.F., “Light scattering by an optically active sphere”, Chemical Physics Letters 29, 458 (1974). 3. Bohren, C.F., “Scattering of electromagnetic waves by an optically active spherical shell”, Journal of Chemical Physics 62, 1566 (1975). 4. Brown, K., Reflections on Relativity, Lulu Publishing (2009). 5. Cosserat, E. and F. Cosserat, Theories des Corps Deformables, A. Hermann et Fils (1909). 6. Depine, R.A., LA. Lakhtakia, “A new condition to identify isotropic dielectricmagnetic materials displaying negative phase velocity”, MOTL 41, 315, 2004. 7. Eringen, A.C. and C.B. Kafadar, “Polar Field Theories” in Continuum Physics IV, Ed. A.C. Eringen, Academic press, NY (1976). 8. Federov, F.I., “On the theory of optical activity in crystals. I. The law of conservation of energy and the optical activity tensors”, Optical Spectroscopy 6, 237 (1959a). 9. Federov, F.I., “On the theory of optical activity in crystals. II. Crystals of cubic symmetry and planar classes of central symmetry”, Optical Spectroscopy 6, 49 (1959b). 10. Jaggard, D., A.R. Mickelson and C.T. Pappas, “On electromagnetic waves in chiral media”, Applied Physics 18, 211 (1979). 11. Lakes, R.S. and R.L. Benedict, “Noncentrosymmetry in micropolar elasticity”, International Journal of Engineering Science 20, 1161 (1982). 12. Lakhtakia, A., V.K. Varadan and V.V. Varadan, Time Harmonic Electromagnetic Fields in Chiral Media, Lecture Notes in Physcis 335, Springer-Verlag (1989). 13. Lakhtakia, A., V.V. Varadan and V.K. Varadan, “Equivalent dipole moments of helical arrangements of small, isotropic, point–polarizable scatters: Application to chiral polymer design”, J. Appl. Phys. 63, 5246-5250 (1988). 14. Lee, S.H., C.M. Park, Y.M. Seo, Z.G. Wang and C.K. Kim, “Acoustic metamaterial with negative modulus”, J. Phys.: Condensed Matter 21, 175704 (2009). 15. Li, J. and C.T. Chan, “Double negative acoustic metamaterial”, Phys. Rev. E 70, 055602 (2004). 16. Monzon, C. and D.W. Forester, “Negative Refraction and Focusing of Circularly PolarizedWaves in Optically Active Media”, Physical Review Letters 95, 123904 (2005). 17. Morse, P.M. and H. Feshbach, Methods of Theoretical Physics Part I, McGraw Hill (1953). 18. Nowacki, W., Theory of Asymmetric Elasticity, Pergamon, Oxford (1986). 19. Pao, Y.H. and V. Varatharajulu (V.V. Varadan), “Huygens’ Principle, Radiation Conditions and Integral Formulas for the Scattering of Elastic Waves”, Journal of the Acoustical Society of America 59, l36l-37l (l976).
On Research in Coupled Field Problems
331
20. Pendry, J.B., A.J. Holden, D.J. Robbins and W.J. Stewart, J. Phys.: Condensed Matter 10, 4785 (1998). 21. Pendry, J.B., A.J. Holden, D.J. Robbins and W.J. Stewart, IEEE Trans. Microwave Theory Techniques 47, 2075 (1999). 22. Pendry, J.B., A.J. Holden, W.J. Stewart and I. Youngs, Phys. Rev. Lett. 76, 4773 (1996). 23. Portigal, D.L. and E. Burstein, Physical Review 170, 673 (1968). 24. Post, E.J., Formal Structure of Electromagnetics, Amsterdam: North-Holland (1962). 25. Ro, R., V.V. Varadan, V.K. Varadan, “Electromagnetic Activity and Absorption in Microwave Chiral Composites”, IEE Proceedings Part H 139, 441-448 (1992). 26. Shelby, R.A., D.R. Smith and S. Schultz, Science 292, 77 (2001). 27. Sihvola, A., Electromagnetic Mixing Formulas and Applications, IEE (1999). 28. Spencer, A.J.M., Theory of Invariants in Continuum Physics Volume I, Ed. A.C. Eringen, Academic Press, NY (1971). 29. Tinoco Jr., I. and M.P. Freeman, Journal of Physical Chemistry 61, 1196 (1957). 30. Tretyakov, S.A., IEEE Trans. Antennas & Propagation 44, 1006 (1996). 31. Varadan, V.V. and V.K. Varadan, “Scattering Matrix for Elastic Waves. III. Application to Spheroids”, Journal of the Acoustical Society of America 65, 896-905 (1979). 32. Varadan, V.V., “Scattering Matrix for Elastic Waves II. Application to Elliptic Elastic Cylinders”, Journal of the Acoustical Society of America 63, l0l4-024 (l978). 33. Varadan, V.V., A. Lakhtakia and V.K. Varadan, editors, Field Representations and Introduction to Scattering, Amsterdam: North–Holland, 1991. 34. Varadan, V.V., R. Ro and V.K. Varadan, “Measurement of the Electromagnetic Properties of Chiral Composite Materials in the 8–40 GHz range”, Radio Science 29, 9-22 (1994). 35. Varadan, V.V., S.K. Yang, A. Lakhtakia and V.K. Varadan, “Rotation of elastic shear waves in laminated, structurally chiral composites”, J. Sound & Vibration 159, 403-420 (1992). 36. Varatharajulu, V. (V.V. Varadan) and Y.H. Pao, “Scattering Matrix for Elastic Waves I. Theory”, Journal of the Acoustical Society of America 60, 556-66 (l976). 37. Veselago, V.G., “The electrodynamics of substances with simulataneously negative values ε and µ”, Sov. Phys.—Usp. 10, 509-514 (1968). 38. Voigt, W., “Uber Medien ohne innere Krafte und eine durch sie gelieferte mechanische Deutung der Maxwell-Hertzchen Gleichungen”, Abh. Ges. Wiss. 34, Gottingen (1887). 39. Yang, S.K., V.V. Varadan, A. Lakhtakia and V.K. Varadan, “Reflection and transmission of elastic waves by a structurally chiral arrangement of identical uniaxial layers”, J. Phys. D: Applied Physics 24, 1601 (1991). 40. Zhang, S. et al., “Negative refractive index in chiral metamaterials”, Physical Review Letters 102, 023901 (2009).
This page is intentionally left blank
March 25, 2011
13:47
World Scientific Review Volume - 9in x 6in
CHAPTER 13 TRANSIENT RESPONSE OF AN ELASTIC HALF SPACE BY A MOVING CONCENTRATED TORQUE
Chau-Shioung Yeh Institute of Applied Mechanics, National Taiwan University Taipei, Taiwan E-mail: [email protected] The propagation of transient waves in an elastic half-space generated by a moving torque is investigated in this paper. The torque is suddenly applied and moves rectilinearly at a constant speed in the surface. The responses for torques moving at superspeed, transpeed and subspeed relative to the distortional wave speed are analyzed in detail. Closed-form solutions for displacements in the half-space are obtained through the application of the extended Cagniard-de Hoop method and are expressed in terms of elementary functions.
1. Introduction Ground motions generated by a torque may be essential for engineering applications and may also be interesting in studying theoretical seismogram. In this investigation, we study the propagation of transient waves in an elastic half-space subjected to a torque moving rectilinearly on the surface with a constant speed. Problems of torsional loads applied on the surface of a half-space have been considered by several investigators. Miller and Pursey [1] have treated the problem of a harmonic torsional loading. Mitra [2] has considered the case of a finite distributed impulsive twisting moment and Eason [3] has also considered this problem but analyzed it more extensively. For studying the theoretical seismograms due to well-defined sources, Pekeris et al. [4] have investigated the ground motion of a layer. The investigations cited above fall into a category of three-dimensional transient axisymmetric problem. 333
Chapter13
March 25, 2011
13:47
World Scientific Review Volume - 9in x 6in
Chapter13
C.-S. Yeh
334
The problem which we consider here can be classified as a threedimensional non-axisymmetric deformation. For this class of problems in elastodynamics, only few solutions have been obtained. Chao [5] has analyzed the disturbance generated by a tangential surface point load. Papadopoulos [6] considered the case of a buried tangential point load and a couple-force. Payton [7] has derived the transient displacements by applying the elastodynamic reciprocal theorem. Lansing [8] has rederived some of Paytons result by using a Duhamel integral. Afandi and Scott [9] have obtained the closed-form solutions for the surface displacement due to a surface dipole with a ramp time-dependence. These investigations only gave the details of the transient response at a surface. Norwood [10] has analyzed the transient velocities due to a distributed load over a rectangular area of the surface of the half-space. By extending the Cagniard-de Hoop method [11, 12], Gakenheimer and Miklowitz [13] first derived the transient displacements for the interior of a half-space excited by a suddenly applied moving normal load and expressed them by the sum of single integrals and algebraic expressions. In this paper, we first formulate the problem through the elastodynamic theory and obtain the formal solution by applying Laplace and double Fourier transforms. The solutions for the transformed displacements are expressed with a single integral. Then, we employ the extended Cagniardde Hoop method [13, 14] to invert the integral and obtain the closed-form solution for displacements expressed in terms of elementary functions. The meaning of each term is identified in due course. 2. Governing Equations Let Cartesian coordinates x, y, z be such that the homogeneous, isotropic and elastic half-space is represented by z ≥ 0 as shown in Fig. 1. A concentrated torque T is considered to move on the surface along the positive x-axis at a constant speed c. The torque is assumed to be suddenly applied at x = 0 and t = 0. The motion of the half-space without body forces can be described by the wave equations [15] ∇2 φ =
1 ∂ 2φ , c2p ∂t2
∇2 ψi =
1 ∂ 2 ψi , c2s ∂t2
∇ · ψ = 0,
(2.1) i = x, y, z,
(2.2) (2.3)
March 25, 2011
13:47
World Scientific Review Volume - 9in x 6in
Transient Response of an Elastic Half Space by a Moving Concentrated Torque
T r
Chapter13
335
T
T
x
y z Fig. 1.
A moving torque on an elastic half space.
where ∇2 = ∂ 2 /∂x2 + ∂ 2 /∂y 2 + ∂ 2 /∂z 2 is the Laplacian and cp and cs denote the dilatational and distortional wave speeds, respectively, which are represented by the Lam´e constants λ, µ and the mass density ρ as cp = (λ + 2µ) /ρ, cs = µ/ρ. The Lam´e potentials φ and ψi are related to the displacements ui by ui = ϕ,i + eijk ψk,j ,
(2.4)
where eijk is the alternating tensor. By employing Hooke’s law, the stresses τij can be expressed as τij = λuk,k δij + µ (ui,j + uj,i ) ,
(2.5)
in which δij is Kronecker’s delta. The boundary conditions on the surface are given by τzx (x, y, 0, t) = −T δ(x − ct)δ (y)H(t)/2,
(2.6)
τzy (x, y, 0, t) = T δ (x − ct)δ(y)H(t)/2,
(2.7)
τzz (x, y, 0, t) = 0,
(2.8)
where the prime denotes differentiation with respect to the argument. δ() and H() are the Dirac delta function and the Heaviside step function, respectively. The initial conditions are φ(x, y, z, 0) =
∂ ∂ φ(x, y, z, 0) = ψi (x, y, z, 0) = ψi (x, y, z, 0) = 0, (2.9) ∂t ∂t
and the radiation conditions require that the potentials φ and ψi and their first order space derivatives vanish at infinity.
March 25, 2011
13:47
World Scientific Review Volume - 9in x 6in
Chapter13
C.-S. Yeh
336
3. Formal Solution In order to find a solution, we first apply the one-sided Laplace transform with respect to time and a double Fourier transform with respect to the space variables x and y. By using the initial conditions (2.9) and the radiation conditions, the Laplace transform solution of Eq. (2.1) can be written as s2 φ¯ = 4π2
∞
−∞
∞
Ae−s(pz−ihx−iky) dhdk,
(3.1)
−∞
and the solutions of Eq. (2.2) with condition (2.3) can be written as s2 ψ¯x = 4π 2 s2 ψ¯y = 4π 2
∞
∞
−∞ ∞
−∞ ∞
−∞ 2 ∞
s ψ¯z = 4π 2
Be−s(mz−ihx−iky) dhdk,
(3.2)
Ce−s(mz−ihx−iky) dhdk,
(3.3)
−∞ ∞
( −∞
−∞
Bih + Cik −s(mz−ihx−iky) dhdk, )e m
(3.4)
where s is the Laplace transform parameter, h and k are Fourier transform parameters, A, B, and C are arbitrary functions of h, k, and s, and p(h, k) = m(h, k) =
h2 + k 2 + u2 , h2 + k 2 + υ 2 ,
1 , cp 1 υ= , cs u=
(3.5) (3.6)
are chosen branches such that Re p ≥ 0 and Re m ≥ 0, respectively. The Laplace transform for the boundary conditions (2.6), (2.7) and (2.8) can be written as τ zx (x, y, 0, s) = −
T s2 8π2
τ zy (x, y, 0, s) = −
Ts 8π2
τ zz (x, y, 0, s) = 0.
∞
∞
−∞ 2 ∞
−∞ ∞
−∞
−∞
ik eis(hx+ky) dhdk, (1 + ich)
(3.7)
ih eis(hx+ky) dhdk, (1 + ich)
(3.8) (3.9)
March 25, 2011
13:47
World Scientific Review Volume - 9in x 6in
Transient Response of an Elastic Half Space by a Moving Concentrated Torque
Chapter13
337
By employing relations (2.4) and (2.5), the Laplace transform of the stresses τ¯zx , τ¯zy , τ¯zz can be obtained as follows, µs4 τ zx (·) = 4π2
∞
−∞
∞
−∞
− (2ihp)Ae−spz
+ [(2hk)B − (2h2 + υ 2 )C]e−smz eis(hx+ky) dhdk, µs4 ∞ ∞ τ zy (·) = − (2ikp)Ae−spz 4π2 −∞ −∞ + [(2k2 + υ 2 )B − (2hk)C]e−smz eis(hx+ky) dhdk, µs4 ∞ ∞ 2 τ zz (·) = (m + h2 + k2 )Ae−spz 4π2 −∞ −∞ + [(2ikm)B − (2ihm)C]e−smz eis(hx+ky) dhdk,
(3.10)
(3.11)
(3.12)
with (·) = (x, y, z, s). Substituting Eqs. (3.10)–(3.12) into boundary conditions (3.7)–(3.9), we have A = 0, iT h , 2µs2 υ 2 (1 + ich) iT k C= . 2µs2 υ 2 (1 + ich)
B=
(3.13) (3.14) (3.15)
In view of above results, Eqs. (3.1)–(3.4), we should mention that the moving torque can only generate the distortional wave described by ψx , ψy and ψz which contribute to responses of the half-space. Thus, using relation of Eq. (2.4) and Eqs. (3.1)–(3.4), we can obtain the Laplace transformed displacements T ∂ I(x, y, z, s), 8π2 µ ∂y −T ∂ uy = I(x, y, z, s), 8π2 µ ∂x uz = 0,
ux =
(3.16) (3.17) (3.18)
where
∞
∞
I(x, y, z, s) = −∞
−∞
1 e−s(mz−ihx−iky) dhdk. (1 + ich)m
(3.19)
March 25, 2011
13:47
World Scientific Review Volume - 9in x 6in
Chapter13
C.-S. Yeh
338
4. Inversion of I¯ To find the inversion of Laplace transform for displacements, u ¯x and u¯y represented by Eqs. (3.16)–(3.17), respectively, we consider the integral (3.19). ¯ y, z, s), we adopt a method originally due to In order to determine I(x, Cagniard [11] and later modified by de Hoop [12]. The details of the procedure to find the solution are very similar to those given by Gakenheimer and Miklowitz [13, 14] in their analysis of the responses of an elastic halfspace excited by a suddenly applied moving point load. In essence , the method consists of converting the integral I¯ into the Laplace transform of certain explicit functions of time, and then obtaining the inversion of the transform by inspection. To insure the uniqueness of the solution if it exists, s is considered to be a real and positive number [16]. As first used by de Hoop [12] in his analysis of three-dimensional acoustic waves, we introduce the transformation h = υ(q cos θ − w sin θ), k = υ(q sin θ − w cos θ).
(4.1)
Then I¯ in Eq. (3.19) can be expressed in circular cylindrical coordinates (r, θ, z) as M ∞ I(r, θ, z) = lim K(q, w, θ)e−sυ(m1 z−iqr) dqdw, (4.2) M→∞
−M
−∞
where 1 , x[γ + i(q cos θ − w sin θ)]m1 m1 = q 2 + w2 + 1,
K(q, w, θ) =
2
2
2
r =x +y , cs γ= . c
(4.3) (4.4) (4.5) (4.6)
It should be mentioned that the inner integral of Eq. (4.2) is treated by fixing w in the outer integral. In order to convert the q-integral, the Cagniardde Hoop transformation defined by t = υ(m1 z − iqr)
(4.7)
is taken to map v(m1 z − iqr) into t through a contour integration in a complex q-plane. The singularities of the integrand of I¯ are the branch
March 25, 2011
13:47
World Scientific Review Volume - 9in x 6in
Chapter13
Transient Response of an Elastic Half Space by a Moving Concentrated Torque
339
Im q branch cut
t=f
t=f q-
qc
*3
+
qs
*2
q
+
q +c
t=tw
tan-1(r/z)
*1
tan-1(r/z)
*
Fig. 2.
Re q
Contour of integration in the q-plane.
points of m1 , and the poles
q = qs± ≡ ±i w2 + 1, q=
qc±
≡ (±w sin θ + iγ)/cos θ.
(4.8) (4.9)
For convergence of the q-integral, we choose the contour Γ + Γ1 + Γ2 + Γ3 in the upper half of the q-plane shown in Fig. 2. In view of making m1 singlevalued and Re m1 > 0, a branch cut along the imaginary axis √from the √ branch point qs+ is introduced. If t ≥ υR w2 + 1 = tw , with R = r2 + z 2 , Eq. (4.7) represents a branch of a hyperbola which is given by 1 q = q± ≡ (±z t2 − t2w + irt), t ≥ tw , (4.10) 2 υR with asymptotes making the angles arg q = ± arctan(r/z ), with the Re q-axis, and the vertex at t = tw , q = qυ ≡ ir w2 + 1/R.
(4.11)
(4.12)
As r/R < 1 for z > 0, it can be shown from Eqs. (4.8) and (4.12) that |qv | < |qs± |, and the hyperbola does not cross the branch cut. As comprehensively discussed by Miklowitz [14], the conditions which determine whether qc will be interior to the contour may define three regions, Region I: x > 0, x/R > γ The pole at q = qc lies inside the contour for w ∈ [0, ∞).
March 25, 2011
13:47
World Scientific Review Volume - 9in x 6in
Chapter13
C.-S. Yeh
340
Region II: x > 0, x/R < γ The poles at q = qc± lie inside the contour for w ∈ (w0 , ∞), and they lie outside the contour for w ∈ [0, ∞), where w02 = (γ 2 R2 − x2 )z 2 /r2 n2 and n2 = y 2 + z 2 . Region III: x < 0 No poles lie inside the contour for w ∈ [0, ∞). The inversion for each region can be carried out by employing the Cauchy-Goursat theorem and residue theory. By Jordans lemma, it can be shown that the contributions from Γ1 and Γ2 for |q| → ∞ are zero. Superspeed Cases γ < 1 Region I: I(r, θ, z, s) = F (r, θ, z, s) + G(r, θ, z, s), where
∞
∞
F (r, θ, z, s) = −∞
2π G(r, θ, z, s) = c cos θ
K(q + , w, θ)
tw
∞ −∞
(4.13)
dq + dq − −st − K(q − , w, θ) e dtdw, dt dt (4.14)
1 −sυ(mc z−iqc r) e dw, mc
(4.15)
with mc = [m1 ]q=qc .
(4.16)
F¯ (r, θ, z, s) is contributed from the contour integration along Γ2 , and ¯ θ, z, s) is the contribution of the residue at the pole q = qc . InterG(r, changing the order of integration in Eq. (4.14), we have ∞ ∞ F (r, θ, z, s) = D(r, θ, z, t, w)H(t − tw )dw e−st dt, (4.17) 0
−∞
where D(r, θ, z, t, w) =
D1 D2
υR2 (γυR2 − tr cos θ)(D1 + D2 ), c
=
t2
−
t2w [(γυR2
−
tr cos θ)2
(4.18)
1 . + (z cos θ t2 − t2w ∓ υR2 w sin θ)2 ] (4.19)
March 25, 2011
13:47
World Scientific Review Volume - 9in x 6in
Transient Response of an Elastic Half Space by a Moving Concentrated Torque
Chapter13
341
Im w branch cut Region I -
w
ws+ wc
C2 C3
w+ C1
t=tc -wo t=tL
Region II
Wt
Ws-
Fig. 3.
ws-
wo t=tL
C
Ws+
Re w ws± for c>cs Ws± for c
Contour of integration in the w-plane.
The function H(t − tw ) in the inner integral of Eq. (4.17) restricts the limit of integration as −Ts < w < Ts , Ts = (t/ts )2 − 1, where ts = vR, and w = 0 gives tw = ts ; Eq. (4.17) becomes
∞ Ts F (r, θ, z, s) = D(r, θ, z, s)dw e−st dt. −t
−Ts
Thus, after lengthy manipulations, we have 2πR H(t − υR) F (x, y, z, t) = . c (γυR2 − tx)2 + (t2 − υ 2 R2 )n2
(4.20)
In fact, F (x, y, z, t) is valid for any value of γ and represents a hemispherical wave emanating from the starting point of the applied torque and its front arrives at t = ts . ¯ y, z, t) can also be obtained by the Cagniard-de The inversion of G(x, Hoop method. The particular contour in the w-plane as shown in Fig. 3 is given by solving t = υ(mc z − iqc r),
(4.21)
for w, i.e., w = w± ≡ −iγ sin θ +
γ cos θ (iξy ± αz), n2
t ≥ tc ,
(4.22)
March 25, 2011
13:47
World Scientific Review Volume - 9in x 6in
C.-S. Yeh
342
where α=
ξ 2 − (υ 2 c2 − 1)n2 ,
ξ = ct − x,
and
1 2 2 (n υ c − 1 + x) c represents the arrival time of a conical wave which is similar to the Mach cone encountered in supersonic aerodynamics. This wave trails behind the torque as shown in Fig. 4(a). Equation (4.22) defines one branch of a hyperbola with vertex at y cos θ w = wv ≡ −iγ sin θ + i 1 − γ 2, (4.23) n and asymptotes argw=arctan(y/z) with the Re w-axis. The branch points arise from mc = 0 and are located at w = ws± ≡ i −γ sin θ ± 1 − γ 2 cos θ . (4.24) tc =
The singularities, branch cuts and the contour of integration are also shown in Fig. 3. Since (x/R) > γ for γ < 1, and (y/n) < 1, w± intersects the Im w-axis below the branch point w = ws+ and above Re w-axis. According to Jordans lemma, the contributions of the integral along C1 and C3 are zero. ¯ θ, z, s) can be written as Thus G(r, ∞ 4π −st G(r, θ, z, s) = (4.25) e dt, α tc which gives 4π H(t − tc ), (4.26) α representing a conical wave as mentioned earlier, which gives response when t ≥ tc . G(x, y, z, t) =
Region II: On the basis of the condition on w for this region, the pole at q = qc coalesces on q ± at t = tL as w → w0 , where tL = R2 /cx is the arrival time of a hemispherical wave with center at x = ct/2. Thus ∞ ∞ dq + dq − −st K(q + , w, θ) − K(q − , w, θ) e dtdw, F (r, θ, z, s) = P dt dt −∞ tw (4.27)
Chapter13
March 25, 2011
13:47
World Scientific Review Volume - 9in x 6in
Transient Response of an Elastic Half Space by a Moving Concentrated Torque
Chapter13
343
and −w0 ∞ 2π 1 −sv(mc z−iqc r) ( G(r, θ, z, s) = + ) e dw , c cos θ −∞ w0 mc
(4.28)
where P means ‘Cauchy principal value’. Adopting the same procedure as previously applied for region I, we have F (x,y,z,t)=
H(t − vR) 2πR . 2 c (γvR )2 + (t2 − v 2 R2 )n2
(4.29)
¯ is the same as that for The contour integration in the w-plane for G region I, but with the modification in the geometry of the w-plane. The contour starts and ends at w = −w0 and w = w0 , respectively, on the Re w-axis as shown in Fig. 3 marked region II. The singularity ws+ may lie below Re w-axis, but still on Im w-axis, which is not shown in Fig. 3. Since this contour does not intersect the Im w-axis, this has no contribution to the result for this region. Therefore we have G(x, y, z, t) =
4π H(t − tL ). α
(4.30)
Equation (4.30) represents a hemispherical wave with center at (ct/2, 0, 0), giving response when t ≥ tL . Region III: Since no pole exists inside the contour in the q-plane for this region, ¯ to I¯ is nil and I can be expressed by F therefore the contribution of G only. Thus the inversion of I¯ can be written as I(x, y, z, t) = −
4π 2πR H(t − ts ) − H(t − tc )H(t − tL )H(x), c|x|Q α
(4.31)
where Q=
(t − tL )2 + (t2 − t2s )(n/x)2 .
It is noted that as z → 0, the contour in the q-plane and w-plane wrap around the branch point q = qs+ and w = ws+ , however, the contributions of these points to I¯ vanish. Therefore the validity of expression (4.31) can be extended to z = 0. The wave pattern for this case is depicted in Fig. 4(a).
March 25, 2011
13:47
World Scientific Review Volume - 9in x 6in
Chapter13
C.-S. Yeh
344 x
(a)
t=to
c
t=tc t=tL
I
x r
cs c
II
x
I III
t=tL
T
t=to
t=tc
II z
x R
cs c
y
III
t=ts
x
(b)
t=to
c
t=tL t=ts II
T x
t=to y
III
III
II
t=tL t=ts
z
x-z plane
x-y plane x
t=to
(c)
c
t=tL t=ts II
III
t=to y
T
x
t=tL
III
II
t=ts
z
x-y plane Fig. 4.
x-z plane
Wave patterns for (a) c > cS , (b) c = cS , and (c) c < cS .
March 25, 2011
13:47
World Scientific Review Volume - 9in x 6in
Transient Response of an Elastic Half Space by a Moving Concentrated Torque
Chapter13
345
Transpeed Case (γ = 1) and Subspeed Case (γ > 1) When the moving speed of the torque c ≤ cs , then (x/R) ≤ (cs /c); so only the regions II and III exist. In these cases, the inversion of I¯ proceeds exactly as described for γ < 1, except that only regions II and III are applicable and the geometry of the w-plane is changed as shown in Fig. 3. The locations of the singularities depend on γ, i.e., w = Wt = −i sin θ,
(4.32)
which locates on the Im w-axis and below the Re w-axis, and w = Ws± = −iγ sin θ ±
γ 2 − 1 cos θ,
f or
γ > 1,
(4.33)
which also locate below Re w-axis. However, the contour of integration in the w-plane for region II remains the same as for any value of γ. Thus no new results arise for region II and the F in Eq. (4.29) and G in Eq (4.30) are still valid for the case γ ≥ 1. The inversion of I¯ for region III is independent of γ and the result is the same as obtained for γ < 1 and the wave patterns for c = cs are depicted in Figs. 4(b) and 4(c), respectively. Finally, Eq. (4.31), the expression for I(x, y, z, t) is valid for any c provided H(t − tc ) = 1
f or
c ≤ cs .
5. Displacements Based on Eqs. (3.16) and (3.17), we have T ∂ I(x, y, z, t), 8π2 µ ∂y T ∂ I(x, y, z, t). uy (x, y, z, t) = − 2 8π µ ∂x
ux (x, y, z, t) =
(5.1) (5.2)
March 25, 2011
13:47
World Scientific Review Volume - 9in x 6in
C.-S. Yeh
346
The substitution of Eq. (4.31) into Eqs. (5.1) and (5.2) leads to the results as follows: −y T (c2 − c2 )yR3 [ δ(t − ts ) + 2 2 s 3 3 H(t − ts ) 4πµc c|x|Q c cs |x| Q 2 2 2 c − cs yH(x) δ(t − tc )H(t − tL ) − ccs nS 4yH(x) − δ(t − tL )H(t − tc ) cxS 2(c2 − c2s )yH(x) + H(t − tc )H(t − tL )], (5.3) c2 c2s S 3 −T −sgn(x) R3 (t − t0 ) [ δ(t − ts ) + uy (x, y, z, t)= H(t − ts ) 4πµc cQ c|x|3 Q3 2δ(x) 2H(x) H(t − tc )H(t − tL ) − δ(t − tc )H(t − tL ) + S cS 2 2 2(n − x )H(x) δ(t − tL )H(t − tc ) + cx2 S 2(t − t0 )H(x) + H(t − tc )H(t − tL )], (5.4) cS 3
ux (x, y, z, t)=
where S=
(t − t0 )2 − (tc − t0 )2 ,
sgn(x) =
t0 = x/c,
+1, x > 0, −1, x < 0,
and H(t − tc ) = 1,
δ(t − tc ) = 0
f or
c ≤ cs .
As we described previously, the closed-form solutions for the displacements, Eqs. (5.3) and (5.4) are valid for z ≥ 0. Thus we can obtain the displacements at the surface under a moving torque simply by setting z = 0. The numerical results for surface displacements, showing the non-dimensional displacements in the r- and θ-directions versus the dimensionless quantity ct/r for (a) γ = 2.0, (b) γ = 1.0, and (c) γ = 0.5, are presented in Figs. 5.
Chapter13
March 25, 2011
13:47
World Scientific Review Volume - 9in x 6in
Chapter13
Transient Response of an Elastic Half Space by a Moving Concentrated Torque
347
1
(a)
Ur
J=2.0
PU rr2 / T
0.5
135q
0q, 180q
0
90q
45q
-0.5
-1 0
1
2
1
UT
ct / r
3
4
5
4
5
J=2.0
0q
P U T r2 / T
0.5
45q
90q 0 135q, 180q
0q -0.5
-1 0
1
2
ct / r
3
Fig. 5. The non-dimensional displacements in the r- and θ-directions versus the dimensionless quantity ct/r for (a) γ = 2.0, (b) γ = 1.0, and (c) γ = 0.5.
March 25, 2011
13:47
World Scientific Review Volume - 9in x 6in
Chapter13
C.-S. Yeh
348
(b)
1
Ur
J=1.0
0.5
PU rr2 / T
0q, 180q
135q
0
45q 90q
-0.5
-1 0
1
2
ct / r
3
4
5
4
5
1
UT
J=1.0
PU T r 2 / T
0.5
90q
135q, 180q
0
45q
-0.5
0q -1 0
1
2
Fig. 5.
ct / r
(Continued )
3
13:47
World Scientific Review Volume - 9in x 6in
Chapter13
Transient Response of an Elastic Half Space by a Moving Concentrated Torque
(c)
1
Ur
PU rr2 / T
0.5
J=0.5
45q
135q
0q, 180q 0
90q 65q
-0.5
-1 0
1
2
ct / r
3
4
5
4
5
2
UT
J=0.5
1.5 1
PUTr2 / T
March 25, 2011
0.5
135q, 180q
90q
0 -0.5
45q
0q
65q
-1 -1.5 -2 0
1
2
Fig. 5.
ct / r
3
(Continued )
349
March 31, 2011
16:23
World Scientific Review Volume - 9in x 6in
Chapter13
C.-S. Yeh
350
6. Stationary Concentrated Torque When the speed of the moving torque equals to zero, the problem considered here reduces to that of the dynamic responses of a half-space under a stationary concentrated torque T varying as a Heaviside step function of time. The solution for displacements of this problem can be obtained by letting c → 0 in Eqs. (5.3) and (5.4), which gives y T y ux (x, y, z, t) = − δ(t − t ) + H(t − t ) , (6.1) s s 4πµ cs R2 R3 T x x δ(t − t ) + H(t − t ) , (6.2) uy (x, y, z, t) = s s 4πµ cs R2 R3 or when expressed in cylinderical coordinates, ur (r, θ, z, t) = 0, uθ (r, θ, z, t) =
Tr 4πµR2
1 1 δ(t − ts ) + H(t − ts ) , cs R
(6.3) (6.4)
which are independent of θ. When t approaches to infinity, Eq. (6.4) reduces to the statical displacement uθ (r, z) =
Tr , 4πµR3
(6.5)
which is identical to the result derived by Chowdhury [17] through a similarity transformation for the analysis of the static problem. 7. Conclusion By employing the extended Cagniard-de Hoop method, this paper investigates the transient response of an elastic half space, excited by an impulsive moving concentrated torque. Three different speeds of the moving torque, namely superspeed, transpeed, and subspeed relative to the distortional wave speed are analyzed and their closed-form solutions for displacements in the half-space are obtained. Acknowledgments The author acknowledges K.C. Chen for helpful discussions, and W.S. Shyu, J.H. Sun, and C.H. Lin for help with the figures. This work was supported by the R.O.C. National Science Council under Grant NSC-97-2221-E-002119.
March 25, 2011
13:47
World Scientific Review Volume - 9in x 6in
Transient Response of an Elastic Half Space by a Moving Concentrated Torque
Chapter13
351
References [1] G. F. Miller and H. Pursey, The field and radiation impedance of mechanical radiators on the free surface of a semi-infinite isotropic solids, Proc. Royal Soc. London A 223, 521–541, (1954). [2] M. Mitra, Disturbance produced in an elastic half-space by an impulsive twisting moment applied to an attached rigid circular disc, Z. A. Math. und Mech. 38, 40–43, (1958). [3] G. Eason, On the torsional impulsive loading of an elastic half-space, Q. J. Mech. and Appl. Math. 17, 279–292, (1964). [4] C. Pekeris, Z. Alterman, and F. Abramovici, Propagation of an SH-torque pulse in a layered solid, Bul. Seism. Soc. Am. 53, 39–57, (1963). [5] C. C. Chao, Dynamic response of an elastic half-space to tangential surface loading, J. Appl. Mech. 27, 559–567, (1960). [6] M. Papadopoulos, The use of singular integrals in a semi-infinite elastic medium, Proc. Royal Soc. London A 276, 204–237, (1963). [7] R. G. Payton, An application of the dynamic Betti-Rayleigh reciprocal theorem to moving point loads in elastic media, Q. Appl. Math. 21, 299–313, (1964). [8] D. L. Lansing, The displacements in an elastic half-space due to a moving concentrated normal load, NASA TECH. Report NASA TR R-238, (1966). [9] O. F. Afandi and R. Scott, Excitation of an elastic half-space by a timedependent dipole– I. The surface displacements due to a surface dipole, Int. J. Solids Struct. 8, 146–1161, (1972). [10] F. R. Norwood, Exact transient response of an elastic half-space loaded over a rectangular region of its surface, J. Appl. Mech. 36(3), 516–522, (1969). [11] L. Cagniard, Reflection and Refraction of Progressive Seismic Waves. Translated by E. A. Flinn and C. H. Dix. (McGraw-Hill., N.Y., 1962). [12] A. T. de Hoop, A modification of Cagniard’s method for solving seismic pulse problem, Appl. Sci. Res., B 8, 349–356, (1959). [13] D. C. Gakenheimer and J. Miklowitz, Transient excitation of an elastic halfspace by a point load travelling on the surface, J. Appl. Mech. 36(3), 505– 515, (1969). [14] J. Miklowitz, The Theory of Elastic Wave and Waveguides. (North-Holland Pub. Co., Amsterdam, 1978). [15] J. D. Achenbach, Waves Propagation in Elastic Solids. (North-Holland Pub. Co., Amsterdam, 1973). [16] H. S. Carslaw and J. C. Jaeger, Operational Methods in Applied Mathematics. (Oxford University Press, 1941). [17] K. L. Chowdhury, Solution of the problem of a concentrated torque on a semi-space by similarity transformations, J. Elasticity 13, 87–90, (1983).
This page is intentionally left blank
CHAPTER 14 MAGNETIC FORCE MODELS FOR MAGNETIZABLE ELASTIC BODIES IN THE MAGNETIC FIELDa
Xiaojing Zheng* and Ke Jin Key Laboratory of Mechanics on Western Disaster and Environment Ministry of Education, PRC, Department of Mechanics and Engineering Science College of Civil Engineering and Mechanics, Lanzhou University Lanzhou, Gansu 730000, China *E-mail: [email protected] This paper reviews the magnetic force models of magnetizable bodies in a magnetic field as developed in recent decades by comparing their premises, modeling processes and formulae as well as deficiencies, and it discusses their physical significances and scopes of application. Then, a special case of magnetoelastic behavior of ferromagnetic plates subjected to a complex magnetic field is considered. The discrepancies in the magnetoelastic behavior of ferromagnetic plates predicted by different models have been analyzed in detail, and the principal reasons for these differences have been investigated. There are many coexisting models describing the magnetic force exerted on a magnetizable body; they may be attributed to the different approximations and assumptions adopted during the modeling process. Since the distribution of magnetic forces obtained by various models is different in distinct models, the predicted magnetoelastic behavior based on these models is distinct, too. Furthermore, there may in some magnetoelastic problems be a qualitative difference between the predictions of the mechanical behavior and experimental phenomena. The conclusion drawn from this paper may provide a credible theoretical basis for investigating the magnetoelastic behavior of magnetizable elastic structures.
a
This work is supported by the National Basic Research Program of China (No.2007CB607506), the NSFC’s program (No.90405005), the Ph. D Fund (No.20050730016). 353
354
X. Zheng and K. Jin
1. Introduction The interaction between applied magnetic fields and magnetizable elastic bodies has become a focus of attention in the field of science and engineering. This is attributed to the widespread application of electromagnetic theory, leading to the formation of magneto-solid mechanics, which is a subject to study the influence of magnetic forces on the deformation, stresses, dynamics and stability.44,45 Electricity and magnetism have always been considered an inseparable part, however, it is difficult to produce strong magnetic fields. Great complexity exists in the interaction between applied magnetic fields and magnetizable elastic bodies, resulting in far fewer research works in electromagnetic structures related to ‘magnetism’ than those related to ‘electricity’. It should be noticed that the interaction between magnetic fields and magnetizable elastic bodies is accomplished by couplings. The molecular magnetic moment in magnetizable elastic bodies will rotate to some extent along the direction of the external applied magnetic field, a process by which the body is magnetized. On the other hand, magnetic force and torque, exerted on a body by the magnetic field, can cause deformation and movement; moreover, the magnetic field around the magnetized body will be changed by the magnetized body acting as an inductive ‘source’, due to the magnetic field inside the body as well as the influence of the deformation and movement of the magnetized elastic body.1,52 Therefore, it is difficult and complex to study problems in magneto-solid mechanics, which is often ignored in the literature.23,26,27,66 In fact, the coupled interaction between a magnetic field and magnetizable elastic body plays a key role in magneto-solid mechanics. Therefore, it is necessary to represent the magnetic force exerted on the magnetizable body, which includes body force, magnetic body couple, and traction (hereinafter referred to as magnetic force collectively). The distribution of the magnetic force exerted on the magnetizable elastic body cannot be directly measured by experiments; it can only be indirectly deduced based on fundamental laws of electromagnetism. However, it has been found that different predictions of mechanical characteristics of magnetoelastic interaction are based on different distributions of the magnetic force. Therefore, modeling the magnetic
Magnetic Force Models for Magnetizable Elastic Bodies in the Magnetic Field 355
force exactly and (or at least reasonably) is a primary problem in magneto-solid mechanics. The concept of magnetic stress goes back to Faraday,44,45 who is conscious of the magnetic force that is exerted on a magnetized body inside the magnetic field lines. He and imagines that lines of tension act along field lines and pressure forces act normal to field lines. However, his ideas are not expressed in a proper mathematical form.38 Faraday’s experiments and ideas were brought on a solid mathematical foundation by Maxwell in 1873 in his famous treatise.38 In this treatise, he derives the formula for the magnetic force on a magnetized body inside the magnetic field, and proves that the magnetic stress he introduced is a tensor, called ‘Maxwell stress tensor’ in the subsequent literature.23,66 This theory has greatly promoted the study of electromagnetomechanics, and is considered in some problems of geophysics, electric power engineering and microelectronics, where soft magnetic materials are used. Solving of the mechanics problems needs the combined solution of electrodynamics and continuum mechanics. The foundation of such a combined approach was initiated by Maxwell, who introduced the ‘Maxwell stress tensor’.59 In this combined process, electromagnetic energy-momentum tensors,51 and material energy-stress tensor,16,83,84 etc. are derived based on the Maxwell stress tensor to solve many magnetoelastic problems under complex circumstances, including the mechanical behavior of magentoelastic membranes and the propagation of electromagnetic waves,19,57,72 etc. Subsequently, some other typical models are used to represent magnetic forces. In the mid 20th century, the magnetic dipole model,4,7,11,17,24,64,66 was developed to describe the magnetic force in magnetized matter. Owing to the unresolved question whether the additional postulates are necessary, formulations are full of speculations and controversies. The conclusion which is drawn is that ‘the dipole formula should be sufficient, without additional postulates’.5,6 Considering that the dipole formula does not lead to any unique expression for the ‘effective field intensity’ inside a magnetized body, Brown presented a set of magnetic force formulae by definition of the ‘long-rang forces’, and developed a phenomenological theory of magnetoelasticity on the basis of the large deformation theory of elasticity and the classical theory of ferromagnetism in his treatise
356
X. Zheng and K. Jin
(1966).8 It is mainly because the representation of the magnetic force by Brown is convenient to be used, that his model has soon become popular to study the deformation, stability and fracture, etc, of magnetizable structures,9,21,49,50,65,73 some nonlinear magnetoelastic problems,9,21 and some problems in rigid magnetized bodies.3 Tiersten in 1964,78 presented a parallel theory to the theory by Brown for magnetically saturated insulators based on the assumption of two superposed, interacting continua, a ‘lattice continuum’ and an ‘electronic spin continuum’. In his theory, an electromagnetic part is contained in the stress term and his free energy depends, in addition to the deformation gradient, on the magnetization and its spatial gradient. Following this ‘direct’ approach, which uses conservation laws of continuum mechanics, Eringen and Maugin et al,12,14,36 present governing equations for deformable bodies subject to magnetoelastic interaction and derive the electromagnetic body force and electromagnetic stress tensor from Lorentz’ theory of electrons. Their works are ‘motivated primarily by an interest in the effect of electric and magnetic fields on the constitutive or stress-strain relations and do not arise out of a specific engineering application’,45 but even so, this approach is also applied to study some magnetoelastic problems in many articles, such as Chen,10 Dunkin and Eringen,13 Maugin,37 and others. Although the general theory as given by Brown,8 or Tiersten,78 is systematic, it is still rather complicated. Therefore, very few quantitative results are given by it, except for a few cases, such as the investigation of the coupling of elastic waves with spin waves,74,80 In order to maintain a magnetoelastic theory which is convenient to be used, Pao and Yeh in 1973,53 present a general theory of linear magnetoelasticity for soft ferromagnetic materials for which the effects due to the magnetization gradient and hysteresis are neglected. In their theory, the displacement-gradients are assumed small and perturbation methods are used. The magnetic force and field equations adopted are essentially the same as those given in Brown’s treatise. This linearized magnetoelastic theory has been widely applied to coupled magnetoelastic problems such as deformation, fracture and instability of magnetizable structures,2,18,20,29-32,53-62,67-71,95,96,98; it has greatly promoted the development of magnetoelastic mechanics.
Magnetic Force Models for Magnetizable Elastic Bodies in the Magnetic Field 357
A perfect work, combining quantitative analysis with experimental measurements is magnetoelastic buckling of a ferromagnetic plate in a uniform transverse magnetic field, investigated first by Moon and Pao.46 As they show in the experiments, magnetoelastic buckling of a cantilever plate inserted into a uniform magnetic field will occur when the magnetic field reaches a critical level. Only a magnetic couple, assumed to be proportional to the rotation of deflection of the plate, is retained in the magnetic force system, exerted on the ferromagnetic plate. The critical magnetic field for buckling of a ferromagnetic plate, calculated by Moon and Pao, is higher than the experimental values by up to a factor of 2. Several different explanations have been offered to explain the discrepancy between the theoretical and experimental values.15,28,43,53,101,102 Nevertheless, the Moon-Pao’s model has been widely used in magnetoelastic problems due to its convenience. Examples are: magnetoelastic behavior of a ferromagnetic plate subjected to a transverse or oblique magnetic field by Moon and Pao;47,48 deformation and buckling as well as instability of a beam-plate and cylindrical shell by Miya et al;39-43 post-buckling of a magnetoelastic beam by Popelar,63 etc. Furthermore, in recent years, Moon and Pao’s model has been modified by considering the effect of the inductive current,33,34 and thermal loads.92-94 A number of other related work about magnetoelastic buckling problem are presented in Ref. 22,85-88. According to the models reviewed above, the approaches of studying the magnetoelastic interaction can be broadly classified into three categories: (i) ‘based on the direct method which uses conservation laws of continuum mechanics’,25 i.e. based on rational mechanics and the axiomatic method, e.g. Tiersten,78 Maugin and Eringen,35,36 Pao and Yeh,53; (ii) based on the energy approach through a variational principle, e.g. Tiersten,80-82 Brown,8; (iii) based on empirical formulae which use the approach of mechanics of materials, e.g. Moon and Pao46. Moreover, the adopted magnetic force modeling approaches can also be classified into three categories:52 (i) derived from Maxwell’s model; (ii) Minkowski’s approach which is based on the theory of relativity; (iii) derived from Lorentz’s theory of electrons, according to the postulates and the principles on which the model is based, Pao (1978),52 discussed the validity of each model applied to deformable bodies from the view
358
X. Zheng and K. Jin
point of mathematical rigor and soundness of the assumptions. Hutter et al22 compared five different descriptions of deformable polarizable and magnetizable continue, and showed that they are equivalent to each other and differ only in terms which can be negligible somehow in the nonrelativistic approximation. However, the discrepancies in the expressions of the magnetic force among the models due to the various adopted assumptions have not been analyzed in detail. For magnetizable bodies or structures, apparently different representations for the magnetic force will lead to different mechanical behavior. In the pursuit of a comprehensive, systematic and convenient representation of the magnetic force model and the approach of magnetoelastic interation, the choice whether a model or theory can describe the experimental phenomena is also a key to evaluate the validity of the model or theory. Zhou and Zheng,103-105 derive an expression of the magnetic body force and traction, which is different from those of existing models. They establish the governing equations of magnetoelastic interaction for soft ferromagnetic deformable bodies, based on a variational principle and energy approach. Their methods were soon extended to establish a model of magneto-thermo-elasticity for nonlinearly magnetizable bodies.90,91 They were also applied to magnetoelastic buckling, post-buckling and vibration of magnetizable structures subjected to various magnetic fields.89,97,99-102,106,107 Zhou and Zheng,103 pointed out that many magnetic models fail to characterize the two phenomena of magnetoelastic experiments; however, they did not present the roots causing the different predictions for these magnetoelastic characteristics of magnetizable bodies based on different magnetic force models. In this paper, we classify the existing typical magnetic force models into four categories, according to their representations: (i) Typical physical models, including Faraday’s magnetic stress assumption, Maxwell’s stress tensor, as well as the magnetic dipole model; (ii) Brown’s model and Eringen-Maugin’s model; (iii) Moon-Pao’s model; (iv) Zhou-Zheng’s model. The procedure adopted here is as follows: First, we review the various existing magnetic force models by comparing the premises and modeling approaches as well as their formulae and discuss their physical significance and validity. Then, a special case of magnetoelastic behavior of ferromagnetic plates subjected
Magnetic Force Models for Magnetizable Elastic Bodies in the Magnetic Field 359
to complex magnetic field is considered. Qualitative and quantitative discrepancies between the predictions based on these theoretical models and two experimental phenomena are analyzed; subsequently, the major reasons for these discrepancies will be discussed. 2. Analysis of Magnetic Forces in Existing Models In the preceding sections, we have reviewed the history of the magnetic force model and some of their applications. With the hope of getting a comprehensive and profound understanding of the magnetic forces exerted on magnetizable bodies placed in an external magnetic field, we shall now review and discuss four types of models to discriminate their definitions and physical significance. Considerable attention will be devoted to the fundamental concepts, such as the magnetic field (induction) B in the model. When a magnetizable body is placed in an external magnetic field, the magnetic field in the body and near the body is different from the external magnetic field due to magnetization and deformation without the body. To illustrate this clearly, notations B+ and B- are used to denote the magnetic field inside and outside the magnetized body, respectively. Thus, the magnetic field intensity H0, H+ and H-, corresponding to the external magnetic field, the magnetic field inside and outside the magnetized body, respectively, can be obtained through the constitutive equations outside and inside the body, e.g. B 0 = µ 0 H 0 (or B − = µ0 H − ) and B+ = µ0 µr H+ , where µr is the relative permeability of the magnetizable body. 2.1. Typical Physics Models 2.1.1. Faraday’s Magnetic Stress Assumption According to Faraday’s assumption, a magnetizable body placed in the external magnetic field may be subject to an electromagnetic stress on its surface, and lines of tension Tm act along field lines and pressure forces Pm act normal to field lines.44 That is Tm =
Bn2 , 2 µ0
Pm =
Bτ2 , 2µ0
(1)
360
X. Zheng and K. Jin
where Bn and Bτ are the normal and tangential components of the magnetic field to the surface of the magnetizable body, µ0 is the vacuum permeability. We deduce from this that the magnetic stress on the surface of the magnetizable body is approximately B2 B2 σn = n − τ n , 2 µ0 2 µ0
(2)
where the unit vector n is normal to the surface of the body. On this basis the surface of discontinuity of the magnetic stress is given by ( Bn+ ) 2 ( Bτ+ )2 ( Bn− )2 ( Bτ− ) 2 n − − − n . 2 µ0 2 µ0 2 µ0 2µ0
[σ n ] =
(3)
Because the normal component of B is continuous across the surface and the jump of the tangential component of H is given by the surface current K, we have,66 n i( B + − B − ) = 0 ,
(4a)
n × (H + − H − ) = K ,
(4b)
Thus, Eq. (3) can be derived as
[σ n ] = −
1 2 µ0
{( B
) − ( Bτ− ) 2 } n = −
+ 2
τ
µ0 χ m ( µ r + 1) 2
( Hτ+ )2 n ,
(5)
where χ m is the magnetic susceptibility. We can conclude that Faraday’s magnetic stress assumption is only an empirical representation of stress exerted on the surface of the body, based on some ideas from a large amount of experiments, without any mathematical basis. Furthermore, Faraday only focused on the stress on the surface of a body, without any details of the stress distribution inside the body. Therefore, Faraday’s magnetic stress can be considered as ‘magnetic tractions’ on the magnetizable body under a given magnetic field. For an unmagnetizable body, the constitutive equation relating B and H can be expressed as B+ = µ0H+; then, we have [σn] = 0, i.e. there is no magnetic traction exerted on an un-magnetizable body in the magnetic field. The
Magnetic Force Models for Magnetizable Elastic Bodies in the Magnetic Field 361
treatment of Eqs. (1)-(6) is derived for a rigid body. However, this treatment requires no modification for the deformable case, but all the coordinates in the treatment must be the instantaneous coordinates of the body after deformation. 2.1.2. Maxwell Stress Tensor Maxwell derives the force acting on a magnetizable body by the variation of stationary magnetic energy and Ampère’s law. Maxwell writes, when a magnetizable body carrying a current ‘magnetized with an intensity whose components are Mx, My, Mz and placed in a field of magnetic intensity whose components are Hx, Hy, Hz,38 the force on the body may for convenience of calculation be divided into two parts, that is, a force due to the presence of the current and another force due to magnetisation,
f = J × H + µ0 (M ⋅∇)H ,
(6)
where J is the conduction current on the body. For an un-magnetizable body, the force is just dependent on the conduction current, i.e. the first part on the right side of Eq. (6). It should be noted that many authors, including Maxwell himself,17,23,66 did not define which magnetic field H should be inserted in Eq. (6), that corresponding to the magnetic field intensity related to the external magnetic field B0, or that related to the total magnetic field inside the magnetizable body B+ or that related to the magnetic field outside the body B-? We decided that the magnetic field intensity H, meant by Maxwell initially, is the total magnetic field inside the magnetizable body, H+. With the aid of the constitutive relation B+ = µ0 (H + + M) and Ampère’s law ∇ × H + = J , the force can also be written as f = J × B + + µ0 (∇H + )iM .
(7)
It is clear, from Eq. (7), that the force exerted on a magnetizable body carrying a current, magnetized with intensity M and placed in a magnetic field of intensity H+, can also be expressed as being due to two parts: the force on the total current J × B + and the force on the magnetic
362
X. Zheng and K. Jin
moment density µ0 (∇H+ ) ⋅ M .52 With (B + − µ0 H + ) replacing M and ∇ × H + replacing J, we obtain 1 f = ∇i(B+ H + − µ0 H + 2δ) , 2
(8)
where, δ is the unit tensor. Maxwell subsequently proved that the quantity in parentheses is a tensor, which is the famous ‘Maxwell stress tensor’. Hence, employing the divergence theorem, the resultant force on the body can be calculated by integration over a surface surrounding it, 1 1 F = ∫ ∇ i(B + H + − µ0 H + 2δ)dτ = ∫ ni(B + H + − µ0 H + 2δ)dS . 2 2 S Ω
(9)
The treatment above requires no modification for deformable bodies; but all the coordinates in this treatment must be the instantaneous coordinates of the particles after deformation.8 Some works let the surface S expand to the outside of the body;8,23 hence, B and H in Eq. (9) can be replaced by B- and H-, since the surface integration is now carried out in the vacuum instead at the boundary of the body. Thus, with the relation B− = µ0 H − , we have F=
1
µ0
∫ n i( B B −
S
−
1 − B − 2δ)dS . 2
(10)
However, the magnetic induction at a point outside and near the magnetized body is not easy to calculate since the external magnetic field around the body will be influenced by the magnetization. We can also see from Eq. (9) that the magnetic traction at the boundary of the magnetized media is not taken into account, since formula (9) simply transforms the body force to the force over the surface. Moreover, the force due to the magnetic moment density is expressed as µ0 (∇H + ) ⋅ M , which can be considered to be due to the effect of non-uniform distribution of the magnetic field intensity, ignoring the part due to the non-uniform distribution of magnetization. Thus, Maxwell’s model cannot describe the magnetic force comprehensively. 2.1.3. Magnetic Dipole Model According to Ampère’s molecular current view, each magnetic molecule can be considered as a current circuit. Then, the force exerted on these
Magnetic Force Models for Magnetizable Elastic Bodies in the Magnetic Field 363
‘small’ loops (‘By small we mean that the largest dimension of the loop is small compared to the distance from it to a point of observation, a condition very well fulfilled when we wish to calculate the large-scale effects of atomic magnetism, and one that seems to be valid at all distances for the magnetic effects associated with spin’,66 Smallness of S means here that B is to be taken as constant over the loop.) is mainly the part on the magnetic dipole from Lorentz’s law, thus, a magnetic body force (per unit volume), plus a body couple (per unit volume) may be defined as
f = [∇B] ⋅ M ,
(11)
c = M×B .
(12)
In the derivation of Eq. (11), ∇ ⋅ B = 0 is used. If a linear constitutive relation for a magnetizable body is maintained, i.e. M = χ mB / µ0 µr , in which χ m denotes the susceptibility, c = 0 can be obtained. Since the force in Eq. (11) is motivated from the force exerted on such a molecular current circuit, we can say that the magnetic induction B in the above formulae refers to the true magnetic field inside the body, B+, which is called ‘effective magnetic field’ by some authors.6 Furthermore, the magnetic body force can be considered as the force exerted on the magnetic moment density due to the effect of non-uniform distribution of magnetic induction, which is more comprehensive compared with the force given by Maxwell (Eq. 7). However, the magnetic traction is neither taken into account. Therefore, both the magnetic dipole model and the Maxwell stress tensor are developed for an infinite magnetizable body, and have nothing to do with their boundary shape. 2.2. Brown’s Model and Eringen-Maugin’s Model 2.2.1. Brown’s Model In Brown’s treatise,8 the magnetic force exerted on a magnetized and conduction-current-carrying body subjected to an externally applied magnetic field was first discussed by consideration of the external
364
X. Zheng and K. Jin
magnetic induction B0 instead of the real magnetic induction B at points inside the body. Thus, Eq. (11) can be expressed as
f (r ) = [M(r ) ⋅∇]B 0 ,
(13)
where ∇ × B 0 = 0 has been used. The resultant force on the body can be expressed by integration as
F = ∫ M ⋅∇B 0 dτ .
(14)
Alternative expressions for the resultant force on the body can be derived by use of the vector identities ∇ ⋅ B = 0 (always) and ∇ × B 0 = 0 in the external magnetic field:
F = ∫ n ⋅ MB 0 ds − ∫ (∇ ⋅ M )B 0 dτ ,
(15)
F = − ∫ (n × M ) × B 0 ds + ∫ (∇ × M ) × B 0 dτ .
(16)
The magnetic body couple (per unit volume) due to Eq. (12) is given by
c = µ0M × H 0 = M × B0 .
(17)
Eqs. (15), (16) are called the pole model and Ampère-current model, respectively. The above three representations (Eqs. (14),(15),(16)) of the resultant force are equivalent, whereas the distribution of the magnetic forces are distinctly different. Moreover, the magnetic induction B in these three models are the external magnetic induction B0, in which the magnetic induction contributed by matter in the magnetized body is not taken into consideration. Then, Brown modified the model by evaluating the long-range part of the magnetic force exerted on matter in an arbitrary volume of a magnetized body by all sources outside the volume, including the remainder of the body. In Fig. 1, τ 1 is the volume of the magnetized body, and τ 2 is the arbitrary volume of τ 1 ; S1 and S2 are closed surfaces surrounding τ 1 and τ 2 , respectively. Define
H 0 = H − H12 , B0 = B − B12 ,
(18)
in which H0 and B0 are the external magnetic field intensity and induction, whilst H and B are the corresponding fields at points inside
Magnetic Force Models for Magnetizable Elastic Bodies in the Magnetic Field 365 S1
S2
τ1 − τ 2 τ2
Fig. 1. Method of calculating the force on a part of a body.8
τ 2 (denoted by H+ and B+); H12 and B12 are the parts of them contributed by the matter in τ 1 , calculated at points inside τ 2 . Thus, F = µ0 ∫ M ⋅∇H 0 dτ = µ0 ∫ M ⋅∇H + dτ + F1 ,
(19)
F1 = − lim {µ0 ∫ n 2 ⋅ m 2 H12 dS 2 + µ0 ∫ (−∇ 2 ⋅ M 2 )H12 dτ 2 } .
(20)
where S1S2 → S
S2
τ2
Here the unit vector n2 is normal to the surface S2. With the aid of the jump condition of H across the surface S, we obtain: 1 (21) F1 = µ0 ∫ nM n2 dS . 2
1 (22) F = µ0 ∫ M ⋅∇H + dτ + µ0 ∫ nM n2 dS , 2 where Mn is the normal component of the magnetization intensity. Formula (22) can be rewritten by setting B + = µ0 (H + + M ) as 1 (23) F = ∫ M ⋅∇B + + M × (∇ × B + ) dτ − µ0 ∫ nM t2 dS , 2 where Mt is the normal component of magnetization intensity. Formulae (22) and (23) are equivalent resultant force expressions considering magnetization of a magnetizable body. Since M × (∇× B+ ) = ∇B+ ⋅ M − (M ⋅∇)B+ , Eq. (23) can be simplified to 1 (24) F = ∫ ∇B + ⋅ Mdτ − µ0 ∫ nM t2 dS . 2 Therefore,
366
X. Zheng and K. Jin
The resultant force expressed in Eqs. (22),(24) can be separated into the representation of magnetic body force (per unit volume) and magnetic traction (per unit area):
f em = µ0 M ⋅∇H + , F em =
1 µ0 M n2n , 2
1 2 and magnetic body couple (per unit volume) is given by
f em = ∇B + ⋅ M , F em = − µ 0 M t2n , c = µ0 M × H + = M × B + .
(25.a,b) (26.a,b)
(27)
The resultant magnetic force calculated by formulae (22) and (24) are equivalent, but the related two sets of magnetic body force and traction (Eqs. (25),(26)) are different, i.e. there are distinct distributions of magnetic forces. The representation of the magnetic body force expressed by the magnetic induction B (Eq. (26.a)) is consistent with that in the magnetic dipole model (Eq. (11)), however, the representation for the magnetic traction is added (Eq. (26.b)). As can be seen from the above derivation, the procedure for obtaining the model by Brown is the following: First, calculate the magnetic force exerted on the magnetized body by the source of the field entirely outside it, without considering the magnetic induction due to magnetization; thus, formulae (15),(16) and (17) are obtained; then modify these resulting expressions by calculating the long-rang part of the magnetic force contributed by the matter in the body; thus, formulae (22),(24)-(26) are obtained. Though the representation of the magnetic body force is the same as for the dipole model, the magnetic traction is simply calculated by carrying the volume integral of the magnetic body force enclosing a region which is separated by a discontinuity surface. Therefore, the magnetic traction calculated in this manner is a representation of the magnetic body force on the boundary surface, not the physically significant magnetic traction. 2.2.2. Eringen-Maugin’s Model In the book by Eringen & Maugin,14 the magnetic force (called ‘ponderomotive force’ by them) and couple arise from the interation
Magnetic Force Models for Magnetizable Elastic Bodies in the Magnetic Field 367
between the external magnetic field and the magnetizable body, are constructed from Lorentz’ theory of electrons. They deduce the macroscopic effects by applying the classical Lorentzian averaging procedure to an assembly of bound point charges contained within a microelement of volume ∆V . The force acting on a point charge δ qα in the volume microelement is given by
1 c
δ f α = δ qα e ( xα ) + vα × b ( xα )
(28)
where e ( xα ) and b ( xα ) are the microscopic electromagnetic fields evaluated at xα ∈ ∆V α , the velocity of the charge can be resolved as ɺ ɺ vα = v + ξɺα + ξˆ α , v = xɺ , where ξˆ α << ξɺ α is the fluctuation velocity of δ qα , with nonzero average, due to thermal agitations. The mean position of the point charge δ qα can be expressed as xα = x + ξ α , here x is the centroid of the volume microelement, and ξ α is the internal coordinate of δ qα . Introducing the microscopic fields E ( x) ≡ e( x), B ( x) ≡ b( x) , and admitting a Taylor series expansion about x, retaining at most terms of the order of ε 2 , we can obtain the electromagnetic force and couple by some simplifications 1 1 1 ∂ f = q f E + J × B + (∇E) ⋅ P + (∇B) ⋅ M + [ (P × B) vk ],k + ( P × B) , (29) c c c ∂t v C = P × E + M × B + × ( P × B) , c
(30)
where E is the electric field intensity and P is the polarization. Then, f can be expressed by an electromagnetic tensor tkl with an electromagnetic momentum Gk
tkl ,k −
∂Gk = fl ∂t
(31)
where 1 tkl = Pk ε l − Bk M l + Ek El + Bk Bl − ( E 2 + B 2 − 2MiB)δ kl 2
(32)
1 G ≡ E× B . c
(33)
368
X. Zheng and K. Jin
The electromagnetic stress tensor t is the sum of symmetric electromagnetic stress tensor F tkl and another tensor
tkl = F tkl + M tkl ,
(34)
1 t = Ek El + Bk Bl − ( E 2 + B 2 )δ kl . 2
(35)
where F kl
Since Eringen-Maugin’s representation of the electromagnetic force and couple are derived by the definition of the microscopic fields inside the magnetizable body, the magnetic field here is the total magnetic field, calculated at points inside body. Eqs. (29), (30) are the general expressions of ponderomotive force and couple in moving electromagnetic bodies. In these two equations, the differences between electric and magnetic effects in the ponderomotive force and couple are apparent. For a stationary magnetic field B without electric field, the charge distribution and conduction current, the magnetic force and couple can be written as Eqs. (28)-(33) are expressed in Gaussian units, but we shall use MKSA units for convenience)
f = (∇B + ) ⋅ M ,
(36)
C = M × B+ .
(37)
Then, the magnetic stress tensor is tkl = − Bk+ M l +
1
1 1 Bk+ Bl+ − ( B + 2 − 2M iB + )δ kl . µ0 2 µ0
(38)
The magnetic traction can be calculated by the discontinuity of the body force on the surface 1 F em = − µ 0 M t2n . 2
(39)
With these understandings, the magnetic traction derived above is analogous to that by Brown; it is also a representation of the magnetic body force on the boundary surface, but not the physically significant magnetic traction.
Magnetic Force Models for Magnetizable Elastic Bodies in the Magnetic Field 369
2.3. Moon-Pao’s Model
Based on the Moon-Pao’s assumption,46 the magnetic induction inside a magnetizable body is approximately equal to that outside the body, B + ≈ B 0 , the magnetic body force and couple (per unit volume) exerted on the body can, respectively, be derived from the magnetic dipole model,
f = M ⋅∇B 0 ,
(40)
c = M × B0 .
(41)
In a steady magnetic field, the magnetic body force is zero due to the uniform distribution of the external magnetic field B0. We note that the difference between the magnetic field inside the magnetizable body B+, due to magnetization, and the external magnetic field B0 are ignored, i.e. the magnetization of the magnetizable body is not taken into account in the Moon-Pao’s model. Furthermore, Moon-Pao’s magnetic couple model is only valid for a infinite body since the magnetic traction is omitted. 2.4. Zhou-Zheng’s Model
Zhou and Zheng chose a variational principle to obtain the governing equations for magnetoelastic interactions.103,104 They formulated the energy due to the magnetic fields and mechanical deformations in the system of magnetoelastic interaction associated with the external work of the applied magnetic fields. The total energy including both the magnetic energy and the mechanical strain energy can be expressed as 1 1 Π [u, φ ] = µ 0 µ r (∇φ + ) 2 dV + ∫ µ0 (∇φ − ) 2 dV ∫ 2 Ω+ ( u ) 2 Ω− ( u ) , (42) 1 − + ∫ n ⋅ B 0φ dS + ∫ ε : tdV 2 Ω+ s0 where u represents the displacement, and φ + and φ − are the magnetic scalar potential inside and outside the magnetizable body, respectively. Ω + and Ω − represent the regions of the body and the vacuum outside the body, respectively; therefore, s0 is a closed surface which surrounds the body and is far away from the magnetizable body. Considering the
370
X. Zheng and K. Jin
variations of φ and u independent of each other, we may write according to the arithmetic of variational approach,
δ Π = δφ Π + δ u Π ,
(43)
in which δ φ Π and δ u Π denote the variations of Π caused by δφ and δ u , respectively. Considering an incompressible and uniformly magnetizable body, we have
∫
δΠ=−
∫
µ0 µ r (∇2φ + )δφ + dV −
Ω+ ( u )
µ0 (∇2φ − )δφ − dV
Ω− ( u )
∂φ ∂φ − − , ∂φ µ µ δφ + − + dS ∫ S 0 r ∂n ∂n ∫s µ0 ∂n + n ⋅ B0 δφ dS 0 +
−
∫ Ω+ ( u)
−
(44)
em (∇ ⋅ t + f em ) ⋅ δ udV + ∫ (n ⋅ t − F ) ⋅ δ udS S
in which the magnetic body force and traction can be obtained as
f em = F em = −
µ0 µr χ m 2
∇( H + ) 2 ,
µ0 χ m ( µr + 1) 2
( Hτ+ ) 2 n + .
(45) (46)
The governing equation of both magnetic and mechanical sub-systems for a magnetizable body in an applied magnetic field can be derived from the variational principle δ Π = 0 . Comparing the representations of the magnetic body force (Eq. (45)) and magnetic traction (Eq. (46)) with Faraday’s magnetic stress assumption (Eq. (5)), the magnetic dipole model (Eq. (11)) and Brown’s model (Eq. (26)), respectively, it can be seen that, the representation of the magnetic body force derived by Zhou-Zheng is consistent with the expression given by Brown. However, their magnetic tractions have the same direction, but different values. The difference between them is the factor ( µr + 1) / ( µr − 1) in the expression of the magnetic traction. Although this difference is small for a large-permeability material ( µr ≫ 1 ), it cannot be ignored when the permeability of the material is relatively small. As has already been mentioned, the calculation of the magnetic traction in Brown’s model is only the representation of the magnetic body force at the boundary surface, it cannot fully describe the
Magnetic Force Models for Magnetizable Elastic Bodies in the Magnetic Field 371
boundary effects of the magnetic force. Since the description of the magnetic traction obtained by Zhou and Zheng is based on the energy method, it is more comprehensive. By comparing the formulae (5) and (46), it is found that the magnetic traction formula in Zhou-Zheng’s model inosculates with the representation of Faraday’s magnetic stress. As far as we know, the expression of Faraday’s magnetic stress is just an assumption extracted from a large number of experiments; therefore, it has physical significance, but no mathematical basis. Zhou-Zheng’s model is established with a rigorous mathematical derivation, which may endow formula (5) with a mathematical garment, the variational principle. Therefore, we can decide that the expression of the magnetic traction (46) is more comprehensive to describe the magnetic traction of a magnetizable elastic body in the magnetic field. 2.4. Discussions
We concluded from the analysis above that different versions of the magnetic force exist, because of the different derivations, assumptions and simplifications, which directly influence the structural deformation of the magnetizable bodies. The expression of Faraday’s magnetic stress, as a first attempt of calculating the magnetic force suffered by magnetizable bodies, is just an assumption with physical significance obtained from a large number of experiments, but little if any mathematical basis. Maxwell was the first to offer a mathematical description of the magnetic force for a magnetizable body in the magnetic field. However, the magnetic force only contains the body force, but no magnetic traction. Apart from this, consideration of internal magnetization for a magnetizable body is sketchy; effects of the nonuniform distribution of magnetization on the magnetic force are ignored. By contrast, the magnetic dipole model, in which the true magnetic induction inside the magnetizable body has been adopted, is more comprehensive and its physical significance is specific. These three typical physical models are available for an infinite magnetizable body, without regard of the boundary shape of the magnetizable body and the boundary effects of the magnetic forces. Brown’s model and EringenMaugin’s model have been deduced from magnetic dipoles and
372
X. Zheng and K. Jin
microscale charges, respectively. However, in the two models, not only the expression of the magnetic body force and that of the magnetic traction are identical, but also the description of the magnetic tractions as representations of the magnetic body force on the boundary surface, which cannot describe the boundary effect of the magnetic force at the discontinuous surface comprehensively. The Moon-Pao’s model is comparatively simple on account of the approximation that the total magnetic field inside the magnetizable body is replaced by the external magnetic field. However, by not considering the magnetization of the magnetizable body and by omitting a description of the magnetic tractions, resulted in a model which is only valid for infinite unmagnetizable bodies. Comparing with other models, both the differences between the external magnetic field and the total magnetic field inside the magnetizable body due to magnetization, and the boundary effect in the magnetic force were accounted in Zhou-Zheng’s model, in which the formulae of the magnetic body force and the magnetic traction were consistent with the magnetic couple model and Faraday’s magnetic stress representation. Furthermore, the formulae of the magnetic body force and magnetic traction derived from the energy method by Zhou and Zheng, have a clear physical significance and a rigorous mathematical basis. Therefore, they can be considered as a comprehensive and reasonable description of the magnetic force for magnetizable bodies in magnetic fields. 3. Applications to Special Cases
It has been shown by Zhou and Zheng,103-105 that many magnetic models fail to characterize the two phenomena of magnetoelastic experiments, (i) magnetoelastic buckling of a ferromagnetic plate in a uniform transverse magnetic field, observed first by Moon and Pao (1968),46 and (ii) the increase of the natural frequency of a ferromagnetic plate with low susceptibility in an in-plane magnetic field, detected by Takagi et al. (1993,1995),75,76 However, these authors did not point out the causes of different predictions of these magnetoelastic characteristics of magnetizable bodies based on different magnetic force models. To prove that the different predictions of magnetoelastic characteristics of
Magnetic Force Models for Magnetizable Elastic Bodies in the Magnetic Field 373
magnetizable bodies arise from different distributions of the magnetic force in each magnetic force model, we shall apply these models to the cases of ferromagnetic plates in complex magnetic fields. The governing equations for the deflection of thin plates may be written as
D∇ 2∇ 2ω ( x, y ) = qzem ( x, y ) ,
(47)
where D is the flexural rigidity of the plate, ω ( x, y ) denotes the deflection of the plates and qzem ( x, y ) is the equivalent transverse magnetic force, which can be expressed by the body force f em, the body couple c, and the normal components of the traction force F em on the upper and bottom surfaces of the plate,77
qzem ( x, y ) = ∫
h/2 − h /2
f zem ( x, y , z ) dz +
∂ h/2 c y ( x, y , z ) dz ∂x ∫ − h /2
, (48)
∂ h/2 − cx ( x, y , z ) dz + Fzem ( x, y , h / 2) + Fzem ( x , y , − h / 2) ∫ h / 2 − ∂y
where h denotes the thickness of the plate. Substitution of the magnetic force formulae of each model, we have Maxwell’s model: qzem ( x, y ) = +
µ0 χ m 2
µ0 χ m 2
{[ H n+ ( x, y , h 2)]2 − [ H n+ ( x , y , − h 2)]2 } +
2
+
;
(49)
2
{[ H τ ( x , y , h 2)] − [ H τ ( x, y , − h 2)] }
Brown’s (or Eringen-Maugin’s) model: qzem ( x, y ) = +
µ0 µr χ m 2
µ0 χ m 2
{[ H n+ ( x, y , h 2)]2 − [ H n+ ( x, y , − h 2)]2 }
;
(50)
{[ Hτ+ ( x, y, h 2)]2 − [ Hτ+ ( x, y , − h 2)]2 }
Moon-Pao’s model: qzem ( x, y ) = χ m B0 i H + ( x, y, h / 2) − H + ( x, y, −h / 2) ;
(51)
374
X. Zheng and K. Jin
Zhou-Zheng’s model: qzem ( x, y ) = −
µ0 µr χ m 2
µ0 χ m 2
{[ H n+ ( x, y , h 2)]2 − [ H n+ ( x, y , − h 2)]2 } +
2
+
.
(52)
2
{[ H τ ( x, y , h 2)] − [ Hτ ( x, y , − h 2)] }
(1) For a rigid ferromagnetic plate, when it is placed in an uniform transverse or in-plane magnetic field, since H n+ ( x, y , h 2) = H n+ ( x, y , − h 2) , Hτ+ ( x, y, h 2) = Hτ+ ( x, y, − h 2) , we get qzem ( x, y ) = 0 , i.e. the equivalent transverse magnetic force equals zero. When this plate is placed in a non-uniform transverse or an in-plane magnetic field, then, we have H n+ ( x, y, h 2) ≠ H n+ ( x, y, − h 2) and Hτ+ ( x, y, h 2) ≠ Hτ+ ( x, y, − h 2) , consequently, qzem ( x, y ) ≠ 0 . However, the equivalent transverse magnetic force is different in the various models, due to the different factors in the Eqs. (49)-(52). (2) For an elastic ferromagnetic plate, Case 1: when it is placed in a uniform transverse magnetic field, we know H n+ >> Hτ+ , thus, the equivalent transverse magnetic force of different models can be expressed as Maxwell’model: qzem ( x, y ) ≈
µ0 χ m 2
{[ H n+ ( x, y , h 2)]2 − [ H n+ ( x, y, − h 2)]2 } ;
Brown’s (or Eringen-Maugin’s) model: µµχ qzem ( x, y ) ≈ 0 r m {[ H n+ ( x, y , h 2)]2 − [ H n+ ( x, y , − h 2)]2 } ; 2
(53)
(54)
Moon-Pao’s model: q zem ( x, y ) = χ m B0 H z+ ( x, y , h / 2) − H z+ ( x , y , − h / 2) ;
Zhou-Zheng’s model: µµχ qzem ( x, y ) ≈ 0 r m {[ H n+ ( x, y, h 2)]2 − [ H n+ ( x, y, − h 2)]2 } . 2
(55)
(56)
Magnetic Force Models for Magnetizable Elastic Bodies in the Magnetic Field 375
Note that the equivalent transverse magnetic forces in the middle plane of the plate derived from Brown’s (or Eringen-Maugin’s) model and Zhou-Zheng’s model are nearly the same, but that derived from the Maxwell stress tensor is different by a factor of µr . Not only the magnitudes are different from those given by other models, but also the distribution of the equivalent transverse magnetic forces in the middle plane of the plate, given by Moon-Pao’s model. Case 2: when the plate is placed in a uniform in-plane magnetic field, we know H n+ << Hτ+ , thus, the equivalent transverse magnetic force of different models can be expressed as
Maxwell’model: µ0 χ m
qzem ( x, y ) ≈
{[ Hτ+ ( x, y , h 2)]2 − [ Hτ+ ( x, y , − h 2)]2 ;
2
(57)
Brown’s (or Eringen-Maugin’s) model: qzem ( x, y ) ≈
µ0 χ m 2
{[ Hτ+ ( x, y, h 2)]2 − [ Hτ+ ( x, y , − h 2)]2 } ;
(58)
Moon-Pao’s model: qzem ( x, y ) = χ m B0 H x+ ( x, y, h / 2) − H x+ ( x, y, −h / 2) ;
(59)
Zhou-Zheng’s model: qzem ( x, y ) ≈ −
µ0 χ m 2
{[ Hτ+ ( x, y, h 2)]2 − [ Hτ+ ( x, y, − h 2)]2 } .
(60)
Comparing Eqs. (58) and (60), it can be seen that the equivalent transverse magnetic force derived from Zhou-Zheng’s model and that given by Brown are equal and opposite. The magnetoelastic behavior in an in-plane magnetic field predicted with the aid of the governing equations for the deflection of thin plates, (47) are different in the different formulations, because of the opposite directions of the equivalent transverse magnetic forces in the middle plane of the plate.
376
X. Zheng and K. Jin
The characteristic predictions based on Zhou-Zheng’s model agree well with the experiment, i.e. the natural frequency increases with an increase of the external magnetic field. By contrast, the results based the other models are opposite.104 We discussed the difference in representations of the magnetic force between the Zhou-Zheng’s model and the Brown model in section 2.4. Accordingly, the representation of the magnetic body force derived by Zhou and Zheng is consistent with that given by Brown; moreover, their magnetic tractions have the same direction but different values of a factor (µr +1)/ (µr −1) . It is this factor which leads to the opposite directions of the equivalent transverse magnetic forces in the middle plane of ferromagnetic plates, which directly affects the different predictions of the magnetoelastic behavior. The equivalent transverse magnetic forces derived from Maxwell's and Brown’s models, respectively, are different from that derived by Zhou-Zheng’s model, both in magnitude and direction. 4. Conclusions
In this paper, magnetic force models of magnetizable bodies in a magnetic field were reviewed. They appeared in the literature in recent decades by comparing the premises, modeling processes and formulae as well as deficiencies, and also discussed their physical significances and scopes of applications. It can be concluded that the distributions of the magnetic force described by different models are not the same. This implies great influences on the predictions of the magnetoelastic behavior of magnetizable structures. The discrepancies of the magnetic force models are attributed to inappropriate approximations and modeling approaches. There are quantitative, sometimes even qualitative, differences in the predictions of magnetoelastic behavior, despite apparent little difference in the formulae of the magnetic body forces or magnetic tractions. Comparing with other models, the magnetic body force and traction in Zhou-Zheng’s model have a clear physical significance and a rigorous mathematical basis. Furthermore, the predictions based on Zhou-Zheng’s model agree well with the experimental phenomena of ferromagnetic plates. The conclusion drawn from this paper can provide a credible theoretical basis for investigating
Magnetic Force Models for Magnetizable Elastic Bodies in the Magnetic Field 377
the magnetoelastic behavior of magnetizable elastic structures. However, the validity of various models in a general thermodynamic process is not taken into account in this paper. These models may be in conflict with thermodynamic relations. Moreover, the disagreements between theoretical predictions and experimental results still exist. Therefore, a thermodynamic setting with a nonlinear constitutive relation of the magnetizable materials may be a breakthrough point in the further study. Acknowledgments
This work was supported by the National Basic Research Program of China (No.2007CB607506), the National Science Foundation of China (No.90405005), the Ph. D Fund of the Ministry of Education of China (No.20050730016). We would like to thank K. Hutter and reviewers for valuable comments and suggestions on the early drafts. References 1. Ambartsumian S. A., Magneto-Elasticity of thin plate and shells, Appl. Mech. Rev. 35, 1 (1982). 2. Bagdasarian G. Y. and Hasanian D. J., Magnetoelastic interaction between a soft ferromagnetic elastic half-plane with a crack and a constant magnetic field, Int. J. Solids Struct. 37, 5371 (2000). 3. Bhattacharya K., Mathematical derivation of the continuum limit of the magnetic force between two parts of a rigid crystalline material, Arch. Rational Mech. Anal. 176, 227 (2005). 4. Birss R. R., Electric and magnetic forces, American Elsevier, New York (1968). 5. Brown W. F., Electric Forces: A direct calculation. I&II, Am. J. Phys. 19, 290 (1950); 19, 333 (1951). 6. Brown W. F., Magnetic energy formulas and their relation to magnetization theory, Rev. Mod. Phys. 25, 131 (1953). 7. Brown W. F., Micromagnetism, Willey, NY (1963). 8. Brown W. F., Magnetoelastic interactions, Springer (1966). 9. Bustamante R., Dorfmann A. and Ogden R. W., On variational formulations in nonlinear magnetoelastostatics, Math. Mech. Solids 13, 725 (2008). 10. Chen X., On magneto-thermo-viscoelastic deformation and fracture, Int. J. NonLinear Mech. 44, 244 (2009). 11. de Groot S. R. and Suttorp L. G., Foundations of electrodynamics, North-Holland Pub. (1972).
378
X. Zheng and K. Jin
12. Dixon R. C. and Eringen A. C., A dynamic theory of polar elastic dielectrics, I, II, Int. J. Engng. Sci. 3, 359, 379 (1965). 13. Dunkin J. W. and Eringen A. C., Wave motion in magnetizable deformable solids, Int. J. Eng. Sci. 1, 461 (1963). 14. Eringen A. C. and Maugin G. A., Electrodynamics of continua, Springer, New York (1989). 15. Eringen A. C., Theory of electromagnetic elastic plates, Int. J. Eng. Sci. 27, 363 (1989). 16. Eshelby J. D., Force on an elastic singularity, Phil. Tran. Roy. Soc. Lond. 244, 87 (1951). 17. Fano R. M., Chu L. J. and Adler R. B., Electromagnetic fields, energy, and forces, John Wiley & Sons, Inc (1960). 18. Gao C. F., Mai Y. W. and Wang B. L., Effects of magnetic fields on cracks in a soft ferromagnetic material, Eng. Fract. Mech. 75, 4863 (2008). 19. Hasanyan D. J. et al., Magneto-thermo-elastokinetics of geometrically nonlinear laminated composite plates. Part 1: foundation of the theory, J. Sound and Vibration 287, 153 (2005). 20. Hasanyan D. J. and Piliposian G. T., Stress–strain state of a piecewise homogeneous ferromagnetic body with an interfacial penny-shaped crack, Int. J. Fract. 133,183 (2005). 21. Hasanyan D. J. and Harutyunyan S., Magnetoelastic interactions in a soft ferromagnetic body with a nonlinear law of magnetization: Some applications, Int. J. Solids Struct. 46, 2172 (2009). 22. Hutter K., van de Ven A. A. F. and Ursescu A., Electromagnetic field matter interactions in thermoelastic solids and viscous fluids, Springer (2006). 23. Jackson J. D., Classical Electrodynamics, John Wiley & Sons, New York (1976). 24. Jeans J. H., The mathematical theory of electricity and magnetism. Cambridge University Press (1925). 25. Kankanala S. V. and Triantafyllidis N., On finitely strained magneto- rheological elastomers, J. Mech. Phys. Solids 52, 2869 (2004). 26. Landau L. D. and Lifshitz E. M., Mechanics and electrodynamics, Pregamon Press (1972). 27. Landau L. D., Lifshiftz E. M. and Pitaevskii L. P., Electrodynamic of continuous Media, Elsevier (1960). 28. Lee J. S., Zheng X. J. and Zhou Y. H., Proceedings of the Twelfth US National Congress of Applied Mechanics, Seattle, Washington, 562 (1994). 29. Liang W., Fang D. N., Shen Y. P. and Soh A. K., Nonlinear magnetoelastic coupling effects in a soft ferromagnetic material with a crack, Int. J. Solids Struct. 39, 3997 (2002). 30. Liang W., Shen Y. P. and Zhao M., Magnetoelastic formulation of soft ferromagnetic elastic problems with collinear cracks: energy density fracture criterion, Theor. Appl. Fract. Mech. 34, 49 (2000).
Magnetic Force Models for Magnetizable Elastic Bodies in the Magnetic Field 379 31. Lin C. B. and Lin H. M., The magnetoelastic problem of cracks in bonded dissimilar materials, Int. J. Solids Struct. 39, 2807 (2002). 32. Lin C. B. and Yeh C. S., The magnetoelastic problem of a crack in a soft ferromagnetic solid, Int. J. Solids Struct. 39, 1 (2002). 33. Liu M. F. and Chang T. P., Vibration analysis of a magneto-elastic beam with general boundary conditions subjected to axial load and external force, J. Sound Vibration 288, 399 (2005). 34. Lu Q. S., To C. W. S. and Huang K. L., Dynamic stability and bifurcation of an alternating load and magnetic field excited magnetoelastic beam, J. Sound Vibration 181, 873 (1995). 35. Maugin G. A. and Eringen A. C., Deformable magnetically saturated media I. Field equations, J. Math. Phys. 13, 143 (1972). 36. Maugin G. A. and Eringen A. C., On the equations of the electrodynamics of deformable bodies of finite extent, Journal de Mécanìque. 16, 102 (1977). 37. Maugin G. A., A continuum approach to magnonphonon couplings I, Int. J. Eng. Sci. 19, 321 (1981). 38. Maxwell J. C., A treatise on electricity and magnetism, Oxford university press (1881). 39. Miya K., Hara K. and Someya K., Experimental and theoretical study on magnetoelastic buckling of a ferromagnetic cantilevered beam-plate, ASME J. Appl. Mech. 45, 355 (1978). 40. Miya K., Hara K. and Tabata Y., Finite element analysis of experiment on dynamic behavior of cylinder due to electromagnetic force, Nucl. Eng. Des. 59, 401 (1980). 41. Miya K. and Takagi T., Finite element analysis of electromagnetoelastic buckling of a superconducting magnet in a magnetic fusion reactor, ASME J. Press Ves. Tech. 100:I-24 (1978). 42. Miya K., Takagi T. and Ando Y., Finite element analysis of electro-magnetoelastic buckling of a superconducting magnet in a magnetic fusion reactor, ASME J. Appl. Mech. 47, 377 (1980). 43. Miya K., Takagi T. and Ando Y., Finite element analysis of magnetoelastic buckling of a ferromagnetic beam-plate, ASME J. Appl. Mech. 47, 377 (1980). 44. Moon F. C., Magneto-solid mechanics, Wiley, New York (1984). 45. Moon F. C., Problems in magneto-solid mechanics. Mechanics Today, Vol. 4, Ed. Nemat-Nasser, Pergamon Press Inc, New York, 306 (1978). 46. Moon F. C. and Pao Y. H., Magnetoelastic buckling of a thin plate, ASME J. Appl. Mech. 35, 53 (1968). 47. Moon F. C. and Pao Y. H., Vibration and dynamic instability of a beam-plate in a transverse magnetic field, ASME J. Appl. Mech. 36, 141 (1969). 48. Moon F. C., The mechanics of ferroelastic plates in a uniform magnetic field, ASME J. Appl. Mech. 37, 153 (1970). 49. Nobili A. and Tarantino A. M., A hard ferromagnetic and elastic beam-plate sandwich structure, Z. Angew. Math. Phys. 58, 137 (2007).
380
X. Zheng and K. Jin
50. Nobili A. and Tarantino A. M., Magnetostriction of a Hard Ferromagnetic and Elastic Thin-Film Structure, Math. Mech. Solids 13, 95 (2008). 51. Obukhov Y. N. and Hehl F. W., Electromagnetic energy–momentum and forces in matter, Physics Letters A 311, 277 (2003). 52. Pao Y. H., Mechanics Today, Ed. Nemat-Nasser, Pergamon Press Inc, New York, 4, 209 (1978). 53. Pao Y. H. and Yeh C. S., A linear theory for soft ferromagnetic elastic solids, Int. J. Engng. Sci 11, 415 (1973). 54. Pao Y. H. and Hutter K., Electrodynamics for moving elastic solids and viscous fluids, Proc. IEEE 63, 1011 (1975). 55. Podilchuk Yu. N., The stress state of a ferromagnetic with an elliptic crack in a homogeneous magnetic field, Int. Appl. Mech. 37, 213 (2001). 56. Podilchuk Yu. N. and Dashko O. G., Stress–strain state of a ferromagnetic with a paraboloidal inclusion in a homogeneous magnetic field, Int. Appl. Mech. 40, 517 (2004). 57. Podilchuk Yu. N. and Dashko O. G., Stress state of a soft ferromagnetic with an ellipsoidal cavity in a homogeneous magnetic field, Int. Appl. Mech. 41, 283 (2005). 58. Podilchuk Yu. N. and Podilchuk I. Yu., The stress state of a transversely isotropic ferromagnetic with a parabolic crack in a homogeneous magnetic field, Int. Appl. Mech. 39, 935 (2003). 59. Podilchuk Yu. N. and Podilchuk I. Yu., Stress intensity coefficients in the transversally-isotropin ferromagnetic with the elliptical crack under the uniform magnetic field, J. Mech. A Solids. 24, 207 (2005). 60. Podilchuk Yu. N. and Tereshchenko L. N., A magnetoelastic field in a ferromagnetic medium with a spherical cavity, Int. Appl. Mech. 37, 352 (2001). 61. Podilchuk Yu. N. and Tereshchenko L. N., A magnetoelastic field in a ferromagnetic with an elliptic inclusion, Int. Appl. Mech. 38, 585 (2002). 62. Podilchuk Yu. N. and Tereshchenko L. N., Stress concentration in a soft ferromagnetic with an elliptic inclusion in a homogeneous magnetic field, Int. Appl. Mech. 40, 61 (2004). 63. Popelar C. H., Post-buckling analysis of a magnetoelastic beam, J. Appl. Mech. 39, 207–212 (1972). 64. Purcell E. M., Electricity and magnetism. Educational Services Incorporated (1963). 65. Raikher Yu. L. and Stolbov O. V., Magnetodeformational effect in ferrogel samples, J. Mag. Mag. Mater. 258–259, 477 (2003). 66. Scott W. T., The physics of electricity and magnetism, John Wiley & Sons, Inc (1959). 67. Shindo Y., The linear magnetoelastic problem for a soft ferromagnetic elastic solid with a finite crack, ASME J. Appl. Mech. 44, 47 (1977).
Magnetic Force Models for Magnetizable Elastic Bodies in the Magnetic Field 381 68. Shindo Y., Magnetoelastic interaction of a soft ferromagnetic elastic solid with a penny-shaped crack in a constant axial magnetic field, ASME J. Appl. Mech. 45, 291 (1978). 69. Shindo Y., Singular Stresses in a Soft Ferromagnetic Elastic Solid with Two Coplanar Griffith Crack, Int. J. Solids. Struct. 16, 537 (1980). 70. Shindo Y., The linear magnetoelastic problem of two coplanar Griffith cracks in a soft ferromagnetic elastic strip, ASME J. Appl. Mech. 49, 69 (1982). 71. Shindo Y., Komatsu T., Narita F. and Horiguchi K., Magnetic stress intensity factor for an edge crack in a soft ferromagnetic elastic half-plane under tension, Acta Mech. 182, 183 (2006). 72. Steigmann D. J., Equilibrium theory for magnetic elastomers and magnetoelastic membranes, Int J Non-Linear Mech 39, 1193 (2004). 73. Stolbov O. V. and Raikher Yu. L., Deformation of a ferrovesicle in a uniform magnetic field, J. Mag. Mag. Mater. 300, 199 (2006). 74. Strauss W., Physical Acoustics, vol. 4, part B, Academic Press (1968). 75. Takagi T., Tani J., Matsubara Y. and Mogi I., Pro 2nd Int. Workshop on Electromagnetic forces and related effects on blankets and other structures surrounding the Fission Plasma Torus., Miya, Ed. 81-90 (1993). 76. Takagi T., Tani J., Matsubara Y. and Mogi I., Dynamic behavior of fusion structural components under strong magnetic fields, Fusion Eng. Des. 27, 481 (1995). 77. Timoshenko S. and Krieger S. W., Theory of plates and shells, McGraw-Hill, New York (1959). 78. Tiersten H. F., Coupled magnetomechanical equations for magnetically saturated insulators, J. Math. Phys. 5, 1298 (1964). 79. Tiersten H. F., Variational principle for saturated magnetoelastic insulators, J. Math. Phys. 6, 779 (1965). 80. Tiersten H. F., Surface mechanics, Proceedings of a Symposium of the A.S.M.E., New York (1969). 81. Tiersten H. F., A development of the equations of electromagnetism in material continua, Springer, New York (1990). 82. Tiersten H. F. and Tsai C. F., On the interaction of the electromagnetic field with heat conducting deformable insulators, J. Math. Phys. 13, 361 (1971). 83. Trimarco C., Stresses and momenta in electromagnetic materials, Mechanics Research Communications 29, 485 (2002). 84. Trimarco C., The structure of material forces in electromagnetic materials, Rend. Sem. Mat. Univ. Pol. Torino 58, 2 (2000). 85. Van de Ven A. A. F., Magnetoelastic buckling of thin plates in a uniform transverse magnetic field, J. of Elasticity 8, 297 (1978). 86. Van de Ven A. A. F., Magnetoelastie buckling of magnetically saturated bodies, Acta Mech. 47, 229 (1983).
382
X. Zheng and K. Jin
87. Van de Ven A. A. F., Magnetoelastic buckling of a beam of elliptic cross-section, Acta Mech. 51, 119 (1984). 88. Van de Ven A. A. F., Tani J., Otoma K. and Shindo Y., Magnetoelastic buckling of two nearby ferromagnetic rods in a magnetic field, Acta Mech. 75, 191 (1988). 89. Wang X. Z., Lee J. S. and Zheng X. J., Magneto-Thermo-Elastic instability of ferromagnetic plates in thermal and magnetic fields, Int. J. Solids Struct. 40, 6125 (2003). 90. Wang X. Z. and Zheng X. J., Analyses on nonlinear coupling of magneto-thermoelasticity of ferromagnetic thin shell—I: Generalized variational theoretical modeling, Acta Mech. Solida Sin. 22, 189 (2009). 91. Wang X. Z., Zhou Y. H. and Zheng X. J., A generalized variational model of magnetothermoelasticity for nonlinearly magnetized ferroelastic bodies, Int. J. Eng. Sci. 40, 1957 (2002). 92. Wu G. Y., The analysis of dynamic instability and vibration motions of a pinned beam with transverse magnetic fields and thermal loads, J. Sound Vibration 284, 343 (2005). 93. Wu G. Y., The analysis of dynamic instability on the large amplitude vibrations of a beam with transverse magnetic fields and thermal loads, J. Sound Vibration 302, 167 (2007). 94. Wu G. Y., The analysis of dynamic instability of a bimaterial beam with alternating magnetic fields and thermal loads, J. Sound Vibration 327, 197 (2009). 95. Yeh C. S., Magnetic fields generated by a mechanical singularity in a magnetized elastic half plane, ASME J. Appl. Mech. 56, 89 (1989). 96. Yeh C. S. and Ren C. W., The magnetoelastic problem for a soft ferromagnetic elastic half-plane with a crack and a constant magnetic induction, J. Mech. 25, 95 (2009). 97. Zheng X. J., Zhou Y. H., Wang X. Z. and Lee J. S., Instability of superconducting partial torus with two pin supports, J. Eng. Mech. 125, 180 (1999). 98. Zhao S. X. and Lee K. Y., Interfacial crack problem in layered soft ferromagnetic materials in uniform magnetic field, Mech. Res, Commun. 34, 19 (2007). 99. Zhou Y. H. and Miya K., A theoretical prediction of natural frequency of a ferromagnetic beam-plate with low susceptibility in in-plane magnetic field, ASME J. Appl. Mech. 65, 121 (1998). 100. Zhou Y. H., Gao Y. W., Zheng X. J. and Jiang Q., Buckling and post-buckling of a ferromagnetic beam-plate induced by magnetoelastic interactions, Int. J. NonLinear Mech. 35, 1059 (2000). 101. Zhou Y. H., Zheng X. J. and Miya K., Magnetoelastic bending and snapping of ferromagnetic plates in oblique magnetic fields, Fusion Eng. Des. 30, 325 (1995). 102. Zhou Y. H. and Zheng X. J., A theoretical model of magnetoelastic buckling for soft ferromagnetic thin plate, Acta Mech. Sin. 12, 213 (1996). 103. Zhou Y. H. and Zheng X. J., A general expression of magnetic force for soft ferromagnetic plates in complex magnetic fields, Int. J. Engng Sci. 35, 1405 (1997).
Magnetic Force Models for Magnetizable Elastic Bodies in the Magnetic Field 383 104. Zhou Y. H. and Zheng X. J., A variational principle on magnetoelastic interaction of ferromagnetic thin plates, Acta Mech. Solida Sin. 10, 1 (1997). 105. Zhou Y. H. and Zheng X. J., A generalized variational principle and theoretical model for magnetoelastic interaction of ferromagnetic bodies, Science in China Series A 42, 618 (1999). 106. Zhou Y. H., Wang X. Z. and Zheng X. J., Magnetoelastic bending and stability of soft ferromagnetic rectangular plates, Appl. Math. and Mech. 19, 669 (1998). 107. Zhou Y. H., Zheng X. J. and Harik I. E., Free vibration analysis of compressed clamped circular plates, ASCE J. Eng. Mech. 121, 1372 (1995).
This page is intentionally left blank
CHAPTER 15 PRINCIPLES OF NONLINEAR VIBRO-WIND ENERGY CONVERSION
Francis C. Moon Joseph C. Ford Professor of Mechanical Engineering Cornell University, Ithaca New York, USA Member US National Academy of Engineering E-mail: [email protected] In this paper, the principles and feasibility of ‘vibro-wind power’ are explored. Vibro- wind denotes the harvesting of energy from the wind as it flows around vibrating structures as an alternative to conventional rotary wind turbines. The basic science involves energy extraction from bodies induced to vibrate due to the nonlinear action of fluid flow and vortices around flexible structures. This research combines research in fluid dynamics, fluid-structure dynamics, nonlinear dynamics and electromechanical energy harvesting to create a new paradigm in energy research as well as to create a potential new energy technology. Two key topics will be addressed: (1) how to take out energy from flow-induced vibrations without quenching the nonlinear fluid-elastic instability, and (2) how to maximize energy conversion from the vibration of a large array of oscillators into electrical energy. A target application is for architectural facades similar to, and as a complement to, solar energy panels.
Dedication This paper was presented at ISEM 2010 Taiwan as a tribute to Professor Yih Hsing Pao who was my research advisor at Cornell University from 1962 to 1966. Professor Pao has since been my friend and colleague for over forty years. As one of his first PhD students I am indebted to his tutelage and wise council not only during my student years but during 385
386
F. C. Moon
my career as a professor. Professor Pao taught me the value of careful analysis from basic principles as well as the need for experiments to verify the theory. He stressed the importance of prior research and the history of science which has served me later in my career in my study of the history of machines. At the same time Professor Pao was always looking for new problems in mechanics based on interdisciplinary science such as magneto-mechanics. It is with much gratitude that I celebrate his 80th birthday. 1. Introduction ‘Vibro-wind power’ is the harvesting of energy from the wind as it flows around commercial and residential buildings through the mechanism of vibrating structures. The basic science involves energy extraction from bodies induced to vibrate by the action of fluid flow and vortices around flexible structures. Our approach will be to consider the effects of wind on single or multiple interacting flexible structures, such as hundreds of small cantilevers mounted to a surface. In our application the wind excites dozens or thousands of small vibrating elements on panels attached to the structure (Figure 1) converting the kinetic energy into electrical energy that can be used in the operation of the building. There are two crucial steps in this process; one is the conversion of wind energy into vibration and the second is the conversion of mechanical vibratory kinetic energy into electrical energy. Our estimates of power output are comparable to solar panels and may compliment solar panel systems during the night time or serve as an alternative to solar panels for building applications, especially in urban areas. Building integrated power generation (BIPG) is an active area of architectural design. Its goal is to provide energy without a significant ecological footprint. In our application of BIPG, the wind might excite dozens or thousands of small vibrating elements on panels attached to the building converting the kinetic energy into electrical energy that can be used in the operation of the building. Vibro-wind façade technology has an advantage over turbines on buildings because it avoids rotating dynamic loads on the structure as well as noise problems.
Principles of Nonlinear Vibro-Wind Energy Conversion
387
Fig. 1. Design Concept for Multi-Oscillator Vibro-Wind Panel.
2. Cornell University Collective Research in Vibro-Wind Energy There are six faculty working collectively on vibro-wind energy at Cornell University under a grant from the Cornell Center for a Sustainable Future (CCSF); the Author, F.C. Moon, Professors E. Garcia, H. Lipson, C. Williamson, W. Sachse, of Mechanical and Aerospace Engineering, and Professor K. Pratt of Architecture. Garcia and Lipson are exploring alternative concepts of vibro-wind fluid structure dynamics, Williamson is investigating the fluid-structure physics, Sachse is exploring vibration-electrical energy conversion and Pratt is designing building facades for vibro-wind technology. Most of the research is conducted by undergraduate and masters level students. Experiments have been conducted in small and medium size wind tunnels as well as atop one of Cornell’s buildings using a prototype vibro-wind panel of multi-oscillators. 3. Physics of Vibro-Wind Power Generation Wind Power in an unimpeded flow. The flow of wind power P [Watts/meter2] past an area A normal to the flow velocity V
388
F. C. Moon
is proportional to the air density and given by (see e.g. Leithead, 2007)
P = ρV 3A / 2 With the density of air of 1.2 kg/m3, the power density of wind at V = 10 m/s, is 0.6 kW/m2. This density is the same for either rotary wind turbine systems or vibro-wind systems. One cannot hope to capture all of this energy. However it might be possible to convert 30% of this power into structural vibration energy with a density of P = 180 Watts/m2 [V = 10 m/s]. If one were to scavenge 30% of the structural vibration into electrical energy our figure of merit would be P = 54 Watts/m2. Commercial solar photo-voltaic panels have an area power density of around 60-110 Watts/m2. So the power output of vibro-wind panels on buildings might be comparable to solar photo-voltaic technology. Although vibro-wind panels may be on the low end of solar panel power levels, the integrated energy may be comparable to or greater than solar since wind is available for 24 hours on a daily basis. There are several modes of Vibro-Wind excitation 1. 2. 3. 4. 5.
Galloping Vibrations [Parkinson & Smith 1964, Den Hartog 1932]; Vortex Resonance [Karman Vortex Shedding, Blevins 1978]; Bi-modal Flutter Instability; Wind transient vibrations; Membrane wave-like vibrations; ‘flag flutter’.
For low velocities the fluid will move around the obstacle in a steady pattern, however for larger velocities or Reynold’s number, Re, the flow becomes unsteady and alternating vortex patterns move behind the obstacle that in turn generates non-steady pressure forces. If the obstacle is constrained by a flexible structure vibratory motions will occur from which we can then generate electric energy. If the vortex shedding frequency (given by the Strouhal Number) is close to the natural frequency of the vibratory structure, the mechanism is like a linear resonance excitation. However, there are effective negative damping dynamics of wind interacting with blunt bodies called galloping that do not depend on vortex resonance.
Principles of Nonlinear Vibro-Wind Energy Conversion
389
In the vortex shedding model there are two non-dimensional parameters; the Reynolds number, proportional to the velocity, and the Strouhal number, S = fD/U, that characterizes the vortex frequency. Characteristics of fluid-structure dynamics have been summarized in the book by Blevins (1977) Fluid-Structure Vibration. For obstacles on the order of 50 mm, and velocities on the order of 10 m/s, the Reynold’s number is around 30,000. In this regime, the alternating vortex flow behind a cylinder-type obstacle is well established with a given frequency. In this regime one can show that; for 102 < Re < 105, S = 0.15 where S = fsD/U, where fs is the vortex shedding frequency in cycles per second. For an obstacle or flat plate of width D = 50 mm, and U = 5 m/s, fs = 15 Hz. If the shedding frequency is in resonance with the oscillator frequency, a vibration amplitude on the order of 0.2 D is possible due to vortex shedding forces, (Blevins 1977). For structural natural frequencies below the vortex shedding frequency, there is another self-excitation mechanism for structural oscillations. 4. Literature Review This project uses results from several research areas in fluid mechanics, fluid-structure dynamics, and vibration energy harvesting. Each of these areas has a large literature and we can only mention a few of the relevant papers and books. Already mentioned above is the bluff body galloping and vortex resonance excitation (see Blevins, 1977). The galloping problem dates back at least to the 1930’s and the work of Den Hartog (1932). A detailed theoretical quasi-steady aerodynamic model with experiments was published by Parkinson and Smith (1964) and reviewed later by Parkinson (1989). An extension of the problem was undertaken by van Oudheusden (1995) to include rotation and translation. Bluntbody vortex dynamics has been reviewed by Williamson at Cornell, (Khalak and Williamson, 1999). Williamson (1996) has also presented a detailed description of vortex dynamics behind blunt bodies. The use of vibration to harvest energy from fluid or wind energy also has a decades old history as reviewed in the preprint paper by Tang, Paidoussis and DeLaurier from McGill University. [Available online in PDF form.] For example DeLaurier in 1978 proposed a “Wingmill”
390
F. C. Moon
using wing flutter. This was published by McKinney and DeLaurier in 1981 in Journal of Energy. Another related paper in the same journal is by Ly and Chasteau (1981). More recently Allen and Smits (2001) have proposed a “vibrating eel” behind a rigid blunt body in a fluid. A US patent for a similar “Energy harvesting eel” has been issued (US Pat No. 6424079). Another scheme is a vortex induced vibration of a blunt cylinder in fluid flow by M. Bernitsas called VIVACE or University of Michigan aimed at water flow. Another idea is a so-called “Windbelt” described on the web as a single membrane that undergoes flutter and generates energy. A similar device with more documentation is the work at McGill University called “Flutter Mill” by Tang, Paidoussis and Delaurier (2008). None of these uses an array of blunt-body oscillators that is posited in this proposal. The dynamics of flow-induced vibration of arrays of cylinders in crow flow and axial flow has a long history related to the design of heat exchangers. The work of Paidoussis of McGill as the editor of the Journal of Fluids and Structures has been a mainstay in this area. The Principal Investigator has looked at nonlinear dynamics of tube arrays in several papers and we hope to make use of some of these results in the Vibro-wind concept. [See e.g. Thothadri and Moon, 1998, 1999, Kuroda and Moon, 2001, 2007.] Priya et al. (2005) has recently proposed a ‘piezoelectric windmill’ using kinematic mechanisms. Recently Li and Lipson (2009) of Cornell have designed a flapping-leaf tree type array of oscillators for Vibro-Wind applications. Recently, a theoretical study of galloping and energy harvesting has been published by Barrero-Gil et al (2010). Finally, the subject of vibration energy harvesting has a large literature of hundreds of papers, reviews and books. A noteworthy review is by Sodano, Inman and Park (2004) and the book by Roundy, Wright and Rabaey (2004). Four different physics principles are usually discussed in energy harvesting, electromagnetic, ceramic piezoelectric, piezo-polymer materials, and electrostatic methods. Most reviews rate ceramic piezoelectric materials as having the highest transduction efficiency. Some rank electromagnetic has having the next best efficiency while others cite electrostatic methods. There are also studies
Principles of Nonlinear Vibro-Wind Energy Conversion
391
optimizing the electronic coupling to a power load and energy storage and several impedance matching schemes show energy transfer from the vibrating structure upwards of 30% or higher. We will also try to design the placement of the vibro-wind panels on the building to take advantage of the special aerodynamics of building structures in natural wind flows. Energy in an Array of Structural Oscillators As an estimate of the available kinetic energy we imagine N oscillators on a square meter panel. For an array of dozens or hundreds of structural oscillators with blunt bodies, each oscillator is contributing to the energy transfer from wind to structural kinetic energy. We assume a vibrating mass ‘m’ for one elastic structure of frequency ‘f’. For a sinusoidal oscillation of amplitude ∆, the kinetic energy of the oscillator is given by T = 2 π 2 mf 2 ∆ 2 cos2 (2 πft ). If this energy were absorbed in one cycle, a figure of merit of oscillator power availability would be Pelastic = 2 π 2 m∆ 2 f 3 . If there were N oscillators per square meter, then the power available would be NPelastic. For a mass of 18 grams per oscillator, and 100 oscillators per square meter, (one oscillator in a 10 cm x 10 cm area) each with a natural frequency of 30 Hz and ∆ = 1 cm, the power density would be 100 Watts/m2. From the fluid-structure dynamics, these vibration parameters are possible in a flow of around 10 m/s. This power density of 100 W/m2 is on the same order as that of commercial solar panels.
Key Research Issues in Vibro-Wind Energy Harvesting are: 1. Exploration of ways to optimize the geometry of the vibrating elements and the fluid flow around them in order to achieve maximum mechanical motion. 2. Evaluation of transduction principles such as piezoelectric, electromagnetic, or others to optimize the electrical power generation from the vibrating array configuration.
392
F. C. Moon
3. Optimization of the electrical power output of the large array of wind energy scavengers and evaluating the economic costs of the system compared to solar systems. 4. Investigation of how a large number of vibro-wind elements or panel arrays should be optimally integrated into a building energy system to obtain a practical power source. 5. Explore whether large-scale wind energy systems from vibrating structures can be utilized in design of buildings.
Fluid Mechanics Research in Vibro-Wind Systems One of the key issues is the flow around many vibrating bodies with a goal to determine whether “constructive” interference might enhance resonant fluid-structure dynamics leading to increased energy extraction. The author has published earlier work on dynamics of arrays of fluid oscillators that has been shown to exhibit nonlinear collective wave-like modes. (Thothodri and Moon 1998, 1999, Kuroda and Moon, 2002, 2007) Another key fluids issue is the discovery of the optimum blunt body shape that will transfer steady and non-steady wind energy into vibrations with minimum hysteresis and at low wind speeds. Nonlinear Models for Vibro-Wind Systems: The single degree of freedom model was first posited by Den Hartog around 1932 and later developed by Parkinson and Smith (1964) and van Oudheusden (1995) using a nonlinear lift force model with a seventh order lift force model.
ɺɺ + ryɺ + ky = C y ρ V 2 hL / 2 my 3
5
yɺ yɺ yɺ yɺ Cy = A − B + C − D U U U U
7
This model leads to a self-sustained limit cycle oscillation of the subcritical Hopf bifurcation type illustrated in the second graph below. This model has hysteresis in the amplitude-flow velocity curve for some flow conditions.
Principles of Nonlinear Vibro-Wind Energy Conversion
393
Fig. 2. Fluid-Structure Vibration Amplitude as a function of flow velocity: Top – super critical Hopf bifurcation, Bottom – Subcritical Hopf bifurcation.
Another model advanced by van Oudheusden (1995) incorporates rotation and translation in a single degree of freedom model (Figure 3) that leads to a coupling of angle of attack, translation velocity and the rotation angle. This model has a similar equation of motion to the Parkinson model above. This model also assumes a so-called quasisteady aerodynamic theory that allows one to measure the lift coefficient as a function of the angle of attack. This empirical approach has been effective in predicting the amplitude of motion of galloping vibrations for a single oscillator. Induced Angle of attack: In the standard Den Hartog galloping model, the vertical translational velocity induces a virtual angle of attack. However when the blunt body is attached to a flexible structure, one can have combined effects of displacement rotation and velocity, (see Figure 3). Oudheusden’s linearized equation of motion has a dependence on both the rotation angle as well as the angular velocity as shown below. In the energy harvesting model, the electrical energy generation is similar to the passive damping term on the right hand side of the equation of motion. One of our goals in this research to be able to extract the maximum energy (damping) without quenching the galloping limit cycle motion.
394
F. C. Moon
Fig. 3. Blunt Body on a cantilevered beam showing translation induced angle of attack as well as velocity induced angle of attack.
θɺɺ + θ = −2ζθɺ + a
µU U θ − θɺ ) + [ Electrical Coupling ] ( 2ω0 R ω0 R
Multiple Oscillator Model [Thothadri and Moon, 1998, 1999] In the vibro-wind concept, there will be many blunt-body oscillators in close proximity so that we can sum up hundreds of energy scavengers in a single power panel. For a simulation model, we are studying a model used by the Author in earlier research in which a row of coupled bodies is exposed to cross flow. (Thothadri and Moon, 1998, 1999) In our earlier work we developed a nonlinear system identification model for coupled oscillators in cross flow that captured the limit cycle bifurcation behavior. For blunt bodies there will be not only a nonlinear lift term, but also a nearest neighbor interaction between bodies. Such coupling has been shown to lead to a wave synchronized motion of the entire row in the cross flow. This synchronization may be useful in extracting energy from a large array of oscillators.
5. Preliminary Experimental Results As exploratory research, we have built 2 oscillator, four [2x2] oscillator arrays (Figure 4) of blunt-body oscillators in a 25 cm x 25 cm wind
Principles of Nonlinear Vibro-Wind Energy Conversion
395
tunnel in flow speeds in the range of 4–10 m/s. We have also built a 25 [5x5] oscillator array for study in a larger wind tunnel as well as for outdoor experiments. One of the blunt-body shapes chosen is the square cylinder 2 cm x 2 cm with 6 cm length attached to steel cantilevered ‘feeler gages’. We have attached Piezo Systems piezo benders to each of the beams and connected each oscillator to a full rectifier bridge and attempted to scavenge energy into a capacitor. These preliminary experiments have been successful and have encouraged us to proceed with larger arrays. The galloping mode oscillator is a nonlinear limit cycle instability (Moon, 1992) that often exhibits a hysteretic ‘Amplitude-Wind Speed’ behavior as shown in Figure 5. However we were able to optimize the shape of the blunt body and the structural design of the composite piezo-beam to effectively eliminate the hysteresis as shown in Figure 6.
Fig. 4. Four oscillator array of blunt bodies on piezo cantilever beams with rectifier circuit board in the background. Wind direction is from lower right to upper left.
396
F. C. Moon
Fig. 5. Displacement vs wind speed. Experimental data for vibration amplitude shows hysteresis typical of a sub-critical Hopf bifurcation.
Fig. 6. Vibration Amplitude versus wind speed for a blunt body with trapezoidal shape cross-section showing minimal hysteresis and low wind speed instability.
Energy Converters for Vibro-Wind Systems Possible candidates for vibration to electrical converters are electromechanical, piezoelectric, piezopolymer and electrostatic phenomena, (see e.g. Wu et al. 2009). Electromagnetic devices depend on coils and magnets while piezo devices are material property based.
Principles of Nonlinear Vibro-Wind Energy Conversion
397
One key thrust of this project is the collection of electrical power from an array of mechanical oscillators. Current research on vibration energy scavenging has focused on the single oscillator problem. The Cornell research differs from those efforts in that we wish to extract energy from an array of oscillators using the collective vibration behavior and any wave-like motions that may exist in multi-array vibratory systems. In our experiments on a 25 oscillator array each oscillator is coupled to a full wave rectifier and the charge is pumped into a capacitor energy storage system, which has been used to light up a LED device. (Figure 7)
Multi-Blunt Body Aerodynamic Induced Wave Motion Prior research findings of the Author and co-researchers have explored the complex spatial and temporal dynamics of many close-packed oscillators on cross-flow. For a single row of cylinders [see M. Thothadri and FC. Moon, “Helical Wave Oscillations in a Row of Cylinders in a Cross Flow”, J. Fluids and Structures, 12, 591-613 (1998)] we have shown a collective nonlinear mode for seven cylinders. In another set of papers with a Japanese researcher [see Kuroda, M. and Moon, F.C.
Fig. 7. Voltage vs Time for 30 mF storage capacitor for 4 oscillator array; wind speed 4 m/s.
398
F. C. Moon
“Local Complexity and Global Nonlinear Modes in Large Arrays of Fluid-Elastic Oscillators” Proc. IMECE2002, ASME Congress, 2002.] we examined the dynamics of 300 and 1000 rod-like oscillators in a close packed array. Again we found global modal patterns for cross-flow conditions. This suggests that a close-packed array of blunt-bodies in a vibro-wind panel will exhibit collective behavior from which we can optimize the extraction of energy from the air flow. A photo from the 1000 rods experiments is shown below looking down on the tops of the vibrating rods.
6. Architectural Issues in Vibro-Wind Energy Harvesting There is a great interest today in architectural building design in “building integrated power generation” or BIPG. In order for this new technology to be successful, the design concept of putting vibrating structures on building facades must be acceptable to the professional architecture community.
Fig. 8. 1000 rods in cross flow showing a collective oscillator nonlinear mode. [Kuroda and Moon, 2002, 2007]
Principles of Nonlinear Vibro-Wind Energy Conversion
399
Fortunately there is precedent for dynamic elements in architecture originating in the kinetic sculpture or kinetic art world. As early as the 1970’s the sculptor Harry Bertoia was using hundreds of vibrating rods of 1-2 meters in length in architectural environments to produce musical sounds in the flow of wind around buildings, (see e.g. Nelson, 1970). In the 1990’s the Japanese kinetic sculptor, Susumu Shingu, had installed large oscillating wind vanes on roofs, towers and domes that vibrated in the wind. One of these sculptures can be seen at an outdoor underground station in Cambridge, Mass, (see, Shingu, 1997). More recently, the kinetic artist and designer Ned Kahn has designed panels on the scale of 2m by 12m with 80,000 vibrating plates for architectural projects in Charlotte NC and Winterthur, Switzerland. These thousands of small vibrating plates are designed to produce visual wave-like effects as the wind blows around the buildings, (see Streitfeld, 2008). Video examples can be seen by through Google or YouTube with the field “Ned Kahn”. Thus there is precedent for the use of vibrating elements in architectural design. Our goal will be to produce a technology that produces energy for internal building use as well as create an aesthetic visual effect. Besides the determination of optimal shape of the mass-like obstacles to the flow there are architectural issues that a vibro-wind system must address Compatibility with building façade design practices Installation issues Weather resistant design Maintainability Vibro-wind panel lifetime Fatigue of vibrating elements Noise and acoustics Aesthetic design Integration into the building energy or grid system Compatibility with local building codes
400
F. C. Moon
7. Summary There is emerging a new set of wind energy concepts based on vibratory motion and not rotary motion. These systems, while not as efficient as rotary machines, are more analogous to solar panel technology and have the potential for application to architectural design for building energy use, especially in urban areas where rotary turbines are not practical. We have shown that vibro-wind energy harvesting depends on the nonlinear characteristics of fluid-structure dynamics. However another key technology is the transfer of energy from mechanical to electrical storage. Researchers at Cornell University have built several prototype Vibro-wind systems that show the potential of this emerging alternative energy technology.
References 1. Allen, J. J. and Smits, A. J., J. Fluids and Structures, Vol. 15, pp 629-640 (2001). 2. Barrero-Gil, A. Alonso, G., Sanz-Andres, “Energy harvesting from transverse galloping” J. Sound and Vibration Vol. 329, pp 2873-2883 (2010). 3. Blevins, R. D., Flow-Induced Vibration, Van Nostrand, NY (1977). 4. Govardhan, R. and Williamson, C. H. K., “Modes of vortex formation and frequency response of a freely vibrating cylinder” J. Fluid Mechanics, Vol. 420, pp 85-130 (2000). 5. Den Hartog, J. D., Mechanical Vibrations, McGraw Hill, NY (1934). 6. Khalak, A. and Williamson, C. H. K., “Motions, Forces, and Mode Transitions in Vortex-induced Vibrations at Low Mass-Damping” J. Fluids and Structures, Vol. 13, pp 813-851 (1999). 7. Kuroda, M. and Moon, F. C., “Local Complexity and Global Nonlinear Modes in Large Arrays of Fluid-Elastic Oscillators” Proc. IMECE2002, ASME Congress, 2002. 8. Kuroda, M. and Moon, F. C., “Experimental Reconsideration of Spatio-Temporal Dynamics Observed in Fluid-elastic Oscillator Arrays from a Complex System Point of View” Complexity Vol. 12, No. 4, pp 36-47 (2007). 9. Leithead W. E., “Wind Energy”, Energy for the Future: Phil. Transactions of the royal Society, Vol. 365, No. 1853, pp 957-970, 15 April 2007 (2007). 10. Li, S. and Lipson, H., “Vertical-Stalk Flapping-Leaf Generator for Parallel Wind Energy Harvesting” Proc. ASME/AIAA 2009 Conference on Smart Materials and Adaptive structures, SMASIS2009 (2009). 11. Ly, K. H. and Chasteau, V. A. L. J. Energy (1981).
Principles of Nonlinear Vibro-Wind Energy Conversion
401
12. McKinney, W., DeLaurier, J. D., “Wingmill- An Oscillating-wing Windmill” J. Energy (1981). 13. Moon, F. C., Chaotic and Fractal Dynamics, Wiley, NY (1992). 14. Nelson, J. K., Harry Bertoia, Sculptor, Wayne State Univ. Press, Detroit (1970). 15. Parkinson, G. and Smith, J. D., “The square prism as an aeroelastic oscillator” Quart. J. Mech. & Appl. Math. Vol. 17, pp 225-239 (1964). 16. Parkinson, G., “Phenomena and Modeling of Flow-Induced Vibrations of Bluff Bodies” Progress in Aerospace Sciences, Vol. 26, pp 169-224 (1989). 17. Priya, S., Chen, Chih-Ta, Fye, D., and Zahnd, F., “Piezoelectric Windmill: A Novel Solution to Remote Sensing” Japanese J. Applied Physics, Vol. 44, No. 3, pp L104L107 (2005). 18. Roundy, S., Wright, P. K. and Rabaey, J. M., Energy Scavenging for Wireless Sensor Networks, Kluwer, Boston (2004). 19. Soldano, H. A., Inman, D. J. and Park, Gyuhae, “A review of power harvesting from vibration using Piezoelectric materials” Shock and Vibration Digest, May 2004, Vol. 36 Issue 3 pp 197-205. 20. Streitfeld, L. P., “Ned Kahn: Sculpting Consciousness in Time and Space” Sculpture, Vol. 27, pp 43-49. April 2008 (2008). 21. Susumu Shingu, Shingu: Message From Nature, Abbeville Press (1997). 22. Tang, L., Paidoussis, M. P., DeLaurier, J. D., “Flutter-Mill A New Energy Harvesting Device” McGIll University PrePrint. Available online (2008). 23. Thothadri, M. and F. C. Moon, “Helical Wave Oscillations in a Row of Cylinders in a Cross Flow”, J. Fluids and Structures, 12, 591-613 (1998). 24. Thothadri, M. and F. C. Moon, “An Investigation of Nonlinear Models for a Cylinder Row in a Cross Flow”, ASME J. Pressure Vessel Technology, 121, 133-141 (1999). 25. Van Oudheusden, B. W., “On the Quasi-Steady Analysis of One-Degree-ofFreedom Galloping with Combined Translational and Rotational Effects” Nonlinear Dynamics, Vol. 8, pp. 435-451, Kluwer Acad. Publ (1995). 26. Wu, W. J, Wickenheiser, A. M., Reissman, T., Garcia, E., “Modeling and experiments on synchronized discharging techniques for power harvesting from piezoelectric transducers,” Smart Materials and Structures Journal, in review. 27. Williamson, C., “Vortex Dynamics in Wakes” Ann Rev Fluid Mech. 1996.
From Waves in Complex Systems to Dynamics of Generalized Continua Tributes to Professor Yih-Hsing Pao on His 80th Birthday The book reviews recent research activities in applied mechanics and applied mathematics such as the fields of solid £t fluid constitutive modeling for coupled fields, applications of geophysical and environmental context in judicious numericalcomputational implementations. The book aims to merge foundation aspects of continuum mechanics with modern technological applications, notably on reviewing recent advances in the treated subjects in an attractive presentation accessible to a wide readership of engineering and applied sciences. Key Features: * Covers waves and dynamics in complex continuous systems with applications to modern engineering and environmental-geophysical processes " Presents the topics with fundamental understanding of the foundations of mathematics and physics ' Pushes the presented subjects into forms allowing immediate transposition that are relevant to engineering and applied sciences