Large-Scale Scientific Computing: 7th International Conference, LSSC 2009, Sozopol, Bulgaria, June 4-8, 2009 Revised Papers (Lecture Notes in Computer Science)

Lecture Notes in Computer Science Commenced Publication in 1973 Founding and Former Series Editors: Gerhard Goos, Juris...

Author: Ivan Lirkov | Svetozar Margenov | Jerzy Wasniewski (eds.)

80 downloads 573 Views 13MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

Lecture Notes in Computer Science Commenced Publication in 1973 Founding and Former Series Editors: Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen

Editorial Board David Hutchison Lancaster University, UK Takeo Kanade Carnegie Mellon University, Pittsburgh, PA, USA Josef Kittler University of Surrey, Guildford, UK Jon M. Kleinberg Cornell University, Ithaca, NY, USA Alfred Kobsa University of California, Irvine, CA, USA Friedemann Mattern ETH Zurich, Switzerland John C. Mitchell Stanford University, CA, USA Moni Naor Weizmann Institute of Science, Rehovot, Israel Oscar Nierstrasz University of Bern, Switzerland C. Pandu Rangan Indian Institute of Technology, Madras, India Bernhard Steffen TU Dortmund University, Germany Madhu Sudan Microsoft Research, Cambridge, MA, USA Demetri Terzopoulos University of California, Los Angeles, CA, USA Doug Tygar University of California, Berkeley, CA, USA Gerhard Weikum Max-Planck Institute of Computer Science, Saarbruecken, Germany

5910

Ivan Lirkov Svetozar Margenov Jerzy Wa´sniewski (Eds.)

Large-Scale Scientiﬁc Computing 7th International Conference, LSSC 2009 Sozopol, Bulgaria, June 4-8, 2009 Revised Papers

13

Volume Editors Ivan Lirkov Svetozar Margenov Bulgarian Academy of Sciences, Institute for Parallel Processing Acad. G. Bonchev, block 25A, 1113 Soﬁa, Bulgaria E-mail: {ivan, margenov}@parallel.bas.bg Jerzy Wa´sniewski Technical University of Denmark Department of Informatics and Mathematical Modelling Richard Petersens Plads - Building 321, 2800 Kongens Lyngby, Denmark E-mail: [email protected]

Library of Congress Control Number: 2010924431 CR Subject Classiﬁcation (1998): G.1, D.1, D.4, F.2, C.2, I.6, J.2, J.6 LNCS Sublibrary: SL 1 – Theoretical Computer Science and General Issues ISSN ISBN-10 ISBN-13

0302-9743 3-642-12534-4 Springer Berlin Heidelberg New York 978-3-642-12534-8 Springer Berlin Heidelberg New York

This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, speciﬁcally the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on microﬁlms or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer. Violations are liable to prosecution under the German Copyright Law. springer.com © Springer-Verlag Berlin Heidelberg 2010 Printed in Germany Typesetting: Camera-ready by author, data conversion by Scientiﬁc Publishing Services, Chennai, India Printed on acid-free paper 06/3180

Preface

The 7th International Conference on Large-Scale Scientiﬁc Computations (LSSC 2009) was held in Sozopol, Bulgaria, June 4–8, 2009. The conference was organized and sponsored by the Institute for Parallel Processing at the Bulgarian Academy of Sciences. The conference was devoted to the 70th birthday anniversary of Professor Zahari Zlatev. The Bulgarian Academy of Sciences awarded him the Marin Drinov medal on ribbon for his outstanding results in environmental mathematics and for his contributions to the Bulgarian mathematical society and the Academy of Sciences. The plenary invited speakers and lectures were: – P. Arbenz, “μFinite Element Analysis of Human Bone Structures” – Y. Efendiev, “Mixed Multiscale Finite Element Methods Using Limited Global Information” – U. Langer, “Fast Solvers for Non-Linear Time-Harmonic Problems” – T. Manteuﬀel, “First-Order System Least-Squares Approach to Resistive Magnetohydrodynamic Equations” – K. Sabelfeld, “Stochastic Simulation for Solving Random Boundary Value Problems and Some Applications” – F. Tr¨ oltzsch, “On Finite Element Error Estimates for Optimal Control Problems with Elliptic PDEs” – Z. Zlatev, “On Some Stability Properties of the Richardson Extrapolation Applied Together with the θ-method” The success of the conference and the present volume in particular are an outcome of the joint eﬀorts of many partners from various institutions and organizations. First we would like to thank all the members of the Scientiﬁc Committee for their valuable contribution forming the scientiﬁc face of the conference, as well as for their help in reviewing contributed papers. We especially thank the organizers of the special sessions. We are also grateful to the staﬀ involved in the local organization. Traditionally, the purpose of the conference is to bring together scientists working with large-scale computational models of environmental and industrial problems, and specialists in the ﬁeld of numerical methods and algorithms for modern high-performance computers. The invited lectures reviewed some of the advanced achievements in the ﬁeld of numerical methods and their eﬃcient applications. The conference talks were presented by university researchers and practical industry engineers including applied mathematicians, numerical analysts and computer experts. The general theme for LSSC 2009 was “Large-Scale Scientiﬁc Computing” with a particular focus on the organized special sessions.

VI

Preface

The special session organizer were: – Multilevel and Multiscale Preconditioning Methods — J. Kraus, S. Margenov, M. Neytcheva – Upscaling and Multiscale Methods — M. Katsoulakis, R. Lazarov – Industrial and Biomedical Multiscale Problems — Y. Efendiev, O. Iliev, P. Popov – Environmental Modelling — A. Ebel, K. Georgiev, Z. Zlatev – Control and Uncertain Systems — M. Krastanov, V. Veliov This session was dedicated to the 60th anniversary of Asen Donchev. – Applications of Metaheuristics to Large-Scale Problems — F. Luna, S. Fidanova – Monte Carlo: Methods, Applications, Distributed Computing — I. Dimov, V. Mitin, M. Nedjalkov – Grid and Scientiﬁc and Engineering Applications — A. Karaivanova, E. Atanassov, T. Gurov – Reliable Numerical Methods for Diﬀerential Equations — I. Farag´ o, J. Kar´ atson, S. Korotov – Discretization and Fast Solution Techniques for Large-Scale Physics Applications — P. Vassilevski, L. Zikatanov – Least Squares Finite Element Methods — P. Bochev, T. Manteuﬀel – Unconventional Uses of Optimization in Scientiﬁc Computing — P. Bochev, D. Ridzal More than 140 participants from all over the world attended the conference representing some of the strongest research groups in the ﬁeld of advanced largescale scientiﬁc computing. This volume contains 99 papers submitted by authors from 24 countries. The 8th International Conference on Large-Scale Scientiﬁc Computations (LSSC 2011) will be organized in June 2011. January 2010

Ivan Lirkov Svetozar Margenov Jerzy Wa´sniewski

Table of Contents

I

Plenary and Invited Papers

An Eﬃciency-Based Adaptive Reﬁnement Scheme Applied to Incompressible, Resistive Magnetohydrodynamics . . . . . . . . . . . . . . . . . . . . J. Adler, T. Manteuﬀel, S. McCormick, J. Nolting, J. Ruge, and L. Tang

1

Discontinuous Galerkin Subgrid Finite Element Method for Heterogeneous Brinkman’s Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Oleg P. Iliev, Raytcho D. Lazarov, and Joerg Willems

14

Stochastic Simulation for Solving Random Boundary Value Problems and Some Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Karl Sabelfeld

26

On Finite Element Error Estimates for Optimal Control Problems with Elliptic PDEs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Fredi Tr¨ oltzsch

40

On Some Stability Properties of the Richardson Extrapolation Applied Together with the θ-Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ´ Zahari Zlatev, Istv´ an Farag´ o, and Agnes Havasi

54

II

Multilevel and Multiscale Preconditioning Methods

On the Use of Aggregation-Based Parallel Multilevel Preconditioners in the LES of Wall-Bounded Turbulent Flows . . . . . . . . . . . . . . . . . . . . . . . . . . Andrea Aprovitola, Pasqua D’Ambra, Daniela di Seraﬁno, and Salvatore Filippone An Additive Matrix Preconditioning Method with Application for Domain Decomposition and Two-Level Matrix Partitionings . . . . . . . . . . . Owe Axelsson Numerical Study of AMLI Methods for Weighted Graph-Laplacians . . . . Petia Boyanova and Svetozar Margenov A Scalable TFETI Based Algorithm for 2D and 3D Frictionless Contact Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Zdenˇek Dost´ al, Tom´ aˇs Brzobohat´y, Tom´ aˇs Kozubek, Alex Markopoulos, and V´ıt Vondr´ ak

67

76 84

92

VIII

Table of Contents

Multilevel Preconditioning of Crouzeix-Raviart 3D Pure Displacement Elasticity Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ivan Georgiev, Johannes Kraus, and Svetozar Margenov

100

Element-by-Element Schur Complement Approximations for General Nonsymmetric Matrices of Two-by-Two Block Form . . . . . . . . . . . . . . . . . . Maya Neytcheva, Minh Do-Quang, and He Xin

108

Numerical Simulation of Fluid-Structure Interaction Problems on Hybrid Meshes with Algebraic Multigrid Methods . . . . . . . . . . . . . . . . . . . . Huidong Yang and Walter Zulehner

116

III

Multilevel and Multiscale Methods for Industrial Applications

Recent Developments in the Multi-Scale-Finite-Volume Procedure . . . . . . Giuseppe Bonﬁgli and Patrick Jenny Boundary Element Simulation of Linear Water Waves in a Model Basin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Clemens Hofreither, Ulrich Langer, and Satyendra Tomar Numerical Homogenization of Bone Microstructure . . . . . . . . . . . . . . . . . . . Nikola Kosturski and Svetozar Margenov Multiscale Modeling and Simulation of Fluid Flows in Highly Deformable Porous Media . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . P. Popov, Y. Efendiev, and Y. Gorb

IV

124

132 140

148

Environmental Modelling

Assimilation of Chemical Ground Measurements in Air Quality Modeling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Gabriele Candiani, Claudio Carnevale, Enrico Pisoni, and Marialuisa Volta Numerical Simulations with Data Assimilation Using an Adaptive POD Procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Gabriel Dimitriu, Narcisa Apreutesei, and R˘ azvan S ¸ tef˘ anescu Game-Method Model for Field Fires . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Nina Dobrinkova, Stefka Fidanova, and Krassimir Atanassov Joint Analysis of Regional Scale Transport and Transformation of Air Pollution from Road and Ship Transport . . . . . . . . . . . . . . . . . . . . . . . . . . . . K. Ganev, D. Syrakov, G. Gadzhev, M. Prodanova, G. Jordanov, N. Miloshev, and A. Todorova

157

165 173

180

Table of Contents

Runs of UNI-DEM Model on IBM Blue Gene/P Computer and Analysis of the Model Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Krassimir Georgiev and Zahari Zlatev Sensitivity Analysis of a Large-Scale Air Pollution Model: Numerical Aspects and a Highly Parallel Implementation . . . . . . . . . . . . . . . . . . . . . . . Tzvetan Ostromsky, Ivan T. Dimov, Rayna Georgieva, and Zahari Zlatev Advanced Results of the PM10 and PM2.5 Winter 2003 Episode by Using MM5-CMAQ and WRF/CHEM Models . . . . . . . . . . . . . . . . . . . . . . . Roberto San Jos´e, Juan L. P´erez, Jos´e L. Morant, F. Prieto, and Rosa M. Gonz´ alez Four-Dimensional Variational Assimilation of Atmospheric Chemical Data – Application to Regional Modelling of Air Quality . . . . . . . . . . . . . . Achim Strunk, Adolf Ebel, Hendrik Elbern, Elmar Friese, Nadine Goris, and Lars Peter Nieradzik Numerical Study of Some High PM10 Levels Episodes . . . . . . . . . . . . . . . . A. Todorova, G. Gadzhev, G. Jordanov, D. Syrakov, K. Ganev, N. Miloshev, and M. Prodanova

V

IX

188

197

206

214

223

Control and Uncertain Systems

Hausdorﬀ Continuous Viscosity Solutions of Hamilton-Jacobi Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Roumen Anguelov, S. Markov, and F. Minani

231

Stochastic Skiba Sets: An Example from Models of Illicit Drug Consumption . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Roswitha Bultmann, Gustav Feichtinger, and Gernot Tragler

239

Classical and Relaxed Optimization Methods for Nonlinear Parabolic Optimal Control Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . I. Chryssoverghi, J. Coletsos, and B. Kokkinis

247

Exponential Formula for Impulsive Diﬀerential Inclusions . . . . . . . . . . . . . Tzanko Donchev

256

Directional Sensitivity Diﬀerentials for Parametric Bang-Bang Control Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ursula Felgenhauer

264

Estimates of Trajectory Tubes of Uncertain Nonlinear Control Systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Tatiana F. Filippova

272

X

Table of Contents

Asymptotics for Singularly Perturbed Reachable Sets . . . . . . . . . . . . . . . . . Elena Goncharova and Alexander Ovseevich

280

On Optimal Control Problem for the Bundle of Trajectories of Uncertain System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Mikhail I. Gusev

286

High-Order Approximations to Nonholonomic Aﬃne Control Systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Mikhail I. Krastanov and Vladimir M. Veliov

294

VI

Application of Metaheuristics to Large Scale Problems

A Parametric Multi-start Algorithm for Solving the Response Time Variability Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Albert Corominas, Alberto Garc´ıa-Villoria, and Rafael Pastor

302

Enhancing the Scalability of Metaheuristics by Cooperative Coevolution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ciprian Cr˘ aciun, Monica Nicoar˘ a, and Daniela Zaharie

310

Hybrid ACO Algorithm for the GPS Surveying Problem . . . . . . . . . . . . . . Stefka Fidanova, Enrique Alba, and Guillermo Molina

318

Generalized Nets as Tools for Modeling of the Ant Colony Optimization Algorithms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Stefka Fidanova and Krassimir Atanassov

326

A Scatter Search Approach for Solving the Automatic Cell Planning Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Francisco Luna, Juan J. Durillo, Antonio J. Nebro, and Enrique Alba

334

Some Aspects Regarding the Application of the Ant Colony Meta-heuristic to Scheduling Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ioana Moisil and Alexandru-Liviu Olteanu

343

High-Performance Heuristics for Optimization in Stochastic Traﬃc Engineering Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Evdokia Nikolova

352

Optimization of Complex SVM Kernels Using a Hybrid Algorithm Based on Wasp Behaviour . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Dana Simian, Florin Stoica, and Corina Simian

361

Table of Contents

VII

XI

Monte Carlo: Methods, Applications, Distributed Computing

Enabling Cutting-Edge Semiconductor Simulation through Grid Technology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Asen Asenov, Dave Reid, Campbell Millar, Scott Roy, Gareth Roy, Richard Sinnott, Gordon Stewart, and Graeme Stewart Incremental Reuse of Paths in Random Walk Radiosity . . . . . . . . . . . . . . . Francesc Castro and Mateu Sbert Monte Carlo Adaptive Technique for Sensitivity Analysis of a Large-Scale Air Pollution Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ivan T. Dimov and Rayna Georgieva Parallel Implementation of the Stochastic Radiosity Method . . . . . . . . . . . Roel Mart´ınez and Jordi Coma

369

379

387 395

Monte-Carlo Modeling of Electron Kinetics in Room Temperature Quantum-Dot Photodetectors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Vladimir Mitin, Andrei Sergeev, Li-Hsin Chien, and Nizami Vagidov

403

Particle Model of the Scattering-Induced Wigner Function Correction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . M. Nedjalkov, P. Schwaha, O. Baumgartner, and S. Selberherr

411

Analysis of the Monte Carlo Image Creation by Uniform Separation . . . . A.A. Penzov, Ivan T. Dimov, L. Szirmay-Kalos, and V.N. Koylazov The Role of the Boundary Conditions on the Current Degradation in FD-SOI Devices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . K. Raleva, D. Vasileska, and S.M. Goodnick Gamma Photon Transport on the GPU for PET . . . . . . . . . . . . . . . . . . . . . L. Szirmay-Kalos, B. T´ oth, M. Magdics, D. L´egr´ ady, and A.A. Penzov Transport in Nanostructures: A Comparative Analysis Using Monte Carlo Simulation, the Spherical Harmonic Method, and Higher Moments Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . M. Vasicek, V. Sverdlov, J. Cervenka, T. Grasser, H. Kosina, and S. Selberherr Thermal Modeling of GaN HEMTs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . D. Vasileska, A. Ashok, O. Hartin, and S.M. Goodnick

419

427 435

443

451

XII

VIII

Table of Contents

Grid and Scientiﬁc and Engineering Applications

Tuning the Generation of Sobol Sequence with Owen Scrambling . . . . . . . Emanouil Atanassov, Aneta Karaivanova, and Soﬁya Ivanovska Applying the Improved Saleve Framework for Modeling Abrasion of Pebbles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . P´eter D´ ob´e, Rich´ ard K´ apolnai, Andr´ as Sipos, and Imre Szeber´enyi Information Flow and Mirroring in an Agent-Based Grid Resource Brokering System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Maria Ganzha, Marcin Paprzycki, Michal Drozdowicz, Mehrdad Senobari, Ivan Lirkov, Soﬁya Ivanovska, Richard Olejnik, and Pavel Telegin Scatter Search and Grid Computing to Improve Nuclear Fusion Devices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Antonio G´ omez-Iglesias, Miguel A. Vega-Rodr´ıguez, Francisco Castej´ on-Maga˜ na, Miguel C´ ardenas-Montes, and Enrique Morales-Ramos Service-Oriented Integration of Grid Applications in Heterogeneous Grids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Radoslava D. Goranova

459

467

475

483

491

Grid Based Environment Application Development Methodology . . . . . . . Dorian Gorgan, Teodor Stefanut, Victor Bacu, Danut Mihon, and Denisa Rodila

499

User Level Grid Quality of Service . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Anastas Misev and Emanouil Atanassov

507

Dictionary Compression and Information Source Correction . . . . . . . . . . . D´enes N´emeth, M´ at´e Lakat, and Imre Szeber´enyi

515

A Parallelization of Finite Volume Method for Calculation of Gas Microﬂows by Domain Decomposition Methods . . . . . . . . . . . . . . . . . . . . . . Kiril S. Shterev and Stefan K. Stefanov Background Pollution Forecast over Bulgaria . . . . . . . . . . . . . . . . . . . . . . . . D. Syrakov, K. Ganev, M. Prodanova, N. Miloshev, G. Jordanov, E. Katragkou, D. Melas, A. Poupkou, and K. Markakis Climate Change Impact Assessment of Air Pollution Levels in Bulgaria . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . D. Syrakov, M. Prodanova, N. Miloshev, K. Ganev, G. Jordanov, V. Spiridonov, A. Bogatchev, E. Katragkou, D. Melas, A. Poupkou, and K. Markakis

523 531

538

Table of Contents

Eﬀective Algorithm for Calculating Spatial Deformations of Pre-stressed Concrete Beams . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . J´ anos T¨ or¨ ok, D´enes N´emeth, and M´ at´e Lakat

IX

XIII

546

Reliable Numerical Methods for Diﬀerential Equations

Structurally Stable Numerical Schemes for Applied Dynamical Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Roumen Anguelov Matrix and Discrete Maximum Principles . . . . . . . . . . . . . . . . . . . . . . . . . . . Istv´ an Farag´ o

554

563

On a Bisection Algorithm That Produces Conforming Locally Reﬁned Simplicial Meshes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Antti Hannukainen, Sergey Korotov, and Michal Kˇr´ıˇzek

571

A Discrete Maximum Principle for Nonlinear Elliptic Systems with Interface Conditions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . J´ anos Kar´ atson

580

Inverse Problem for Coeﬃcient Identiﬁcation in Euler-Bernoulli Equation by Linear Spline Approximation . . . . . . . . . . . . . . . . . . . . . . . . . . . Tchavdar T. Marinov and Rossitza S. Marinova

588

A Fully Implicit Method for Fluid Flow Based on Vectorial Operator Splitting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Rossitza S. Marinova, Raymond Spiteri, and Eddy Essien

596

Discrete Maximum Principle for Finite Element Parabolic Operators . . . Miklos E. Mincsovics

604

Climate Change Scenarios for Hungary Based on Numerical Simulations with a Dynamical Climate Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ildik´ o Pieczka, Judit Bartholy, Rita Pongr´ acz, and Adrienn Hunyady

613

Numerical Simulations of Reaction-Diﬀusion Systems Arising in Chemistry Using Exponential Integrators . . . . . . . . . . . . . . . . . . . . . . . . . . . R˘ azvan S ¸ tef˘ anescu and Gabriel Dimitriu

621

On the Discretization Time-Step in the Finite Element Theta-Method of the Two-Dimensional Discrete Heat Equation . . . . . . . . . . . . . . . . . . . . . Tam´ as Szab´ o

629

XIV

X

Table of Contents

Novel Applications of Optimization Ideas to the Numerical Solution of PDEs

A Locally Conservative Mimetic Least-Squares Finite Element Method for the Stokes Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Pavel Bochev and Max Gunzburger

637

Additive Operator Decomposition and Optimization–Based Reconnection with Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Pavel Bochev and Denis Ridzal

645

Least-Squares Spectral Element Method on a Staggered Grid . . . . . . . . . . Marc Gerritsma, Mick Bouman, and Artur Palha Mimetic Least-Squares Spectral/hp Finite Element Method for the Poisson Equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Artur Palha and Marc Gerritsma Adaptive Least Squares Finite Element Methods in Elasto-Plasticity . . . Gerhard Starke The Automatic Construction and Solution of a Partial Diﬀerential Equation from the Strong Form . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Joseph Young

XI

653

662 671

679

Contributed Papers

Zienkiewicz-Type Finite Element Applied to Fourth-Order Problems . . . Andrey B. Andreev and Milena R. Racheva

687

Acceleration of Convergence for Eigenpairs Approximated by Means of Non-conforming Finite Element Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . Andrey B. Andreev and Milena R. Racheva

695

A Two-Grid Method on Layer-Adapted Meshes for a Semilinear 2D Reaction-Diﬀusion Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ivanka T. Angelova and Lubin G. Vulkov

703

Comparative Analysis of Solution Methods for Delay Diﬀerential Equations in Hematology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Gergana Bencheva

711

Solving Non-linear Systems of Equations on Graphics Processing Units . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Lubomir T. Dechevsky, Børre Bang, Joakim Gundersen, Arne Laks˚ a, and Arnt R. Kristoﬀersen

719

Table of Contents

Computing n-Variate Orthogonal Discrete Wavelet Transforms on Graphics Processing Units . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Lubomir T. Dechevsky, Joakim Gundersen, and Børre Bang Wavelet Compression, Data Fitting and Approximation Based on Adaptive Composition of Lorentz-Type Thresholding and Besov-Type Non-threshold Shrinkage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Lubomir T. Dechevsky, Joakim Gundersen, and Niklas Grip On Interpolation in the Unit Disk Based on Both Radon Projections and Function Values . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Irina Georgieva and Rumen Uluchev Comparison between the Marching-Cube and the Marching-Simplex Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Joakim Gundersen, Arnt R. Kristoﬀersen, and Lubomir T. Dechevsky Transitions from Static to Dynamic State in Three Stacked Josephson Junctions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ivan Christov, Stefka Dimova, and Todor Boyadjiev Factorization-Based Graph Repartitionings . . . . . . . . . . . . . . . . . . . . . . . . . . Kateˇrina Jurkov´ a and Miroslav T˚ uma An Eﬃcient Algorithm for Bilinear Strict Equivalent (BSE)- Matrix Pencils . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Grigoris I. Kalogeropoulos, Athanasios D. Karageorgos, and Athanasios A. Pantelous

XV

730

738

747

756

765 771

779

Two-Grid Decoupling Method for Elliptic Problems on Disjoint Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Miglena N. Koleva and Lubin G. Vulkov

787

A Method for Sparse-Matrix Computation of B-Spline Curves and Surfaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Arne Laks˚ a

796

Parallel MIC(0) Preconditioning for Numerical Upscaling of Anisotropic Linear Elastic Materials . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Svetozar Margenov and Yavor Vutov

805

The Bpmpd Interior Point Solver for Convex Quadratically Constrained Quadratic Programming Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Csaba M´esz´ aros

813

XVI

Table of Contents

On Shape Optimization of Acoustically Driven Microﬂuidic Biochips . . . Svetozara I. Petrova

821

Block Preconditioners for the Incompressible Stokes Problem . . . . . . . . . . M. ur Rehman, C. Vuik, and G. Segal

829

Author Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

837

An Eﬃciency-Based Adaptive Reﬁnement Scheme Applied to Incompressible, Resistive Magnetohydrodynamics J. Adler, T. Manteuﬀel, S. McCormick, J. Nolting, J. Ruge, and L. Tang University of Colorado at Boulder Department of Applied Mathematics Boulder, CO

Abstract. This paper describes the use of an eﬃciency-based adaptive mesh reﬁnement scheme, known as ACE, on a 2D reduced model of the incompressible, resistive magnetohydrodynamic (MHD) equations. A ﬁrst-order system least squares (FOSLS) ﬁnite element formulation and algebraic multigrid (AMG) are used in the context of nested iteration. The FOSLS a posteriori error estimates allow the nested iteration and ACE algorithms to yield the best accuracy-per-computational-cost. The ACE scheme chooses which elements to add when interpolating to ﬁner grids so that the best error reduction with the least amount of cost is obtained, when solving on the reﬁned grid. We show that these methods, applied to the simulation of a tokamak fusion reactor instability, yield approximations to solutions within discretization accuracy using less than the equivalent amount of work needed to perform 10 residual calculations on the ﬁnest uniform grid. Keywords: Magnetohydrodynamics, adaptive mesh reﬁnement, algebraic multigrid, nested iteration.

1

Introduction

Magnetohydrodynamics (MHD) is a model of plasma physics that treats the plasma as a charged ﬂuid. As a result, the set of partial diﬀerential equations that describe this model are a time-dependent, nonlinear system of equations. Thus, the equations can be diﬃcult to solve and eﬃcient numerical algorithms are needed. This paper shows the use of such an eﬃcient algorithm on the incompressible, resistive MHD equations. A ﬁrst-order systems least-squares (FOSLS) [1,2] ﬁnite element discretization is used along with nested iteration and algebraic multigrid (AMG) [3,4,5,6,7,8]. The main focus of this paper is to show that if an eﬃciency-based adaptive mesh reﬁnement (AMR) scheme is used, within the nested iteration algorithm, then a nonlinear system of equations, such as the MHD equations, can be solved in only a handful of work units per time step. Here, a work unit is deﬁned as the equivalent of one relaxation sweep on the ﬁnest grid. In other words, the accuracy-per-computational-cost for solving the MHD equations can be maximized by the use of nested iteration and AMR. As I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 1–13, 2010. c Springer-Verlag Berlin Heidelberg 2010

2

J. Adler et al.

is shown in the results section, we were able to resolve an island coalescence instability in less than 10 work units per time step. The MHD system and the FOSLS methodology applied to it are discussed in detail in the companion paper [9], so we include only a brief description here in section 2. The nested iteration algorithm has also been described in [10] and [11], so we only brieﬂy discuss it here in section 3. This section also discusses the eﬃciency-based AMR method known as ACE, which was developed in [12,13,14]. Finally, in section 4, numerical results are shown for a 2D reduced model that simulates plasma instabilities in a tokamak reactor. These results conﬁrm that the AMR algorithm greatly reduces the amount of work needed to solve the MHD systems.

2

The MHD Equations and FOSLS Formulation

The resistive MHD equations are time-dependent and nonlinear, and involve several dependent variables. The system is a coupling of the incompressible NavierStokes and Maxwell’s systems. The primitive variables are deﬁned to be the ﬂuid velocity, u, the ﬂuid pressure, p, the magnetic ﬁeld, B, the current density, j, and the electric ﬁeld, E. In addition, a resistive form of Ohm’s law, j = σ(E + u × B),

(1)

is used to eliminate the electric ﬁeld, E, from the equations. After a nondimensionalization using Alfv´en units, the following equations for incompressible resistive MHD are obtained (i.e., Navier-Stokes coupled with Maxwell’s equations) [15,16]: 1 2 ∂u + u · ∇u − j × B + ∇p − ∇ u = f, ∂t Re ∂B 1 − B · ∇u + u · ∇B + (∇ × j) = g, ∂t SL ∇ × B = j,

(2) (3) (4)

∇ · B = 0, ∇ · u = 0,

(5) (6)

∇ · j = 0.

(7)

Here, Re is the ﬂuid Reynolds Number and SL is the Lundquist Number, both of which are assumed to be constants and adjusted for diﬀerent types of physical behavior. Using the FOSLS method [1,2], the system is ﬁrst put into a diﬀerential ﬁrstorder system of equations. This is done based on a vorticity-velocity-pressurecurrent formulation [17,18,19]. A scaling analysis is performed in [9] for the full MHD system. The resulting scaling yields a nice block structure of the MHD system, which results in good AMG convergence of the linear systems obtained, while still preserving the physics of the system.

An AMR Scheme for Incompressible RMHD

3

Vorticity, ω = ∇ × u, is introduced and the ﬁnal formulation in 3D used is 1 √ ∇ × u − Re ω = 0, (8) Re 1 √ ∇ · u = 0, (9) Re Re ∇ · ω = 0, (10) 1 ∂u 1 √ − u × ω − j × B − Re ∇p + √ ∇ × ω = f , (11) Re ∂t Re 1 √ ∇ × B − SL j = 0, (12) SL 1 √ ∇ · B = 0, (13) SL SL ∇ · j = 0, (14) 1 ∂B 1 1 √ +√ (u · ∇B − B · ∇u) + √ ∇ × j = g. (15) SL ∂t Re SL SL The above system is denoted by L(u) = f , where u = (u, ω, p, B, j)T represents a vector of all of the dependent variables that should not be confused with the vector ﬂuid velocity, u. Then, the L2 norm of the residual of this system is minimized. This is referred to as the nonlinear functional, F (u) = ||L(u) − f ||0 .

(16)

In general, we wish to ﬁnd the argmin of (16) in some solution space V. In the context of this paper, we choose V to be an H 1 product space with boundary conditions that are chosen to satisfy the physical constraints of the problem as well as the assumptions needed for the FOSLS framework. In practice, a series of nested ﬁnite subspaces, V h , are used to approximate the solution in V. However, in the Newton-FOSLS approach [20,21], system (8)-(15) is ﬁrst linearized using a Newton step before a FOSLS functional is formed and minimized. This results in the weak form of the problem that produces symmetric positive deﬁnite algebraic systems when the problem is restricted to a ﬁnite-dimensional subspace, V h . In addition, proving continuity and coercivity of the resulting bilinear form is equivalent to having H 1 equivalence of the FOSLS functional. Moreover, the FOSLS functional yields a sharp a posteriori local error estimate, which is used to make the algorithm more robust and, under the right conditions, produces algebraic systems that are solved easily by multilevel iterative solvers. Our choice here is algebraic multigrid (AMG) [3,4,5,6,7,8], which, when applied to the FOSLS discretization, has been shown to be an optimal (O(n)) solver [1,2,6]. Using the formulation above, and with appropriate boundary conditions, H 1 equivalence of the linearized FOSLS functional is shown in [11]. Therefore, the FOSLS functionals can be a good measure of the error, or at least the semi-norm of the error, in the solution space. Thus, they can be used to develop an eﬃcient solution algorithm and as aids in the adaptive reﬁnement process. By measuring

4

J. Adler et al.

the functional norm of the error in each element of the domain, information on where reﬁnement is needed is readily available.

3

Solution Algorithm

In [10], an algorithm is devised to solve a system of nonlinear equations, L(u) = f . Starting on a coarse grid, given an initial guess, the system is linearized and the linearized FOSLS functional is then minimized on a ﬁnite element space. At this point, several AMG V-cycles are performed until there is little to gain in accuracy-per-computational-cost. The system is then relinearized and the minimum of the new linearized FOSLS functional is searched for in the same manner. After each set of linear solves, the relative diﬀerence between the computed linearized functional and the nonlinear functional is checked. If they are close and near the minimum of the linearized functional, then it is concluded that we are close enough to the minimum of the nonlinear functional and, hence, we have a good approximation to the solution on the given grid. Next, the approximation is interpolated to a ﬁner grid and the problem is solved on that grid. This process is repeated until an acceptable error has been reached, or until we have run out of computational resources, such as memory. If, as in the case of the MHD equations, it is a time-dependent problem, the whole process is performed at each time step. This algorithm is summarized in the ﬂow chart, ﬁgure 1. 3.1

Adaptive Refinement

In the nested iteration algorithm, we decide when to stay on a current mesh and iterate further or interpolate to a ﬁner grid. In [10], it was assumed that the grid is uniformly reﬁned. In other words, it was assumed that there are 2d more points on the next grid than the one before, where d is the dimension of the problem.

Fig. 1. Flow chart of nested iteration algorithm

An AMR Scheme for Incompressible RMHD

5

This is generally not the case when the grids are reﬁned locally. On a given mesh, after enough Newton steps and linear iterations have been performed, the nonlinear functional is calculated in each element. This indicates in which region of the domain the functional and, hence, the error is large compared to the rest of the domain. Then, the best use of new degrees of freedom is to concentrate them where the error is large. Since the goal of the algorithm is to increase the accuracy-per-computational-cost, we do not want to over solve in areas where the error is already small. The adaptive scheme that we describe here is an eﬃciency based reﬁnement method, called ACE, that was developed in [12,13,14]. This scheme estimates both the reduction in the functional and the computational cost that would result from any given reﬁnement pattern. These estimates are used to establish a reﬁnement pattern that attempts to optimize the Accuracy-per-Computational cost (Eﬃciency), which gives rise to the acronym ACE. The square of the functional value on each element, i , is computed and is ordered such that the local functional value is decreasing: 1 ≥ 2 ≥ . . . ≥ N l ,

(17)

where Nl is the total number of elements on level l. Next, we predict the reduction of the squared functional and the estimated computational work that would result if we were to reﬁne a given percentage of the elements with the most error. Denote the percentage by r ∈ (0, 1] and the number of elements to be reﬁned by r ∗ Nl . Deﬁne f (r) to be the squared functional value in the r ∗ Nl elements with largest error, that is, i
(19)

Here, p is the order of the ﬁnite element basis. In other words, the predicted error reduction only comes from the regions where the elements have been reﬁned. The predicted error in the remaining elements is left unchanged. In general, this underestimates the reduction. In practice, the unreﬁned elements can have some error reduction as well. On the other hand, the predicted work is assumed to be a function of the total number of elements on the reﬁned grid. Since each reﬁned element yields 2d children, we predict the number of elements on the reﬁned grid to be Nl+1 = Nl (1 − r + 2d r).

(20)

6

J. Adler et al.

Fig. 2. Error distribution among elements. Left plot shows uniformly distributed error. Right plot shows distribution where a few elements contain most of the error.

The work required to solve the linear system on the reﬁned grid is assumed to be a function of the number of elements, Nl+1 , the error reduction factor, γ(r), and the AMG convergence factor, ρ. Assume that the cost of one AMG V-cycle is C1 work units and the overall setup cost for FOSLS and AMG is C0 work units. The number of V-cycles needed to reduce the error by γ(r) using AMG is n=

logγ . logρ

(21)

Thus, the overall predicted work of solving on a reﬁned grid is logγ )Nl+1 . logρ

(22)

logγ )(1 − r + 2d r)Nl . logρ

(23)

W (r) = (C0 + C1 Replacing Nl+1 using (26) yields W (r) = (C0 + C1

With these relations in mind, we choose the r that minimizes the predicted eﬀective functional reduction, γ(r)ef f = γ(r)1/W (r) .

(24)

Therefore, we reﬁne only r ∗ Nl elements on each level that gives us the best error reduction for the added cost. In addition, one could allow for multiple reﬁnements of each element. For example, the ACE scheme could call for r1 Nl elements to be reﬁned once and r2 Nl elements to be reﬁned twice. This changes the predicted error reduction, γ(r1 , r2 ), to 2p 4p 1 1 γ(r1 , r2 ) = 1 − f (r1 ) + (f (r1 ) − f (r2 )) + f (r2 ). (25) 2 2 Then, the new number of elements on the reﬁned grid is Nl+1 = Nl (1 − r1 + 2d (r1 − r2 ) + 22d r2 ),

(26)

An AMR Scheme for Incompressible RMHD

7

and the predicted work estimate, W (r1 , r2 ) becomes W (r) = (C0 + C1

logγ )(1 − r1 + 2d (r1 − r2 ) + 22d r2 )Nl . logρ

(27)

Now, the optimal pair, 0 ≤ r1 ≤ r2 ≤ 1 is found to minimize the eﬀective error reduction, equation (24). This allows for more aggressive reﬁnement. In practice, it has been found that using even more reﬁnement, such as triple reﬁnement, is unnecessary. The ACE scheme ﬁts in nicely with the nested iteration approach. Not only do we try to get the best accuracy-per-computational-cost for each linearization and each AMG cycle, but we also take this into account when interpolating to ﬁner grids. A more detailed explanation of the ACE algorithm can be found in [13,14]. Numerical results in section 4 show that using adaptive reﬁnement yields the same accuracy in the MHD test problems while using fewer degrees of freedom.

4

Numerical Results

In this section, we investigate an MHD test problem to show that the nested iteration Newton-FOSLS-AMG method works well with the addition of the ACE adaptive mesh reﬁnement. We show that these methods are capable of solving the complex nonlinear systems in a minimal amount of work units, or ﬁne-grid relaxation equivalents. The full algorithm, as in ﬁgure 1, was applied to a tokamak test problem [22,23,24,25]. From the papers by Chac´ on, Knoll, and Finn [22] and Philip [24], a reduced set of MHD equations is obtained. These equations simulate a “large aspect-ratio” tokamak, with non-circular cross-sections. Here, the magnetic B-ﬁeld along the z-direction, or the toroidal direction, is very large and mostly constant. In this context, we are able to look at plasma behavior in the poloidal cross-section. This was described in the companion paper, [9]. The reduced model is equivalent to the 2D version of equations (8)-(15). The x-direction denotes the periodic poloidal direction in the tokamak, whereas the y dimension represents a thin annulus in the poloidal cross section. In this 2D setting, vorticity, ω, and current density, j, are both scalar variables. We now apply our methodology to a test problems known as the island coalescence problem. 4.1

Test Problem: Island Coalescence

This test problem simulates an island coalescence in the current density arising from perturbations in an initial current density sheet. A current density sheet in the toroidal direction of the tokamak is perturbed, resulting in an instability that causes a reconnection in the magnetic ﬁeld lines and merging of two islands in the current density ﬁeld. This produces a sharp peak in current density where the magnetic ﬁeld lines reconnect. This region is known as the reconnection zone and the point at which the magnetic ﬁeld lines break is known as the X point. See [26,27,24] for more details. We choose a low enough resistivity

8

J. Adler et al.

(i.e. Lundquist number above 50, 000) in order to observe the interesting physics. For the following simulations, we deﬁne Ω = [−1, 1] × [−1, 1], Re = SL = 50, 001. The initial conditions at equilibrium are 1 sinh(2πy) B0 (x, y) = , cosh(2πy) + k cos(2πx) k sin(2πx) u0 (x, y) = 0, ω0 (x, y) = 0, 2π(k 2 − 1) j30 (x, y) = ∇ × B0 = , (cosh(2πy) + 0.2 cos(2πx))2 (1 − k 2 ) 1 , p0 (x, y) = 1+ 2 (cosh(2πy) + 0.2 cos(2πx))2

(28) (29) (30) (31) (32)

where k = 0.2. These initial conditions are perturbed away from equilibrium as follows: ⎞ ⎛ 1 − π cos(πx) sin(π y2 ) (33) δB0 (x, y) = ⎝ 12 π1 cos(π y2 ) sin(πx) ⎠ , 0 y δj30 (x, y) = cos(π ) cos(πx), (34) 2 where = −0.01. The boundary conditions are periodic in x and Dirichlet for the current density and vorticity on the top and bottom of the domain. We also have n · u and n · B known on the top and bottom. Again, the FOSLS formulation, (8)-(15), is H 1 elliptic. Results. The problem was run to time 10τA with a timestep of 0.1τA using a BDF-2 implicit time-stepping scheme. By the 80th time step, or time 8τA , the islands have coalesced and the large peak in current density has occurred at the reconnection point. Using both uniform reﬁnement and the ACE scheme, we were able to capture the instability. With ACE employed, the grids evolve over time to reﬁne in areas with steeper gradients. In this problem, as time progresses, a steep gradient occurs at the reconnection point. This is seen in the bottom graph in ﬁgure 3. We expect, then, that most of the reﬁnement occurs in this region, which is indeed the case. To get an idea of how much more eﬃcient adaptive reﬁnement is, we compare the work performed using ACE to that of using uniform reﬁnement. The work at one time step is calculated by ﬁrst determining the work of all the V-cycles on a given reﬁnement level. Then these values times the number of matrix nonzeros

An AMR Scheme for Incompressible RMHD

9

for the level are summed over all grids and divided by the number of nonzeros on the ﬁnest reﬁnement level for the given problem. In table 1, the work unit values given are with respect to the ﬁnest level of the given reﬁnement scheme. They are an average over all time steps. To compare the two schemes, the average work unit value is multiplied by the ﬁne-grid nonzeros for that scheme and then the ratio is taken. This ratio is deﬁned as the Work Ratio in table 1. Similarly, the Element Ratio column is the ratio of elements on the ﬁnest grid of the adaptive scheme compared to the number of elements on the ﬁnest grid of the uniform scheme.

Fig. 3. Numerical solution using 10 levels of adaptive reﬁnement. SL = Re = 50, 001. Top Left: Current Density at Time 4τA . Top Right: Current Density at Time 8τA . Bottom: Zoomed in plot of current density peak at Time 8τA .

Table 1. Average number of work units per timestep using uniform reﬁnement versus ACE reﬁnement. A total of 100 time steps were performed. Uniform Work Units 66.87

ACE Nonzeros Work Units 51,781,975 78.23

Nonzeros Work Ratio Element Ratio 7,283,047.4 0.16 0.07

10

J. Adler et al.

Table 2. Results for a single time step. AMG convergence factor, ρ, total work units (total work units with respect to uniform reﬁnement in parenthesis), and the nonlinear functional norm are given. Level Elements Nonzeros Newton Steps 2 4 17,493 2 3 16 63,063 2 4 40 155,575 1 5 79 299,145 1 6 175 640,871 1 7 337 1,161,741 1 8 658 2,263,261 1 9 1,078 3,624,089 1 10 1,696 5,535,089 1 Total

ρ 0.35 0.55 0.40 0.46 0.58 0.50 0.79 0.67 0.71

Work Units Nonlinear Functional 0.054 (0.006) 4.919 0.355 (0.038) 1.287 0.427 (0.046) 0.604 0.682 (0.073) 0.284 3.328 (0.356) 0.112 3.929 (0.420) 0.049 19.548 (2.090) 0.023 20.789 (2.222) 0.013 29.495 (3.153) 0.008 78.697 (8.398)

The results show that using adaptive reﬁnement greatly reduces the amount of work needed, compared to that of using uniform reﬁnement as is done in [10]. ACE requires 10% of the work that uniform reﬁnement requires. The physics is more localized in this problem, especially by time 8τA and, thus, the reﬁnement is more localized. Looking at one speciﬁc time step, as in table 2, one can see how the nested iteration algorithm along with ACE solves the problem eﬃciently. By the ﬁnest levels, only one Newton step and a handful of work units are needed. The values in parenthesis show the relative work with respect to a grid that was uniformly reﬁned for the given number of levels. Thus, using nested iteration Newton-FOSLS-AMG with ACE yields a good approximation to the solution of the island coalescence problem in less than 10 work units, or the equivalent of 10 relaxation sweeps on a 128 × 128 bi-quadratic uniform grid.

5

Discussion

We showed that using an eﬃciency-based adaptive reﬁnement scheme, such as ACE, along with the FOSLS ﬁnite element method and nested iteration, yields a highly eﬀective method for solving the complicated, current-vorticity form of resistive MHD. Real world MHD applications are solved very eﬃciently when the focus is on accuracy-per-computational-cost. The use of FOSLS greatly aids this process. Its sharp, a posteriori error estimate allows parameters to be computed that are used to estimate the current accuracy-per-computational-cost. From this, judgements as to what further computation is necessary are made. Such decision making facilitates an eﬃcient local adaptive reﬁnement process, which in turn reduces the amount of cost needed. Several aspects still need to be studied. First, the linear system solvers are what dictate the overall eﬃciency of the NI-Newton-FOSLS method. For this paper, the algebraic systems are solved with a classical algebraic multigrid method. Deteriorations in the algebraic convergence for increased timestep size as well as

An AMR Scheme for Incompressible RMHD

11

Reynolds and Lundquist numbers are observed. In fact, in table 2, the convergence factor ρ increases slightly as h gets smaller. The current AMG algorithm can be improved in several ways. One might develop an improved AMG for the above type of systems of PDEs. This might involve the use of newly developed adaptive multigrid algorithms described more in [28,29,30]. In the test problems addressed in this paper, tensor product grids are used, with local reﬁnement. Therefore, one might employ a geometric multigrid solver instead that takes into account the hierarchy of grids used, or one that is based on block structured grids. Speciﬁcally, using the same multigrid method to solve on a uniform mesh as for an adaptively reﬁned mesh is not ideal. One would prefer a solver that takes into account the unstructured nature of the grid. In addition, many aspects of the adaptive reﬁnement algorithm can be improved. In the above tests, the ACE algorithm requires more levels than uniform reﬁnement to reach the same accuracy because it does not reﬁne as aggressively as uniform reﬁnement. Currently, we are examining modiﬁcations to ACE that dictate the number of unknowns on the reﬁned grid. If this number is ﬁxed, then the ACE algorithm chooses where to put these new nodes. This is accomplished by allowing ACE to reﬁne an element more than once at each step. This approach allows for fewer reﬁnement steps and, hence, fewer computations overall. Finally, there are many other MHD problems to be tested, as well as other time-dependent problems in ﬂuid dynamics that have large nonlinearities. Doing most of the hard work on the coarser grids allows us to solve these problems more eﬃciently. Using a ﬁrst-order system least squares formulation, we were able to resolve the above MHD physics, and we believe that, with a careful formulation, it can be used for many other time-dependent nonlinear systems. This, with the addition of a parallel implementation, could allow us to tackle even more complicated problems such as Extended or Hall MHD, as well as other complex ﬂuid problems. Acknowledgments. This work was sponsored by the Department of Energy under grant numbers DE-FG02-03ER25574 and DE-FC02-06ER25784, Lawrence Livermore National Laboratory under contract numbers B568677, and the National Science Foundation under grant numbers DMS-0621199, DMS-0749317, and DMS-0811275.

References 1. Cai, Z., Lazarov, R., Manteuﬀel, T., McCormick, S.: First-Order System Least Squares for Second-Order Partial Diﬀerential Equations. SIAM J. Numer. Anal. 31, 1785–1799 (1994) 2. Cai, Z., Manteuﬀel, T., McCormick, S.: First-Order System Least Squares for Second-Order Partial Diﬀerential Equations. II. SIAM J. Numer. Anal. 34, 425–454 (1997) 3. Brandt, A., McCormick, S.F., Ruge, J.: Algebraic Multigrid (AMG) for automatic multigrid solutions with application to geodetic computations. Report, Inst. for Computational Studies, Fort Collins, CO (1982)

12

J. Adler et al.

4. Brandt, A., McCormick, S.F., Ruge, J.: Algebraic Multigrid (AMG) for sparse matrix equations. Cambridge University Press, Cambridge (1984) 5. Brandt, A.: Algebraic Multigrid Theory: The Symmetric Case. Appl. Math. Comput. 19(1-4), 23–56 (1986) 6. Briggs, W.L., Henson, V.E., McCormick, S.F.: A Multigrid Tutorial. Society for Industrial and Applied Mathematics (SIAM), Philadelphia (2000) 7. Oosterlee, C., Schuller, A., Trottenberg, U.: Multigrid. Academic Press, London (2000) 8. Ruge, J., St¨ uben, K.: Algebraic Multigrid (AMG). In: McCormick, S.F. (ed.) Multigrid Methods (1986) 9. Adler, J., Manteuﬀel, T., McCormick, S.F., Ruge, J.: First-order system least squares for incompressible resistive Magnetohydrodynamics. SIAM J. Sci. Comp. (to appear, 2009) 10. Adler, J., Manteuﬀel, T., McCormick, S., Ruge, J., Sanders, G.: Nested Iteration and First-Order System Least Squares for Incompressible, Resistive Magnetohydrodynamics. SIAM J. on Sci. Comp. (SISC) (submitted) (2009) 11. Adler, J.: Nested Iteration and First Order Systems Least Squares on Incompressible Resistive Magnetohydrodynamics. PhD thesis, University of Colorado at Boulder (2009) 12. Berndt, M., Manteuﬀel, T., McCormick, S.F.: Local error estimates and adaptive reﬁnement for ﬁrst-order system least squares (FOSLS). E.T.N.A. 6, 35–43 (1998) 13. DeSterck, H., Manteuﬀel, T., McCormick, S., Nolting, J., Ruge, J., Tang, L.: Eﬃciency-based h- and hp-reﬁnement strategies for ﬁnite element methods. J. Num. Lin. Alg. Appl. 15, 249–270 (2008) 14. Nolting, J.: Eﬃciency-based Local Adaptive Reﬁnement for FOSLS Finite Elements. PhD thesis, University of Colorado at Boulder (2008) 15. Nicholson, D.R.: Introduction to Plasma Theory. John Wiley and Sons, New York (1983) 16. Ullrich, P.: Dynamics and Properties of the Magnetohydrodynamics Equations (unpublished) (2005) 17. Bochev, P., Cai, Z., Manteuﬀel, T., McCormick, S.: Analysis of Velocity-Flux FirstOrder System Least-Squares Principles for the Navier-Stokes Equations: Part I. SIAM J. Numer. Anal. 35, 990–1009 (1998) 18. Bochev, P., Cai, Z., Manteuﬀel, T., McCormick, S.: Analysis of Velocity-Flux FirstOrder System Least-Squares Principles for the Navier-Stokes Equations: Part II. SIAM J. Numer. Anal. 36, 1125–1144 (1999) 19. Heys, J., Lee, E., Manteuﬀel, T., McCormick, S.: An Alternative Least-Squares Formulation of the Navier-Stokes Equations with Improved Mass Conservation. J. Comp. Phys. 226(1), 994–1006 (2007) 20. Codd, A.: Elasticity-Fluid Coupled Systems and Elliptic Grid Generation (EGG) Based on First-Order System Least Squares (FOSLS). PhD thesis, University of Colorado at Boulder (2001) 21. Codd, A., Manteuﬀel, T., McCormick, S.: Multilevel First-Order System Least Squares for Nonlinear Elliptic Partial Diﬀerential Equations. SIAM J. Numer. Anal. 41, 2197–2209 (2003) 22. Chacon, L., Knoll, D.A., Finn, J.M.: An Implicit, Nonlinear Reduced Resistive MHD Solver. J. of Computational Physics 178, 15–36 (2002) 23. Chacon, L., Knoll, D.A., Finn, J.M.: Nonlinear Study of the Curvature-Driven Parallel Velocity Shear-Tearing Instability. Physics of Plasmas 9, 1164–1176 (2002)

An AMR Scheme for Incompressible RMHD

13

24. Philip, B., Chacon, L., Pernice, M.: Implicit Adaptive Mesh Reﬁnement for 2D Reduced Resistive Magnetohydrodynamics. J. Comp. Phys. 227(20), 8855–8874 (2008) 25. Strauss, H.: Nonlinear, Three-Dimensional Magnetohydrodynamics of Noncircular Tokamaks. Physics of Fluids 19, 134–140 (1976) 26. Bateman, G.: MHD Instabilities. MIT Press, Cambridge (1978) 27. Knoll, D.A., Chacon, L.: Coalescence of Magnetic Islands, Sloshing, and the Pressure Problem. Physics of Plasmas 13(1) (2006) 28. Brezina, M., Falgout, R., MacLachlan, S., Manteuﬀel, T., McCormick, S., Ruge, J.: Adaptive smoothed aggregation (αsa) multigrid. SIAM Review (SIGEST) 47, 317–346 (2005) 29. Brezina, M., Falgout, R., Maclachlan, S., Manteuﬀel, T., McCormick, S., Ruge, J.: Adaptive algebraic multigrid. SIAM J. on Sci. Comp. (SISC) 27, 1261–1286 (2006) 30. MacLachlan, S.: Improving Robustness in Multiscale Methods. PhD thesis, University of Colorado at Boulder (2004) 31. Ruge, J.: Fospack users manual, version 1.0. (unpublished) (2000)

Discontinuous Galerkin Subgrid Finite Element Method for Heterogeneous Brinkman’s Equations Oleg P. Iliev1,2 , Raytcho D. Lazarov2,3, and Joerg Willems1,3 1 Fraunhofer ITWM, Kaiserslautern, Germany Inst. Mathematics, Bulgarian Academy of Sciences, Soﬁa, Bulgaria Dept. Mathematics, Texas A&M University, College Station, TX 77843, USA 2

3

Abstract. We present a two-scale ﬁnite element method for solving Brinkman’s equations with piece-wise constant coeﬃcients. This system of equations model ﬂuid ﬂows in highly porous, heterogeneous media with complex topology of the heterogeneities. We make use of the recently proposed discontinuous Galerkin FEM for Stokes equations by Wang and Ye in [12] and the concept of subgrid approximation developed for Darcy’s equations by Arbogast in [4]. In order to reduce the error along the coarse-grid interfaces we have added a alternating Schwarz iteration using patches around the coarse-grid boundaries. We have implemented the subgrid method using Deal.II FEM library, [7], and we present the computational results for a number of model problems. Keywords: Numerical upscaling, ﬂow in heterogeneous porous media, Brinkman’s equations, subgrid approximation, mixed FEM.

1

Introduction

In this paper we consider the Brinkman’s equations for the velocity u and the pressure p: −μΔu + ∇p + μκ−1 u = f in Ω, ∇ · u = 0 in Ω, (1) u = 0 on ∂Ω. Here μ > 0 is the viscosity, Ω is a bounded simply connected domain in Rn , n = 2, 3, with Lipschitz polyhedral boundary having the outward unit normal vector n, 0 < κ ∈ L∞ (Ω) is the permeability and f ∈ L2 (Ω)n is a forcing term. Then problem (1) has unique weak solution (u, p) ∈ (H 1 (Ω)n , L20 (Ω)). Brinkman’s equations adequately describe ﬂows in highly porous media. They are used for modeling many industrial materials and processes such as industrial ﬁlters, with porosity above 0.9, thermal insulators made of glass or mineral wool with porosity 0.99, or open foams with porosity above 0.95, see Fig. 1. Equation (1) was introduced by Brinkman in [6] in order to reduce the deviations between the measurements for ﬂows in highly porous media and the Darcybased predictions. This was done without direct link to underlying microscopic I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 14–25, 2010. c Springer-Verlag Berlin Heidelberg 2010

Discontinuous Galerkin Subgrid Finite Element Method

15

Fig. 1. Microstructure of industrial foams

Fig. 2. Microstructure and marcostructure of mineral wool

behavior of the ﬂow process, but as a constitutive relation involving a dissipative term scaled by the viscosity. However, advances in homogenization theory made it possible to rigorously derive Brinkman’s equations from Stokes’ equations in the case of slow viscous ﬂuid ﬂow in the presence of periodically arranged solid obstacles, see e.g., [1,2,9,11]. Also, system (1) has been considered from the point of view of ﬁctitious domain or penalty formulation of ﬂows of incompressible liquids around solid or porous obstacles; in this case the coeﬃcients are piecewise constant: the viscosity μ is constant and κ is “small” in the solid obstacles and “inﬁnity” in the ﬂuid, (see, e.g. [3]). In this paper we derive and study numerical methods for solving Brinkman’s equations (1) assuming that the coeﬃcient κ is piece-wise constant and has large jumps. Moreover, the structure of the subdomains, where the coeﬃcient is constant, has quite a complicated geometry, see, e.g. Figure 1. For such problems we shall construct and numerically test two-scale ﬁnite element approximations. In the case of problems with scale separation the method captures well the coarse scale behavior of the solution and enhances it with ﬁne scale features. Also, we extend such approximations to numerically treat problems without scale separation. Enhancing the method by subdomains around the coarse-grid edges we devise an iterative alternating Schwarz methods, which converges to the ﬁne-grid approximate solution. All constructions are implemented within

16

O.P. Iliev, R.D. Lazarov, and J. Willems

the Deal.II ﬁnite element library. As a byproduct of this development we also obtaine a subgrid ﬁnite element approximation of Darcy’s problem, a method proposed and justiﬁed by Arbogast [4], and the discontinuous Galerkin method for Stokes’ equations, proposed and studied by Wang and Ye [12]. In this note we present derivation of a subgrid approximation of Brinkman equations and test the method on a number of model problems. Our further goals include theoretical and practical study of the error of the method, testing its performance on more realistic three-dimensional problems, and development and justiﬁcation of eﬃcient solution methods.

2

Preliminaries

Here we use the standard notation for spaces of scalar and vector-functions deﬁned on Ω. L2 (Ω) is the space of measurable square integrable functions in Ω and L20 (Ω) is its subspace of functions with mean value zero. The Sobolev spaces H 1 (Ω) and H 1 (Ω)n consist of scalar and vector-functions, respectively, with weak ﬁrst derivatives in L2 (Ω) and L2 (Ω)n . Similarly, H01 (Ω)n := {v ∈ H 1 (Ω)n : v = 0 on ∂Ω}, H(div; Ω) := {v ∈ L2 (Ω)n : ∇ · v ∈ L2 (Ω)}, H0 (div; Ω) := {v ∈ H(div; Ω) : v · n = 0 on ∂Ω}. Further, Pk denotes the set of polynomials of degree k ≥ 0 and Pkn the set of vector functions in Rn with components in Pk . The two-scale ﬁnite element method uses various (mixed) ﬁnite element spaces, which are deﬁned below. Let TH and Th be quasi-uniform quadrilateral triangulations of Ω with mesh-parameter H and h, respectively, such that each TH ∈ TH is an agglomeration of elements in Th . We will refer to TH and Th as coarse and ﬁne triangulation, respectively. Let EH denote the set of all edges (n = 2) and faces (n = 3) of TH , respectively. Also, we deﬁne E˚H to be the set of all internal edges/faces of TH , i.e., E˚H := {eH ∈ EH | eH ∂Ω} and E˚h to be the set of all edges/faces of Th that are internal for the coarse-grid cells TH ∈ EH . Furthermore, for each TH ∈ TH we denote by Th (TH ) the restriction of Th to the coarse ﬁnite element TH , Th (TH ) is referred to as a ﬁne triangulation of TH . We denote by (VH , WH ) and (Vh , Wh ) the mixed ﬁnite element spaces corresponding to TH (Ω) and Th (Ω), resprectively. Likewise, for each TH ∈ TH (Ω) and eH ∈ E˚H let (δVh (TH ), δWh (TH )) ⊂ (H0 (div; TH ), L20 (TH ))

(2)

be mixed ﬁnite element spaces corresponding to Th (TH ). We also consider the direct sums of these local ﬁnite element spaces and set (δVh , δWh ) := (δVh (TH ), δWh (TH )), TH ∈TH

Discontinuous Galerkin Subgrid Finite Element Method

17

where functions in (δVh (TH ), δWh (TH )) are extended by zero to Ω\TH . We, furthermore, assume that the ﬁnite element spaces satisfy the following properties: ∇ · δVh = δWh δWh ⊥ WH

and ∇ · VH = WH ,

(3a)

in the L2 -inner-product,

(3b)

and VH ∩ δVh = {0}.

(3c)

We note that if we choose (VH , WH ) and (δVh (TH ), δWh (TH )), with TH ∈ TH , to be Brezzi-Douglas-Marini (BDM1) mixed ﬁnite element spaces of order one (cf. e.g. [5, Section III.3, p. 126]), then (3) is satisﬁed. The velocity for the BDM1 space in 2-D is given by P12 + span{curl(x21 x2 ), curl(x1 x22 } = P12 + span{(x21 , −2x1 x2 ), (2x1 x2 , −x22 )} on each cell, with the restriction that the normal component is continuous across cell boundaries. In 3-D the spaces are deﬁned in a similar manner, see, e.g. [5, Section III.3, p. 127]. Due to (3b) and (3c) the following direct sum is well-deﬁned (VH,h , WH,h ) := (VH , WH ) ⊕ (δVh , δWh ) .

(a) (Vh , Wh ).

(4)

(b) (VH,h , WH,h ) with 2 coarse cells.

Fig. 3. Degrees of freedom of diﬀerent mixed ﬁnite element spaces corresponding to BDM1 elements for the velocity and piece-wise constant functions for the pressure

3

Subgrid Method for Brinkman’s Equations

Now we outline the numerical subgrid approach for problem (1) in the way T. Arbogast applied it to Darcy’s problem in [4]. It is well known that the mixed variational formulation of (1) reads as follows: Find (u, p) ∈ (H01 (Ω)n , L20 (Ω)) such that for all (v, q) ∈ (H01 (Ω)n , L20 (Ω)) f · vdx, (5) a (u, v) + b (v, p) + b (u, q) = Ω

18

O.P. Iliev, R.D. Lazarov, and J. Willems

where

∇ · vqdx,

(6a)

(μ∇u : ∇v + μκ−1 u · v)dx.

(6b)

b (v, q) := − a (u, v) :=

Ω

Ω

Now we consider a ﬁnite element approximation of (5) with respect to (VH,h , ⊂ H01 (Ω)n , and therefore it is a nonconforming ﬁnite WH,h ). Note, that VH,h element space. However, we have that VH,h ⊂ H0 (div; Ω) and we can use the discontinuous Galerkin approximation of Stokes equations, derived in [12]: Find (uH,h , pH,h ) ∈ (VH,h , WH,h ) such that for all (vH,h , qH,h ) ∈ (VH,h , WH,h ) we have ah (uH,h , vH,h ) + b (vH,h , pH,h ) + b (uH,h , qH,h ) = F (vH,h ), where ah (u, v) :=

∇u : ∇v + κ−1 u · v dx

Th ∈Th Th

+

e∈Eh

(7)

α u v − {{ε(u)}} v − {{ε(v)}} u ds e |e|

with the average {{ε(·)}} and the jump · deﬁned by 1 + + − − on e ∈ E˚h , 2 n · ∇(u|Th+ · τ ) + n · ∇(u|Th− · τ ) {{ε(u)}} := + + n · ∇(u|T + · τ ) on e ∈ Eh∂

(8)

(9a)

h

and v :=

v|T + · τ + + v|T − · τ − on e ∈ E˚h , h h v|T + · τ + on e ∈ Eh∂ .

(9b)

h

Here α > 0 is a suﬃciently large stabilization parameter, n and τ are normal and tangential vectors to the edge e (with right orientation), the superscripts + and − refer to the elements on either side of edge e, and E˚h and Eh∂ denote the sets of all internal and boundary edges, respectively. If H = h, i.e. δVh = ∅, δWh = ∅, we have a single grid approximation of Brinkman’s equations, a method proposed and studied by Wang and Ye in [12] for Stokes equations. The approximate solution obtained on a single grid for H suﬃciently small will be further called a reference solution. The subsequent derivation, which follows the reasoning in [4], is the core of the numerical subgrid approach and essentially yields a splitting of (7) into one coarse global and several ﬁne local problems.Due to (4) we know that each element in (VH,h , WH,h ) may be uniquely decomposed into its components from (VH , WH ) and (δVh , δWh ). Thus, (7) may be rewritten as ah (uH + δuh , v + δvh ) + b (v + δvh , pH + δph ) = F (v + δvh ), b (uH + δuh , q + δqh ) = 0, ∀(v, q) ∈ (VH , WH ) and ∀(δvh , δqh ) ∈ (δVh , δWh ).

(10)

Discontinuous Galerkin Subgrid Finite Element Method

19

By linearity we may decompose (10) into ah (uH + δuh , v) + b (v, pH + δph ) + b (uH + δuh , q) = F (v), ∀(v, q) ∈ (VH , WH ), ah (uH + δuh , δvh ) + b (δvh , pH + δph ) + b (uH + δuh , δqh ) = F (δvh ),

∀(δvh , δqh ) ∈ (δVh , δWh ).

(11a)

(11b)

Due to (3a), (3b), and (3c) we may simplify (11) to obtain the equation: ah (uH + δuh , v) + b (v, pH ) + b (uH , q) = F (v) ∀ (v, q) ∈ (VH , WH ) ah (uH + δuh , δvh ) + b (δvh , δph ) + b (δuh , δqh ) = F (δvh ), ∀ (δvh , δqh ) ∈ (δVh , δWh ).

(12a)

(12b)

Remark 1. This last step is crucial to ensure the solvability of (12b). In fact, the equivalence of (11b) and (12b) is a major reason for requiring the properties of the ﬁnite element spaces in (3). The requirements (3), however, signiﬁcantly limit the possible choices of ﬁnite elements that might be used in the subgrid method. Indeed, the BDM1 ﬁnite element is the only choice that we are aware of that satisﬁes all constraints and can be used for discretizing (1).

h , δp + δp ) and using Now, by further decomposing (δuh , δph ) = (δuh + δu h h superposition, (12b) may be replaced by the following systems of equations sat h , δp ) and (δuh , δp ), respectively, for all (δvh , δqh ) ∈ (δVh , δWh ): isﬁed by (δu h h

h , δvh + b δvh , δp

h , δqh = 0 + b δu (13a) ah uH + δu h and

ah δuh , δvh + b δvh , δph + b δuh , δqh = F (δvh ).

(13b)

h , δp ) = (δu

h (uH ), δp (uH )) is a linear operWe easily see by (13a) that (δu h h ator in uH . Unfortunately, as written, these two problems do not lead to local (over the coarse elements) computations since they are connected through the

h and δuh on ∂TH . We achieve penalty involving the tangential component of δu the desired localization by considering equations (13) over each coarse grid cell with zero tangential component of the velocity imposed weakly by the penalty term (for more details, see [13]). Keeping this in mind we further use the same

h (uH ), δp (uH )) as the notation. In the following we refer to (δuh , δph ) and (δu h responses to the right hand side and uH , respectively.

h (uH ) into (12a) we arrive at an upscaled equation, which Plugging δuh + δu is entirely posed in terms of the coarse unknowns, i.e. for all (v, q) ∈ (VH , WH )

h (uH ), v + b (v, pH ) = F (v) − ah δuh , v , ah uH + δu (14) b (uH , q) = 0.

20

O.P. Iliev, R.D. Lazarov, and J. Willems

h (v), that Now, due to (13a) we see, by choosing δvh = δu

h (uH ), δu

h (v) + b δu

h (v), δp (uH ) = 0. ah uH + δu h (uH ) = 0 for any

h (v), δp The second equation in (13a) in turn yields b δu h v ∈ VH . Combining these two results with (14) we obtain the symmetric upscaled problem: ﬁnd (uH , pH ) ∈ (VH , WH ) so that for all (v, q) ∈ (VH , WH )

h (uH ), v + δu

h (v) + b (v, pH ) + b (uH , q) = F (v) − ah δuh , v , ah uH + δu (15)

h (uH ) One can set up this linear system by ﬁrst computing the responces δu with uH being a coarse-grid basis function. This could be done in advance and in parallel. Once (15) is solved for (uH , pH ) we obtain the solution of (7) by

h (uH ) + δuh , pH + δp (uH ) + δp ). (uH,h , pH,h ) = (uH + δu h h

4

(16)

Subgrid Method and Alternating Schwarz Iterations

As noted in the previous section we presented a special way of computing the solution of (7), i.e., the ﬁnite element solution corresponding to the space (VH,h , WH,h ). The diﬀerence between the spaces (VH,h , WH,h ) and (Vh , Wh ) is that the former has no ﬁne degrees of freedom across coarse cell boundaries. Thus, ﬁne-scale features of the solution (u, p) across those coarse cell boundaries are poorly captured by functions in (VH,h , WH,h ). Algorithm 1 addresses this problem by performing alternating Schwarz iterations between the spaces (VH,h , WH,h ) and (Vhτ (eH ), Whτ (eH )) that consist of ﬁne-grid functions deﬁned on overlapping subdomains around each coarse-mesh interface eH ∈ E˚H of size H. Now, problem (17) is of exactly the same form as (7). Thus, by the same reasoning as in the previous section we may replace (17) by the following two problems: Find (δeh , δeh ) ∈ (δVh , δWh ) such that for all (δvh , δqh ) ∈ (δVh , δWh ) j+ 13

ah (δeh , δvh ) + b (δvh , δeh ) = F (δvh ) − ah (uh b (δeh , δqh ) =

j+ 1 −b(uh 3 , δqh ).

j+ 13

, δvh ) − b(δvh , ph

),

(19a)

Find (eH , eH ) ∈ (VH , WH ) such that for all (v, q) ∈ (VH , WH ) we have j+ 13

aH (eH , v) + b (v, eH ) = F (v) − ah (uh b (eH , q) =

j+ 1 −b(uh 3 , q).

j+ 13

, v) − b(v, ph

) − ah (δeh , v),

(19b)

Here, (19a) and (19b) correspond to (13b) and (15), respectively, and analogous to (16), (eH,h , eH,h ) from (17) is obtained by

h (eH ), δp (eH ) + (δeh , δeh ) . (eH,h , eH,h ) = (eH , eH ) + δu (20) h

Discontinuous Galerkin Subgrid Finite Element Method

21

Algorithm 1. Alternating Schwarz extension to the numerical subgrid approach for Darcy’s problem — ﬁrst formulation. 1: Set (u0h , p0h ) ≡ (0, 0). 2: for j = 0, . . . until convergence do 3: if j > 0 then 4: Starting from (ujh , pih ) perform an additive Schwarz step with respect to j+1/3 j+1/3 , ph ). (Vhτ (eH ), Whτ (eH )) for all eH ∈ EH to get (uh 5: else 1/3 1/3 6: (uh , ph ) = (u0h , p0h ) 7: end if 8: Find (eH,h , eH,h ) ∈ (VH,h , WH,h ) such that for all (v, q) ∈ (VH,h , WH,h ) we have ⎧ ⎨ ah (eH,h , v) + b (v, eH,h ) = F (v) − ah uj+1/3 , v − b v, pj+1/3 , h h (17) j+1/3 ⎩ ,q . b (eH,h , q) = −b uh 9:

Set

j+1/3

j+1 (uj+1 h , ph ) = (u h

j+1/3

, ph

) + (eH,h , eH,h ).

(18)

10: end for

Algorithm 2. Alternating Schwarz extension to the numerical subgrid approach for Darcy’s problem — second formulation.

h for all coarse velocity basis functions, i.e., solve (13a) with uH re1: Compute δu placed by basis functions. 2: Set (u0h , p0h ) ≡ (0, 0). 3: for j = 0, . . . until convergence do 4: Steps 3:–7: of Algorithm 1 5: Solve (19a) for (δeh , δeh ). j+2/3 j+2/3 j+1/3 j+1/3 6: Set (uh , ph ) = (uh , ph ) + (δeh , δeh ). 7: Solve (22) for (eH , eH ). j+2/3 j+1

h (eH ), pj+2/3 + eH + δp (eH )). + eH + δu 8: Set (uj+1 h h , ph ) = (u h h 9: end for

Now, deﬁne

j+ 23

(uh

j+ 23

, ph

j+ 13

) := (uh

j+ 13

, ph

) + (δeh , δeh ).

Combining this with (18) and (20) we obtain j+ 23

j+1 (uj+1 h , ph ) = (uh

j+ 23

, ph

h (eH ), δp (eH )). ) + (eH , eH ) + (δu h

(21)

We observe that due to (3a) and (3b) we may simplify (19b) to obtain the equality for all (v, q) ∈ (VH , WH ) aH (eH , v) + b (v, eH ) +b (eH , q) j+ 23

= F (v) − ah uh

j+ 23

, v − b(uh

j+ 23

, q) − b(v, ph

Thus, we may rewrite Algorithm 1 in form of Algorithm 2.

).

(22)

22

O.P. Iliev, R.D. Lazarov, and J. Willems

Remark 2. Algorithm 2 also has a diﬀerent interpretation than just being some equivalent formulation of Algorithm 1. It is straightforward to see that for j = 0, 2 2 j+ 2 j+ 2 (uh3 , ph3 ) = (δuh , δph ), i.e. it is the solution of (13b). For j ≥ 1 (uh 3 , ph 3 ) is the solution of (13b) with the homogeneous boundary conditions being replaced j+ 1

j+ 1

by (in general) inhomogeneous ones deﬁned by (uh 3 , ph 3 ). Besides, (22) is of exactly the same form as (15). Thus, Algorithm 2 can be viewed as a subgrid algorithm that iteratively improves the local boundary conditions of the response to the right hand side. Remark 3. As a byproduct of our consideration we have a subgrid approximation of the Darcy’s equation. In this case the above algorithms reduce to an overlapping Schwarz domain decomposition method. Such methods were studied in details in [8,10] where a convergence rate independent of H has been established.

(a) Periodic geometry.

(b) Media with obstacles. (c) Media vuggs.

with

large

Fig. 4. Three test geometries. The dark regions indicate obstacles or porous media, i.e. κ−1 is large, and the light regions indicate (almost) free ﬂow regions, i.e. κ−1 is small.

(a) Reference solution.

(b) Subgrid solution.

(c) Schwarz DD, 5 it.

Fig. 5. Velocity u1 for media with periodically arranged obstacles of Figure 4(a)

Discontinuous Galerkin Subgrid Finite Element Method

5

23

Numerical Results

We consider the following example to test the numerical subgrid approach for Brinkman’s problem and the enhanced version with alternating Schwarz iterations. The model problem has non-homogeneous boundary conditions, namely, u = g on ∂Ω. Extending this subgrid method to problems with non-homogeneous boundary data is possible under the condition that g is contained in the trace space of VH on the boundary ∂Ω. This has been elaborated in the PhD thesis of J. Willems, [13]. Example 1. We choose Ω = (0, 1)2 and 1 1e3 in dark regions, f ≡ 0, g ≡ , μ = 1, κ−1 = 0 1 in light regions, where the position of the obstacles is shown in Figure 4(a) (periodic case) and Figure 4(b) (media with small obstacles), and Figure 4(c) (media with relatively large vuggs), respectively. We have chosen a ﬁne mesh of 128 × 128 cells that resolve the geometry. For the subgrid algorithm we choose a coarse 8 × 8 mesh and each coarse-cell is further reﬁned to 16 × 16 ﬁne cells. On all ﬁgures we plot the numerical results for the velocity u1 for Example 1. On Figure 5 we report the numerical results for periodically arranged obstacles: Figure 5(a) shows the reference solution, computed on the global ﬁne 128×128 grid, Figure 5(b) — the solution of the subgrid method, and Figure 5(c) — the solution after ﬁve iterations of overlapping Schwarz domain decomposition method (Schwarz DD). The relative L2 -norm of the error for the the subgrid solution shown in Figure 5(b) is 7.14e-2, while after ﬁve Schwarz iterations (see Figure 5(c)) the L2 -norm of the error is an order of magnitude better. Likewise, on Figure 6 we show the numerical results for the two geometries shown on Figures 4(b) and 4(c). The ﬁrst row represents the reference solution computed on the global ﬁne grid with 128 × 128 cells. In the second row we give the solution of the subgrid method and in the third row we show the solution obtained after few Schwarz overlapping domain decomposition iterations. In all cases we clearly see the error of the subgrid solution along the coarsegrid boundaries. This error is due to the prescribed zero values for the ﬁne-grid velocity on the coarse-grid boundaries and is characteristic for the multiscale methods (e.g. [14]). As we see in the numerical experiments, few Schwarz iterations substantially decrease the error. For the geometry shown on Figure 4(c) the improvement is achieved after just one iteration, while for the geometry shown on Figure 4(b) the substantial improvement is achieved after 10 iterations. In all cases the approximate solution of Schwarz overlapping method obtained in few iterations is very close to the reference one.

24

O.P. Iliev, R.D. Lazarov, and J. Willems

(a) Reference solution

(b) Reference solution

(c) Subgrid FE solution

(d) Subgrid FE solution

(e) Schwarz DD after 1 iteration

(f) Schwarz DD after 10 iterations

Fig. 6. Velocity component u1 for Example 1; on the left – the results for geometry of Figure 4(b) and the right row – the results for geometry of Figure 4(c)

Discontinuous Galerkin Subgrid Finite Element Method

25

Acknowledgments. The research of O. Iliev was supported by DAAD-PPP D/07/10578 and award KUS-C1-016-04, made by King Abdullah University of Science and Technology (KAUST). R. Lazarov has been supported by award KUS-C1-016-04, made by KAUST, by NSF Grant DMS-0713829, and by the European School for Industrial Mathematics (ESIM) sponsored by the Erasmus Mundus program of the EU. J. Willems was supported by DAAD-PPP D/07/10578, NSF Grant DMS-0713829, and the Studienstiftung des deutschen Volkes (German National Academic Foundation). Part of the research was performed during the visit of O. Iliev to Texas A&M University. The hospitality of the Institute of Applied Mathematics and Computational Science, funded by KAUST, and the Institute for Scientiﬁc Computing are gratefully acknowledged. The authors express sincere thanks to Dr. Yalchin Efendiev for his valuable comments and numerous discussion on the subject of this paper.

References 1. Allaire, G.: Homogenization of the Navier-Stokes Equations in Open Sets Perforated with Tiny Holes. I: Abstract Framework, a Volume Distribution of Holes. Arch. Rat. Mech. Anal. 113(3), 209–259 (1991) 2. Allaire, G.: Homogenization of the Navier-Stokes Equations in Open Sets Perforated with Tiny Holes. II: Non-critical Size of the Holes for a Volume Distribution and a Surface Distribution of Holes. Arch. Rat. Mech. Anal. 113(3), 261–298 (1991) 3. Angot, P.: Analysis of Singular Perturbations on the Brinkman Problem for Fictitious Domain Models of Viscous Flows. Math. Methods Appl. Sci. 22(16), 1395– 1412 (1999) 4. Arbogast, T.: Analysis of a Two-Scale, Locally Conservative Subgrid Upscaling for Elliptic Problems. SIAM J. Numer. Anal. 42(2), 576–598 (2004) 5. Brezzi, F., Fortin, M.: Mixed and Hybrid Finite Element Methods. Springer Series in Comput. Mathematics, vol. 15. Springer, New York (1991) 6. Brinkman, H.C.: A Calculation of the Viscose Force Exerted by a Flowing Fluid on a Dense Swarm of Particles. Appl. Sci. Res. A1, 27–34 (1947) 7. deal.II: A Finite Element Diﬀerential Equations Analysis Library, http://www.dealii.org/ 8. Ewing, R., Wang, J.: Analysis of the Schwarz Algorithm for Mixed Finite Element Methods. Math. Modelling and Numer. Anal. 26(6), 739–756 (1992) 9. Hornung, U. (ed.): Homogenization and Porous Media, 1st edn. Interdisciplinary Applied Mathematics, vol. 6. Springer, Heidelberg (1997) 10. Mathew, T.: Schwarz Alternating and Iterative Reﬁnement Methods for Mixed Formulations of Elliptic Problems, Part II: Convergence Theory. Numer. Math. 65, 469–492 (1993) 11. Ochoa-Tapia, J.A., Whitaker, S.: Momentum Transfer at the Boundary Between a Porous Medium and a Homogeneous Fluid. I. Theoretical development. Int. J. Heat Mass Transfer 38, 2635–2646 (1995) 12. Wang, J., Ye, X.: New Finite Element Methods in Computational Fluid Dynamics by H(div) Elements. SIAM J. Numer. Anal. 45(3), 1269–1286 (2007) 13. Willems, J.: Numerical Upscaling for Multi-Scale Flow Problems, Ph.D. Thesis, Technical University of Kaiserslautern (2009) 14. Wu, X.H., Efendiev, Y., Hou, T.Y.: Analysis of Upscaling Absolute Permeability. Discrete Contin. Dyn. Syst., Ser B 2, 185–204 (2002)

Stochastic Simulation for Solving Random Boundary Value Problems and Some Applications Karl Sabelfeld Institute Comp. Math. & Math. Geoph., Novosibirsk, Lavrentiev str, 6, 630090 Novosibirsk, Russia [email protected]

Abstract. I present in this paper (ﬁrst given as a plenary lecture at the International conference Large-Scale Scientific Computations, Sozopol 2009) an overview of stochastic simulation methods for solving boundary value problems for high dimensional PDEs with random ﬁelds as input data. Three classes of boundary value problems are considered: (1) PDEs with random coeﬃcients, in particular, the Darcy equation with random hydraulic conductivity, (2) PDEs with random source terms, in particular, the elasticity problems governed by the Lam´e equation with random loads, and (3) PDEs with random boundary conditions, in particular, the Laplace and Lam´e equations with random velocities prescribed on the boundary. Here of particular interest are applications related to the analysis of dislocations behavior in crystals. In the modern problems of mathematical physics stochastic models governed by PDEs with random parameters attract more and more attention. One of the reasons is the growing complexity of the modern measurement techniques which enables to cover multiscale character of the studied processes. As an example, we mention the X-ray diﬀraction technique for the analysis of the crystal surfaces, and the ﬂows in porous media. A direct Monte Carlo approach to solve the random boundary value problem ensemblewise is well known, but it takes a huge amount of computer time even in quite simple cases. We give a diﬀerent approach based on a combined space-ensemble ergodic averaging. To this end, we develop ergodic models of random ﬁelds, in particular, the Fourier-wavelet models. We present some applications, in particular, the long-term correlations of the transport in porous media, and the X-ray diﬀraction measurements of the dislocations in crystals. We discuss also some inverse problems of elastography in a stochastic formulation, governed by the elasticity Lam´e equation with random loads and random displacements prescribed on the boundary. Comparisons are made against some exactly solvable results. Finally we suggest a stochastic fractal model for the mass density distribution in the Universe.

1

Introduction

Random ﬁelds provide a useful mathematical framework for representing disordered heterogeneous media in theoretical and computational studies. I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 26–39, 2010. c Springer-Verlag Berlin Heidelberg 2010

Stochastic Simulation

27

Example 1. Fully developed turbulent ﬂows. The velocity ﬁeld representing the turbulent ﬂow is modeled as a random ﬁeld v(x, t) with statistics encoding important empirical features, and the temporal dynamics of the position X(t) and velocity V(t) = dX dt of immersed particles is then governed by m dV(t) = − γ V(t) − v(X(t), t) dt + 2kB T γ dW(t), where m is the particle mass, γ is its friction coeﬃcient, kB is the Boltzmann constant, T is the absolute temperature, and W(t) is a random Wiener process representing molecular collisions. See [1,11,12]. Example 2. Transport through porous media, such as groundwater aquifers, in which the hydraulic conductivity K(x) is modeled as random ﬁeld reﬂecting the empirical variability of the porous medium, see [14,15]. The Darcy ﬂow rate q(x) in response to pressure applied at the boundary is governed by the Darcy equation q(x) = −K(x) grad φ(x),

div q = 0.

Example 3. Stochastic elastostatics problem governed by the Lam´e equation μ Δu(x) + (λ + μ) grad div u(x) = f (x) ,

x∈D ,

u(y)|y∈∂D = g(y)

where u is the displacement vector, and μ and λ are the elastic Lam´e coeﬃcients (scalar random ﬁelds), f is a random vector load, and g is a random displacement vector prescribed on the boundary. Example 4. Smoluchowski coagulation equations. Our fourth example deals with a nonlinear coagulation dynamics. In the processes of crystal growth, the kinetics of an ensemble of atom islands that diﬀuse on the surface and irreversibly merge as they touch each other can be described by the system of Smoluchowski equations: ∞ dcn 1 = Kij ci cj − cn Kjn cj . dt 2 i+j=n j=1 Here cn is the number of islands containing n units (vacancies of atoms in our case) per unit area. In Fig. 1 we show a sample of simulated facetted islands of atoms (upper panel), and the relevant size distributions, see for details [16,17]. What should be evaluated ? In the ﬁrst example, important characteristics is c(x), the average concentration of particles which is in fact a one-particle statistics. More complicated is the ﬂuctuation of concentration, which is a twoparticle statistics related to the mean square separation ρ2 (t). Flows in porous media: even the simplest statistical characteristics, the mean ﬂow, is a non-trivial function to be calculated. More general Lagrangian statistical characteristics are necessary, to evaluate the mean concentration and its ﬂux, see [11-15]. For tracking the trajectories in the extremely heterogeneous velocity ﬁeld, one needs a multiscale resolution through a stochastic synthesis of random ﬁelds.

28

K. Sabelfeld

Fig. 1. Formation of facetted adatom islands by solving the Smoluchowski equation

Fig. 2. Two samples of a random velocity ﬁeld, with a unit correlation length L = 1 (left panel) and L = 0.5 (right panel)

Stochastic Simulation

29

Fig. 3. A sample of a random hydraulic conductivity for an isotropic porous medium

Elasticity problems: Standard statistical characteristics are the mean displacements ui , the second moments u2i , and the probability that the displacements exceed some critical value: P rob(ui > ucr ). The same statistics - for the strain and stress tensors in the elasticity problem εij = (ui,j + uj,i )/2,

τij = 2μεij + λδij divu .

In the analysis of dislocations in crystals, the goal is to evaluate the x-ray diﬀraction peak proﬁles from distributions of misﬁt and threading dislocations. The x-ray scattering amplitude from a ﬁlm of thickness d is given by an integral of the form ∞ A(qx , qz ) =

d dx

−∞

dz exp {i[qx x + qz z + V (x, z)]} 0

where V (x, z) =

K(x − x , z)u(x , ω)dx ,

is the total displacement ﬁeld due to all misﬁt dislocations, and the kernel K(x− x , z) is the Green function given explicitly or to be calculated numerically, and m the random ﬁeld u(x , ω) is deﬁned by u(x, ω) = δ(x − ωj ) where δ is the j=1

Dyrac delta-function, the random points ω1 , . . . , ωm are distributed on [0, L] with a density p(ω) which is independent coordinate x. of the spatial The scattered intensity I(qx , qz ) = |A(qx , qz )|2 can be directly calculated by the Monte Carlo double randomization method, as explained above, by a randomized evaluation of the integral representation

30

K. Sabelfeld

∞ I(qx , qz ) =

d d dx

−∞

0

dz1 dz2 ei[qx x+qz (z1 −z2 )] ei[V (x1 ,z1 )−V (x2 ,z2 )] .

0

The random points ω1 , ω2 , . . . , ωm may have quite diﬀerent distribution on [0, L], say, they may be all independent of each other, they may form a Markov chain with a certain transition probability density, or they may be placed almost periodically, with small but correlated random shifts ωj from their mean positions Xj = j Δx, j = 1, . . . , m: ωj+1 = Δj + ωj . The mean density of points on the [0, L] is ρ = m/L so that Δx = ρ−1 , and the random ﬁeld u(x, ω) is stationary with the mean u(x, y; ω) = ρ. For details, see [16,17]. Smoluchowski coagulation equation: dX(t) = V(t)dt, dVi (t) = ai (X(t), V(t))dt + σij (X(t), t)dBj (t) 1 dnl = Kij ni nj − Kli nl ni . dt 2 i≥1

i+j=l

Important quantities are the average of the mean size ¯ n(t), as well as the average size spectrum nk (t). More complicated functionals: what are the average and variance of the random time when a “solution explosion” happens? (the so-called gelation phenomenon). Droplet growth in turbulent ﬂows: The coagulation kernel in the case of turbulent coagulation regime has the form: ε(x, t) 3 1/3 r1 (i + j 1/3 )3 Kij = 1.364 ν where ε(x, t) is a random ﬁeld of the dissipation rate of the turbulent kinetic energy, ν is the kinematic viscosity, and r1 is the monomer size.

2

Spectral Representations of Homogeneous Random Fields

Under quite general conditions, a real-valued Gaussian homogenous random ﬁeld u(x) can be represented through a stochastic Fourier integral ˜ (dk) u(x) = e2πik·x E 1/2 (k)W (1) R Id

˜ (dk) is a complex-valued white noise random measure on IRd , with where W ˜ (B) = W ˜ (−B), W ˜ (B) = 0, and W ˜ (B)W ˜ (B ) = μ(B ∩ B ) for Lebesgue W measure μ and all Lebesgue-measurable sets B, B . The spectral density E(k) is a nonnegative even function representing the strength (energy) of the random ﬁeld associated to the wavenumber k, meaning the length scale 1/|k| and direction k/|k|.

Stochastic Simulation

31

Multiscale random ﬁelds will have a multiscale spectral density, meaning that E(k) will have substantial contributions over a wide range of wavenumbers kmin |k| kmax , with kmax /kmin 1. This poses a challenge for eﬃcient simulation. More generally, we deal with homogeneous Gaussian l-dimensional vector random ﬁelds (see [1-3]) u(x) = (u1 (x), . . . , ul (x))T , x ∈ IRd with a given correlation tensor B(r): Bij (r) = ui (x + r) uj (x), i, j = 1, . . . l, (2) or with the corresponding spectral tensor F : −i 2π k·r Bij (r) dr, Bij (r) = ei 2π r·k Fij (k) dk, Fij (k) = e R Id

i, j = 1, . . . l .

R Id

We assume that the condition

(3) |Bjj (r)| dr < ∞ is satisﬁed which ensures that

R Id

the spectral functions Fij are uniformly continuous with respect to k. Here Bjj is the trace of B. Let Q(k) be an l × n matrix deﬁned by Q(k)Q∗ (k) = F (k),

¯ Q(−k) = Q(k) .

(4)

Here the star stands for the complex conjugate transpose which is equivalent to taking two operations, the transpose T , and the complex conjugation of each entry. Then the spectral representation of the random ﬁeld is written as follows u(x) = ei 2π kx Q(k) Z(dk) (5) R Id

where the column-vector Z = (Z1 , . . . Zn )T is a complex-valued homogeneous n-dimensional white noise on IRd with a unite variance and zero mean: Z(dk) = 0,

Zi (dk1 ) Z¯j (dk2 ) = δij δ(k1 − k2 ) dk1 dk2

(6)

¯ satisfying the condition Z(−dk) = Z(dk). Diﬀerent simulation formulae based on series representations and Fourierwavelet expansions can be found in our recent papers [1-3].

3

Random Walks and Double Randomization Method

Assume we have to solve a PDE which includes a random ﬁeld σ, say in a right-hand side, in coeﬃcients, or in the boundary conditions: Lu = f,

u|Γ = uγ .

32

K. Sabelfeld

Suppose we have constructed a stochastic method for solving this problem, for a ﬁxed sample of σ. This implies, e.g., that an unbiased random estimator ξ(x| σ) is deﬁned so that for a ﬁxed σ, u(x, σ) = ξ(x| σ) where · stands for averaging over the random trajectories of the stochastic method (e.g., a diﬀusion process, a Random Walk on Spheres, or a Random Walk on Boundary). Let us denote by Eσ the average over the distribution of σ. The double randomization method is based on the equality: Eσ u(x, σ) = Eσ ξ(x| σ) . The algorithm for evaluation of Eσ u(x, σ) then reads: 1. Choose a sample of the random ﬁeld σ. 2. Construct the random walk over which the random estimator ξ(x| σ) is calculated. 3. Repeat 1. and 2. N times, and take the arithmetic mean. Suppose one needs to evaluate the covariance of the solution. Let us denote the random trajectory by ω. It is not diﬃcult to show that u(x, σ) u(y, σ) = E(ω1 ,ω2 ,σ) [ξω1 (x, ω)ξω2 (y, ω)] . The algorithm for calculation of u(x, σ) u(y, σ) follows from this relation: 1. Choose a sample of the random ﬁeld σ. 2. Having ﬁxed this sample, construct two conditionally independent trajectories ω1 and ω2 , starting at x and y, respectively, and evaluate ξω1 (x, ω)ξω2 (y, ω). 3. Repeat 1. and 2. N times, and take the arithmetic mean.

4

Random Loads

Here we solve a problem from elastography: measuring the correlation functions of random displacements under isotropic random loads we try to recover the elastic properties of a molecule consisting of 17 atoms, see Fig. 5, left picture. The spectral tensor of an isotropic random load has the following general structure: Sij (k) = [SLL (k) − SN N (k)]

ki kj + SN N (k)δij , k2

k = |k| .

Here SLL and SN N are longitudinal and transversal spectra, respectively. We consider incompressible random ﬁeld, SLL = 0, hence in this case, SN N (k) = E(k)/(2πk)

Stochastic Simulation

33

1.8

Longitudinal corr. B(u) (x) 11

1.6 1.4 1.2 1

Transverse corr. B(u)(x) 22

0.8 0.6 0.4

σ=1, L=0.5; α=2

0.2 0 −0.5

0

0.5

1

1.5

x

2

Fig. 4. Longitudinal and transverse correlation functions of the elastic displacements, for ﬁxed intensity ﬂuctuations σ, and diﬀerent correlation lengths L and elasticity constant α

0.015 0.01

α = 0.90 = 0.75 = 0.50 = 0.36

B (x) 21

0.005

solid dash dash−dot dot

0 −0.005 B (x)

−0.01 −0.015

12

0

0.5

1

1.5

2

2.5

3

3.5

4

4.5

X

5

Fig. 5. Elastic molecule consisting of 17 atoms (left panel), and the cross correlation function of the elastic displacements for diﬀerent values of the elastic constant α

where E(k) is the energy spectrum. We choose √ 2 2 L −Lk2 E(k) = σ √ e , k = |k| , π here L is a positive parameter, characterizing the correlation length. The Lam´e equation was solved by the Random Walk on spheres method, the calculated correlation functions are plotted in Figures 4 and 5 which show that the elasticity constant can be recovered from the measurements of the correlation functions. For details see [4].

34

5

K. Sabelfeld

Respond to Boundary Excitations

In this section we consider PDEs with random boundary conditions, in particular, the Dirichlet problem for Laplace and Lam´e equations with white noise excitations on the boundary. For details and proofs of Theorems 1-5 see [5,7]. 5.1

Inhomogeneous Random Fields and Karhunen-Lo` eve Expansion

The Karhunen-Lo`eve expansion of an inhomogeneous random ﬁeld u(x) has the form ∞ u(x) = λk ξk hk (x) . k=1

By deﬁnition, B(x1 , x2 ) is bounded, symmetric and positive deﬁnite, so the ∞ Hilbert-Schmidt theory says that B(x1 , x2 ) = λk hk (x1 ) hk (x2 ) where the k=1

eigen-values and eigen-functions are the solutions of the following eigen-value problem for the correlation operator: B(x1 , x2 ) hk (x1 ) dx1 = λk hk (x2 ) . G

The eigen-functions form a complete orthogonal set

hi (x) hj (x) dx = δij and

G

1 ξk = √ λk

u(x) hk (x) dx ,

E ξk = 0,

Eξi ξj = δij .

G

Theorem 1. The solution of the Dirichlet problem for the Laplace equation in a disc K(x0 , R) with the white noise boundary function g(y) is an inhomogeneous 2D Gaussian random ﬁeld uniquely deﬁned by its correlation function u(r1 , θ1 ) u(r2 , θ2 ) = Bu (ρ1 , θ1 ; ρ2 , θ2 ) =

1 1−ρ21 ρ22 2π 1−2ρ1 ρ2 cos(θ2 −θ1 )+ρ21 ρ22

which is harmonic, and it depends only on the angular diﬀerence θ2 − θ1 and the product of radial coordinates ρ1 ρ2 = r1 r2 /R2 . The random ﬁeld u(r, θ) is thus homogeneous with respect to the angular coordinate θ, and its partial discrete spectral density has the form fθ (0) = 1/2π, fθ (k) = (ρ1 ρ2 )k /π, k = 1, . . . . The Karhunen-Lo`eve expansion of the solution has the form ∞ 1 k ξ0 ρ ξk cos(k θ) + ηk sin(k θ) u(r, θ) = √ + √ π 2π k=1

where {ξk }, {ηk } are sets of mutually independent standard Gaussian random variables.

Stochastic Simulation

35

Theorem 2. Assume the boundary function g in the Dirichlet problem is a homogeneous random process with a continuous correlation function Bg (ψ). Then the solution of this problem is partially homogeneous with respect to the angular coordinate, and its correlation function Bu (ρ1 , θ1 ; ρ2 , θ2 ) depends on the angular diﬀerence ψ = θ2 − θ1 and the product ρ1 ρ2 , and is explicitly given by the convolution Bu = K ∗ Bg , i.e. , by the Poisson formula 1 Bu (ρ1 ρ2 ; ψ) = 2π

2π

K(ρ1 ρ2 ; ψ − ψ ) Bg (ψ ) dψ

0

which implies that the correlation function Bu (ρ, θ) is harmonic in the unit disc, and it is the unique solution of the Dirichlet boundary value problem ΔBu = 0,

Bu |ρ→1 = Bg .

Theorem 3. Let u(x, y), x = (x1 , . . . xn−1 ) be a random ﬁeld deﬁned in the halfspace D+ = IRn+ as a harmonic function with the boundary condition u|y=0 = g where g is a zero mean homogeneous random ﬁeld on the boundary {y = 0} with the correlation function Bg (x) which is bounded in dimension n = 2, or tends to zero as |x| → ∞ if n > 2. Then Bu (x, y) = Bu (x2 − x1 , y1 + y2 ), the correlation function of the solution, is a harmonic function in IRn+ , and is related to Bg by the Poisson type formula: (y1 + y2 ) Bg (x ) dS(x ) Γ (n/2) Bu (x2 − x1 , y1 + y2 ) = . π n/2 [(x − (x2 − x1 ))2 + (y1 + y2 )2 ]n/2 ∂D+

Remark. In practice, it is often important to know the statistical structure of the gradient of the solution. Using the Fourier transform we ﬁnd that Buxi = −

∂ 2 Bu , ∂xi

i = 1, . . . , n − 1,

Buy =

∂ 2 Bu . ∂2y

Note that since the correlation function Bu is harmonic, this implies the following n−1 remarkable property: Buy = Buxi . So in dimension two, Buy = Bux . i=1

Theorem 4. The exact Karhunen-Lo`eve representations for the covariance tensor and the random ﬁeld (ur , uθ )T which solves the Lam´e equation under the boundary white noise excitations are given by Bu (ρ1 , θ1 ; ρ2 , θ2 ) = ⎧ ⎫ ∞ ∞ ρ1 ρ2 1 1 k k k k ⎪ ⎪ ⎪ ⎪ + Λ ρ ρ cos [k(θ − θ )] Λ ρ ρ sin [k(θ − θ )] 11 1 2 2 1 12 1 2 2 1 ⎪ ⎪ 2π π π ⎪ ⎪ ⎪ ⎪ k=1 k=1 ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ , ⎪ ⎪ ⎪ ⎪ ∞ ∞ ⎪ ⎪ ⎪ 1 ⎪ ρ1 ρ2 1 k k k k ⎪ ⎪ ⎩ π Λ21 ρ1 ρ2 sin [k(θ2 − θ1 )] + Λ ρ ρ cos [k(θ − θ )] ⎭ 22 2 1 1 2 2π π k=1

k=1

36

K. Sabelfeld

Λ11 = λ11 (ρ1 , k) λ11 (ρ2 , k) + λ12 (ρ1 , k) λ12 (ρ2 , k), Λ12 = λ11 (ρ1 , k) λ21 (ρ2 , k) − λ12 (ρ1 , k) λ22 (ρ2 , k), where the functions λij are known explicitly, see [5], e.g., λ11 (ρ, k) =

2(λ + 2μ) k (λ + μ)(1 − ρ2 ) 1 2μ ρ + + , 2(λ + 3μ) ρ ρ

and ur (r, θ) =

∞ 1 ξ0 ρ + λ11 ρk ξk cos kθ + ηk sin kθ 2π π k=1

+

∞ 1 λ12 ρk − ηk cos kθ + ξk sin kθ , π k=1

uθ (r, θ) =

∞ 1 ξ0 ρ + λ21 ρk − ηk cos kθ + ξk sin kθ 2π π k=1

+

∞ 1 λ22 ρk ξk cos kθ + ηk sin kθ , π k=1

where {ξk , ηk } and {ξk , ηk }, k = 0, 1, 2, . . . are two independent families of standard independent gaussian random variables. Thus the random ﬁeld is homogeneous with respect to the angular variable, and the respective partial spectra are: = m the spectrum is pure Smm (k) = π1 Λmm ρk1 ρk2 , Smm (0) = ρ1 ρ2 /2π, and for n imaginary: Smn (k) = i π1 Λmn ρk1 ρk2 . The results of calculations for the longitudinal and cross correlations are shown in Fig. 6 for diﬀerent values of the elasticity constant. These can be used for recovering the elastic properties of the body by measuring the correlations in respond to random boundary excitations.

6

Fractal Density of the Universe

Let us consider a fractional model of the density of the Universe 1−a

(−Δ) 2 u(x, y) = 0, u|∂Rn+ = f (ξ)

x ∈ IRn ,

y ∈ IR+

In [7], we have given the following statement which proves an existence of a model for the mass distribution in the Universe with a correlation function having very heavy tails characterizing the fractal nature of this mass distribution, see Fig. 7 (left panel) and a sample of the fractal density of the Universe (right panel).

Stochastic Simulation

37

Fig. 6. The longitudinal (left panel) and the cross correlation (right panel) functions, for diﬀerent values of the elastic constant

Theorem 5. The random ﬁeld u(X), X ∈ IRn × [0, ∞) solving the boundary value problem for the Lam´e equation with the Gaussian white noise f is partially isotropic, that is, its correlation function depends on x = |x1 −x2 |, and on y1 , y2 . It is uniquely deﬁned by its correlation function which has the form 1 Bu (X1 , X2 ) =

2 Cn,a

A(y1 , y2 ) 0

t

n−a−1 2

(1 − t)

n−a−1 2

dt 2 n +1−a y1 t + y22 (1 − t) + t(1 − t)|x1 − x2 |2 2

π n/2 Γ n2 + 1 − a 1−a 1−a A(y1 , y2 ) = n+1−a 2 y1 y2 . Γ 2

where

The partial spectral function is explicitly given by S(k; y1 , y2 ) = C 2 (|k|y1 )

1−a 2

(|k|y2 )

1−a 2

K 1−a (2π|k|y1 ) K 1−a (2π|k|y2 ) 2

2

1−a

where C = 2π 2 /Γ ( 1−a 2 ) , and Kν (z) is the modiﬁed Bessel function of the −Iν (z) second kind, also known as the Macdonalds function: Kν (z) = π2 Iνsin(νπ ), where ν Iν (z) = i Jν (iz) is the modiﬁed Bessel function in turn deﬁned via Jν , the Bessel function of a pure imaginary argument iz. The random ﬁeld u(x, y) has the following Randomized Spectral approximation: u(x, y) ≈ V (x, y) = C

(|k|y)

1−a 2

K 1−a (2π |k|y) 2 ξ cos(2πk · x) + η sin(2πk · x) p(k)

where k is a random vector in IRn distributed with a density p(k) which can be chosen quite arbitrarily, satisfying the condition p(k) = 0 if K 1−a (|k|y) = 0, and 2 ξ, η are independent standard Gaussian random variables.

38

K. Sabelfeld

Fig. 7. Correlations of the fractal density of the Universe, for diﬀerent values of the fractal parameter a, in 3D (left panel) and a sample of the fractal density (right panel)

Acknowledgments The author thanks the organizers of the conference, and acknowledges the support of the RFBR under Grants N 06-01-00498, 09-01-12028-oﬁ-m, and a joint BMBF and Bortnik Funds Grant.

References Random ﬁeld simulation. 1. Kramer, P., Kurbanmuradov, O., Sabelfeld, K.: Comparative Analysis of Multiscale Gaussian Random Field Simulation Algorithms. Journal of Computational Physics 226, 897–924 (2007) 2. Kurbanmuradov, O., Sabelfeld, K.K.: Stochastic spectral and Fourier-wavelet methods for vector Gaussian random ﬁeld. Monte Carlo Methods and Applications 12(5-6), 395–446 (2006) 3. Kurbanmuradov, O., Sabelfeld, K.: Convergence of Fourier-Wavelet models for Gaussian random processes. SIAM Journal on Numerical Analysis 46(6), 3084– 3112 (2008) Random loads in elasticity. 4. Sabelfeld, K., Shalimova, I., Levykin, A.: Stochastic simulation method for a 2D elasticity problem with random loads. Probabilistic Engineering Mechanics 24(1), 2–15 (2009) Random boundary excitations. 5. Sabelfeld, K.: Expansion of random boundary excitations for some elliptic PDEs. Monte Carlo Methods and Applications 13(5-6), 403–451 (2007)

Stochastic Simulation

39

6. Sabelfeld, K.K.: Stokes ﬂows under random boundary velocity excitations. J. Stat. Physics 133(6), 1107–1136 (2008) 7. Sabelfeld, K.: Expansion of random boundary excitations for the fractional Laplacian. Journal of Cosmology and Astroparticle Physics 10(004) (2008), doi:10.1088/ 1475-7516/2008/10/004 8. Sabelfeld, K.K., Shalimova, I.A.: Elastic half-plane under random displacement excitations on the boundary. J. Stat. Physics 132(6), 1071–1095 (2008) Nucleation, coagulation processes and Ostwald ripening. 9. Kaganer, V.M., Ploog, K.H., Sabelfeld, K.K.: Dynamic coalescence kinetics of facetted 2D islands. Physical Review B 73(11) (2006) 10. Kaganer, V.M., Braun, W., Sabelfeld, K.K.: Ostwald ripening of faceted twodimensional islands. Physical Review B 76, 075415 (2007) Footprint problem in canopy. 11. Vesala, T., Kljun, N., Rannik, U., Rinne, J., Sogachev, A., Markkanen, T., Sabelfeld, K., Foken, T., Leclerc, M.Y.: Flux and concentration footprint modeling: state of the art. Environmental Pollution 152(3), 653–666 (2008) Turbulence simulation and transport in porous media. 12. Suciu, N., Vamos, C., Vereecken, H., Sabelfeld, K., Knabner, P.: Memory Eﬀects Induced by Dependence on Initial Conditions and Ergodicity of Transport in Heterogeneous Media. Water Resources Research 44, W08501 (2008) 13. Sabelfeld, K.: Random Field Simulation and Applications. In: Niederreiter, H., Heinrich, S., Keller, A. (eds.) Invited paper of the Plenary Lecture. Proceedings of the 7th International Conference on Monte Carlo and Quasi-Monte Carlo Methods in Scientiﬁc Computing, Ulm, Germany, August 14-18, 2006, pp. 143–166. Springer, Heidelberg (2007) 14. Sabelfeld, K., Kurbanmuradov, O., Levykin, A.: Stochastic simulation of particle transport by a random ﬂow inside a porous cylinder. Monte Carlo Methods and Applications 15(1) (2009) 15. Kurbanmuradov, O., Sabelfeld, K.: Stochastic ﬂow simulation and particle transport in a 2D layer of random porous medium. Submitted to Transport in Porous media (2009) X-Ray diﬀraction peaks in the dislocation analysis. 16. Kaganer, V., Brandt, O., Riechert, H., Sabelfeld, K.: X-ray diﬀraction of epitaxial ﬁlms with arbitrarily correlated dislocations: Monte Carlo calculation and experiment. Phys. Rev. B 80, 033306 (2009) 17. Kaganer, V., Sabelfeld, K.: X-ray diﬀraction peaks from partially ordered misﬁt dislocations. Phys. Rev. B 80, 184105 (2009)

On Finite Element Error Estimates for Optimal Control Problems with Elliptic PDEs Fredi Tr¨ oltzsch Technische Universit¨ at Berlin, Institut f¨ ur Mathematik 10623 Berlin, Str. d. 17. Juni 136, Sekr. MA 4-5, Germany Abstract. Discretizations of optimal control problems for elliptic equations by ﬁnite element methods are considered. The problems are subject to constraints on the control and may also contain pointwise state constraints. Some techniques are surveyed to estimate the distance between the exact optimal control and the associated optimal control of the discretized problem. As a particular example, an error estimate for a nonlinear optimal control problem with ﬁnitely many control values and state constraints in ﬁnitely many points of the spatial domain is derived.

1

Introduction

In this paper, we consider optimal control problems of the type min J(y, u) := subject to

1 λ y − yd 2 + u2 2 2

−Δy = u y=0

in Ω on Γ,

(1)

(2)

where also further constraints u ∈ Uad (control constraints) and y ∈ Yad (state constraints) may be given. In this setting, Ω ⊂ IR2 is a convex, bounded and polygonal domain, yd ∈ L2 (Ω) a given desired state, and λ > 0 is a ﬁxed regularization parameter. By · , the natural norm of L2 (Ω) is denoted. In the last section, B(u, ρ) ⊂ IRm is the open ball of radius ρ around u. In the paper, c is a generic constant that is more or less arbitrarily adapted. Our main issue is to estimate the error arising from a ﬁnite element discretization of such problems. First, we consider the problem without control and state constraints. Next, we explain how the error can be estimated, if control constraints are given. We brieﬂy survey recent results on probems with state constraints and discuss ﬁnally a problem with ﬁnite-dimensional control space and state constraints given in ﬁnitely many points of the spatial domain.

2 2.1

Optimal Control Problem with Control Constraints The Unconstrained Optimal Control Problem

The Problem. Let us start the tour through error estimates by a quite simple approach for the unconstrained problem (1), (2), where no further constraint on u or y are given, i.e. Uad = L2 (Ω) and Yad = L2 (Ω). I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 40–53, 2010. c Springer-Verlag Berlin Heidelberg 2010

On Finite Element Error Estimates for Optimal Control Problems

41

For all u ∈ L2 (Ω), there is exactly one state y = y(u) ∈ H 2 (Ω) ∩ H01 (Ω) and the mapping Λ : L2 (Ω) → H 2 (Ω) ∩ H01 (Ω), Λ : u → y(u), is continuous. We consider Λ also with range in L2 (Ω) and denote this “solution operator” by S, i.e. S = EH 2 →L2 Λ, where EH 2 →L2 denotes the continuous injection of H 2 (Ω) in L2 (Ω). By S, we are able to formally eliminate the PDE and to transform the problem to the control reduced quadratic optimization problem (P )

min f (u) := J(Su, u) =

u∈L2 (Ω)

1 λ Su − yd 2 + u2 . 2 2

The existence of a unique (optimal) solution of this problem is a standard result. In all what follows, we denote by u¯ the unique optimal control and by y¯ = y(¯ u) the associated optimal state. All further results on the necessary optimality conditions for (P) and its version with additional control constraints are stated without proofs. They are discussed extensively in the forthcoming textbook [1]. Necessary and Suﬃcient Optimality Condition. It is clear that f (¯ u) = 0 is necessary for the optimality of u ¯, hence u) = S ∗ (S u ¯ − yd ) + λ¯ u=0 f (¯ must hold, where S ∗ denotes the adjoint operator of S. It is very useful to introduce an auxiliary function p by p¯ := S ∗ (S u ¯ − yd ). This function p¯ is called adjoint state associated with u ¯. Therefore, we have p¯ + λ¯ u = 0.

(3)

The adjoint state p¯ is the solution of the adjoint equation −Δp = y¯ − yd p=0

in Ω on Γ.

(4)

To determine the unknown triplet (¯ y, u ¯, p¯), we have to solve the optimality system (2), (4), (3). Invoking (3), i.e. inserting u = −λ−1 p in the state equation, the optimality system −Δy + λ−1 p = 0 y = 0,

−Δp − y = −yd p=0

in Ω on Γ

(5)

is obtained. Having solved this, we obtain the optimal control by u¯ := −λ−1 p. Discretized Problem and Error Estimate. We assume a regular triangulation T of Ω with mesh size h, triangles Ti , and piecewise linear and continuous ansatz functions Φi , i = 1, . . . , n, which generate the ﬁnite-dimensional subspace Vh = span {Φ1 , . . . , Φn } ⊂ H01 (Ω). We do not explain the standard notion of

42

F. Tr¨ oltzsch

regularity of the triangulation. Instead, we just assume that the standard ﬁnite element error estimate (7) below is satisﬁed. In the discretized problem, the state yh associated with u is determined by yh ∈ Vh and ∇yh · ∇Φi dx = u Φi dx ∀i = 1, . . . , n. (6) Ω

Ω

To each u ∈ L2 (Ω), there exists exactly one solution yh ∈ H01 (Ω) of (6) denoted by yh (u). From the ﬁnite element analysis for regular grids, the error estimate h yh (u) − y(u)H 1 (Ω) + yh (u) − y(u) ≤ c h2 u

(7)

is known for all suﬃciently small h > 0, where the constant c does not depend on u or h. Let us introduce the mapping Sh : L2 (Ω) → L2 (Ω), Sh : u → yh (u). In terms of S and Sh , (7) is equivalent to S − Sh L2 (Ω)→L2 (Ω) ≤ c h2 .

(8)

Analogously to the former section, the discretized optimal control problem can be formulated in control reduced form as (Ph )

min fh (u) :=

1 λ Sh u − yd 2 + u2. 2 2

This discretized problem has a unique optimal control denoted by u¯h with associated state y¯h . The reader might be surprised that the control u in (Ph ) is not discretized. In fact, we do not need this here, since the optimality conditions will automatically imply uh ∈ Vh . ¯. We write down the necessary optiIt is easy to estimate the error ¯ uh − u mality conditions for both optimal controls, S ∗ (S u ¯ − yd ) + λ¯ u=0 ¯h − yd ) + λ¯ uh = 0, Sh∗ (Sh u multiply them scalarly by u¯ − u¯h , subtract the results and re-order. This yields u−u ¯h )2 + λ¯ u−u ¯h 2 ≤ | (yd , (S − Sh )(¯ u − u¯h )) | Sh (¯ ≤ yd c h2 ¯ u−u ¯h , where (· , ·) denotes the natural inner product of L2 (Ω). Consequently, we have the L2 error estimate (9) ¯ u−u ¯h ≤ c λ−1 h2 yd . This was easy, since we considered the unconstrained case that is not really interesting in optimization. Let us include also constraints on the control.

On Finite Element Error Estimates for Optimal Control Problems

2.2

43

Constraints on the Control

Optimality Conditions. Let ua < ub be two real numbers. Consider the control-constrained problem (P C)

min f (u) =

u∈Uad

1 λ Su − yd 2 + u2 2 2

with Uad = {u ∈ L2 (Ω) : ua ≤ u(x) ≤ ub a.e. in Ω}. Again, the problem has a unique optimal control u¯. However, due to the conu) = 0. Instead, the variational inequality straints, we cannot expect that f (¯ f (¯ u)(u − u ¯) ≥ 0 ∀u ∈ Uad is necessary and suﬃcient for optimality of u¯. It expresses the intuitively clear observation that, in a minimum, the function f cannot decrease in any feasible direction. In terms of S, this means ¯ − yd ) + λ¯ u, u−u ¯) ≥ 0 (S ∗ (S u or equivalently

∀u ∈ Uad

(10)

p(x) + λ¯ u(x) (u(x) − u ¯(x)) dx ≥ 0 ∀u(·) ∈ Uad .

Ω

A simple pointwise discussion of this inequality reveals that almost everywhere u(x) > 0 ua if p(x) + λ¯ u¯(x) = (11) u(x) < 0 ub if p(x) + λ¯ u(x) = 0. From this, one and we have, of course, u¯(x) = −λ−1 p(x) if p(x) + λ¯ derives with some eﬀort the well-known projection formula 1 a.e. in Ω, (12) u ¯(x) = IP[ua ,ub ] − p(x) λ where IP[ua ,ub ] : IR → [ua , ub ] denotes projection onto [a, b]. This projection ¯ can exhibit corners in the formula shows that, although we have p ∈ H 2 (Ω), u points, where the bounds ua and ub are reached. Hence in general we can only expect u ∈ H 1 (Ω). Moreover, (¯ y, u ¯, p¯) cannot be obtained from a smooth coupled system of PDEs as (5). Therefore, error estimates are more diﬃcult. They also depend on the way how the control function u is discretized. Discretization by Piecewise Constant Controls. The most common way of control discretization is working with piecewise constant controls. Here, the set of admissible discretized controls is deﬁned by h Uad = {ua ≤ u(·) ≤ ub : u is constant on each triangle Ti }

44

F. Tr¨ oltzsch

and the associated discretized problem is (P Ch )

min fh (uh ),

h uh ∈Uad

where fh and yh are deﬁned as in the last section. The diﬀerence to (Ph ) consists h . Let u¯h denote the unique optimal control of (P Ch ) in the appearance of Uad and let y¯h be the associated (discretized) state. Then a discrete counterpart to the variational inequality (10) must be satisﬁed, h (Sh∗ (Sh u¯h − yd ) + λ¯ uh , uh − u ¯h ) ≥ 0 ∀uh ∈ Uad .

By the discrete adjoint state ph := Sh∗ (Sh u ¯h − yd ), this is equivalent to 1 u ¯(x) = IP[ua ,ub ] − ph (x) dx ∀ x ∈ Ti , ∀ i = 1, . . . , M, λ |Ti | Ti

(13)

(14)

where |Ti | is the area of the triangle Ti . Now we cannot derive an error estimate in the same way as before. First, u¯ cannot be inserted in (14), as u ¯h is not piecewise constant. As a substitute, we use the interpolant Πh u ¯ deﬁned by 1 (Πh u¯)(x) := − u¯(x) dx ∀ x ∈ Ti . (15) |Ti | Ti h It holds that Πh u ¯ ∈ Uad and, with some c > 0 not depending on h,

¯ ≤ c h. ¯ u − Πh u

(16)

Now, we might insert u ¯h in (10), Πh u¯ in (13), add the two inequalities obtained and resolve for ¯ u−u ¯h to estimate the error. This procedure only yields a non√ optimal estimate of the order h. Some further tricks avoid the square root, cf. [2] or the early papers [3,4]. Below, we explain a perturbation approach. Its main idea goes back to [5] and was applied in [6] to nonlinear elliptic problems. The function Πh u¯ will only fulﬁll the variational inequality (13) if it is optimal for (P Ch ) by chance. However, it satisﬁes the variational inequality for the perturbed control problem (P Cζh ) min fh (uh ) + ζh (x) uh (x) dx , h uh ∈Uad

Ω

¯ satisﬁes the associated projection formula if ζh (x) is deﬁned such that Πh u 1 ¯(x) = IP[ua ,ub ] − (ph (x) + ζh (x)) dx ∀ x ∈ Ti , ∀ i = 1, . . . , M. Πh u λ |Ti | Ti Notice that the derivative of the perturbed functional at a function u is equal to Sh∗ (Sh u − yd ) + ζh + λ u = ph + ζh + λ u so that ph + ζh plays the role of the former ph . How the function ζh must be constructed? If ua < Πh u¯(x) < ub holds in a triangle Ti , then λ|Ti |Πh u ¯(x) + Ti (ph + ζh ) dx = 0 must hold on Ti . This follows from (14), applied to Πh u ¯ and (P Cζh ). Therefore, we deﬁne ζh by 1 ζh (x) ≡ −Πh u¯(x) + ph dx on Ti . λ |Ti | Ti

On Finite Element Error Estimates for Optimal Control Problems

45

If, on Ti , Πh u ¯(x) ≡ ua , then λ|Ti |Πh u¯(x) + Ti (ph + ζh ) dx ≥ 0 must hold on Ti (adapt (11) to (PCζh )). To compensate negative values, we deﬁne ζh by

1 ζh (x) ≡ Πh u ¯(x) + ph dx on Ti , λ |Ti | Ti − where a− = (|a| − a)/2 ≥ 0 denotes the negative part of a real number. Analogously, we deﬁne ζh := −[. . .]+ via the associated positive part to compensate a ¯(x) ≡ ub . It is not diﬃcult to show that positive value, if Πh u ζh ≤ c ¯ u − Πh u¯ ≤ c˜ h. Now we proceed similarly as in the last section. We insert Πh u ¯ in the variational inequality (13) for u ¯h and insert u¯h in the perturbed variational inequality for Πh u ¯ to obtain uh , Πh u ¯−u ¯h ) ≥ 0, (Sh∗ (Sh u¯h − yd ) + λ¯ ∗ (Sh (Sh Πh u ¯ − yd ) + ζh + λΠh u ¯, u ¯ h − Πh u ¯) ≥ 0. Adding both inequalities, we obtain after some re-ordering and ignoring the term Sh (¯ u h − Πh u ¯)2 ¯ uh − Πh u¯2 ≤ λ−1 (ζh , u ¯ h − Πh u ¯) ≤ λ−1 ζh ¯ u h − Πh u ¯. In view of (16), an obvious application of the triangle inequality yields ﬁnally ¯ uh − u ¯ ≤ c h

(17)

with some c > 0 not depending on h. This is the optimal error estimate for piecewise constant approximations of u¯. The same order of the error can be derived for problems with semilinear elliptic equations, both for distributed and boundary controls, [6,7]. However, thanks to non-convexity, the situation is more delicate. Diﬀerent locally optimal controls might appear. They should satisfy a second-order suﬃcient optimality condition to have a unique approximating locally optimal control in a certain neighborhood, cf. also the discussion in the last section. The error analysis for piecewise linear controls is more diﬃcult. We refer only to [8] for Neumann and to [9,10,11] for Dirichlet boundary control problems. In the Neumann case, the order h3/2 can be expected for the error. Variational Discretization. The situation is easier, if the control functions are (formally) not discretized, i.e. if we consider the discretized problem (Ph )

min fh (u).

u∈Uad

At ﬁrst glance, this consideration seems to be useless. How should one be able to compute the optimal control without any kind of discretization? However, take a look at the ﬁnite element approximation of the optimality system −Δp − y = −yd in Ω −Δy = IP[ua ,ub ] − λ−1 p (18) y = 0, p=0 on Γ,

46

F. Tr¨ oltzsch

where u is eliminated by the projection formula (12). This nonsmooth nonlinear system can be solved numerically to obtain the (discrete) state yh and adjoint state ph , [12]. Then u¯h is found by u¯h = IP[ua ,ub ] {−λ−1 ph }. It is piecewise linear but does not in general belong to Vh . This approach of variational discretization is also useful in iterative methods of optimization, cf. [13]. For this variational discretization, an error estimate of the order h2 is easily derived: We repeat a similar procedure as before and insert u ¯h in the variational inequality for u¯, u ¯ in the variational inequality for u ¯h (all u ∈ Uad are now admitted!), ¯ − yd ) + λ¯ u, u ¯h − u ¯) ≥ 0, (S ∗ (S u ∗ (Sh (Sh u ¯h − yd ) + λ¯ uh , u ¯−u ¯h ) ≥ 0. Next, we add both inequalities and re-order as we proceeded to derive (9), u − u¯h )2 + λ¯ u − u¯h 2 ≤ | (yd , (S − Sh )(¯ u−u ¯h )) | ≤ yd c h2 ¯ u−u ¯h . Sh (¯ Consequently, we have the optimal L2 -estimate ¯ u−u ¯h L2 (Ω) ≤ c λ−1 h2 yd . The same optimal order can be obtained under natural assumptions with piecewise constant controls by one smoothing step, cf. [14].

3 3.1

Problems with State Constraints Available Error Estimates

Here, we admit also state constraints. Now the error analysis is more diﬃcult. Currently, this is a very active ﬁeld of research. A detailed survey on relevant contributions would go beyond the scope of this paper. We mention ﬁrst results for problems with ﬁnitely many state constraints in [15] and the convergence of discretizations for pointwise state constraints of the type y(x) ≤ c a.e. in Ω in [16]. To have a comparison with the results of the next section, we state also recent results for elliptic problems with pointwise state constraints: In [17], for ¯ u − u¯h the order h1−ε was derived in Ω ⊂ IR2 and h1/2−ε for Ω ⊂ IR3 . Recently, this was improved in [18] to h| log h| for 2D and h1/2 for 3D domains. 3.2

Control in IRm and State Constraints in Finitely Many Points

Let us discuss a simpler problem with semilinear elliptic equation, controls in IRm and state constraints in ﬁnitely many points x1 , . . . , x of Ω: ⎧ 1 λ ⎪ ⎪ min J(yu , u) = yu − yd 2 + |u|2 ⎪ ⎪ 2 2 ⎪ ⎨ u∈Uad subject to (P S) ⎪ ⎪ ⎪ g (y (x )) = 0, for all i = 1, . . . , k, ⎪ ⎪ ⎩ i u i gi (yu (xi )) ≤ 0, for all i = k + 1, . . . , ,

On Finite Element Error Estimates for Optimal Control Problems

47

where yu is the solution to the state equation −Δ y(x) + d(y(x), u) = 0 in Ω y(x) = 0 on Γ

(19)

and Uad = {u ∈ IRm : ua ≤ u ≤ ub } with given ua ≤ ub of IRm . We assume l ≥ 1 and set k = 0, if only inequality constraints are given and k = l, if only equality constraints are given. We assume for short that d : IR2 → IR and gi : IR → IR, i = 1, . . . , , are twice diﬀerentiable with locally Lipschitz second-order derivatives and that d is monotone non-decreasing with respect to y. In [19], the problem is considered in a slightly more general setting. Thanks to our assumptions, the mapping u → yu ¯ hence the values yu (xi ) are well deﬁned. is continuous from IRm to H01 (Ω)∩C(Ω), ¯ Therefore, we consider S : u → yu as mapping from IRm to H01 (Ω) ∩ C(Ω). To convert (PS) into a ﬁnite-dimensional nonlinear programming problem, → IR of class C 2,1 by f (u) = J(yu , u) = J(S(u), u). we deﬁne again f : IRm Thanks to our assumptions, in particular the Lipschitz properties of the second derivatives of d and the gi , the mapping S has a locally Lipschitz continuous second-order derivative S . Moreover, we deﬁne G : IRm → IR by G(u) := [g1 (yu (x1 )), . . . , g (yu (x ))] .

(20)

To cover the equality and inequality constraints in a uniﬁed way, we introduce the convex cone K = {z ∈ IR : zi = 0, i = 1, . . . , k, zi ≤ 0, i = k + 1, . . . , } and write z ≤K 0 iﬀ z ∈ K. By these deﬁnitions, (PS) becomes equivalent to the nonlinear programming problem min f (u) (21) (N ) G(u) ≤K 0, u ∈ Uad . The discretized optimal control problem (PSh ) is deﬁned on substituting yu by its ﬁnite-element approximation yh,u , obtained from ∇yh · ∇vh dx + d(yh , u) vh dx = 0 ∀vh ∈ Vh . Ω

Ω

Introducing Gh (u) := [g1 (yh,u (x1 )), . . . , g (yh,u (x ))] we express this problem as ﬁnite-dimensional nonlinear programming problem min fh (u) (22) (Nh ) Gh (u) ≤K 0, u ∈ Uad . ¯0 ⊂ Ω1 ⊂ Ω ¯1 ⊂ Ω. Let Ω0 and Ω1 be open sets such that {x1 , . . . , x } ⊂ Ω0 and Ω Then there exists a constant c > 0, independent of h and u ∈ Uad , such that (23) yu − yh,u L∞ (Ω0 ) ≤ c h2 | log h| yu W 2,∞ (Ω1 ) + h2 yu H 2 (Ω) ,

48

F. Tr¨ oltzsch

cf. [19]. Thanks to this estimate and to our assumptions, it holds |f (u) − fh (u)| + |f (u) − fh (u)| + |f (u) − fh (u)| ≤ c h2 |G(u) − Gh (u)| + |G (u) − Gh (u)| + |G (u) − Gh (u)| ≤ c h2 | log h|

(24)

for all u ∈ Uad , with some constant c not depending on h and u, [19]. In all what follows, u ¯ is a locally optimal reference solution of (PS), hence also of (N). We show the existence of an associated locally optimal solution u¯h of (PSh ), (or (Nh )) converging to u ¯ as h ↓ 0. Our main aim is to estimate |¯ u−u ¯h |. For short, we use the abbreviation α(h) = h2 | log h|. Our error analysis is based on 3 main assumptions. To formulate them, we need some standard deﬁnitions of nonlinear programming which are explained below. ˆ ∈ IR+2m by including the box We ﬁrst extend the vector G(u) ∈ IR to G(u) constraints deﬁning Uad . We add the 2m further components G+i (u) = ua,i − ui ,

for i = 1, . . . , m

G+m+i (u) = ui − ub,i ,

for i = 1, . . . , m,

ˆ and put G(u) := (Gi (u))i=1,...,+2m . Then all constraints of the problem can ˆ be uniﬁed by G(u) ≤K 0, where K is re-deﬁned accordingly. We deﬁne the Lagrangian function L : IRm × IR+2m by L(u, ν) = f (u) +

+2m

νi Gi (u).

i=1

The index set A(¯ u) of active constraints at u ¯ is deﬁned by A(¯ u) = {i ∈ {1, . . . , + 2m} : Gi (¯ u) = 0} . We now formulate the main assumptions: Robinson regularity condition: At u ¯, it holds that 0 ∈ int {G(¯ u) + G (¯ u)(Uad − u ¯) + K}, u)(u − u ¯)+ k | u ∈ Uad , k ∈ K}. where the set in braces is deﬁned as ∪{G(¯ u)+ G (¯ It is known that this regularity assumption is suﬃcient for the existence of a Lagrange multiplier ν¯ associated with the (locally) optimal solution u ¯. Strong second-order suﬃcient optimality condition: For the pair (¯ u, ν¯), it holds ∂ 2 L(¯ u, ν¯) v>0 ∀v ∈ Cu¯ , v = 0, v ∂u2 where Cu¯ ⊂ IRm is deﬁned by Cu¯ = {v | Gi (¯ u)v = 0 ∀i ∈ {1, . . . , k} ∪ {i ∈ {k + 1, . . . , + 2m} : ν¯i > 0}} . Linear independence condition of active gradients: This condition is satu) | i ∈ A(¯ u)} are linearly independent. isﬁed if all vectors of the set {∇Gi (¯

On Finite Element Error Estimates for Optimal Control Problems

49

Theorem 1. Under the three assumptions stated above, there exists a constant c > 0 not depending on h such that, for all suﬃciently small h > 0, a unique locally optimal control u¯h exists in a neighborhood of u ¯ and it holds |¯ u − u¯h | ≤ c h2 | log h|. We do not entirely show this result here. Instead, we show an estimate of the order h | log h|. In this way, we prepare the proof of the full order in [19]. First, we approximate admissible vectors for (N) by admissible ones for (Nh ) with the order α(h) and vice versa. Lemma 1. Suppose that u¯ is feasible for (N ) and satisﬁes the Robinson regularity condition. Then there are c > 0 independent of h and h0 > 0 such that, for each h ∈ (0, h0 ) an admissible uh for problem (Ph ) exists with |¯ u − uh | ≤ c α(h).

(25)

Proof. To be consistent with the notation of [20], we write ⎧ ⎨ Gh (u), if u ∈ Uad and h > 0, G(u), if u ∈ Uad and h = 0, G(h, u) = ⎩ ∅, if u ∈ / Uad . Thanks to (24), G and ∂G/∂u are continuous at the point (h, u) = (0, u ¯). Moreover, we have G(0, u ¯) ≤K 0. In view of the Robinson regularity condition, the assumptions of the generalized implicit function in [20] are fulﬁlled. We obtain the existence of neighborhoods N of h = 0 and O of u¯ such that, for all h ∈ N , the inequality G(h, u) ≤K 0 has a solution u ∈ O, and it holds dist[v, Σ(h)] ≤ c |G(h, v)+ |,

∀h ∈ N , ∀v ∈ O,

(26)

where Σ(h) = {u ∈ Uad | G(h, u) ≤K 0} is the solution set of the inequality and dist denotes the Euclidean distance of a point to a set. The value |G(h, v)+ | is the distance of the set G(h, v) + K to the origin and measures the residual of v with respect to the inequality G(h, v) ≤K 0, cf. [20], p. 498. Inserting v = u ¯ in (26), we deduce dist[¯ u, Σ(h)] ≤ c |G(h, u¯)+ | ≤ c(|G(0, u¯)+ | + |G(h, u¯)+ − G(0, u ¯)+ |) ≤ 0 + c α(h). u − uh | ≤ c α(h). The statement is shown. Hence, there exists uh ∈ Σ(h) with |¯ Lemma 2. Let the reference solution u ¯ satisfy the linear independence condition. Then, for all given ρ > 0 and all suﬃciently small h > 0, the auxiliary problem ⎧ ⎨ min fh (u) Gh (u) ≤K 0, (27) (Nh,ρ ) ⎩ u ∈ Uad ∩ cl B(¯ u, ρ) is solvable. If u ¯h is any optimal solution to this problem, then an admissible element vh for (N) exists satisfying with some c > 0 independent of h |¯ uh − vh | ≤ c α(h).

(28)

50

F. Tr¨ oltzsch

Proof. (i) Solvability of (Nh,ρ ) : For a positive h0 and all h ∈ (0, h0 ), the admissible set of (Nh,ρ ) is not empty, because uh constructed in Lemma 1 satisﬁes all constraints. The existence of an optimal u ¯h follows immediately. We have to ﬁnd ¯h | ≤ c α(h). Below, we cover the inequalvh in Uad with G(vh ) ≤K 0 and |vh − u ˆ ity constraints of Uad by the extended vector function G(u) : IRm → IR+2m ˆ introduced 2 pages before. Let us set in this proof G := G and := + 2m to avoid an extensive use of the hat sign. Hence we have to construct vh such that Gi (vh ) = 0,

i = 1, . . . , k,

Gi (vh ) ≤ 0,

i = k + 1, . . . , .

(ii) Construction of an equation for vh : Notice that u ¯h ∈ cl B(¯ u, ρ) for all h ≤ h0 . Therefore, if ρ is taken small enough, all inactive components Gi (¯ u) are inactive for u ¯h as well and there exists ε > 0 such that uh ) ≤ −ε < 0 Gi (¯

∀i ∈ I,

∀h ≤ h0 ,

(29)

where I is the set of all inactive indices i of u ¯ in {k + 1, . . . , }. Suppose that r constraints are active at u ¯, k ≤ r ≤ m. After renumbering, if necessary, we can assume that those with the numbers 1 ≤ i ≤ r u) = . . . = Gr (¯ u) = 0. By the independence condiare active, hence G1 (¯ u) are linearly independent. If ρ is small tion, the associated gradients ∇Gi (¯ enough, also ∇G1 (¯ uh ), . . . , ∇Gr (¯ uh ) are linearly independent. Consider the ma trix Bh = [∇G1 (¯ uh ), . . . , ∇Gr (¯ uh )] . Since Bh has full rank r, we ﬁnd an invertible submatrix Dh such that (after renumbering of the components of u, if necessary) Bh = [Dh , Eh ] holds with some matrix Eh . Deﬁne Fh : IRr → IRr by ¯h,m ) − Gh,i (¯ uh ), Fh,i (w) := Gi (w, u¯h,r+1 , . . . , u

i = 1, . . . , r.

To ﬁnd vh , we ﬁx its m − r last components by vh,i := u ¯h,i , i = r + 1, . . . , m, and determine the ﬁrst r components as the solution w of the system Fh (w) = 0,

(30)

i.e. we set vh,i := wi , i = 1, . . . , r. (iii) Solvability of (30): In this part of the proof, we follow a technique used by Allg¨ ower et al. [21]. We deﬁne for convenience w ¯h := (¯ uh,1 , . . . , u ¯h,r ) , w ¯ := ¯r ) and have (¯ u1 , . . . , u |Fh (w ¯h )| ≤ c α(h), (31) since |Gi (¯ uh ) − Gh,i (¯ uh )| ≤ c α(h) holds for all 1 ≤ i ≤ r. Thanks to (24) and the Lipschitz assumptions, there exist γ > 0, β > 0 with Fh (w1 ) − Fh (w2 ) ≤ γ |w1 − w2 | (Fh (w))−1

≤β

∀wi ∈ B(w, ¯ ρ),

∀w ∈ B(w, ¯ ρ)

for all 0 ≤ h ≤ h0 , if ρ is taken suﬃciently small. Notice that ∂G(w)/∂w is then close to ∂G(w)/∂w, ¯ and this matrix is invertible. Deﬁne η > 0 by η := β|Fh (w ¯h )|. Then (31) implies β γ η/2 ≤ 1 for all 0 < h < h0 , if h0 is suﬃciently small.

On Finite Element Error Estimates for Optimal Control Problems

51

Proceeding as in [21], the Mysovskij theorem, cf. Ortega and Rheinboldt [22], p. 412, ensures that the Newton method starting at w0 := w ¯h generates a solution w of (30) in the ball cl B(w ¯h , c0 η), where c0 is a certain constant. It follows from our construction that = 0, i = 1, . . . , k, Gi (vh ) = Gh,i (¯ uh ) ≤ 0, i = k + 1, . . . , r. Moreover, if h is small, Gi (vh ) < 0 holds for r < i ≤ . Therefore, we have that G(vh ) ≤K 0 and vh ∈ Uad . From w ∈ cl B(w ¯h , c0 η) it follows |w − w ¯ h | ≤ c0 η ≤ c α(h), hence also |vh − u ¯h | ≤ c α(h). Lemma 3. If ρ > 0 is taken suﬃciently small and h ∈ (0, h0 (ρ)), then all u, ρ). Therefore, they solutions u¯h of the auxiliary problem (Nh,ρ ) belong to B(¯ are also locally optimal for the problem (Nh ). Proof. First, we compare the solution u¯h of (Nh,ρ ) deﬁned in Lemma 1 with uh that is admissible for (Nh,ρ ) and approximates u ¯ with the order α(h). We get fh (¯ uh ) ≤ fh (uh ) ≤ |fh (uh ) − fh (¯ u)| + |fh (¯ u) − f (¯ u)| + f (¯ u). By |fh (¯ u) − f (¯ u)| + |uh − u ¯| + |fh (¯ uh ) − f (¯ uh )| ≤ c α(h) and by the uniform Lipschitz property of fh , we ﬁnd f (¯ uh ) ≤ f (¯ u) + c1 α(h).

(32)

Next, we compare u ¯ with vh taken from Lemma 2. The assumed second-order suﬃcient optimality condition implies a quadratic growth condition. Inserting vh in this condition, we obtain for small h u) + ω |¯ u − vh |2 . f (vh ) ≥ f (¯ From |¯ uh − vh | ≤ c α(h) we deduce u) + ω |¯ u−u ¯h |2 . f (¯ uh ) + c2 α(h) ≥ f (¯

(33)

Combining the inequalities (32)–(33), it follows that f (¯ u) + c1 α(h) ≥ f (¯ u) + ω |¯ u − u¯h |2 − c2 α(h) and hence we obtain the stated auxiliary error estimate |¯ u−u ¯h | ≤ c α(h).

(34)

For all suﬃciently small h, this estimate implies |¯ u−u ¯h | < ρ so that u ¯h does not touch the boundary of B(¯ u, ρ). In view of this, u ¯h is locally optimal for (Nh ).

52

F. Tr¨ oltzsch

The error estimate (34) is not optimal. We can get rid of the square root. Moreover, we are able to show the stated local uniqueness of u¯h , i.e. uniqueness of local optima of (Nh ) in a neighborhood of u ¯. Both tasks can be accomplished by the stability theory for optimality systems written as generalized equations. This would go beyond the scope of this paper and we refer the reader to the detailed presentation in [19], where we complete the proof of the optimal error estimate stated in the theorem. Moreover, we mention the recent monography [23], where the theory of generalized equations and associated applications are discussed extensively. The same estimate can also be shown for the associated Lagrange multipliers.

References 1. Tr¨ oltzsch, F.: Optimal Control of Partial Diﬀerential Equations: Theory, Methods and Applications. Graduate Studies in Mathematics. American Math. Society (to appear, 2010) 2. Casas, E., Tr¨ oltzsch, F.: Error estimates for linear-quadratic elliptic control problems. In: Barbu, V., et al. (eds.) Analysis and Optimization of Diﬀerential Systems, pp. 89–100. Kluwer Academic Publishers, Boston (2003) 3. Falk, F.: Approximation of a class of optimal control problems with order of convergence estimates. J. Math. Anal. Appl. 44, 28–47 (1973) 4. Geveci, T.: On the approximation of the solution of an optimal control problem problem governed by an elliptic equation. R.A.I.R.O. Analyse num´erique/ Numerical Analysis 13, 313–328 (1979) 5. Dontchev, A.L., Hager, W.W., Poore, A.B., Yang, B.: Optimality, stability, and convergence in nonlinear control. Applied Math. and Optimization 31, 297–326 (1995) 6. Arada, N., Casas, E., Tr¨ oltzsch, F.: Error estimates for the numerical approximation of a semilinear elliptic control problem. Computational Optimization and Applications 23, 201–229 (2002) 7. Casas, E., Mateos, M., Tr¨ oltzsch, F.: Error estimates for the numerical approximation of boundary semilinear elliptic control problems. Computational Optimization and Applications 31, 193–220 (2005) 8. Casas, E., Mateos, M.: Error estimates for the numerical approximation of boundary semilinear elliptic control problems. Computational Optimization and Applications 39, 265–295 (2008) 9. Casas, E., Raymond, J.P.: Error estimates for the numerical approximation of Dirichlet boundary control for semilinear elliptic equations. SIAM J. Control and Optimization 45, 1586–1611 (2006) 10. Vexler, B.: Finite element approximation of elliptic Dirichlet optimal control problems. Numer. Funct. Anal. Optim. 28, 957–973 (2007) 11. Deckelnick, K., G¨ unther, A., Hinze, M.: Finite element approximation of Dirichlet boundary control for elliptic PDEs on two- and three-dimensional curved domains. Preprint SPP1253-08-05, DFG-Priority Programme 1253 Optimization with Partial Diﬀerential Equations (2008) 12. Neitzel, I., Pr¨ ufert, U., Slawig, T.: Strategies for time-dependent PDE control with inequality constraints using an integrated modeling and simulation environment. Numerical Algorithms 50(3), 241–269 (2009)

On Finite Element Error Estimates for Optimal Control Problems

53

13. Hinze, M.: A variational discretization concept in control constrained optimization: the linear-quadratic case. J. Computational Optimization and Applications 30, 45– 63 (2005) 14. Meyer, C., R¨ osch, A.: Superconvergence properties of optimal control problems. SIAM J. Control and Optimization 43, 970–985 (2004) 15. Casas, E.: Error estimates for the numerical approximation of semilinear elliptic control problems with ﬁnitely many state constraints. ESAIM, Control, Optimisation and Calculus of Variations 8, 345–374 (2002) 16. Casas, E., Mateos, M.: Uniform convergence of the FEM. Applications to state constrained control problems. J. of Computational and Applied Mathematics 21, 67–100 (2002) 17. Meyer, C.: Error estimates for the ﬁnite element approximation of an elliptic control problem with pointwise constraints on the state and the control. Preprint 1159, WIAS Berlin (2006) 18. Deckelnick, K., Hinze, M.: Numerical analysis of a control and state constrained elliptic control problem with piecewise constant control approximations. In: Kunisch, K., Of, G., Steinbach, O. (eds.) Proceedings of ENUMATH 2007, the 7th European Conference on Numerical Mathematics and Advanced Applications, Graz, Austria. Numerical Mathematics and Advanced Applications, pp. 597–604. Springer, Heidelberg (2007) 19. Merino, P., Tr¨ oltzsch, F., Vexler, B.: Error estimates for the ﬁnite element approximation of a semilinear elliptic control problem with state constraints and ﬁnite dimensional control space. Accepted for publication by ESAIM, Control, Optimisation and Calculus of Variations 20. Robinson, S.M.: Stability theory for systems of inequalities, part ii: diﬀerentiable nonlinear systems. SIAM J. Numer. Analysis 13, 497–513 (1976) 21. Allgower, E.L., B¨ ohmer, K., Potra, F.A., Rheinboldt, W.C.: A mesh-independence principle for operator equations and their discretizations. SIAM Journal on Numerical Analysis 23, 160–169 (1986) 22. Ortega, J.M., Rheinboldt, W.C.: Iterative solution of nonlinear equations in several variables. SIAM Publ., Philadelphia (2000) 23. Dontchev, A.L., Rockafellar, R.T.: Implicit functions and solution mappings. A view from variational analysis. Springer Monographs in Mathematics. Springer, New York (2009)

On Some Stability Properties of the Richardson Extrapolation Applied Together with the θ-Method ´ Zahari Zlatev1 , Istv´an Farag´ o2, and Agnes Havasi3 1 National Environmental Research Institute, Aarhus University Frederiksborgvej 399, P.O. Box 358, DK-4000 Roskilde, Denmark [email protected] 2 Department of Applied Analysis and Computational Mathematics E¨ otv¨ os Lor´ and University, P´ azm´ any P. s. 1/C, H-1117 Budapest, Hungary [email protected] 3 Department of Meteorology, E¨ otv¨ os Lor´ and University P´ azm´ any P. s. 1/C, H-1117 Budapest, Hungary [email protected]

Abstract. Consider the system of ordinary diﬀerential equation (ODEs) dy/dt = f (t, y) where (a) t ∈ [a, b] with b > a, (b) y is a vector containing s components and (c) y(a) is given. The θ-method is applied to solve approximately the system of ODEs on a set of prescribed grid-points. If N is the number of time-steps that are to be carried out, then this numerical method can be deﬁned by using the following set of relationships yn = yn−1 +h(1−θ)f (tn−1 , yn−1 ), θ ∈ [0.5, 1.0], n = 1, 2, . . . , N , h = (b−a)/N , tn = tn−1 + h = t0 + nh, t0 = a, tN = b. As a rule, the accuracy of the approximations {yn | n = 1, 2, . . . , N }, can be improved by applying the Richardson Extrapolation under the assumption that the stability of the computational process is preserved. Therefore, it is natural to require that the combined numerical method (Richardson Extrapolation plus the θ-method) is in some sense stable. It can be proved that the combined method is strongly A-stable when θ ∈ [2/3, 1.0]. This is the main result in the paper. The usefulness of this result in the solution of many problems arising in diﬀerent scientiﬁc and engineering areas is demonstrated by performing a series of experiments with an extremely badly-scaled and very stiﬀ atmospheric chemistry scheme which is actually used in several well-known large-scale air pollution models.

1

Richardson Extrapolation

Consider the classical initial value problem for systems of ordinary diﬀerential equations (ODEs): dy = f (t, y), dt

t ∈ [a, b],

b > a,

y ∈ s ,

f ∈ s ,

s ≥ 1,

with a given initial value y(a) = y0 . I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 54–66, 2010. c Springer-Verlag Berlin Heidelberg 2010

(1)

On Some Stability Properties of the Richardson Extrapolation

55

Assume that N ≥ 1 is a given integer and that h = (b − a)/N is a real constant. It is convenient to consider the equidistant grid deﬁned by TN = {tn , n = 0, 1, . . . , N | t0 = a, tn = tn−1 + h (n = 1, 2, . . . , N ), tN = b} (2) but the results could easily be extended for the case where non-equidistant grids are used. It will also be assumed that yn , zn and wn are some approximations of the exact solution y(t) of (1) at the grid-point t = tn ∈ TN . Furthermore, when the Richardson Extrapolation is studied, it is also appropriate to consider an approximation wn−0.5 of y(t) at the inter-mediate grid-point t = tn−0.5 = tn − 0.5h. The simplest version of the Richardson Extrapolation can be formulated as follows. Assume that two approximations zn and wn of y(tn ) have been calculated by using a numerical method of order p and stepsizes h and 0.5h respectively. The following two relationships can be written by exploiting the fact that the order of the selected numerical method is p: y(tn ) = zn + hp K + O(hp+1 ),

(3)

y(tn ) = wn + (0.5h)p K + O(hp+1 ),

(4)

where K is some quantity depending on the numerical method applied in the calculation of zn and wn . Eliminating K from (3) and (4) gives: y(tn ) =

2p wn − zn + O(hp+1 ). 2p − 1

Denote

(5)

2p wn − zn . (6) 2p − 1 It is clear that the approximation yn , being of order p + 1, will in general be more accurate than both zn and wn (at least when the stepsize h is suﬃciently small). Thus, the Richardson extrapolation can be used in the eﬀorts to improve the accuracy of the approximate solution. There are two ways of implementing the Richardson Extrapolation. Assume that the three approximations zn , wn and yn are already calculated and it is necessary to calculate the next approximation yn+1 . The ﬁrst implementation is based on the use of zn and wn as initial values in the calculation of zn+1 and wn+1 respectively and after that the improved approximation yn+1 can be calculated by utilizing the obtained values of zn+1 and wn+1 . This computational device is called Passive Richardson Extrapolation. In the second approach the calculated improvement yn is used as an initial value in the calculation of both zn+1 and wn+1 . After that again zn+1 and wn+1 are applied to calculate yn+1 . This device is called Active Richardson Extrapolation. It must be emphasized here that only the Active Richardson Extrapolation will be studied in the remaining part of this paper. Moreover, the word “active” will always be omitted. yn =

´ Havasi Z. Zlatev, I. Farag´ o, and A.

56

2

Combining the Richardson Extrapolation and the θ-Method

The Richardson Extrapolation can be combined with the θ-method by using a computational process containing three stages: Stage 1. Perform a large time-step: zn = yn−1 + h[(1 − θ)f (tn−1 , yn−1 ) + θf (tn , zn )].

(7)

Stage 2. Perform two small time-steps: wn−0.5 = yn−1 + 0.5h[(1 − θ)f (tn−1 , yn−1 ) + θf (tn−0.5 , wn−0.5 )],

(8)

wn = wn−0.5 + 0.5h[(1 − θ)f (tn−0.5 , wn−0.5 ) + θf (tn , wn )].

(9)

Stage 3. Calculate an improved approximation: yn = 2wn − zn when θ = 0.5 and yn =

4wn − zn when θ = 0.5. 3

(10)

It is clear that at any time-step n (n = 1, 2, . . . , N ) it is necessary to perform (i) one large step by applying (7), (ii) two small steps by applying (8) - (9) and (iii) to combine the calculated approximations by using (10). The θ-method is reduced to the second-order Trapezoidal Rule when θ = 0.5. Therefore, (6) is used with p = 2 for θ = 0.5. The θ-method is of order one when θ = 0.5 and (6) has to be used with p = 1 in this case.

3

Studying the Stability Properties

The scalar linear test-equation (proposed originally by Dahlquist in 1963, [4]) dy = λy, dt

(11)

where λ is a given complex constant, is often used in stability studies. The application of formulae (7) - (9) in the numerical solution of (11) results in zn = and

wn =

1 + (1 − θ)μ yn−1 1 − θμ

1 + (1 − θ)(0.5μ) 1 − θ(0.5μ)

(12)

2 yn−1 ,

(13)

where μ = λh is a complex number. The application of the Richardson Extrapolation leads to the following relationship: yn = R(μ)yn−1 (14)

On Some Stability Properties of the Richardson Extrapolation

with

1 + (1 − θ)(0.5μ) R(μ) = 2 1 − θ(0.5μ) and R(μ) =

4 3

1 + 0.25μ 1 − 0.25μ

2 −

2 −

1 + (1 − θ)μ for θ = 0.5 1 − θμ

1 1 + 0.5μ for θ = 0.5. 3 1 − 0.5μ

57

(15)

(16)

The last three formulae show clearly that the application of the Richardson Extrapolation combined with the θ-method in the solution of the test-problem (11) is equivalent to the application of a one-step method for solving ODEs with a stability function R(μ). We are mainly interested in solving stiﬀ systems of ODEs. Therefore, it is important to preserve the stability of the computational process. Several definitions related to the stability properties of numerical methods with stability function R(μ) are proposed in the literature (see, for example [2,3,6,7,8,11]). The following three deﬁnitions will be used in this paper. Definition 1. Consider the set S containing all values of μ = α + iβ for which |R(μ)| ≤ 1. If S ⊃ C = {ν | ν = γ + iδ, γ ≤ 0}, then the method with stability function R(μ) is called A-stable (see, for example, [6]). Definition 2. A numerical method with stability function R(μ) is said to be strongly A-stable if it is A-stable and if the relationship: lim (|R(μ)|) ≤ ζ,

µ→∞

(17)

where ζ ≤ 1 is some non-negative constant, holds (see [7]). Remark 1. If μ ∈ C, then μ → ∞ will always mean that |μ| grows beyond any assigned positive real number. Definition 3. A numerical method with a stability function R(μ) is called Lstable if it is A-stable and the relationship: lim (|R(μ)|) = 0

µ→∞

(18)

holds (this deﬁnition was originally proposed in [5], see also [6,7]). It should be pointed out here that the class of the L-stable methods is a sub-class of the class of strongly A-stable methods, which in their turn form a class which is a sub-class of the class of A-stable methods. The use of numerical methods, which are at least A-stable, is desirable when the system of ODEs is stiﬀ. It is well-known (see, for example, [7]) that the θ-method is – A-stable when θ = 0.5 (the Trapezoidal Rule), – strongly A-stable when θ ∈ (0.5, 1.0), – L-stable when θ = 1.0 (the Backward Euler Formula).

58

´ Havasi Z. Zlatev, I. Farag´ o, and A.

Therefore, it is assumed in this paper that θ ∈ [0.5, 1.0]. The following results related to the stability of the combination consisting of the Richardson Extrapolation and the θ-method can be formulated and proved (see [14,15]). Theorem 1. The combined numerical method consisting of the Richardson Extrapolation and the Trapezoidal Rule is not A-stable. Theorem 2. The combined numerical method consisting of the Richardson Extrapolation and the Backward Euler Formula is L-stable. Theorem 3. The numerical method consisting of a combination of the Richardson Extrapolation and the θ-method is strongly A-stable when θ ∈ (2/3, 1.0]. Theorem 1 shows that that the Richardson Extrapolation may cause stability problems when it is combined with the Trapezoidal Rule and when stiﬀ systems of ODE’s are solved. Theorem 2 and Theorem 3 show that the Richardson Extrapolation has stronger stability properties when the Backward Euler Formula is used instead of the θ-method with θ ∈ (2/3, 1.0). The following theorem indicates that if the accuracy requirements are to be taken into account, then it might be more preferable to apply the θ-method with θ ∈ (2/3, 1.0). Theorem 4. The principal part of the local truncation error of the θ-method with θ = 0.75 is twice smaller than that of the Backward Euler Formula.

4

Introduction of an Atmospheric Chemistry Scheme

An atmospheric chemistry scheme containing s = 56 species was applied in the experiments results of which will be presented below. This chemistry scheme is used in several well-known environmental models (for example, in the EMEP models, [9], and UNI-DEM, [12,13]). The chemistry scheme is described mathematically as a non-linear system of ODEs of type (1). This example is extremely diﬃcult because (a) it is badly scaled and (b) some species vary very quickly during the periods of changes from day-time to night-time and from night-time to day-time. The bad scaling is demonstrated by the results given in Table 1, where the maximal, minimal and mean values of the concentrations of several chemical species during a time-interval of 24 hours are given. It is clearly seen that while Table 1. Maximal, minimal and mean concentrations of some species during a timeperiod of 24 hours. The units are numbers of molecules per cubic centimetre. Species NO N O2 Ozone OH Isoprene

Maximal value 2.5E+09 2.4E+10 1.8E+12 2.3E+07 3.7E+09

Minimal value 8.4E+04 3.7E+08 1.4E+12 3.3E+04 1.1E+06

Mean value 5.5E+08 4.3E+09 1.5E+12 6.3E+06 1.5E+09

On Some Stability Properties of the Richardson Extrapolation

1.5*10

4

1.0*10

4

5.0*10

3

0.0*10

0

12 13 14 15 16 17 18 19 20 21 22 23 24 01 02 03 04 05 06

7

8

59

9 10 11 12

Fig. 1. Diurnal variation of the concentrations of the chemical species OP (P state of oxygen)

some species (as nitrogen dioxide and ozone) do not vary too much, other species vary in a very wide range (sometimes by many orders of magnitude; see also Fig. 1 and Fig. 2). The steep gradients of some of the concentrations in the critical parts of the time-interval (changes from day-time to night-time and from night-time to day-time) are demonstrated in the plots drawn in Fig. 1 and Fig. 2. The concentrations of some species are growing during the day (an example is given in Fig. 1), while other concentrations are growing during the night (see Fig. 2).

5

Organization of the Numerical Experiments

The atmospheric chemistry scheme discussed in the previous section is handled on the time-interval [a, b] = [43200, 129600]. The value a = 43200 corresponds to twelve o’clock at the noon (measured in seconds starting from the mid-night), while b = 129600 corresponds to twelve o’clock on the next day. Thus, the length of the time-interval is 24 hours (86400 seconds) and it contains important changes from day-time to night-time and from night-time to day-time when most of the chemical species are quickly varying. In each experiment the ﬁrst run is performed by using N = 168 time-steps (this means that the time-stepsize is h ≈ 514.285 seconds). After that the stepsize h is halved eighteen times (which implies that the number of time-steps, N , is

60

´ Havasi Z. Zlatev, I. Farag´ o, and A.

8

*10

2

1

12 13 14 15 16 17 18 19 20 21 22 23 24 01 02 03 04 05 06

7

8

9 10 11 12

Fig. 2. Diurnal variation of the concentrations of the chemical species N O3

doubled in every successive run). The behavior of the errors in this sequence of 19 runs is studied. The error made in each run is measured in the following way. Denote: ref |ym,k − ym,k | ERRm = max , (19) ref |, 1.0) k=1,2,...,56 max(|ym,k ref where ym,k and ym,k are the calculated value and the reference solution of the k-th chemical species at time tm = t0 + mh0 (where m = 1, 2, . . . , 168 and h0 ≈ 514.285 is the time-stepsize that has been used in the ﬁrst run). The reference solution was calculated by using a three-stage ﬁfth-order L-stable fully implicit Runge-Kutta algorithm with N = 998244352 and href ≈ 6.1307634E − 05. This algorithm is often called RADAU IIA (the formulae on which it is based are listed for example in [2,3,6,8]). It is clear from the above discussion that only the values of the reference solution at the grid-points of the coarse grid used in the ﬁrst run have been stored and applied in the evaluation of the error (it is, of course, also possible to store all values of the reference solution, but such an action will increase tremendously the storage requirements). The global error made during the computations is estimated by

ERR =

max

(ERRm ).

m=1,2,...,168

(20)

It is desirable to eliminate the inﬂuence of the rounding errors when the quantities involved in (19) and (20) are calculated. Normally, this task can easily be

On Some Stability Properties of the Richardson Extrapolation

61

accomplished when double precision arithmetic is used. Unfortunately, this is not true when the atmospheric chemistry scheme is handled. The diﬃculty can be explained as follows. If the problem is stiﬀ, and the atmospheric chemistry scheme selected by us is as mentioned above a stiﬀ non-linear system of ODEs, then implicit numerical methods are to be used. The application of such numerical methods leads to the solution of systems of non-linear algebraic equations, which are normally treated at each time-step by the Newton Iterative Method (see, for example, [6]). This means that long sequences of systems of linear algebraic equations are to be handled. Normally this does not cause great problems. However, the atmospheric chemistry schemes are very badly scaled and the condition numbers of the involved matrices are very large. It was found, by applying a LAPACK subroutine for calculating eigenvalues and condition numbers ([1]), that the condition numbers of the matrices involved in the Newton Iterative Process during the numerical integration of the atmospheric chemistry scheme on the interval [a, b] = [43200, 129600] vary in the range [4.56E + 08, 9.27E + 12]. Simple application of error analysis arguments from [10] indicates that there is a danger that the rounding errors will aﬀect the fourth signiﬁcant digit of the approximate solution on most of the existing computers when double precision arithmetic is used. Therefore, all computations reported in the next section were performed by using quadruple-precision (i.e. by using REAL*16 declarations for the real numbers and, thus, about 32-digit arithmetic) in order to eliminate the inﬂuence of the rounding errors on the ﬁrst 16 signiﬁcant digits of the computed approximate solutions.

6

Results from the Numerical Experiments

The results presented in the next three sub-sections will demonstrate that the combined method (consisting of the Richardson Extrapolation and a strongly Astable θ-method) has the following properties: (a) it behaves as a second-order numerical method, (b) for some values of θ the results produced by this method are more accurate than the results produced by the combination consisting of the Richardson Extrapolation and the Backward Euler Formula (which is obtained when θ = 1.0) and (c) it is much more eﬃcient (in terms of the computing time) when a prescribed accuracy is required. 6.1

Achieving Second Order of Accuracy

Results obtained by using the θ-method directly and in combination with the Richardson Extrapolation are given in Table 2. The value θ = 0.75 is selected, which means that (17) is satisﬁed with ζ = 5/9. It is clearly seen the θ-method performs as (a) a ﬁrst-order method when it is applied directly and (b) a stable second-order method when it is used as an underlying method in the Richardson Extrapolation.

62

´ Havasi Z. Zlatev, I. Farag´ o, and A.

Table 2. Numerical results obtained in (a) 19 runs in which the direct implementation of the θ-method with θ = 0.75 is used and (b) 19 runs in which the combination consisting of the Richardson Extrapolation and the θ-method with θ = 0.75 is applied. The errors obtained by (20) are given in the columns under “Accuracy”. The ratios of two successive errors are given in the columns under “Rate”.

Job No. Time-steps 1 168 2 336 3 672 4 1344 5 2688 6 5376 7 10752 8 21504 9 43008 10 86016 11 172032 12 344064 13 688128 14 1376256 15 2752512 16 5505024 17 11010048 18 22020096 19 44040192

6.2

Direct use of the θ-method Richardson Extrapolation Accuracy Rate Accuracy Rate 1.439E-00 3.988E-01 6.701E-01 2.147 5.252E-02 7.593 3.194E-01 2.098 1.503E-02 3.495 1.550E-01 2.060 3.787E-03 3.968 7.625E-02 2.033 9.502E-04 3.985 3.779E-02 2.018 2.384E-04 3.986 1.881E-02 2.009 5.980E-05 3.986 9.385E-03 2.005 1.499E-05 3.989 4.687E-03 2.002 3.754E-06 3.993 2.342E-03 2.001 9.394E-07 3.996 1.171E-03 2.001 2.353E-07 3.993 5.853E-04 2.000 6.264E-08 3.756 2.926E-04 2.000 1.618E-08 3.873 1.463E-04 2.000 4.111E-09 3.935 7.315E-05 2.000 1.036E-09 3.967 3.658E-05 2.000 2.601E-10 3.984 1.829E-05 2.000 6.514E-11 3.993 9.144E-06 2.000 1.628E-11 4.001 4.572E-06 2.000 4.051E-12 4.019

Comparing Results Obtained by the Backward Euler Formula and the θ-Method

Theorem 4 indicates that one should expect, as stated above, the θ-method with θ = 0.75 to be more accurate than the Backward Euler Formula. Some results are shown in Table 3. It is seen that the accuracy of the results obtained by using the θ-method with θ = 0.75 is indeed considerably better than those obtained by the Backward Euler Formula (see the ﬁgures given in the third and the ﬁfth columns of Table 3). It is remarkable that the accuracy is improved precisely by a factor of two when the time-stepsize becomes suﬃciently small. It is not clear how to derive corresponding expressions for the principal parts of the local truncation error when the Richardson Extrapolation is used together with these two numerical methods for solving systems of ODEs. However the results in Table 3 show that the accuracy is in general improved by a factor greater than two when the θ-method with θ = 0.75 is used as an underlying method instead of the Backward Euler Formula.

On Some Stability Properties of the Richardson Extrapolation

63

Table 3. Comparison of the accuracy achieved when the Backward Euler Formula and θ-method with θ = 0.75 are run with 19 diﬀerent time-stepsizes. The errors obtained by (20) are given in the columns in this table. The ratios (the errors obtained when the θ-method with θ = 0.75 is used divided by the corresponding errors obtained when the Backward Euler Formula is used) are given in brackets. Backward Euler Formula Job No. Time-steps Direct Richardson 1 168 2.564E-00 3.337E-01 2 336 1.271E-00 1.719E-01 3 672 6.227E-01 5.473E-02 4 1344 3.063E-01 7.708E-03 5 2688 1.516E-01 1.960E-03 6 5376 7.536E-02 5.453E-04 7 10752 3.757E-02 1.455E-04 8 21504 1.876E-02 3.765E-05 9 43008 9.371E-03 9.583E-06 10 86016 46842E-03 2.418E-06 11 172032 2.241E-03 6.072E-07 12 344064 1.171E-03 1.522E-07 13 688128 5.853E-04 3.809E-08 14 1376256 2.926E-04 9.527E-09 15 2752512 1.463E-04 2.382E-09 16 5505024 7.315E-05 5.957E-10 17 11010048 3.658E-05 1.489E-10 18 22020096 1.829E-05 3.720E-11 19 44040192 9.144E-06 9.273E-12

6.3

The θ-method with θ = 0.75 Direct Richardson 1.439E-00 (0.561) 3.988E-01 (1.195) 6.701E-01 (0.527) 5.252E-02 (0.306) 3.194E-01 (0.513) 1.503E-02 (0.275) 1.550E-01 (0.506) 3.787E-03 (0.491) 7.625E-02 (0.503) 9.502E-04 (0.484) 3.779E-02 (0.501) 2.384E-04 (0.437) 1.881E-02 (0.501) 5.980E-05 (0.411) 9.385E-03 (0.500) 1.499E-05 (0.398) 4.687E-03 (0.500) 3.754E-06 (0.392) 2.342E-03 (0.500) 9.394E-07 (0.389) 1.171E-03 (0.500) 2.353E-07 (0.388) 5.853E-04 (0.500) 6.264E-08 (0.411) 2.926E-04 (0.500) 1.618E-08 (0.425) 1.463E-04 (0.500) 4.111E-09 (0.432) 7.315E-05 (0.500) 1.036E-09 (0.435) 3.658E-05 (0.500) 2.601E-10 (0.437) 1.829E-05 (0.500) 6.514E-11 (0.437) 9.144E-06 (0.500) 1.628E-11 (0.438) 4.572E-06 (0.500) 4.051E-12 (0.437)

Comparing Computing Times Needed to Obtain Prescribed Accuracy

Three time-steps (one large and two small) with the underlying numerical method are necessary when one time-step of the Richardson Extrapolation is performed. This means that if the Richardson Extrapolation and the underlying numerical method are used with the same time-stepsize, then the computational cost of the Richardson Extrapolation will be about three times greater than that of the underlying numerical method. In many practical situations this factor will be less than three, but considerably larger than two (because the number of Newton iterations needed for each of the two small time-steps will normally be smaller than the corresponding number for the large time-step). However, the use of the Richardson Extrapolation leads also to an improved accuracy of the calculated approximations (see Table 2). Therefore, it is not relevant (and not fair either) to compare the Richardson Extrapolation with the underlying method under the assumption that both devices are run with equal number of time-steps. It is much more relevant to investigate how much work is needed in order to achieve the same accuracy in the cases where (a) the θ-method with θ = 0.75 is applied directly and (b) when it is combined with the Richardson

64

´ Havasi Z. Zlatev, I. Farag´ o, and A.

Table 4. Comparison of the computational costs (measured by the CPU times given in hours) needed to achieve prescribed accuracy in the cases where (a) the θ-method with θ = 0.75 is implemented directly and (b) the Richardson Extrapolation with the θ-method is used. The computing times measured in hours are given in the columns under “CPU time”. The numbers of time-steps needed to obtain the desired accuracy are given in the columns under “Time-steps”.

Accuracy [1.0E − 2, 1.0E [1.0E − 3, 1.0E [1.0E − 4, 1.0E [1.0E − 5, 1.0E [1.0E − 6, 1.0E

− 1) − 2) − 3) − 4) − 5)

The θ-method with θ = 0.75 Richardson Extrapolation CPU time Time-steps CPU time Time-steps 0.0506 2686 0.0614 336 0.1469 21504 0.0897 1344 1.1242 344032 0.1192 2688 6.6747 2752512 0.2458 10752 43.0650 22020096 0.6058 43008

Extrapolation. The computing times needed in the eﬀorts to achieve prescribed accuracy are given in Table 4. If the desired accuracy is 10−k (k = 1, 2, 3, 4, 5), then the computing time achieved in the ﬁrst run in which the quantity from (20) becomes less than 10−k is given in Table 4. This means that the error is in the interval [10−(k+1) , 10−k ) when accuracy of order 10−k is required. The results shown in Table 4 show clearly that not only is the Richardson Extrapolation a powerful tool for improving the accuracy of the underlying numerical method, but it is also extremely eﬃcient with regard to the computational cost (this being especially true when the accuracy requirement is not very low). It must be emphasized here that accuracy better than 10−6 was not achieved in the 19 runs when the θ-method is used directly, while accuracy better than 10−11 can be achieved when this method is combined with the Richardson Extrapolation (see Table 2).

7

Concluding Remark

Important properties of the Richardson Extrapolation were studied in the previous sections of this paper. Theorems related to the stability of the computational process and the accuracy of the results were formulated. Numerical experiments were carried out to demonstrate (a) the improvement of the accuracy by applying the Richardson Extrapolation, (b) the better accuracy properties of the combination Richardson Extrapolation plus the θ-method with θ = 0.75 when compared with the accuracy achieved by using the combination the Richardson Extrapolation plus the Backward Euler Formula and (c) the great savings in computing time achieved when a prescribed accuracy is required. There are still many open problems which will be studied in the near future. Four of the open problems are listed below: – It seems plausible to conjecture that Theorem 3 could be extended for some implicit or diagonally-implicit Runge-Kutta methods for solving systems of ODEs when these are to be applied together with the Richardson Extrapolation.

On Some Stability Properties of the Richardson Extrapolation

65

– It is desirable to obtain some stability results related to the Richardson Extrapolation applied together with strongly stable or L-stable numerical methods for solving systems of ODEs that are of higher order, i.e., for combined numerical methods with p ≥ 3. – Theorem 4 and the results shown in Table 3 indicate that not only should one select methods of a given order, but it is also necessary to try to improve the results by designing methods with a smaller (in absolute value) principal parts of the local truncation error. – Splitting procedures can be useful in many applications. It is interesting to try to prove in a rigorous way when the splitting procedures preserve the stability properties of the combined methods (i.e. when the combination consisting of the Richardson Extrapolation plus some splitting procedure plus the numerical methods for solving system of ODEs has good stability properties).

Acknowledgements The research of Z. Zlatev was partly supported by the NATO Scientiﬁc Programme (Collaborative Linkage Grants: No. 980505 “Impact of Climate Changes on Pollution Levels in Europe” and No. 98624 “Monte Carlo Sensitivity Studies of Environmental Security”). The Danish Centre for Supercomputing gave us access to several powerful parallel computers for running a long sequence of numerical experiments. ´ The research of Istv´ an Farag´ o and Agnes Havasi was supported by Hungarian Scientiﬁc Research Fund (OTKA), grants No. K67819 and F61016, respectively.

References 1. Anderson, E., Bai, Z., Bischof, C., Demmel, J., Dongarra, J., Du Croz, J., Greenbaum, A., Hammarling, S., McKenney, A., Ostrouchov, S., Sorensen, D.: LAPACK: Users’ Guide. SIAM, Philadelphia (1992) 2. Burrage, K.: Parallel and Sequential Methods for Ordinary Diﬀerential Equations. Oxford University Press, Oxford (1995) 3. Butcher, J.C.: The Numerical Analysis of Ordinary Diﬀerential Methods: RungeKutta and General Linear Methods, 2nd edn. Wiley, New York (2003) 4. Dahlquist, G.: A special stability problem for linear multistep methods. BIT 3, 27–43 (1963) 5. Ehle, B.L.: On Pad´e approximations to the exponential function and A-stable methods for the numerical solution of initial value problems. Research Report CSRR 2010, Dept. AACS, University of Waterloo, Ontario, Canada (1969) 6. Hairer, E., Wanner, G.: Solving Ordinary Diﬀerential Equations: II Stiﬀ and Diﬀerential-Algebraic Problems. Springer, Berlin (1991) 7. Hundsdorfer, W., Verwer, J.G.: Numerical Solution of Time-Dependent AdvectionDiﬀusion-Reaction Equations. Springer, Berlin (2003) 8. Lambert, J.D.: Numerical Methods for Ordinary Diﬀerential Equations. Wiley, New York (1991)

66

´ Havasi Z. Zlatev, I. Farag´ o, and A.

9. Simpson, D., Fagerli, H., Jonson, J.E., Tsyro, S.G., Wind, P., Tuovinen, J.-P.: Transboundary Acidiﬁcation, Eutrophication and Ground Level Ozone in Europe, Part I. Uniﬁed EMEP Model Description. EMEP/MSC-W Status Report 1/2003, Norwegian Meteorological Institute, Oslo, Norway (2003) 10. Wilkinson, J.H.: The algebraic eigenvalue problem. Oxford University Press, Oxford (1965) 11. Zlatev, Z.: Modiﬁed diagonally implicit Runge-Kutta methods. SIAM Journal on Scientiﬁc and Statistical Computing 2, 321–334 (1981) 12. Zlatev, Z.: Computer treatment of large air pollution models. Kluwer Academic Publishers, Dordrecht (1995) 13. Zlatev, Z., Dimov, I.: Computational and Environmental Challenges in Environmental Modelling. Elsevier, Amsterdam (2006) ´ Some stability properties and applications of the 14. Zlatev, Z., Farag´ o, I., Havasi, A.: Richardson Extrapolation (submitted, 2009) ´ Stability of Richardson Extrapolation and the 15. Zlatev, Z., Farag´ o, I., Havasi, A.: θ-method (submitted, 2009)

On the Use of Aggregation-Based Parallel Multilevel Preconditioners in the LES of Wall-Bounded Turbulent Flows Andrea Aprovitola1 , Pasqua D’Ambra2 , Daniela di Seraﬁno3 , and Salvatore Filippone4

2

1 Engine Institute (IM) — CNR Via Marconi, 8 I-80125 Naples, Italy [email protected] Institute for High-Performance Computing and Networking (ICAR) — CNR Via Pietro Castellino 111, I-80131 Naples, Italy [email protected] 3 Department of Mathematics, Second University of Naples Via Vivaldi 43, I-81100 Caserta, Italy [email protected] 4 Department of Mechanical Engineering, University of Rome Tor Vergata, Viale del Politecnico 1, I-00133, Rome, Italy [email protected]

Abstract. This work is concerned with the application of algebraic multilevel preconditioners in the solution of pressure linear systems arising in the large eddy simulation of turbulent incompressible ﬂows in wallbounded domains. These systems, coming from the discretization of elliptic equations with periodic and Neumann boundary conditions, are large and sparse, singular, compatible, and nonsymmetric because of the use of non-uniform grids taking into account the anisotropy of the ﬂow. Furthermore, they generally account for a large part of the simulation time. We analyse, through numerical experiments, the eﬀectiveness of parallel algebraic multilevel Schwarz preconditioners, based on the smoothed aggregation technique, in the iterative solution of the above pressure systems. We also investigate the behaviour of a variant of the smoothed aggregation technique, recently developed to eﬃciently deal with nonsymmetric systems.

1

Introduction

Projection methods are widely used to solve the Navier-Stokes (N-S) equations for incompressible ﬂows [8]. A main task of these methods consists in computing a velocity ﬁeld which is divergence-free in some discrete sense; this requires the solution of a discretized elliptic equation, known as pressure-correction or simply pressure equation, which is derived from the continuity equation. In this paper we consider a projection method applied to suitably ﬁltered N-S equations in the context of the Large Eddy Simulation (LES) of wall-bounded turbulent ﬂows. I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 67–75, 2010. c Springer-Verlag Berlin Heidelberg 2010

68

A. Aprovitola et al.

The related pressure systems are large and sparse, singular, compatible, with symmetric nonzero pattern and nonsymmetric values. The computational cost of solving these systems is very high if large Reynolds numbers are considered, thus calling for high-performance solvers. Krylov methods coupled with multilevel preconditioners have shown their eﬀectiveness in the context of CFD, as well as in a variety of other applications. Increasing interest has been devoted to algebraic multilevel domain decomposition preconditioners because of their general applicability and their suitability to modern parallel and distributed computing systems. The MultiLevel Domain Decomposition Parallel Preconditioners Package based on PSBLAS (MLD2P4) [7] implements algebraic multilevel preconditioners using Schwarz domain decomposition methods and the smoothed aggregation coarsening technique; the preconditioners can be used with the Krylov solvers available in the PSBLAS library [9]. MLD2P4 has been developed following an object-based design translated into Fortran 95, to pursue goals such as extensibility and ﬂexibility, while preserving runtime eﬃciency and immediate interfacing with Fortran codes. The package has been integrated in a Fortran 90 LES code implementing the above projection method, to analyse the behaviour of diﬀerent multilevel preconditioners in the solution of the pressure equations, with the ﬁnal goal of identifying the most eﬀective ones in the parallel simulation of wall-bounded turbulent ﬂows. We note that the classical smoothed aggregation technique produces very eﬀective multilevel algorithms for symmetric positive-deﬁnite systems, where good characterizations of the so-called algebraically smooth error are available. Recently, a generalization of the smoothed aggregation has been developed to eﬃciently deal with nonsymmetric systems and has been implemented in the ML package [10] available in the Trilinos C++ framework [11]. In order to analyse the impact of this aggregation strategy on the pressure systems arising in our LES application, we performed numerical experiments with ML preconditioners using such strategy. In this case the analysis was carried out “oﬄine”, i.e. by extracting the linear systems at certain steps of the simulation. The paper is organized as follows. In Section 2 we outline the LES approach and the approximate projection method used in our application, focusing on the pressure equation and the related linear system. In Section 3 we brieﬂy describe the algebraic multilevel Schwarz preconditioners, focusing on the smoothed aggregation technique and its generalization. In Section 4 we discuss the results of the numerical experiments carried out with MLD2P4 and ML. In Section 5 we draw some conclusions and future work.

2

Pressure Systems in the LES of Turbulent Channel Flows

We consider incompressible and homothermal turbulent ﬂows modelled by initial boundary-value problems for the N-S equations. In our LES approach, a tophat ﬁlter and an approximate deconvolution operator are applied to the N-S

Parallel Multilevel Preconditioners in the LES of Wall-bounded Flows

69

equations in non-dimensional weak conservation form, obtaining the following formulation: ∂ v · n dS = 0, v A−1 (1) = fconv + fdif f + fpress , x ∂t ∂Ω(x)

= Ax (¯ where Ω(x) is a ﬁnite volume contained into the region of the ﬂow, v v) is the unknown velocity ﬁeld, with Ax approximate diﬀerential deconvolution ¯ resolved velocity ﬁeld, and fconv , fdif f and fpress are the convecoperator and v tive, diﬀusive and pressure ﬂuxes, respectively. Note that the pressure unknown is introduced in the N-S equations through the fpress term. An implicit subgridscale modelling is used, thus the unresolved subgrid-scale terms do not appear explicitly in the equations. For details the reader is referred to [2]. We focus on the solution of channel ﬂows, with periodic boundary conditions assigned in the stream-wise (x) and span-wise (z) directions, and no-slip boundary conditions on the walls (y direction). The domain is discretized by using a structured Cartesian grid, with uniform grid spacings in the x and z directions, where the ﬂow is assumed to be homogeneous, and a non-uniform grid spacing in the y direction, with reﬁnement near the walls. The equations (1) are discretized by using a ﬁnite volume method, with ﬂow variables co-located at the centers of the control volumes; a third-order multidimensional upwind scheme is used for the ﬂuxes. A time-splitting technique based on an approximate projection method [3] is used to decouple the velocity from the pressure in the deconvolved ˜ is computed, at each time momentum equation; the unknown velocity ﬁeld v step, through the following predictor-corrector formula: n+1 = v∗ − Δt∇φn+1 ; v where v∗ is an intermediate velocity ﬁeld, Δt is the time step, and φ is a scalar ﬁeld such that ∇φ is an O(Δt) approximation of the pressure gradient. The intermediate velocity v∗ is computed applying a second-order AdamsBashforth/Crank-Nicolson semi-implicit scheme to the deconvolved momentum equation, where the pressure term is neglected. The correction term, needed which is divergence-free in a discrete sense, is obto have a velocity ﬁeld v tained by computing φn+1 as the solution of the so-called pressure equation, which is a Poisson-type equation with periodic and non-homogeneous Neumann boundary conditions. By discretizing this equation with a second-order central ﬁnite-volume scheme we obtain a linear system Ax = b,

(2)

where A = (aij ) ∈ n×n is a sparse matrix with symmetric sparsity pattern, but nonsymmetric values because of the use of a non-uniform grid in the y direction. System (2) is singular but compatible, since a discrete compatibility condition is enforced according to the boundary conditions. The dimension n of the matrix A depends on the number of cells of the discretization grid and hence, owing to

70

A. Aprovitola et al.

resolution needs, increases as the Reynolds number grows; therefore the eﬃcient solution of the above linear system is a key point in the LES simulations. We note that R(A) ∩ N (A) = {0}, where R(A) and N (A) are the range space and the null space of A; this property, coupled with the compatibility of the linear system, ensures that in exact arithmetic the GMRES method computes a solution before a breakdown occurs [4]. Usually, the Restarted GMRES (RGMRES) method is used and the application of an eﬀective preconditioner that reduces the condition number of the restriction of A to R(A) is crucial to decrease the number of iterations to achieve a required accuracy in the solution.

3

Algebraic Multilevel Domain Decomposition Preconditioners

Multilevel domain decomposition preconditioners are based on the idea of applying basic domain decomposition methods to a hierarchy of matrices that represent, in increasingly coarser spaces, the matrix A to be preconditioned, and of combining contributions coming from each space. The use of multiple spaces (levels) allows to correct “errors” arising in the application of the basic domain decomposition methods to the original matrix. These preconditioners are built from two main components: a basic domain decomposition preconditioner (smoother), acting on a single matrix in the hierarchy, and a coarse-space correction, which computes a correction to the basic preconditioner by transferring suitable information from the current space to the next coarser one and vice versa, and by solving a linear system involving the coarse-space matrix. Among the multilevel preconditioners, the algebraic ones generate the hierarchy of matrices using only information on A, without assuming any knowledge of the problem from which it arises. A nice feature of these preconditioners is that they combine a wide applicability with the capability of automatically adapting to speciﬁc features of the problem to be solved. A description of algebraic multilevel domain decomposition preconditioners is beyond the scope of this paper. For details the reader is referred, e.g., to [14,15]. We only outline the smoothed aggregation technique for the construction of the hierarchy of matrices and of the related coarse-to-ﬁne and ﬁne-to-coarse transfer operators [5,16], which is used by the multilevel preconditioners discussed in this paper. Given the set of row (column) indices Ω k = {1, 2, . . . , nk } of the matrix Ak concerning a certain level k in the hierarchy, the smoothed aggregation generates a coarser index set Ω k+1 = {1, 2, . . . , nk+1 } by suitably grouping strongly k coupled indices into disjoint subsets called aggregates. Two indices i, j ∈ Ω are considered strongly coupled if |akij | ≥ · |akii akjj |, where > 0 is a given threshold. Then a tentative prolongator P k ∈ nk ×nk+1 is built, where each column identiﬁes an aggregate. According to the general theory on aggregation-based multilevel methods, P k should be such that its range contains the near null space of the matrix Ak , i.e. the eigenspace associated to the smallest eigenvalue of Ak . We consider the following deﬁnition of P k

Parallel Multilevel Preconditioners in the LES of Wall-bounded Flows

Pk = ( pkij ),

pkij =

1 0

if point i is in the aggregate j , otherwise

71

(3)

which is a piecewise constant interpolation operator whose range includes the space spanned by the vector of all ones, i.e. the near null space of the matrix Ak related to the pressure linear system of our LES application. The actual prolongator P k is obtained by applying a damped Jacobi smoother to P k , P k = (I − ω k (Dk )−1 Ak )Pk , where ω k is a suitable damping parameter and Dk is the diagonal part of Ak . The coarse matrix Ak+1 is built by using the Galerkin approach, i.e. Ak+1 = Rk Ak P k , with Rk = (P k )T . Convergence results concerning multilevel methods based on the previous aggregation strategy conﬁrm the eﬀectiveness of the above approach for symmetric positive deﬁnite matrices [17]. Parallel algebraic multilevel domain decomposition preconditioners are implemented in MLD2P4 [6,7]. Additive Schwarz methods are used as basic preconditioners and the coarse-level corrections are built through the previously described smoothed aggregation, using the tentative prolongator deﬁned in (3). These components can be combined in diﬀerent multilevel frameworks, obtaining purely additive and hybrid (i.e. multiplicative among the levels and additive within each level) preconditioners; the latter apply basic Schwarz methods as pre-, post- or two-side (i.e. V-cycle) smoothers. Diﬀerent solvers are available at the coarsest level, including LU and ILU factorizations as well as block-Jacobi solvers with ILU or LU on the blocks. A generalization of the classical smoothed aggregation technique has been recently proposed to obtain on nonsymmetric systems the eﬀectiveness achieved on the symmetric ones [13]. In the new approach the coarse matrix Ak+1 is built using a restriction operator Rk diﬀerent from (P k )T , which is deﬁned as k (I − Ak W k (Dk )−1 ), Rk = R k = (P k )T is a tentative restrictor and W k is a diagonal matrix whose where R k . Local dampi-th diagonal entry is a damping parameter for the i-th row of R ing parameters are also used in the deﬁnition of the smoothed prolongator, i.e. P k = (I − (Dk )−1 Ω k Ak )Pk , where Ω k is a diagonal matrix. This nonsymmetric smoothed aggregation is currently available in the ML package of parallel multilevel preconditioners [10], included in the C++ Trilinos computational framework [11]. We are working to implement it in MLD2P4.

4

Numerical Experiments

Multilevel preconditioners based on the smoothed aggregation techniques described in Section 3 have been applied to the pressure systems arising in the simulation of a bi-periodical channel ﬂow with the LES approach outlined in Section 2. The problem domain is a box of size 2πl × 2l × 2πl, where l is the

72

A. Aprovitola et al.

channel half-width, and the Reynolds number referred to the shear velocity is Reτ = 1050; the initial ﬂow is a Poiseuille ﬂow, with a random Gaussian perturbation. A computational grid with 64 × 96 × 128 cells is considered, with cell aspect ratio varying approximately from 3 to 500. The resulting pressure matrix has dimension 786432 and 5480702 nonzero entries. The time step Δt is 10−4 , to meet stability requirements. Preliminary results concerning the performance of multilevel preconditioners available in MLD2P4, coupled with the Restarted GMRES (RGMRES) from PSBLAS, in the solution of the pressure linear systems arising in the LES simulation, are reported in [1]. In that case, the threshold used in the aggregation algorithm was set to zero and the multilevel algorithms used only post-smoothing. Many other experiments have been performed, varying the type of multilevel cycle, the number of levels, the aggregation threshold, the basic Schwarz preconditioner, and the coarsest-level solver. A signiﬁcant improvement in the convergence behaviour has been observed in passing from = 0 to > 0, according to the mesh anisotropy; on the other hand, only modest changes have been obtained using diﬀerent small positive values of . For the sake of space, we report here the results concerning multilevel preconditioners of V-cycle type, with 2, 3 and 4 levels, using = 0.01, the block-Jacobi preconditioner as basic preconditioner, and four sweeps of the block-Jacobi solver at the coarsest level (the ILU(0) factorization is applied on the local submatrices within the block-Jacobi preconditioner/solver). On the average, this type of preconditioner shows on our problem the best results in terms of iterations and execution time of the coupled Krylov solver. In the following, the above preconditioners are denoted by 2LMLD2P4, 3L-MLD2P4, and 4L-MLD2P4. We also report, for comparison, the results obtained with the one-level block-Jacobi preconditioner, using ILU(0) on the local blocks (BJAC-MLD2P4). The results presented in this Section concern the solution of the pressure systems arising in the ﬁrst ten time steps of the simulation procedure. Furthermore, since the pressure matrix does not change during the whole simulation, the cost for the construction of the multilevel preconditioners is neglected in our discussion. In order to analyse the impact on the pressure systems of the nonsymmetric smoothed aggregation described in Section 3, we have applied to those systems some multilevel preconditioners using the new aggregation strategy, which are available in the Trilinos-ML package. All the other features of the ML preconditioners have been chosen as in the MLD2P4 preconditioners, except the coarsestlevel solver, set to 4 sweeps of the pointwise Jacobi method, since ML does not include the block-Jacobi one. These preconditioners are referred to 2L-ML-NS, 3L-ML-NS, 4L-ML-NS. To focus on the aggregation technique, removing the dependence from the coarsest-level solver in our analysis, we have also applied the corresponding ML preconditioners with classical smoothed aggregation (2L-ML, 3L-ML, 4L-ML). The ML preconditioners have been coupled with the RGMRES solver provided by the Aztec-OO package included in Trilinos. Note that no signiﬁcative inﬂuence on the convergence behaviour of the above ML preconditioners

Parallel Multilevel Preconditioners in the LES of Wall-bounded Flows

73

Table 1. MLD2P4: mean number of iterations and mean execution time (in brackets) Procs 1 2 4 8 16 32 64

BJAC-MLD2P4 91 (62.77) 96 (37.89) 98 (19.80) 98 (9.59) 102 (4.45) 101 (2.09) 109 (1.33)

2L-MLD2P4 3L-MLD2P4 22 (32.51) 14 (23.31) 23 (19.96) 12 (10.85) 22 (9.72) 12 (5.42) 22 (4.81) 16 (3.56) 23 (2.57) 12 (1.45) 23 (1.37) 16 (1.03) 27 (0.99) 19 (0.80)

4L-MLD2P4 13 (21.83) 13 (11.46) 13 (5.59) 14 (3.29) 13 (1.48) 19 (1.34) 21 (1.03)

has been observed varying the aggregation threshold in the decoupled coarsening strategy. This diﬀerent behaviour between MLD2P4 and ML is probably due to a diﬀerent local coarsening strategy and it is currently under investigation. All the previous preconditioners have been used as right preconditioners and the restarting parameter of RGMRES has been set to 30. The null vector has been chosen as starting guess and the RGMRES iterations have been stopped when the ratio between the 2-norms of the residual and of the right-hand-side was smaller than 10−7 ; a maximum number of 500 iterations has been also set. A rowblock distribution of the pressure matrix has been considered, with right-hand side and solution vectors distributed accordingly. The experiments have been carried out on a HP XC 6000 Linux cluster with 64 bi-processor nodes, operated by the Naples branch of ICAR-CNR. Each node comprises an Intel Itanium 2 Madison processor, with clock frequency of 1.4 Ghz, and is equipped with 4 GB of RAM; it runs HP Linux for High Performance Computing 3 (kernel 2.4.21). The main interconnection network is Quadrics QsNetII Elan 4, which has a sustained bandwidth of 900 MB/sec. and a latency of about 5 μsec. for large messages. The GNU Compiler Collection, v. 4.3, and the HP MPI implementation, v. 2.01, have been used. MLD2P4 1.1 and and PSBLAS 2.3.1 have been installed on top of ATLAS 3.6.0 and BLACS 1.1. The release 8.0.5 of Trilinos, including ML 5.0, has been installed on top of ATLAS 3.6.0 and LAPACK 3.0. In Table 1 we report the mean number of iterations and the mean execution time, in seconds, obtained with the MLD2P4 preconditioners over the ten linear systems, on 1, 2, 4, 8, 16, 32 and 64 processors. We see that the MLD2P4 multilevel preconditioners show good algorithmic scalability, with a low increase in the number of iterations when the number of processors increases. Similar iteration counts and execution times are obtained with three and four levels, showing that three levels are suﬃcient to reduce low-energy error modes in this case. Satisfactory speedup values can be observed for all the preconditioners. A signiﬁcant reduction of the number of iterations and of the execution time with respect to the block-Jacobi preconditioner is obtained with all the multilevel preconditioners. In Table 2 we report the mean number of iterations and the mean execution time for the ML preconditioners based on the classical smoothed aggregation and on the nonsymmetric one. We observe that RGMRES with the ML

74

A. Aprovitola et al.

Table 2. ML: mean number of iterations and mean execution time (in brackets) Procs 1 2 4 8 16 32 64

2L-ML 3L-ML 4L-ML 2L-ML-NS 3L-ML-NS 4L-ML-NS 39 (73.70) 25 (57.68) 24 (57.84) 38 (77.72) 18 (54.00) 9 (39.98) 31 (42.89) 22 (36.46) 21 (36.57) 31 (44.80) 17 (35.54) 9 (28.79) 31 (21.50) 22 (17.79) 21 (17.99) 31 (22.70) 18 (18.26) 9 (13.97) 31 (10.27) 22 (8.42) 21 (8.57) 31 (10.79) 18 (8.75) 10 (6.62) 31 (5.01) 22 (3.92) 22 (4.08) 31 (5.31) 17 (3.95) 12 (3.15) 32 (2.47) 22 (1.88) 26 (2.20) 32 (2.63) 18 (1.91) 15 (1.75) 32 (1.31) 24 (1.03) 27 (1.32) 32 (1.61) 17 (1.02) 15 (0.95)

preconditioners based on the classical smoothed aggregration has a general convergence behaviour similar to that obtained with the MLD2P4 preconditioners (the diﬀerent number of iterations is due to the diﬀerent coarsest-level solver). In particular, 3L-ML and 4L-ML have close iteration counts. Conversely, the preconditioners based on the nonsymmetric aggregation strategy produce a large reduction of the iterations when three and four levels are used. The best results are obtained with 4L-ML-NS, which gets the smallest number of iterations and the smallest execution time with respect to the other ML preconditioners. These results show that the aggregation strategy for nonsymmetric matrices can enhance the performance of multilevel preconditioners in the solution of the linear systems arising in our LES application and its inclusion in MLD2P4 is desirable. Finally, we see that the multilevel preconditioners from MLD2P4 generally require smaller execution times than the ML preconditioners; the same is true if we consider the time per single iteration. On the other hand, the ML preconditioners show higher speedup values on 32 and 64 processors; this is mainly due to their large sequential execution times.

5

Conclusions and Future Work

In this work we analysed the performance of parallel algebraic multilevel Schwarz preconditioners, based on aggregation, in the solution of nonsymmetric pressure systems arising in the LES of incompressible wall-bounded ﬂows. Our work was aimed at identifying eﬀective multilevel preconditioners to be used in a scientiﬁc code implementing the LES approach described in Section 2, where MLD2P4 has been already integrated. The results obtained show that the MLD2P4 preconditioners based on the classical smoothed aggregation achieve good algorithmic scalability and parallel performance on the systems under consideration. However, experiments with a recently proposed nonsymmetric smoothed aggregation technique indicate that the eﬀectiveness of multilevel algorithms on such systems can be improved. Based on these considerations, we planned to implement the new aggregation technique into the next release of MLD2P4.

Parallel Multilevel Preconditioners in the LES of Wall-bounded Flows

75

References 1. Aprovitola, A., D’Ambra, P., Denaro, F.M., di Seraﬁno, D., Filippone, S.: Application of Parallel Algebraic Multilevel Domain Decomposition Preconditioners in Large Eddy Simulations of Wall-Bounded Turbulent Flows: First Experiments, ICAR-CNR Technical Report RT-ICAR-NA-07-02 (2007) 2. Aprovitola, A., Denaro, F.M.: On the Application of Congruent Upwind Discretizations for Large Eddy Simulations. J. Comput. Phys. 194(1), 329–343 (2004) 3. Aprovitola, A., Denaro, F.M.: A Non-diﬀusive, Divergence-free, Finite Volumebased Double Projection Method on Non-staggered Grids. Internat. J. Numer. Methods Fluids 53(7), 1127–1172 (2007) 4. Brown, P.N., Walker, H.F.: GMRES on (nearly) singular systems. SIAM J. Matrix Anal. Appl. 18(1), 37–51 (1997) 5. Brezina, M., Vanˇek, P.: A Black-Box Iterative Solver Based on a Two-Level Schwarz Method. Computing 63(3), 233–263 (1999) 6. D’Ambra, P., di Seraﬁno, D., Filippone, S.: On the Development of PSBLAS-based Parallel Two-Level Schwarz Preconditioners. Applied Numerical Mathematics 57, 1181–1196 (2007) 7. D’Ambra, P., di Seraﬁno, D., Filippone, S.: MLD2P4: a Package of Parallel Multilevel Algebraic Domain Decomposition Preconditioners in Fortran 95. ICAR-CNR Technical Report RT-ICAR-NA-09-01 (2009), http://www.mld2p4.it 8. Ferziger, J.H., Peric, M.: Computational Methods for Fluid Dynamics. Springer, Heidelberg (1996) 9. Filippone, S., Buttari, A.: PSBLAS 2.3: User’s and Reference Guide (2008), http://www.ce.uniroma2.it/psblas/ 10. Gee, M.W., Siefert, C.M., Hu, J.J., Tuminaro, R., Sala, M.G.: ML 5.0 Smoothed Aggregation User’s Guide (May 2006), http://trilinos.sandia.gov/packages/ml/publications.html 11. Heroux, M.A., et al.: An Overview of the Trilinos Project. ACM Trans. Math. Soft. 31(3), 397–423 (2005) 12. Keyes, D.: Domain Decomposition Methods in the Mainstream of Computational Science. In: Proc. of the 14th International Conference on Domain Decomposition Methods, pp. 79–93. UNAM Press, Mexico City (2003) 13. Sala, M., Tuminaro, R.: A New Petrov-Galerkin Smoothed Aggregation Preconditioner for Nonsymmetric Linear Systems. SIAM J. Sci. Comput. 31(1), 143–166 (2008) 14. Smith, B., Bjørstad, P., Gropp, W.: Domain Decomposition: Parallel Multilevel Methods for Elliptic Partial Diﬀerential Equations. Cambridge University Press, Cambridge (1996) 15. St¨ uben, K.: Algebraic Multigrid (AMG): an Introduction with Applications. In: Sch¨ uller, A., Trottenberg, U., Oosterlee, C. (eds.) Multigrid. Academic Press, London (2000) 16. Vanˇek, P., Mandel, J., Brezina, M.: Algebraic Multigrid by Smoothed Aggregation for Second and Fourth Order Elliptic Problems. Computing 56, 179–196 (1996) 17. Vanˇek, P., Mandel, J., Brezina, M.: Convergence of Algebraic Multigrid based on Smoothed Aggregation. Numer. Math. 88, 559–579 (2001)

An Additive Matrix Preconditioning Method with Application for Domain Decomposition and Two-Level Matrix Partitionings Owe Axelsson Department of Information Technology, Uppsala University, Sweden & Institute of Geonics, AS CR, Ostrava, The Czech Republic [email protected]

Abstract. Domain decomposition methods enable parallel computation during the iterative solution of partial diﬀerential equations of elliptic type. In order to limit the number of iterations one must then use an eﬃcient preconditioner which can signiﬁcantly reduce the condition number of the given problem and, at the same time, is highly parallelizable. In this paper we describe and analyse such a preconditioner, which is based on local subdomain inverse matrices and is applicable for various types of domain decomposition methods.

1

Introduction

The paper analyses a particular preconditioner,primarily to be used in a conjugate gradient method, for nonsingular matrices in the form ⎤ ⎡ A1 0 A13 (1) A = ⎣ 0 A2 A23 ⎦ A31 A32 A3 Such matrices arise in many applications of which we here just mention the following three: (i) If a domain of deﬁnition (Ω) of an elliptic problem discretized by a ﬁnite element method, is divided in two parts, the matrix blocks Ai , i = 1, 2 correspond to interior node points in the subdomains Ω1 and Ω2 ,respectively and A3 corresponds to the nodepoints on the common border, Ω¯1 ∩ Ω¯2 between the two subdomains, i.e. an edge in a 2D and a face in a 3D problem. Unless one applies some parallelizable method for the sub-blocks in A3 , the parallelism is here limited to two sub-blocks. Domain decomposition methods, also in a much more general context, have been presented in a number of publications, see e.g. [5,7,9,10]. (ii) If the domain is partitioned in a number of odd-even ordered subdomains then Ai , i=1,2 correspond to the odd- respectively the even-numbered subdomains and A3 to the common edges in a 2D and faces/edges in a 3D I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 76–83, 2010. c Springer-Verlag Berlin Heidelberg 2010

An Additive Matrix Preconditioning Method

77

problem. A 3D domain can be ﬁrst partitioned consecutively in 2D domains, where we use alternatively an odd-even and even-odd orderings. The sub-blocks Ai , i = 1, 2 consist themselves of uncoupled, diagonal matrix sub-blocks, one for each subdomain. Therefore this method can contain a high degree of parallelism. Here, however, A3 has a fairly large size, and the solution of the arising corresponding systems needs special attention. This problem is avoided if the following variant of this DD method is used. (iii) Partition ﬁrst the given matrix in two-by-two blocks, where the lower diagonal block corresponds to a set of coarse,vertex mesh points and the upper, pivot block to the remaining (ﬁne) mesh nodes. Preconditioners for such matrix partitionings have been discussed in a number of publications, see e.g. [4], and will not be commented on further in this paper. Instead we want to apply the subdomain partitioning to just the pivot block matrix, i.e. the matrix that remains if the solution at all coarse mesh nodes have been chosen to be zero. This is an important part in the preconditioning of matrices partitioned in two-by-two block form, see e.g. [4,1,2]. In this case the corresponding matrix A3 for the pivot block consists of uncoupled sub-blocks and the solution of the arising systems with this matrix can be done eﬃciently in parallel as well. The sub-blocks are actually not always uncoupled, as some nodes on two diﬀerent edges might be coupled. If such a coupling exists it can be deleted and compensated for by the method of deletion and diagonal compensation, see [3] for details.This is done only to construct the preconditioner,the given matrix will be unchanged. As will be seen in Section 2, the preconditioner will be of additive form, where each term corresponds to a subdomain. Such preconditioners have been discussed in [2,7],see also references cited therein. Because of the additive form, they can be taken in any order. In fact, the initial odd-even partitioning has been done only to simplify the analysis of the condition number of the resulting preconditioned matrix and is not essential for the actual implementation of the method. We shall show that by use of certain perturbations of the sub-blocks in positions (3,3) of the inverse matrices, we can derive a condition number bound, κ of the preconditioned matrix which, for symmetric and positive deﬁnite (spd) matrices depends −1/2 −1/2 1/2 −1 )} , only on the CBS constant, γ = {ρ(A3 (A31 A−1 1 A13 + A32 A2 A23 )A3 2 1/2 as κ =1/(1-γ ) ,where ρ(·) denotes the spectral radius. Note that the CBS constant measures the strength of the oﬀ-diagonal blocks in relation to the diagonal blocks of the matrix in (1). The method can be extended by recursion in various ways. In the domain decomposition method in (i), each of the subdomains Ω1 , Ω2 can themselves be partitioned in two sub-blocks and so on. In the context of a recursive odd-even subdomain ordering, the arising set of coarse mesh vertex points can be ordered in such subsets. In Section 2 of the paper we describe the additive preconditioner and derive an expression for a resulting error matrix. The analysis of the corresponding condition number is found in Section 3. Some concluding remarks are found in Section 4.

78

O. Axelsson

The notation A ≥ B, where A and B are positive semideﬁnite, used in the following, means that A − B is positive semideﬁnite.

2

Error Matrix for an Additive Preconditioning Method (1)

(2)

(i)

Given a matrix in the form (1), we partition A3 = A3 + A3 , where A3 , i = 1, 2 arise from contributions to edge nodes from the odd and even numbered elements, respectively. The preconditioner takes the form B = B1 + B2 , where ⎡ ⎤ ⎡ ⎤ B11 0 B13 I1 0 0 ⎢ ⎥ 1 = ⎣ 0 0 0 ⎦ B1 = ⎣ 0 0 0 ⎦ and B1 A −1 (1) 0 0 I3 B31 0 S3 ⎡ ⎤ ⎡ ⎤ 0 0 0 0 0 0 ⎢ ⎥ 2 = ⎣0 I2 0 ⎦ B2 = ⎣0 B22 B23 ⎦ and B2 A (2) −1 0 0 I3 0 B32 S3 2 − E, where 1 + A Here the matrix A has been split as A = A ⎡ ⎡ ⎤ ⎤ ⎡ ⎤ A1 0 A13 0 0 0 00 0 1 = ⎣ 0 0 0 ⎦ , A 2 = ⎣0 A2 A23 ⎦ , E = ⎣0 0 0 ⎦ A A31 0 A3 0 A32 A3 0 0 A3 and S3 = A3 − A3i A−1 1 Ai3 , i = 1, 2. The matrices Bi,j need not be speciﬁed here as they will not appear in the ﬁnal result. A computation shows that (i)

1 + B1 (A 2 + B2 (A 2 − E) + B2 A 1 − E)) − A ABA − A = A(B1 A ⎡ ⎤ 00 0 2 − E) + B2 (A 1 − E)) − A = A(I + ⎣0 0 0 ⎦ + B1 (A 0 0 I3 ⎤ ⎡ ⎤ ⎤ ⎡ ⎤⎡ ⎤⎡ ⎡ 0 0 0 I1 0 0 0 0 0 A1 0 A13 0 0 A13 = ⎣0 0 A23 ⎦ + ⎣ 0 0 0 ⎦ ⎣0 A2 A23 ⎦ + ⎣0 I2 0 ⎦ ⎣ 0 0 0 ⎦ 0 0 I3 0 0 I3 A31 0 0 0 0 A3 0 A32 0 2 − E)B1 (A 2 − E) + (A 1 − E)B2 (A 1 − E) +(A ⎤⎡ ⎤⎡ ⎤ ⎡ ⎤ ⎡ 00 0 0 0 A13 0 0 0 0 0 0 ⎢ ⎥ = ⎣ 0 0 A23 ⎦ + ⎣0 A2 A23 ⎦ ⎣0 0 0 ⎦ ⎣0 A2 A23 ⎦ −1 (1) A31 A32 A3 0 A32 0 0 A32 0 0 0 S3 ⎤⎡ ⎡ ⎤⎡ ⎤ 00 0 A1 0 A13 A1 0 A13 ⎢ ⎥ + ⎣ 0 0 0 ⎦ ⎣0 0 0 ⎦ ⎣ 0 0 0 ⎦ (2) −1 A31 0 0 A31 0 0 0 0 S3

An Additive Matrix Preconditioning Method

⎡

i.e.

⎢ ABA − A = ⎣

(2) −1

A13 S3

A13

0 A31

A13

0

79

⎤

⎥ (1) −1 A23 S3 A32 A23 ⎦ := F A32 A3

(2)

Hence it holds ABA − A = F , where F is an error matrix and, if A is spd, A1/2 BA1/2 − I = A−1/2 F A−1/2 .

3

Condition Number Bound for a Perturbed Preconditioner (1)

(2)

(1)

From now on we assume that A is spd. Since A3 = A3 + A3 , where A3 and (2) A3 are the contributions to A3 from odd-and even-numbered elements, respec⎤ ⎡ ⎤ ⎡ 0 0 0 A1 0 A13 tively, it holds that ⎣ 0 0 0 ⎦ and ⎣0 A2 A23 ⎦ are symmetric and positive (1)

(2)

A31 0 A3

0 A32 A3

semideﬁnite. Hence A3 − A3i Ai −1 Ai3 , i=1,2 are also positive semideﬁnite, so (i)

(1)

S3

= A3 + A3 − A31 A1 −1 A13 ≥ A3

(1) −1 A3

≥

(2)

(1)

(2)

(2) −1 S3

Hence, (2) −1

A13 S3

(1) −1

A31 ≤ A13 A3

(1) −1 A23 S3 A32

so, by (2)

(2) −1

or A3

≤

A31 (2) −1 A23 A3 A32

(1) −1

≥ S3

and, similarly,

≤ A1 ≤ A2

⎤ A1 0 A13 ABA − A ≤ ⎣ 0 A2 A23 ⎦ = A A31 A32 A3 ⎡

Therefore A1/2 BA1/2 ≤ 2I, and λmax (BA) ≤ 2. To derive a lower bound we shall use perturbations. Let then B be computed as above and let = B + Δ, B (3) ⎤ ⎡ 00 0 to be used as a preconditioner to A, where Δ is such that AΔA = δ ⎣0 0 0 ⎦, 0 0 A3 δ ≥ 0. It follows that ⎡ ⎤⎡ ⎤ −1 ⎤ ⎡ 0 0 A−1 0 0 0 00 0 1 A13 S3 −1 ⎦ ⎣ 0 0 0 ⎦. 0 0 0 ⎦⎣ Δ = δ ⎣0 0 A−1 2 A23 S3 −1 −1 −1 −1 −1 0 0 A3 S3 A31 A1 S2 A32 A2 −S3 −1 0 0 −S3 Here the arising actions of Ai can be computed locally elementwise. The Schur −1 complement matrix S3 = A3 − A31 A−1 1 A13 − A32 A2 A23 can be assembled from local Schur complements exactly. However,the solution of the arising systems

80

O. Axelsson

with S3 can be too costly to implement for just a perturbation matrix. Therefore, this method is less practical and should be seen only as a benchmark method to show what can be ideally achieved. Due to space limitations we will not discuss other alternatives here. One method which can be used in a multilevel context,is to replace S3 with a corresponding version on a coarser mesh and interpolate the so computed values to ﬁner meshpoints. The intention is to keep δ suﬃciently big but also suﬃciently small not to − A = ABA − A + AΔA. increase the upper bound too much. It holds ABA Let now ξ be a positive number suﬃciently big to make (ξ +1)ABA−A ≥ 0. It holds (ξ+1)ABA−A = ξA+(ξ+1)(ABA−A)+(ξ+1)AΔA. Here ABA−A = F, ⎡ ⎤ 00 0 where F is deﬁned in (2) and AΔA = δ ⎣0 0 0 ⎦. 0 0 A3 Hence ⎡ ⎤ A1 0 αA13 ˜ −A= ξ⎣ 0 A2 αA23 ⎦ (ξ + 1)ABA αA31 αA32 α2 D3 ⎡ ⎤ (2) −1 A31 0 βA13 A13 S3 ⎢ ⎥ (1) −1 +(ξ + 1) ⎣ ⎦ 0 A23 S3 A32 βA23 βA31

(1)

(2)

βA32 ⎡ β 2 (S⎤ 3 + S3 ) 00 0 +((2ξ + 1) + (ξ + 1)δ − 2β 2 (ξ + 1)) ⎣0 0 0 ⎦ 0 0 A3 (4) (1) (2) −1 where D3 = A31 A−1 A + A A A and S + S = 2A − D . 13 32 2 23 3 3 1 3 3 Further ξ, α, β are positive numbers that satisfy the following system of equations, ξα + (ξ + 1)β = 2ξ + 1, here equating the oﬀ-diagonal entries here equating the D3 -terms ξα2 = (ξ + 1)β 2 , 2(ξ + 1)β 2 = (2ξ + 1) + (ξ + 1)δ, here equating the A3 -terms.

(5)

Theorem 1. Assume that A, defined in (1), is symmetric and positive definite be defined by (3).Then λmin (BA) ≥ 1/(ξ + 1), λmax (BA) ≤ 2 + δ/(1 − and let B −1/2 2 γ ) where ξ = ξ(δ) satisfies the equations (5) and γ = {ρ(A3 (A31 A−1 1 A13 + −1/2 1/2 −1 A32 A2 A23 )A3 ) . As δ → 0,then ξ → ∞, and the condition number is 1 δ 1/2 asymptotically bounded by κ(BA) ≤ 2√ (2δ −1/2 + 1−γ 2 } and minδ≥0 κ(BA) ≤ 2 1/(1 − γ 2 )1/2 .

Proof. It holds ⎡ ⎤ ⎡ ⎤⎡ ⎤ ⎤⎡ 0 0 I1 A1 0 αA13 A1 0 0 I1 0 αA1 −1 A13 ⎣ 0 0 I2 0 ⎦ ⎣ 0 A2 0⎦ ⎣ 0 I2 αA2 −1 A23 ⎦ A2 αA23 ⎦ = ⎣ −1 −1 αA31 αA32 α2 D3 0 0 0 αA31 A1 αA32 A2 I3 0 0 I3

An Additive Matrix Preconditioning Method

81

and ⎡ ⎢ ⎣

(2) −1

A13 S3

0 βA31

A13

0

⎤

βA13

⎥ (1) −1 ⎦= A23 S3 A32 βA23 (1) (2) 2 βA32 β (S3 + S3 )

⎤⎡ ⎤⎡ ⎤ 00 0 0 0 A13 0 0 0 ⎥ ⎣0 0 0 ⎦ ⎢ ⎣0 0 0 ⎦ ⎣ 0 0 0 ⎦ + −1 (2) (2) (2) 0 0 βS3 A31 0 βS3 0 0 S3 ⎤⎡ ⎡ ⎤⎡ ⎤ 00 0 00 0 0 0 0 ⎥ ⎣0 0 A23 ⎦ ⎢ 0 ⎦ ⎣0 0 0 ⎦ ⎣0 0 (1) (1) (1) −1 0 0 βS3 0 A32 βS3 0 0 S3 ⎡

It follows that the ﬁrst two terms in (4) are positive semideﬁnite. Since by (5) ≥ A. Further it holds the last term is zero, it follows that (ξ + 1)ABA T = sup xT ABAx/x λmax (BA) Ax ≤ 2 + sup xT AΔAx/xT Ax x=0

= 2 + δ sup xT3 A3 x3 /xT Ax

x=0

x=0

= 2 + δ sup xT3 A3 x3 /xT3 S3 x3 = 2 + δ/(1 − γ 2 ), x=0

−1 where S3 = A3 − A31 A−1 1 A13 − A32 A2 A23 . A computation, using (5), shows 1/2 1/2 β, (ξ + 1)√ ((ξ + 1)1/2 + ξ 1/2 )β = 2ξ + 1, and 1/(ξ + 1) = that α = ((ξ + 1)/ξ)

1/2 (1 + ζ + 2ζ )/ (1 + ζ)δ ≤ 2 2δ, where ζ = ξ/(ξ + 1). Hence ζ < 1 and as √ ≥ 1/(ξ + 1) = 2 2δ. For the δ → 0 it holds ξ → ∞ and ζ → 1, so λmin (BA) condition number it holds,

≤ κ(BA)

2 + δ/(1 − γ 2 ) √ , 2 2δ

˜ ≤ √ which is minimized for δ = 2(1 − γ 2 ), so asymptotically, min κ(BA δ>0

1 1−γ 2

Remark 3.1. If we replace A3 in the perturbation matrix AΔA with the matrix S3 , then ⎡ ⎤ −1 ⎤ ⎡ 0 0 A−1 0 0 0 1 A13 S3 −1 ⎦ ⎣ ⎦ 0 0 0 Δ = δ ⎣0 0 A−1 (6) 2 A23 S3 −1 −1 A31 A−1 A A −I 0 0 −S3 32 3 1 2 ≥ 1/(ξ + Using the same notations as before,it can be shown that λmin (BA) 1) = δ and that the upper eigenvalue bound now becomes λmax (BA) ≤ 2 + δ. ≤ 2/δ + 1, i.e., does Therefore,in this case the condition number satisﬁes κ(BA) not depend on γ. Typically we let δ = 1. However,we can choose δ arbitrarily large. In the limit, δ → ∞, we get a method which just corresponds to solving the −1 reduced,Schur complement equation, S3 x3 = f3 − A31 A−1 1 x1 − A32 A2 x2 , where

82

O. Axelsson

Ax = F , F = (f1 , f2 , f3 )T , which is an alternative to using the perturbation method. In the latter however, we do not need to solve the Schur complement system very accurately, which is an advantage of it. Note that each action of Δ in (6), in addition to the two local element solutions with matrices Ai , i = 1, 2 requires only one Schur complement equation solution.

4

Concluding Remarks

For domains partitioned in two parts it is known, see e.g.[9,10] that γ 2 = 1−O(h), and hence for partitioning in subdomains, that γ 2 = 1 − O(h/H), where h, H are the characteristic meshsizes for the ﬁnest elements and for the macroelements, respectively. Hence it follows from Theorem 1 that the condition num = O((h/H)−1/2 . Therefore, the number of conjugate gradient iteraber κ(BA) tions to solve a system with matrix A, using the perturbed preconditioner with = B(δ), B δ = 2(1 − γ 2 ) = O(h/H), increases at most as O((h/H)−1/4 ), h → 0, which implies a fairly modest increase. For instance, if h = H/16 decreases to h = H/256, then the number of iterations approximately only double. An elementwise construction of preconditioners can be applied also for preconditioning of the Schur complement matrix, which arises in the construction of preconditioners for matrices partitioned in two-by-two blocks, see [2], [6] and [8] for further details. When the method is extended by recursion, an optimal sequence of mesh division parameters mk , where hk+1 = hk /mk , mk ≥ 2 has been derived in [2]. The method approaches then rapidly, asymptotically an optimal order method, i.e., the total computational complexity becomes proportional to the degrees of freedom at the ﬁnest mesh level, when the number of levels, k increases When solving the arising block matrix systems, one can use inner iterations for the subblocks, see e.g., [2]. The preconditioner can also be used for nonsymmetric matrices but the corresponding convergence theory is less developed, see however [8] for an approach. Here one must in general use some form of a generalized conjugate gradient method.

Acknowledgements The work of the author was supported by the Academy of Sciences of the Czech Republic through projects AV0Z30860518 and 1ET400300415.

References 1. Axelsson, O.: Iterative Solution Methods. Cambridge University Press, New York (1994) 2. Axelsson, O., Blaheta, R., Neytcheva, M.: Preconditioning for boundary value problems using elementwise Schur complements. SIAM J. Matrix Analysis (to appear) (2009) 3. Axelsson, O., Kolotilina, L.: Diagonally compensated reduction and related preconditioning methods. Numer. Linear Algebra Appl. 1(2), 155–177 (1994)

An Additive Matrix Preconditioning Method

83

4. Axelsson, O., Vassilevski, P.S.: Variable-step multilevel preconditioning methods. I. Selfadjoint and positive deﬁnite elliptic problems. Numer. Linear Algebra Appl. 1(1), 75–101 (1994) 5. Dryja, M., Sarkis, M.V., Widlund, O.: Multilevel Schwarz methods for elliptic problems with discontinuous coeﬃcients in three dimensions. Numer. Math. 72(3), 313–348 (1996) 6. Kraus, J.K.: Algebraic multilevel preconditioning of ﬁnite element matrices using local Schur complements. Numer. Linear Algebra Appl. 13(1), 49–70 (2006) 7. Mandel, J., Dohrmann, C.R.: Convergence of a balancing domain decomposition by constraints and energy minimization. Numer. Linear Algebra Appl. 10(7), 639–659 (2003) 8. Neytcheva, M., B¨ angtsson, E.: Preconditioning of nonsymmetric saddle point systems as arising in modelling of viscoelastic problems. ETNA 29, 193–211 (2007– 2008) 9. Toselli, A., Widlund, O.: Domain decomposition methods — algorithms and theory. Springer Series in Computational Mathematics, vol. 34. Springer, Berlin (2005) 10. Quarteroni, A., Valli, A.: Domain decomposition methods for partial diﬀerential equations. Numerical Mathematics and Scientiﬁc Computation. Oxford University Press, New York (1999)

Numerical Study of AMLI Methods for Weighted Graph-Laplacians Petia Boyanova and Svetozar Margenov Institute for Parallel Processing Bulgarian Academy of Sciences Acad. G. Bonchev, bl. 25A 1113 Soﬁa, Bulgaria [email protected], [email protected]

Abstract. We consider a second-order elliptic problem in mixed form that has to be solved as a part of a projection algorithm for unsteady Navier-Stokes equations. The use of Crouzeix-Raviart non-conforming elements for the velocities and piece-wise constants for the pressure provides a locally mass-conservative approximation. Since the mass matrix corresponding to the velocities is diagonal, these unknowns can be eliminated exactly. We address the design of eﬃcient solution methods for the reduced weighted graph-Laplacian system. Construction of preconditioners based on algebraic multilevel iterations (AMLI) is considered. AMLI is a stabilized recursive generalization of two-level methods. We deﬁne hierarchical two-level transformations and corresponding block 2x2 splittings locally for macroelements associated with the edges of the coarse triangulation. Numerical results for two sets of hierarchical partitioning parameters are presented for the cases of two-level and AMLI methods. The observed behavior complies with the theoretical expectations and the constructed AMLI preconditioners are optimal.

1

Introduction

The aim of the current article is to study the convergence of algebraic multilevel iterations (AMLI) method for weighted graph-Laplacian systems resulting from mixed ﬁnite element approximation of a second-order elliptic boundary value problem with Crouzeix-Raviart non-conforming elements for the vector valued unknown function and piece-wise constants for the scalar valued unknown function. The considered problem: u + ∇p = f

in Ω,

∇ · u = 0 in Ω,

u · n = 0 on ∂Ω,

(1)

where Ω is a convex polygonal domain in IR2 and f is a given smooth vector valued function, has to be solved at the projection step of a composite time stepping method for unsteady Navier-Stokes equations. In the paper we will use the CFD terminology and refer to the vector u as a velocity and p as a I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 84–91, 2010. c Springer-Verlag Berlin Heidelberg 2010

Numerical Study of AMLI Methods for Weighted Graph-Laplacians

85

pressure. The chosen FEM discretization provides a locally mass-conservative algorithm (see [3] and the references therein). The Crouzeix-Raviart mass matrix is diagonal and we can eliminate the velocity unknowns exactly. The reduced matrix for the pressure has a structure of a weighted graph-Laplacian (see [8]). The rest of the paper is organized as follows. In Section 2 we introduce the general framework of AMLI methods. In Section 3 we present the preconditioner construction for the considered case of weighted graph-Laplacian systems. Section 4 is devoted to analysis of the numerical results of applying a preconditioned conjugate gradient (PCG) method with two-level and AMLI preconditioners for the 2-D model problem. The ﬁnal Section 5 contains a summary of the observed behavior and suggestions for future work.

2

AMLI Method

The framework of the algebraic multilevel iterations (AMLI) method was originally proposed in [1] for the case of linear conforming FEs. More recently, AMLI methods for non-conforming FEs and discontinuous Galerkin (DG) systems were developed, see [5,6,7,9] and the references therein. AMLI is a recursive generalization of two-level preconditioners that has optimal behavior due to a proper Schur complement stabilization using Chebyshev polynomials (see also [2]). We consider a sequence of nested triangulations T0 ⊂ T1 ⊂ · · · ⊂ Tl of the domain Ω, constructed by recursive uniform reﬁnement of a given initial mesh. Next we deﬁne the corresponding spaces of piece-wise constant functions V (0) ⊂ V (1) ⊂ · · · ⊂ V (l) , the spaces of degrees of freedom V(0) , V(1) , · · · , V(l) , the numbers of degrees of freedom n0 < n1 < · · · < nl , and the system matrix associated with each triangulation level A(0) , A(1) · · · A(l) . We are interested in the solution of the discrete problem (2) at the ﬁnest mesh. A(l) x = b

(2)

Let us deﬁne a hierarchical 2x2 block partitioning of the system matrix: (k) A (k) }nk − nk−1 T A (k) (k) (k) (k) 11 12 =J A J A = , (k) A (k) }nk−1 A 21 22

(3)

where J (k) is a properly deﬁned nonsingular hierarchical transformation matrix. The generated splitting in the space of degrees of freedom is characterized by the constant γ (k) in the strengthened Cauchy-Bunyakowski-Schwarz (CBS) inequality, associated with the cosine between the two subspaces. Regarding (0) this partitioning, the AMLI method is deﬁned = A(0) and for C as follows: −1 (k) (k) A (k) −1 C −T 0 IC 11 12 11 k = 1, . . . , l, C (k) = J (k) J (k) , where the (k) (k−1) A˜ 0 I A 21 (k) are symmetric positive deﬁnite approximations of A (k) , that satisfy blocks C 11

11

v ≤ vt C v ≤ (1 + b)vt A v, for all v ∈ Rnk −nk−1 , vt A 11 11 11 (k)

(k)

(k)

(4)

86

P. Boyanova and S. Margenov

The Schur complement approximation is stabilized by

˜(k−1)−1 = I − Pβ C (k−1)−1 A(k−1) A(k−1)−1 . A

The acceleration polynomial is explicitly deﬁned by

1 + α − 2t 1+α Pβ (t) = 1 + Tβ / 1 + Tβ , 1−α 1−α

where Tβ is the Chebyshev polynomial of degree β with L∞ -norm 1 on (−1, 1). The main result concerning the eﬃciency of the AMLI preconditioner was originally stated in [1] and is reformulated in the following theorem. Theorem 1. Let the following three assumptions be fulfilled: (i) The hierarchical (k) = A(k−1) ; (ii) There exists an transformation matrices are sparse and A 22 (k) absolute constant γ such that γ ≤ γ < 1 for all k > 0; (iii) √ 1 2 < β < ρ, 1−γ

where ρ = min nk /nk−1 . k

Then there exists α ∈ (0, 1), such that the AMLI preconditioner C (l) has opti−1 mal relative condition number κ(C (l) A(l) ) = O(1), and the total computational complexity is O(nl ). −1

1+b It is known that the relative condition number κ(C (l) A(l) ) = O( 1−γ 2 ). For the case β = 2 the coeﬃcients q0 and q1 of the optimal stabilization polynomial Q1 = (1 − P2 (t))/t = q0 + q1 t, which has to be evaluated in the AMLI W-cycle, are given by (see [1,4]):

q0 =

3

2 −1

, q1 = (5) 2 2 2 −b + 1 + b + b − γ 1 − γ + b(1 + 2b − 2 1 + b + b2 − γ 2 )

Preconditioner Construction for Weighted Graph-Laplacian

In the case of general triangular mesh the reduced weighted graph-Laplacian (k) (k) A(k) = {ai,j } has the following properties: ai,j = 0 only for elements with in(k)

dexes i and j that share a common edge. Then ai,j = −wi,j , i = j, where wi,j > (k) 0 are weights that depend on the angles of the triangles and ai,i = i=j wi,j . In this paper we consider 2-D uniform mesh of right-angled triangles. Then the reduced system for the pressure of the discretized problem (1) corresponds to the T-shaped four point stencil shown in Fig. 1. We assume that the ﬁnest triangulation is obtained by consecutive reﬁnement of a coarse triangulation. Let each reﬁned mesh be obtained by dividing the current triangles in four congruent ones connecting the midpoints of the sides. We use an approach proposed in [9] and deﬁne two-level hierarchical transformations and corresponding 2x2 block partitionings at each reﬁnement level locally for macroelements associated with the edges of the coarser triangulation. Let us

Numerical Study of AMLI Methods for Weighted Graph-Laplacians

87

−1 4 −1

−2

Fig. 1. Schur complement four point stencil for the pressure

consider two consecutive triangulations Tk−1 ⊂ Tk and a decomposition of the (k) (k) (k) = weighted graph-Laplacian, A(k) = A e∈E Ae , e∈E Ae , as a sum (assembly) of local matrices associated with the set of edges E of the coarser (k) one. We deﬁne Ae using a weight parameter t ∈ (0, 1) so that the contribution of the links between the interior nodes among the (macroelement) edge matrices of the current coarse triangle are distributed correctly. Following the numbering (k) from Fig. 2, we introduce the local (macroelement) matrix Ae , corresponding to a hypotenuse, in the form: ⎡

(k)

A(k) = Ae;H e

t + 1 −2t

⎢ ⎢ −2t ⎢ ⎢ t−1 ⎢ 2 ⎢ t−1 ⎢ ⎢ 2 =⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎣

t−1 2

⎤

t−1 2

2t 5−t 2 5−t 2

t + 1 −2t −2t −2 −2 (k)

2t

t−1 2 t−1 2

⎥ ⎥ ⎥ ⎥ −2 ⎥ ⎥ −2 ⎥ ⎥ ⎥. ⎥ t−1 t−1 ⎥ 2 2 ⎥ ⎥ ⎥ ⎥ ⎥ 5−t ⎦ 2 5−t 2

(k)

The local (macroelement) matrix Ae = Ae;C , corresponding to a cathetus, is introduced in a similar way. Then we deﬁne the (macroelement) local transfor(k) mation matrix Je as ⎡

Je(k)

⎤ 1 p q q ⎢1 q p q ⎥ ⎢ ⎥ ⎢1 q q p ⎥ ⎢ ⎥ ⎢ 1 p q q ⎥ ⎥, =⎢ ⎢ 1 q p q ⎥ ⎢ ⎥ ⎢ 1 q q p ⎥ ⎢ ⎥ ⎣r r r r ⎦ r r r r

√ where p and q are parameters of the splitting, and r = 2/2 is a scaling factor. (k) A (k) T A (k) (k) (k) (k) e:11 e:12 , and γ 2 ≤ max{γ 2 , γ 2 } = γ 2 e = Je Ae Je Then A = e;H e;C M (k) A (k) A e:21

e:22

(see e.g. [9]). Local analysis has been performed in [10] for the studied case of 2-D regular right-angled triangulation that derived the following estimates for 2 γ: γM = 0.73 for the parameter setting (p = 1, q = −0.5), and the improved

88

P. Boyanova and S. Margenov

Fig. 2. Macroelement of two adjacent triangles from Tk with a common hypotenuse 2 estimate γM = 0.58 for (p = 1, q = −0.1). The AMLI preconditioners for both parameter settings comply with the three assumptions of Theorem 1 for β = 2 or β = 3.

4

Numerical Results

The aim of the performed experiments is to convey a numerical study of the properties of the proposed hierarchical partitioning of weighted graph-Laplacians and the behavior of the corresponding AMLI method. We use PCG method to solve the system (2) for the case of Dirichlet boundary conditions in the unit square Ω = (0, 1) × (0, 1), discretized with a mesh size h. The collected data in the tables that follow represent number of PCG iterations needed to reduce the residual norm by a factor for a zero right hand side and random initial guess. As the constructed AMLI preconditioner is a recursive generalization of a two-level method with a two-level partitioning that has the same properties at each level, we ﬁrst examine the case of PCG with two-level multiplicative preconditioner based on the splitting (3). The two cases of partitioning parameters (p = 1, q = −0.5) and (p = 1, q = −0.1) are considered. As can be seen in Table 1, the number of PCG iterations for both settings do not depend on the size of the problem but only on the reduction factor , namely linearly w.r.t. ln(1/ ). These results agree with the theoretical expectations, as it is known that the number of iterations nit needed for achieving a relative error for the PCG method is −1

−1

2

nit ≥ 12 κ(C (k) A(k) ) ln (2/ ) + 1 and κ(C (k) A(k) ) ≤ 1/(1 − γ (k) ) where γ (k) ≤ γ < 1 for all k > 0. Next, we consider PCG with AMLI preconditioner with polynomial degree 2 β = 2 and polynomial coeﬃcients calculated by (5) using the estimate γ 2 ≤ γM . In all presented results, the coarsest level of triangulation has mesh size h = 1/16, that corresponds to 512 degrees of freedom. For the experiments summarized in Table 2 we use inner iterations to achieve high accuracy solution for the pivot block systems that let us assume b = 0 in (4). In both cases of splitting parameters the number of iterations is stabilized but for the setting that has a better CBS constant estimate this happens earlier and the convergence rates are considerably better. As opposed to a preconditioner that does not use pivot approximation, we test the convergence of AMLI when the pivot blocks are approximated by their scaled diagonals. It is known that b is uniformly bounded and does not depend

Numerical Study of AMLI Methods for Weighted Graph-Laplacians

89

Table 1. Two-level method: number of PCG iterations PCG stop criteria

(p = 1, q = −0.5) (p = 1, q = −0.1)

= 10−3 = 10−6 = 10−9 = 10−3 = 10−6 = 10−9

32 (2048) 4 7 11 4 7 10

h−1 (Number of degrees of freedom) 64 128 256 (8192) (32768) (131072) 4 4 4 8 8 8 12 12 12 4 4 3 7 7 7 10 10 10

512 (524288) 4 8 12 3 7 10

Table 2. W-cycle, no pivot approximation: number of PCG iterations PCG stop criteria

(p = 1, q = −0.5, 2 γM = 0.73) (p = 1, q = −0.1, 2 γM = 0.58)

= 10−3 = 10−6 = 10−9 = 10−3 = 10−6 = 10−9

1 (2048) 4 7 11 4 7 10

Number of levels of reﬁnement (Number of degrees of freedom) 2 3 4 (8192) (32768) (131072) 8 8 9 16 18 19 25 28 29 4 4 4 8 8 8 11 12 11

5 (524288) 8 18 28 4 8 11

on the problem size (see e.g. [10]). Therefore it is enough to calculate it for a rather coarse mesh and use the obtained value in (5). We construct diagonal pivot approximation with b = 4.67 for (p = 1, q = −0.5) and b = 6.2 for (p = 1, q = −0.1). The AMLI method results are presented in Table 3. As expected, the two parameter settings diﬀer in convergence rates, the one with better CBS constant estimate achieving better performance. The diﬀerence in number of iterations for the two AMLI preconditioner types recorded in Table 2 and Table 3 is the theoretically expected. The results presented so far concern AMLI preconditioners that have strong theoretical analysis background. However, the observations of the performed numerical experiments let us think we can go further and try and improve the AMLI method for the setting (p = 1, q = −0.5), using feedback from the program runs. One motivation is that the two-level method is a good practical indicator for the qualities of the hierarchical partitioning deﬁned. As can be seen from Table 1, the PCG for the two cases of parameter settings perform in a very similar way, which let us assume that they also have similar splitting characteristics. The second motivation for the tests that follow, is that we have both the theoretical estimate of the CBS constant for (p = 1, q = −0.5) and the practical numerical proof that the deﬁned two-level and AMLI preconditioners behave in the ex2 = 0.73 is too pected stable way. Next, we assume that the derived estimate γM pessimistic for the case (p = 1, q = −0.5) and we could use the one, obtained for

90

P. Boyanova and S. Margenov Table 3. W-cycle, diagonal pivot approximation: number of PCG iterations PCG stop criteria

(p = 1, q = −0.5 2 γM = 0.73) (p = 1, q = −0.1 2 γM = 0.58)

= 10−3 = 10−6 = 10−9 = 10−3 = 10−6 = 10−9

1 (2048) 8 16 24 9 20 30

Number of levels of reﬁnement (Number of degrees of freedom) 2 3 4 (8192) (32768) (131072) 18 18 20 38 45 47 58 69 74 9 9 9 20 20 20 30 30 30

5 (524288) 19 47 74 9 20 30

Table 4. W-cycle, (p = 1, q = −0.5, γ 2 = 0.58): number of PCG iterations PCG stop criteria

no pivot approx. diagonal pivot approx.

= 10−3 = 10−6 = 10−9 = 10−3 = 10−6 = 10−9

1 (2048) 4 7 11 8 16 24

Number of levels of reﬁnement (Number of degrees of freedom) 2 3 4 (8192) (32768) (131072) 4 4 4 8 8 8 11 13 12 9 9 9 19 19 19 28 28 29

5 (524288) 4 8 12 9 19 29

2 the case (p = 1, q = −0.1), namely γM = 0.58. The results presented in Table 4 are for an AMLI preconditioner with polynomial coeﬃcients calculated using this more optimistic bound. As can be seen, the PCG converges, and the AMLI is stabilized at lower iteration numbers.

5

Concluding Remarks

The purpose of the performed numerical study was to practically examine the properties of AMLI method for weighted graph-Laplacians, following three logical steps. First the two-level method was considered. The results show that the proposed two-level hierarchical transformation deﬁnes a proper partitioning of the system matrix and the corresponding two-level preconditioner performs as expected. This fact let us proceed with the multilevel generalization - the AMLI method. The qualities of the preconditioner are most transparently observed when b = 0. Then for both examined splitting parameter sets, the PCG with 2 = 0.58). AMLI preconditioner behaves similarly to the two-level method (for γM Preconditioners with no pivot block approximations are too expensive to use in practice, due to the high computational cost of solving systems with the pivot blocks. That is why, as a next step, we construct a preconditioner with a diagonal pivot approximation. This is a good ﬁrst choice for pivot block approximation and the AMLI is stabilized again, though at greater iteration numbers.

Numerical Study of AMLI Methods for Weighted Graph-Laplacians

91

When choosing pivot block approximations, it is important to ﬁnd the balance between their good conditioning and the computational eﬃciency of solving systems with them. The ﬁrst aﬀects the convergence rates of the AMLI method, and the second — the computational cost of each iteration. This is the reason why one important topic for future work is the examination of possibilities for improvement of the pivot block approximation for weighted graph-Laplacians, so that the overall computational complexity is decreased. Another important aspect of our future work is the incorporation of the constructed optimal AMLI method for weighted graph-Laplacians in the composite time stepping algorithm for unsteady Navier-Stokes equations. The results presented in this paper concern regular meshes but this approach is applicable for more general irregular triangulation. In the later case the convergence of the AMLI method would depend on the properties of the initial grid, namely, the angles of the coarsest elements used. Acknowledgments. This work is supported by the Bulgarian Science Foundation grants DO02–147 and DO02–338.

References 1. Axelsson, O., Vassilevski, P.S.: Algebraic multilevel preconditioning methods II. SIAM J. Numer. Anal. 27, 1569–1590 (1990) 2. Vassilevski, P.S.: Multilevel Block Factorization Preconditioners: Matrix-based Analysis and Algorithms for Solving Finite Element Equations. Springer, Heidelberg (2008) 3. Bejanov, B., Guermond, J., Minev, P.: A locally div-free projection scheme for incompressible ﬂows based on non-conforming ﬁnite elements. Int. J. Numer. Meth. Fluids 49, 239–258 (2005) 4. Georgiev, I., Kraus, J., Margenov, S.: Multilevel preconditioning of rotated bilinear non-conforming FEM problems. RICAM-Report No. 2006–03 (January 2006) 5. Blaheta, R., Margenov, S., Neytcheva, M.: Robust optimal multilevel preconditioners for non-conforming ﬁnite element systems. Numer. Lin. Alg. Appl. 12(5-6), 495–514 (2005) 6. Kraus, J., Margenov, S.: Multilevel methods for anisotropic elliptic problems. Lectures on Advanced Computational Methods in Mechanics, Radon Series Comp. Appl. Math. 1, 47–87 (2007) 7. Kraus, J., Tomar, S.: A multilevel method for discontinuous Galerkin approximation of three-dimensional anisotropic elliptic problems. Num. Lin. Alg. Appl. 15, 417–438 (2008) 8. Margenov, S., Minev, P.: On a MIC(0) preconditioning of non-conforming mixed FEM elliptic problems. Mathematics and Computers in Simulation 76, 149–154 (2007) 9. Lazarov, R., Margenov, S.: CBS constants for multilevel splitting of graphLaplacian and application to preconditioning of discontinuous Galerkin systems. J. Complexity 23(4-6), 498–515 (2007) 10. Boyanova, P., Margenov, S.: Multilevel Splitting of Weighted Graph-Laplacian Arising in Non-conforming Mixed FEM Elliptic Problems. In: Margenov, S., Vulkov, L.G., Wa´sniewski, J. (eds.) NAA 2008. LNCS, vol. 5434, pp. 216–223. Springer, Heidelberg (2009)

A Scalable TFETI Based Algorithm for 2D and 3D Frictionless Contact Problems Zdenˇek Dost´ al1 , Tom´aˇs Brzobohat´ y2 , Tom´aˇs Kozubek1 , 2 Alex Markopoulos , and V´ıt Vondr´ a k1 1

2

ˇ FEECS VSB-Technical University of Ostrava, Department of Applied Mathematics, 17. listopadu 15, CZ-708 33 Ostrava, Czech Republic, ˇ FME VSB-Technical University of Ostrava, Department of Mechanics, 17. listopadu 15, CZ-708 33 Ostrava, Czech Republic

Abstract. We report our recent results in the development of theoretically supported scalable algorithms for the solution of large scale complex contact problems of elasticity. The algorithms combine the TFETI based domain decomposition method adapted to the solution of 2D and 3D frictionless multibody contact problems of elasticity with our in a sense optimal algorithms for the solution of the resulting quadratic programming problems. Rather surprisingly, the theoretical results are qualitatively the same as the classical results on scalability of FETI for the linear elliptic problems. The eﬃciency of the method is demonstrated by the results of numerical experiments with parallel solution of both coercive and semicoercive 2D and 3D contact problems.

1

Introduction

Observing that the classical Dirichlet and Neumann boundary conditions of contact problems are known only after the solution has been found, it is not surprising that the solution of contact problems is more costly than the solution of the related linear problems with classical boundary conditions. In particular, since the cost of the solution of any problem increases at least linearly with the number of the unknowns, even if we should only copy the results, it follows that the development of a scalable algorithm for contact problems is a challenging task which requires to identify the contact interface in a sense for free. In spite of this, a number of interesting results have been obtained by modiﬁcations of the methods that were known to be scalable for linear problems, in particular multigrid (see, e.g., Kornhuber [19], Kornhuber and Krause [20], and Wohlmuth and Krause [24]) and domain decomposition (see, e.g., Dureisseix and Farhat [13], Dost´ al, Gomes, and Santos [7], and Avery et al. [1]. Sch¨ oberl [23] even proved the optimality for his approximate variant of the projection method using a domain decomposition preconditioner and a linear multigrid solver on the interior nodes. For the multigrid-based algorithms, it seems that it was the necessity to keep the coarse grid away from the contact interface (see also Iontcheva and Vassilevski [17]) that prevented the authors to prove the optimality results similar to the classical results for linear problems. I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 92–99, 2010. c Springer-Verlag Berlin Heidelberg 2010

A Scalable TFETI Based Algorithm

93

The point of this paper is to report our optimality results for coercive contact problems of elasticity using TFETI (Total FETI) [9], a variant of the FETI method introduced by Farhat and Roux [16] for parallel solution of linear problems that enforces the prescribed displacements by Lagrange multipliers. For linear problems, the method was considered earlier by Justino, Park, and Felippa [18] and Park, Felippa, and Gumaste [22]. See also the thesis by Of [21]. We use an in a sense optimal “natural coarse grid preconditioning” introduced for linear problems by Farhat, Mandel, and Roux [15]. Since the preconditioning by the “natural coarse grid” uses a projector to the subspace with the solution [8], its application to the solution of variational inequalities does not turn the bound constraints into general bounds and can be interpreted as a variant of the multigrid method with the coarse grid on the interface. This unique feature, as compared with the standard multigrid preconditioning for the primal problem, reduces the development of scalable algorithms for the solution of variational inequalities to the solution of bound and equality constrained quadratic programming problems with the rate of convergence in terms of bounds on the spectrum. For the sake of simplicity, we consider only the frictionless problems of linear elasticity with the linearized, possibly non-matching non-interpenetration conditions implemented by mortars, but the results may be exploited also for the solution of the problems with friction [12] or large deformations with more sophisticated implementation of the kinematic constraints [3]. The basic idea works also in the framework of boundary element methods [2].

2

TFETI and Contact Problems

Assuming that the bodies are assembled from Ns subdomains Ω (s) , the equilibrium of the system may be described as a solution u of the problem min j(v)

subject to

Ns s=1

(s)

BI v (s) ≤ gI

and

Ns s=1

(s)

BE v (s) = o,

(1)

where o denote the zero vector, j(v) is the energy functional deﬁned by j(v) =

Ns 1 s=1

2

T

T

v (s) K (s) v (s) − v (s) f (s) ,

where v (s) and f (s) denote the admissible subdomain displacements and the subdomain vector of prescribed forces, K (s) is the subdomain stiﬀness matrix, (s) (s) T T BI and BE are the blocks of the matrix B = BIT , BE that correspond to Ω (s) , and gI is a vector collecting the gaps between the bodies in the reference conﬁguration. The matrix BI and the vector gI arise from the nodal or mortar description of the non-penetration conditions, while BE describes the “gluing” of the subdomains into the bodies and the Dirichlet boundary conditions. To simplify the presentation of the basic ideas, we can describe the equilibrium in terms of the global stiﬀness matrix K, the vector of global displacements u, and the vector of global loads f . In the TFETI method, we have

94

Z. Dost´ al et al.

⎤ u(1) ⎥ ⎢ u = ⎣ ... ⎦ , u(Ns ) ⎡

K = diag(K (1) , . . . , K (Ns ) ),

⎡

⎤ f (1) ⎢ ⎥ and f = ⎣ ... ⎦ , f (Ns )

where K (s) , s = 1, . . . , Ns , is a positive semideﬁnite matrix. The energy function reads 1 j(v) = v T Kv − f T v 2 and the vector of global displacements u solves min j(v)

subject to

BI v ≤ gI

and BE v = o.

(2)

Alternatively, the global equilibrium may be described by the Karush–Kuhn– Tucker conditions (see, e.g., [6]) Ku = f − B T λ,

λI ≥ o,

λT (Bu − g) = o,

(3)

T T where g = gIT , oT and λ = λTI , λTE denotes the vector of Lagrange multipliers which may be interpreted as the reaction forces. The problem (3) diﬀers from the linear problem by the non-negativity constraint on the components of the reaction forces λI and by the complementarity condition. We can use the left equation of (3) and the sparsity pattern of K to eliminate the displacements. We shall get the problem to ﬁnd λI ≥ o and RT (f − B T λ) = o,

(4)

1 1 Θ(λ) = − λT BK † B T λ + λT (BK † f − g) − f K † f, 2 2

(5)

max Θ(λ)

s.t.

where

K † denotes a generalized inverse that satisﬁes KK † K = K, and R denotes the full rank matrix whose columns span the kernel of K. The action of K † can be eﬀectively evaluated by a variant of LU–SVD decomposition [14,10]. Recalling the FETI notation F = BK † B T ,

e = RT f,

G = RT B T ,

d = BK † f − g,

we can modify (4) to min θ(λ)

s.t. λI ≥ o and Gλ = e,

(6)

where 1 θ(λ) = λT F λ − λT d. 2 To replace the equality constraint in (6) by a homogeneous one without changing the non-negativity constraint, we can use the following lemma.

A Scalable TFETI Based Algorithm

95

T Lemma 1. Let GI = RT BIT , GE = RT BE , and G = [GI , GE ] arise from the TFETI decomposition of a coercive contact problem. Then GE is a full rank matrix and

oI λ= (7) GTE (GE GTE )−1 e

I = oI and Gλ = e. satisﬁes λ Proof. First observe that G = [GI , GE ], GTE ξ

so it is enough to prove that = BE Rξ = o implies ξ = o. Since BE Rξ denotes the jumps of the vector Rξ across the auxiliary interfaces and the violations of the prescribed Dirichlet boundary conditions, it follows that the vector u = Rξ satisﬁes both the discretized Dirichlet conditions and the “gluing” conditions, but belongs to the kernel of K, which contradicts our assumption that the problem (1) arises from the discretization of a coercive contact problem. Lemma 1 is an important ingredient in the proof of the numerical scalability of our algorithm. Let us point out that the proof is the only place where we use the assumption that our model problem (1) is coercive, i.e., that the prescribed displacements prevent the body from ﬂoating. and substituting into (6), we get Denoting λ = μ + λ 1 + const. θ(λ) = μT F μ − μT (d − F λ) 2 After returning to the old notation, problem (6) is reduced to min

1 T λ F λ − λT d 2

s.t. Gλ = o and λI ≥ o

with d = d − F λ.

3

Optimality

Our ﬁnal step is based on the observation that the last problem is equivalent to min θ(λ)

s.t. Gλ = o and λI ≥ o,

(8)

where θ(λ) =

1 T (λ QF Q + ρP )λ − λT Q d, 2

P = GT (GGT )−1 G,

Q = I − P,

(9)

and ρ > 0. The regularization term ρP is introduced in order to simplify the reference to the results of quadratic programming that assume the regularity of the Hessian matrix of the quadratic form. Problem (8) turns out to be a suitable starting point for the development of an eﬃcient algorithm for variational inequalities due to the following classical estimates of the extreme eigenvalues due to Farhat, Mandel, and Roux [15].

96

Z. Dost´ al et al.

Theorem 1. If the decompositions and the discretizations of given contact problems are suﬃciently regular, then there are constants C1 > 0 and C2 > 0 independent of the discretization parameter h and the decomposition parameter H such that C1 ≤ λmin (QF Q|ImQ)

and

λmax (QF Q|ImQ) ≤ QF Q ≤ C2

H , h

(10)

where λmin and λmax denote the extremal eigenvalues of corresponding matrices. The theorem states that if we ﬁx the regularization parameter ρ, then problems (8) resulting from the application of various discretizations and decompositions have the spectrum of the Hessian matrices conﬁned to a positive interval, nonnegativity constraints, and homogeneous equality constraints. If we use SMALBE, our variant of the augmented Lagrangian method with adaptive precision control for the solution of quadratic programming problems with bound and equality constraints, to the solution of (8), then we reduce the solution of (8) to a sequence of bound constrained problems with uniformly bounded spectrum of the Hessian matrix [8,4]. SMALBE enforces the equality constraints by the Lagrange multipliers generated in the outer loop, while the auxiliary bound constrained problems can be solved approximately in the inner loop by MPRGP proposed by Dost´al and Sch¨ oberl [11], an active set based algorithm which uses the conjugate gradient method to explore the current face, the ﬁxed steplength gradient projection to expand the active set, the adaptive precision control of auxiliary linear problems, and the reduced gradient with the optimal steplength to reduce the active set. The unique feature of SMALBE with the inner loop implemented by MPRGP when used to (8) is the bound on the number of iterations whose cost is proportional to the number of variables, so that it can return an approximate solution for the cost proportional to the number of variables. More information about the algorithms and their implementation can be found in [6].

4

Numerical Experiments

The algorithms reported in this paper were tested with the aim to verify their optimality and capability to solve the real-life problems. We ﬁrst tested the scalability on a classical Hertz 2D problem of Fig. 1 with varying discretizations and decompositions using structured grids. We kept the ratio H/h of the decomposition and the discretization parameters approximately constant so that the assumptions of Theorem 1 were satisﬁed. The solution with the discretization and the traces of decomposition is in Fig. 2. We used the stopping criterion max {g P (λk , μk , ρ), Gλk } ≤ 10−4 P d, where g P (λk , μk , ρk ) denotes the projected gradient of the augmented Lagrangian L(λk , μk , ρk ) = θ(λk ) + (μk )T Gλk +

ρk Gλk 2 . 2

A Scalable TFETI Based Algorithm

97

20 18 16 14 12 10 8 6 4 2 0 0

Fig. 1. 2D Hertz problem

5

10

Fig. 2. Solution of 2D Hertz problem: total displacement

Table 1. Scalability of algorithm for 2D Hertz problem Primal Dual Subdomains Null space dimension dimensions 40000 640000 10240000

600 11200 198400

2 32 512

6 96 1536

Matrix–vector multiplications

Time (sec)

45 88 134

10 78 1300

The results of the computations are in Table 1. We can observe that the number of matrix–vector multiplications varies only mildly with the increasing dimension of the problem in agreement with our theory. We have also tested our algorithms on real-life problems, such as the analysis of the stress in the ball bearings of Fig. 3. The problem is diﬃcult because the traction acting on the lower part of the inner ring is distributed throughout the nonlinear interface of the cage and balls to the outer ring. The solution of the problem discretized by 1688190/408196 primal/dual variables required 2364 matrix–vector multiplications. It took 5383 seconds to identify 20843 active constraints. Though this number is not small, we were not able to resolve the problem by a commercial software, including ANSYS. We believe that we shall get better results by enhancing the standard FETI preconditioners and some recently proposed improvements. We conclude that the results of numerical experiments indicate that the algorithm can be useful for an eﬀective solution of real-life problems.

98

Z. Dost´ al et al.

Fig. 3. Ball bearings

5

Fig. 4. Von midplane

Mises

stress

in

the

Comments and Conclusions

The TFETI method turns out to be a powerful engine for the solution of contact problems of elasticity. Results of numerical experiments comply with our recent theoretical results and indicate high eﬃciency of the method reported here. Future research will include adaptation of the standard preconditioning strategies, problems with friction in 3D, and dynamic contact problems.

Acknowledgements This research has been supported by the grants GA CR No. 201/07/0294 and 103/09/H078 and ME CR No. MSM6198910027.

References 1. Avery, P., Rebel, G., Lesoinne, M., Farhat, C.: A numerically scalable dual–primal substructuring method for the solution of contact problems – part I: the frictionless case. Comput. Methods Appl. Mech. Eng. 193, 2403–2426 (2004) 2. Bouchala, J., Dost´ al, Z., Sadowsk´ a, M.: Theoretically Supported Scalable BETI Method for Variational Inequalities. Computing 82, 53–75 (2008) 3. Dobi´ aˇs, J., Pt´ ak, S., Dost´ al, Z., Vondr´ ak, V.: Total FETI based algorithm for contact problems with additional non-linearities. Advances in Engineering Software (in print, 2009), doi:10.1016/j.advengsoft.2008.12.006 4. Dost´ al, Z.: An optimal algorithm for bound and equality constrained quadratic programming problems with bounded spectrum. Computing 78, 311–328 (2006) 5. Dost´ al, Z.: Inexact semi-monotonic augmented Lagrangians with optimal feasibility convergence for quadratic programming with simple bounds and equality constraints. SIAM J. Numer. Anal. 43(1), 96–115 (2005) 6. Dost´ al, Z.: Optimal Quadratic Programming Algorithms, with Applications to Variational Inequalities, 1st edn. Springer, US (2009)

A Scalable TFETI Based Algorithm

99

7. Dost´ al, Z., Gomes, F.A.M., Santos, S.A.: Solution of contact problems by FETI domain decomposition with natural coarse space projection. Comput. Methods Appl. Mech. Eng. 190(13-14), 1611–1627 (2000) 8. Dost´ al, Z., Hor´ ak, D.: Theoretically supported scalable FETI for numerical solution of variational inequalities. SIAM J. Numer. Anal. 45, 500–513 (2007) 9. Dost´ al, Z., Hor´ ak, D., Kuˇcera, R.: Total FETI - an easier implementable variant of the FETI method for numerical solution of elliptic PDE. Commun. Numer. Methods Eng. 22, 1155–1162 (2006) 10. Dost´ al, Z., Kozubek, T., Markopoulos, A., Menˇs´ık, M.: Combining Cholesky decomposition with SVD to stable evaluation of a generalized inverse of the stiﬀness matrix of a ﬂoating structure (submitted, 2009) 11. Dost´ al, Z., Sch¨ oberl, J.: Minimizing quadratic functions over non-negative cone with the rate of convergence and ﬁnite termination. Comput. Optim. Appl. 30(1), 23–44 (2005) 12. Dost´ al, Z., Vondr´ ak, V.: Duality Based Solution of Contact Problems with Coulomb Friction. Arch. Mech. 49(3), 453–460 (1997) 13. Dureisseix, D., Farhat, C.: A numerically scalable domain decomposition method for solution of frictionless contact problems. Int. J. Numer. Methods Eng. 50(12), 2643–2666 (2001) 14. Farhat, C., G´eradin, M.: On the general solution by a direct method of a large scale singular system of linear equations: application to the analysis of ﬂoating structures. Int. J. Numer. Methods Eng. 41, 675–696 (1998) 15. Farhat, C., Mandel, J., Roux, F.-X.: Optimal convergence properties of the FETI domain decomposition method. Comput. Methods Appl. Mech. Eng. 115, 365–385 (1994) 16. Farhat, C., Roux, F.-X.: A method of ﬁnite element tearing and interconnecting and its parallel solution algorithm. Int. J. Numer. Methods Eng. 32, 1205–1227 (1991) 17. Iontcheva, A.H., Vassilevski, P.S.: Monotone multigrid methods based on element agglomeration coarsening away from the contact boundary for the Signorini’s problem. Numer. Linear Algebra Appl. 11(2-3), 189–204 (2004) 18. Justino, M.R.J., Park, K.C., Felippa, C.A.: The construction of free–free ﬂexibility matrices as generalized stiﬀness matrices. International Journal for Numerical Methods in Engineering 40, 2739–2758 (1997) 19. Kornhuber, R.: Adaptive monotone multigrid methods for nonlinear variational problems. Teubner–Verlag, Stuttgart (1997) 20. Kornhuber, R., Krause, R.: Adaptive multigrid methods for Signorini’s problem in linear elasticity. Comput. Vis. Sci. 4(1), 9–20 (2001) 21. Of, G.: BETI - Gebietszerlegungsmethoden mit schnellen Randelementverfahren und Anwendungen. Ph.D. Thesis, University of Stuttgart (2006) 22. Park, K.C., Felippa, C.A., Gumaste, U.A.: A localized version of the method of Lagrange multipliers. Computational Mechanics 24, 476–490 (2000) 23. Sch¨ oberl, J.: Solving the Signorini problem on the basis of domain decomposition techniques. Computing 60(4), 323–344 (1998) 24. Wohlmuth, B., Krause, R.: Monotone methods on nonmatching grids for nonlinear contact problems. SIAM J. Sci. Comput. 25, 324–347 (2003)

Multilevel Preconditioning of Crouzeix-Raviart 3D Pure Displacement Elasticity Problems Ivan Georgiev1 , Johannes Kraus1 , and Svetozar Margenov2 1

Johann Radon Institute for Computational and Applied Mathematics, Austrian Academy of Sciences, Altenbergerstraße 69, A-4040 Linz, Austria [email protected], [email protected] 2 Institute for Parallel Processing, Bulgarian Academy of Sciences, Acad. G. Bonchev, Bl. 25A, 1113 Soﬁa, Bulgaria [email protected]

Abstract. In this study we demonstrate how some diﬀerent techniques which were introduced and studied in previous works by the authors can be integrated and extended in the construction of eﬃcient algebraic multilevel iteration methods for more complex problems. We devise an optimal order algorithm for solving linear systems obtained from locking-free discretization of 3D pure displacement elasticity problems. The presented numerical results illustrate the robustness of the method for nearly incompressible materials.

1

Introduction

In this paper we consider the pure displacement elasticity problem 3 ∂σ ij + fi = 0 ∂x j j=1 u=0

x∈Ω

i = 1, 2, 3

x ∈ ΓD = ∂Ω

where Ω is a polyhedral domain in R3 and ∂Ω is the boundary of Ω. The stresses σij and the strains εij are deﬁned by the classical Hooke’s law, i.e. 3 1 ∂ui ∂uj σij (u) = λ εkk (u) δij + 2μεij (u), εij (u) = + . 2 ∂xj ∂xi k=1

The unknowns of the problem are the displacements uT = (u1 , u2 , u3 ). A generalization to nonhomogeneous boundary condition is straightforward. The Lam´e coeﬃcients are given by λ=

νE , (1 + ν)(1 − 2ν)

μ=

E , 2(1 + ν)

where E stands for the elasticity modulus, and ν ∈ [0, 12 ) is the Poisson ratio. We use the notion nearly incompressible for the case ν = 12 − δ (δ > 0 is a I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 100–107, 2010. c Springer-Verlag Berlin Heidelberg 2010

Multilevel Preconditioning of 3D Pure Displacement Elasticity Problems

101

small parameter). Note that the boundary value problem becomes ill-posed when ν = 12 (the material is incompressible). The weak formulation of the 3D linear elasticity problem (homogeneous boundary conditions are assumed) reads as follows: For f = (f1 , f2 , f3 )T ∈ 1 (L2 (Ω))3 , ﬁnd u ∈ (HD (Ω))3 = {v ∈ (H 1 (Ω))3 , v |ΓD = 0} such that 1 A(u, v) = f T vdx ∀v ∈ (HD (Ω))3 . (1) Ω

The bilinear form A(u, v) is of the form 3 A(u, v) = λdiv(u)div(v) + 2μ εij (u)εij (v)dx. Ω

We rewrite (2) as

(2)

i,j=1

A(u, v) =

Cd(u), d(v)dx Ω

and consider also the modiﬁed bilinear form s A (u, v) = C s d(u), d(v)dx Ω

where ⎡

⎤ ⎡ ⎤ λ+2μ 0 0 0 λ 0 0 0 λ λ+2μ 0 0 0 λ+μ 0 0 0 λ+μ ⎢ 0 μ 0μ 0 0 0 0 0 ⎥ ⎢ 0 μ 00 0 0 0 0 0 ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ 0 0μ 0 0 0μ 0 0 ⎥ ⎢ 0 0μ 0 0 00 0 0 ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ 0 μ 0μ 0 0 0 0 0 ⎥ ⎢ 0 0 0μ 0 0 0 0 0 ⎥ ⎢ ⎥ ⎢ ⎥ s ⎥ ⎢ ⎥ C =⎢ ⎢ λ 0 0 0λ+2μ 0 0 0 λ ⎥, C = ⎢ λ+μ 0 0 0λ+2μ 0 0 0 λ+μ ⎥, ⎢ 0 0 0 0 0 μ 0μ 0 ⎥ ⎢ 0 0 0 0 0 μ 00 0 ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ 0 0μ 0 0 0μ 0 0 ⎥ ⎢ 0 00 0 0 0μ 0 0 ⎥ ⎢ ⎥ ⎢ ⎥ ⎣ 0 0 0 0 0 μ 0μ 0 ⎦ ⎣ 0 0 0 0 0 0 0μ 0 ⎦ λ 0 0 0 λ 0 0 0λ+2μ λ+μ 0 0 0 λ+μ 0 0 0λ+2μ and

d(u) =

∂u1 ∂u1 ∂u1 ∂u2 ∂u2 ∂u2 ∂u3 ∂u3 ∂u3 , , , , , , , , ∂x1 ∂x2 ∂x3 ∂x1 ∂x2 ∂x3 ∂x1 ∂x2 ∂x3

T .

1 Then due to the pure displacement boundary conditions, as for u, v ∈ (HD (Ω))3 we have ∂ui ∂vj ∂ui ∂vj dx = dx, ∂x ∂x j i Ω Ω ∂xi ∂xj 1 one ﬁnds that A(u, v) = As (u, v) for all u, v ∈ (HD (Ω))3 . s The matrix C is positive deﬁnite. It is also important, that the rotations are excluded from the kernel of the related (stabilized) Neumann boundary conditions operator. As a direct result, the nonconforming Crouzeix-Raviart FEs are straightforwardly applicable to the variational problem (1)–(2). Let us remind that locking-free error estimates for the 2D pure displacement problem discretized by Crouzeix-Raviart FEs are presented, e.g., in [5,6,7]. The same scheme of analysis is applicable to the 3D case as well.

102

I. Georgiev, J. Kraus, and S. Margenov

Fig. 1. Reference macroelement of 6 Crouzeix-Raviart elements

2

Composite FR Algorithm

Let us assume that the domain Ω is covered by a tetrahedral mesh Th based on cubic (macro)elements, each of which is split into 6 tetrahedra. We also suppose that the edges of the cubes are parallel to the coordinate axes. The sequence of nested cubic meshes T1 ⊂ T2 ⊂ · · · ⊂ T is obtained by uniform reﬁnement of the coarser macroelements into 8 ﬁner cubes. The construction of the robust algebraic multilevel (AMLI) preconditioner is based on a hierarchical two- and multilevel splitting of the ﬁnite element space, see [1,2]. One such algorithm is the so-called First Reduce (FR) splitting, see [3,4,11]. Here we present a composite FR algorithm for AMLI preconditioning of the stiﬀness matrix Ah , which corresponds to the FEM discretization of the modiﬁed elasticity problem (1)–(2), using Crouzeix-Raviart elements deﬁned on the ﬁnest mesh Th . The algorithm is described on a macroelement level. The global stiﬀness matrix Ash is written in the form Ash =

AsE ,

E∈T

where E ∈ T are the cubic macroelements. In what follows we use the numbering of nodes from Fig. 1 where the interior nodes 1 to 6 have the coordinates {(xi , yi , zi )}6i=1 = {( 13 , 13 , 13 ), ( 23 , 13 , 13 ), ( 13 , 23 , 13 ), ( 23 , 13 , 23 ), ( 13 , 23 , 23 ), ( 23 , 23 , 23 )}. Let φ1 , . . . , φ18 be the standard (scalar) nonconforming linear ﬁnite element nodal basis functions on the macroelement E. Then for the 3D elasticity problem (1) (2) (3) we use the basis functions φi = (φi , 0, 0)T , φi = (0, φi , 0)T , and φi = T (0, 0, φi ) , i = 1, . . . , 18. The vector of the macroelement basis functions T (1) (2) (3) (1) (2) (3) (1) (2) (3) ΦE = φ1 , φ1 , φ1 , φ2 , φ2 , φ2 , . . . , φ18 , φ18 , φ18

Multilevel Preconditioning of 3D Pure Displacement Elasticity Problems

103

E = JE ΦE . Folis transformed into a vector of new hierarchical basis functions Φ lowing the FR procedure, we consider a transformation matrix JE corresponding (E) = V 0 (E) ⊕ V 1 (E) ⊕ V 2 (E), to the splitting V 0 (E) = span {φ(k) , φ(k) , φ(k) , φ(k) , φ(k) , φ(k) }3 , V 1 2 3 4 5 6 k=1 1 (E) = V (k)

(k)

(k)

(k)

(k)

(k)

(k)

(k)

(k)

(k)

(k)

(k)

(k)

(k)

(k)

(k)

(k)

(k)

(k)

(k)

(k)

(k)

(k)

span {φ8 −φ7 , φ10 −φ9 , φ12 −φ11 , φ14 −φ13 , φ16 −φ15 , φ18 −φ17 }3k=1 , 2 (E) = V (k)

span {φ8 +φ7 , φ10 +φ9 , φ12 +φ11 , φ14 +φ13 , φ16 +φ15 , φ18 +φ17 }3k=1 . Accordingly, JE transforms the macroelement stiﬀness matrix AsE into the s = JE As J T , hierarchical form A E E E ⎤ ⎡ 0 (E) s s s V A E:00 AE:01 AE:02 ⎥ s s s s = ⎢ A ⎣ AE:10 AE:11 AE:12 ⎦ V1 (E) . E 2 (E) s s s V A A A E:20

E:21

E:22

The corresponding global hierarchical stiﬀness matrix s = s A A h

E

E∈Th

has a 3 × 3 block structure

⎡

s A h:00 ⎢ s h = ⎣ A s A h:10 s A h:20

s A h:01 s A h:11 s A h:21

⎤ s A h:02 s ⎥ A h:12 ⎦ , As

(3)

h:22

s which is induced by the partitioning on macroelement level. The block A h:00 corresponds to the interior nodal unknowns with respect to the macro-elements s s E ∈ T . The matrices A h:11 and Ah:22 correspond to certain diﬀerences and aggregates of nodal unknowns (basis functions) related to the faces of E. The ﬁrst step of the FR algorithm is to eliminate locally (static condensation) the h , and get the Schur ﬁrst block of the unknowns [4]. For this purpose we factor A s complement Bh in the form ⎡ ⎤ s B s B s = ⎣ h:11 h:12 ⎦ , B (4) h s B s B h:21 h:22 where its ﬁrst block corresponds to the diﬀerences of the (two) basis functions s corresponds to the sum of related to each macroelement face. The matrix B h:22 the (same) basis functions from each macroelement face, and thus is associated with the coarse grid. Here, “coarse grid” means the grid of cubic elements associated with Th . After applying a two-level method to (4) the problem is reduced

104

I. Georgiev, J. Kraus, and S. Margenov

0.50

Ν0.4999 Ν0.499 0.45

Ν0.49 Ν0.4

0.40

0

2

4

6

8

10

12

14

2 Fig. 2. Numerically computed γE as a function of the number of coarsening steps

s . This is the end of the ﬁrst step to a system with the coarse level matrix B h:22 of our composite algorithm. s The next observation to note is that B h:22 has the same structure as the related Rannacher-Turek FE stiﬀness matrix. This allows us to apply the FR method from [9] as a second step of the algorithm. The convergence of the AMLI algorithm crucially depends on, and is controllable by, the related CBS constants. The next Fig. 2 shows the multilevel behavior of the local CBS constant γE varying the Poisson ratio [10]. The ﬁrst important observation is that γE is uniformly bounded when the Poisson ratio tends to the incompressibility limit of ν = 1/2. We see also that in the multilevel setting, the local CBS constant at the ﬁrst step of the composite FR algorithm is considerably smaller. Then there is a steep 2 (but bounded by 0.52) increase of γE , corresponding to the ﬁrst aggregation step of the second stage (where the Rannacher-Turek FR construction is applied) of the composite algorithm. This peak is followed by a monotonically decreasing 2 (k), which tends to a certain ﬁxed value (close to 0.4), uniformly sequence of γE with respect to the Poisson ratio.

3

Numerical Tests

We consider the pure displacement linear elasticity problem in the unit cube Ω = (0, 1)3 . The robustness of the composite AMLI algorithm is studied varying the mesh size h and for a Poisson ratio ν approaching 1/2. The presented results are for the second step of the composite algorithm, i.e., s in (4) for the solution of the linear systems with the coarse level matrix B h:22 which we use diﬀerent variants of the AMLI algorithm. Note that the ﬁrst step of the solution procedure results in an optimal order process if the two-level preconditioner for (4) has a uniformly bounded condition number. We will comment on this issue at the end of this section. If we implement Dirichlet boundary conditions without elimination of variables, e.g., by equating the corresponding

Multilevel Preconditioning of 3D Pure Displacement Elasticity Problems

105

Table 1. Convergence results for AMLI V-cycle # voxels 163 323 643

ν = 0.49 ν = 0.4999 ν = 0.499999 12 [18] 10 [14] 7 [11] 15 [24] 12 [19] 8 [14] 19 [31] 15 [25] 9 [17]

Table 2. Convergence results for linear AMLI W-cycle # voxels 163 323 643

ν = 0.49 ν = 0.4999 ν = 0.499999 9 [14] 7 [11] 6 [9] 10 [15] 8 [13] 6 [10] 10 [16] 8 [14] 6 [10]

rows and columns with the identity matrix, the dimension of the full system to be solved is NT × NT where NT = 18n2 (1 + 2n) and n = 1/h. The number of interior DOF on voxel level is NI = 18n3 . Thus the size of the condensed matrix (4) is NC × NC , NC = 18n2 (1 + n), and, ﬁnally, its lower right block, to which we apply the recursive multilevel method, yields a linear system of dimension N = 9n2 (1 + n). In the tables we list the number of outer iterations that are required to reduce the A-norm of the initial error by a factor 106 . The right-hand side vector is the vector of all zeros and the iteration is initialized with a random initial guess. The approximate inverses of the pivot blocks we realize in all cases by static condensation of the interior DOF on macro element level followed by an incomplete factorization without any additional ﬁll-in, i.e., ILU(0) applied to the Schur complement. The iteration counts are for the multiplicative and in brackets for additive AMLI. In Table 3 we also report the solution time on a Fujitsu Siemens Primergy RX600 S3 workstation with 4 dual core Intel Xeon MP processors (3.4 GHz) performing the linear AMLI W-cycle method. We observe that our method is completely robust with respect to the Poisson ratio ν; The convergence even becomes faster when ν approaches 1/2. In case of the V-cycle method the number of PCG iterations increases moderately according to the increase of the condition number of the preconditioner when adding levels of approximate factorization. The W-cycle method in both cases, linear and nonlinear AMLI, stabilizes the number of outer iterations and thus yields an optimal order solution process. In case of linear AMLI this is achieved by a proper second-degree stabilization polynomial, in case of nonlinear AMLI by two inner (generalized) conjugate gradient iterations at each intermediate level. Table 3. Solution time for linear AMLI W-cycle ν = 0.4999 # voxels ν = 0.49 163 0.48 [0.90] 0.39 [0.71] 323 4.97 [8.92] 3.96 [7.67] 43.11 [86.97] 34.59 [71.56] 643

ν = 0.499999 0.33 [0.58] 3.00 [5.34] 26.25 [51.41]

106

I. Georgiev, J. Kraus, and S. Margenov Table 4. Convergence results for nonlinear AMLI W-cycle # voxels 163 323 643

ν = 0.49 ν = 0.4999 ν = 0.499999 8 [13] 7 [11] 5 [9] 8 [14] 7 [12] 5 [9] 8 [15] 7 [12] 5 [9]

14 Ν0.4999

12 Ν0.499

10 Ν0.49

8 Ν0.4

6 4

0

2

4

6

8

10

12

14

s h:11 Fig. 3. Numerically computed condition number of the ﬁrst block B of the Schur complement (4) as a function of the number of coarsening steps

s of the Schur complement (4) as well as the pivot Remark 1. The ﬁrst block B h:11 blocks of the recursively computed coarse-level matrices after static condensation of the interior unknowns are well-conditioned with a condition number bound that is uniform with respect to the Poisson ratio ν ∈ (0, 1/2). This can be shown by a local analysis on (macro) element level, see Fig. 3.

4

Concluding Remarks

The starting point of this paper is a generalization of locking-free discretizations of 2D pure displacement elasticity problems by nonconforming Crouzeix-Raviart ﬁnite elements to 3D problems. Then we propose a robust optimal order composite multilevel algorithm for the iterative solution of the related systems of linear algebraic equations. The presented method can also be used for solving eﬃciently certain problems resulting from mixed ﬁnite element discretization, e.g. the Stokes problem. By applying the augmented Lagrangian method one can reduce the indeﬁnite problem to a nearly singular system. In case of Stokes ﬂow the nearly singular system corresponds to a linear elasticity problem for a nearly incompressible material. For details see [8,12]. Note that the demonstrated robustness of the presented preconditioners (with respect to the Poisson ratio) results in an optimal order solution algorithm for this kind of nearly singular systems even without using any regularization techniques.

Multilevel Preconditioning of 3D Pure Displacement Elasticity Problems

107

Acknowledgments The authors gratefully acknowledge the support by the Austrian Academy of Sciences. This work has also been partially supported by the Austrian Science Foundation (FWF Project P19170-N18) and the Bulgarian Science Foundation (grants DO02–147 and DO02–338).

References 1. Axelsson, O., Vassilevski, P.S.: Algebraic Multilevel Preconditioning Methods I. Numer. Math. 56, 157–177 (1989) 2. Axelsson, O., Vassilevski, P.S.: Algebraic Multilevel Preconditioning Methods II. SIAM J. Numer. Anal. 27, 1569–1590 (1990) 3. Blaheta, R., Margenov, S., Neytcheva, M.: Aggregation-based multilevel preconditioning of non-conforming FEM elasticity problems. In: Dongarra, J., Madsen, K., Wa´sniewski, J. (eds.) PARA 2004. LNCS, vol. 3732, pp. 847–856. Springer, Heidelberg (2004) 4. Blaheta, R., Margenov, S., Neytcheva, M.: Uniform estimate of the constant in the strengthened CBS inequality for anisotropic non-conforming FEM systems. Numerical Linear Algebra with Applications 11(4), 309–326 (2004) 5. Brenner, S., Scott, L.: The mathematical theory of ﬁnite element methods. Texts in applied mathematics, vol. 15. Springer, Heidelberg (1994) 6. Brenner, S., Sung, L.: Linear ﬁnite element methods for planar linear elasticity. Math. Comp. 59, 321–338 (1992) 7. Falk, R.S.: Nonconforming ﬁnite element methods for the equations of linear elasticity. Math. Comp. 57, 529–550 (1991) 8. Fortin, M., Glowinski, R.: Augmented Lagrangian methods: Applications to the numerical solution of boundary value problems, vol. 15. North-Holland Publishing Co., Amsterdam (1983) 9. Georgiev, I., Kraus, J., Margenov, S.: Multilevel algorithms for Rannacher-Turek ﬁnite element approximation of 3D elliptic problems. Computing 82, 217–239 (2008) 10. Kraus, J., Margenov, S.: Robust Algebraic Multilevel Methods and Algorithms. Radon Series Comp. Appl. Math. 5 (2009) 11. Kraus, J., Margenov, S., Synka, J.: On the multilevel preconditioning of CrouzeixRaviart elliptic problems. Numer. Lin. Alg. Appl. 15, 395–416 (2008) 12. Lee, Y., Wu, J., Xu, J., Zikatanov, L.: Robust subspace correction methods for nearly singular systems. Mathematical Models and Methods in Applied Sciences 17(11), 1937–1963 (2007)

Element-by-Element Schur Complement Approximations for General Nonsymmetric Matrices of Two-by-Two Block Form Maya Neytcheva1 , Minh Do-Quang2 , and He Xin1 1

2

Department of Information Technology, Uppsala University, Sweden, [email protected], [email protected] Department of Mechanics, Royal University of Technology, Stockholm, Sweden [email protected]

Abstract. We consider element-by-element Schur complement approximations for indefinite and general nonsymmetric matrices of two-by-two block form, as arising in finite element discretized systems of PDEs. The paper provides some analysis of the so-obtained approximation and attempts to quantify the quality of the underlying two-by-two matrix splitting in a way similar to that used for symmetric positive definite matrices. The quality of the approximation is illustrated numerically.

1

Introduction

This paper targets the eﬃcient solution of systems of equations as arising in multiphase ﬂow models, where various patterns and interface motion have to be resolved. Multiphase processes advance through free contact surfaces (sharp fronts) to be accurately tracked by the numerical methods. Some existing models however fail to provide an eﬃcient treatment of important problem characteristics such as the so-called wetting contact angle at the walls of a solid in contact with a liquid. A good alternative turns out to be the Phase-Field model (PFM), which allows to easily implement the combination of large scale simulation of the hydrodynamic interaction with the constrain of micro scale properties (such as wetting). There, the mathematical tool used is the Cahn-Hilliard equation. We assume that the discretization is done using the conforming Finite Element method (FEM) on standard or adaptive meshes. For the solution of the so-arising linear systems of equations we consider iterative methods and describe a preconditioner, based on the classical two-bytwo block factorization of the matrix of the underlying algebraic system. The novel moment is the analysis of the behaviour of the preconditioner for general nonsymmetric matrices. We brieﬂy describe the physical problem and the mathematical model in Section 2. Section 3 describes the suggested preconditioner and some of its properties for the general nonsymmetric case. Section 4 presents some numerical illustrations. Finally, in Section 5, we state some conclusions and directions for future research. I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 108–115, 2010. c Springer-Verlag Berlin Heidelberg 2010

Element-by-Element Schur Complement Approximations

2

109

The Physical Problem and Mathematical Models Used

To illustrate some of the eﬀects of multiphase ﬂow we choose to consider a ball falling into water. To understand the eﬀect of throwing a solid object (say, a sphere) into a liquid and in particular, the splashing phenomena observed when the object penetrates the surface of the liquid, has been a challenge the researchers since more than 100 years. When we drop a ball in water, numerous physical phenomena take place, which we want to understand, measure or compute, and control: the force acting on the sphere, the shape of the produced air cavity, the sound eﬀects during the impact between solid and liquid phase, pulsation of the cavity volume, existence of critical velocity, which was found to be a function of the so-called wetting contact angle. During the years, various mathematical models have been considered to describe the above mentioned phenomena. The model, which has shown to have advantages over others is the Phase-Field model (PF). Constituting the model goes back to 1893 (van der Waals). To shortly characterize the PF model, consider two diﬀerent phases A and B. Let C be the corresponding relative concentration of each phase, thus C takes a distinct constant value in each of the bulk phases, say −1 and 1, and changes rapidly but in a continuous manner from one to the other in an interface strip of certain thickness. The proﬁle of the interface, which is what we are mostly interested in, is obtained by minimizing the so-called free energy of the system F = f dV , where f is the free energy density, expressed in terms of the concentration, C, cf. for instance [10]. Originally, the FP model has been derived for stationary problems. It has been extended to model time-dependent problems (Cahn, 1961), and later on, further modiﬁed by Bates and Fife (1993) to model the creation, evolution and dissolution of phase-ﬁeld interfaces as a result of diﬀusive processes. As a next step, to enable incorporating ﬂuid motion, the velocity u has been added to the model. The latter version is referred to as the Cahn-Hilliard equation, which reads as follows: ∂C + (u · ∇)C = k∇2 (βΨ (C) − α∇2 C) in Ω. (1) ∂t Here k stands for the mobility of the ﬂuid and Ψ is a a double-well function with two minima at ±1, corresponding to the two stable phases1 . In order to determine the velocity u, one has to solve the time-dependent Navier-Stokes (NS) equations, where the forcing term depends on C and on the so-called surface tension Φ. The function Φ is also referred to as the chemical potential of the phase mixture and is deﬁned as the rate of change of the free energy F , namely, δF = β Ψ (C) − α∇2 C Φ= δC Here α and β are control parameters (cf, e.g., [10]). In this paper we do not describe the model any further. We state the resulting coupled system of equations in its ﬁnalized form, obtained after nondimensionalization. 1

Examples of such functions are Ψ (C) = (C + 1)2 (C − 1)2 and Ψ (C) = 1/4(1 − C 2 )2 .

110

M. Neytcheva, M. Do-Quang, and H. Xin

∂C 1 2 + (u · ∇)C = ∇ (Ψ (C) − Cn2 ∇2 C) ∂t P e ∂u 1 Re + (u · ∇)u = −∇p + ∇2 u − C∇Φ ∂t Ca Cn ∇·u= 0

(2) (3) (4)

As is seen, there are four problem parameters involved in the model: Re is the Reynolds number, P e is the Peclet number, Ca is the Capillary number (deﬁned as the ratio between the viscous and surface tension forces) and the Cahn number Cn, which is a measure of the ratio between mean ﬁeld thickness and characteristic length. We note that the convective Cahn-Hilliard equation (2) is mass-preserving. The system (2)-(4) is usually solved using operator splitting techniques, namely, one solves (2) ﬁrst, then solves the NS problem (3)-(4) and repeats until a proper convergence criterium is met. In this work we address only the solution of equation (2). It is seen from the expression in (2), that the CH equation is a fourth order nonlinear parabolic PDE. A classical approach to avoid discretizations of the high derivatives is to decompose (2) into a coupled system of two second-order PDEs, which after some manipulations takes the form ψ = f (φ) + ∇2 φ,

∂φ + (b · ∇)φ = ∇ · (k∇Φ). ∂t

(5)

Here Φ is a continuous scalar variable that describes the diﬀusive interface proﬁle. It has constant value in each phase, ±1. There are two straightforward approaches to solve (5). The ﬁrst one is the operator splitting method when the two equations are solved autonomously in an alternating manner, which is simpler and many well-studied solution techniques are available. However, it is restricted to very small timesteps which entails very long simulation times. Alternatively, (5) can be solved as a system, discretized by mixed FEM which allows for larger timesteps (cf. [6] and the references therein). Either of the approaches requires the use of a suitable numerical solution method for the so arising linear or linearized large scale systems of algebraic equations, which most often governs the computational eﬃciency of the FEM models. In many cases those linear systems are solved by sparse direct solution methods. Despite their general applicability and robustness, direct solution methods become very costly for large problems and do remain the most time consuming part of the numerical simulations. For example, with one of the best direct solvers, UMFPACK [4], the numerical simulations (in 2D) in [6] required more than 60 hours of computation. Aiming at reducing the simulation time, we consider iterative methods. The major task in this paper is to test and analyse a preconditioning strategy for systems in a two-by-two block form, such as in (5), based on the structure of the matrix and utilizing the discretization, in this case, standard conforming FEM.

Element-by-Element Schur Complement Approximations

3

111

Preconditioning Strategy and Estimates

When discretizing a system of PDEs, the resulting linear (or linearized) system of algebraic equations admits itself a two-by-two block structure ((6), left), A11 A12 A11 0 I1 Z12 A= , B= . (6) A21 S 0 I2 A21 A22 To solve systems with A we consider preconditioned iterative methods, such as pcg, minres, gmres or gcg (cf. [1]). The suggested preconditioner utilizes the matrix block structure and is of block-multiplicative form ((6), right), where S is an approximation of the Schur complement SA of A, namely, SA = A22 − A21 A−1 11 A12 , Z12 is either exactly equal or approximates the matrix product A−1 A 11 12 and I1 , I2 are identity matrices of corresponding order. The issues on how to solve systems with the pivot block A11 and how to handle the block Z12 are out of scope of this paper. We discuss only the construction of an approximation of SA , for the case when the discretization of the PDE system is done using conforming FEM. We note that when we discretize systems of PDEs as in (5) with FEM, the local stiﬀness matrices, corresponding to one ﬁnite element, admit also a twoM T R(m) A(m) R(m) where by-two block form, namely, A = m=1

A

(m)

(m) (m) A11 A12 }n1 = (m) (m) A21 A22 }n2

(7)

Here M denotes the number of the ﬁnite elements in the discretization mesh. The matrices R(m) (n, N ) are the standard Boolean matrices which provide the local-to-global correspondence of the numbering of the degrees of freedom. Let S=

M

(m) T

R2

(m)

S (m) R2

(8)

m=1 (m)

(m)

(m) −1

(m)

(m)

where S (m) = A22 − A21 A11 A12 and R2 are the parts of R(m) corresponding to the degrees of freedom in A22 . In several works, e.g. [8,9,2], it has been shown that S is a high quality approximation of SA . However, so far this has been shown rigorously only in the case of symmetric positive matrices, see [8,2]. Here we suggest a framework to study the quality of the preconditioned system S −1 SA for general nonsymmetric matrices. Deﬁne the auxiliary matrix ⎡ ⎤ (1) (1) (1) A12 R2 A11 ⎢ ⎥ (2) (2) (2) A11 A12 R2 ⎢ ⎥ ⎢ ⎥ .. .. ⎢ ⎥ . =⎢ ⎥ . A (9) ⎢ ⎥ (M) (M) (M) ⎢ ⎥ A R A 11 12 2 ⎢ ⎥ M ⎣ (1) T (1) (2) T (2) ⎦ (M) T (M) (m) T (m) (m) R2 A21 R2 A21 · · · R2 A21 R2 A22 R2 m=1

112

M. Neytcheva, M. Do-Quang, and H. Xin

is constructed in such a way that its Schur complement is exactly S. Clearly, A have the same 22-block. Further, A and A Note that A and A can be related C 0 algebraically in the following way. We deﬁne a matrix C = T1 12 , where 012 I2 I2 is the identity matrix of order N2 , 012 is a rectangular zero matrix of order (M n1 , N2 ). The block C1 is of order (M n1 , N 1) and is constructed as follows T T T C1T = R1(1) R1(2) · · · R1(M) . A straightforward computation reveals that 11 C1 C T A C1T A T 1 12 A = C AC = . 21 C1 A A22 An additional observation is that C1T C1 = D, where D is a diagonal matrix of order N1 with entries equal to 1 or 2, corresponding to the fact that two local (m ) (m ) element matrix blocks A11 1 , A11 2 can share only one mesh point (in 2D). Applying Sherman-Morrison-Woodbury formula to S −1 and using the rela we obtain: tions between A and A −1 −1 A −1 A S −1 SA = I2 + A (I1 − W )A 22 12 (Γ − I1 ) 11 12 (= I2 + E),

−1 A −1 A 11 12 A22 A21

(10)

T C1 A−1 11 C1 A11 .

and W = Observe that W C1 = C1 , where Γ = thus W > 1. Further, simple computation reveals that W k = W for any k. Equality (10) shows that there holds S −1 SA ≤ 1 + δ, −1 −1 A −1 A where δ = A (W − I1 )A 22 12 (Γ − I1 ) 11 12 . Below, for spd matrices, we show a relation between (10) and the result from [2] which latter reads:

(1 − γ 2 )SA ≤ S ≤ SA

(11)

with γ of the corresponding hierarchical hasis function representation. Denoting 12 A−1 A 21 ), U = A 1/2 , −1 A −1/2 A12 A−1/2 and W =A 1/2 C1 A−1 C T A γ = ρ(A 11

22

11

22

and applying a spectral transformation on S −1 SA we obtain

11

11

−1/2 = I2 + U T (I1 − U U T )−1 (I1 − W 1/2 S −1 SA A )U. A 22 22

1

11

(12)

T

Taking into account that γ = ρ(U U ), we obtain 2

λmax (S −1 SA ) ≤ 1 +

γ 2 2 1 ≤ 1 + γ I − W = . 1 2 2 1−γ 1− γ 1−γ 2

(13)

= 0, the upper bound in (13) is attained and admits the same form For W measures the error in approximating A−1 by as in (11). However, since W 11 T −1 C1 A11 C1 , which is never zero, we see that the actual upper bound for λmax (S −1 SA ) is smaller than 1/(1 − γ 2 ) and does not deteriorate with γ → 1, conﬁrmed also by the numerical results. We also see from (10) and (12) that in the general case, γ itself is not suﬃcient to characterize the quality of the two-by-two splitting and the approximation S, considered here.

Element-by-Element Schur Complement Approximations

4

113

Numerical Illustrations

We consider two test problems — one with symmetric and indeﬁnite system matrix as arising from the stationary Stokes problem and the second with a general nonsymmetric two-by-two block matrix as arising from the discretized system (5). All numerical experiments are done in Matlab. Problem 1. We consider a model of Oseen’s problem in 2D for ﬁnding velocity u and pressure p satisfying −νΔu + (w · ∇)u + ∇p = f (x, y) ∇·u= 0 on the unit square Ω = [0, 1]2 . Here ν is the viscosity and w is the wind. We consider two cases. Case 1: ν = 1, w = 0 (the Stokes problem). For testing purposes, in this case we choose a solution in advance and compute f and the boundary conditions for the velocity correspondingly. Case 2: (Oseen’s driven lid problem, see [5] and the references there in) ν = 1/20, ν = 1/80 and ν = 1/320, and w = 2(2y − 1) ∗ (1 − (2x − 1)2 ) . The boundary conditions are u1 = u2 = 0 for −2(2x − 1) ∗ (1 − (2y − 1)2 ) x = 0, x = 1 and y = 0, u1 = 1, u2 = 0 for y = 1. The problem is discretized using a square mesh and the so-called modified Taylor-Hood (MTH) elements, i.e., bilinear basis functions for the velocity on a mesh with a meshstep h and bilinear basis functions for the pressure on a mesh with a meshstep 2h. It is known (cf. e.g. [3]) that MTH provides a stable discretization for which the LBB condition holds. Problem 2. Simulation of a moving interface with a constant speed by using the Cahn-Hilliard equation as in (5). Stokes problem is solved by the minres method, preconditioned by the preconditioner (6), right. Oseen’s problem is solved by the gcg-mr method. We compare the quality of the element-by-element Schur complement approximation (S) with the pressure mass matrix (M ) which is known to be a good approximation of the negative Schur complement, in this case equal to A21 A−1 11 A12 . Table 1 shows a comparison of the two preconditioning strategies. For the Stokes problem we present the spectral bounds of M −1 SA and S −1 SA , respectively. For these experiments, A−1 11 A12 is used as Z12 and the blocks A11 and S, respectively M , are solved directly in order to see only the inﬂuence of the approximation of SA . In Table 2, W and E are the same as in (10). The results for Stokes problem, Table 1 (left), illustrate that S is at least as good approximation of SA , as the mass matrix M . For Oseen’s problem, Table 1 (right), when the matrices become nonsymmetric, the advantages of the proposed approach are better seen. The obtained iteration counts with S as a preconditioner are slightly better than those, reported in [5].

114

M. Neytcheva, M. Do-Quang, and H. Xin Table 1. Problem 1 Stokes

Size of S 81 289 1089 4225 16641

M eig(SA , M ) min-max 0.166-0.986 0.164-0.996 0.164-0.999 -

Iter A, B 8 7 6 4 3

S eig(SA , S) min-max 0.101-0.634 0.098-0.635 0.096-0.636 -

ν = 1/20 Iter A, B 8 6 5 5 4

M 28 30 29 27 24

S 22 24 24 22 19

Oseen ν = 1/80 Iter M S 66 37 106 65 117 88 116 89 110 89

ν = 1/320 M 81 219 427 522 514

S 45 96 210 330 398

Table 2. Problem 2 Size of S 153 561 2145 8385 33153

−1 eig(SA S) min(Re)max max(Re) abs(Im)

0.98-1 0.68-1 0.26-1 0.07-1 -

0.0001 0.0002 7.2e-6 0 -

W

E/γ

1.25 1.25 -

0.04/0.38 0.35/1.49 -

Iter SA , S A, B (gmres) (gcg) 3 6 11 22 -

4 7/4 13/7 25/12 47/24

Table 2 illustrates the quality of S constructed for Problem 2. Column 6 in Table 2 shows the iteration counts to solve a system with SA , preconditioned by S (using the Matlab-gmres method with a relative stopping criterion 10−6 ). Column 7 in Table 2 shows the iterations required to solve a system with A, preconditioned by B as in (6), using the gcg - minimal residual method with a relative stopping criterion 10−6 and 10−3 , respectively. Again, the block Z12 is taken to be equal to A−1 11 A12 . Systems with S and A11 are solved directly using the Matlab \ operator. In this way we monitor only the quality of the approximation S. The experiments in Table 2 indicate that (i) cond(S −1 SA ) for the two-level method depends on the discretization parameter h and, thus, is not of optimal order as for spd problems; (ii) γ can take values larger than 1 and seems not to be the proper quality indicator for nonsymmetric matrices; (iii) the norm of W seems to have an upper bound independent of h; (iv) the error matrix E remains relatively small for all the numerical tests, which makes S still an attractive preconditioner to SA , obtained and applied as a sparse matrix on a low computational cost.

5

Conclusions and Some Open Problems

The numerical experiments complement those from earlier works and show that the element-by-element Schur approximation S is a good quality approximation

Element-by-Element Schur Complement Approximations

115

of the exact Schur complement SA for a broad spectrum of problems. The results in Section 3 indicate a way to theoretically quantify the relation between S and −1 SA using the spectral radius of the same matrix product A−1 11 A12 A22 A21 as in the symmetric positive deﬁnite case. The study for the general nonsymmetric case needs more systematic analysis to get a better understanding and insight both in the target application problem and in the resulting approximation of the Schur complement matrix. Another important question to answer is how to eﬃciently solve the 11-block, which will then make the preconditioner not only numerically but also computationally eﬃcient.

Acknowledgements This research is partly supported by a grant from the Swedish Research Council (VR), Finite element preconditioners for algebraic problems as arising in modelling of multiphase microstructures, 2009–2011.

References 1. Axelsson, O.: Iterative Solution Methods. Oxford University Press, Oxford (1994) 2. Axelsson, O., Blaheta, R., Neytcheva, M.: Preconditioning for boundary value problems using elementwise Schur complements. SIMAX 31, 767–789 (2009) 3. Braess, D.: Finite Elements, 2nd edn. Cambridge University Press, Cambridge (2001) 4. Davis, T.A.: A column pre-ordering strategy for the unsymmetric-pattern multifrontal method. ACM Transactions on Math. Software 30, 165–195 (2004) 5. de Niet, A.C., Wubs, F.W.: Two preconditioners for saddle point problems in fluid flows. Int. J. Numer. Meth. Fluids 54, 355–377 (2007) 6. Do-Quang, M., Amberg, G.: The splash of a ball hitting a liquid surface: Numerical simulation of the influence of wetting. Journal Physic of Fluid (2008) (accepted) 7. femLego (Numerical simulation by symbolic computation), http://www.mech.kth.se/~ gustava/femLego 8. Kraus, J.: Algebraic multilevel preconditioning of finite element matrices using local Schur complements. Num. Lin. Alg. Appl. 13, 49–70 (2006) 9. Neytcheva, M., B¨ angtsson, E.: Preconditioning of nonsymmetric saddle point systems as arising in modelling of visco-elastic problems. ETNA 29, 193–211 (2008) 10. Villanueva, W., Amberg, G.: Some generic Capillary-driven flows. Int. J. of Multiphase Flow 32, 1072–1086 (2006)

Numerical Simulation of Fluid-Structure Interaction Problems on Hybrid Meshes with Algebraic Multigrid Methods Huidong Yang1 and Walter Zulehner2 1 Johannes Kepler University Linz, Altenberger Strasse 69, 4040 Linz, Austria/Europe [email protected] http://www.numa.uni-linz.ac.at/~huidong 2 Johannes Kepler University Linz, Altenberger Strasse 69, 4040 Linz, Austria/Europe [email protected] http://www.numa.uni-linz.ac.at/~zulehner

Abstract. Fluid-structure interaction problems arise in many application fields such as flows around elastic structures or blood flow problems in arteries. The method presented in this paper for solving such a problem is based on a reduction to an equation at the interface, involving the so-called Steklov-Poincar´e operators. This interface equation is solved by a Newton-like iteration. One step of the Newton-like iteration requires the solution of several decoupled linear subproblems in the structural and the fluid domains. These subproblems are spatially discretized by a finite element method on hybrid meshes. For the time discretization implicit first-order methods are used for both subproblems. The discretized equations are solved by algebraic multigrid methods.

1 1.1

Problem Setting of the Fluid-Structure Interaction Geometrical Description

Let Ω0 denote the initial domain at a time t = 0 consisting of the structural and ﬂuid sub-domains Ω0s and Ω0f , respectively. The domain Ω(t) at a time t is composed of the deformable structural sub-domain Ω s (t) and the ﬂuid sub-domain Ω f (t). The corresponding interface Γ (t) is evolving from the initial interface Γ0 . The evolution of Ω(t) is obtained by an injective mapping, the so-called the arbitrary Lagrangian Eulerian (ALE) mapping (Fig. 1.): x : Ω0 × R+ → R3 .

(1)

The position of a point x0 ∈ Ω0s at a time t is given by the mapping for the structure domain xst : Ω0s → Ω s (t), (2) given by xst (x0 ) ≡ xs (x0 , t) = x(x0 , t) = x0 +ds (x0 , t) for x0 ∈ Ω0s , where ds (x0 , t) denotes the displacement ds (x0 , t) of the structural domain at a time t. I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 116–123, 2010. c Springer-Verlag Berlin Heidelberg 2010

Numerical Simulation of Fluid-Structure Interaction Problems

Γ0d

Γ0n

Ω0s

Γ0

Γin

Ω s (t)

xs (·, t)

Γout

Γ (t)

Γin (t)

Ω0f

117

Γout (t)

Ω (t) f

x (·, t) f

Ω0

Ω(t)

Fig. 1. ALE mapping

Correspondingly, the position of any point x0 ∈ Ω0f at a time t is given by the mapping for the ﬂuid domain xft : Ω0f → Ω f (t)

(3)

given by xft (x0 ) ≡ xf (x0 , t) = x(x0 , t) = x0 + df (x0 , t) for x0 ∈ Ω0f , where df (x0 , t) denotes the displacement in ﬂuid domain. It is deﬁned as an extension of the structural displacement ds at the interface Γ0 : df = Ext(ds|Γ ),

(4)

0

e.g. the harmonic extension, given by: ⎧ f ⎪ ⎨ −Δd = 0 df = ds ⎪ ⎩ df = 0

in Ω0f , on Γ0 ,

(5)

on Γin (t) ∪ Γout (t).

Furthermore, we introduce the domain velocities by ws (x0 , t) :=

∂ds ∂xs (x0 , t) = (x0 , t) ∂t ∂t

wf (x0 , t) :=

∂df ∂xf (x0 , t) = (x0 , t) ∂t ∂t

and

for the structural and the ﬂuid domain respectively. 1.2

The Physical Model in Strong Form

We describe the interface conditions which have to be satisﬁed for this coupled problem.

118

H. Yang and W. Zulehner

Interface Conditions. At the interface between the structure and the ﬂuid domain we assume no-slip conditions: df Γ = ds |Γ0 (6) 0

and the equilibrium of normal stresses: (σf nf ) ◦ xft + σs ns = 0,

(7)

where σs is the ﬁrst Piola-Kirchoﬀ stress tensor, σf is the Cauchy stress tensor, nf and ns are the outward normals of Ω f (t) and Ω0s , respectively. Structure and Fluid Sub-problems. With prescribed Dirichlet data λ for the displacement at the interface Γ0 , we compute the Neumann data σs ns at the interface Γ0 by solving the structure problem ⎧ ∂ 2 ds ⎪ ⎪ ρ − div(σs (ds )) = 0 in Ω0s , s ⎪ 2 ⎪ ⎪ ⎨ ∂t σs (ds )ns = 0 on Γ0n , (8) ⎪ s d ⎪ ⎪ d =0 on Γ0 , ⎪ ⎪ ⎩ ds = λ on Γ0 , where ρs is the density. We will concentrate on a linear Saint-Venant Kirchoﬀ s s T ) elastic model, i.e. σs (ds ) = 2μl ε(ds ) + λl div(ds )I with ε(ds ) = ∇d +(∇d and 2 l l the Lam´e constants λ , μ . We introduce Ss as the Dirichlet-to-Neumann mapping Ss : H 1/2 (Γ0 ) → H −1/2 (Γ0 ),

(9)

given by Ss (λ) = σs (d )ns , with an appropriate function space H (Γ0 ) and its dual space H −1/2 (Γ0 ). Let u(x, t) denote the Eulerian velocity of the ﬂuid. The ALE time derivative of u(x, t) is introduced in order to overcome the diﬃculty for evaluating the time derivative of velocity u(x, t) under the Eulerian framework in a moving domain. −1 Let x ∈ Ω f (t) with x0 = xft (x), then the ALE time derivative is given by ∂u d (x, t) = u(xft (x0 ), t) . (10) ∂t x0 dt s

1/2

Analogously, with prescribed Dirichlet data ∂λ ∂t for the velocity at the interface Γ0 , we compute the Neumann data (σf nf ) ◦ xft at the interface Γ0 by solving the ﬂuid problem ⎧ ∂u ⎪ f ⎪ in Ω f (t), ⎪ ⎪ ρf ∂t + ρf (u − w ) · ∇ u − 2μdivε(u) + ∇p = 0 ⎪ ⎪ x 0 ⎪ ⎪ ⎪ ⎪ divu = 0 in Ω f (t), ⎨ σf (u, p)nf = gin on Γin (t), (11) ⎪ ⎪ ⎪ ⎪ on Γout (t), σf (u, p)nf = 0 ⎪ ⎪ ⎪ ⎪ ⎪ ∂λ ⎪ ⎩ on Γ0 , u ◦ xft = ∂t

Numerical Simulation of Fluid-Structure Interaction Problems

119

where ρf is the density of the ﬂuid, μ its dynamic viscosity, the stress tensor T σf (u, p) = −pI + 2με(u), the pressure p, the strain tensor ε(u) = ∇u+(∇u) . 2 In a similar way as before, we introduce the Dirichlet to Neumann mapping Sf : H 1/2 (Γ0 ) → H −1/2 (Γ0 ),

(12)

given by Sf (λ) = (σf (u, p)nf ) ◦ xft . With these notations the equilibrium condition (7) can be written as S(λ) := Sf (λ) + Ss (λ) = 0,

(13)

which is the so-called Steklov-Poincar´e equation and will be solved by iterative method in Section 2. 1.3

Weak Formulations

For the weak formulation spaces V s = [H 1 (Ω0s )]3 , V0s = we need thes function s s s d s {v ∈ V |v = 0 on Γ0 Γ0 }, and Vg = {v ∈ V s |v s = λ(t) on Γ0 } for the structure. For the ﬂuid, we deﬁne Df = [H 1 (Ω0f )]3 , D0f = {d ∈ Df |d = 0 on Γ0 }, Dgf (t) = {d ∈ Df |d = λ(t) on Γ0 }, V f (t) = {v f |v f ◦ xft ∈ H 1 (Ω0f )3 }, V0f (t) = {v f ∈ V f (t)|v f ◦ xft = 0 on Γ0 }, Vgf (t) = {v f ∈ V0f |v f ◦ xft = wf ◦ xft on Γ0 }, and Qf (t) = {q f |q f ◦ xft ∈ L2 (Ω0f )}, where H 1 (Ω0s ) and H 1 (Ω0f ) denote the standard Sobolev spaces. Then we obtain: The Weak Form of the Structure Problem. Find ds ∈ Vgs such that for all v s ∈ V0s , a(ds , v s ) = 0 (14) with

∂ 2 ds a(d , v ) = ρs 2 · v s dx0 + ∂t Ω0s s

s

Ω0s

[λl divds divv s + 2μl ε(ds ) : ε(v s )]dx0 . (15)

The Weak Form for the Harmonic Extension. Find df ∈ Dgf (t) such that for all φ ∈ D0f , (16) a(df , φ) = 0

with a(df , φ) =

Ω0f

∇df : ∇φdx0 .

(17)

The computational ﬂuid domain Ω f (t) is then given by Ω f (t) = Ω0f + df .

(18)

120

H. Yang and W. Zulehner

The Weak Form of the Fluid Problem. Find (u, p) ∈ Vgf (t) × Qf (t) such that for all (v f , q f ) ∈ V0f (t) × Qf (t), a(u, v f ) + b1 (v f , p) = F f , v f , (19) b2 (u, q f ) − c(p, q f ) = Gf , q f , where

⎧ d ⎪ ⎪ a(u, v f ) = ρf u · v f dx − divwf ρf u · v f dx ⎪ ⎪ dt f (t) f (t) ⎪ Ω Ω ⎪ ⎪

⎪ ⎪ ⎪ f f ⎪ + ρ (u − w ) · ∇ u · v dx + 2μ ε(u) : ε(v f )dx, ⎪ f ⎨ Ω f (t) Ω f (t)

⎪ f f ⎪ ⎪ b (v , p) =b (v , p) = − p divv f dx, 1 2 ⎪ ⎪ f (t) ⎪ Ω ⎪ ⎪

⎪ ⎪ ⎪ f f f ⎪ gin · v f ds, Gf , q f = 0. ⎩ c(p, q ) =0, F , v =

(20)

Γin (t)

The Weak Form of the Equilibrium Condition. Find λ(t) ∈ H 1/2 (Γ0 ) such that, for all v f × v s ∈ V f (t) × V s such that Sf (λ), v f Γ (t) + Ss (λ), v s Γ0 = 0.

(21)

where v f ◦ xft = v s on Γ0 and ., . denotes the corresponding dual product. 1.4

Discretization

The spatial discretization was done by a ﬁnite element method on a hybrid mesh consisting of tetrahedra, hexahedra, pyramids and prisms. These elements, seethe ﬁrst line of Fig. 2, are splitted into pure tetrahedral elements by introducing artiﬁcial points at the volume and face centers, see the second line of Fig. 2. We then construct the ﬁnite elements based on pure tetrahedral meshes. The introduced additional degrees of freedom are locally eliminated by the mean value approximation, for details [5]. Concerning time discretization we proceeded as proposed in [1]: For the ﬂuid problem an implicit Euler scheme is used with a semi-implicit treatment of the non-linear convective term. The structure problem is discretized by an implicit ﬁrst-order scheme. Good stability properties of this scheme were reported in [1].

2

Iterative Methods for the Interface Equation

We apply a preconditioned Richardson method to (13): given λ0 , for k ≥ 0, λk+1 = λk + ω k Pk−1 −Ss (λk ) − Sf (λk ) , (22) with the relaxation parameter ω k and a proper preconditioner Pk .

Numerical Simulation of Fluid-Structure Interaction Problems

Number of splitted tetrahedra:

14

24

121

4

Fig. 2. Splitting of hybrid elements into tetrahedral elements

The Newton algorithm applied to (13) is obtained by using iteration (22) and choosing the preconditioner at step k (see [1]): Pk = Ss (λk ) + Sf (λk ). In our computation, we instead use an approximation of the full tangent operators Pˆk ≈ Ss (λk ) + Sf (λk ) which includes the Fr´echet derivative for the structure operator Ss and the classical Fr´echet derivative part which does not take into account the shape change for the ﬂuid operator Sf . In each step of the iterative method a problem of the form (23) Pˆk μk = − Ss (λk ) + Sf (λk ) has to solved. For this we used a preconditioned GMRES method with precon ditioner Ss (λk ) (see [1]). Summarizing, the method can be described as follows: for k ≥ 0, 1. update the residual Ss (λk ) + Sf (λk ) by solving the structure and ﬂuid subproblems, 2. solve the tangent problem (23) via GMRES method, 3. update the displacement λk+1 , if not accurate enough, go to step 1. Note that Step 1 can be parallelized due to the independence of the sub-problems for given interface boundary conditions. Step 2 requires solving the linearized structure and ﬂuid problems several times during the GMRES iteration, for details we refer to [5]. The algebraic multigrid method (AMG) is used for the ﬂuid and structure sub-problems, see [3] and [4].

3

Numerical Results

We simulate a pressure wave in a cylinder of length 5 cm and radius 5 mm at rest. The thickness of the structure is 0.5 mm. The structure is considered linear and clamped at both the inlet and outlet. The ﬂuid viscosity is set to μ = 0.035,

122

H. Yang and W. Zulehner

(a) coarse mesh

(b) fine mesh

Fig. 3. Fine and coarse meshes for simulations

(a) t = 5 ms

(b) t = 10 ms

(c) t = 15 ms

(d) t = 20 ms

Fig. 4. Simulation results at time t = 5 ms (upper left), t = 10 ms (upper right), t = 15 ms (lower left) and t = 20 ms (lower right)

Numerical Simulation of Fluid-Structure Interaction Problems

123

the Lam´e constants to μl = 1.15 × 106 and λl = 1.73 × 106 , the density to ρf = 1.0 and ρs = 1.2. The ﬂuid and structure are initially at rest and a pressure of 1.332 × 104 dyn/cm2 is set on the inlet for a time period of 3 ms. Two meshes (Fig.3.) are used for simulations: For all simulations, we use the same time step size δt = 1 ms and run the simulation until the same end time t = 20 ms as in [1]. For visualization purposes the deformation is ampliﬁed by a factor of 12. Fig.4 shows the pressure wave propagation on the ﬁne mesh at diﬀerent time levels. A relative error reduction by a factor of 10−5 is achieved in 2-3 outer iterations. Each of these iterations requires 6-8 GMRES iterations for a relative error reduction by a factor of 10−5 . For solving the structure problem, about 10 preconditioned conjugate gradient iterations with AMG preconditioning are needed for a relative error reduction by a factor of 10−8 , for the ﬂuid problem about 5 AMG iterations for a relative error reduction by a factor of 10−8 . Almost the same numbers of iterations were observed for the coarse and the ﬁne mesh. For the doubled time step size δt = 2 ms the number of inner AMG iterations for the ﬂuid problem increased to about 10. In future work it is planned to implement an adaptive time stepping strategy to improve the eﬃciency of the method.

References 1. Deparis, S., Discacciati, M., Fourestey, G., Quarteroni, A.: Fluid-structure algorithms based on Steklov-Poincar´e operators. J. Comput. Methods Appl. Mech. Engrg. 105, 5799–5812 (2006) 2. Fern´ andez, M.A., Moubachir, M.: A Newton method using exact Jacobians for solving fluid-structure coupling. J. Comput. and Struct. 83(2-3), 127–142 (2005) 3. Reitzinger, S.: Algebraic Multigrid Methods for Large Scale Finite Element Equations. PhD thesis, Johannes Kepler Unversity Linz (2001) 4. Wabro, M.: Algebraic Multigrid Methods for the Numerical Solution of the Incompressible Navier-Stokes Equations. PhD thesis, Johannes Kepler Unversity Linz (2003) 5. Yang, H.: Numerical Simulations of Fluid-structure Interaction Problems on Hybrid Meshes with Algebraic Multigrid Methods. PhD thesis, Johannes Kepler Unversity Linz (in preparation) (2009)

Recent Developments in the Multi-Scale-Finite-Volume Procedure Giuseppe Bonﬁgli and Patrick Jenny ETH-Zurich, CH-5092 Zurich, Switzerland [email protected]

Abstract. The multi-scale-ﬁnite-volume (MSFV) procedure for the approximate solution of elliptic problems with varying coeﬃcients has been recently modiﬁed by introducing an iterative loop to achieve any desired level of accuracy (iterative MSFV, IMSFV). We further develop the iterative concept considering a Galerkin approach to deﬁne the coarse-scale problem, which is one of the key elements of the MSFV and IMSFVmethods. The new Galerkin based method is still a multi-scale approach, in the sense that upscaling to the coarse-scale problem is achieved by means of numerically computed basis functions resulting from localized problems. However, it does not enforce strict conservativity at the coarsescale level, and consequently no conservative velocity ﬁeld can be deﬁned as in the IMSFV-procedure until convergence to the exact solution is achieved. Numerical results are provided to evaluate the performance of the modiﬁed procedure.

1

Introduction

The multi-scale-ﬁnite-volume (MSFV) procedure by Jenny et al. [1] is a method for the approximate solution of discrete problems resulting from the elliptic partial diﬀerential equation ∇ · λ∇φ = q,

x ∈ Ω, λ > 0,

(1)

when the operator on the left-hand side is discretized on a Cartesian grid with standard ﬁve-point stencils. The coeﬃcient λ can be an arbitrary function of the spatial coordinate x. Successive developments of the MSFV-approach have been proposed by Lunati and Jenny [2], Hajibeygi et al. [3] and Bonﬁgli and Jenny [4]. The iterative version, i.e. the IMSFV-method, has been introduced in [3] and modiﬁed in [4] to improve its eﬃciency in cases in which the coeﬃcient λ varies over several orders of magnitude within the integration domain. The IMSFV-method as presented in [4] foresees two steps for the computation of the approximate solution at each level of the iterative loop: the ﬁne-scale step or ﬁne-scale relaxation and the coarse-scale step. The ﬁne-scale step requires the solution of several localized problems on subdomains of the original grid with approximated boundary conditions. This allows to capture the ﬁnescale features in the spatial distributions of the coeﬃcient λ and of the source I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 124–131, 2010. c Springer-Verlag Berlin Heidelberg 2010

Recent Developments in the Multi-Scale-Finite-Volume Procedure

125

term q. However, due to the local character of the considered problems, the interdependence of any two points in the integration domain, as implied by the elliptic character of equation (1) (global coupling), can not be accounted for at this stage. Analogies are found here with domain decomposition approaches, in particular if Dirichlet boundary conditions for the localized problems are extracted in the IMSFV-method from the solution at the previous iteration [4]. If the subdomains are reduced to a grid line or to a single grid point, standard line or Gauss-Seidel-relaxation is recovered. The coarse-scale step provides the global coupling missing in the ﬁne-scale step. An upscaled problem with a strongly reduced number of degrees of freedom (coarse-scale problem) is deﬁned by requiring the integral fulﬁllment of the conservation law (1) for a set of coarse control volumes. This provides an approximate solution on a coarse grid spanning the whole integration domain, which is then prolonged to the ﬁne grid as in a two-level multi-grid approach [3,4]. The main novelty of the IMSFV-procedure considered as an elliptic solver is represented by the strategy used for restriction and prolongation, i.e. to deﬁne the coarse-scale problem and to interpolate its solution back to the ﬁne grid. Both steps are achieved by means of the basis functions computed by solving numerically localized problems at the ﬁne-scale level, such that the ﬁne-scale features of the coeﬃcient λ are approximatively accounted for. Even if the eﬃciency of the MSFV and IMSFV-methods have been demonstrated for several applications, some aspects still deserve further investigation. In particular: – The stability of the iterative procedure for critical λ-distributions has been progressively improved [3,4], but problems where the procedure diverges may still be found. – For large cases, the size of the coarse-scale problem may become signiﬁcant, thus making a direct solution impracticable. The introduction of a further upscaling step, in analogy with multi-grid methods, would be an attractive alternative. The critical aspect with respect to the ﬁrst point is clearly the quality of the coarse-scale operator. At the same time the structure of the coarse-scale problem also matters for the development of a multi-level procedure. The present investigation considers an alternative deﬁnition of the coarse-scale problem based on a Galerkin approach, which, as usual for Galerkin methods, leads to a symmetric positive-deﬁnite or semi-deﬁnite system matrix. This is clearly a desirable feature in view of the targeted development of a multi level procedure. Due to its analogy with a Galerkin-based ﬁnite-element approach we call the new procedure iterative-multi-scale-ﬁnite-element method (IMSFE). As a preliminary step towards a multi-level method, we compare the performance of the IMSFE-procedure with that of the original IMSFV procedure, when the coarse-scale problem is solved directly. The base-line formulation of the IMSFVprocedure is the one presented in [4]. We point out that Galerkin, also iterative, approaches have been already used in the context of multi-scale methods for elliptic problems, beginning with the

126

G. Bonﬁgli and P. Jenny

seminal work by Hou and Wu [5] (see also [6]). The novelty of our contribution is rather their application in the context of the iterative strategy from the IMSFVmethod, i.e. considering basis functions as deﬁned in the IMSFV-procedure and including line or Gauss-Seidel relaxation to converge at the ﬁne-scale level.

2

The IMSFV-Procedure

An extensive discussion of the IMSFV-procedure can be found in [3,4]. Here we only present the basics. The diﬀerent grids considered for the formulation of the IMSFV-procedure are sketched in ﬁgure 1. The so-called coarse and dual grids, are staggered with respect to each other and their cells (coarse and dual cells) are deﬁned as rectangular blocks of cells from the original grid (ﬁne grid). Neighbouring dual cells share one layer of ﬁne-grid cells (right bottom ﬁgure). Let P be the index set for an indexing of the ﬁne-grid cells. The unknowns of the ﬁne-scale problem are the values φj , j ∈ P, of the discrete solution φ at centroids xj of the ﬁne cells. The unknowns of the coarse-scale problem are the values φj , j ∈ P¯ ⊂ P at ﬁne-grid nodes corresponding to the corners of dual cells (coarse-grid nodes, crosses in ﬁgure 1). Given the approximate solution φ[n] the improved solution φ[n+1] at the next iteration step is given by φ[n+1] = S ns (φ[n] + δφ),

(2)

where S in the ﬁne-scale relaxation operator (we only consider line and GaussSeidel relaxation), which is applied ns times after adding the correction δφ from the coarse-scale step to the old solution. In turn, the correction δφ is deﬁned as the linear superposition δφ = δφj Φj (3) ¯ j∈P

¯ provided by of the basis functions Φj , scaled with the node-values δφj , j ∈ P, the solution of the coarse-scale problem.

¯j Ω ˜j Ω

¯j , solid lines) and dual grid (cells Ω ˜j , dashed lines) with enlargeFig. 1. Coarse (cells Ω ment of the corner region of coarse (top) and dual cells (bottom). Thin lines mark the boundaries of ﬁne-grid cells, thick lines the boundaries of coarse and dual cells.

Recent Developments in the Multi-Scale-Finite-Volume Procedure

127

Once the basis functions are assigned, the coarse-scale problem, i.e. a linear ¯ is obtained by requiring the system for the coarse-scale unknowns δφj , j ∈ P, conservation law represented by equation (1) to be fulﬁlled by (φ[n] + δφ) for all ¯j . By considering the divergence theorem and the deﬁnition (3) of coarse cells Ω δφ, one obtains the following equivalent sets of equations: ¯j Ω

¯ k∈P

∇ · λ∇(φ[n] + δφ)dV =

δφk

qdV, ¯j Ω

¯j ∂Ω

λ∇Φk · ndS = −

¯j ∂Ω

¯ j ∈ P,

(4a)

¯ j ∈ P,

(4b)

λ∇φ[n] · ndS +

qdV, ¯j Ω

of which the latter is usually implemented in the IMSFV-procedure. Of course, the discrete version of the equations is considered in the numerical procedure, where the integration and diﬀerentiation operators are replaced by the corresponding discrete operators. The equivalence of equations (4a) and (4b) is preserved also in the discrete case, since the divergence theorem holds exactly for the considered ﬁnite-volume discretization. In conclusion, we detail the deﬁnition of the basis functions Φj , each of which is non-zero only within the dual cells intersecting the corresponding coarse cell ¯j . Fix j ∈ P¯ so that no face of Ω ¯j lies on the domain boundary ∂Ω and let Ω ˜k Ω ˜ ¯ ¯ be a dual cell so that Ωk ∩ Ωj = ∅. Let furthermore C ⊂ P be the set of indices of ˜k . The restriction of Φj to the coarse-grid nodes xl corresponding to corners of Ω ˜k is then determined by the following localized elliptic problem with so-called Ω reduced boundary conditions: ˆ ∇ · λ∇Φ j = 0,

˜k , x∈Ω

(5a)

Φj = 0, Φj = 1, ˆ t Φj = 0, ∇t · λ∇

x = x l , l ∈ C \ {j}, x = xj , ˜k . x ∈ ∂Ω

(5b) (5c) (5d)

Thereby, the tangential nabla operator ∇t = ∇ − n(n · ∇) is deﬁned at boundary ˜k analogously to ∇, but ignores derivatives in direction normal points x ∈ ∂ Ω ˆ = max{λ, Λ}, ˆ Λˆ > 0, follows from to the boundary. The modiﬁed coeﬃcient λ λ after removing very low values to avoid bad conditioning of the coarse-scale problem [4]. For numerical implementation, the discretized version of equation ˜k , leading (5d) is imposed in place of (5a) for cells adjacent to the boundary ∂ Ω ˜ to 1-d problems along the edges of ∂ Ωk , with Dirichlet-like boundary conditions ˜k lie on the domain given by (5b) and (5c). Finally, if one or more edges of Ω boundary, the homogeneous counterpart of the boundary condition prescribed for the solution φ are imposed there in place of equation (5d). The consistency of the deﬁnition of Φj at ﬁne-grid nodes shared by more than one dual cell and their suitability to deﬁne the coarse system are discussed in [3,4].

128

3

G. Bonﬁgli and P. Jenny

The Galerkin Approach

The coarse-scale problem in its form (4a) can be interpreted as a weak formulation of the elliptic equation (1) considering the test functions ¯j 1, x ∈ Ω ¯ χΩ¯ j = , j ∈ P. (6) ¯j 0, x ∈ /Ω The obvious alternative considered in this work is represented by the Galerkin approach, in which the basis functions themselves are considered as test functions [5,6]. The following equivalent coarse-scale problems are derived: ¯ Φj [∇ · λ∇(φ[n] + δφ)]dV = Ψj qdV, j ∈ P, (7a) Ω Ω ¯ δφk Φj ∇ · λ∇Φk dV = Φj (−∇ · λ∇φ[n] + q)dV, j ∈ P, (7b) Ω

¯ k∈P

¯ k∈P

Ω

δφk

∇Φj · λ∇Φk dV = Ω

Φj (−∇ · λ∇φ[n] + q)dV,

¯ j ∈ P.

(7c)

Ω

Summations needed to evaluate the volume integrals in the discrete case, may be limited to cells in the support of Φj and their immediate neighbours. The formulation (7c) of the coarse-scale problem shows that the system matrix is symmetric and positive deﬁnite, except for problems with Neumann conditions on the whole boundary, for which it is positive semi-deﬁnite, with one single eigenvalue equal to zero. Constant φ-distributions are in this case the only non-trivial elements of the matrix null-space [4]. Correspondingly, the system is uniquely solvable if the boundary conditions are not only Neumann. In the opposite case, the solution exists, and is determined up to an additive constant, only if the usual compatibility constraint between Neumann boundary conditions and source term is fulﬁlled. The eﬀects of the Galerkin approach on the performance of the complete iterative procedure are not as evident. Numerical experiments investigating this aspect for a challenging distribution of the coefﬁcient λ are presented in the following section. We point out that the original IMSFV procedure provides at any convergence level of the iterative procedure an approximation g˜ for the gradient ∇φ, so that ∇ · g˜ = q (conservative vector ﬁeld). This is not true for the Galerkin-based IMSFE-procedure, for which only the fully converged solution is conservative.

4

Numerical Tests

We consider the elliptic problem of the form (1), which has to be solved in the simulation of incompressible ﬂows in porous media. The coeﬃcient λ corresponds to a 2-d slice of the permeability ﬁeld for the SPE-10-bottom-layer test case for reservoir simulations [7]. The ﬁne grid contains 220 × 60 cells, equally divided

Recent Developments in the Multi-Scale-Finite-Volume Procedure 60 y

Log(λ) 44 3 22 1 00 -1 -2 -2

40 20 00

129

50

100

150

x 200

p 0.01 0.00 -0.01 0y

50 x

0

100 150 25

50

200

Fig. 2. Permeability distribution and solution. Lines in left ﬁgure correspond to the boundaries of the coarse cells, circles to the coarse-grid nodes.

into 22 × 6 coarse cells. Concentrated sinks and sources of equal intensity and opposite sign are set at two opposite corners of the integration domain. The distribution for λ and the ﬁne-scale solution are displayed in ﬁgure 2. Convergence rates Γ for the IMSFV and IMSFE-procedures are presented in ﬁgure 3 for diﬀerent values of the lower bound Λˆ (see equation (5)) and for diﬀerent stretching factors α = Δx/Δy of the ﬁne-grid cells. The coeﬃcient λ is assumed to be constant within ﬁne cells for the discrete formulation of the problem, and the value associated to each cell is not modiﬁed when varying α. Result are presented for computations in which ﬁne-scale relaxation is obtained either by line (ns = 10) or Gauss-Seidel relaxation (ns = 20, each line-relaxation step includes two steps comparable to a Gauss-Seidel step, one for each spatial direction). The convergence rate Γ is deﬁned as Γ = 1/N , where N is the average number of iterations needed to reduce the error = ||φ[n] − φ||∞ by one order of magnitude. We evaluate Γ considering iterations for which 10−5 ≤ ≤ 10−3 . If α 1, the connectivity between ﬁne cells sharing edges normal to the y-axis is much larger than for cells sharing faces normal to the x-axis (anisotropic problems). This is known to be a critical situation, and similar problems could be handled successfully by IMSFV only after including line-relaxation in the ﬁnescale step [3]. The robustness of the iterative procedure in this respect was enˆ for the computation of basis hanced by introducing the clipped coeﬃcient ﬁeld λ functions [4]. Figure 3 shows that a further signiﬁcant improvement may be accomplished by considering the Galerkin-based coarse system. On the other hand, the stability envelope of the original procedure can be enlarged only marginally by increasing the number ns of relaxation steps. Convergence rates for stable computations depend only moderately on the strategy considered to deﬁne the coarse-scale problem. However, the improved ˆ which stability of the Galerkin approach simpliﬁes the choice of lower bound Λ, in the standard procedure is not easy, since, for a given α, the optimal value of Λˆ is often close to the limit of the stability region. When using the Galerkinbased approach, a reasonable choice for moderate α would be Λˆ = 0 (no lower bound). For very large stretching factors α, the performance is not satisfactory, neither for the IMSFV, nor for the IMSFE-procedure. The highest convergence ˆ rates are obtained for Λ/||λ|| ∞ = 1, i.e. when the basis functions are computed

130

G. Bonﬁgli and P. Jenny

3

3

α

α

2

2

1

1

0

-6

-4

-2

ˆ Λ/||λ|| ∞

0

0

3

3

α

α

2

2

1

1

0

-6

-4

-2

0

0 ˆ Λ/||λ|| ∞

-6

-4

-2

-6

-4

-2

ˆ Λ/||λ|| ∞

0

0 ˆ Λ/||λ|| ∞

log10 (Γ ) Fig. 3. Convergence rates for the IMSFE (left colum) and the IMSFV-procedures (right column) considering line relaxation (ns = 10, top row) or Gauss-Seidel relaxation (ns = 20, bottom row) for ﬁne-scale relaxation. Blanked values indicate divergence of the iterative procedure.

10

0

0

10

Γ

ns ns ns ns ns

Γ 10-1

10-1

=5 = 10 = 15 = 20 = 300

10-2 10

-2

10-3 10

-3

10

-4

10

ns ns ns ns ns

=5 = 10 = 15 = 20 = 300

10-4

-5

0

10

1

10

2

3

α

10

10

0

10

10

1

2

10

α

10

3

Fig. 4. Convergence rates for the IMSFE (lines) and the IMSFV-procedures (lines with symbols) as functions of the number ns of ﬁne scale relaxation steps and of the grid −2 ˆ stretching α with Λ/||λ|| . Fine-scale relaxation is achieved either by means ∞ = 10 of line-relaxation (left) of Gauss-Seidel relaxation (right).

for a homogeneous λ-distribution. In this case the IMSFV-approach performs slightly better. If Λˆ is decreased, the IMSFV-procedure becomes unstable, while convergence rates for the Galerkin based approach decrease to the level provided by the ﬁne-scale relaxation step. Better results for anisotropic problems, could

Recent Developments in the Multi-Scale-Finite-Volume Procedure

131

be achieved by modifying the number of ﬁne cells per coarse cell so to keep the aspect ratio of the latter close to one. Convergence rates are consistently higher if line relaxation is used in place of Gauss-Seidel relaxation. The dependence of Γ on the number ns of relaxation steps is investigated in −2 ˆ ﬁgure 4 for Λ/||λ|| . Convergence rates are found in all cases to grow ∞ = 10 proportionally to ns . Since ﬁne-scale relaxation represents a relevant portion of the computational costs, no determinant gain of performance can be attained by increasing ns .

5

Conclusions

A Galerkin formulation has been proposed for the coarse-scale problem of the IMSFV-procedure, leading to a symmetric positive-deﬁnite or semi-deﬁnite matrix. This is equivalent to using a ﬁnite-element approach at the coarse level [5,6]. Numerical testing has been carried out considering one challenging spatial distribution of the coeﬃcient in the elliptic problem (permeability ﬁeld according to the terminology used for incompressible ﬂows in porous media). If the coarse system is solved directly as usual in IMSFV, the resulting IMSFE-procedure is signiﬁcantly more stable than the original one. For cases where both procedure converge, diﬀerences in their convergence rates are moderate. These results seem to indicate that the Galerkin-based approach, due to the favourable features of the coarse-scale-system matrix, could be a good starting point for the generalization of IMSFV as a multi-level procedure. Further tests considering diﬀerent permeability distributions are needed to allow conclusive statements.

References 1. Jenny, P., Lee, S.H., Tchelepi, H.A.: Multi-scale ﬁnite-volume method for elliptic problems in subsurface ﬂow simulation. J. Comp. Phys. 187, 47–67 (2003) 2. Lunati, I., Jenny, P.: Multiscale ﬁnite-volume method for density-driven ﬂow in porous media. Comput. Geosci. 12, 337–350 (2008) 3. Hajibeygi, H., Bonﬁgli, G., Hesse, M.A., Jenny, P.: Iterative multiscale ﬁnite-volume method. J. Comp. Phys. 277, 8604–8621 (2008) 4. Bonﬁgli, G., Jenny, P.: An eﬃcient multi-scale poisson solver for the incompressible navier-stokes equations with immersed boundaries. J. Comp. Phys. (accepted, 2009) 5. Hou, T.Y., Wu, X.H.: A multiscale ﬁnite element method for elliptic problems in composite materials and porous media. J. Comp. Phys. 134, 169–189 (1997) 6. Efendiev, Y., Hou, T.Y.: Multiscale Finite Element Methods: Theory and Application, 1st edn. Springer, Heidelberg (2009) 7. Christie, M.A., Blunt, M.J.: 10th SPE comparative solution project: a comparison of upscaling techniques. In: SPE 66599 (February 2001)

Boundary Element Simulation of Linear Water Waves in a Model Basin Clemens Hofreither1 , Ulrich Langer2 , and Satyendra Tomar3 1

3

DK Computational Mathematics, JKU Linz Altenberger Straße 69, 4040 Linz, Austria 2 Institute of Computational Mathematics, JKU Linz Altenberger Straße 69, 4040 Linz, Austria Johann Radon Institute for Computational and Applied Mathematics (RICAM) Altenberger Straße 69, 4040 Linz, Austria

Abstract. We present the Galerkin boundary element method (BEM) for the numerical simulation of free-surface water waves in a model basin. In this work, as a ﬁrst step we consider the linearized model of this timedependent three-dimensional problem. After time discretization by an explicit Runge-Kutta scheme, the problem to be solved at each time step corresponds to the evaluation of a Dirichlet-to-Neumann map on the free surface of the domain. We use the Galerkin BEM for the approximate evaluation of the Dirichlet-to-Neumann map. To solve the resulting large, dense linear system, we use a data-sparse matrix approximation method based on hierarchical matrix representations. The proposed algorithm is quasi-optimal. Finally, some numerical results are given.

1

Introduction

Numerical simulation of free-surface water waves in a model test basin is important when maritime structures, such as freight carriers, ferries, and oil rigs are tested on a model scale. Before the actual construction, the owners and designers need precise information about the hydrodynamic properties of their design. Several numerical algorithms have been proposed for such problems, see e.g. [8] (a combination of ﬁnite element method and ﬁnite diﬀerence method), or [7] (a complete discontinuous Galerkin ﬁnite element method). For such problems, only the discretization of the surface is of interest, which is a typical characteristic of the boundary element method (BEM). Unfortunately, the resulting system of linear equations from BEM is dense. On the other hand, while the ﬁnite element method (FEM) requires the discretization of the whole domain, the resulting matrices are sparse, and fast iterative solvers are available for these discrete systems. Thus, asymptotically, FEM has been considered the favorable choice as compared to BEM, at least, for three-dimensional problems, see e.g. [4]. However, data-sparse approximations of the dense BEM matrices can overcome this drawback of the BEM, see [2,5] and the references wherein. In this paper, we present the Galerkin BEM for the numerical simulation of linear free-surface water waves in a model basin. We use a linearized model for I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 132–139, 2010. c Springer-Verlag Berlin Heidelberg 2010

Boundary Element Simulation of Linear Water Waves in a Model Basin

133

representing the dynamics of the water, which is introduced in Section 2. Its basic structure is that of an operator ordinary diﬀerential equation involving a Dirichlet-to-Neumann map on its right-hand side. For the evaluation of this operator, we employ a boundary element method, which is brieﬂy described in Section 3. The resulting large, dense matrices are approximated via data-sparse H-matrix techniques, as outlined in Section 4. Finally, we present numerical results in Section 5 and conclude our ﬁndings in Section 6.

2

Modeling

We brieﬂy outline the mathematical model used to describe the behavior of a model basin. The detailed derivation can be found in [9]. We start from the Navier-Stokes equations for an ideal, incompressible ﬂuid. By assuming an irrotational ﬂow in a simply connected domain, we may introduce a potential φ(x, y, z, t) for the velocity v = ∇φ. If the amplitude of the waves is small in comparison to the depth of the basin, we may linearize the problem; our computational domain Ω ⊂ R3 is then time-independent. Without loss of generality, let Ω be oriented such that gravity applies in negative z direction with a magnitude g, and the free surface ΓF of Ω is a subset of the plane at z = 0. The remainder of the surface, ΓN = ∂Ω \ ΓF , represents the walls of the basin, where we prescribe a given Neumann boundary condition gN , possibly time-dependent, for the potential φ. We denote the vertical perturbation due to waves at the free surface by a scalar function ζ(x, y, t) with (x, y, 0) ∈ ΓF ; see Figure 1 for a sketch. Under these assumptions, we obtain the following system of equations. The ﬁrst and the second condition on the free surface are called the kinematic and the dynamic boundary condition, respectively. ⎫ ∂φ ⎬ = gN (t) on ΓN , ⎪ Δφ = 0 in Ω, ∂n (1) ∂φ ∂ζ ∂φ ⎪ ⎭ = on ΓF , = −gζ on ΓF . ∂t ∂n ∂t

z=0

ζ(x, y, t)

Ω Fig. 1. A sketch of the free surface parametrization ζ

134

C. Hofreither, U. Langer, and S. Tomar

We now introduce the Dirichlet-to-Neumann map or Steklov-Poincar´e operator S(t) : H 1/2 (ΓF ) → H −1/2 (ΓF ). For given Dirichlet data gD ∈ H 1/2 (ΓF ), let u ∈ H 1 (Ω) be the weak solution of the mixed boundary value problem Δu = 0

in Ω,

∂u = gN (t) ∂n

on ΓN ,

and u = gD

on ΓF .

(2)

We then deﬁne S(t) to be the Dirichlet-to-Neumann map ⏐ ∂u ⏐ ⏐ . S(t) : gD → ∂n ⏐ΓF Using this operator, the behavior of the model problem (1) on the free surface may be speciﬁed in the form of a system of two coupled operator ordinary diﬀerential equations, d φF 0 −g φF = , (3) S(t) 0 ζ dt ζ where φF represents the trace of the potential φ on ΓF . As initial values, we use constant zero functions for both φF and ζ. In the context of our model problem, this corresponds to the free surface being undisturbed and at rest initially. For the solution of this ODE system in a time interval [0, T ], we introduce a discretization of the time axis, 0 = t0 < t1 < . . . < tN = T , with the step sizes τ (n) := tn+1 − tn . After spatial discretization, the Steklov-Poincar´e operator S exhibits moderate stiﬀness with a Lipschitz constant of the order O(h−1 ). The use of an explicit scheme for time integration, e.g. a classical fourth-order RungeKutta method, is thus justiﬁable as long as the time step size is not chosen too large. For the discretization of the Steklov-Poincar´e operator S, we use a boundary element method, as motivated in the introduction.

3

The Boundary Element Method

The BEM operates only on the Cauchy data, that is, on the Dirichlet and Neumann traces on the boundary of the computational domain. The Cauchy data are related to each other via an integral equation on the boundary. This equation is then solved via e.g. a collocation or Galerkin approach. For a thorough treatment of the boundary element method, we refer the reader to e.g. [5]. For the remainder of this section, let u refer to the Dirichlet values of the ∂u |Γ refer to its Neumann solution of the PDE on the boundary Γ , and let v := ∂n data. 1 1 of the Laplace With the help of a fundamental solution E(x, y) = 4π |x−y| equation in 3D, we now deﬁne the boundary integral operators V : H −1/2 (Γ ) → H 1/2 (Γ ), K : H 1/2 (Γ ) → H 1/2 (Γ ), D : H 1/2 (Γ ) → H −1/2 (Γ )

Boundary Element Simulation of Linear Water Waves in a Model Basin

135

called the single layer potential operator, double layer potential operator and hypersingular operator, respectively. We have the relation 1 u u I −K V u =C := 2 , (4) 1 D v v v 2I + K where the two-by-two block operator C is called the Calder´ on projector. Consider now the mixed boundary value problem (2) for a ﬁxed t. We deﬁne extensions g˜D ∈ H 1/2 (Γ ) and g˜N ∈ H −1/2 (Γ ) of the given boundary values and choose the ansatz u = g˜D + uN , v = g˜N + vD with uN ∈ H 1/2 (Γ ), vD ∈ H −1/2 (Γ ). Substituting this in (4) and restricting the equations to suitable parts of the boundary yields 1 g D + K gD − V gN 2 1 = g N − D gD − K g N 2

V vD − KuN = K vD + DuN

in H 1/2 (ΓD ),

(5)

in H −1/2 (ΓN ).

(6)

−1/2 (ΓD ) × H 1/2 (ΓN ), where We then choose a trial space Λ := H

1/2 (Γ ) := v = v˜|Γ : v˜ ∈ H 1/2 (Γ ), supp v˜ ⊂ Γ , H

−1/2 (Γ ) := H 1/2 (Γ ) H for any open subset Γ ⊂ Γ . With an arbitrary test function (s, t) ∈ Λ, we multiply (5) by s and (6) by t in order to obtain a variational formulation. We discretize Ω by a quasi-uniform and shape-regular triangulation with mesh size h. On this mesh, we deﬁne the space of piecewise linear and continuous 1/2 (ΓN ) on the Neumann boundary, and the space of functions Sh1 (ΓN ) ⊂ H −1/2 (ΓD ) on the Dirichlet boundary. piecewise constant functions Sh0 (ΓD ) ⊂ H This gives us the natural choice Λh := Sh0 (ΓD ) × Sh1 (ΓN ) ⊂ Λ for a ﬁnitedimensional trial space. We thus obtain the Galerkin variational formulation: Find (vDh , uN h ) ∈ Λh such that for all (sh , th ) ∈ Λh , there holds a(vDh , uNh ; sh , th ) = F (sh , th ),

(7)

with the bilinear form a(v, u; s, t) = V v, sΓD − Ku, sΓD + K v, tΓN + Du, tΓN and an analogous abbreviation F (s, t) of the right-hand side. The resulting linear system may be solved by a suitable Krylov subspace method, e.g. MINRES.

4

Data-Sparse Approximation

The system matrices obtained from (7) are dense. By discretizing the boundary only, we obtain NBEM = O(h−2 ) unknowns, resulting in a fully populated system

136

C. Hofreither, U. Langer, and S. Tomar

matrix with O(h−4 ) non-zero entries. Even with an optimally preconditioned iterative solver, the solution of the corresponding linear system will thus require at least O(h−4 ) arithmetical operations. In contrast to the BEM, the FEM, where we have to discretize the entire domain, results in a system with NFEM = O(h−3 ) unknowns. However, the ﬁnite element stiﬀness matrices are sparse, i.e. they have only O(h−3 ) non-zero entries. With an optimal preconditioner, a numerical solver with O(h−3 ) operations is attainable. However, it is possible to avoid the diﬃculties associated with classical BEM discretization by using data-sparse approximation of the involved system matrices. In particular, we apply hierarchical matrix (or H-matrix ) techniques to represent the system matrix. This results in a considerable reduction of the memory demand for matrix coeﬃcients than the storage of the full matrix would require; see e.g. Bebendorf [2]. The core idea is to approximate a matrix A∈ Rm×n by a sum of outer prodr ucts of vectors ui ∈ Rm , vi ∈ Rn , i.e. A ≈ A˜ = i=1 ui viT . We call this a low-rank approximation of A with rank (at most) r. One way to construct such approximations is to compute a singular value decomposition of A and then discard all singular values which are below a certain threshold. This is called truncated singular value decomposition. While this produces optimal approximations in the spectral norm [2], it is quite slow. Therefore, faster methods have been developed in recent years. We only mention here the Adaptive Cross Approximation (ACA) method. Its concept is to construct A˜ from a sum of crosses, that is, outer products of a column and a row of A. Once a desired error threshold is reached, the process is stopped. The approximation rank is thus determined adaptively. Our BEM system matrices share the common property that they have a nearsingularity along the diagonal. For such matrices, typically no suitable low-rank approximation with desirable accuracy exists. Thus, the algorithms described above fail to give useful results if applied to the entire matrix. Instead, it is required to split the matrix recursively into sub-matrices which may be better approximated. This is done by clustering the degrees of freedom which are represented by the matrix rows and columns into so-called cluster trees. Pairs of clusters are then chosen according to some admissibility condition for low-rank approximation. By this method, we obtain a so-called H-matrix approximation. Various matrix operations like addition, factorization or inversion may be generalized to H-matrices. We refer the interested reader to the comprehensive monograph [2] for a detailed discussion of the techniques mentioned above. We also mention that software libraries which implement the methods sketched above are available, e.g. HLib [3] and ahmed [1]. The latter was used in our numerical experiments. Data-sparse approximation is also used for preconditioning. From (7), we get the system of boundary element equations in the form −f Dh −Vh Kh v Dh = . KhT Dh uN h f Nh

Boundary Element Simulation of Linear Water Waves in a Model Basin

137

Table 1. Performance of the algorithm for T=80 # triangles (N = O(h−2 )) 704 2816 11264 45056

init 9 62 520 4162

solve 20 370 3214 20473

total 29 432 3734 24635

ratio – 16.78 8.78 6.60

As a preconditioner for this block matrix, we choose I 0 −Vh Kh C= , 0 Dh −KhT Vh−1 I where hierarchical Cholesky factorizations of Vh and Dh are used to apply this preconditioner in an approximate way.

5

Numerical Results

In the following, we extend a numerical example from [7] into three dimensions. We assume that we have a test basin with the dimensions Ω = (0, 10) × (0, 1) × (−1, 0). The free surface at rest is the quadrilateral ΓD = (0, 10) × (0, 1) × {0}. On the remaining walls ΓN , we prescribe the normal velocity gN (x, y, z, t) that is (1 + z)a sin(ωt) for all (x, y, z) ∈ {0} × (0, 1) × (−1, 0) and 0 otherwise. That is, the left wall of the basin is assumed to be equipped with a wave maker which exhibits periodic oscillations with maximum amplitude a = 0.02 and frequency ω = 1.8138. The other walls are assumed to be stationary. Note that the oscillations exhibit maximum amplitude at the top of the basin and vanish at the bottom. For time discretization, we use a ﬁxed time step τ = 0.1 over the time interval [0, T = 80]. This results in 800 time steps, each of which requires four solutions of the mixed boundary value problem in the domain Ω. The surface Γ is discretized by triangles using the software package netgen. We use a series of uniformly reﬁned boundary meshes where each reﬁnement step quadruples the number of triangles. The computations are performed on a machine with four Opteron-852 processors and 32 GB of RAM. Table 1 summarizes the performance of our data-sparse BEM algorithm. The ﬁrst column indicates the number of triangles in the boundary mesh. The second and third columns show the CPU time (in seconds) used for initialization and solution of the problem, while the fourth column shows total CPU time. Finally, the ﬁfth column gives the ratio between total time for the current and the previous smaller problem. Note that there is a signiﬁcant cost for the generation and factorization of the system matrices. This however has to be performed only once at startup. The scheme is thus better suited for long simulations where many time iterations are to be performed. Also, parallelization was used for this initialization phase, so

138

C. Hofreither, U. Langer, and S. Tomar

(a) Resulting wave proﬁle at t = 20.0, t = 38.0, t = 67.0, and t = 120.0 (left to right)

(b) Asymmetric mesh employed in the computations Fig. 2. Wave proﬁles computed on an asymmetric mesh

actual measured times were lower. No appreciable parallelization overhead could be measured, so the CPU times given above can be taken as wall clock times for serial execution on one CPU. As discussed at the beginning of Section 4, a FEM-based implementation of the Dirichlet-to-Neumann map would have O(h−3 ) unknowns and thus, if optimally preconditioned, a time complexity of O(h−3 ) = O(N 3/2 ). Since evaluating the Dirichlet-to-Neumann map is the main bottleneck in the numerical 3 simulation, we could then expect a constant ratio 4 2 = 8 in the last column of Table 1. The numbers we have obtained here thus suggest that our scheme may outperform a FEM-based approach for large problems. To investigate the stability of the proposed scheme over a long period of time, we performed the simulations up to T = 120 with N = 11264 and N = 45056 triangles, and did not face any stability problems like those reported in [8]. Figure 2(a) shows the computed wave proﬁle with 45056 boundary triangles at times t = 20.0, 38.0, 67.0 and t = 120.0. At t = 20.0 the wave starts approaching the wall opposite to the wave maker, at t = 38.0 the wave gains full height, and t = 67.0 the wave reﬂected from the wall is aﬀecting the pattern in the basin. The wave is also exhibiting a very nice pattern at t = 120.0, without any stability problems arising from the numerical scheme. Since no experimental data is available to validate our results (for a 3-D linear problem), we can rely only on visual results. However, they are in excellent agreement with a behavior similar to that of the 2-D problem reported in [7]. It is also important to note here that for a non-uniform/asymmetric mesh (Figure 2(b) shows the structure of the mesh employed in our computations) we do not need any artiﬁcial stabilization in our numerical scheme. This property is highly desirable for simulations over a long period of time. A detailed analysis of the eﬀect of mesh-asymmetry in the numerical computations was carried out in [6], and the resulting mechanism was used to stabilize the numerical scheme in [6,7].

Boundary Element Simulation of Linear Water Waves in a Model Basin

6

139

Conclusion

We have presented a numerical scheme for solving a linearized, time-dependent potential ﬂow problem using the fourth-order explicit Runge-Kutta method and the boundary element method for the time and space discretizations, respectively. Data-sparse approximations of the resulting large-scale, dense matrices were used to make an eﬃcient solution feasible. While this technique incurs a large one-time overhead for setting up the hierarchical matrices, it is worthwhile when many boundary value problems are to be solved on a ﬁxed geometry, as the results in Section 5 indicate. For the same reason, however, a direct generalization to the case of a non-linearized, time-dependent domain seems to be more problematic at the ﬁrst glance, since the system matrices would have to be recomputed at every iteration step. This problem may be alleviated by using the same preconditioner in every iteration and taking advantage of the fact that the matrix generation step may be trivially parallelized. The numerical scheme does not require a separate velocity reconstruction, or diﬀerent order polynomials for velocity ﬁeld and potential to preserve the accuracy in the wave height. A rigorous analysis of the numerical scheme proposed in this paper is still in progress.

References 1. Bebendorf, M.: ahmed. Another software library on hierarchical matrices for elliptic diﬀerential equations, Universit¨ at Leipzig, Fakult¨ at f¨ ur Mathematik und Informatik 2. Bebendorf, M.: Hierarchical Matrices. Springer, Heidelberg (2008) 3. B¨ orm, S., Grasedyck, L.: HLib. A program library for hierarchical and H 2 -matrices, Max Planck Institute for Mathematics in the Sciences, Leipzig 4. Cai, X., Langtangen, H.P., Nielsen, B.F., Tveito, A.: A ﬁnite element method for fully nonlinear water waves. J. Comput. Physics 143(2), 544–568 (1998) 5. Rjasanow, S., Steinbach, O.: The Fast Solution of Boundary Integral Equations. Mathematical and Analytical Techniques with Applications to Engineering. Springer-Verlag New York, Inc. (2007) 6. Robertson, I., Sherwin, S.: Free-surface ﬂow simulation using hp/spectral elements. J. Comput. Phys. 155(1), 26–53 (1999) 7. Tomar, S.K., van der Vegt, J.J.W.: A Runge-Kutta discontinuous Galerkin method for linear free-surface gravity waves using high order velocity recovery. Comput. Methods Appl. Meth. Engrg. 196, 1984–1996 (2007) 8. Westhuis, J.-H.: The Numerical Simulation of Nonlinear Waves in a Hydrodynamic Model Test Basin. PhD thesis, Universiteit Twente (2001) 9. Whitham, J.: Linear and Nonlinear Waves. John Wiley, New York (1974)

Numerical Homogenization of Bone Microstructure Nikola Kosturski and Svetozar Margenov Institute for Parallel Processing, Bulgarian Academy of Sciences

Abstract. The presented study is motivated by the development of methods, algorithms, and software tools for μFE (micro finite element) simulation of human bones. The voxel representation of the bone microstructure is obtained from a high resolution computer tomography (CT) image. The considered numerical homogenization problem concerns isotropic linear elasticity models at micro and macro levels. Keywords: μFEM, microstructure.

1

nonconforming

FEs,

homogenization,

bone

Introduction

The reference volume element (RVE) of the studied trabecular bone tissue has a strongly heterogeneous microstructure composed of solid and ﬂuid phases (see Fig. 1). In a number of earlier articles dealing with μFE simulation of bone structures (see, e.g., [8,10]), the contribution of the ﬂuid phase is neglected. This simpliﬁcation is the starting point for our study and for the related comparative analysis. In this study the ﬂuid phase located in the pores of the solid skeleton is considered as almost incompressible linear elastic material. This means that the bulk modulus Kf is ﬁxed and the related Poisson ration νf is approaching the incompressibility limit of 0.5. The elasticity modulus Es and the Poisson ratio νs of the solid phase as well as the bulk modulus of the ﬂuid phase Kf are taken from [4]. Let Ω ⊂ IR3 be a bounded domain with boundary Γ = ΓD ∪ ΓN = ∂Ω and u = (u1 , u2 , u3 ) be the displacements in Ω. The components of the small strain tensor are 1 ∂ui ∂uj εij = + , 1 ≤ i, j ≤ 3 2 ∂xj ∂xi and the components of the Cauchy stress tensor are 3 σij = λ εkk δij + 2μεij , 1 ≤ i, j ≤ 3. k=1

Here λ and μ are the Lam´e coeﬃcients, which can be expressed by the elasticity modulus E and the Poisson ratio ν ∈ (0, 0.5) as follows λ=

Eν , (1 + ν)(1 − 2ν)

μ=

E . 2 + 2ν

I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 140–147, 2010. c Springer-Verlag Berlin Heidelberg 2010

Numerical Homogenization of Bone Microstructure

141

Fig. 1. Microstructure of a trabecular bone specimen

Now, we can introduce the Lam´e’s system of linear elasticity (see, e.g., [2]) 3 ∂σij j=1

∂xj

+ fi = 0,

i = 1, 2, 3

(1)

equipped with the boundary conditions 3

ui (x) = gi (x),

x ∈ ΓD ⊂ ∂Ω,

σij (x)nj (x) = hi (x),

x ∈ ΓN ⊂ ∂Ω,

j=1

where nj (x) are the components of the normal vector of the boundary for x ∈ ΓN . The remainder of the paper is organized as follows. The applied locking-free nonconforming FEM discretization is presented in the next section. Section 3 contains a description of the applied numerical homogenization scheme. Selected numerical results are given in Section 4. The ﬁrst test problem illustrates the locking-free approximation when the Poisson ratio tends to the incompressibility limit. Then, a set of numerical homogenization tests for real-life bone microstructures are presented and analyzed. Some concluding remarks are given at the end.

2 Let

Locking-Free Nonconforming FEM Discretization (H01 (Ω))3 = u ∈ (H 1 (Ω))3 : u = 0 on ΓD , (Hg1 (Ω))3 = u ∈ (H 1 (Ω))3 : u = g on ΓD .

The weak formulation of (1) can be written in the form (see, e.g., [3]): for a given f ∈ (L2 (Ω))3 ﬁnd u ∈ (Hg1 (Ω))3 such that for each v ∈ (H01 (Ω))3 a(u, v) = − (f , v)dx + (h, v)dx. Ω

ΓN

142

N. Kosturski and S. Margenov

In the case of pure displacement boundary conditions, the bilinear form a(·, ·) can be written as (Cd(u), d(v)) = (C ∗ d(u), d(v)) = a∗ (u, v), (2) a(u, v) = Ω

where

d(u) =

Ω

∂u1 ∂u1 ∂u1 ∂u2 ∂u2 ∂u2 ∂u3 ∂u3 ∂u3 , , , , , , , , ∂x1 ∂x2 ∂x3 ∂x1 ∂x2 ∂x3 ∂x1 ∂x2 ∂x3

T

and the matrices C and C ∗ are ⎤ ⎡ λ + 2μ 0 0 0 λ 000 λ ⎢ 0 μ0μ 0 000 0 ⎥ ⎢ ⎥ ⎢ 0 ⎥ 0 μ 0 0 0 μ 0 0 ⎢ ⎥ ⎢ 0 μ0μ 0 ⎥ 0 0 0 0 ⎢ ⎥ ⎥, C =⎢ λ 0 0 0 λ + 2μ 0 0 0 λ ⎢ ⎥ ⎢ 0 ⎥ 0 0 0 0 μ 0 μ 0 ⎢ ⎥ ⎢ 0 ⎥ 0 μ 0 0 0 μ 0 0 ⎢ ⎥ ⎣ 0 000 0 μ0μ 0 ⎦ λ 000 λ 0 0 0 λ + 2μ ⎤ ⎡ λ + 2μ 0 0 0 λ + μ 0 0 0 λ + μ ⎢ 0 μ00 0 000 0 ⎥ ⎥ ⎢ ⎢ 0 0μ0 0 000 0 ⎥ ⎢ ⎥ ⎢ 0 00μ 0 000 0 ⎥ ⎢ ⎥ ⎥ C∗ = ⎢ ⎢ λ + μ 0 0 0 λ + 2μ 0 0 0 λ + μ ⎥ . ⎢ 0 000 0 μ00 0 ⎥ ⎢ ⎥ ⎢ 0 000 0 0μ0 0 ⎥ ⎢ ⎥ ⎣ 0 000 0 00μ 0 ⎦ λ + μ 0 0 0 λ + μ 0 0 0 λ + 2μ Let us note, that the modiﬁed bilinear form a∗ (·, ·) is equivalent to a(·, ·) if ∂ui ∂uj ∂ui ∂uj dx = dx (3) ∂x ∂x j i Ω Ω ∂xi ∂xj is fulﬁlled. It is also important, that the rotations are excluded from the kernel of the modiﬁed (corresponding to a∗ (·, ·)) Neumann boundary conditions operator. As a result, the nonconforming Crouzeix-Raviart (C.–R.) FEs are applicable to the modiﬁed variational problem, providing a locking-free discretization. Locking-free error estimates of the 2D pure displacement problem discretized by C.–R. FEs are presented, e.g., in [3]. A similar analysis is applicable to the 3D case as well. It is easy to check that the equality (3) holds true also for the RVE with Dirichlet boundary conditions with respect to the normal displacements only. As we will see in the next section, this is exactly the case in the studied numerical homogenization scheme.

Numerical Homogenization of Bone Microstructure

143

Fig. 2. Splitting a voxel into six tetrahedra

In order to apply C.–R. FE discretization of RVE, we split each voxel (macroelement) into six tetrahedra, see Fig. 2. For a RVE divided into n × n × n voxels, the number of unknowns in the resulting linear system is 18n2 (2n + 1). A block factorization (static condensation) is applied ﬁrst on the macroelement level, eliminating the unknowns associated with the inner nodes. This step reduces the total number of unknowns to 18n2 (n + 1).

3

Numerical Homogenization

We study the implementation of a standard numerical homogenization scheme of Dirichlet boundary conditions type [6]. The RVE boundary value problem has zero normal displacements on ﬁve of the faces of the cube and a (small) nonzero constant normal displacement on the sixth face (see Fig. 3). The right hand side is also supposed to be zero. Nonconforming C.–R. ﬁnite elements (based on the modiﬁed bilinear form a∗ (·, ·) from (2)) are used to get a locking-free discretization. By symmetry arguments, it simply follows that the homogenized stress and the strain tensors have zero shear components, and therefore the following relation between the homogenized normal stress and strain components holds ⎤ ⎡ 1 σ ˆx E(1 − ν) ν ⎣σ ⎦ ⎣ ˆy = (1 + ν)(1 − 2ν) 1−ν ν σ ˆz 1−ν ⎡

ν ν 1−ν 1−ν ν 1 1−ν ν 1−ν 1

⎤ εˆx ⎦ ⎣ εˆy ⎦ . εˆz ⎤⎡

Let us consider now the case, where the nonzero constant normal displacement (in z direction) is applied on the top face of the RVE cube. Then εˆx = εˆy = 0 and therefore σ ˆx = σ ˆy =

Eν εˆz , (1 + ν)(1 − 2ν)

and σ ˆz =

E(1 − ν) εˆz . (1 + ν)(1 − 2ν)

144

N. Kosturski and S. Margenov

Fig. 3. Boundary conditions of the RVE problem

From these relations we can directly calculate the homogenized elasticity coeﬃcients as follows ν=

1 , 1+p

where p=

E=

(1 + ν)(1 − 2ν) r, 1−ν

σ ˆz σ ˆz = , σ ˆx σ ˆy

r=

σ ˆz . εˆz

ˆy . In the case of numerical homogenization, the average values In theory σ ˆx = σ of the stresses σx and σy over all tetrahedral ﬁnite elements are used. In the case of numerical homogenization of strongly heterogeneous materials (like bone tissues), the average stresses σ ˆx and σ ˆy are not equal. This is why we set 1 σ ˆx = σ ˆy = 2

Nel Nel 1 1 σxi + σi Nel i=1 Nel i=1 y

when computing the homogenized elasticity modulus and Poisson ratio. Similar relations hold when the nonzero displacements are alternatively applied in the x and y directions. To determine the elasticity coeﬃcients more precisely, we apply the above numerical scheme in all three cases, and ﬁnally average the computed elasticity modulus and Poisson ratio.

4

Numerical Experiments

A set of numerical tests illustrating the accuracy of the locking-free FEM approximation in the almost incompressible case are presented ﬁrst. The second part of the section contains results of numerical homogenization of three real-life

Numerical Homogenization of Bone Microstructure

145

Table 1. Relative error on a fixed 32 × 32 × 32 mesh for ν → 0.5 ν 0.4 0.49 0.499 0.4999 0.49999

r∞ 0.033733 0.052206 0.551943 5.551980 55.552900

f ∞ 214407 1381450 13125600 1.31 × 108 1.31 × 109

r∞ /f ∞ 1.57331 × 10−7 3.77904 × 10−8 4.20509 × 10−8 4.24652 × 10−8 4.25009 × 10−8

bone microstructure RVEs. The (additional) response of the ﬂuid phase within the studied linear elasticity model is analyzed. The studied methods and algorithms are addressed to the case of μFEM numerical homogenization which leads in general to large-scale linear systems. The developed solver is based on the Preconditioned Conjugate Gradient (PCG) [1] method, where BoomerAMG1 is used as a preconditioner (see also [7] for an alternative locking-free AMLI solver in 2D). In the presented numerical tests, the relative stopping criterion for the PCG method is rTk C −1 rk ≤ 10−12 rT0 C −1 r0 . The numerical tests are run on an IBM Blue Gene/P supercomputer. 4.1

Model Problem

A homogeneous material with ﬁxed elasticity modulus E = 0.06 GPa is considered. The computational domain Ω is the unit cube [0, 1]3 . A pure displacement problem is considered with Dirichlet boundary conditions on ΓD = ∂Ω corresponding to the given exact solution u1 (x, y, z) = x3 + sin(y + z), u2 (x, y, z) = y 3 + z 2 − sin(x − z), u3 (x, y, z) = x2 + z 3 + sin(x − y). The right hand side f is obtained by substituting the exact solution and the corresponding material coeﬃcients in Lam´e’s system (1). The relative errors presented in Table 1 well illustrate the robustness (locking-free approximation) of the C.–R. nonconforming FEM approximation of almost incompressible elasticity problems. 4.2

Homogenization of Bone Microstructures

Three test cases of diﬀerent RVE are considered. The related voxel representations (16 × 16 × 16, 32 × 32 × 32 and 64 × 64 × 64) of the bone microstructure are extracted from a high resolution computer tomography (CT) image [9]. The used mechanical properties of bones are from [4]. The elasticity modulus of solid phase is Es = 14.7 GPa and the related Poisson ratio is νs = 0.325. The ﬂuid phase is considered as almost incompressible where the bulk modulus Kf = 2.3 GPa is 1

BoomerAMG is a parallel algebraic multigrid implementation from the package Hypre, developed in LLNL, Livermore [5].

146

N. Kosturski and S. Margenov Table 2. Solid skeleton: Es = 14.7 GPa, νs = 0.325 Solid phase Ehom νhom

16 × 16 × 16 47 % 3.21 GPa 0.118

32 × 32 × 32 26 % 1.10 GPa 0.141

64 × 64 × 64 19 % 0.58 GPa 0.158

Table 3. Solid skeleton with incompressible fluid filling: RVE of 16 × 16 × 16 voxels Solid phase Es νs 14.7 GPa 0.325 14.7 GPa 0.325 14.7 GPa 0.325 14.7 GPa 0.325 14.7 GPa 0.325

Fluid phase Ef νf 1.38 × 108 Pa 0.49 1.38 × 107 Pa 0.499 1.38 × 106 Pa 0.4999 1.38 × 105 Pa 0.49999 1.38 × 104 Pa 0.499999

Homogenized Ehom νhom 5.05 GPa 0.299 4.82 GPa 0.307 4.79 GPa 0.309 4.79 GPa 0.309 4.79 GPa 0.309

Table 4. Solid skeleton with incompressible fluid filling: RVE of 32 × 32 × 32 voxels Solid phase Es νs 14.7 GPa 0.325 14.7 GPa 0.325 14.7 GPa 0.325 14.7 GPa 0.325 14.7 GPa 0.325

Fluid phase Ef νf 1.38 × 108 Pa 0.49 1.38 × 107 Pa 0.499 1.38 × 106 Pa 0.4999 1.38 × 105 Pa 0.49999 1.38 × 104 Pa 0.499999

Homogenized Ehom νhom 2.24 GPa 0.376 1.96 GPa 0.391 1.88 GPa 0.396 1.86 GPa 0.396 1.86 GPa 0.396

Table 5. Solid skeleton with incompressible fluid filling: RVE of 64 × 64 × 64 voxels Solid phase Es νs 14.7 GPa 0.325 14.7 GPa 0.325 14.7 GPa 0.325 14.7 GPa 0.325 14.7 GPa 0.325

Fluid phase Ef νf 1.38 × 108 Pa 0.49 1.38 × 107 Pa 0.499 1.38 × 106 Pa 0.4999 1.38 × 105 Pa 0.49999 1.38 × 104 Pa 0.499999

Homogenized Ehom νhom 1.54 GPa 0.408 1.19 GPa 0.428 1.07 GPa 0.435 1.05 GPa 0.436 1.05 GPa 0.436

known. In the presented tests, the Poisson ratio is νf varied between 0.49 and 0.5 − 10−6 , and respectively Ef = 3Kf (1 − 2νf ). Tables 3, 4, and 5 contain the homogenized material coeﬃcients of three bone specimens with diﬀerent mesh sizes. The case when the ﬂuid phase is neglected is presented in Table 2, including the percentage of the solid phase of the considered three RVEs. This is the starting point of our comparative analysis. The last three tables clearly demonstrate the signiﬁcant contribution of the ﬂuid phase to the homogenized parameters (Ehom , νhom ). The numerical stability with respect to νf → 0.5 is also well expressed. High accuracy of the homogenization procedure is achieved for νf ∈ {0, 4999, 0.499999}.

Numerical Homogenization of Bone Microstructure

147

The following next steps in the numerical homogenization of bone microstructures are planned: a) anisotropic macro model; b) poroelasticity micro model. The goal is to improve further (quantitatively and qualitatively) our understanding of the underlying phenomena.

Acknowledgments This work is partly supported by the Bulgarian NSF Grants DO02-115/08 and DO02-147/08. We kindly acknowledge also the support of the Bulgarian Supercomputing Center for the access to the supercomputer IBM Blue Gene/P.

References 1. Axelsson, O.: Iterative solution methods. Cambridge University Press, Cambridge (1994) 2. Axelsson, O., Gustafsson, I.: Iterative methods for the Navier equations of elasticity. Comp. Meth. Appl. Mech. Engin. 15, 241–258 (1978) 3. Brenner, S., Sung, L.: Nonconforming finite element methods for the equations of linear elasticity. Math. Comp. 57, 529–550 (1991) 4. Cowin, S.: Bone poroelasticity. J. Biomechanics 32, 217–238 (1999) 5. Lawrence Livermore National Laborary, Scalable Linear Solvers Project, https://computation.llnl.gov/casc/linear_solvers/sls_hypre.html 6. Kohut, R.: The determination of Young modulus E and Poisson number ν of geocomposite material using mathematical modelling, private communication (2008) 7. Kolev, T., Margenov, S.: Two-level preconditioning of pure displacement nonconforming FEM systems. Numer. Lin. Algeb. Appl. 6, 533–555 (1999) 8. Margenov, S., Vutov, Y.: Preconditioning of Voxel FEM Elliptic Systems. TASK, Quaterly 11(1-2), 117–128 (2007) 9. Vertebral Body Data Set ESA29-99-L3, http://bone3d.zib.de/data/2005/ESA29-99-L3 10. Wirth, A., Flaig, C., Mueller, T., M¨ uller, R., Arbenz, P., van Lenthe, G.H.: Fast Smooth-surface micro-Finite Element Analysis of large-scale bone models. J. Biomechanics 41 (2008), doi:10.1016/S0021-9290(08)70100-3

Multiscale Modeling and Simulation of Fluid Flows in Highly Deformable Porous Media P. Popov1, Y. Efendiev2 , and Y. Gorb2 1

Institute for Parallel Processing, Bulgarian Academy of Sciences, Soﬁa, Bulgaria [email protected] 2 Dept. of Mathematics, Texas A&M University, College Station, TX 77843 {efendiev,gorb}@math.tamu.edu

Abstract. In this work a new class of methods for upscaling FluidStructure Interaction (FSI) problems from the pore-level to a macroscale is proposed. A fully coupled FSI problem for Stokes ﬂuid and an elastic solid is considered at the pore-level. The solid, due to coupling with the ﬂuid, material nonlinearities, and macroscopic boundary conditions, can deform enough so that the pore-space is altered signiﬁcantly. As a result, macroscopic properties such as the permeability of the porous media become nonlinearly dependent on the ﬁne-scale displacements. Therefore, classical upscaled models, such as Biot’s equations, can no longer be applied. We propose a series of numerical upscaling models in the context of the Multiscale Finite Element Method (MsFEM) which couple this ﬁne-scale FSI problem to a nonlinear elliptic equation for the averaged pressure and displacements at the coarse scale. The proposed MsFEM schemes correctly transfer the appropriate physics from the ﬁne to the coarse scale. Several numerical examples which demonstrate the methods are also presented.

1

Introduction

In this work we consider a Multiscale Finite Element framework for modeling ﬂows in highly deformable porous media. The physical processes under consideration span two length-scales. On the macroscopic level, one has ﬂuid, diﬀusing through a nonlinear porous solid. At the microscale the solid has a complex pore geometry and interacts with a Stokes ﬂow. We assume good scale separation, with the usual small parameter ε being the ratio of the ﬁne to the coarse length scales. We denote the ﬁne scale domain by Ωε0 , which contains two subdomains — a ﬂuid part Fε0 and solid part Sε0 . The superscript 0 indicates the reference, or undeformed conﬁguration of the body. The interface between the solid and ﬂuid domains is denoted by Γε0 = ∂Fε0 ∩ ∂Sε0 . The physics is described by the strongly coupled, stationary ﬂuid structure interaction problem [4]: Find the interface Γε , velocity vε , pressure pε and displacements uε such that: Γε = X + uε (X)| ∀ X ∈ Γε0 , I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 148–156, 2010. c Springer-Verlag Berlin Heidelberg 2010

(1)

Multiscale Modeling and Simulation of Fluid Flows

149

the Stokes and Elasticity equations are satisfied: −∇pε + μΔvε + f = 0,

∇ · vε = 0

in Fε

(2)

Sε0

(3)

on Γε0 .

(4)

in

−∇ · S(Eε ) = f with the interface condition −T

det (I + ∇uε ) (−pε I + 2μDε ) (I + ∇uε )

n0 = S(Eε )n0

Here X is the material and x the spatial coordinate, μ is the ﬂuid viscosity, f is the body force. Further, S is the ﬁrst Piola-Kirchhoﬀ stress tensor in the solid. It depends, linearly or nonlinearly, on the strain E(u) = 12 ∇u(X) + ∇u(X)T . −T Recall, that given a CauchyT stress σ one has S = det(I + ∇uε )σ (I + ∇uε ) . 1 D(v) = 2 ∇v(x) + ∇v(x) is the rate of stretching tensor, and n0 is the reference normal to the interface. Note, that the interface condition (4) introduces a nonlinearity into the problem, regardless of the constitutive form for S.

2

Multilevel Algorithm

The combination of poroelasticity with large pore-level deformation is a complex problem. The main diﬃculty in upscaling it is when Fε deforms substantially at the ﬁne-scale, breaking the assumptions underlying classic homogenized models such as Darcy and Biot [1,6,5,2]. This necessitates a two-scale numerical approach, such as MsFEM. For developing MsFEM, one needs to (1) identify relevant macroscopic parameters, (2) develop downscaling operators and (3) derive macroscopic variational formulation for poroelastic media. The proposed multiscale method follows the general MsFEM framework. In the present study, the coarse pressure p0 and displacement u0 are taken as the macroscopic variables. For these macroscopic variables, the downscaling operator E MsF EM is introduced which maps the macroscopic quantities to microscopic ones [3]. In particular we deﬁne the downscaling map ˜ε , u ˜ ε) , E MsF EM : (p0 , u0 ) → (˜ pε , v

(5)

where the subscript ε refers to the ﬁne-scale quantities. Note that p˜ε = p˜ε (p0 , u0 ), ˜ε = v ˜ε (p0 , u0 ), u ˜ε = u ˜ ε (p0 , u0 ), where ˜· is used to diﬀerentiate the downscaled v quantities from the true solution. For simplicity of the presentation, we avoid using ﬁnite element discretizations of the variables. In our simulations, (p0 , u0 ) ˜ε , u ˜ ε ) are discretized are discretized on the coarse grid (mesh size H) and (˜ pε , v on the ﬁne grid (mesh size h). The choice of macroscopic variables is motivated by using formal asymptotic expansion of the solid and ﬂuid subproblems. Following a standard scaling argument [5], the Stokes system (2) is expanded in Fε : vε (x) = ε2 v0 (x, y) + ε3 v1 (x, y) + ..., pε (x) = p0 (x) + εp1 (x, y) + ...

y ∈ YF .

150

P. Popov, Y. Efendiev, and Y. Gorb

˜ ε as the ﬁrst-order correctors for the One can now introduce formally p˜ε and v pressure and velocity, respectively: p˜ε (x) = p0 (x) + εp1 (x, y),

˜ ε = ε2 v0 v

(6)

˜ ε (x) as approximation for the ﬁne scale For the solid problem (3) one considers u displacement uε (x) deﬁned as follows: ˜ ε (x) = u0 (x) + εu1 (x, y), u

y ∈ YS0 ,

(7)

where the zero order term u0 contains the smooth part of the displacement, while the Y − periodic part u1 (x, y) approximates the oscillatory part of the solution. The approximation (7) also deﬁnes a ﬁne-scale ﬂuid domain F˜ε : ˜ ε (X)| ∀ X ∈ Γ 0 Γ˜ε = X + u (8) ˜ ε ) are deﬁned. Together Γ˜ε , p˜ε , v ˜ε , u ˜ ε deﬁne which is the domain where (˜ pε , v an oscillatory approximation to the solution of the FSI problem (1)-(4). To obtain an FSI cell problem in a representative element of volume Y = YF ∪ YS , with YF being the ﬂuid part, YS the solid, one uses the expansion for the Stokes equation to obtain the classical cell problem [5,6]: −∇y p1 + μΔy v0 + f − ∇x p0 = 0

∇y · v0 = 0

in YF˜ ,

(9)

which in our case is deﬁned in an a-priori unknown unit sized ﬂuid domain YF˜ . As usual the subscript y indicates that the respective diﬀerential operators are taken with respect to the fast variable y in a unit sized domain [5]. The above cell problem is subject to periodic boundary conditions for the velocity v0 and pressure p1 on ∂Y and no-slip condition for v0 on the interface Γ˜ε . Similarly, nonlinear homogenization of the solid part (3) results in the cell problem [3]: −∇y · S (Ex (u0 ) + Ey (u1 )) = 0

in YS0 ,

(10)

which, in general, is nonlinear. The main diﬃculty comes from interface condition ˜ε , u ˜ ε it reads: (4). If one substitutes in (4) the approximations p˜ε , v −T

det (I + ∇˜ uε ) [−˜ pε I + 2μD (˜ vε )] (I + ∇˜ uε )

n0 = S (E (˜ uε )) n0

on εYΓ0 (11)

As before, n0 is the unit outer normal to the surface YΓ0 . Note, that this condition is nonlinear and cannot be separated into diﬀerent powers of ε. If one wishes to write it in a unit sized domain, the ﬂuid and solid stresses have to be rescaled in the same way. Thus, one introduces Uε (y) := ε−1 uε (x). which preserves the strain, that is: Ey (Uε (y)) = Ex (uε (x)) ,

(12)

Multiscale Modeling and Simulation of Fluid Flows

151

Moreover, by deﬁning Sy (·) := ε−1 S (·), interface condition (11) can now be rescaled, keeping the correct stress scaling on both sides: −T ˜ ε [−p0 I + ε (−p1 I + 2μDy (v0 ))] I + ∇y U ˜ε det I + ∇y U n0 = ˜ ε n0 on YΓ Sy Ey U (13) 0 Observe that the left side of the last equation is the ﬂuid traction at the rescaled interface YΓ0 , which we denote by ˜sε . The FSI cell problem consists of (8)-(11) and is nonlinear, even if the constitutive response of the material is linear. Note also that (7) does not put restrictions on the presence of ﬁner scales in u1 , for example ε2 . The reason for this is again the interface condition, which cannot be split into powers of ε. Thus it is not possible to determine which orders in the displacement will be inﬂuenced by the ˜ fε = p˜ε I + 2μD (˜ vε ) which approximates σ fε up to ﬁrst order in ε, ﬂuid stress σ cf. the left side of (11). The macroscopic equations which correspond to (9) and (10) are (cf. [7]): ∇ · (K (p0 , u0 ) ∇p0 ) = f ,

(14)

∇ · S0 (E (u0 ) , p0 ) = f .

(15)

The ﬁrst is macroscopic balance of mass, with K being the standard Darcy permeability tensor for the cell geometry YF˜ , for details see [6]. Since p0 and u0 drive the cell FSI problem, they determine YF˜ and therefore K = K (p0 , u0 ). Similarly, the macroscopic homogenized stress S0 is given by (cf. [3]): S0 := S (Ex (u0 ) + Ey (u1 )) ,

(16)

where · is the Y − averaging operator. In the general case, the local problem (10) is nonlinear, so u1 depends on u0 and therefore S0 is an implicit function of Ex (u0 ). Since closed form expressions for K and S0 cannot be obtained for general geometries one can instead linearize (14) and (15), for example as ∇ · (Kn (x)∇pn+1 )=f 0

∇ · (Ln (x)∇un+1 ) = f. 0

(17)

In the case of linearized elastic solids, L = L (p0 , u0 ) is simply the homogenized elasticity moduli, which similarly to K depends on the macroscopic variables. In the case of nonlinear elasticity, one can use any appropriate linearization of S0 around (p0 , u0 ). This suggests the following two-level MsFEM iteration: Algorithm 1. Fully coupled FSI model: Initialize p0 and u0 . For n = 1, 2, . . . until convergence, do (n)

(n)

(n)

(n)

(n)

˜ε 1. Given p0 and u0 solve for v0 , p1 , u (n) (8), (9), (10) and (13) and define YΓ˜ . (n)

which satisfy the cell problem

2. Based on YΓ˜ , find the permeability K(n) and elasticity tensor L(n) . 3. Based on the macroscopic linearization (17) find a new upscaled pressure (n+1) (n+1) and upscaled displacement u0 . p0

152

P. Popov, Y. Efendiev, and Y. Gorb

The key to this algorithm is the ability to solve the ﬁne-scale FSI problem numerically (step 1). The numerical algorithm for that has been presented elsewhere [4]. It is based on successive solutions to ﬂuid and solid subproblems: one starts with a guess for the ﬂuid domain, solves the Stokes equation there, and computes the ﬂuid stress on the interface. This stress is then used to solve an elasticity problem, whose solution is then used to deform the ﬂuid domain. The process is repeated until convergence. Given this scheme, one can further consider a variant of the above two-level algorithm, where in step 1, the FSI cell problem is not solved exactly, but only a few iterations are done, the extreme case being a single one. This can be summarized in the following algorithm: Algorithm 2. Two-scale linearized FSI model: Initialize p0 and u0 . For n = 1, 2, . . . until convergence, do: (n)

˜ ε which also defines the cell geometry 1. Given the downscaled displacement u (n) (n+1) (n) YF˜ , compute K and solve the first equation in (17) to obtain p˜0 . (n+1)

2. Based on p˜0

(n+1)

solve (9) for p˜ε

3. Compute the fluid traction (n) (n+1) (n+1) ˜ε and v . YF˜ , p˜ε (n)

(n+1) ˜sε

(n+1)

˜ε and v

(n)

using YF˜

as fluid domain.

(cf. the left hand side of (13)) based on

(n+1)

˜ 0 and ˜sε 4. Based on u compute a new linearized moduli L(n) . Then solve (n+1) ˜0 . the second equation in (17) to obtain u (n+1) (n+1) ˜0 and u , solve (10), (13) for the downscaled displacement 5. Using ˜sε (n+1) (n+1) (n+1) ˜ε u , which defines a new interface Yi,Γ˜ and fluid domain YF˜ at the (n+1) (n+1) and fluid domain F˜ε . cell level and therefore downscaled interface Γ˜ε In practice, either of these proposed algorithms are realized as follows. First, a macroscopic discretization of (17) is constructed, for example by the Finite Element Method (FEM). The macroscopic problems are constructed at a number of macroscopic locations which provide a good interpolation of K(n) and L(n) , typically the integration points associated with the macroscopic discretization. The computation of the cell problems is done at each iteration of Algorithm 1 or 2. Note that these problems require only the transfer of macroscopic information, e.g. p0 and u0 , so the computation is very easy to parallelize. An important simpliﬁcation of Algorithms 1 and 2 can be obtained if the solid is not connected, e.g. one has ﬂow past obstacles. This allows to eliminate the macroscopic momentum balance from (15) and from the linearization (17). While this situation is not very common in practice it provides a rich model problem, both in terms of numerical experiments and rigorous analysis. The analysis of both Algorithms 1 and 2, that is convergence of the iterative process, error bounds in terms of ε and mesh parameters is a complex task. A partial analysis for the simpliﬁed case of ﬂows past deformable obstacles will be presented in an upcoming publication.

Multiscale Modeling and Simulation of Fluid Flows

153

1

Fluid 0.8

Rigid support 0.6

0.4

Elastic Solid

0.2

0 0

(a) Unit Cell

0.2

0.4

0.6

0.8

1

(b) Macroscopic domain

Fig. 1. Unit cell (a) and macroscopic domain (b) used in computations

3

Numerical Examples

Several numerical examples were considered in order to test the proposed algorithms. The basic ﬁne-scale domain is a 2D periodic arrangement of elastic obstacles (Figure 1). The unit cell (Figure 1(a)) consists of circular linear elastic material, surrounded by ﬂuid. The elastic material under consideration is isotropic with Young’s modulus E = 1.44 and Poisson’s ration ν = 0.1. It is supported rigidly in the center. The unit cell is arranged periodically to form the macroscopic domain. A series of macroscopic domains with ε−1 = 4, 8, 16, 32 were considered. Shown in Figure 1(b) is the ﬁne-scale domain with ε−1 = 16. The convergence of the nonlinear iterative processes involved in our two-level algorithms is demonstrated ﬁrst. This is done by considering a set of boundary value problems (BVPs) in which a uniform pressure Pl is applied at the left side of the macroscopic domain. The pressure at the right side is 0 and no-ﬂow boundary conditions are applied at the top and bottom sides of the domain. Both MsFEM algorithms took 6 iterations to converge for Pl = 0.1 and 8 with Pl = 0.2, uniformly with respect to ε or the macroscopic mesh size H. Also, they proved insensitive to the number of iterations performed on the cell FSI problem (step 1 of Algorithm 1), including the extreme case of a single one. Secondly, thanks to the simple geometry and boundary conditions, it was possible to obtain ﬁne-scale solutions for the same set of BVPs via a direct numerical simulation (DNS). A typical ﬁnescale DNS solution and a ﬁrst-order corrected MsFEM solution are shown in Figure 2 for the case ε = 1/8 and Pl = 0.1. The results demonstrate convergence with respect to ε of the ﬁne-scale approximations obtained via our two-scale algorithm and the DNS results (Table 1). A number of more complicated two-dimensional ﬂows were also computed. Shown in Figure 3 is an example of corner point ﬂow, driven by a pressure diﬀerence of 0.2. Again, as in the previous examples both algorithms 1 and 2 converged to the same solution. Unlike the previous example, however, quadratic ﬁnite elements were used for the macroscopic pressure. The permeability ﬁeld was evaluated at the nodal points for the macroscopic pressure and was also interpolated by piecewise quadratic polynomials in each element. The number

154

P. Popov, Y. Efendiev, and Y. Gorb

(a) Fine-scale solution uε

˜ε (b) First order corrector u

˜ε (c) Error, uε − u Fig. 2. Displacements in a typical MsFEM model problem. The macroscopic domain has 8 × 8 unit cells, and due to periodicity in the y direction, only one horizontal row of unit cells is shown. The exact ﬁne scale displacements (a) can be compared with the ﬁrst order corrector (b). The diﬀerence between the two is shown in (c). P=0

1

0.8

0.6

0.4

0.2

P = 0.1 0 0

0.2

0.4

0.6

0.8

1

(a) BCs and discretization

(b) Permeability, K11

(c) Coarse pressure, p0

Fig. 3. Corner ﬂow driven by a pressure diﬀerence

1

0.8

P = −0.1 0.6

P = 0.1 0.4

0.2

0 0

0.2

0.4

0.6

0.8

1

(a) BCs and discretization

(b) Permeability, K11

(c) Coarse pressure, p0

Fig. 4. Flow driven by a pressure diﬀerence between two parallel plates

Multiscale Modeling and Simulation of Fluid Flows

155

Table 1. Error in the ﬁne-scale displacements when compared to DNS results 1/4 1/8 1/16 1/32

Iterations 6 6 6 6

L∞ Error L∞ 1.23 × 10−3 3.18 × 10−4 8.07 × 10−5 2.03 × 10−5

1/4 1/8 1/16 1/32

Iterations 8 8 8 8

L∞ Error L∞ 2.96 × 10−3 7.94 × 10−4 2.06 × 10−4 5.25 × 10−5

Pl = 0.1 Rel. Error 0.18 0.10 0.053 0.027 Pl = 0.2 Rel. Error 0.22 0.126 0.068 0.035

L2 Error L2 Rel. Error 2.48 × 10−4 0.23 4.39 × 10−5 0.13 7.75 × 10−6 0.069 1.37 × 10−6 0.0351 L2 Error L2 4.93 × 10−4 8.78 × 10−5 1.56 × 10−5 2.75 × 10−6

Rel. Error 0.22 0.127 0.067 0.034

of macroscopic iterations for both algorithms was 6. The results indicate a permeability ﬁeld (Figure 3(b)) that is strongly correlated with the macroscopic pressure. It is highest where the pressure is highest (lower left-corner) and lowest, where the pressure is lowest (upper-left corner). In another example we consider a 2D section of ﬂow generated by pressure diﬀerence between to parallel plates (Figure 4). A unit domain with the same microstructure as in the previous examples is considered. A pressure diﬀerence of 0.2 is again applied at the two interior plates (Figure 4(a)) and no-ﬂow boundary conditions are speciﬁed at the outer boundary. The computation shown in Figure 4 was performed with ε = 1/8 and both algorithms took 9 iterations to converge. As can be seen from the ﬁgure, the permeability plot is strongly correlated to the macroscopic pressure ﬁeld. The highest (K11 = 7.86 × 10−6 ) is observed at the left plate, where the pressure is highest and the lowest value (K11 = 2.38 × 10−6 ) is observed at the right plate, where the pressure is lowest. Unlike the previous examples where the pressure was always positive, here negative values of the pressure leads to expansion of the ﬁne-scale inclusion (cf. Figure 1(a)). This in turn decreases the permeability below the value for rigid obstacle. These two examples further indicate that the proposed algorithms are robust and can solve eﬀectively multiscale FSI problems.

Acknowledgments This work was partially funded by the European Commission, grant FP7PEOPLE-2007-4-3-IRG-230919 as well as the US National Science Foundation, grant NSF-DMS-0811180. The authors would like to specially thank Dr. Steve Johnson for his support in using the Calclab cluster at the Dept. of Mathematics, Texas A&M University. Last but not least, we are particularly indebted to Dr. Richard Ewing for the generous funding and support that he provided through the Institute for Scientiﬁc Computing during the initial stage of this research.

156

P. Popov, Y. Efendiev, and Y. Gorb

References 1. Biot, M.A.: General theory of three dimensional consolidation. J. Appl. Phys. 12, 155–164 (1941) 2. Burridge, R., Keller, J.B.: Poroelasticity equations derived from microstructure. Journal of the Acoustical Society of America 70, 1140–1146 (1981) 3. Efendiev, Y., Pankov, A.: Numerical homogenization of nonlinear random parabolic operators. Multiscale Modeling and Simulation 2(2), 237–268 (2004) 4. Iliev, O., Mikelic, A.: P Popov. On upscaling certain ﬂows in deformable porous media. SIAM Multiscale Modeling and Simulation 7, 93–123 (2008) 5. Sanchez-Palencia, E.: Non-Homogeneous Media and Vibration Theory. Lecture Notes in Physics, vol. 127. Springer, Berlin (1980) 6. Sanchez-Palencia, E., Ene, H.I.: Equations et ph´enom`enes de surface pour l’´ecoulement dans un mod`ele de milieu poreux. Journal de M´ecanique 14, 73–108 (1975) 7. Zhikov, V.V., Kozlov, S.M., Oleinik, O.A.: Homogenization of Diﬀerential Operators and Integral Functionals. Springer, Berlin (1994)

Assimilation of Chemical Ground Measurements in Air Quality Modeling Gabriele Candiani, Claudio Carnevale, Enrico Pisoni, and Marialuisa Volta Department of Electronics for Automation University of Brescia Viale Branze, 38 – 25123 – Brescia, Italy [email protected]

Abstract. Regional authorities need tools helping them to understand the spatial distribution of an air pollutant over a certain domain. Models can provide spatially consistent air quality data, as they consider all the main physical and chemical processes governing the atmosphere. However, uncertainties in formalization and input data (emission ﬁelds, initial and boundary conditions, meteorological patterns) can aﬀect simulation results. On the other hand, measurements, which are considered more accurate, have a limited spatial coverage. The integration of monitoring data and model simulations can potentially increase the accuracy of the resulting spatial ﬁelds. This integration can be performed using either deterministic techniques or algorithms which allow for uncertainties in both modeled and observed data, to obtain the best physical assessment of concentration ﬁelds. In this study ozone daily ﬁelds, simulated by the Transport Chemical Aerosol Model (TCAM), are reanalyzed from April to September 2004. The study area covers all the Northern Italy, divided in 64x41 cells with a spatial resolution of 10x10 km2 . Ozone data measured by ground stations are assimilated using Inverse Distance Weighting (IDW), kriging, cokriging and Optimal Interpolation (OI) in order to obtain reanalyzed ozone ﬁelds. Results show that these methods highly improve the accuracy of the retrieved ozone maps.

1

Introduction

An accurate description of the air quality state is a very challenging task due to the complexity and non-linearity of the mechanisms which take place in atmosphere. This is especially true when dealing with secondary pollutants, such as ozone. In order to describe the distribution of an air pollutant over a certain area, regional authorities can use either the data from a monitoring station network or the results of simulations performed by a deterministic modeling system. Both of these approaches have advantages and drawbacks. Monitoring stations are considered to be more accurate than models but, since they lack in spatial coverage, the accuracy of the resulting pollutant ﬁeld is limited by the number and the position of the stations. On the other hand, multiphase chemical and transport models can describe the pollutant distribution with a good spatial resolution. I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 157–164, 2010. c Springer-Verlag Berlin Heidelberg 2010

158

G. Candiani et al.

However, because of the complexity of these models and the need for detailed meteorological and emission inputs, model results are aﬀected by uncertainties [6]. The integration between models and observations can therefore have the potential to decrease the uncertainties in the resulting pollutant ﬁelds [7]. In this study, ozone daily ﬁelds simulated by the Transport Chemical Aerosol Model (TCAM) are reanalyzed oﬀ-line, assimilating ozone ground measurements. The paper is structured as follows: section 2 presents the TCAM features and the theoretical formalization of the assimilation methodologies; section 3 shows the experiment setup; in section 4 the results of the reanalysis are presented and commented for the four techniques.

2

Methodology

The objective of the assimilation is to estimate as accurately as possible the state of atmosphere at a given time. The reanalysis is a particular case of assimilation which can be formalized as [1] xa = xb + δx

(1)

where: – xa is the analysis ﬁeld; – xb is the model ﬁeld (also called background or ﬁrst guess or a priori ﬁeld); – δx is the analysis increment. The increment δx is the product of a weight matrix K and the diﬀerences between the observations y and the background at the monitoring station locations: δx = K(y − H(xb )) = Kψ

(2)

where H(x) is an operator that maps the model state space x into the observed state space y, and ψ is often called residual vector. In literature, reanalysis algorithms diﬀer in the way they compute the matrix K. In this study, Inverse Distance Weighting (IDW) [7], kriging [5], cokriging [5] and Optimal Interpolation (OI) [1] are used to estimate δx over the domain grid points. 2.1

Background Fields

The background ﬁelds are provided by the simulations of TCAM [2], a 3-D multiphase Eulerian model. It is a part of the Gas Aerosol Modeling Evaluation System (GAMES) [15] which also includes the meteorological pre-processor PROMETEO and the emission processor POEM-PM [3]. TCAM solves time by time a PDE system which describes the horizontal/vertical transport, the multiphase chemical reactions and the gas to particle conversion phenomena using a splitting operator technique [12]. The horizontal transport is solved by a chapeau function approximation [13] and the non linear Forester ﬁlter [8], while the vertical transport PDE system is solved by an hybrid implicit/explicit scheme.

Assimilation of Chemical Ground Measurements in Air Quality Modeling

159

The gas chemistry is described by the COCOH97 scheme [16]. The ODE chemical kinetic system is solved by means of the Implicit-Explicit Hybrid (IEH) solver, which splits the species in fast and slow ones, according to their reaction rates. The system of fast species is solved by means of the implicit Livermore Solver for Ordinary Diﬀerential Equations (LSODE) [4] implementing an Adams predictor/corrector method in the non-stiﬀ case, and the Backward Diﬀerentiation Formula method in the stiﬀ case. The slow species system is solved by the Adams-Bashfort method [17]. The thermodynamic module is derived by SCAPE2 [11]. 2.2

Algorithms

Inverse Distance Weighting. One of the simplest technique to compute K = [ki,p ] is the IDW. The weights used in this algorithm are inverse functions of the distance di,p (di,q ) between the grid point i and the station location p (q), according to the equation K = [ki,p ] =

1 dβ i,p n 1 q=1 dβ

(3)

i,q

where n is the number of stations and β is the weighting power. Ordinary kriging. In ordinary kriging, weights K = [ki,p ] are based on the knowledge of the semivariogram, which describes the expected squared increment of values ψ between locations p and q. First, an empirical semivariogram γψ is constructed as: 1 ¯ = [ψ(p) − ψ(q)]2 (4) γψ (d) ¯ 2N (d) (p,q)

¯ is the number of station pairs (p, q) separated by d. ¯ Then, the where N (d) empirical semivariogram is ﬁtted by a model semivariogram γˆψ (e.g. spherical, gaussian, exponential, ...), which is used to solve the system of n + 1 equations [10] n

ki,p γˆψ (dq,p ) + μ = γˆψ (dq,i )

q = 1, ..., n

(5)

p=1 n

ki,p = 1

(6)

p=1

where: – n is the number of stations; – dq,p and dq,i are the distances between a station point (p) and another station point (q) or a grid point i, respectively; – μ is the Lagrange multiplier which ensures the unbiasedness condition (6).

160

G. Candiani et al.

Ordinary Cokriging. Ordinary cokriging is an extension of ordinary kriging. It allows to use an auxiliary variable called the secondary variable, which is correlated to the primary variable and is usually more densely sampled. In this case, the increment δx (2) is not only a linear combination of primary variable ψ, but also a function of the secondary variables η weighted by the matrix L, according to the following equation: δx = K(y − H(xb )) + Lη = Kψ + Lη.

(7)

The weights K = [ki,p ] and L = [li,s ] are estimated on the base of the two semivariograms γψ and γη (estimated as in (4)) and the crossvariogram γψη , which describes the correlation between the two variables ψ and η according to ¯ = γψη (d)

1 (ψ(p) − ψ(q))(η(p) − η(q)) ¯ 2N (d)

(8)

(p,q)

¯ is the number of station pairs (p, q) separated by d. ¯ Model semivarwhere N (d) γψη ), are then used to compute iograms (ˆ γψ and γˆη ) and model crossvariogram (ˆ the set of weights solving the following n + m + 2 equations [10] n

ki,p γˆψ (dp,q ) +

p=1 n p=1

m

li,s γˆψη (ds,q ) s=1 m

ki,p γˆψη (dp,t ) +

+ μ1 = γˆψ (di,q )

li,s γˆη (ds,t ) + μ2 = γˆψη (di,t )

q = 1, ..., n t = 1, ..., m

(9)

(10)

s=1 n

ki,p = 1

p=1 m

li,s = 0

(11)

(12)

s=1

where: – n and m are respectively the number of data for primary and secondary variable; – dp,q and dp,t are the distances between a point (p) of the primary variable and another point (q) of the primary variable or a point (t) of the secondary variable, respectively; – ds,q and ds,t are the distances between a point (s) of the secondary variable and a point of the primary variable (q) and another point of the secondary variable (t), respectively; – di,q and di,t are the distances between a grid point i and a point of the primary variable q or the secondary variable t, respectively; – μ1 and μ2 are the Lagrange parameters which ensure the unbiasedness condition (11) and (12).

Assimilation of Chemical Ground Measurements in Air Quality Modeling

161

Optimal Interpolation. In the Optimal Interpolation algorithm [1], the weight matrix K (also called gain matrix) is calculated as K = BH T (HBH T + R)−1

(13)

where B and R are the covariance matrices for background and observation errors respectively. The diﬀerence from the algorithms presented above is that now the weight matrix K takes into account the errors associated to the model and the observations. The B matrix is of utmost importance as it weights the model error against the competing observation error and how the information is distributed over the domain. In this work, B is estimated using the Gaussian exponential function B = [bi,j ] = exp(−

d2i,j )v = D(i, j)v, 2L2h

(14)

where di,j is the distance between the grid cells i and j, Lh is a parameter deﬁning the decay of covariance with respect to the horizontal distance, and v is the model error variance estimate, computed on the basis of previous simulation results. This approximation states that the model error variance is constant for each cell of the domain, while the error covariance between 2 grid points (i and j) is a function of the horizontal distance between them. The R matrix can be considered diagonal when the measurements performed by the monitoring stations are independent. Moreover, if the same type of instruments are used for the measurements, it could be possible to assume that all the monitoring stations have the same error variance r, thus R could be rewritten as R = rI. Under these assumptions eq. 13 can be written as r K = D(i, j)vH T (v(HD(i, j)H T + R))−1 v = D(i, j)H T (HD(i, j)H T + σR)−1

(15)

where the only degree of freedom is the ratio between the observation and model error variances σ.

3

Experiment Setup

In this study, the maps of the maximum of 8 hour mean (MAX8H) ozone concentration, simulated by TCAM, are reanalyzed with the assimilation of ground measurement data. The study area covers all the Northern Italy (Fig. 1), divided in 41x64 cells (10x10 km2 resolution). Northern Italy is a densely inhabited and industrialized area, where high anthropogenic emissions, frequent stagnating meteorological conditions and Mediterranean solar radiation regularly cause high ozone levels, in summer months. The reanalysis covers the period between April and September 2004. The background ﬁelds are provided by means of TCAM simulations. The emission ﬁelds were estimated by POEM-PM pre-processor starting from the

162

G. Candiani et al.

Fig. 1. Study area

regional emission inventory collected by ISPRA (Italian Agency for Environmental Protection and Research), while the meteorological ﬁelds are computed by means of MM5 model [9]. The boundary conditions are computed starting from continental scale simulation of CHIMERE model [14]. The observation data are represented by 134 stations, collected from the regional monitoring networks for Italy, and from AirBase database for France and Switzerland. The 75% of the stations (crosses in Figure 1) are used to estimate the daily analysis ﬁelds, while the remaining stations (stars in Figure 1) are used for the validation. The parameters used in the algorithms are chosen on the basis of a previously sensitivity analysis. Thus, β in IDW is equal to 2, secondary variable η used in cokriging is the altitude, while Lh and σ in the OI are equal to 50000 m and 0.5, respectively.

4

Results and Discussions

The validation of the reanalyzed MAX8H ozone ﬁelds is performed comparing the results of the four techniques with the values observed at the monitoring stations chosen for the validation. The same comparison is also performed for the non-reanalyzed TCAM ﬁelds, in order to better appreciate the improvements due to the reanalysis. Figure 2 shows the results of validation performances in terms of mean error, root mean square error, and correlation. Looking at the mean error results (Fig. 2(a)), even if the median of TCAM is the closest to zero, the analysis algorithms present a low variability, suggesting a better accuracy of the estimates. Signiﬁcant improvements can be seen for the

Assimilation of Chemical Ground Measurements in Air Quality Modeling

163

35 Root Mean Square Error

Mean Error

20 10 0 −10

25 20 15 10

−20 background IDW

30

kriging cokriging

background IDW

OI

(a)

kriging cokriging

OI

(b)

0.9

Correlation

0.8 0.7 0.6 0.5 0.4 0.3 background IDW

kriging cokriging

OI

(c) Fig. 2. Mean error (a), Root Mean Square Error (b), and Correlation (c) for TCAM, IDW, kriging, cokriging, and OI

RMSE index (Fig. 2(b)), where the TCAM value of 27-28 μgm−3 becomes less than 20 μgm−3 for IDW and cokriging and close to 15 μgm−3 for kriging and OI. The improvement is conﬁrmed also by the results for the correlation coeﬃcients (Fig. 2(c)), where the 0.6 value of TCAM increases to 0.85 in IDW, kriging and OI. The improvement seen in cokriging is less marked than the other techniques, but is still close to 0.8. Overall, the implemented methodologies ensure a large improvement in the accuracy of the retrieved ozone concentration ﬁelds.

Acknowledgments The research has been developed in the framework of the Pilot Project QUITSAT (Contract I/035/06/0 — http://www.quitsat.it), sponsored and funded by the Italian Space Agency (ASI).

References 1. Bouttier, F., Courtier, P.: Data assimilation concepts and methods. In: Meteorological Training Course Lecture Series (2002) 2. Carnevale, C., Decanini, E., Volta, M.: Design and validation of a multiphase 3D model to simulate tropospheric pollution. Science of the Total Environment 390, 166–176 (2008)

164

G. Candiani et al.

3. Carnevale, C., Gabusi, V., Volta, M.: POEMPM: an emission model for secondary pollution control scenarios. Environmental Modeling and Software 21, 320–329 (2006) 4. Chock, D., Winkler, S., Sun, P.: A comparison of stiﬀ chemistry solvers for air quality modeling. In: Proceedings of Air and waste management association 87th annual meeting (1994) 5. Cressie, N.A.C.: Statistics for spatial data. Wiley, Chichester (1991) 6. Cuvelier, C., et al.: CityDelta: A model intercomparison study to explore the impact of emission reductions in European cities in 2010. Atmospheric Environment 41, 189–207 (2007) 7. Denby, B., Hor´ alek, J., Walker, S.E., Eben, K., Fiala, J.: Interpolation and assimilation methods for Europe scale air quality assessment and mapping. Part I: review and recommendations. Technical report, ETC/ACC Technical Paper 2005/7 (2005) 8. Forester, C.: Higher order monotonic convection diﬀerence schemes. Journal of Computational Physics 23, 1–22 (1977) 9. Grell, G., Dudhia, J., Stauﬀer, D.: A description of the Fifth-generation Penn State/NCAR Mesoscale Model (MM5). Technical report, NCAR Tech Note TN398 + STR, 122 p. (1994) 10. Isaaks, E.H., Srivastava, R.M.: Applied Geostatistics. Oxford University Press, Oxford (1989) 11. Kim, Y., Seinfeld, J., Saxena, P.: Atmospheric gas aerosol equilibrium I: thermodynamic model. Aerosol Science and Technologies 19, 157–187 (1993) 12. Marchuk, G.: Methods of Numerical Mathematics. Springer, Heidelberg (1975) 13. Pepper, D.W., Kern, C.D., Long Jr., P.E.: Modeling the dispersion of atmospheric pollution using cubic splines and chapeau functions. Atmospheric Environment 13, 223–237 (1979) 14. Schmidt, H., Derognat, C., Vautard, R., Beekmann, M.: A comparison of simulated and observed ozone mixing ratios for the summer of 1998 in Western Europe. Atmospheric Environment (2001) 15. Volta, M., Finzi, G.: GAMES, a comprehensive Gas Aerosol Modeling Evaluation System. Environmental Modelling and Software 21, 587–594 (2006) 16. Wexler, A., Seinfeld, J.: Second-generation inorganic aerosol model. Atmospheric Environment 25, 2731–2748 (1991) 17. Wille, D.: New stepsize estimators for linear multi step methods. In: Proceedings of MCCM (1994)

Numerical Simulations with Data Assimilation Using an Adaptive POD Procedure Gabriel Dimitriu1 , Narcisa Apreutesei2 , and R˘ azvan S ¸ tef˘anescu1 1

2

“Gr. T. Popa” University of Medicine and Pharmacy, Department of Mathematics and Informatics 700115 Ia¸si, Romania [email protected], [email protected] “Gh. Asachi” Technical University, Department of Mathematics 700506 Ia¸si, Romania [email protected]

Abstract. In this study the proper orthogonal decomposition (POD) methodology to model reduction is applied to construct a reduced-order control space for simple advection-diﬀusion equations. Several 4D-Var data assimilation experiments associated with these models are carried out in the reduced control space. Emphasis is laid on the performance evaluation of an adaptive POD procedure, with respect to the solution obtained with the classical 4D-Var (full model), and POD 4D-Var data assimilation. Despite some perturbation factors characterizing the model dynamics, the adaptive POD scheme presents better numerical robustness compared to the other methods, and provides accurate results.

1

Introduction

Data assimilation represents a methodology to combine the results of a largescale numerical model with the measurements available to obtain an optimal reconstruction of the state of the system. The four-dimensional variational data assimilation (4D-Var) method has been a very successful technique used in operational numerical weather prediction at many weather forecast centers ([6,8,9,10]). Proper orthogonal decomposition technique has been used to obtain low dimensional dynamical models of many applications in engineering and science ([1,2,3,4,5,7]). Basically, the idea starts with an ensemple of data, called snapshots, collected from an experiment or a numerical procedure of a physical system. The POD technique is then used to produce a set of basis functions which spans the snapshots collection. The goal of the approach is to represent the ensemble of data in terms of an optimal coordinate system. That is, the snapshots can be generated by a smallest possible set of basis functions. The drawback of the POD 4D-Var consists of the fact that the optimal solution can only be sought within the space spanned by the POD basis of background ﬁelds. When observations lay outside of the POD space, the POD 4D-Var solution may fail to ﬁt observations suﬃciently. The above limitation of the POD 4D-Var can be improved by implementing an adaptive POD 4D-Var scheme. In I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 165–172, 2010. c Springer-Verlag Berlin Heidelberg 2010

166

G. Dimitriu, N. Apreutesei, and R. S ¸ tef˘ anescu

this study an adaptive proper orthogonal decomposition (A-POD) procedure is applied to set up a reduced-order control space for the one-dimensional diﬀusionadvection equations. The outline of this paper is as follows. Section 2 is dedicated to the numerical model under study and reviewing the 4D-Var assimilation procedure, together with POD 4D-Var method and its adaptive variant. Section 3 contains numerical results from data assimilation experiments using 4D-Var, POD 4D-Var and adaptive POD 4D-Var methods. The paper ends with some conclusions.

2

Numerical Model and Adaptive POD 4D-Var Assimilation Procedure

Our model under study is a one-dimensional diﬀusion-advection equation deﬁned by the following partial diﬀerential equation: ∂2c ∂c ∂c =a 2 +b + f (x, t) , ∂t ∂x ∂x

(x, t) ∈ Ω × (0, T ),

c(x, 0) = x(1 − x) .

(1)

Here, the spatial domain Ω = [0, 1], and the coeﬃcients a and b are positive constants. The function f is chosen to be f (x, t) = (−x2 + (1 + 2b)x + 2a − b)et . Then, the exact (analytical) solution of (1) is given by cexact (x, t) = x(1 − x)et . Details about the numerical implementation of a data assimilation algorithm for a similar model of (1) is presented in [10]. For a successful implementation of the POD 4D-Var in data assimilation problems, it is of most importance to construct an accurate POD reduced model. In what follows, we brieﬂy present a description of this procedure (see [2,3,7]). For a temporal-spatial ﬂow c(x, t), we denoted by c1 , . . . , cn a set adequately chosen n in a time interval [0, T ], that is ci = c(x, ti ). Deﬁning the mean c = 1 i=1 ci , we expand c(x, t) as n cPOD (x, t) = c(x) +

M

βi (t)Φi (x) ,

(2)

i=1

where Φi (x) (the ith element of POD basis) and M are appropriately chosen to capture the dynamics of the ﬂow as follows: n 1. Calculate the mean c = n1 i=1 ci ; 2. Set up the correlation matrix K = [kij ], where kij = Ω (ci − c)(cj − c) dx; 3. Compute the eigenvalues λ1 ≥ λ2 ≥ · · · ≥ λn ≥ 0 and the corresponding orthogonaleigenvectors v1 , v2 , . . . , vn of K; n 4. Set Φi := j=1 vji (ci − c) . Now, we introduce a relative information content to select a low-dimensional basis of size M n, by neglecting modes corresponding to the small eigenvalues. Thus, we deﬁne the index k λi I(k) = i=1 n i=1 λi

Numerical Simulations with Data Assimilation

167

and choose M , such that M = argmin{I(m) : I(m) ≥ γ}, where 0 ≤ γ ≤ 1 is the percentage of total information captured by the reduced space X M = span{Φ1 , Φ2 , . . . , ΦM }. The tolerance parameter γ must be chosen to be near the unity in order to capture most of the energy of the snapshot basis. The reduced order model is then obtained by expanding the solution as in (2). Generally, an atmospheric or oceanic model is usually governed by the following dynamic system dc = F (c, t) , dt

c(x, 0) = c0 (x) .

(3)

To obtain a reduced model of (3), we ﬁrst solve (3) for a set of snapshots and follow above procedures, then use a Galerkin projection of the model equations onto the space X M spanned by the POD basis elements (replacing c in (3) by the expansion (2), then multiplying Φi and integrating over spatial domain Ω): dβi = dt

F

c+

M

βi Φ i , t , Φ i

,

βi (0) =< c(x, 0)−c(x), Φi (x) > . (4)

i=1

Equation (4) deﬁnes a reduced model of (3). In the following, we will analyze applying this model reduction to 4D-Var formulation. In this context, the forward model and the adjoint model for computing the cost function and its gradient represent the reduced model and its corresponding adjoint, respectively. At the assimilation time interval [0, T ], a prior estimate or ‘background estimate’, cb of the initial state c0 is assumed to be known and the initial random errors (c0 − cb ) are assumed to be Gaussian with covariance matrix B. The aim of the data asimilation is to minimize the square error between the model predictions and the observed system states, weighted by the inverse of the covariance matrices, over the assimilation interval. The initial state c0 is treated as the required control variable in the optimization process. Thus, the objective function associated with the data assimilation for (3) is expressed by J(c0 ) = (c0 − cb )T B−1 (c0 − cb ) + (Hc − yo )T R−1 (Hc − yo ) .

(5)

Here, H is an observation operator, and R is the observation error covariance matrix. In POD 4D-Var, we look for an optimal solution of (5) to minimize the cost function J(cM 0 ) = J(β1 (0), . . . , βM (0)) given by POD − cb )B−1 (cPOD − cb ) + (HcPOD − yo )R−1 (HcPOD − yo ) , (6) J(cM 0 ) = (c0 0

where cPOD is the control vector. 0 In (6), cPOD = cPOD (x, 0) and cPOD = cPOD (x, t) are expressed by 0 cPOD (x) = c(x) + 0

M i=1

βi (0)Φi (x),

cPOD (x) = c(x) +

M i=1

βi (t)Φi (x) .

168

G. Dimitriu, N. Apreutesei, and R. S ¸ tef˘ anescu

Therefore, in POD 4D-Var the control variables are β1 (0), . . . , βM (0). As explained later, the dimension of the POD reduced space could be much smaller than the original space. As a consequence, the forward model is the reduced model (4) which can be very eﬃciently solved. The adjoint model of (4) is then used to calculate the gradient of the cost function (6) and that will signiﬁcantly reduce both the computational cost and the programming eﬀort. The POD model in POD 4D-Var assimilation is established by construction of a set of snapshots, which is taken from the background trajectory, or integrate original model (3) with background initial conditions. The A-POD algorithm used in our numerical experiments is presented below: A-POD Algorithm: Step 1: Set k = 1, the iteration level for POD procedure, and the initial guess controls ck0 . Step 2: Set up the snapshots ensemble from the solution of the full forward model, with the controls ck0 . Step 3: Compute the POD bases (the number of POD bases is chosen to capture a prescribed energy level, γ, mentioned above). Step 4: Project the controls ck0 on the reduced space βk,iter (iter = 1). Step 5: Optimize the initial controls βk,iter (here, iter denotes the iteration of the optimization process, completely carried out on the reduced space). Step 6: (i) Check the value of the cost function (5). If |Jiter | < ε (where ε is the tolerance for the optimization), then STOP; (ii) If |Jiter | > ε and |Jiter − Jiter−1 | > 10−3 , then set iter = iter + 1 and go to Step 5; (iii) If |Jiter | > ε and |Jiter − Jiter−1 | < 10−3 , then update the POD bases: βk,iter

3

by projecting the optimization controls (a) Find the new controls ck+1 0 onto the original domain. (b) Set k = k + 1 and go to Step 2.

Numerical Results

This section is devoted to the numerical results of several data assimilation experiments carried out to examine the performance of A-POD 4D-Var procedure, by comparing it with the full 4D-Var and POD 4D-Var data assimilation. All the performed tests have used as ‘true’ (exact) solution c0 of the assimilation problem, that one computed from the analytical solution cexact of (1) given in Section 2. We set in our approach T = 1, and used 31 discretization points in space, and 69 points in time interval [0, 1]. By means of a perturbed initial condition we generated the observed state yo . We assumed that the correlation matrices in (5) and (6) are diagonal matrices, given by B = σb2 I and R = σo2 I, with I denoting the identity matrix of appropriate order. We set σb2 = 0.052 and σo2 = 0.12 , representing the variances for the background and observational errors, respectively.

12 snapshots

−0.5

log10(rel. energy)

log10(rel. energy)

Numerical Simulations with Data Assimilation

−1 −1.5 −2

169

18 snapshots

−0.5 −1 −1.5 −2 −2.5

−2.5 2

4

6

8

10

12

5

23 snapshots −1 −2 −3

5

10

15

Mode index

10

15

Mode index

20

log10(rel. energy)

log10(rel. energy)

Mode index

−5 35 snapshots −10 −15 10

20

30

Mode index

Fig. 1. Relative energy spectrum for POD modes, corresponding to diﬀerent sets of snapshots for the system state

The numerical tests have been performed with the following values of the diﬀusion and advection parameters: a = 0.0001 and b = 1. The background value cb and the observation yo were obtained by perturbing the exact solution of the state equation. Figure 1 illustrates the decay of the eigenvalues, in case when the spatial correlation matrix, K, was calculated using 12, 18, 23, and 35 snapshots, respectively. The dimension of the POD reduced model depends on the number of basis functions. We found that only few basis functions are required to capture a high percentage from the dynamic of the system. For the presented cases, we can remark in Table 1, that more than 90% from the full variability characterizing the model dynamics can be captured with 4, 16, and 18 POD modes, respectively. Thus, choosing M = 16 modes and n = 36 snapshots, the captured energy represents 98.53% of the total energy, while when M = 18, the captured energy percentage is 99.16%. Table 1. The values of the index I(k) for diﬀerent numbers of snapshots (n), and POD modes (M ) Number of Number of Index snapshots (n) modes (M ) I(M ) 12 4 91.74% 18 16 99.89% 36 16 98.53% 36 18 99.16%

170

G. Dimitriu, N. Apreutesei, and R. S ¸ tef˘ anescu

Table 2. The values of the RMS errors (RMSE) calculated for POD and A-POD procedures, corresponding to diﬀerent numbers of snapshots and POD modes Number of Number of RMSE RMSE snapshots (n) modes (M ) for POD for A-POD 12 4 0.1193 0.0874 12 12 0.0730 0.0532 18 16 0.0684 0.0476 36 16 0.0640 0.0420 23 22 0.0444 0.0403 36 22 0.0427 0.0351

We performed several numerical tests by implementing the three distinct procedures for data assimilation problem (1), (5): the standard (full) 4D-var, the POD 4D-Var, and the adaptive POD (A-POD) scheme. In all cases, the performance of POD scheme with respect to A-POD was evaluated by computing RMS errors of the assimilation solution (see Table 2). The numerical solution of the optimal control problem was obtained using fminunc — the Matlab unconstrained minimization routine. Its algorithm is based on the BFGS quasi-Newton method with a mixed quadratic and cubic line search procedure. The decreasing

n= 18(snapshots), M= 10 (modes) 4D−VAR POD A POD

J/J0

0.9 0.8 0.7 0.6 0.5 0

10

20

30

40

0.6 0.4 0.2 0

10

20

30

40

50

Number of iteration n= 35 (snapshots), M= 25 (modes) 1

Gradient/Gradient0

1

4D−VAR A POD

0.8

0

50

Number of iteration n= 35(snapshots), M= 25 (modes) 4D−VAR POD A POD

0.9

J/J0

n= 18 (snapshots), M= 10 (modes) 1

Gradient/Gradient0

1

0.8 0.7 0.6 0.5 0

10

20

30

40

Number of iteration

50

4D−VAR A POD

0.8 0.6 0.4 0.2 0

0

10

20

30

40

50

Number of iteration

∂J Fig. 2. The decreasing of the cost function J and of the gradient ∂c , for diﬀerent values 0 of snapshots and POD modes, compared to the 4D-Var method with full order model and reduced order model using POD 4D-Var and adaptive POD 4D-Var techniques

Numerical Simulations with Data Assimilation

exact 4D−VAR POD A POD

n= 23(snapshots), M= 22 (modes) 0.3

C0

171

0.2 0.1 0

5

10

15

20

25

Number of space points n= 36(snapshots), M= 22 (modes)

exact 4D−VAR POD A POD

0.3

C0

30

0.2 0.1 0

5

10

15

20

25

30

Number of space points

Fig. 3. Comparative results on the assimilation of c0 , maintaining the same number of POD modes (M = 22) and increasing the number of snapshots, from n = 23 to n = 36 ∂J values of the cost function J, as well as of its gradient ∂c normalized to their 0 values at the ﬁrst iteration is presented in Figure 2. The comparative assimilation results associated with our experiments for certain selections of M and n are depicted in Figure 3. Obviously, one can conclude that better results are obtained when one is using more POD modes and more snapshots. We also remark that the variance of the observational errors, chosen twice bigger than the variance level of the background errors, caused a relative high instability for the assimilation solution (even for the best case, where the reduced model was constructed with n = 36 snapshots and M = 22 modes — see Figure 3, bottom plot). A certain contribution to this oscillatory behaviour of the solution could also be attributed to the fact the advection strongly dominates diﬀusion in our model.

4

Conclusions

In this study, we applied to a simple diﬀusion-advection model, both a standard 4D-Var assimilation scheme and a reduced order approach to 4D-Var assimilation using an adaptive POD procedure. The numerical results from several POD models (constructed with diﬀerent numbers of snapshots and POD modes) are compared with those of the original model. Our numerical tests showed that variability of the original model could be captured reasonably well by applying an adaptive POD scheme to a low dimensional system set up with 36 snapshots and 22 leading POD basis functions. At the same time, the available data should

172

G. Dimitriu, N. Apreutesei, and R. S ¸ tef˘ anescu

be both representative and suﬃciently accurate in order to ensure the desired improvements. In the case when the available data consist of measurements, some information about the uncertainties of the measurements would be very helpful for the assimilation procedure. Several issues, including more general models (containing distributed diﬀusion and advection parameters, and also nonlinear source terms, directly depending on state variable, c) are under investigation.

Acknowledgments The paper was supported by the project ID 342/2008, CNCSIS, Romania.

References 1. Antoulas, A., Sorensen, S., Gallivan, K.A.: Model Reduction of Large-Scale Dynamical Systems. In: Bubak, M., van Albada, G.D., Sloot, P.M.A., Dongarra, J. (eds.) ICCS 2004, Part III. LNCS, vol. 3038, pp. 740–747. Springer, Heidelberg (2004) 2. Cao, Y., Zhu, J., Luo, Z., Navon, I.M.: Reduced Order Modeling of the Upper Tropical Paciﬁc Ocean Model Using Proper Orthogonal Decomposition. Computer & Mathematics with Applications 52(8-9), 1373–1386 (2006) 3. Cao, Y.H., Zhu, J., Navon, I.M.: A reduced-order approach to four-dimensional variational data assimilation using proper orthogonal decomposition. Int. J. Numer. Meth. Fluids 53(10), 1571–1583 (2007) 4. Dimitriu, G.: Using Singular Value Decomposition in Conjunction with Data Assimilation Procedures. In: Boyanov, T., Dimova, S., Georgiev, K., Nikolov, G. (eds.) NMA 2006. LNCS, vol. 4310, pp. 435–442. Springer, Heidelberg (2007) 5. Fang, F., Pain, C.C., Navon, I.M., Piggot, M.D., Gorman, G.J., Allison, P.A., Goddard, A.J.H.: Reduced-order modelling of an adaptive mesh ocean model. Int. J. Numer. Meth. Fluids 59, 827–851 (2009) 6. Ghil, M., Malanotte-Rizzoli, P.: Data Assimilation in Meteorology and Oceanography. In: Dunowska, R., Saltzmann, B. (eds.) Advances in Geophysics, vol. 33, pp. 141–266 (1991) 7. Luo, Z., Zhu, J., Wang, R., Navon, I.M.: Proper Orthogonal Decomposition Approach and Error Estimation of Mixed Finite Element Methods for the Tropical Paciﬁc Ocean Reduced Gravity Model. Submitted to Computer Methods in Applied Mechanics and Engineering 8. Qui, C.J., Zhang, L., Shao, A.M.: An Expicit four-dimensional variational data assimilation method. Sci. China Ser. D-Earth Sci. 50(8), 1232–1240 (2007) 9. Th´epaut, J.N., Hoﬀman, R.N., Courtier, P.: Interactions of dynamics and observations in a 4D variational assimilation. Mon. Weather Rev. 121, 3393–3414 (1993) ´ Implementation issues related to variational 10. Zlatev, Z., Brandth, J., Havasi, A.: data assimilation: some preliminary results and conclusions. In: Working Group on Matrix Computations and Statistics, Copenhagen, Denmark, April 1-3 (2005)

Game-Method Model for Field Fires Nina Dobrinkova1, Stefka Fidanova2 , and Krassimir Atanassov3 1

Institute of Mathematics and Informatics, Bulgarian Academy of Sciences [email protected] 2 Institute for Parallel Processing, Bulgarian Academy of Sciences [email protected] 3 Centre of Biomertical Engineering, Bulgarian Academy of Sciences [email protected]

Abstract. Every year about 45000 forest ﬁres occur in Europe, burning half a million hectares of land, some of which protected zones with rare species of the ﬂora and fauna. The existing methods for wildland modeling are very complicated and their realizations need a lot of computational capacity. That is why we will use another approach based on the game-method theory, which consume less computational resources.

1

Introduction

The study of wildland ﬁre should begin with the basic principles and mechanisms of the combustion process — ﬁre fundamentals. Fire behavior is what a ﬁre does, the dynamics of the ﬁre event. An understanding of the fundamentals of wildland ﬁre is important for some very practical reasons. The combustion process can be manipulated to some extent: Retardants can be applied to aﬀect the combustion process: fuel arrangement can be altered for hazard reduction; and appropriate environmental conditions can be chosen for prescribed ﬁre to reduce smoke impacts, achieve desired fuel reduction, and still retain control of the ﬁre. The need to understand wildland ﬁre fundamentals is even more pressing than it was in the past. In earlier times the focus was on describing the aspects of ﬁre that are important to suppression eﬀorts. That continues to be high priority. In addition, there is now increasing emphasis on characterizing ﬁre for its eﬀect on vegetation and for the smoke it produces [4]. The basis for the ﬁre modeling is the Rothermel model for the behavior of surface ﬁres [5]. It calculates for any given point local intensity and spread parameters for the head of a surface ﬁre. Inputs for the model are a two-dimensional wind ﬁeld, terrain parameters, fuel moisture and a detailed description of the fuel bed, based on the local behavior output by the Rothermel model and on a model for the local shape of ﬁre spread. The mathematical models require descriptions of fuel properties as inputs to calculations of ﬁre danger indices or ﬁre behavior potential. The set of parameters describing the fuel characteristics have become known as fuel models and can be organized into four groups: grass, shrub, timber, and slash. Fuel models for ﬁre danger rating have increased to twenty while ﬁre behavior predictions and I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 173–179, 2010. c Springer-Verlag Berlin Heidelberg 2010

174

N. Dobrinkova, S. Fidanova, and K. Atanassov

applications have utilized the thirteen fuel models tabulated by Rothermel [5] and Albini [1]. Each fuel model is described by the fuel load and the ratio of surface area to volume for each size class; the depth of the fuel bed involved in the ﬁre front; and fuel moisture, including that at which ﬁre will not spread, called the moisture of extinction. The ﬁre models used nowadays by wildland ﬁre managers are speciﬁed to distinct types of ﬁre behavior. There are separate models for surface ﬁres, crown ﬁres, spotting, and point-source ﬁre acceleration. These behaviors are really abstractions of an overall three-dimensional process of unconﬁned combustion that links implicitly to the environment through heat and mass transfer feedbacks. In this work we will focus on game-method principle, which is easy to understand, use, and implement, when we are trying to model spreading of a wildland ﬁre. This model can be used also by non specialist in the forest ﬁres ﬁeld for learning and describing this phenomenon. The game-method model diﬀers from the other models with the usage of small amount of computational recourses. Most of the developed mathematical models require descriptions of fuel properties as inputs to calculations of ﬁre danger indices or ﬁre behavior potential. They are so sophisticated that their resulting simulations have predictions which cannot be tested analytically. In nature, the natural consistency of a given model is usually tested by calculations of sets of simple experiments (e.g a hydrodynamics code is always tested by calculations of shock tube experiments). However, often such tests are impossible or ambiguous. In such cases, a more basic question arises — the correctness question, in the sense that all applied conditions and restrictions are valid, no additional conditions and restrictions are applied; and the model rules are not contradictory among themselves. In other words, we mean correctness in the mathematical and logical sense (completeness, conservativeness and consistency). That is why in the paper we focus on the game-method model [2] and we apply it on ﬁeld ﬁres.

2

Description of the Game-Method Model

The combinatorial game-method has mostly theoretical applications in astronomy for example, but we will try to show its possible application in ﬁre spreading problem. Let a set of symbols S and n-dimensional simplex (in the sense of [3]) comprising n-dimensional cubes (at n = 2, a two-dimensional net of squares) be given. Let material points be found in some of the vertices of the simplex and let a set of rules A be given, containing: – rules for the motions of the material points along the vertices of the simplex; – rules for the interactions among the material points. Let the rules from the i-th type be marked as i-rules, where i = 1, 2. Each material point is a set by a number, n-tuple of coordinates characterizing its location in the simplex and a symbol from S reﬂecting the peculiarity of the material point (e.g. in physical applications — mass, charge, concentration, etc.).

Game-Method Model for Field Fires

175

We shall call an initial conﬁguration every set of (n + 2)-tuples with an initial component being the number of the material point; the second, third, etc. until the (n + 1)-st — its coordinates; and the (n + 2)-nd – its symbol from S. We shall call a ﬁnal conﬁguration the set of (n + 2)-tuples having the above form and which is a result from a (ﬁxed) initial conﬁguration by a given number of applying the rules from A. Once applying a rule from A over a given conﬁguration K will be called an elementary step in the transformation of the model and will be denoted by A1 (K). In this sense if K is initial conﬁguration, and L – a ﬁnal conﬁguration derived from K through times applying regulations from A, then conﬁgurations K0 , K1 , ..., Km will exist, for which K0 = K, Ki+1 = A1 (Ki ) for 0 ≤ i ≤ m − 1, Km = L, (the equality “=” is in the sense of coincidence in the conﬁgurations) and this will be denoted by L = A(K) ≡ A1 (A1 (...A1 (K)...)). Let a rule P be given, which juxtaposes to a combination of conﬁgurations M a single conﬁguration P (M ) being the mean of the given ones. We shall call this rule a concentrate one. The concentration can be made over the values of the symbols from S for the material points, as well as over their coordinates, however, not over both of them simultaneously. For each choice of the rule P one should proceed from physical considerations. For example, if k-th element of M (1 ≤ k ≤ s, where s is the number of elements of M ) is a rectangular with p × q squares and if the square staying on (i, j)-th place (1 ≤ i ≤ p, 1 ≤ j ≤ q) contains number dki,j ∈ {0, 1, ..., 9}, then on the (i, j)-th place of P (M ) stays: – minimal number

di,j

s 1 k = di,j , s k=1

– maximal number

di,j =

s 1 k di,j , s k=1

– average number

di,j

s 1 k 1 = di,j + , s 2 k=1

where for real number x = a + α, where a is a natural number and α ∈ [0, 1): [x] = a and a, if α = 0 x = a + 1, if α > 0 Let B be a criterion deﬁned from physical or mathematical considerations. On given two conﬁgurations K1 and K2 it answers the question if they are close to

176

N. Dobrinkova, S. Fidanova, and K. Atanassov

each other or not. For example, for two conﬁgurations K1 and K2 having the form from the above example, B(K1 , K2 ) = or

q p 1 1 |d − d2i,j | < C1 p.q i=1 j=1 i,j

⎞ 12 q p

1 2 B(K1 , K2 ) = ⎝ d1 − d2i,j ⎠ < C2 , p.q i=1 j=1 i,j ⎛

where C1 and C2 are some constants. For the set of conﬁgurations M and the set of rules A we shall deﬁne the set of conﬁgurations A(M ) = {L|(∃K ∈ M )(L = A(K))}. The rules A will be called statistically correct, if for a great enough (from a statistical point of view) natural number N : (∀m > N )(∀M = {K1 , K2 , ..., Km }) (B(A(P (M )), P ({Li |Li = A(Ki ), 1 ≤ i ≤ m})) = 1).

(1)

The essence of the method is in the following: the set of rules A, the proximity criterion B and the concentrate rule P are ﬁxed preliminarily. A set of initial conﬁgurations M is chosen and the set of the corresponding ﬁnal conﬁgurations is constructed. If the equation (1) is valid we can assume that the rules from the set A are correct in the frames of the model, i.e. they are logically consistent. Otherwise we replace part (or all) of them with others. If the rules become correct, then we can add to the set some new ones or transform some of the existing and check permanently the correctness of the newly constructed system of rules. Thus, in consecutive steps, extending and complicating the rules in A and checking their correctness, we construct the model of the given process. Afterwards we may check the temporal development (as regards the ﬁnal system of rules A) of a particular initial conﬁguration. In the method of modelling generally one works over one or several particular conﬁgurations. However here we check initially the correctness of the modelling rules and just then we proceed to the actual modelling. This is due to a great deal to the fact that we work over discrete objects with rules convenient for computer realization. Thus a series of checks of the equation (1) can be performed and just to construct the conﬁguration A(K) for a given conﬁguration K and a set of rules A. Here we will start describing our model: Let have a matrix NxM, which will be an abstract description of the surface of a forest. The visualization will be a net NxM with bins containing numbers from 0 to 9. Those parameters are representation of the coeﬃcient of burning. The 0 would be totally burned or nothing to burn status of the bin (in case of river, rock, lake etc.) and everything in the range from 1 to 9 will be thickness of the forest trees in each bin. Another

Game-Method Model for Field Fires

177

parameter will be the vector of the wind with its direction and intensity of blowing. In our software application we can include a ﬂag for showing the state of each bin in the moment of the started ﬁre. The ﬂag is 1 if the bin is burning and it is 0 if not. In our simulation the ﬁre starts from the central bin, of course it can start from any bin. The bins coeﬃcients represent how long the bin will burn till be totally burned. In this paper we consider the simplest case without wind. Thus the ﬁre is distributed in all directions with the same speed. So if the state of the bin is 1 (burning) every time step (iteration) the burning coeﬃcient decrease with 1 till it becomes 0 (totally burned). If the state of the bin is 0 (not burning) and the state of all neighbor bins are 0, then the burning coeﬃcient states unchanged. If the state of the bin is 0 and the state any of the neighbor bins was 1 in a precedence iteration, then the state of the bin becomes 1. Thus with this model we can trail the development of the ﬁre in every time moment.

3

Experimental Results

In this section we prepare several test problems. Our test area is divided on N × M bins. On real problem one bin can be 10 square meters. To generate tests we generate random numbers from 1 to 9 for burning coeﬃcients. We start burning from the central bin and burning process continues till the ﬁre reaches borders of the area. We generate K tests (simulations). To check the correctness of the algorithm, we calculate the average area by averaging every bin with respect to all simulated areas: bij =

K 1 aijn K n=1

(2)

We do the same for burned areas (resulting areas): cij =

K 1 a K n=1 ijn

(3)

After that we apply the method on average initial area and the result is the burned area dij . We compare cij with dij for every i and j. On Figure 1 the Table 1. The development of the ﬁre 3 3 3 9 8 4 6 1 3

9 4 2 5 8 4 2 9 9

8 5 8 9 2 5 8 1 9

9 3 5 4 8 8 1 6 5

2 9 2 8 2 6 4 2 9

6 4 4 0 0 0 5 1 1

4 9 7 6 4 1 9 1 1

2 3 5 8 3 1 1 3 5

9 7 8 8 7 5 4 7 2

3 3 3 9 8 4 6 1 3

9 4 2 5 8 4 2 9 9

8 5 8 9 2 5 8 1 9

9 3 5 3 7 7 1 6 5

2 9 2 7 1 5 4 2 9

6 4 4 0 0 0 5 1 1

4 9 7 6 4 1 9 1 1

2 3 5 8 3 1 1 3 5

9 7 8 8 7 5 4 7 2

3 3 3 9 8 4 6 1 3

9 4 2 5 8 4 2 9 9

8 5 7 8 1 4 7 1 9

9 3 4 2 6 6 0 6 5

2 9 1 6 0 4 3 2 9

6 4 3 0 0 0 4 1 1

4 9 7 6 4 1 9 1 1

2 3 5 8 3 1 1 3 5

9 7 8 8 7 5 4 7 2

178

N. Dobrinkova, S. Fidanova, and K. Atanassov

Fig. 1. Diﬀerences between average on the result areas and result on average initial areas, the diﬀerent bins are market with X

diﬀerent bins between the average result areas and result on average initial areas are marked with X. We prepared an example where the area consists of 143 × 143 bins. We generated 30 simulations. We apply the method on every of the simulation and on the average area where the value of the bin (i, j) is an average of the bins (i, j) of all 30 simulations. We average the resulting areas and compare with the result of the average area Figure 1. Only 914 bins, which are 4% of whole area, are diﬀerent and they are on the boundary, so called boundary eﬀect. So we can conclude that the results are statistically similar and the method is correct. Table 2. The area after the third time step o o o o o o o o o

o o o o o o o o o

o o o o o o o o o

o o o o o o o o o

o o o o X o o o o

o o o o o o o o o

o o o o o o o o o

o o o o o o o o o

o o o o o o o o o

o o o o o o o o o

o o o o o o o o o

o o o o o o o o o

o o o X X X o o o

o o o X X X o o o

o o o o o o o o o

o o o o o o o o o

o o o o o o o o o

o o o o o o o o o

o o o o o o o o o

o o o o o o o o o

o o X X X X X o o

o o X X X X O o o

o o X X O X X o o

o o X o o o X o o

o o o o o o o o o

o o o o o o o o o

o o o o o o o o o

o o o o o o o o o

o X X X X X X X o

o X X X X X X O o

o X X X X X O X o

o X O X O X X X o

o X X o o o X O o

o X X X o O X O o

o o o o o o o o o

o o o o o o o o o

Game-Method Model for Field Fires

179

We prepare a small example with 9×9 bins. We want to show how the method works and how the ﬁre is dispersed. On the ﬁrst table is the initial area with the burning coeﬃcients in the every bin. The coeﬃcient 0 means unburned material. On the Table 1 we observe the decreasing of the burning coeﬃcients. On the Table 2 we show the ﬁre dispersion and how the ﬁre surround unburned area. The bins with o are without ﬁre. The bins with X are burning bins. The bins with O are totally burned.

4

Conclusion

On this paper we describe method of modelling wildland ﬁre using games theory. We focused on simpliﬁed example when the area is ﬂat and without wind. Our model can trace the ﬁre and predict its development which is very important to bring it under control and to prevent it to cause large damages. In a future works we will complicate the model including areas which are not ﬂat and meteorological data as wind and humidity. Thus our model will become more ﬂexible and applicable in diﬀerent situations. We will use real data from a past wildland ﬁre and compare how well we model the development of the process.

References 1. Albini, F.A.: Estimating wildﬁre behavior and eﬀects. USDA For. Serv. Gen. Tech. Rep. INT-30, 92 p. lntermt. For. and Range Exp. Stn., Ogden, Utah (1976) 2. Atanassov, K., Atanassova, L., Sasselov, D.: On the combinatorial game-metod for modelling in astronomy (1994) 3. Kuratovski, K.: Topology. Academic Press, New York (1966) 4. Pyne, S.J., Andrews, P.L., Laven, R.D.: Introduction to Wildland Fire, p. 3. Wiley, New York (1996) 5. Rothermel, R.C.: A mathematical model for predicting ﬁre spread in wildland fuels. Research Paper INT-115. Ogden, UT: US Department of Agriculture, Forest Service, Intermountain Forest and Range Experiment Station, pp. 1–40 (1972)

Joint Analysis of Regional Scale Transport and Transformation of Air Pollution from Road and Ship Transport K. Ganev1 , D. Syrakov2, G. Gadzhev1 , M. Prodanova2, G. Jordanov1, N. Miloshev1 , and A. Todorova1 2

1 Geophysical Institute, Bulgarian Academy of Sciences, Soﬁa, Bulgaria National Institute of Meteorology and Hydrology, Bulgarian Academy of Sciences, Soﬁa, Bulgaria

Abstract. The objective of the present work is to study in detail the dilution processes of the plumes and the chemical transformations of pollutants jointly generated by road and ship transport. The numerical simulations are carried out using the MODELS-3 system (SMOKE/ MM5/CMAQ) nesting abilities, downscaling to a horizontal resolution of 30 km, the innermost domain including regions with very intensive road and ship transport and diﬀerent climate (photochemistry reactions intensity). The CMAQ “Process rate analysis” option is applied to discriminate the role of diﬀerent dynamic and chemical processes to the pollution levels formation. A large number of numerical experiments were carried out, which makes it possible to distinguish the relative contribution of diﬀerent air pollution factors. Careful and detailed analysis of the obtained results was made, outlining the inﬂuence of the domain speciﬁc physiographic characteristics, road and ship emission impacts on the pollution characteristics.

1

Introduction

The objective of the present work is to study the regional scale dilution and chemical transformation processes of pollutants generated by road and ship transport. More precisely the study aims at clarifying the interaction between the pollution from road and ship emissions their mutual impacts and contribution to the overall pollution. It is expected the results of the current work to give some clues for speciﬁcation of the “eﬀective emission indices” linking emission inventories to the emissions to be used as input in large scale models.

2

Modelling Tools

The US EPA Model-3 system was chosen as a modelling tool because it appears to be one of the most widely used models with proved simulation abilities. The I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 180–187, 2010. c Springer-Verlag Berlin Heidelberg 2010

Joint Analysis of Regional Scale Transport and Transformation

181

system consists of three components: MM5 – the ﬁfth generation PSU/NCAR Meso-meteorological Model MM5 ([6,7]) used as meteorological pre-processor; CMAQ – the Community Multiscale Air Quality System ([5,2]); SMOKE – the Sparse Matrix Operator Kernel Emissions Modelling System ([4,8,3]).

3

Methodology

US NCEP Global Analyses data was used for meteorological background input: The data is with 1 × 1 degree grid resolution covering the entire globe, the time resolution is 6 hours. The Models-3 system nesting abilities were applied for downscaling the problem to a resolution of 30 km for the domain discussed further. Two sets of emission data are used in the present study: – The data set created by Visschedijk et al. ([9]) was used for the all the emissions but those from ship transport; – The ship transport emissions were taken from the inventory created by Wang et al. ([10]). As to prepare the CMAQ emission input ﬁle, these inventory ﬁles were handled by a specially prepared computer code. Two main procedures were performed. The temporal allocation is made on the base of daily, weekly, and monthly proﬁles, provided by Builtjes in [1]. The temporal proﬁles are country-, pollutantand SNAP (Selected Nomenclature for Air Pollution)-speciﬁc. A speciﬁc approach for obtaining speciation proﬁles is used here. The USA EPA data base is intensively exploited. Typical sources for every SNAP were linked to similar source types from US EPA nomenclature. The weighted averages of the respective speciation proﬁles are accepted as SNAP-speciﬁc splitting factors. In such a way VOC and PM2.5 speciation proﬁles are derived. The biogenic emissions of VOC were estimated by the model SMOKE. The study was based on joint analysis of the results from the following emission scenarios: 1. Simulations with all the emissions in the simulation domain, corresponding arbitrary (concentration, deposition, columnar value, process contribution, etc.) characteristic Φ denoted by Φall ; 2. Simulations with the emissions from road transport reduced by a factor of α, corresponding arbitrary characteristic Φ denoted by Φreducedroad ; 3. Simulations with the emissions from ship transport reduced by a factor of α, corresponding arbitrary characteristic Φ denoted by Φreducedship ; 4. Simulations with the emissions from both road and ship transport reduced by a factor of α, corresponding arbitrary characteristic Φ denoted by Φreducedroadandship . The most natural properties, which can be constructed from these scenarios are the relative (in %) contributions of road/ship emissions to the formation of the characteristic Φ:

182

K. Ganev et al.

1 Φall − Φreduced road /, 100 1−α Φall 1 Φall − Φreduced ship = /, 100. 1−α Φall

Φroad = Φship

Some more “sophisticated” properties, like impact of road transport on the pollution from ship emissions (the ratio Φship |road of pollution from ship emissions when road emissions are also excluded to pollution from ship emissions, but with road emissions present) or vice versa (Φroad |ship ) can also be deﬁned: Φreduced road − Φreduced road and ship /, 100 and Φall − Φrediced ship Φreduced ship − Φreduced road and ship = /, 100. Φall − Φrediced road

Φship |road = Φroad |ship

MM5 and CMAQ simulations were carried out for the periods January 2001– 2005 and July 2001–2005. Averaging the ﬁelds over the respective month produces a diurnal behavior of given pollution characteristic, which can be interpreted as “typical” for the month (respectively season). The characteristic, which will be mostly demonstrated and discussed as an example further in this paper is the surface concentration c. Moreover, what will be shown and discussed concerns not only separate pollutants, but also some aggregates like nitrogen compounds (GNOY = NO + NO2 + NO3 + 2*N2O5 + HONO + HNO3 + PNA), organic nitrates (ORG N = PAN + NTR), hydrocarbons (HYDC = PAR + ETH + OLE + TOL + XYL + ISO), CAR PHE = FORM + ALD2 + MGLY + CRES, aerosol NH4, SO4 and H2O, PM2.5 and PMcoarse = PM10 – PM2.5. The Models-3 “Integrated Process Rate Analysis” option is applied to discriminate the role of diﬀerent dynamic and chemical processes for the pollution from road and ship transport. The processes that are considered are: advection, diﬀusion, mass adjustment, emissions, dry deposition, chemistry, aerosol processes and cloud processes/aqueous chemistry.

4

Results and Discussions

The respective “typical” concentrations can be also averaged for the day and these will be (due to the volume limitations) most of the illustrations demonstrated, like on Fig. 1. The impact of road/ship emissions is rather complex and will take a lot of pages to be described. One could not help but notice, however, how well displayed the conﬁgurations of most intensive ship/road transport are. Due to the none-linearity of the processes their impact can be negative as well. Some big cities and the main ship routes are particularly well displayed as sinks in the July ozone plots. The same plots for a “typical” January are demonstrated in Fig. 2. The impact of road/ship emissions is again very complex. The diﬀerence, both qualitative and quantitative between “summer” and “winter” ﬁelds is quite evident for all

Joint Analysis of Regional Scale Transport and Transformation

183

a)

b) Fig. 1. Averaged for a “typical” day in July ﬁelds of croad [%] (a) and cship [%] (b) for ozone, PM2.5, nitrogen compounds, hydrocarbons, CAR PHE, PMcoarse , organic nitrates, and SO2

the species, though the conﬁgurations of most intensive ship/road transport are again very well displayed. Ship/road emission relative contribution does have diurnal course, which is well displayed by Fig. 3. Again the temporal behavior is complex and can not be described by some very general statements. The most intensive ship/road sources are well visible as sinks and the sink locations do not have dramatic diurnal change. The shape and location of areas with positive impact to the ozone levels generated by ship/road transport emissions temporal variations are more prominent. It should be noted that, while the detailed spatial distribution shows that the contribution of road/ship transport emissions to some compounds surface concentration can change sign from point to point, but averaged over large enough domains the respective contributions vary with time but their sign, though diﬀerent for diﬀerent compounds and aggregated families, does not change during the day.

184

K. Ganev et al.

a)

b) Fig. 2. Averaged for a “typical” day in January ﬁelds of croad [%] (a) and cship [%] (b) for ozone, PM2.5, nitrogen compounds, hydrocarbons, CAR PHE, PMcoarse , organic nitrates, and SO2

The more complex and sophisticated “road-to-ship” and “ship-to-road” contributions are illustrated in Fig. 4. It is really hard to describe the plot, but they are a very good demonstration of how complex the interactions of the pollutants are and how strange and surprising the impact of one of the types of transport to the air pollution by the other is. The absolute value of these contributions in some points could be very large, but generally for most of the compounds the contributions vary within relatively narrow margins around 100%. The corresponding plots for January are of course diﬀerent, but the most general features mentioned above are valid as well. To better understand the interaction of pollutants from road and ship transport, one should closely examine the contributions of diﬀerent processes to the total pollution, in particular the “road-to-ship” and “ship-to-road” contributions to the respective processes. That is why the CMAQ “Integrated Process Rate Analysis” option was applied and the “portions” of the major processes which

Joint Analysis of Regional Scale Transport and Transformation

185

a)

b) Fig. 3. Diurnal course of croad [%] (a) and cship [%] (b) for ozone for typical July day

a)

b) Fig. 4. Fields of cship |road [%] (a) and of croad |ship [%] (b) for ozone, PM2.5, nitrogen compounds and hydrocarbons, July

form the hourly concentration changes were calculated. The process “portions” Δc(proc) can be treated like the concentrations and respective “road”, “ship”, “road-to-ship”, and “ship-to-road” contributions can be estimated. Plots of the horizontal ﬁelds of the “road-to-ship” Δc(chem) ship|road and (chem) “ship-to-road” Δc road|ship (contributions to the chemical transformation processes “portion” of the surface ozone the hourly change concentrations are shown in Fig. 5. The very complex and “mosaic” texture of the plots can hardly be discussed in details. This is simply an illustration of how discouragingly complex the mutual impacts of pollution from diﬀerent source types is and

186

K. Ganev et al.

a)

b) Fig. 5. Diurnal course of Δc(chem) ship|road [%] (a) and of Δc(chem) road|ship [%] (b) for ozone, July

how diﬃcult the answer of a question “What is the impact of the pollution from road transport to the chemical transformation of the ozone from ship transport?” or vise versa could be.

5

Conclusion

The numerical experiments performed produced a huge volume of information, which have to be carefully analyzed and generalized so that some ﬁnal conclusions could be made. The conclusions that can be made at this stage of the studies are that the transport type/processes interactions are indeed very complex. The results produced by the CMAQ “Integrated Process Rate Analysis” demonstrate the very complex behavior and interaction of the diﬀerent processes — process contributions change very quickly with time and these changes for the diﬀerent points on the plane hardly correlate at all. The analysis of the behavior of diﬀerent processes does not give simple answer of the question what the impact of pollution from given source type to the process, which form the pollution from another source type, could be.

Acknowledgments The present work is supported by EC through 6FP NoE ACCENT (GOCE-CT2002-500337), SEE-GRID-SCI project, contract No. FP7 - RI-211338, COST Actions 728, as well as by the Bulgarian National Science Fund (grants No. DO02-161/16.12.2008 and DO02-115/2008). The contacts within the framework of the NATO SfP Grant ESP.EAP.SFPP 981393 were extremely simulating as well.

Joint Analysis of Regional Scale Transport and Transformation

187

References 1. Builtjes, P.J.H., van Loon, M., Schaap, M., Teeuwisse, S., Visschedijk, A.J.H., Bloos, J.P.: Project on the modelling and veriﬁcation of ozone reduction strategies: contribution of TNO-MEP, TNO-report, MEP-R2003/166, Apeldoorn, The Netherlands (2003) 2. Byun, D., Schere, K.L.: Review of the Governing Equations, Computational Algorithms, and Other Components of the Models-3 Community Multiscale Air Quality (CMAQ) Modeling System. Applied Mechanics Reviews 59(2), 51–77 (2006) 3. CEP: Sparse Matrix Operator Kernel Emission (SMOKE) Modeling System, University of Carolina, Carolina Environmental Programs, Research Triangle Park, North Carolina (2003) 4. Coats Jr., C.J., Houyoux, M.R.: Fast Emissions Modeling with the Sparse Matrix Operator Kernel Emissions Modeling System, TheEmissions Inventory: Key to Planning, Permits, Compliance, and Reporting, Air and Waste Management Association, New Orleans (September 2006) 5. Dennis, R.L., Byun, D.W., Novak, J.H., Galluppi, K.J., Coats, C.J., Vouk, M.A.: The Next Generation of Integrated Air Quality Modeling: EPA’s Models-3. Atmosph. Environment 30, 1925–1938 6. Dudhia, J.: A non-hydrostatic version of the Penn State/NCAR Mesoscale Model: validation. Mon. Wea. Rev. 121, 1493–1513 (1993) 7. Grell, G.A., Dudhia, J., Stauﬀer, D.R.: A description of the Fifth Generation Penn State/NCAR Mesoscale Model (MM5). NCAR Technical Note, NCAR TN-398STR, 138 p. (1994) 8. Houyoux, M.R., Vukovich, J.M.: Updates to the Sparse Matrix Operator Kernel Emission (SMOKE) Modeling System and Integration with Models-3, The Emission Inventory: Regional Strategies for the Future, Raleigh, NC, Air and Waste Management Association (1999) 9. Visschedijk, A.J.H., Zandveld, P.Y.J., Denier van der Gon, H.A.C.: A High Resolution Gridded European Emission Database for the EU Integrate Project GEMS, TNO-report 2007-A-R0233/B, Apeldoorn, The Netherlands (2007) 10. Wang, C., Corbett, J.J., Firestone, J.: Modeling Energy Use and Emissions from North American Shipping: Application of the Ship Traﬃc, Energy, and Environment Model. Environ. Sci. Technol. 41(9), 3226–3232 (2007)

Runs of UNI–DEM Model on IBM Blue Gene/P Computer and Analysis of the Model Performance Krassimir Georgiev1 and Zahari Zlatev2 1

Institute for Parallel Processing, Bulgarian Academy of Sciences Acad. G. Bonchev, bl. 25A, 1113 Soﬁa, Bulgaria [email protected] 2 National Environmental Research Institute, Aarhus University Frederiksborgvej 399, P.O. Box 358, DK-4000 Roskilde, Denmark [email protected]

Abstract. UNI–DEM is an Eulerian model for studying long range transport of air pollutants. The computational domain of the model covers Europe and some neighbour parts of Atlantic ocean, Asia and Africa. The model mainly is developed in the National Environmental Research Institute of Denmark, located at Roskilde. If UNI–DEM model is to be applied on a large space domain by using ﬁne grids, then its discretization leads to a huge computational problem. If the space domain is discretized by using a (480 x 480) grid and the number of chemical species studied by the model is 35, then several systems of ordinary diﬀerential equations containing 8 064 000 equations have to be treated at every time-step (the number of time-steps being typically several thousand). If a threedimensional version of the same air pollution model is to be used, then the ﬁgure above must be multiplied by the number of layers. This implies that such a model as UNI–DEM must be run only on high-performance computer architectures, like IBM Blue Gene/P. The implementatation of such complex large–scale model on each new computer is a non trivial task. Analysis of the runs of UNI–DEM performed until now on IBM Blue Gene/P computer is presented and some preliminary results on performance, speed ups and eﬃciency are discussed.

1

Introduction

The environmental models are normally described by systems of partial diﬀerential equations (PDEs). Often, the number of equations is equal to the number of chemical species studied by the models and the unknown functions are concentrations of these species. There are ﬁve basic stages in the development of a large environmental model: • one has to choose carefully the physical and chemical processes that are to be taken into account during the development of the model, • the selected processes must be described mathematically, I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 188–196, 2010. c Springer-Verlag Berlin Heidelberg 2010

Runs of UNI–DEM Model on IBM Blue Gene/P Computer and Analysis

189

• the resulting system of PDEs must be treated by numerical methods, • the reliability of the obtained results must be evaluated, and • conclusions should be drawn. It is important to take into account all relevant physical and chemical processes during the development of the models. If more physical and chemical processes are included in the model, then one should expect that more accurate and more reliable results can be calculated. On the other hand, there are two diﬃculties related to the attempt to include as many as possible physical and chemical processes in the model: • The complexity of the model is increased when more processes are included in it. The treatment of the model on the available computers might become very diﬃcult, and even impossible, when too many processes are taken into account. • Some of the physical and chemical processes are still not well understood, which means that such processes must either be neglected or some simpliﬁed mechanisms, based on uncertain assumptions and/or on experimental data, must be developed. It is necessary to ﬁnd a reasonable compromise related to the number of processes that are to be taken into account when a large environmental model is developed. This explains also why it is necessary to validate the model results. The selected physical and chemical processes have to be described by mathematical terms. There exist some more or less standard rules which can be used for the mathematical description of diﬀerent processes. Several examples, related to air pollution modes, are listed below: • The transport caused by the wind (which is normally called advection) is described by using terms containing ﬁrst-order spatial derivatives of the unknown functions (the concentrations of the studied pollutants) multiplied by the wind velocities along the coordinate axes. • The diﬀusion of the concentrations is expressed by second-order spatial derivatives multiplied by the diﬀusivity coeﬃcients. • The chemical reactions are represented by non-linear mathematical terms. • The change of the concentrations in time is given by ﬁrst-order derivatives of the concentrations of the pollutants with respect to time. When all selected physical and chemical processes are expressed by some mathematical terms, then these terms have to be combined in a system of PDEs. The rest of the paper is organized as follows. The mathematical background of UNI-DEM is discussed in Section 2. The importance of the eﬃcient organization of the computational process is outlined in Section 3. Section 4 is devoted to the parallelization strategy and the supercomputer used, analysis of the performed runs of UNI–DEM on IBM Blue Gene/P computer and some preliminary results on the performance, speed ups and eﬃciency achieved. Finally, short concluding remarks can be found in Section 6.

190

2

K. Georgiev and Z. Zlatev

Mathematical Description of the Danish Eulerian Model

The Danish Eulerian Model (DEM) is represented mathematically by the following system of partial diﬀerential equations (PDE): ∂(ucs ) ∂(vcs ) ∂(wcs ) ∂cs =− − − ∂t ∂x ∂y ∂z ∂ ∂cs ∂cs ∂cs ∂ ∂ + Kx + Ky + Kz ∂x ∂x ∂y ∂y ∂z ∂z +Es − (κ1s + κ2s )cs + Qs (c1 , c2 , . . . , cq ) ;

(1)

s = 1, . . . , q

where cs = cs (t, x, y, z) is the concentration of the chemical specie s at point (x, y, z) of the space domain at time t of the time-interval, u = u(t, x, y, z), v = v(t, x, y, z), and w = w(t, x, y, z) are the wind velocities along Ox, Oy, and Oz directions respectively at point (x, y, z) and time t; Kx = Kx (t, x, y, z), Ky = Ky (t, x, y, z), and Kz = Kz (t, x, y, z) are diﬀusion coeﬃcients; Es are the emissions, κ1s = κ1s (t, x, y, z) and κ2s = κ2s (t, x, y, z) are the coeﬃcients for dry and wet deposition of the chemical specie s at point (x, y, z) and at time t of the time-interval (for some species these coeﬃcients are non-negative constants, the wet deposition coeﬃcients κ2s are equal to zero when it is not raining), and ﬁnally, Qs (t, x, y, z, c1 , c2 , . . . , cq ) are expressions that describe the chemical reactions under consideration. Normally, it is not possible to solve exactly the systems of PDEs by which the large environmental models are described mathematically. Therefore, the continuous systems of type (1) are to be discretized. Assume that the space domain, on which (1) is deﬁned, is a parallelepiped (this is as a rule the case when environmental models are to be handled) and that x ∈ [a1 , b1 ], y ∈ [a2 , b2 ], z ∈ [a3 , b3 ] and t ∈ [a, b]. Consider the grid-points (tn , xj , yk , zm ), where xj = a1 + jΔx, j = 0, 1, 2, . . . , Nx , yk = a2 + kΔy, k = 0, 1, 2, . . . , Ny , zm = a3 + mΔz, m = 0, 1, 2, . . . , Nz , and tn = a + nδt, n = 0, 1, 2, . . . , Nt . Assume also that the initial values cs (a, x, y, z) are given on all points of the spatial grid deﬁned above. Then the task of ﬁnding the exact solution of the unknown functions cs at all points (t, x, y, z) of the domain (inﬁnite number of points) is reduced to the task of ﬁnding approximations of the values of the functions ci at the points (tn , xj , yk , zm ); the number of these points can be very large (up to many millions), but it is ﬁnite. This means that the original task is relaxed in two ways: the number of points at which the problem is treated is reduced to the number of grid-points and it is required to ﬁnd approximate solutions instead of the exact solution. In the example given above equidistant grids are introduced (i.e. Δx, Δy, Δz and Δt are constants). Non-equidistant grids can also be used. The vertical grids are normally not equidistant. It is assumed here that Cartesian coordinates have been chosen. Other coordinates, for example spherical coordinates, can also be used.

Runs of UNI–DEM Model on IBM Blue Gene/P Computer and Analysis

191

The above two remarks illustrate the fact that the discretization can be performed in diﬀerent ways. The important thing is that the main idea remains the same: one considers approximate values of the unknown functions at a ﬁnite number of grid-points, deﬁned by the discretization chosen, instead of the exact solution of (1) on the whole continuous space domain. Numerical methods must be used to ﬁnd approximate values of the solution at the grid-points. It is also appropriate to split the model, the system of PDEs of type (1), into several sub-models (sub-systems), which are in some sense simpler. There is another advantage when some splitting procedure is applied: the diﬀerent sub-systems have diﬀerent mathematical properties and one can try to select the best numerical method for each of the sub-systems. It is clear that if some splitting procedure and appropriate numerical methods are already chosen, then any continuous system of type (1), which represents an environmental model, is replaced by several discrete sub-models which have to be treated on the available computers. More details about the mathematical description of the physical and chemical processes involved in UNI-DEM can be found in [19] or [20].

3

Need for Eﬃcient Implementation of the Computations

The discretization of the system of PDEs by which the environmental models are described mathematically leads to huge computational tasks. The following example illustrates clearly the size of these tasks. Assume that • Nx = Ny = 480 (when a 4800km×4800km domain covering Europe is considered, then this choice of the discretization parameters leads to horizontal 10km × 10km cells), • Nz = 10 (i.e. ten layers in the vertical direction are introduced) and • Ns = q = 56 (the chemical scheme contains 56 species). Then the number of equations that are to be handled at each time-step is (Nx + 1)(Ny + 1)(Nz + 1)Ns = 142 518 376. A run over a time-period of one year with a time step-size Δt = 2.5 minutes will result in Nt = 213 120 time-steps. When studies related to climatic changes are to be carried out, it is necessary to run the models over a time-period of many years. When the sensitivity of the model to the variation of some parameters is studied, many scenarios (up to several hundreds) are to be run. This short analysis demonstrates the fact that the computational tasks arising when environmental studies are to be carried out by using large–scale models are enormous. Therefore, it is necessary: • to select fast but suﬃciently accurate numerical methods and/or splitting procedures, • to exploit eﬃciently the cash memories of the available computer, • to parallelize the code in the eﬀorts to make a large environmental model tractable on the available computers. It should be mentioned that it may be impossible to handle some

192

K. Georgiev and Z. Zlatev

very large environmental models on the computers available at present even when the above three conditions are satisﬁed. The treatment of UNI–DEM on diﬀerent supercomputers is discussed in [2,5,6,16,17,18,19,20]. This model has been used to study pollution levels in (a) Bulgaria ([24,25]), (b) Denmark ([19,22,23]), (c) England ([1]), (d) Europe ([3,4,19,22,20]), (e) Hungary ([9,10]) and (f) the North Sea ([8]).

4 4.1

Parallel Implementation of DEM IBM Blue Gene/P Computer

The supercomputer which is used in this study is the ﬁrst Bulgarian supercomputer IBM Blue Gene/P (see: http://www.bgsc.acad.bg/). As it is reported, this machine consists of two racks, 2048 Power PC 450 based compute nodes, 8192 processor cores and a total of 4 TB random access memory. Each processor core has a double–precision, dual pipe ﬂoating–point core accelerator. Sixteen I/O nodes are connected via ﬁbre optics to a 10 Gb/s Ethernet switch. The smallest partition size, available currently, is 128 compute nodes (512 processor cores). The theoretical performance of the computer is 27.85 Tﬂops while the maximum LINPACK performance achieved is 23.42 Tlops (≈ 84%). 4.2

Parallelization Strategy

The MPI (Message Passing Interface, [7]) standard library routines are used to parallelize this model. One of the most important advantages of MPI is that it can be used on a much wider class of parallel systems, including shared-memory computers and clustered systems (each node of the cluster being a separate shared-memory machine). Thus it provides a high level of portability of the code. Our MPI parallelization is based on the space domain partitioning. The space domain is divided into several sub-domains (the number of the sub-domains being equal to the number of MPI tasks). Each MPI task works on its own sub-domain. On each time step there is no data dependency between the MPI tasks on both the chemistry and the vertical exchange stages. This is not so with the advection-diﬀusion stage. Spatial grid partitioning between the MPI tasks requires overlapping of the inner boundaries and exchange of certain boundary values on the neighboring subgrids for proper treatment of the boundary conditions. This leads to two main consequences: (i) certain computational overhead and load imbalance, leading to lower speedup of the advection-diﬀusion stage in comparison with the chemistry and the vertical transport; (ii) communication necessity for exchanging boundary values on each time step (done in a separate communication stage).

Runs of UNI–DEM Model on IBM Blue Gene/P Computer and Analysis

193

Table 1. CHUNKSIZE runs on IBM Blue Gene/P computer Size of chunks

1

2

4

8

16

32

64

Case 1 (CPU in hours)

9.8

10.3

9.4

8.7

8.4

8.2

8.4

Case 2 (CPU in hours)

43.2

37.7

33.4

30.4

30.1

31.1

37.0

Table 2. Characteristic of some parallel runs of UNI–DEM on IBM Blue Gene/P computer Computational Measured proces quantities

Number of processors used 15 30 60 120

Advection

CPU time Speed-up Percent

16.67 35.6

8.55 1.95 34.6

4.50 3.70 31.6

2.36 7.06 26.3

Chemistry

CPU time Speed-up Percent

24.77 53.0

12.51 1.98 50.1

6.20 4.00 43.6

3.10 7.99 34.2

Overhead

CPU time Speed-up Percent

5.28 11.4

83.54 1.49 15.3

3.52 1.50 24.8

3.53 1.50 39.2

Total

CPU time Speed-up Percent

46.72 100.0

24.60 1.90 100.0

15.21 3.07 100.0

9.00 5.19 100.0

The subdomains are usually too large to ﬁt into the fast cache memory of the target processor. In order to achieve good data locality, the smaller lowlevel tasks are grouped in chunks. A parameter CHUNKSIZE is provided in the chemical–emission part of the DEM code, which should be tuned with respect to the cache size of the target machine (see Table 1). Remark: In Table 1 the following model parameters are used: (i) in both cases 480 × 480 grid (10km × 10km cells) and number of processors = 120; (ii) Case 1: advection time step: 150 s., chemical time step: 150 s.; Case 2: advection time step: 90 s.; chemical time step: 9 s. The time for pre-processing and post-processing is, in fact, overhead, introduced by the MPI partitioning strategy. Moreover, this overhead is growing up with increasing the number of MPI tasks and little can be done for its parallel processing. Thus the relative weight of these two stages grows up with increasing the number of MPI tasks, which eventually aﬀects the total speed-up and eﬃciency of the MPI code (see results reported in Table 2).

194

4.3

K. Georgiev and Z. Zlatev

Numerical Results

Results of some preliminary experiments for a one year period on the IBM Blue Gene/P computer described above are shown in Table 2. Let us mention the following details of the performed runs: 1. The computing times are measured in hours. 2. Discretization parameters are as follows: (a) Advection step: 150 s.; (b) Chemical step: 150 s., 480 × 480 grid in the spatial domain (10km × 10km cells). 3. The percentages are related to the total times for the processors used. 4. The speed-up factors are related to the computing times obtained when 15 processors are used. 5. All runs were performed by using chunks of length 48.

5

Conclusions

By using IBM Blue Gene/P high parallel computer to run the variable grid-size version of the Danish Eulerian Model, detailed air pollution results for a large region (whole Europe) and for a very long period (one or several years) can be obtained within a reasonable time. The use of many processors (several hundreds) give possibility to use ﬁner time resolution both in advection– diﬀusion and chemical-deposition parts. Some preliminary runs with the combinations of time steps 90 s.(advection/diﬀusion) – 9 s. (chemistry/deposition) and 9 s.(advection/diﬀusion)– 0.9 (chemistry/deposition) showed to be very optimistic. The uniﬁed parallel code of DEM, UNI–DEM, created by using MPI standard library, appears to be highly portable and shows good eﬃciency and scalability on IBM Blue Gene/P machine.

Acknowledgments This research is supported in part by grant DO02-115/2008 from the Bulgarian NSF and the Bulgarian Supercomputer Center giving access to IBM Blue Gene/P computer.

References 1. Abdalmogith, S., Harrison, R.M., Zlatev, Z.: Intercomparison of inorganic aerosol concentrations in the UK with predictions of the Danish Eulerian Model. Journal of Atmospheric Chemistry 54, 43–66 (2006) 2. Alexandrov, V., Owczarz, W., Thomsen, P.G., Zlatev, Z.: Parallel runs of large air pollution models on a grid of SUN computers. Math. and Comp. in Simulation 65, 557–577 (2004)

Runs of UNI–DEM Model on IBM Blue Gene/P Computer and Analysis

195

3. Ambelas Skjøth, C., Bastrup-Birk, A., Brandt, J., Zlatev, Z.: Studying variations of pollution levels in a given region of Europe during a long time-period. Systems Analysis Modelling Simulation 37, 297–311 (2000) 4. Bastrup-Birk, A., Brandt, J., Uria, I., Zlatev, Z.: Studying cumulative ozone exposures in Europe during a 7-year period. Journal of Geophysical Research 102, 23917–23935 (1997) 5. Georgiev, K., Zlatev, Z.: Running an advection–chemistry code on message passing computers. In: Alexandrov, V.N., Dongarra, J. (eds.) PVM/MPI 1998. LNCS, vol. 1497, pp. 354–363. Springer, Heidelberg (1998) 6. Georgiev, K., Zlatev, Z.: Parallel sparse matrix algorithms for air pollution models. Parallel and Distributed Computing Practices 2, 429–442 (2000) 7. Gropp, W., Lusk, E., Skjellum, A.: Using MPI: Portable programming with the message passing interface. MIT Press, Cambridge (1994) 8. Harrison, R.M., Zlatev, Z., Ottley, C.J.: A comparison of the predictions of an Eulerian atmospheric transport chemistry model with experimental measurements over the North Sea. Atmospheric Environment 28, 497–516 (1994) ´ Boz´ 9. Havasi, A., o, L., Zlatev, Z.: Model simulation on transboundary contribution to the atmospheric sulfur concentration and deposition in Hungary. Id¨ oj´ ar´ as 105, 135–144 (2001) ´ Zlatev, Z.: Trends of Hungarian air pollution levels on a long time-scale. 10. Havasi, A., Atmospheric Environment 36, 4145–4156 (2002) 11. Hesstvedt, E., Hov, Ø., Isaksen, I.A.: Quasi-steady-state approximations in air pollution modeling: comparison of two numerical schemes for oxidant prediction. Int. Journal of Chemical Kinetics 10, 971–994 (1978) 12. Hov, Ø., Zlatev, Z., Berkowicz, R., Eliassen, A., Prahm, L.P.: Comparison of numerical techniques for use in air pollution models with non-linear chemical reactions. Atmospheric Environment 23, 967–983 (1988) 13. Marchuk, G.I.: Mathematical modeling for the problem of the environment. Studies in Mathematics and Applications, vol. 16. North-Holland, Amsterdam (1985) 14. McRae, G.J., Goodin, W.R., Seinfeld, J.H.: Numerical solution of the atmospheric diﬀusion equations for chemically reacting ﬂows. J. Comp. Physics 45, 1–42 (1984) 15. Ostromsky, T., Owczarz, W., Zlatev, Z.: Computational Challenges in Large-scale Air Pollution Modelling. In: Proc. 2001 International Conference on Supercomputing in Sorrento, pp. 407–418. ACM Press, New York (2001) 16. Ostromsky, T., Zlatev, Z.: Parallel Implementation of a Large-scale 3-D Air Pollution Model. In: Margenov, S., Wa´sniewski, J., Yalamov, P. (eds.) LSSC 2001. LNCS, vol. 2179, pp. 309–316. Springer, Heidelberg (2001) 17. Ostromsky, T., Zlatev, Z.: Flexible Two-level Parallel Implementations of a Large Air Pollution Model. In: Dimov, I.T., Lirkov, I., Margenov, S., Zlatev, Z. (eds.) NMA 2002. LNCS, vol. 2542, pp. 545–554. Springer, Heidelberg (2003) 18. Owczarz, W., Zlatev, Z.: Running a large-scale air pollution model on fast supercomputer. In: Chok, D.P., Carmichael, G.R. (eds.) Atmospheric Modelling, pp. 185–204. Springer, Heidelberg (2000) 19. Zlatev, Z.: Computer treatment of large air pollution models. Kluwer, Dordrecht (1995) 20. Zlatev, Z., Dimov, I.: Computational and Environmental Challenges in Environmental Modelling. Elsevier, Amsterdam (2006) 21. Zlatev, Z., Dimov, I., Georgiev, K.: Three-dimensional version of the Danish Eulerian Model. Zeitschrift f¨ ur Angewandte Mathematik und Mechanik 76, 473–476 (1996)

196

K. Georgiev and Z. Zlatev

22. Zlatev, Z., Dimov, I., Ostromsky, T., Geernaert, G., Tzvetanov, I., Bastrup-Birk, A.: Calculating losses of crops in Denmark caused by high ozone levels. Environmental Modelling and Assessment 6, 35–55 (2001) 23. Zlatev, Z., Moseholm, L.: Impact of climate changes on pollution levels in Denmark. Environmental Modelling 217, 305–319 (2008) 24. Zlatev, Z., Syrakov, D.: A ﬁne resolution modelling study of pollution levels in Bulgaria. Part 1: SOx and NOx pollution. International Journal of Environment and Pollution 22(1-2), 186–202 (2004) 25. Zlatev, Z., Syrakov, D.: A ﬁne resolution modelling study of pollution levels in Bulgaria. Part 2: High ozone levels. International Journal of Environment and Pollution 22(1-2), 203–222 (2004)

Sensitivity Analysis of a Large-Scale Air Pollution Model: Numerical Aspects and a Highly Parallel Implementation Tzvetan Ostromsky1 , Ivan Dimov1 , Rayna Georgieva1, and Zahari Zlatev2 1

Institute for Parallel Processing, Bulgarian Academy of Sciences, Acad. G. Bonchev, bl. 25A, 1113 Soﬁa, Bulgaria [email protected], [email protected], [email protected] http://www.bas.bg/clpp/ 2 National Environmental Research Institute, Department of Atmospheric Environment, Frederiksborgvej 399 P.O. Box 358, DK-4000 Roskilde, Denmark [email protected] http://www.dmu.dk/AtmosphericEnvironment

Abstract. The Uniﬁed Danish Eulerian Model (UNI-DEM) is a powerful air pollution model, used to calculate the concentrations of various dangerous pollutants and other chemical species over a large geographical region (mostly Europe). It takes into account the main physical and chemical processes between these species, the emissions, the deposition, advection and diﬀusion in dependence with the changing meteorological conditions. This large and complex task is split into submodels responsible for the main physical and chemical processes. In the chemical submodel there is a number of parameters, responsible for the speed of the corresponding chemical reactions. By simultaneous variation of these parameters we produced a set of multidimensional discrete functions. These are necessary for variance-based sensitivity analysis by using adaptive Monte Carlo approaches, which is subject to another paper. These huge computational tasks require extensive resources of storage and CPU time. A highly parallel implementation of the UNI-DEM has been created for this purpose and implemented on the new IBM BlueGene/P supercomputer, the most powerful parallel machine ever in Bulgaria. Some details of this implementation and a set of results obtained by it are presented in this paper.

1

Introduction

Huge computational tasks arise in the treatment of large-scale air pollution models and great diﬃculties arise even when modern high-performance computers are used. Therefore, it is an advantage to be able to simplify as much as possible the model, keeping control over the accuracy and reliability of the results produced by the model. A detailed sensitivity analysis can help us to ﬁnd out which simpliﬁcations can be done with minimum loss of accuracy. On the other hand, it is important to analyze the inﬂuence of variations of the initial conditions, I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 197–205, 2010. c Springer-Verlag Berlin Heidelberg 2010

198

T. Ostromsky et al.

the boundary conditions, chemical rates etc. as there is always certain level of uncertainty of their values. This knowledge can show us which parameters are most critical for the results and, therefore, how can we eﬀectively improve the results by improving the accuracy of certain input data items.

2

Sensitivity Analysis Concept

Sensitivity analysis (SA) is the study of how much the uncertainty in the input data of a model (due to any reason: inaccurate measurements or calculation, approximation, data compression, etc.) is reﬂected in the accuracy of the output results [4]. Two kinds of sensitivity analysis is considered in the existing literature: local and global SA. Local SA studies how some small variations of inputs around a given value can change the value of the output. Global SA takes into account all the variation range of the input parameters, and apportions the output uncertainty to the uncertainty in the input data. Subject to our study in this paper is the global sensitivity analysis. Several sensitivity analysis techniques have been developed and used throughout the years [4]. In general, these methods rely heavily on special assumptions connected to the behaviour of the model (such as linearity, monotonicity and additivity of the relationship between input and output parameters of the model). Among the quantitative methods, variance-based methods are the most often used. The main idea of these methods is to evaluate how the variance of an input or a group of inputs contributes into the variance of model output. 2.1

Definition of the Total Sensitivity Indices (TSI)

Assume that a model is represented by the following model function: u = f (x), where the input parameters x = (x1 , x2 , . . . , xd ) ∈ U d ≡ [0, 1]d are independent (non-correlated) random variables with a known joint probability distribution function (p.d.f.). In this way the output u becomes also a random variable (as it is a function of the random vector x) with its own p.d.f. and mathematical expectation (E). Let D[E(u|xi )] is the variance of the conditional expectation of u with respect to xi and Du is the total variance according to u. This indicator is called ﬁrst-order sensitivity index by Sobol [5] or sometimes correlation ratio. Total Sensitivity Index (TSI) [5] of an input parameter xi , i ∈ {1, . . . , d} is the sum of the complete set of mutual sensitivity indices of any order (main eﬀect, two-way interactions (second order), three-way interactions (third order), etc.): = Si + Sil1 + Sil1 l2 + · · · + Sil1 ...ld−1 , (1) Sxtot i l1 =i

l1 ,l2 =i,l1
where Sil1 ...lj−1 – j th order sensitivity index for the parameter xi (1 ≤ j ≤ d), j = 1, . . . , Si — the “main eﬀect” of xi . According to the values of their total sensitivity indices, the input parameters are classiﬁed in the following ), important (0.5 < Sxtot < 0.8), unimportant way: very important (0.8 < Sxtot i i tot tot (0.3 < Sxi < 0.5), irrelevant (Sxi < 0.3). In most practical problems the high

Sensitivity Analysis of a Large-Scale Air Pollution Model

199

dimensional terms can be neglected, thus reducing signiﬁcantly the number of summands in (1). 2.2

Sobol’ Approach, Based on HDMR and ANOVA

The Sobol’ method is one of the most often used variance-based methods. It is based on unique decomposition of the model function into orthogonal terms (summands) of increasing dimension and zero means. Its main advantage is computing in an uniform way not only the ﬁrst order indices, but also the higher order indices (in quite a similar way, as the computation of the main eﬀects). The total sensitivity index can then be calculated with just one Monte Carlo integral per factor. The Sobol’ method for global SA, applied here, is based on the so-called HDMR1 (2) of the model function f (integrable) in the d-dimensional factor space: d fl1 ...ls (xl1 , xl2 , . . . , xls ), (2) f (x) = f0 + s=1 l1 <...
where f0 is a constant. The representation (2) is not unique. Sobol’ has proven that under the conditions (3) for the right-hand-side functions 1 fl1 ...ls (xl1 , xl2 , . . . , xls ) dxlk = 0, 1 ≤ k ≤ s, s = 1, . . . , d. (3) 0

the decomposition (2) is unique and is called ANOVA2-HDMR of the model function f (x). Moreover, the functions of the right-hand side can be deﬁned in a unique way by multidimensional integrals [6].

3

The Danish Eulerian Model

In this section we describe shortly the Danish Eulerian Model (DEM) [8,9]. It is mathematically represented by the following system of partial diﬀerential equations, in which the unknown concentrations of a large number of chemical species (pollutants and other chemically active components) take part. The main physical and chemical processes (horizontal and vertical wind, diﬀusion, chemical reactions, emissions and deposition) are represented in following system: ∂(ucs ) ∂(vcs ) ∂(wcs ) ∂cs =− − − + ∂t ∂x ∂y ∂z ∂ ∂ ∂ ∂cs ∂cs ∂cs + Kx + Ky + Kz + ∂x ∂x ∂y ∂y ∂z ∂z + Es + Qs (c1 , c2 , . . . cq ) − (k1s + k2s )cs , 1 2

High Dimensional Model Representation. ANalysis Of VAriances.

s = 1, 2, . . . q

(4)

200

T. Ostromsky et al.

where – – – – – –

cs — the concentrations of the chemical species; u, v, w — the wind components along the coordinate axes; Kx , Ky , Kz — diﬀusion coeﬃcients; Es — the emissions; k1s , k2s — dry / wet deposition coeﬃcients; Qs (c1 , c2 , . . . cq ) — non-linear functions describing the chemical reactions between species under consideration.

The above rather complex system is split into 3 subsystems / submodels, according to the major physical and chemical processes: (1) (1) (1) (1) (1) ∂ ucs ∂ vcs ∂ ∂cs ∂ ∂cs ∂cs =− − + Kx + Ky = A1 c(1) s (t) ∂t ∂x ∂y ∂x ∂x ∂y ∂y horizontal advection & diﬀusion

∂cs (2) (2) (2) = Es + Qs c1 , c2 , . . . c(2) − (k1s + k2s )c(4) q s = A2 cs (t) ∂t (2)

chemistry, emissions & deposition

(3) ∂cs

∂t

=−

(3) ∂ wcs ∂z

∂ + ∂z

(3)

∂cs Kz ∂z

= A3 c(3) s (t)

vertical transport

The Danish Eulerian Model is designed to calculate the pollution over a large geographical region (4800 km × 4800 km), covering whole Europe. Spatial and time discretization makes each of the above submodels a huge computational task, and high performance computing — vital for the real-time solution. That is why the parallelization has always been a key point in the computer implementation of DEM since its very early stages. A coarse-grain parallelization strategy based on decomposition of the spatial domain appears to be the most eﬃcient and well-balanced on widest class of nowadays parallel machines (with not too many processors), although some restrictions apply. Other parallelizations are also possible and suitable to certain classes of supercomputers [2,3]. For the purpose of this study, a distributed memory parallelization of the model via MPI is used [1,2]. Parallelization is based on domain decomposition of the horizontal grid, which implies certain restrictions on the number of MPI tasks and requires communication on each time step. Improving the data locality for more eﬃcient cache utilization is achieved by using chunks to group properly the small tasks in the chemistry-deposition and vertical exchange stages. Additional pre-processing and post-processing stages are needed for scattering the input data and gathering the results, causing some overhead.

Sensitivity Analysis of a Large-Scale Air Pollution Model

201

Table 1. Choosable parameters for selecting an optional UNI-DEM version Parameter Description NX = NY Grid size 96 × 96 Grid step 50 km. NZ # layers (2D/3D) 1 NEQUAT # chem. species 35

Optional values 288 × 288 16.7 km. 56

480 × 480 10 km. 10 168

The following numerical methods are used in the solution of the submodels: – Advection-diﬀusion part: Finite elements, followed by predictor-corrector schemes with several diﬀerent correctors. – Chemistry-deposition part: An improved version of the QSSA (Quazi Steady-State Approximation) – Vertical transport: Finite elements, followed by theta-methods.

4

DEM, UNI-DEM, and SA-DEM

The development and improvements of DEM throughout the years has lead to a variety of diﬀerent versions with respect to the grid-size/resolution, 2D – 3D layering and the number of species in the chemical scheme. The most prospective of them has been united under a common driver routine, called UNI-DEM. It provides a uniform and easy user access to the available up-to-date versions of the model. These versions and their governing parameters are shown in Table 1. SA-DEM is a modiﬁcation of UNI-DEM, specially adjusted to be used in sensitivity analysis studies. The main diﬀerences are in the direct user access to the chemical rates — constants in the original model, and the ability to be modiﬁed, either separately or in groups in dependence with the dimension of the particular sensitivity analysis study. As a large number of experiments are to be done in order to produce the necessary mesh functions, especially in the higher dimensional studies, a driver routine that automatically changes certain modiﬁcation coeﬃcients and restarts the model has been developed. Finally, an additional program for extracting the necessary mean monthly concentrations and computing the variance mesh functions has been developed. The last task is much simpler and not computationally intensive, so we leave it beyond the scope of our supercomputer performance analysis.

5

Performance Results

In this section we present some time and speed-up3 results, showing the scalability of SA-DEM on two parallel supercomputers. As in UNI-DEM, the basic 3

The ratio between the execution times of an algorithm in sequential (on one processor) and in parallel (on n processors) of the same machine, is called speed-up on n processors for the corresponding algorithm and machine and is denoted by Sp(n) throughout this paper.

202

T. Ostromsky et al.

Table 2. Time and speed-up of SA-DEM on a SUN Sunﬁre E25000 at DTU, (96 × 96 × 1) grid, 35 species, CHUNKSIZE=48 # proc. 1 2 4 8 16 32

Advection Time [s] 1832 826 482 243 120 66

Chemistry

(Sp) Time [s] — 7548 (2.2) 3756 (3.8) 1901 (7.5) 939 (15.3) 474 (27.8) 244

TOTAL

(Sp) Time [s] — 9956 (2.0) 5167 (4.0) 2945 (8.0) 1654 (15.9) 1072 (30.9) 824

(Sp) — (1.9) (3.4) (6.0) (9.3) (12.1)

E [%] — 96% 85% 75% 58% 38%

run for SA-DEM is for one year period. Although for the particular SA study we do not use the whole set of results, but usually take the mean monthly concentrations for a typical summer month (where the ozone concentrations reach their highest levels), the results of one-year runs are generally more accurate and reliable due to the minimal impact of the initial conditions. The results of such runs are presented in the tables of this section. The time and the speed-up (Sp) of the main computational stages of the model, as well as in total, are given in separate columns. The total time includes also the times for pre-processing and post-processing stages and some data transfer procedures, which are either inherently sequential or cannot be eﬃciently parallelized. In addition, the total eﬃciency E (in percent) is given in the last column, where E = Sp(n)/n .100%, where n is the number of processors (given in the ﬁrst column). The Sunﬁre E25000 system consists of 72 dual core Ultrasparc IV processors (1350 MHz, 2 level cache). The machine (called ‘newton’) is part of the high performance cluster of the Technical University of Denmark (DTU). It has been used for a long time for development and experiments with UNI-DEM, so the codes are well tuned on it. The results of one-year experiments with the SA-DEM on the Sunﬁre E25k are presented in Table 2. The IBM Blue Gene/P is a state-of-the-art high-performance system with 8192 CPU in total and theoretical peak performance more than 23 TFLOPS. It is built of 2048 quad core PowerPC 450 (4 CPU, 850 MHz, 2 GB RAM). There is 8 MB L3 cache per node, 32 KB L1 cache per CPU (private). The system was installed in autumn 2008 in DAITS, Soﬁa. The results of one-year experiments with the SA-DEM on the Blue Gene/P are presented in Table 3.

6

Sensitivity Analysis Experiments

Chemistry is the most diﬃcult and the most time consuming part of the model, as one can see in the tables from the previous section. It is described as a complex non-linear system for calculating the unknown concentrations. However, other important parameters take part in the calculations and among them – the

Sensitivity Analysis of a Large-Scale Air Pollution Model

203

Table 3. Time and speed-up of SA-DEM on the IBM Blue Gene/P, (96 × 96 × 1) grid, 35 species, CHUNKSIZE=48 # proc.

Advection Time [s]

1 2 4 8 16 24 32 48

5456 2748 1373 698 366 278 229 158

Chemistry

(Sp) Time [s] — (2.0) (4.0) (7.8) (14.9) (19.6) (23.8) (34.5)

32099 16067 8081 4054 2013 1355 1008 673

TOTAL

(Sp) Time [s] — (2.0) (4.0) (7.9) (15.9) (23.8) (31.8) (47.7)

38848 19863 10283 5690 3265 2482 2198 1866

(Sp) E [%] — (2.0) (3.8) (6.8) (11.9) (15.7) (17.6) (20.8)

— 98% 95% 85% 74% 65% 55% 43%

chemical rate constants. There are 69 time-dependent chemical reactions and 47 time-independent in the condensed Carbon Bond-IV Mechanism (CBM IV), used in DEM [9]. The intensity of each reaction is determined by the corresponding chemical rate constant. For the purpose of the sensitivity analysis it can be considered as a random variable. On the ﬁrst stage of our study we had to determine the most important chemical rate constants with respect to a given criterion. Ozone is known to be one of the most dangerous pollutants in the air. That is why the mean monthly ozone concentration was chosen as a primary criterion. By extensive experiments with perturbation of a large number of coeﬃcients of the suspicious chemical reactions, we were trying to determine the most critical of them. The results of these experiments revealed the chemical reaction #22 (time-dependent): N O + OH → HN O2 as one of the most critical. Two other were also chosen to take part in the SA study: #3 (time-dependent) and #6 (time-independent). By running SA-DEM with a given set of perturbation factors α = (αi )di=1 (d = 3 in our case, according to the above choice), we generate an output of the following type: rs (α) =

cα s (as , bs ) , cs (a0 , b0 )

α ∈ {0.1, 0.2, . . . , 2.0}d

(5)

where s is the index of chemical specie (s = 1, . . . , 35) and the denominator is the mean monthly concentration of this specie without any perturbation (α = 1) in the point where the maximal mean monthly concentration of a ﬁxed chemical specie k over the spatial domain G is achieved (cmax = ck (a0 , b0 ) = k max(a,b)∈G {ck (a, b)}). The sensitivity of ﬁve important pollutants (nitrogen oxide (N O), nitrogen dioxide (N O2 ), sulphur dioxide (SO2 ), peroxy radical (P HO) and ozone (O3 )) to its rate perturbation coeﬃcient is shown in Fig. 1. In Fig. 2 just the ozone sensitivity is given for 5 consecutive years. The graphics are quite similar. Moreover, the graphics for the other chemical species show similar behaviour, which

204

T. Ostromsky et al.

1.25 1.5 1.0

1.0

0.75

0.5 0.5 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2.0

Fig. 1. July’1996 sensitivity of the mean monthly concentrations of 5 chemical species to the RC22 perturbation factors in the interval [0.1, 2.0]

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2.0

Fig. 2. Sensitivity of the ozone mean monthly concentrations for 5 consecutive years to the RC22 perturbation factors in the interval [0.1, 2.0]

indicates that the chemical rate constants are not so sensitive to the variations of the meteorological conditions. With generating the values rs (α) (mesh functions in G) the ﬁrst stage of the SA procedure is completed. The second stage consists of two steps: (i) Approximation of the mesh functions and (ii) Computing of Sobol’ global sensitivity indices, but those are beyond the scope of this paper.

7

Conclusions and Plans for Future Work

A 3-stage variance-based sensitivity analysis method is proposed and applied to the Danish Eulerian Model. On the ﬁrst stage, many numerical experiments with perturbation of some reaction coeﬃcients were executed, producing the necessary mesh functions for the next stage of the proposed sensitivity analysis method. For that purpose, a new SA-oriented version of the model (SA-DEM) was created. It was implemented on two very powerful parallel supercomputers, showing good scalability on both machines. Next stage of this SA research include approximation of the mesh function by polynomials of 3-rd / 4-th degree, or by cubic B-spline functions. A Monte Carlo integration method is furtherly applied to the results. The study would possibly lead to improvements in the chemical scheme of the model as well as to its more eﬃcient use in some time-critical applications. These results will be presented in another paper. Other near future plans include: – Extending the abilities and improving the performance of SA-DEM; – Experiments by SA-DEM with 3D coarse grid (96 × 96 × 10), as well as with ﬁner resolution grids (storage-permitting);

Sensitivity Analysis of a Large-Scale Air Pollution Model

205

– studies of model sensitivity with respect to the emission levels and the boundary conditions.

Acknowledgments This research is supported by the Bulgarian NSF grants DO 02-115/2008 “Supercomputer Applications” and DO 02-215/2008.

References 1. Dimov, I., Georgiev, K., Ostromsky, T., Zlatev, Z.: Computational challenges in the numerical treatment of large air pollution models. Ecological Modelling 179, 187–203 (2004) 2. Ostromsky, T., Zlatev, Z.: Parallel Implementation of a Large-scale 3-D Air Pollution Model. In: Margenov, S., Wa´sniewski, J., Yalamov, P. (eds.) LSSC 2001. LNCS, vol. 2179, pp. 309–316. Springer, Heidelberg (2001) 3. Ostromsky, T., Zlatev, Z.: Flexible Two-level Parallel Implementations of a Large Air Pollution Model. In: Dimov, I.T., Lirkov, I., Margenov, S., Zlatev, Z. (eds.) NMA 2002. LNCS, vol. 2542, pp. 545–554. Springer, Heidelberg (2003) 4. Saltelli, A., Tarantola, S., Campolongo, F., Ratto, M.: Sensitivity Analysis in Practice: A Guide to Assessing Scientiﬁc Models. Halsted Press, New York (2004) 5. Sobol, I.M.: Sensitivity estimates for nonlinear mathematical models. Mathematical Modeling and Computational Experiment 1, 407–414 (1993) 6. Sobol, I.M.: Global Sensitivity Indices for Nonlinear Mathematical Models and Their Monte Carlo Estimates. Mathematics and Computers in Simulation 55(1-3), 271– 280 (2001) 7. Sobol, I.M.: Theorem and examples on high dimensional model representation. Reliability Engineering and System Safety 79, 187–193 (2003) 8. WEB-site of the Danish Eulerian Model, http://www.dmu.dk/AtmosphericEnvironment/DEM 9. Zlatev, Z.: Computer treatment of large air pollution models. Kluwer, Dordrecht (1995)

Advanced Results of the PM10 and PM2.5 Winter 2003 Episode by Using MM5-CMAQ and WRF/CHEM Models Roberto San Jos´e1 , Juan L. P´erez1, Jos´e L. Morant1 , F. Prieto2 , and Rosa M. Gonz´ alez3 1

3

Environmental Software and Modelling Group, Computer Science School, Technical University of Madrid - UPM, Campus de Montegancedo, Boadilla del Monte 28660 Madrid, Spain [email protected] http://artico.lma.fi.upm.es 2 Department of Ecology, Building of Sciences, University of Alcal´ a, 28871 Alcal´ a de Henares, Madrid, Spain Department of Meteorology and Geophysics, Faculty of Physics, Complutense University of Madrid – UCM, Ciudad Universitaria, 28040 Madrid, Spain [email protected] Abstract. During the winter of 2003 there was an special particulate episode over Germany. The application of the MM5-CMAQ model (PSU/ NCAR/EPA, US) to simulate the high concentrations in PM10 and PM2.5 during a winter episode (2003) in Central Europe has been performed. The selected period is January 15 – April 6, 2003. Values of daily mean concentrations up to 75 µgm−3 are found on average of several monitoring stations in Northern Germany. Additionally WRF/CHEM (NOAA, USA) model has been applied. In this contribution we have performed additional simulations to improve the results obtained in our contribution San Jos´e et al. (2008) [5]. We have run again both models but with changes in emission inventory and turbulence scheme for MM5-CMAQ. In the case of WRF/CHEM much more changes have been performed: Lin et al. (1983) microphysics scheme has been substituted by WSM 5-class single moment microphysics scheme (Hong et al. 2004); Goddard radiation scheme has been substituted by Dudhia radiation scheme and FTUV photolysis model has been substituted by J-FAST photolysis model. The results improve substantially the PM10 and PM2.5 patterns in both models. The correlation coeﬃcient for PM10 for 80 days simulation period and for daily averages has been increased up to 0.851 and in the case of PM2.5, it has been increased up to 0.674. Keywords: Emissions, PM10 and PM2.5, air quality models, air particles.

1

Introduction

During recent years, investigations related to diﬀerent pollution episodes have been increased substantially. Simulations of elevated PM10 and PM2.5 I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 206–213, 2010. c Springer-Verlag Berlin Heidelberg 2010

Advanced Results of the PM10 and PM2.5 Winter 2003 Episode

207

concentrations have been always underestimated by modern three dimensional air quality modelling tools. This fact has focused much more attention between researchers during last years. Three dimensional air quality models have been developed during the last 15-20 years and substantial progress has occurred in this research area. These models are composed by a meteorological driver and a chemical and transport module. Examples of meteorological drivers are: MM5 (PSU.NCAR, USA), [5], RSM (NOAA, USA), ECMWF (Redding, UK), HIRLAM (Finnish Meteorological Institute, Finland), WRF [3] and examples of dispersion and chemical transport modules are EURAD (University of Cologne, Germany), EUROS (RIVM, The Netherlands), EMEP Eulerian (DNMI, Oslo, Norway), MATCH (SMHI, Norrkoping, Sweden), [2], REM3 (Free University of Berlin, Germany), CHIMERE (ISPL, Paris, France), NILU-CTM (NILU, Kjeller, Norway), LOTOS (TNO, Apeldoorm, The Netherlands), DEM (NERI, Roskilde, Denmark), OPANA model based on MEMO and MM5 mesoscale meteorological models and with the chemistry on-line solved by [4] STOCHEM (UK Met. Oﬃce, Bracknell, UK), [1] and CMAQ (Community Multiscale Air Quality modelling system), developed by EPA (USA). In USA, CAMx Environ Inc., STEM-III (University of Iowa) and CMAQ model are the most up-to-date air quality dispersion chemical models. In this application we have used the CMAQ model (EPA, USA) which is one of the most complete models and includes aerosol, cloud, and aerosol chemistry. In this contribution we present results from two simulations by two diﬀerent models. The ﬁrst air quality modelling systems is MM5-CMAQ which is a matured modelling systems based on the MM5 mesoscale non-hydrostatic meteorological model and the dispersion and chemical transport module, CMAQ. The second tool is the WRF/CHEM air quality modelling system, which is an on-line (one code, one system) tool to simulate air concentrations based on the WRF meteorological driver. In WRF/CHEM the chemistry transport and transformations are embedded into WRF as part of the code so that the interactions between many meteorological and climate variables and the chemistry can be investigated. WRF/CHEM is developed by NOAA/NCAR (USA). The advantage of on-line models is based on the capability to analyze all variables simultaneously and to account for all interactions (or at least, as much as possible) with a full modular approach.

2

PM10 and PM2.5 Episode

During the period January 15, 2003 to April 5, 2003, in central Europe (mainly northern part of Germany), we observe three high peaks on PM10 and PM2.5 values in several monitoring stations located in the area of North-East of Germany. The daily averages of PM10 concentrations were close to 80 µgm−3 and higher than 70 µgm−3 for PM2.5 concentrations. These values are about 4-5 times higher than those registered as “normal” values. The ﬁrst peak on PM10 and PM2.5 concentrations has occurred from February 1 to February 15. During this period of time, Central Europe was under the inﬂuence of a high-pressure

208

R. San Jos´e et al.

system coming from Russia through Poland and Southern Scandinavia. In Northern part of Germany, we found south-easterly winds and stable conditions with low winds. These meteorological conditions brought daily PM10 concentrations at about 40 µgm−3 . The second peak was characterized by a sharp gradient on PM10 concentrations from February 15 to March 7. This episode reached daily PM10 concentrations up to 70 µgm−3 . The meteorological conditions on March 2 (peak values) was characterized by a wind rotation composed by South-easterly winds from Poland over the North of Germany and North-westerly and Western winds in the Central part of Germany. Finally a third peak with values of about 65 µgm−3 on March 27 starts on March 20 ending on April 5, 2003 was having a similar structure and causes than the second one. The observational data used to compare with the modelling results is referred in San Jos´e et al. (2008).

3

Emission Data

In both models, we have applied the TNO emissions as area and point sources with a geographical resolution of 0.125◦ latitude by 0.25◦ longitude and covering all Europe. The emission totals by SNAP activity sectors and countries agree with the baseline scenario for the Clean Air for Europe (CAFE) program. This database gives the PM10 and PM2.5 emission for the primary particle emissions. We also took from CAFE the PM splitting sub-groups, height distribution and the breakdown of the annual emissions into hourly emissions. The PM2.5 fraction of the particle emissions was split into an unspeciﬁed fraction, elemental carbon (EC) and primary organic carbon (OC). The EC fraction of the PM2.5 emissions for the diﬀerent SNAP sectors was taken from EMEP emission inventory. For the OC fraction, the method proposed by is applied as follows: an average OC/EC emission ratio of two was used for all sectors, i.e. the OC fraction were set as twice the EC fractions, except if the sum of the two fractions exceed the unity. In this case (fEC > 0.33), fOC was set as: fOC = 1−fEC . With this prepared input, the WRF/CHEM and CMAQ took the information as it is. The hourly emissions are derived using sector-dependent, monthly, daily and hourly emission factors as used in the EURODELTA (http://aqm.jrc.it/eurodelta/) exercise. The diﬀerences with [5] simulations for MM5-CMAQ are established as follows: Albania, Croatia, Bosnia, and Serbia use the Bulgaria daily factors; Turkey uses the Hungary daily factors; Belarus, Moldavia, Ukraine, and Russia use the Romania daily factors; Germany use the Federal Republic of Germany daily factors; Czech Republic uses the Slovakia monthly factors. The VOC to TOC factor is 1.14. In case of WRF/CHEM the changes are the same as for MM5-CMAQ but the VOC to TOC factor in the VOC splitting scheme is changed to 3.2.

4

MM5-CMAQ and WRF-CHEM Architectures and Configurations

MM5 was set up with two domains: a mother domain with 60 x 60 grid cells with 90 km spatial resolution and 23 vertical layers and 61x61 grid cells with

Advanced Results of the PM10 and PM2.5 Winter 2003 Episode

209

30 km spatial resolution with 23 vertical layers. The central point is set at 50.0 N and 10.0 E. The model is run with Lambert Conformal Conical projection. The CMAQ domain is slightly smaller following the CMAQ architecture rules. We use reanalysis T62 (209 km) datasets as 6-hour boundary conditions for MM5 with 28 vertical sigma levels and nudging with meteorological observations for the mother domain. We run MM5 with two-way nesting capability. We use the Kain-Fritsch 2 cumulus parameterization scheme, the MRF PBL scheme, Schultz microphysics scheme and Noah land-surface model. In CMAQ we use clean boundary proﬁles for initial conditions, Yamartino advection scheme, ACM2 for vertical diﬀusion, EBI solver and the aqueous/cloud chemistry with CB05 chemical scheme. Since our mother domain includes signiﬁcant areas outside of Europe (North of Africa), we have used EDGAR emission inventory with EMIMO 2.0 emission model approach to ﬁll those grid cells with hourly emission data. The VOC emissions are treated by SPECIATE Version 4.0 (EPA, USA) and for the lumping of the chemical species, we have used the procedure, for 16 diﬀerent groups. We use our BIOEMI scheme for biogenic emission modeling. The classical, Atkin, Accumulation and Coarse modes are used (MADE/SORGAM modal approach). In WRF/CHEM simulation we have used only one domain with 30 km spatial resolution similar to the MM5. We have used the Lin et al. (1983) scheme for the microphysics, Yamartino scheme for the boundary layer parameterization and for the biogenic emissions. The MOSAIC sectional approach is used with 4 modes for particle modeling.

5

Changes in Model Configurations

In case of MM5-CMAQ the changes in the model simulations compared with the report of [5] aﬀect only to the emissions (as explained above) an the Kz (eddy diﬀusivity coeﬃcient). The option to use the so-called KZMIN as detailed in CMAQ code is applied. If KZMIN is activated the Kz coeﬃcient is calculated by: Kz = KzL + (KzU − KzL) ∗ U F RAC (1) where Kz is the eddy diﬀusivity in m2 s−1 , KZL is 0.5 (lowest) and KzU is 2.0 (highest). The UFRAC represents the percentage (range 0–1) of urban landuse in the grid cell. In case of WRF/CHEM, the changes aﬀect to the microphysics scheme, substituting the scheme by the WSM (WRF single moment) 5-class microphysics scheme. 5 represents the number of water species predicted by the scheme. The Goddard/NASA radiation scheme is substituted by the Dudhia radiation scheme. The FTUV photolysis rate model is substituted by the FAST-J scheme.

6

Model Results

The comparison between daily average values (averaged over all monitoring stations) of PM10 concentrations and modeled values has been performed with

210

R. San Jos´e et al.

Fig. 1. Comparison between daily averaged observed PM10 concentrations and model results produced by MM5-CMAQ. The model gets closer to the maximum peak compared with the previous simulation in [5].

Fig. 2. Comparison between daily averaged observed PM10 concentrations and model results produced by WRF/CHEM. The model captures even better than in the previous simulation [5] the magnitude of the PM10 peaks.

Advanced Results of the PM10 and PM2.5 Winter 2003 Episode

211

Fig. 3. Comparison between daily averaged observed PM2.5 concentrations and model results produced by MM5-CMAQ. The model gets closer to the simulation performed in [5].

Fig. 4. Comparison between daily averaged observed PM2.5 concentrations and model results produced by WRF/CHEM. The model overestimates a little bit the observed data but the correlation coeﬃcient gets a light improvement (up to 0.759).

212

R. San Jos´e et al.

several statistical tools such as: Calculated mean/Observed mean; Calculated STD/Observed STD; bias; squared correlation coeﬃcient (R2); RMSE/Observed mean (Root Mean Squared Error); percentage within +/- 50% and number of data sets. Figure 1 shows the comparison between PM10 observed averaged daily values and the modeled values by MM5-CMAQ. The results show that for MM5-CMAQ, the new conﬁguration related to emission data and eddy diﬀusivity improves the correlation coeﬃcient from 0.828 to 0.851 but the pattern show a substantial improvement with the central peak much closer to the observed data. Figure 2 shows the comparison between observed and modeled average daily data for the episode with the new conﬁguration for the WRF/CHEM model. The results show a much better correlation coeﬃcient going from 0.782 to 0.852 with the new conﬁguration. Figures 3 and 4 show similar results for PM2.5. In case of MM5-CMAQ the improvement is from 0.608 to 0.674 and for WRF/CHEM the change is from 0.760 to 0.759. These results show that the new conﬁguration is substantially better than the previous one. New experiments are needed to determine the impact of emissions and the eddy diﬀusivity respectively.

7

Conclusions

We have implemented and re-run two diﬀerent models (MM5-CMAQ and WRFCHEM) for the same episode over Northern part of Germany during the winter period of 2003 (Jan. 15-Apr. 5, 2003). The comparison between these simulations and those performed in [5] produce the following results: we have improved substantially the correlation coeﬃcients for the daily averages when comparing observed and modelled data for both models. The WRF/CHEM continue to show better results than MM5-CMAQ but the peaks for PM10 and PM2.5 for MM5-CMAQ are getting closer to the observed peaks. The patterns for MM5CMAQ have improved substantially compared with the results obtained in [5]. New experiments are necessary to determine the impact of eddy diﬀusivity and emission inventory on the new results.

Acknowledgements We would like to thank Dr. Peter Builtjes (TNO, The Netherlands) for his initial guidance and suggestion for this experiment and also COST 728 project (EU) where the inter-comparison experiment was proposed. The authors thankfully acknowledge the computer resources, technical expertise and assistance provided by the Centro de Supercomputaci´on y Visualizaci´on de Madrid (CeSVIMa) and the Spanish Supercomputing Network.

References 1. Collins, W.J., Stevenson, D.S., Johnson, C.E., Derwent, R.G.: Tropospheric ozone in a global scale 3D Lagrangian model and its response to NOx emission controls. J. Atmos. Chem. 86, 223–274 (1997)

Advanced Results of the PM10 and PM2.5 Winter 2003 Episode

213

2. Derwent, R., Jenkin, M.: Hydrocarbons and the long-range transport of ozone and PAN across Europe. Atmospheric Environment 8, 1661–1678 (1991) 3. Grell, G.A., Dudhia, J., Stauﬀer, D.R.: A description of the Fifth-Generation Penn State/NCAR Mesoscale Model (MM5), NCAR/TN- 398+ STR. NCAR Technical Note (1994) 4. Jacobson, M.Z., Turco, R.P.: SMVGEAR: A sparse-matrix, vectorized GEAR code for atmospheric models. Atmospheric Environment 28(2), 273–284 (1994) 5. San Jos´e, R., P´erez, J.L., Morant, J.L., Gonz´ alez, R.M.: Elevated PM10 and PM2.5 concentrations in Europe: a model experiment with MM5-CMAQ and WRF/CHEM. WIT Transactions on Ecology and the Environment 116, 3–12 (2008)

Four-Dimensional Variational Assimilation of Atmospheric Chemical Data – Application to Regional Modelling of Air Quality Achim Strunk, Adolf Ebel, Hendrik Elbern, Elmar Friese, Nadine Goris, and Lars Peter Nieradzik Rhenish Institute for Environmental Research at the University of Cologne [email protected] http://www.riu.uni-koeln.de

Abstract. The chemistry transport model system EURAD-IM and its variational data assimilation implementation are applied to air quality assessment problems. The facility of joint initial value and emission rate optimisation together with a nested 4d-var module is employed to the measurement campaign BERLIOZ. To emphasise the beneﬁts of computationally eﬃcient data assimilation for monitoring and simulating air quality, results of an operational 3d-var implementation are also given.

1

Introduction

The main objective of data assimilation (DA) is to provide an as accurate as possible and consistent image of a system’s state on regular grids. This is to be accomplished by taking into account all available information, consisting of procedural knowledge coded in numerical models and declarative knowledge like climatological information, observations and model forecasts. Due to the variety of information sources, given by 1) observations with heterogeneous accuracy, representativenes and spatial and temporal density, 2) retrieval methods of different kinds and reliability and 3) defective and incomplete model simulation results, only advanced DA techniques – like the four-dimensional variational DA (4d-var) – and inverse modelling provide methods for appropriate data fusion and analysis. In traditional (meteorological) DA, the optimal guess of the system’s state space variables is in the focus of analysis applications, providing new initial conditions for successive forecasts ([9]). Regarding chemical applications in the troposphere, air-surface interactions like emissions have strong impact on trace gas distributions, while being at the same time insuﬃciently known ([5,6]). This fact suggests the generalisation of air quality DA to an inversion problem, allowing for the joint analysis of various kinds of model parameters. This exposition will show air quality assessment results by DA applications using variational approaches implemented in the advanced chemistry-transport model (CTM) EURAD-IM.

The paper is dedicated to Zahari Zlatev on the occasion of his 70th birthday.

I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 214–222, 2010. c Springer-Verlag Berlin Heidelberg 2010

Four-Dimensional Variational Assimilation of Atmospheric Chemical Data

2

215

Four-Dimensional Variational Data Assimilation

Using variational data assimilation techniques, the analysis task is formulated as a minimisation problem of the variational calculus. Emission rates are included in the set of control parameters, optimised together with the initial values. Following [5], we want to preserve the daily cycle of each emitted species in each grid cell, while the total amount of emitted mass is scaled by a time invariant emission factor, which is to be optimised instead. This serves for a temporal regularisation and suitable constraints to the minisation procedure. The theoretical outline will be discussed brieﬂy, resting on [6]. Climatological information or short range forecasts serve for a background ﬁeld xb of the state space, while the analogous background emission estimate eb is usually obtained from emission inventories. The diﬀerences between observations y t and the model equivalents H xt at time t are called innovations and given by dt = y t − H xt , with the observation operator H mapping from the model state space to observations. The model state xt evolved from initial conditions x0 by a model integration, using emission factors f . In order to obtain an incremental formulation of a cost function ([3]), we use perturbations of the initial state δx0 = x0 − xb0 and of logarithmically transformed emission factors δu = u − ub = ln(f ) − ln(f b ). With B and K being background error covariance matrices of the background initial state and emission factors, respectively, we further must transform the control variables to v = B−1/2 δx0 and w = K−1/2 δu, in order to suitably precondition the ill-posed optimisation problem. This leads to the scalar valued cost function J with N

1 1 1 T J(v, w) = v T v + wT w + [dj − H δxj ] R−1 [dj − H δxj ] , 2 2 2 j=0

(1)

with H being a linearised approximation of H. This cost function can be minimised using gradient descent or quasi-Newton algorithms under the availability v of the gradient of the cost function with respect to z = : w T /2 N v B 0 T −1 ∇z J = − − MT [dj − H δxj ] , j H R T /2 w 0 K

(2)

j=0

where MT j is the adjoint operator of the tangent linear M of the model operator M in δxt = Mt,0 δz, and HT is the adjoint of the linearised observation operator.

3

The EURAD-IM Chemistry Transport Model

The European Air Pollution Dispersion chemistry-transport model and its inverse modelling module (EURAD-IM) is a comprehensive Eulerian model operating from hemispheric to local scale. It involves transport, diﬀusion, chemical

216

A. Strunk et al.

tranformation of about 60 chemical species (depending on the chosen mechanism), wet and dry deposition of tropospheric trace gases ([7,8]). Moreover, the inverse modelling system allows for application of variational data assimilation techniques, focussing on episodic scenario analysis and operational forecast enhancements in a near-real-time setup. As meteorological driver the Penn State/NCAR mesoscale model MM5 is used. Emission data in this exposition is produced based on available EMEP inventories of diﬀerent years of validity. In addition to the (forward) integrating CTM, the 4d-var data assimilation system includes the associated adjoint operators for gas phase mechanisms, transport and diﬀusion schemes. The system is completed by a quasi-Newton minimisation procedure (L-BFGS) and a sophisticated covariance module for the initial value background error covariance matrix (BECMiv ) including a suitable preconditioning strategy. The provision of BECMiv , which is hampered by unsuﬃcient knowledge and the impracticality of storing a matrix with more than 1012 entries in the case of a real world CTM, uses the algorithm proposed by [11]. The authors advice to employ a generalised diﬀusion approach to provide correlations between background state errors by an operator. This technique allows for ﬂexible applications including inhomogeneous and anisotropic correlation features. An emission factor background error covariance matrix has been constructed using emission statistics from EMEP, broken down by polluter groups. It is coded as block diagonal matrix describing emission rate correlations between emitted constituents. The implementation of the covariance matrices in EURAD-IM is described in detail by [6]. The 4d-var setup with its adjoint operators enforce saving the complete forward (direct) model trajectory for later retrieval during the backward (adjoint) integration. Due to the enormous amount of data to save, the current implementation of the EURAD-IM applies a two level recalculation strategy, resulting in two additional forward integrations during the backward sweep. Therefore, one complete iteration takes about ﬁve times the duration of a single forward simulation, necessitating a parallel implementation. This is realised using a domain decomposition method and communication facilities of MPI. The implementation within EURAD-IM is described in detail in [4] and [6]. Due to discontinuities in the EURAD-IM particel phase module MADE ([1]), the adjoint of this module is not yet available. The examples for data assimilation including particle phase components shown below is thus limited to the application of three dimensional variational (3d-var) data assimilation.

4

Applications and Results

As mentioned above, one core feature of the 4d-var system is the joint optimisation of initial values and emission rates. A short example of the motivation will be given in the following: Figure 1 shows the ozone evolution at the Austrian station St. P¨ olten on June 4, 2003. Having assimilated the observations of the ﬁrst six hours only, the properties of optimising the diﬀerent sets of control parameters are obvious: Initial value optimisation strongly improves the forecast skill, while still overestimating the ozone values on the second part of the

Four-Dimensional Variational Assimilation of Atmospheric Chemical Data

217

Fig. 1. Observed and analysed ozone evolution at Austrian station St. P¨ olten using diﬀerent sets of control parameters. Grey shaded: window of assimilated observations (00-06 UTC). Vertical bars: ozone observations with error estimates. Dashed line: Control run without data assimilation. Dotted line: initial value optimisation. Dash-dotted line: emission factor optimisation. Solid line: joint initial value and emission factor optimisation.

day. Emission based optimisation strongly suppresses ozone precursor emissions (due to ozone overestimation of the control run during the assimilation interval) and thus leads to almost completely removed ozone at the end of the day. Only jointly adjusting emission rates and initial values allows for an almost perfect ﬁt to the observed ozone evolution. The following two sections will show results for 1) a nested application of the 4d-var system to an ozone episode and 2) the operational implementation of a 3d-var system and its beneﬁts for the forecast skill. 4.1

Nested 4d-Var Application to the BERLIOZ Campaign

The main stimulus of the BERLIOZ campaign ([10]) in July and August 1998 was to analyse the transport and chemical processes responsible for photo-oxidant formation and the contribution of ozone precursors to its formation in the Berlin area. Due to limited success of analysing reactive emitted constituents like NOx on coarse resolutions, the EURAD-IM simulation setup for the BERLIOZ campaign include a coarse-grid with a horizontal grid spacing of 54 km and two recursively nested domains with a nesting ratio of 3, resulting in a sequence of 54-18-6 km. The deepest nested domain (nest 2) covers large parts of Eastern Germany with the greater Berlin area, being the region of main interest (see Figure 3). The assimilated observational data include ground based observations for O3 , NO, NO2 , CO, SO2 , benzene, HNO3 , H2 O2 , CH2 O, and PAN within an assimilation window from 06 to 20 UTC on July 20, 1998. Figure 2 shows nitrogen oxides results for two selected stations in the greater Berlin area. A signiﬁcantly improved simulation skill can be seen when analysis results for nest level 2 are used, even for the subsequent forecast. Hence, observed constituents of strong

218

A. Strunk et al.

Fig. 2. Observations with error estimates (vertical bars) and model realisations on nest level 2 for nitrogen oxides at two selected stations in the greater Berlin area on June 20-21. Dashed line: Control run (no assimilation). Dotted line: First guess run (based on analyses for nest 0 and nest 1). Solid line: Analysis run (based on analysis for nest 2). Assimilated observations: 06-20 UTC on July 20 (grey shaded); later observations used for quality control only.

local variability have become suﬃciently predicted by nested analysis results applying joint initial value and emission rate optimisation. Figure 3 exhibits analysed emission factors for NO2 at nest level 2, showing moderate values close to 1. Thus, the analysis runs do not enforce a vigorous change in emission inventory to reproduce the observational information and indicate a reasonably well estimated emission inventory. These optimised emission rates are a direct estimate of improved surface ﬂuxes. The ﬂexibility of the variational technique in taking into account various types of observations like temporally and spatially dense earth observations will be emphasised in the following. This is of even more importance, since satellite retrievals for tropospheric height levels is a strongly emerging issue in earth

Fig. 3. Optimised emission factors for NO2 on BERLIOZ nest 2, as obtained by joint optimisation of initial values and emission factors. The diﬀerent optimisation stages in terms of boxes of nest levels 0, 1 and 2 are obvious. Each analysis on the recursive nestlevels reﬁnes the previous determined scaling factors, leading to a more and more reﬁned emission rate analysis.

Four-Dimensional Variational Assimilation of Atmospheric Chemical Data

219

Fig. 4. Assimilation results on the European grid for GOME tropospheric NO2 columns on July 20, 1998. Upper left: retrievals; upper right: total model column (analysis); lower left: model equivalents (background); lower right: model equivalents (analysis). Column densities are in 1015 molec/cm2 .

observation with focus on air quality monitoring. The results of assimilating nitrogen dioxide tropospheric column retrievals (provided by KNMI, [2]) of GOME observations – which come along with averaging kernel and footprint information – are shown in Figure 4. The observations show features with elevated NO2 column densities coincidently with higher expected NO2 surface concentrations, namely the area from London over the Channel and Netherlands/Belgium to the German Rhine-Ruhr area as well as Paris and the Spanish north-coast. Although underestimated, the higher NO2 column densities over the Channel and the adjacent areas already become apparent in the model equivalents by the background model simulation. The structures in the observed NO2 column densities can be reproduced by the analysis run in a remarkable manner. The remaining discrepancies coincide with very low observed values, which the model does not reproduce. To emphasise the kind of information contained in the observed column densities including footprint information, the total tropospheric model columns as produced by the analysis run is given, too. Here, the locations with high NO2 surface concentrations also push through in the tropospheric column density, but remain concentrated ashore. The higher NO2 column densities over Scotland and the Atlantic Ocean are linked to a cyclonic low, transporting polluted air masses – originating from surface levels – over long distances.

220

A. Strunk et al.

Fig. 5. Scatter plots of O3 at the European domain and PM10 at the nested NorthrhineWestfalian grid, showing the beneﬁt of the operational analysis procedure for the forecast skill on November 27, 2008, as given by bias and root-mean-square errors (rmse) for diﬀerent model realisations. Locations of measuring stations are given by crosses in the upper right corner of the plot areas. See text for further explanations.

4.2

Operational 3d-Var Application for PROMOTE

Within the framework of the European project PROMOTE an operational 3dvar data assimilation setup is chosen to produce daily analyses and high-quality forecasts on various European domains. Starting with a coarse European domain with a horizontal grid spacing of 45x45 km2 and using one intermediate nest level with 15x15 km2 grid spacing, target domains include Switzerland, Austria, Ireland, Mecklenburg-Western Pomerania and Northrhine-Westfalia in Germany at a horizontal resolution of 5x5 km2 . A special nest is prepared for the Rhine-Ruhr area with a grid spacing of 1x1 km2 . Due to the numerical demands of the 4d-var technique, only a spatial analysis algorithm has been chosen allowing for activation of the aerosol module MADE ([1]) in EURAD-IM. Figure 5 shows scatter plots for O3 on the European domain and PM10 on the Northrhine-Westfalian grid, emphasising the beneﬁts of the data assimilation system for the forecast skill. Four model realisations are given, basing on diﬀerent meteorological and chemical analyses: The simulations forecast day-1 and control day-0 base on the same chemical analysis, but control day-1 uses a more recent meteorological analysis (by 12 hours). The diﬀerence in model performance is thus given by diﬀerent meteorological conditions only. In comparison, forecast day-0 uses the same meteorology like control day-0, but an updated chemical analysis inferred by available observations from the day before. The lower bias and rmse values are therefore due to the chemical data assimilation procedure only. A

Four-Dimensional Variational Assimilation of Atmospheric Chemical Data

221

signiﬁcantly lower bias and rmse can be stated both for ozone at coarse resolution as well as for particulate matter in the nested domain. The fourth model simulation statistics analysis day-0 is given for the sake of completeness, showing the results after performing 3d-var assimilation of the given observations on November 27 with forecast d-0 as model background.

5

Summary

Allowing for joint analysis of emission rates and initial values, 4d-var DA proves to be a powerful tool for air quality assessments and simulations. The problem of a more ill-posed optimisation problem has been compensated by the design of ﬂexible and precise error covariance matrices for both optimisation parameter sets. The operational application of spatial DA techniques on nested domains leads to improved forecast skill for air quality relevant constituents. Under the upcoming availability of an adjoint particle phase module, the employment of operational 4d-var experiments and ensemble setups will further enhance the statistical knowledge needed for further improvement of covariances for both gas and particle phase.

References 1. Ackermann, I.J., Hass, H., Memmesheimer, M., Ebel, A., Binkowski, F.S., Shankar, U.: Modal aerosol dynamics model for Europe: Development and ﬁrst applications. Atmos. Environ. 32, 2981–2999 (1998) 2. Boersma, K.F., Eskes, H.J., Brinksma, E.J.: Error analysis for tropospheric NO2 retrieval from space. J. Geophys. Res. 109(D04311) (2004) 3. Courtier, P., Th´epaut, J.N., Hollingsworth, A.: A strategy for operational implementation of 4D-Var, using an incremental approach. Q. J. R. Meteorol. Soc. 120(519), 1367–1387 (1994) 4. Elbern, H., Schmidt, H.: Chemical 4D variational data assimilation and its numerical implications for case study analyses. In: Chock, D.P., Carmichael, G.R. (eds.) IMA volumes in Mathematics and its Applications, Atmospheric Modeling, vol. 130, pp. 165–184 (2002) 5. Elbern, H., Schmidt, H., Talagrand, O., Ebel, A.: 4D–variational data assimilation with an adjoint air quality model for emission analysis. Environ. Model. and Software 15, 539–548 (2000) 6. Elbern, H., Strunk, A., Schmidt, H., Talagrand, O.: Emission rate and chemical state estimation by 4-dimensional variational inversion. Atmos. Chem. Phys. 7, 1–59 (2007) 7. Jakobs, H.J., Feldmann, H., Hass, H., Memmesheimer, M.: The use of nested models for air pollution studies: An application of the EURAD model to a SANA episode. J. Appl. Meteor. 34(6), 1301–1319 (1995) 8. Memmesheimer, M., Friese, E., Ebel, A., Jakobs, H.J., Feldmann, H., Kessler, C., Piekorz, G.: Long-term simulations of particulate matter in Europe on diﬀerent scales using sequential nesting of a regional model. Int. J. Environm. and Pollution 22(1-2), 108–132 (2004)

222

A. Strunk et al.

9. Talagrand, O., Courtier, P.: Variational assimilation of meteorological observations with the adjoint vorticity equation. I: Theory. Q. J. R. Meteorol. Soc. 113, 1311– 1328 (1987) 10. Volz-Thomas, A., Geiss, H., Hofzumahaus, A., Becker, K.H.: Introduction to special section: Photochemistry experiment in BERLIOZ. J. Geophys. Res. 108(D4), 8252 (2003) 11. Weaver, A., Courtier, P.: Correlation modelling on the sphere using a generalized diﬀusion equation. Q. J. R. Meteorol. Soc. 127, 1815–1846 (2001)

Numerical Study of Some High PM10 Levels Episodes A. Todorova1, G. Gadzhev1 , G. Jordanov1, D. Syrakov2, K. Ganev1 , N. Miloshev1 , and M. Prodanova2 1

Geophysical Institute, Bulgarian Academy of Sciences, Soﬁa, Bulgaria 2 National Institute of Meteorology and Hydrology, Bulgarian Academy of Sciences, Soﬁa, Bulgaria

Abstract. The study aims at examining the ability and the limitations of US EPA Models 3 system to adequately reproduce air pollution episodes and to evaluate the role of diﬀerent processes in the PM10 pattern formation. The case study focuses on the meteorological situation in Germany in February and March of 2003 during which three major PM10 episodes could be identiﬁed. The simulated meteorological ﬁelds agree well with the patterns described in the case study deﬁnition. The simulated PM10 concentrations are compared with measurements from the EMEP stations in Germany. The Integrated Process Rate Analysis function of CMAQ is applied for clarifying the role of diﬀerent processes of transport and transformation in forming PM10 concentration peaks.

1

Introduction

The goal of the study is to examine the ability and the limitations of US EPA Models 3 system to adequately reproduce air pollution episodes and to evaluate the role of diﬀerent processes of transport and transformation in forming PM10 concentration peaks. The case study focuses on the meteorological situation in Germany in February and March of 2003 during which three major PM10 episodes could be identiﬁed. These episodes had already been applied for model intercomparison and studying model simulation abilities ([12]). Extensive joint model comparison studies are recently still going on in the frame of COST Action 728, mostly aimed at clarifying the reasons for the shortcomings in the simulations and at the choice of optimal model set-ups, inputs and parameters. The present study is part of COST 728 activities as well, but focuses mostly at studying the role of diﬀerent processes in the PM10 pattern formation and their contribution to the PM10 peaks in the period of interest.

2

Modeling Tools

The US EPA Model-3 system was chosen as a modelling tool. The system consists of three components: the Meso-meteorological Model MM5 ([7,9]); CMAQ I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 223–230, 2010. c Springer-Verlag Berlin Heidelberg 2010

224

A. Todorova et al.

Fig. 1. Surface concentrations of PM10 for 28 February at 14 and 20h

– the Community Multiscale Air Quality System ([6,2,3,14]); SMOKE – the Sparse Matrix Operator Kernel Emissions Modelling System ([5,10,4,15]).

3

Model Configuration and Brief Description of the Simulations

As far as the basis meteorological data for 2003 is the NCEP Global Analysis Data with 1◦ × 1◦ resolution, it was necessary to use MM5 and CMAQ nesting capabilities as to downscale to 10 km, the innermost domain covering the territory of Germany. The TNO inventory ([13]) is used for the simulations. As to prepare the CMAQ emission input file, these inventory ﬁles were handled by a specially prepared computer codes. Two main procedures were performed. The temporal allocation is made on the base of daily, weekly and monthly proﬁles, provided by Builtjes in [1]. The temporal proﬁles are country-, pollutantand SNAP-speciﬁc. The speciation procedure depends on the Chemical Mechanism (CM) used. The Carbon Bond, v.4 (CB4) is exploited in the present study ([8]). In the used Version 4.6 of CMAQ the CB4 is upgraded with the Version 1.7 of ISORROPIA aerosol model ([11]). It requires splitting of VOC to 10 lump pollutants and PM2.5 to 5 groups of aerosol. A speciﬁc approach for obtaining speciation proﬁles is used here. The USA EPA data base (see [16]) is intensively exploited. Typical sources for every SNAP were linked to similar source types from US EPA nomenclature. The weighted averages of the respective speciation proﬁles are accepted as SNAP-speciﬁc splitting factors. In such a way VOC and PM2.5 speciation proﬁles are derived.

Numerical Study of Some High PM10 Levels Episodes

225

It must be stressed that the biogenic emissions of VOC were not estimated. Having in mind that the simulations concern winter episodes in North Europe it is obvious that this is not a great omission. The CB-4 chemical mechanism with Aqueous-Phase Chemistry and EBI solver (Eulerian iterative method) was used. CMAQ simulations were carried out for the period 1 Jan 2003 - 30 Apr 2003 for the two inner domains with 30 and 10 km resolution The Models-3 “Integrated Process Rate Analysis” option is applied to discriminate the role of diﬀerent dynamic and chemical processes for the formation of the observed high PM10 concentration episodes. The processes that are considered are: advection, diﬀusion, mass adjustment, emissions, dry deposition, chemistry, aerosol processes and cloud processes/aqueous chemistry.

4

Comparison of the PM10 Simulations with Measurments

The case study focuses on the meteorological situation in Germany in February and March of 2003 during which three major PM10 episodes could be identiﬁed: Between Feb 10 and Feb 14 with observed peak PM10 concentrations from Feb 11 to Feb 13, the core episode between Feb 21 and March 5 with peak PM10 concentrations from Feb 28 to March 4, and the episode between March 24 and March 31 with PM10 maxima at March 27 and March 28. The meteorological simulations are very close to the observed state of the atmosphere: on 10-11 February the main pressure system that inﬂuences the domain of interest is a high pressure system over Central and Eastern Europe with winds from S and SE blowing over Germany; from 25 February until 2 March there are two large pressure systems – low pressure over the Atlantic and high pressure over Eastern Europe, an almost stationary warm front is evident dividing Germany in two parts with diﬀerent meteorological (especially wind) conditions. Plots of surface PM concentrations for February 28 are shown on Fig. 1 as an example. The sum is over these aerosol species that have signiﬁcant contribution to the total concentrations (> 1 µg/m3 ) and the hours are chosen so that daytime concentrations are shown as well as the peak at 20h. From February 28 to March 4 a large PM plume is evident in western Germany. This plume moves forwards and backwards in E-W direction, changing shape as well as location and intensity of maximum concentration. The modeled and observed temporal evolution of the daily average PM10 concentrations at the diﬀerent background stations in Germany is shown in Fig. 2. One should estimate the agreement of simulated with measured concentration as a good one — for all the stations the concentrations temporal evolution is qualitatively fairly well simulated. Still the observed peaks, however, are severely underestimated by the model for stations DE02R, DE09R, and DE41R. The comparison between simulations made with horizontal resolution of 30 and 10 km shows that the higher resolution does not improve the agreement

226

A. Todorova et al.

Fig. 2. Simulated vs. measured PM10 surface concentrations for the following stations DE02R-Langenbrugge, DE03R-Schauinsland, DE04R-Deuselbach, DE05RBrotjacklriegel, DE07R-Neuglobsow, DE08R-Schmucke, DE09R-Zingst, DE41RWesterland Tinnum, DE44R-Melpitz

with measured data. In fact the 10 and 30 km results barely diﬀer at all. Quite obvious this is due to the fact that the observed PM10 peaks are a result of some large-scale processes.

5

Process Analysis

To better understand the development of the pollution episodes, one should closely examine the contributions of diﬀerent processes to the total aerosol concentrations. The analysis of the behavior of diﬀerent processes points to possible explanations of the genesis of the PM10 concentration peaks. Fig. 3 shows the time evolution of the contribution of each process to the change in PM10 concentration in each of nine stations between 57th and 63th Julian day (February 26 to March 4). In all 9 stations advection is the dominant process which has highly variable impact on PM10 concentrations. For DE02 cloud processes also have

Numerical Study of Some High PM10 Levels Episodes

227

Fig. 3. Temporal evolution of the contribution of diﬀerent processes to the hourly change of PM10 surface concentrations for the following stations: DE02RLangenbrugge, DE03R-Schauinsland, DE04R-Deuselbach, DE05R-Brotjacklriegel, DE07R-Neuglobsow, DE08R-Schmucke, DE09R-Zingst, DE41R-Westerland Tinnum, DE44R-Melpitz

signiﬁcant contribution — on the 57th day (positive) and 60th day (negative). DE04 experiences a drop in concentration after 57th day due to horizontal advection and vertical diﬀusion. Between 57th and 58th day the vertical advection acts in the opposite direction, slowing down the rate of decrease. The increase after 60th day is also due to advection — vertical at ﬁrst and then horizontal. In DE05 there is a peak on 58th day due to vertical advection and aerosol processes. On the 59th and 60th day cloud processes have signiﬁcant contribution to the rising PM10 concentration while horizontal advection and vertical diﬀusion act in negative direction all the time until the 61st day. The large drop after 60th day is due to cloud processes while horizontal advection acts in the opposite direction. Station DE07 has high and variable concentrations for this period and there are lots of processes involved. The variations between 57th and 59th day are

228

A. Todorova et al.

due to the variable impact of horizontal advection, which changes its sign from day to day. After 59th day the positive impact of horizontal advection competes with the negative one of cloud processes and vertical advection. In DE08 there is, again, a pronounced impact of horizontal advection, which is highly variable and frequently changes its sign. Vertical diﬀusion is responsible for the lower PM10 concentration in the afternoon of 58th and cloud processes play a role in the decrease on 60th day. For DE09 advection is again the dominant process — the inﬂuence of vertical advections is predominantly positive, while horizontal advection is highly variable and changes its sign. Vertical diﬀusion contributes to the rapid decreasing of PM10 concentration on 58th day and cloud processes play an important role on 60th day. For DE41 the decrease in concentrations on 59th day is due to horizontal advection and cloud processes acting in negative direction, while after 60th day the concentration rises again due to horizontal and vertical advection. DE42 shows quick changes in simulated PM10 concentration between 57th and 60th day due to various processes. Cloud and aerosol processes play an important role on 57th , while horizontal advection and vertical diﬀusion are the processes responsible for the decrease on 58th . On the 59th day the negative inﬂuence of vertical advection becomes more pronounced. After day 60 horizontal advection and aerosol processes contribute to the slight increase in PM10 concentration.

6

Conclusions

The main conclusions that can be made from this study are the following: – The simulated meteorological ﬁelds agree well with the patterns described in the case study deﬁnition. The simulated PM10 agreement with measurements is as good (or “as bad”) as many other model runs demonstrate for many other cases. The qualitative agreement is good and the quantitative agreement for most of the stations is reasonable; – Enhancing the horizontal spatial resolution does not improve the results signiﬁcantly, so most probably the observed PM10 peaks are a result of some large-scale processes; – The results produced by the CMAQ “Integrated Process Rate Analysis” demonstrate the very complex behavior and interaction of the diﬀerent processes — process contributions change very quickly with time and these changes for the diﬀerent stations hardly correlate at all. The analysis of the behavior of diﬀerent processes does not give clear explanation of the genesis of the PM10 concentration peaks, but at least outlines the most important and dominant processes and points to possible explanations of the genesis.

Acknowledgments The present work is supported by EC through 6FP NoE ACCENT (GOCECT-2002-500337), SEE-GRID-SCI project, contract No. FP7 RI-211338, COST

Numerical Study of Some High PM10 Levels Episodes

229

Actions 728, as well as by the Bulgarian National Science Fund (grants No. DO02-161/16.12.2008 and DO02-115/2008). The contacts within the framework of the NATO SfP Grant ESP.EAP.SFPP 981393 were extremely simulating as well. Deep gratitude is due to US EPA, US NCEP and EMEP for providing free-of-charge data and software. Special thanks to the Netherlands Organization for Applied Scientiﬁc research (TNO) for providing us with the high-resolution European anthropogenic emission inventory.

References 1. Builtjes, P.J.H., van Loon, M., Schaap, M., Teeuwisse, S., Visschedijk, A.J.H., Bloos, J.P.: Project on the modelling and veriﬁcation of ozone reduction strategies: contribution of TNO-MEP, TNO-report, MEP-R2003/166, Apeldoorn, The Netherlands (2003) 2. Byun, D., Ching, J.: Science Algorithms of the EPA Models-3 Community Multiscale Air Quality (CMAQ) Modeling System. EPA Report 600/R-99/030, Washington, DC (1999) 3. Byun, D., Schere, K.L.: Review of the Governing Equations, Computational Algorithms, and Other Components of the Models-3 Community Multiscale Air Quality (CMAQ) Modeling System. Applied Mechanics Reviews 59(2), 51–77 (2006) 4. CEP: Sparse Matrix Operator Kernel Emission (SMOKE) Modeling System, University of Carolina, Carolina Environmental Programs, Research Triangle Park, North Carolina (2003) 5. Coats Jr., C.J., Houyoux, M.R.: Fast Emissions Modeling with the Sparse Matrix Operator Kernel Emissions Modeling System, The Emissions Inventory: Key to Planning, Permits, Compliance, and Reporting, Air and Waste Management Association, New Orleans (September 2006) 6. Dennis, R.L., Byun, D.W., Novak, J.H., Galluppi, K.J., Coats, C.J., Vouk, M.A.: The Next Generation of Integrated Air Quality Modeling: EPA’s Models-3. Atmosph. Environment 30, 1925–1938 7. Dudhia, J.: A non-hydrostatic version of the Penn State/NCAR Mesoscale Model: validation (1993) 8. Gery, M.W., Whitten, G.Z., Killus, J.P., Dodge, M.C.: A Photochemical Kinetics Mechanism for Urban and Regional Scale Computer Modeling. Journal of Geophysical Research 94, 12925–12956 (1989) 9. Grell, G.A., Dudhia, J., Stauﬀer, D.R.: A description of the Fifth Generation Penn State/NCAR Mesoscale Model (MM5). NCAR Technical Note, NCAR TN-398STR, 138 p. (1994) 10. Houyoux, M.R., Vukovich, J.M.: Updates to the Sparse Matrix Operator Kernel Emission (SMOKE) Modeling System and Integration with Models-3, The Emission Inventory: Regional Strategies for the Future, Raleigh, NC, Air and Waste Management Association (1999) 11. Nenes, A., Pandis, S.N., Pilinis, C.: ISORROPIA: A new thermodynamic equilibrium model for multiphase multicomponent inorganic aerosols. Aquat. Geoch. 4, 123–152 (1998) 12. Stern, R., Builtjes, P., Schaap, M., Timmermans, R., Vautard, R., Hodzic, A., Memmesheimer, M., Feldmann, H., Renner, E., Wolke, R., Kerschbaumer, A.: A model inter-comparison study focussing on episodes with elevated PM10 concentrations. Atmospheric Environment 42, 4567–4588 (2008)

230

A. Todorova et al.

13. Visschedijk, A.J.H., Zandveld, P.Y.J., Denier van der Gon, H.A.C.: A High Resolution Gridded European Emission Database for the EU Integrate Project GEMS, TNO-report 2007-A-R0233/B, Apeldoorn, The Netherlands (2007) 14. http://www.cmaq-model.org/ 15. http://www.smoke-model.org/ 16. http://www.epa.gov/ttn/chief/emch/speciation/

Hausdorﬀ Continuous Viscosity Solutions of Hamilton-Jacobi Equations R. Anguelov1,2 , S. Markov2, and , F. Minani3 1

Department of Mathematics and Applied Mathematics, University of Pretoria Institute of Mathematics and Informatics, Bulgarian Academy of Sciences 3 Department of Applied Mathematics, National University of Rwanda

2

Abstract. A new concept of viscosity solutions, namely, the Hausdorﬀ continuous viscosity solution for the Hamilton-Jacobi equation is deﬁned and investigated. It is shown that the main ideas within the classical theory of continuous viscosity solutions can be extended to the wider space of Hausdorﬀ continuous functions while also generalizing some of the existing concepts of discontinuous solutions. Keywords: Viscosity solution, Hausdorﬀ continuous, envelope solution. 2000 Mathematics Subject Classification: 49L25, 35D05, 54C60.

1

Introduction

Hamilton-Jacobi equations are traditionally associated with classical mechanics but in recent times they are also linked to new areas of knowledge, e.g. the modeling of biomotor processes [12]. In its special form of Hamilton-Jacobi-Bellman equations they are associated with problems of optimal control [3], including feedback control problems typical for bioreactors [7,9]. The theory of viscosity solutions was developed in order to accommodate various kinds of nonsmooth solutions of these problems. In its classical formulation, see [6], the theory deals with solutions which are continuous functions. The concept of continuous viscosity solutions is further generalized, e.g. see [3, Chapter V], [5,4], to include discontinuous solutions with the deﬁnition of Ishii given in [8] playing a pivotal role. In this paper we propose a new approach to the treatment of discontinuous solutions, namely, by involving Hausdorﬀ continuous (H-continuous) interval valued functions. We justify the proposed approach by demonstrating that (i) the main ideas within the classical theory of continuous viscosity solutions can be extended to the wider space of H-continuous functions; (ii) the existing theory of discontinuous solutions is a particular case of that developed in this paper in terms of H-continuous functions; (iii) the H-continuous viscosity solutions have a more clear; interpretation than the existing concepts of discontinuous solutions, e.g. envelope viscosity solutions [3, Chapter V].

The ﬁrst author was partially supported by the NRF of South Africa. The second author was partially supported by the Bulgarian NSF Project DO 02-359/2008.

I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 231–238, 2010. c Springer-Verlag Berlin Heidelberg 2010

232

R. Anguelov, S. Markov, and, F. Minani

In order to simplify the exposition we will only consider ﬁrst order HamiltonJacobi equations of the form Φ(x, u(x), Du(x)) = 0, x ∈ Ω,

(1)

where Ω is an open subset of Rn , u : Ω → R is the unknown function, Du is the gradient of u and the given function Φ : Ω × R × Rn → R is jointly continuous in all its arguments. The theory of viscosity solutions rests on two fundamental concepts, namely, of subsolution and of supersolution. These concepts are deﬁned in various equivalent ways in the literature. The deﬁnition given below is formulated in terms of local maxima and minima. We use the notations U SC(Ω) = {u : Ω → R : u is upper semi-continuous on Ω} LSC(Ω) = {u : Ω → R : u is lower semi-continuous on Ω} Definition 1. A function u ∈ U SC(Ω) is called a viscosity subsolution of the equation (1) if for any ϕ ∈ C 1 (Ω) we have Φ(x0 , u(x0 ), Dϕ(x0 )) ≤ 0 at any local maximum point x0 of u − ϕ. Similarly, u ∈ LSC(Ω) is called a viscosity supersolution of the equation (1) if for any ϕ ∈ C 1 (Ω) we have Φ(x0 , u(x0 ), Dϕ(x0 )) ≥ 0 at any local minimum point x0 of u − ϕ. Without loss of generality we may assume in the above deﬁnition that u(x0 ) = ϕ(x0 ) exposing in this way a very clear geometrical meaning of this deﬁnition: the gradient of the solution u of equation (1) is replaced by the gradient of any smooth function touching the graph of u from above, in the case of subsolution, and touching the graph of u from below, in the case of supersolution. This also establishes the signiﬁcance of the requirement that a subsolution and a supersolution should respectively be upper semi-continuous and lower semi-continuous functions. More precisely, the upper semi-continuity of a subsolution u ensures that any local supremum of u − ϕ is eﬀectively reached at a certain point x0 , that is, it is a local maximum, with the geometrical meaning that the graph of u can be touched from above at x = x0 by a vertical translate of the graph of ϕ. In a similar way, the lower semi-continuity of a supersolution u ensures that any local inﬁmum of u − ϕ is eﬀectively reached at a certain point x0 which means that the graph of u can be touched from below at x = x0 by a vertical translate of the graph of ϕ. Naturally, a solution should incorporate the properties of both a subsolution and a supersolution. In the classical viscosity solutions theory [6] a viscosity solution isa function u which is both a subsolution and a supersolution. Since U SC(Ω) LSC(Ω) = C(Ω), this clearly implies that the viscosity solutions deﬁned in this way are all continuous functions. The concept of viscosity solution for functions which are not necessarily continuous is introduced by using the upper and lower semi-continuous envelopes. For a locally bounded function u these are given by S(u)(x) = inf{f (x) : f ∈ U SC(Ω), u ≤ f } = inf sup{u(y) : y ∈ Bδ (x)}, (2) δ>0

I(u)(x) = sup{f (x) : f ∈ LSC(Ω), u ≥ f } = sup inf{u(y) : y ∈ Bδ (x)}, δ>0

(3)

Hausdorﬀ Continuous Viscosity Solutions of Hamilton-Jacobi Equations

233

where Bδ (x) denotes the open δ-neighborhood of x in Ω. Using the fact that for any function u : Ω → R the functions S(u) and I(u) are always, respectively, upper semi-continuous and lower semi-continuous functions, a viscosity solution can be deﬁned as follows, [8]. Definition 2. A function u : Ω → R is called a viscosity solution of (1) if S(u) is a viscosity subsolution of (1) and I(u) is a viscosity supersolution of (1). The ﬁrst important point to note about the advantages of the method in this paper is as follows. Interval valued functions appear naturally in the context of noncontinuous viscosity solutions. Namely, they appear as graph completions. Indeed, the above deﬁnition places requirements not on the function u itself but on its lower and upper semi-continuous envelopes or, in other words, on the interval valued function F (u)(x) = [I(u)(x), S(u)(x)], x ∈ Ω,

(4)

which is called the graph completion of u, see [13]. Clearly, Deﬁnition 2 treats functions which have the same upper and lower semi-continuous envelopes, that is, have the same graph completion, as identical functions. On the other hand, since diﬀerent functions can have the same graph completion, a function can not in general be identiﬁed from its graph completion, that is, functions with the same graph completion are indistinguishable. Therefore, no generality will be lost if only interval valued functions representing graph completions are considered. Let A(Ω) be the set of all functions deﬁned on an open set Ω ⊂ Rn with values which are closed ﬁnite real intervals, that is, A(Ω) = {f : Ω → IR}, where IR = {[a, a] : a, a ∈ R, a ≤ a}. Identifying a ∈ R with the point interval [a, a] ∈ IR, we consider R as a subset of IR. Thus A(Ω) contains the set A(Ω) = {f : Ω → R} of all real functions deﬁned on Ω. Let u ∈ A(Ω). For every x ∈ Ω the value of u is an interval [u(x), u(x)] ∈ IR. Hence, the function u can be written in the form u = [u, u] where u, u ∈ A(Ω) and u(x) ≤ u(x), x ∈ Ω. The function w(f )(x) = u(x) − u(x), x ∈ Ω, is called width of u. Clearly, u ∈ A(Ω) if and only if w(f ) = 0. The deﬁnitions of the upper semi-continuous envelope, the lower semi-continuous envelope and the graph completion operator F given in (2), (3) and (4) for u ∈ A(Ω) can be extended to functions u = [u, u] ∈ A(Ω) in an obvious way, e.g. S(u) = inf δ>0 sup{z ∈ u(y) : y ∈ Bδ (x)}. We recall here the concept of S-continuity associated with the graph completion operator, [13]. Definition 3. A function u = [u, u] ∈ A(Ω) is called S-continuous if F (u) = u, or, equivalently, I(u) = u, S(u) = u. Using the properties of the lower and upper semi-continuous envelopes one can easily see that the graph completions of locally bounded real functions on Ω comprise the set F(Ω) of all S-continuous functions on Ω. Following the above discussion we deﬁne the concept of viscosity solution for the functions in F(Ω). Definition 4. A function u = [u, u] ∈ F(Ω) is called a viscosity solution of (1) if u is a supersolution of (1) and u is a subsolution of (1).

234

R. Anguelov, S. Markov, and, F. Minani

A second advantage of the method in this paper is as follows. A function u ∈ A(Ω) is a viscosity solution of (1) in the sense of Deﬁnition 2 if and only if the interval valued function F (u) is a viscosity solution of (1) in the sense of Deﬁnition 4. In this way the level of the regularity of a solution u is manifested through the width of the interval valued function F (u). It is well known that without any additional restrictions the concept of viscosity solution given in Deﬁnition 2 and by implication the concept given in Deﬁnition 4 is rather weak, [3]. This is demonstrated by the following example, partially discussed in [3]. Example 1. Consider the equation u (x) = 1 , x ∈ (0, 1). The functions v(x) =

x+1 if x ∈ (0, 1)∩Q x if x ∈ (0, 1)\Q

w(x) =

(5) x if x ∈ (0, 1)∩Q x+1 if x ∈ (0, 1)\Q

are both viscosity solutions of equation (5) in terms of Deﬁnition 2. The interval valued function z = F (v) = F (w) given by z(x) = [x, x + 1], x ∈ (0, 1), is a solution in terms of Deﬁnition 4. With the interval approach adopted here it becomes apparent that the distance between I(u) and S(u) is an essential measure of the regularity of any solution u, irrespective of whether it is given as a point valued function or as an interval valued function. If no restriction is placed on the distance between I(u) and S(u), then we have some quite meaningless solutions like the solutions in Example 1. On the other hand, a strong restriction like I(u) = S(u) gives only solutions which are continuous. In this paper we consider solutions for which the Hausdorﬀ distance, as deﬁned in [13], between the functions I(u) and S(u) is zero, a condition deﬁned through the concept of Hausdorﬀ continuity.

2

The Space of Hausdorﬀ Continuous Functions

The concept of Hausdorﬀ continuous interval valued functions was originally developed within the theory of Hausdorﬀ approximations, [13]. It generalizes the concept of continuity of real function using a minimality condition with respect to inclusion of graphs. Definition 5. A function f ∈ A(Ω) is called Hausdorﬀ continuous, or Hcontinuous, if for every g ∈ A(Ω) which satisﬁes the inclusion g(x) ⊆ f (x), x ∈ Ω, we have F (g)(x) = f (x), x ∈ Ω. As mentioned in the Introduction the concept of Hausdorﬀ continuity is closely connected with the Hausdorﬀ distance between functions. Let us recall that the Hausdorﬀ distance ρ(f, g) between two functions f, g ∈ A(Ω) is deﬁned as the Hausdorﬀ distance between the graphs of the functions F (f ) and F (g) considered as subsets of Rn+1 , [13]. Then we have f = [f , f ] is H-continuous ⇐⇒ f is S-continuous and ρ(f , f ) = 0 .

Hausdorﬀ Continuous Viscosity Solutions of Hamilton-Jacobi Equations

235

Although every H-continuous function f is, in general, interval valued, the subset of the domain Ω where f assumes proper interval values is a set of ﬁrst Baire category. One of the most surprising and useful properties of the set H(Ω) of all Hcontinuous functions is its Dedekind order completeness with respect to the point wise deﬁned partial order [1,10] f ≤ g ⇐⇒ f (x) ≤ g(x), f (x) ≤ g(x), x ∈ Ω. What makes this property so signiﬁcant is the fact that with very few exceptions the usual spaces in Real Analysis or Functional Analysis are not Dedekind order complete. Hence the space of H-continuous functions can be a useful tool in Real Analysis and in the Analysis of PDEs, particularly in situations involving order. For example, a long outstanding problem related to the Dedekind order completion of spaces C(X) of real valued continuous functions on rather arbitrary topological spaces X was solved through Hausdorﬀ continuous functions [1]. Following this breakthrough a signiﬁcant improvement of the regularity properties of the solutions obtained through the order completion method, see [11], was reported in [2]. Namely, it was shown that these solutions can be assimilated with the class of Hausdorﬀ continuous functions on the open domains Ω. We may also remark that the concept of viscosity solutions is deﬁned through order. Hence it is natural to be considered in the general setting of a Dedekind order complete space like H(Ω).

3

The Envelope Viscosity Solutions and Hausdorﬀ Continuous Viscosity Solutions

Recognizing that the concept of viscosity solution given by Deﬁnition 2 is rather weak the authors of [3] introduce the concept of envelope viscosity solution. The concept is deﬁned in [3] for equation (1) with Dirichlet boundary conditions u|∂Ω = g.

(6)

The concepts of subsolution and supersolution are extended to the problem (1),(6) through respective inequalities on the boundary, namely u|∂Ω ≤ g for a subsolution and u|∂Ω ≥ g for a supersolution. Definition 6. A function u ∈ A(Ω) is called an envelope viscosity solution of (1),(6) if there exist a nonempty set Z1 (u) of subsolutions of (1),(6) and a nonempty set Z2 (u) of supersolutions of (1),(6) such that u(x) =

sup f (x) = f ∈Z1 (u)

inf

f ∈Z2 (u)

f (x), x ∈ Ω.

It is shown in [3] that every envelope viscosity solution is a viscosity solution in terms of Deﬁnition 2. Considering the concept from geometrical point of view, on can expect that by ’squeezing’ the envelope viscosity solution u between a set of subsolutions and a set of supersolutions the gap between I(u) and S(u) would be small. However, in general this is not the case. The following example shows that the concept of envelope viscosity solution does not address the problem of the distance between I(u) and S(u). Hence one can have envelope viscosity solutions of little practical meaning similar to the one in Example 1.

236

R. Anguelov, S. Markov, and, F. Minani

Example 2. Consider the following problem on Ω = (0, 1) −u(x)(u (x))2 = 0, x ∈ Ω, u(0) = u(1) = 0.

(7) (8)

For every α ∈ Ω we deﬁne the functions 1 if x = α 0 if x = α φα (x) = , ψα (x) = 0 if x ∈ Ω \ {α} 1 if x ∈ Ω \ {α}. We have φα ∈ U SC(Ω), ψα ∈ LSC(Ω), α ∈ Ω. Furthermore, for every α ∈ (0, 1) the function φα is a subsolution of (7) while ψα is a supersolution of (7). Indeed, both functions satisfy (7) for all x ∈ Ω \ {α} and at x = α we have −φα (α)p2 = −p2 ≤ 0 for all p ∈ D+ φα (α) = (−∞, ∞) −ψα (α)p2 = 0 ≥ 0 for all p ∈ D− ψα (α) = (−∞, ∞). We will show that the function u(x) =

1 if x ∈ Ω \Q 0 if x ∈ Q Ω

is an envelope viscosity solution of (7). Deﬁne Z1 (u) = {φα : α ∈ Ω \ Q}, Z2 (u) = {ψα : α ∈ Q Ω}. Note that the functions in both Z1 (u) and Z2 (u) satisfy the boundary condition (8). Therefore these sets consist respectively of viscosity subsolutions and viscosity supersolutions of (7),(8). Further u satisﬁes u(x) = supw∈Z1 (u) w(x) = inf w∈Z2 (u) w(x), which implies that it is an envelope viscosity solution. Clearly neither u nor F (u) is a Hausdorﬀ continuous function. In fact we have F (u)(x) = [0, 1], x ∈ Ω. The next interesting question is whether every H-continuous solution is an envelope viscosity solution. Since the concept of envelope viscosity solutions requires the existence of sets of subsolutions and supersolutions respectively below and above an envelope viscosity solution then an H-continuous viscosity solution is not in general an envelope viscosity solution, e.g. when the H-continuous viscosity solutions does not have any other subsolutions and supersolutions around it. However in the essential case when the H-continuous viscosity solution is a supremum of subsolutions or inﬁmum of supersolutions it can be linked to an envelope viscosity solution as stated in the next theorem. Theorem 1. Let u = [u, u] be an H-continuous viscosity solution of (1) and let Z1 = {w ∈ U SC(Ω) : w-subsolution, w ≤ u} Z2 = {w ∈ LSC(Ω) : w-supersolution, w ≥ u}. = ∅ and u(x) = sup w(x) then u is an envelope viscosity solution. a) If Z1 w∈Z1

b) If Z2 = ∅ and u(x) = inf w(x) then u is an envelope viscosity solution. w∈Z2

Hausdorﬀ Continuous Viscosity Solutions of Hamilton-Jacobi Equations

237

Proof. a) We choose the sets Z1 (u) and Z2 (u) required in Deﬁnition 6 as follows: Z1 (u) = Z1 , Z2 (u) = {u}. Then we have u(x) = supw∈Z1 (u) w(x) = inf w∈Z2 (u) w(x), which implies that u is an envelope viscosity solution. The proof of b) is done in a similar way. Let us note that if the conditions in both a) and b) in the above theorem are satisﬁed then both u and u are envelope viscosity solutions and in this case in makes even more sense to consider instead the H-continuous function u.

4

Existence and Uniqueness

One of the primary virtues of the theory of viscosity solutions is that it provides very general existence and uniqueness theorems, [6]. In this section we give existence and uniqueness theorems for H-continuous viscosity solutions in a similar form to the respective theorems for continuous solutions [6, Theorem 4.1], and for general discontinuous solutions [8, Theorem 3.1], [3, Theorem V.2.14]. Theorem 2. (Existence) Assume that there exist Hausdorﬀ continuous functions u1 = [u1 , u1 ] and u2 = [u2 , u2 ] such that u1 is a subsolution of (1), u2 is a supersolution of (1) and u1 ≤ u2 . Then there exists a Hausdorﬀ continuous solution u of (1) satisfying the inequalities u1 ≤ u ≤ u2 . The proof of the above theorem, similar to the other existence theorems in the theory of viscosity solutions, uses Perron’s method and the solution is constructed as a supremum of a set of subsolutions, this time the supremum being taken in the poset H(Ω) and not point-wise. We should note that due to the fact that the the poset H(Ω) is Dedekind order complete it is an appropriate medium for such an application of Perron’s method. As in the theory of continuous viscosity solutions, the uniqueness of Hcontinuous viscosity solutions can be obtained from a comparison principle, which can be formulated for H-continuous functions as follows. Definition 7. The Hamilton-Jacobi equation (1) satisﬁes the comparison principle on the set H(Ω) of all H-continuous functions on Ω if for any subsolution u ∈ H(Ω) and any supersolution v ∈ H(Ω) we have (u(x) ≤ v(x), x ∈ ∂Ω) =⇒ (u(x) ≤ v(x), x ∈ Ω). It is easy to see that the above comparison principle neither follows nor implies the comparison principle for point-valued functions, e.g. see [3]. The reason is that the partial order of intervals as deﬁned in [10] and used here does not imply respective ordering between arbitrary points in these intervals. Hence the issue of characterizing the equations satisfying the comparison principle in Deﬁnition 7. Nevertheless we may note that in the case when u and v are point-valued on the boundary ∂Ω the comparison principle in Deﬁnition 7 follows from the comparison principle for point-valued functions.

238

5

R. Anguelov, S. Markov, and, F. Minani

Conclusion

The H-continuous functions are a particular class of interval-valued functions. Nevertheless, recent results have shown that they can provide solutions to problems formulated in terms of point-valued functions, [1,2]. In this paper Hcontinuous functions are linked with the concept of viscosity solutions. As shown in the Introduction the deﬁnition of viscosity solution, see Deﬁnition 2, has an implicit interval character since it places requirements only on the upper semicontinuous envelope S(u) and the lower semi-continuous envelope I(u). For a H-continuous viscosity solution u the functions I(u) and S(u) are as close as they can be in the sense of the Hausdorﬀ distance ρ deﬁned in [13], namely, we have ρ(I(u), S(u)) = 0. Hence, the requirement that a viscosity solution is H-continuous has a direct interpretation which we ﬁnd clearer than the requirements related to some other concepts of discontinuous viscosity solutions. Further research will focus on extending other aspects of the classical theory to the case of H-continuous viscosity solutions. In particular this includes characterizing the equations satisfying the comparison principle in Deﬁnition 7.

References 1. Anguelov, R.: Dedekind order completion of C(X) by Hausdorﬀ continuous functions. Quaestiones Mathematicae 27, 153–170 (2004) 2. Anguelov, R., Rosinger, E.E.: Solving Large Classes of Nonlinear Systems of PDE’s. Computers and Mathematics with Applications 53, 491–507 (2007) 3. Bardi, M., Capuzzo-Dolcetta, I.: Optimal Control and Viscosity Solutions of Hamilton-Jacobi-Bellman Equations. Birk¨ auser, Basel (1997) 4. Barles, G.: Discontinuous viscosity solutions of ﬁrst order Hamilton-Jacobi Equations. Nonlinear Anal.: Theory, Methods and Applications 20(9), 1123–1134 (1993) 5. Barron, E.N., Jensen, R.: Semicontinuous viscosity solutions for Hamilton-Jacobi equations with convex Hamiltonians. Communications in Partial Diﬀerential Equations 15(12), 1713–1742 (1990) 6. Crandal, M.G., Ishii, H., Lions, P.-L.: User’s guide to viscosity solutions of second order partial diﬀerential equations. Bulletin of AMS 27(1), 1–67 (1992) 7. Dimitrova, N.S., Krastanov, M.I.: Stabilization of a Nonlinear Anaerobic Wastewater Treatment Model. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2005. LNCS, vol. 3743, pp. 208–215. Springer, Heidelberg (2006) 8. Ishii, H.: Perron’s method for Hamilton-Jacobi equations. Duke Mathematical Journal 55(2), 369–384 (1987) 9. Krastanov, M.I., Dimitrova, N.S.: Stabilizing feedback of a nonlinear process involving uncertain data. Bioprocess and Biosystems Engineering 25(4), 217–220 (2003) 10. Markov, S.: Extended interval arithmetic involving inﬁnite intervals. Mathematica Balkanica 6, 269–304 (1992) 11. Oberguggenberger, M.B., Rosinger, E.E.: Solution on Continuous Nonlinear PDEs through Order Completion. North-Holland, Amsterdam (1994) 12. Perthame, B., Souganidis, P.E.: Asymmetric potentials and motor eﬀect: a homogenization approach. Ann. I. H. Poincar´e — AN (2008), doi:10.1016/j.anihpc.2008.10.003 13. Sendov, B.: Hausdorﬀ Approximations. Kluwer, Dordrecht (1990)

Stochastic Skiba Sets: An Example from Models of Illicit Drug Consumption Roswitha Bultmann, Gustav Feichtinger, and Gernot Tragler Vienna University of Technology (TU Wien), Institute for Mathematical Methods in Economics (IWM), Research Unit for Operations Research and Control Systems (ORCOS), Argentinierstr. 8/105-4, A-1040 Wien, Austria [email protected], [email protected], [email protected]

Abstract. Skiba or DNSS sets are an important feature of many deterministic (usually convex or at least non-concave) optimal control models. For stochastic models they have hardly been studied. Using a classical discretization scheme, we consider a time-discrete stochastic reformulation of a well-known optimal control model of illicit drug consumption, in which the retail drug price is inﬂuenced by exogenous random forces. We assume that these exogenous forces are described by i.i.d. random variables on a ﬁnite probability space. Having set up the model in this way, we use techniques from dynamic programming to determine the optimal solution and describe the transient sets that constitute the stochastic Skiba/DNSS sets. We also show that the DNSS sets expand with the variance, and the optimal policy becomes a continuous function of the state for suﬃciently high levels of the variance. Keywords: Australian heroin drought, DNSS point, drug epidemic, drug policy, market disruption, stochastic optimal control, supply shock, treatment.

1

Introduction

More than ten years ago, the ﬁrst optimal control models of illicit drug epidemics were studied ([1,17]; see also [2,18]). Since then, realism and hence complexity in the models has increased steadily, while the focus has been restricted to a deterministic setup. Motivated by the Australian heroin drought (e.g., [19,10]), [4,5] discussed market disruptions in a two-stage setting, in which the drug price changes exactly once at a given time but is otherwise kept constant. That approach captures the very basics of a supply shock (i.e., high/low price in the drought/glut phase and lower/higher price thereafter), but of course it is unrealistic to know in advance how long a given market disruption will last and what the price will be once the disruption is over. One possible move towards more realism is to consider stochastic optimal control models, which is what we do in

Corresponding author.

I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 239–246, 2010. c Springer-Verlag Berlin Heidelberg 2010

240

R. Bultmann, G. Feichtinger, and G. Tragler

#

#

? Equilibrium 1

? DNSS Point

Equilibrium 2

Fig. 1. Schematic illustration of a deterministic DNSS point

this paper. To the best of our knowledge, this constitutes the ﬁrst attempt in that direction. In the simplest case, for deterministic discounted inﬁnite horizon optimal control problems that are autonomous we ﬁnd a unique optimal steady state solution (often and in what follows mostly referred to as ‘equilibrium’) to which the dynamical system converges when the optimal policy is applied. If, however, the optimized system exhibits multiple equilibria, so-called DNSS or Skiba sets may occur. In such a case, the DNSS sets constitute the borders of the basins of attraction of the steady states. Being at a DNSS point, one is indiﬀerent between at least two optimal policies leading to diﬀerent equilibria. In recognition of studies by Sethi [13,14], Skiba [15], Dechert, and Nishimura [8], these points of indifference are denoted as DNSS (Dechert-Nishimura-Sethi-Skiba) or Skiba points (cf. [11] or http://en.wikipedia.org/wiki/DNSS_point). Fig. 1 schematically illustrates the case of a deterministic DNSS point. For stochastic optimal control models, DNSS points have hardly been studied. According to [9], “the stochastic equivalent of a Skiba point is a transient set between two basins of attraction. The basins of attraction are recurrent sets for the Markov dynamics. The two basins of attraction are separated by a region where there is a positive probability that the dynamics will end up in the lower basin of attraction, and a positive probability they will end up in the upper basin of attraction.” Reformulating this footnote deﬁnition very loosely und adapting it to our purposes so that it will ﬁt best to the deterministic case, each of the equilibria is surrounded by some set of points from which these equilibria are reached with certainty (absorbing set), while there also exist other points for which there are positive probabilities that the one or the other equlibrium will be approached. In other words, in stochastic optimal control models there may exist a set of points for which the long-run outcome is uncertain, and for obvious reasons we will call this set set of uncertainty or stochastic DNSS set. Fig. 2 schematically illustrates the case of a stochastic DNSS set.1 The further contents of this paper are as follows. In Section 2 we present the time-discrete stochastic optimal control model of drug use including a 1

A remark is in order. In stochastic models, an ‘equilibrium’ in general is described by an invariant set of points which are visited alternatively in the long run with some positive probability. The set of probabilities associated with these points constitutes the limit distribution which describes the long-run frequency of occurrence of these points. For ease of exposition, we stick to the notation ‘equilibrium’.

Stochastic Skiba Sets: An Example from Models of Illicit Drug Consumption

#

241

#

?

?

Absorbing Set of

Stochastic DNSS Set

Absorbing Set of

Equilibrium 1

Equilibrium 2

6

6

?

"

! "

!

Fig. 2. Schematic illustration of a stochastic DNSS set

parameterization based on data from the current U.S. cocaine epidemic. Section 3, which is devoted to the analysis via techniques from dynamic programming, reveals that the occurrence of stochastic DNSS points crucially hinges on the variance of the random variables. Section 4 further discusses the results and concludes.

2

The Model

Optimal control models both capture the dynamic aspects of drug epidemics and seek to optimize some given objective via control instruments (e.g., budgets for prevention, treatment, or price-raising law enforcement measures). Here — as in most other existing optimal control models of drug use – the goal is to minimize the discounted stream of social costs arising from drug consumption plus the control costs over an inﬁnite planning horizon subject to the dynamic evolution of the population of drug users. In mathematical terms, we consider the following time-discrete stochastic reformulation of a well-known model of illicit drug consumption, which we obtain by using a classical discretization scheme: ⎛ ⎞⎞ ⎛ ∞ ⎜ ⎟⎟ t ⎜ min IE ⎝ h (1 − rh) ⎝ κp−ω At + ut ⎠⎠ ut ≥0 t time step t=0 discounting

social cost

control spending

subject to ⎛

⎞

⎜ ⎜ −a ¯ At+1 = At + h ⎜ t At A − At − ⎜kp ⎝ initiation into drug use

cuzt A1−z t outﬂow due to ‘treatment’

⎟ ⎟ − μpbt At ⎟ , ⎟ ⎠ natural outﬂow

where ut , At , and pt denote control spending (in the present context called ‘treatment’), the number of drug users, and the drug retail price at time t,

242

R. Bultmann, G. Feichtinger, and G. Tragler Table 1. Model parameters with numerical values and description

Parameter h r κ p¯ ω k a

Value 0.001 0.04 3.396819914 0.12454 0.5 0.00000001581272 0.25

¯ A c z μ

16,250,000 0.043229 0.52 0.181282758

b

0.25

Description time step annual discount rate social costs including a proportionality constant drug retail price (in 1000 US-$) absolute value of the short-run elasticity of demand initiation proportionality constant absolute value of the elasticity of initiation with respect to drug price maximum number of drug users in the model treatment proportionality constant 1 − z reﬂects treatment’s diminishing eﬀects rate of natural outﬂow including a proportionality constant elasticity of quitting with respect to drug price

respectively. We want to stress the fact that the considered model is not an approximation to a continuous stochastic model. While in the deterministic model (discussed in detail in its time-continuous version in [11]) the drug price is kept constant over time, in our stochastic reformulation we allow the retail price to be inﬂuenced by exogenous random forces. Here, we assume that pt = p¯ · ζt , where p¯ is some (constant) average price and {ζt }t≥0 is a family of positive i.i.d. random variables with IEζt = 1, taking values in a ﬁnite set Z. For simplicity we further assume that Z = {Z1 , Z2 , Z3 } with respective probabilities P = {P1 , P2 , P3 }. In non-technical terms, the price evolves randomly but may only take three diﬀerent values. This simplifying assumption was made primarily to reduce the computation time to a minimum. For Z, we will consider the three cases {0.5, 1, 1.3}, {0.25, 1, 1.45}, and {0.15, 1, 1.51} with corresponding variances 0.12, 0.27, and 0.3468, respectively, but use the same set of probabilities P = {0.3, 0.2, 0.5}. The base case parameter values for the following numerical analysis are given in Table 1 (see [5] for the derivation of these values).

3

Analysis

We use techniques from dynamic programming to determine the optimal solution and describe the transient sets that constitute the sets of uncertainty. In particular, we use a modiﬁed policy iteration algorithm as described in [3], in which each Bellman operator iteration is followed by 1000 policy operator iterations (cf. [9]). For a better performance of the solution algorithm, we have recently started to implement an adaptive space discretization scheme as proposed by [12].

Stochastic Skiba Sets: An Example from Models of Illicit Drug Consumption

243

In Fig. 3 we present the space increments f (At , u∗t , ζt ) := At+1 − At , so

−a z b f (At , u∗t , ζt ) = h k (¯ pζt ) At A¯ − At − c (u∗t ) A1−z − μ (¯ p ζ ) A t t t (left panels) and the optimal feedback controls u∗t (At ) (right panels) for variances 0.12 (top), 0.27 (center), and 0.3468 (bottom). Note that the space increments f (At , u∗t , ζt ) of course depend on the realization of ζt , so we ﬁnd one curve for each possible value in Z = {Z1 , Z2 , Z3 }. If for a given value of At all f (At , u∗t , ζt ) are negative (positive), then the number of users decreases (increases) with certainty for that value of At . If, however, the sign of f (At , u∗t , ζt ) is not unique (i.e., we ﬁnd curves both above and below the horizontal axis for a given value of At ), then the state can both decrease and increase, depending on the particular value of ζt . The case with the lowest value of the variance (top panels in Fig. 3) is most similar to the deterministic model. We ﬁnd a unique DNSS point, which separates the basins of attraction of a low equilibrium at 0 and a high equilibrium ˆ At the DNSS point, the optimal control is discontinuous. While denoted by A. in the deterministic model the equilibria are single points, in our stochastic reformulation the higher equilibrium is an interval, i.e. an invariant set of points. Increasing the variance, the qualitative results remain the same except for the fact that the DNSS point spreads out to an interval, which constitutes our set of uncertainty/stochastic DNSS set, which – in the terminology of [9] – is a transient set between two basins of attraction (center panels in Fig. 3). It is interesting to note, though, that the optimal control still has (only) one jump, which in our case coincides with the upper boundary of the stochastic DNSS set. Finally, when the variance becomes suﬃciently high, the DNSS property disappears, i.e., for all initial values we converge to the low equilibrium at 0 (bottom panels in Fig. 3). In this case, the optimal control is continuous over the whole state space. It may, however, take a fairly long time to ‘escape’ the transient set (indicated by the dashed arrow in Fig. 3), which is what [6] denoted as near invariance (see also the newer results in [7]).2

4

Discussion, Extensions, and Concluding Remarks

Motivated by the Australian heroin drought and [9], we were able to compute a stochastic DNSS set as sketched in Fig. 2. From the policy point of view it is interesting to note, though, that there is still a single point in the state space separating two optimal policy strategies, which qualitatively look the same as for the deterministic model and are denoted as ‘eradication’ and ‘accomodation’ strategies, respectively (see the center panels in Fig. 3).3 In our stochastic setting, however, around this point we ﬁnd an area in which the long-run outcome 2 3

For an interpretation of the non-monotonic behavior of the optimal feedback controls as displayed in the right panels of Fig. 3 see, e.g., [17,18]. The eradication/accomodation strategy refers to the left/right branch of the optimal policy and uses extensive/moderate levels of control. In the deterministic model, the eradication/accomodation strategy leads to a low/high equilibrium.

244

R. Bultmann, G. Feichtinger, and G. Tragler

DNSS

−

DNSS

- Aˆ +

±

−

DNSS

DNSS

?

−

−

?

?- Aˆ ±+ ±

±

−

−

Fig. 3. Space increments f (At , u∗t , ζt ) (left panels; for each of the three curves, ζt is kept constant at Z1 , Z2 , Z3 , respectively) and optimal feedback controls u∗t (right panels) for variances 0.12 (top), 0.27 (center), and 0.3468 (bottom). The vertical lines indicate values of At , where we ﬁnd a change in the sign of the space increment. Left/right solid arrows indicate a decrease/an increase, respectively, of the number of users under the optimal control. With Aˆ we denote the upper equilibrium, which is an invariant set. Within the stochastic DNSS set (center panel, indicated with a question mark) the long-run outcome is uncertain. Within the area for which we display a dashed arrow (bottom panel), the number of users can both increase and decrease, but sooner or later the system will leave this area at the lower boundary and converge to the low equilibrium at 0; that area is hence a transient set but can be ‘nearly invariant’ in that the system remains there for a very long time.

Stochastic Skiba Sets: An Example from Models of Illicit Drug Consumption

245

is uncertain; only when this area is left, either the low-level or the high-level equilibrium will be approached with certainty. That means, even though the eradication strategy may be optimal initially, a sequence of ‘bad’ realizations of the drug price might push the epidemic up so that the accomodation strategy becomes optimal. Analogously, we may ﬁnd scenarios where the accommodation strategy is optimal initially, but a sequence of ‘good’ realizations of the drug prize may pull the epidemic down so that the eradication strategy becomes optimal. However, once this area of uncertainty is left, we can say that the drug epidemic’s fate is sealed. An immediate and indeed positive conclusion is that with our stochastic approach we can conﬁrm well-established results from the deterministic case, but of course the solution structure is much richer and more realistic. We further showed that the size of the variance has a strong impact on the optimal solution. If the variance is small enough, the system behaves as in the deterministic case (except for the fact that one equilibrium extends from a point to an invariant set). For intermediate values of the variance, a stochastic DNSS set emerges from what was previously a single point. Finally, if the variance becomes high enough, the DNSS property disappears due to the loss of invariance of the upper equilibrium and we observe near invariance. These results are consistent with those by [6,16]. The immediate next goal is to compute a full bifurcation diagram with respect to the variance where we expect to be able to determine that bifurcation point where the invariant set constituting the high equilibrium loses its invariance and becomes transient or ‘nearly invariant’. Many extensions seem to be worthwhile. (1) For future work it seems necessary to clarify the theoretical concepts of stochastic DNSS sets to provide a concise deﬁnition. Therefore we plan to concentrate on simpler models exhibiting the DNSS porperty, which may help to work out more precisely what is model speciﬁc and what is an inherent property of DNSS sets. (2) The step (back) form the discrete to the continuous case seems important for some following work. (3) We increased the variance of the three-point random variable by increasing and decreasing the extremal values. Alternatively, this could be done by enlarging the probabilities of the extremal values. It will be interesting to see if the latter approach may also give rise to the observed Skiba/DNSS phenomenon.

Acknowledgement This research was partly ﬁnanced by the Austrian Science Fund (FWF) within the research grant No. P18527-G14 (“Optimal Control of Illicit Drug Epidemics”). We thank three anonymous referees and Lars Gr¨ une from University Bayreuth for many constructive ideas for how to improve earlier versions of this paper.

References 1. Behrens, D.A.: The US cocaine epidemic: an optimal control approach to heterogeneous consumption behavior. PhD Thesis, Institute for Econometrics, Operations Research and Systems Theory, Vienna University of Technology (1998)

246

R. Bultmann, G. Feichtinger, and G. Tragler

2. Behrens, D.A., Caulkins, J.P., Tragler, G., Feichtinger, G.: Optimal control of drug epidemics: prevent and treat – but not at the same time? Management Science 46(3), 333–347 (2000) 3. Bertsekas, D.P.: Dynamic Programming and Optimal Control, 2nd edn., vol. II. Athena Scientiﬁc, Belmont (2001) 4. Bultmann, R., Caulkins, J.P., Feichtinger, G., Tragler, G.: Modeling Supply Shocks in Optimal Control Models of Illicit Drug Consumption. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2007. LNCS, vol. 4818, pp. 285–292. Springer, Heidelberg (2008) 5. Bultmann, R., Caulkins, J.P., Feichtinger, G., Tragler, G.: How should policy respond to disruptions in markets for illegal drugs? Contemporary Drug Problems 35(2&3), 371–395 (2008) 6. Colonius, F., Gayer, T., Kliemann, W.: Near invariance for Markov diﬀusion systems. SIAM Journal of Applied Dynamical Systems 7(1), 79–107 (2008) 7. Colonius, F., Homburg, A.J., Kliemann, W.: Near invariance and local transience for random diﬀeomorphisms. Journal of Diﬀerence Equations and Applications (forthcoming) 8. Dechert, W.D., Nishimura, K.: A complete characterization of optimal growth paths in an aggregated model with a non-concave production function. Journal of Economic Theory 31(2), 332–354 (1983) 9. Dechert, W.D., O’Donnell, S.I.: The stochastic lake game: A numerical solution. Journal of Economic Dynamics and Control 30, 1569–1587 (2006) 10. Degenhardt, L., Reuter, P., Collins, L., Hall, W.: Evaluating explanations of the Australian heroin drought. Addiction 100, 459–469 (2005) 11. Grass, D., Caulkins, J.P., Feichtinger, G., Tragler, G., Behrens, D.A.: Optimal Control of Nonlinear Processes — With Applications in Drugs, Corruption, and Terror. Springer, Heidelberg (2008) 12. Gr¨ une, L.: Error estimation and adaptive discretization for the discrete stochastic Hamilton-Jacobi-Bellman equation. Numerische Mathematik 99, 85–112 (2004) 13. Sethi, S.P.: Nearest feasible paths in optimal control problems: Theory, examples, and counterexamples. Journal of Optimization Theory and Applications 23(4), 563–579 (1977) 14. Sethi, S.P.: Optimal advertising policy with the contagion model. Journal of Optimization Theory and Applications 29(4), 615–627 (1979) 15. Skiba, A.K.: Optimal growth with a convex-concave production function. Econometrica 46(3), 527–539 (1978) 16. Stachursky, J.: Stochastic growth with increasing returns: stability and path dependence. Studies in Nonlinear Dynamics & Econometrics 7(2), Article 1 (2003) 17. Tragler, G.: Optimal Control of Illicit Drug Consumption: Treatment versus Enforcement. PhD Thesis, Institute for Econometrics, Operations Research and Systems Theory, Vienna University of Technology (1998) 18. Tragler, G., Caulkins, J.P., Feichtinger, G.: Optimal dynamic allocation of treatment and enforcement in illicit drug control. Operations Research 49(3), 352–362 (2001) 19. Weatherburn, D., Jones, C., Freeman, K., Makkai, T.: Supply control and harm reduction: Lessons from the Australian heroin “drought”. Addiction 98(1), 83–91 (2002)

Classical and Relaxed Optimization Methods for Nonlinear Parabolic Optimal Control Problems I. Chryssoverghi, J. Coletsos, and B. Kokkinis Department of Mathematics, National Technical University of Athens Zografou Campus, 15780 Athens, Greece [email protected], [email protected], [email protected]

Abstract. A distributed optimal control problem is considered, for systems deﬁned by parabolic partial diﬀerential equations. The state equations are nonlinear w.r.t. the state and the control, and the state constraints and cost depend also on the state gradient. The problem is ﬁrst formulated in the classical and in the relaxed form. Various necessary conditions for optimality are given for both problems. Two methods are then proposed for the numerical solution of these problems. The ﬁrst is a penalized gradient projection method generating classical controls, and the second is a penalized conditional descent method generating relaxed controls. Using relaxation theory, the behavior in the limit of sequences constructed by these methods is examined. Finally, numerical examples are given.

1

Classical and Relaxed Optimal Control Problems

Let Ω be a bounded domain in Rd with boundary Γ , and I = (0, T ), T < ∞, an interval. Set Q := Ω×I, Σ := Γ ×I, and consider the parabolic state equation yt + A(t)y + a0 (x, t)T ∇y + b(x, t, y(x, t), w(x, t)) = f (x, t, y(x, t), w(x, t)) in Q, y(x, t) = 0 in Σ, y(x, 0) = y 0 (x) in Ω, where A(t) is the second order elliptic diﬀerential operator A(t)y := −

d d

(∂/∂xi )[aij (x, t)∂y/∂xj ].

j=1 i=1

We denote by | · | the Euclidean norm in Rn , by (·, ·) and · the inner product and norm in L2 (Ω), by (·, ·)Q and · Q the inner product and norm in L2 (Q), by (·, ·)1 and · 1 the inner product and norm in the Sobolev space V := H01 (Ω), and by < ·, · > the duality bracket between the dual V ∗ = H −1 (Ω) and V . The state equation will be interpreted in the following weak form < yt , v > +a(t, y, v) + (a0 (t)T ∇y, v) + (b(t, y, w), v) = (f (t, y, w), v), ∀v ∈ V, a.e. in I, I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 247–255, 2010. c Springer-Verlag Berlin Heidelberg 2010

248

I. Chryssoverghi, J. Coletsos, and B. Kokkinis

y(t) ∈ V a.e. in I, y(0) = y 0 , where the derivative yt is understood in the sense of V -vector valued distributions, and a(t, ·, ·) denotes the usual bilinear form on V × V associated with A(t) d d ∂y ∂v a(t, y, v) := a (x, t) ∂x dx. Ω ij i ∂xj j=1 i=1

We deﬁne the set of classical controls W := {w : Q → U | w measurable} ⊂ L∞ (Q) ⊂ L2 (Q),

where U is a compact subset of Rd , and the functionals Gm (w) := Q gm (x, t, y, ∇y, w)dxdt, m = 0, ..., q. The classical optimal control problem is to minimize G0 (w) subject to the constraints w ∈ W,

Gm (w) = 0,

m = 1, ..., p,

Gm (w) 0,

m = p + 1, ..., q.

It is well known that, even if the set U is convex, the classical problem may have no solutions. The existence of such a solution is usually proved under strong, often unrealistic for nonlinear systems, convexity assumptions such as the Cesari property. Reformulated in the so-called relaxed form, the problem is convexiﬁed in some sense and has a solution in a larger space under weaker assumptions. Next, we deﬁne the set of relaxed controls (Young measures; for the relevant theory, see [6], [5]) 1 ∗ R := {r : Q → M1 (U ) |r weakly measurable} ⊂ L∞ w (Q, M (U )) ≡ L (Q, C(U )) ,

where M (U ) (resp. M1 (U )) is the set of Radon (resp. probability) measures on U . The set R is endowed with the relative weak star topology of L1 (Q, C(U ))∗ , and R is convex, metrizable and compact. If we identify every classical control w(·) with its associated Dirac relaxed control r(·) = δw(·) , then W may be also regarded as a subset of R, and W is thus dense in R. For φ ∈ L1 (Q, C(U )) = ¯ C(U )) (or φ ∈ B(Q, ¯ U ; R), where B(Q, ¯ U ; R) is the set of Caratheodory L1 (Q, functions in the sense of Warga [6]) and r ∈ L∞ w (Q, M (U )) (in particular, for r ∈ R), we shall use for simplicity the notation φ(x, t, r(x, t)) := U φ(x, t, u)r(x, t)(du), where φ(x, t, r(x, t)) is thus linear (under convex combinations, if r ∈ R) in r. A sequence (rk ) converges to r ∈ R in R iﬀ lim Q φ(x, t, rk (x, t))dxdt = Q φ(x, t, r(x, t))dxdt, k→∞

¯ U ; R), or φ ∈ C(Q ¯ × U ). for every φ ∈ L1 (Q; C(U )), or φ ∈ B(Q, The relaxed optimal control problem is then deﬁned by replacing w by r (with the above notation) and W by R in the continuous classical problem. In what follows, we shall make some of the following groups of assumptions.

Classical and Relaxed Optimization Methods

249

Assumptions 1. The boundary Γ is Lipschitz (e.g. appropriately piecewise C 1 , or polyhedral) if b = 0; else, Γ is C 1 and n 3. The coeﬃcients aij satisfy the ellipticity condition d d

aij (x, t)zi zj θ

j=1 i=1

d i=1

zi2 ,

∀zi , zj ∈ R,

∀(x, t) ∈ Q,

with θ > 0, aij ∈ L∞ (Q). We have a0 ∈ L∞ (Q)d , and the functions b,f are deﬁned on Q×R×U, measurable for ﬁxed y, u, continuous for ﬁxed x, t, and satisfy |b(x, t, y, u)| φ(x, t) + βy 2 , b(x, t, y, u)y 0, |f (x, t, y, u)| ψ(x, t) + γ |y| , ∀(x, t, y, u) ∈ Q × R × U, with φ, ψ ∈ L2 (Q), β, γ 0, b(x, t, y1 , u) b(x, t, y2 , u), ∀(x, t, y1 , y2 , u) ∈ Q × R2 × U, with y1 y2 , |f (x, t, y1 , u) − f (x, t, y2 , u)| L |y1 − y2 | , ∀(x, t, y1 , y2 , u) ∈ Q × R2 × U. Assumptions 2. The functions gm are deﬁned on Q × Rd+1 × U, measurable for ﬁxed y, y¯, u, continuous for ﬁxed x, t, and satisfy 2 |gm (x, t, y, y¯, u)| ζm (x, t) + δm y 2 + δ¯m |¯ y| ,

∀(x, t, y, y¯, u) ∈ Q × Rd+1 × U,

with ζm ∈ L1 (Q), δm 0, δ¯m 0. Assumptions 3. The functions b, by , bu , f, fy , fu (resp. gm , gmy , gm¯y , gmu ) are ˜ ), where U ˜ is an open set containing ˜ (resp. Q × Rd+1 × U deﬁned on Q × R × U U , measurable on Q for ﬁxed (y, u) ∈ R × U (resp. (y, y¯, u) ∈ Rd+1 × U ) and continuous on R × U (resp. Rd+1 × U ) for ﬁxed (x, t) ∈ Q, and satisfy |by (x, t, y, u)| ξ1 (x, t) + η1 |y| , |fy (x, t, y, u)| L1 , |bu (x, t, y, u)| ζ2 (x, t) + η2 y 2 , |fu (x, t, y, u)| ξ3 (x, t) + η3 |y|, ∀(x, t, y, u) ∈ Q × R × U, |gmy (x, t, y, y¯, u)| ζm1 (x, t) + δm1 |y| + δ¯m1 |¯ y| , y| , |gm¯y (x, t, y, y¯, u)| ζm2 (x, t) + δm2 y 2 + δ¯m2 |¯ 2 |gmu (x, t, y, y¯, u)| ζm3 (x, t) + δm3 y 2 + δ¯m3 |¯ y| , ∀(x, t, y, y¯, u) ∈ Q × Rd+1 × U, with ξ1 , ξ2 , ξ3 , ζm1 , ζm2 , ζm3 ∈ L2 (Q), η1 , η2 , η3 , δm1 , δ¯m1 , δm2 , δ¯m2 , δm3 , δ¯m3 0. Proposition 1. Under Assumptions 1, for every control r ∈ R and y 0 ∈ L2 (Ω), the relaxed state equation has a unique solution y := yr such that y ∈ L2 (I, V ) and yt ∈ L2 (I, V ∗ ). Proposition 2. Under Assumptions 1, the operator w

→ yw , from W to = 0, and the operator r

→ yr , from R to L2 (I, V ), and to L2 (I, L4 (Ω)) if b L2 (I, V ), and to L2 (I, L4 (Ω)) if b = 0, are continuous. Under Assumptions 1 → Gm (r) on R, are continuous. and 2, the functionals w

→ Gm (w) on W , and r

Theorem 1. Under Assumptions 1 and 2, if the relaxed problem is feasible, then it has a solution.

250

I. Chryssoverghi, J. Coletsos, and B. Kokkinis

We give below some results concerning necessary conditions for optimality, which can be proved by using the techniques of [6] and [4] (see also [5]). Lemma 1. We suppose that Assumptions 1-3 hold and that the derivatives in u are excluded in Assumptions 3. Dropping the index m in gm , Gm , the directional derivative of the functional G, defined on R, is given by

DG(r, r − r) := lim G(r+ε(r −r))−G(r) ε ε→0+ = Q H(x, t, y, ∇y, z, r (x, t) − r(x, t))dxdt, for r, r ∈ R, where the Hamiltonian H is defined by H(x, t, y, y¯, z, u) := z[f (x, t, y, u) − b(x, t, y, u)] + g(x, t, y, y¯, u), and the adjoint state z := zr satisfies the linear adjoint equation − < zt , v > +a(t, v, z) + (aT0 ∇v, z) + (zby (y, r), v) = (zfy (y, r) + gy (y, r), v) + (gy¯(y, ∇y, r), ∇v), z(t) ∈ V a.e. in I,

∀v ∈ V, a.e. in I,

z(T ) = 0, with y := yr .

The mappings r

→ zr , from R to L2 (Q), and (r, r )

→ DG(r, r − r), from R × R to R, are continuous. Theorem 2. Under Assumptions 1-3 and with the derivatives in u excluded in Assumptions 3, if r ∈ R is optimal for either the relaxed or the classical problem, then r is strongly extremal relaxed, i.e. there exist multipliers λm ∈ R, q m = 0, ..., q, with λ0 0, λm 0, m = p + 1, ..., q, |λm | = 1, such that m=0 q (1) λm DGm (r, r − r) 0, ∀r ∈ R, m=0

(2) λm Gm (r) = 0, m = p + 1, ..., q (relaxed transversality conditions). The condition (1) is equivalent to the strong relaxed pointwise minimum principle (3)

H(x, t, y(x, t), ∇y(x, t), z(x, t), r(x, t))

=

min H(x, t, y(x, t), ∇y(x, t), u∈U

z(x, t), u), a.e. in Q, where the complete Hamiltonian and adjoint H, z are deq fined with g := λm gm . m=0

If in addition U is convex, then the minimum principle (3) implies the weak relaxed pointwise minimum principle (4) Hu (x, t, y, ∇y, z, r(x, t))r(x, t) = min Hu (x, t, y, ∇y, z, r(x, t))φ(x, t, r(x, t)), a.e. in Q, φ

where the minimum is taken over the set B(Q, U ; U ) of Caratheodory functions φ : Q × U → U (see [6]), which in turn implies the global weak relaxed condition (5) Q Hu (x, t, y, ∇y, z, r(x, t))[φ(x, t, r(x, t))−r(x, t)]dxdt 0, ∀φ ∈ B(Q, U ; U ). A control r satisfying the conditions (5) and (2) is called weakly extremal relaxed.

Classical and Relaxed Optimization Methods

251

Lemma 2. Under Assumptions 1-3 and dropping the index m, the directional derivative of the functional G, here defined on W , is given by DG(w, w − w) = lim+ G(w+ε(w ε−w)−G(w) ε→0 = Q Hu (x, t, y, ∇y, z)(w − w)dxdt, for w, w ∈ W , where the adjoint state z := zw satisfies the equation − < zt , v > +a(t, v, z) + (aT0 ∇v, z) + (zby (y, w), v) = (zfy (y, w) + gy (y, ∇y, w), v) + (gy¯(y, ∇y, w), ∇v), z(t) ∈ V a.e. in I,

z(T ) = 0,

∀v ∈ V, a.e. in I,

with y := yw .

The mappings w

→ zw , from W to L2 (Q), and (w, w )

→ DG(w, w − w), from W × W to R, are continuous. Theorem 3. We suppose that Assumptions 1-3 hold and that U is convex. If w ∈ W is optimal for the classical problem, then w is weakly extremal classical, i.e. there exist multipliers λm as in Theorem 2 such that q λm DGm (w, w − w) 0, ∀w ∈ W , (6) m=0

(7) λm Gm (w) = 0,

m = p + 1, ..., q

(classical transversality conditions).

The condition (6) is equivalent to the weak classical pointwise minimum principle (8) Hu (x, t, y, ∇y, z, w(x, t)) w(x, t) = min Hu (x, t, y, ∇y, z, w(x, t)) u, a.e. in Q, u∈U

where the complete Hamiltonian and adjoint H, z are defined with g :=

q

λm gm .

m=0

2

Classical and Relaxed Optimization Methods

l l Let (Mm ), m = 1, ..., q, be positive increasing sequences such that Mm → ∞ l as l → ∞, γ > 0, b , c ∈ (0, 1), and (β ), (ζk ) positive sequences, with (β l ) decreasing and converging to zero, and ζk 1. Deﬁne ﬁrst the penalized functionals on W p q l l Mm [Gm (w)]2 + Mm [max(0, Gm (w))]2 }/2. Gl (w) := G0 (w) + { m=1

m=p+1

It can be easily shown that the directional derivative of Gl is given by DGl (w, w − w) = DG0 (w, w − w) p l + Mm Gm (w)DGm (w, w − w) m=1 q

+

l Mm max(0, Gm (w))DGm (w, w − w).

m=p+1

The classical penalized gradient projection method is described by the following Algorithm, where U is assumed to be convex.

252

I. Chryssoverghi, J. Coletsos, and B. Kokkinis

Algorithm 1 Step 1. Set k := 0, l := 1, and choose an initial control w01 ∈ W . Step 2. Find vkl ∈ W such that 2 2 ek := DGl (wkl , vkl −wkl )+ γ2 vkl − wkl Q = min [DGl (wkl , v¯ −wkl )+ γ2 v¯ − wkl Q ], v ¯∈W

and set dk := DGl (wkl , vkl − wkl ). Step 3. If |dk | β l , set wl := wkl , v l := vkl , dl := dk , el := ek , wkl+1 := wkl , l := l + 1, and then go to Step 2. Step 4. (Modiﬁed Armijo Step Search) Find the lowest integer value s ∈ Z, say s¯, such that α(s) = cs ζk ∈ (0, 1] and α(s) satisﬁes Gl (wkl + α(s)(vkl − wkl )) − Gl (wkl ) α(s)b dk , and then set αk := α(¯ s). l Step 5. Set wk+1 := wkl + αk (vkl − wkl ), k := k + 1, and go to Step 2.

A (classical or relaxed) extremal (or weakly extremal) control is called abnormal if there exist multipliers as in the corresponding optimality conditions, with λ0 = 0.An admissible extremal control is abnormal in rather exceptional situations (see [6]). With wl as deﬁned in Step 3, deﬁne the sequences of multipliers l λlm := Mm Gm (wl ), m = 1, ..., p, l λlm := Mm max(0, Gm (wl )), m = p+1, ..., q.

Theorems 4 and 5 below are proved by using the techniques of [4]. Theorem 4. We suppose that Assumptions 1-3 hold and that U is convex. l(k) (i) In the presence of state constraints, if the whole sequence (wk ) generated 2 by Algorithm 1 converges to some w ∈ W in L strongly and the sequences (λlm ) are bounded, then w is admissible and weakly extremal for the classical problem. In the absence of state constraints, if a subsequence (wk )k∈K (no index l) converges to some w ∈ W in L2 strongly, then w is weakly extremal classical for the classical problem. (ii) In the presence of state constraints, if a subsequence (wl )l∈L of the sequence generated by Algorithm 1 in Step 3, regarded as a sequence of relaxed controls, converges to some r in R, and the sequences (λlm ) are bounded, then r is admissible and weakly extremal relaxed for the relaxed problem. In the absence of state constraints, if a subsequence (wk )k∈K (no index l) converges to some r in R, then r is weakly extremal relaxed for the relaxed problem. (iii) In any of the convergences cases (i) or (ii) with state constraints, suppose that the classical, or the relaxed, problem has no admissible, abnormal extremal, controls. If the limit control is admissible, then the sequences (λlm ) are bounded, and this control is also extremal as above. Next, deﬁne the penalized discrete functionals on R p q l l Mm [G0 (r)]2 + Mm [max(0, Gm (r))]2 }/2. Gl (r) := G0 (r) + { m=1

m=p+1

Classical and Relaxed Optimization Methods

253

The directional derivative of Gl is given by DGl (r, r − r) = DG0 (r, r − r) p l + Mm Gm (r)DGm (r, r − r) + m=1

q m=p+1

l Mm max(0, Gm (r))DGm (r, r − r).

The relaxed penalized conditional descent method is described by the following Algorithm, where U is not necessarily convex. Algorithm 2 Step 1. Set k := 0, l := 1, choose an initial control r01 ∈ R. Step 2. Find r¯kl ∈ R such that dk := DGl (rkl , r¯kl − rkl ) = min DGl (rkl , r − rkl ). r ∈R

Step 3. If |dk | β l , set rl := rkl , r¯l := r¯kl , dl := dk , rkl+1 := rkl , l := l + 1, and then go to Step 2. Step 4. Find the lowest integer value s ∈ Z, say s¯, such that α(s) = cs ζk ∈ (0, 1] and α(s) satisﬁes Gl (rkl + α(s)(¯ rkl − rkl )) − Gl (rkl ) α(s)bdk , and then set αk := α(¯ s). l Step 5. Choose any rk+1 ∈ R such that l ) Gl (rkl + αk (¯ rkl − rkl )), Gl (rk+1

set k := k + 1, and go to Step 2. If the initial control r01 is classical, using Caratheodory’s theorem, it can be l shown by induction that the control rk+1 in Step 5 can be chosen, for each iteration k, to be a Gamkrelidze relaxed control, i.e. a convex combination of a fixed number of classical (i.e. Dirac) controls. With rl as deﬁned in Step 3, deﬁne the sequences of multipliers l Gm (rl ), m = 1, ..., p, λlm := Mm l l λm := Mm max(0, Gm (rl )), m = p + 1, ..., q.

Theorem 5. We suppose that Assumptions 1-3 hold, with the derivatives in u excluded in Assumptions 3. (i) In the presence of state constraints, if the subsequence (rl )l∈L of the sequence generated by Algorithm 2 in Step 3 converges to some r in R and the sequences (λlm ) are bounded, then r is admissible and strongly extremal relaxed for the relaxed problem. In the absence of state constraints, if a subsequence (rk )k∈K (no index l) converges to some r in R, then r is strongly extremal relaxed for the relaxed problem. (ii) In case (i) with state constraints, suppose that the relaxed problem has no admissible, abnormal extremal, controls. If r is admissible, then the sequences (λlm ) are bounded and r is also strongly extremal relaxed for the relaxed problem.

254

I. Chryssoverghi, J. Coletsos, and B. Kokkinis

In practice, by choosing in the above algorithms moderately growing sequences l (Mm ) and a sequence (β l ) relatively fast converging to zero, the resulting sequences of multipliers (λlm ) are often kept bounded. One can choose a ﬁxed ζk := ζ ∈ (0, 1] in Step 4; a usually faster and adaptive procedure is to set ζ0 := 1, and ζk := αk−1 , for k 1.

3

Numerical Examples

Let Ω = I = (0, 1). Example 1. Deﬁne the reference control and state

−1, (x, t) ∈ Ω × (0, 0.25] −1 + 32(t − 0.25)x(1 − x)/3, (x, t) ∈ Ω × (0.25, 1] y¯(x, t) := x(1 − x)et , w(x, ¯ t) :=

and consider the following optimal control problem, with state equation yt − yxx + 0.5y |y| = 0.5¯ y |¯ y | + [x(1 − x) + 2]et + y − y¯ + w − w ¯ in Q, y = 0 in Σ, y(x, 0) = y¯(x, 0) in Ω, control constraint set U := [−1, 1], and cost functional 2 y | + (w − w) ¯ 2 ]dxdt. G0 (w) := 0.5 Q [(y − y¯)2 + |∇y − ∇¯ Clearly, the optimal control, state, and cost are w, ¯ y¯, 0. Algorithm 1, without penalties, was applied to this problem using the ﬁnite element method with piecewise linear basis functions and the Crank-Nicolson time scheme (step sizes 0.01), with piecewise constant classical controls, and γ = 0.5, b = c = 0.5. After 20 iterations, we obtained the results G0 (wk ) = 4.732·10−8, dk = −1.133·10−15, εk = 1.523·10−4, ηk = 1.596·10−4, where dk was deﬁned in Step 2 of Algorithm 1, εk is the state max-error at the (x, t)-mesh points, and ηk is the control max-error at the centers of the (x, t)-blocks. Example 2. Introducing the state constraint G1 (w) := Q [y(x, t) − 0.3]dxdt = 0, in Example 1, with U := [−1, 0.8], and applying the penalized Algorithm 1, we obtained G0 (wk ) = 1.332334091748·10−2 , G1 (wk ) = −7.229·10−6, dk = −4.279·10−6. Example 3. Deﬁne the reference control and state 1, if t ∈ [0, 0.5] w(x, ¯ t) := y¯(x, t) := x(1 − x)e−t , 1 − 2(t − 0.5)(0.2x + 0.4), if t ∈ (0.5, 1], and consider the following optimal control problem, with state equation ¯ yt − yxx + 0.5y |y| + (1 + w − w)y

Classical and Relaxed Optimization Methods

255

= 0.5 y¯ |¯ y | + y¯ + [−x(1 − x) + 2]e−t + y − y¯ + 3(w − w), ¯ y(x, t) = 0 on Σ, y(0, x) = x(1 − x) in Ω, nonconvex control constraint set U := [0, 0.2] ∪ [0.8, 1](or U := {0, 1}, on/oﬀ type control), and nonconvex cost functional 2 G0 (u) := Q {0.5 [(y − y¯)2 + |∇y − ∇¯ y | ] − (w − 0.5)2 + 0.25}dxdt One can easily verify that the unique optimal relaxed control r is given by r(x, t){1} := w(x, ¯ t),

r(x, t){0} := 1 − r(x, t){1},

(x, t) ∈ Q,

with optimal state y¯ and cost 0, and we see that r is concentrated at the two points 1 and 0; r is classical if t ∈ [0, 0.5], and non-classical otherwise. Algorithm 2, without penalties, was applied to this problem using the same approximation schemes and step sizes as in Example 1, here with piecewise constant relaxed controls. After 100 iterations in k, we obtained G0 (rk ) = 4.053 · 10−5 ,

dk = −1.430 · 10−4 ,

ηk = 5.976 · 10−3 ,

where dk was deﬁned in Algorithm 2 and ηk is the max state error at the (x, t)mesh points. Example 4. We introduce the equality state constraint G1 (w) := Q ydxdt = 0, in Example 3. Applying here the penalized Algorithm 2, we obtained after 210 iterations in k G0 (rkl ) = 7.515191183881 · 10−2 , G1 (rkl ) = 3.574 · 10−5 , dk = −3.536 · 10−3 .

References 1. Bartels, S.: Adaptive approximation of Young measure solution in scalar non-convex variational problems. SIAM J. Numer. Anal. 42, 505–629 (2004) 2. Cartensen, C., Roub´ıˇcek, T.: Numerical approximation of Young measure in nonconvex variational problems. Numer. Math. 84, 395–415 (2000) 3. Chryssoverghi, I., Coletsos, J., Kokkinis, B.: Discrete relaxed method for semilinear parabolic optimal control problems. Control Cybernet 28, 157–176 (1999) 4. Chryssoverghi, I., Coletsos, J., Kokkinis, B.: Discretization methods for optimal control problems with state constraints. J. Comput. Appl. Math. 191, 1–31 (2006) 5. Roub´ıˇcek, T.: Relaxation in Optimization Theory and Variational Calculus. Walter de Gruyter, Berlin (1997) 6. Warga, J.: Optimal Control of Diﬀerential and Functional Equations. Academic Press, New York (1972) 7. Warga, J.: Steepest descent with relaxed controls. SIAM J. Control Optim. 15, 674–682 (1977)

Exponential Formula for Impulsive Diﬀerential Inclusions Tzanko Donchev Department of Mathematics University of Architecture and Civil Engineering 1 “Hr. Smirnenski” str., 1046 Soﬁa, Bulgaria [email protected] Abstract. This paper studies the graph of the reachable set of a differential inclusion with non-ﬁxed time impulses. Using approximation in L1 –metric, we derive exponential characterization of the reachable set.

1

Preliminaries

One of the most important questions, studying optimal control problems is the description of the (time dependent) reachable set. Here we study its graph. We consider a system having the form: = τi (x(t)), x(t) ˙ ∈ F (x(t)), x(0) = x0 a.e. t ∈ I = [0, 1], t Δx|t=τi (x(t)) = Si (x), i = 1, . . . , p,

(1)

Here F : Rn ⇒ Rn is a multifunction with nonempty compact and convex values, Si : Rn → Rn are impulses (jumps) and τi : Rn → R are the times of impulses. Notice that the times of impulses of the solution x(·) are the ﬁxed points of the maps t → τi (x(t)), i = 1, . . . , p. For convenience we will write simply τi (not τi (x(t))). Recall (compare [12, p. 34]) that the piecewise absolutely continuous function x(·) is said to be a solution of (1) if: a) x(·) is left continuous and satisﬁes (1) for almost all t ∈ I, t = τi (x(t)) b) It has (possible) jumps on t = τi (x(t)) (discontinuities of the ﬁrst kind), deﬁned by: Δxt=τi (x(t)) = x(t + 0) − x(t) = Si (x(t)). The aim of the paper is the characterization of the reachable set of (1). The Euler approximation studied in [2] (when the right-hand side is Lipschitz) and in [5] (under stronger assumptions on impulsive surfaces) is good for the solution set, however, in general it is not appropriate to estimate the reachable set. Now we give some deﬁnitions and notation. We refer to [4] for all concepts used in this paper but not explicitly discussed. For a bounded set A ⊂ Rn denote by σ(l, A) = sup l, a its support function. The Hausdorﬀ distance between the a∈A

bounded sets A and B is: DH (A, B) = max{sup inf |a−b|, sup inf |a−b|}. Deﬁne a∈A b∈B

b∈B a∈A

|A| = haus(A, {0}). Let Y, K be norm spaces. A multifunction F : Y → K is I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 256–263, 2010. c Springer-Verlag Berlin Heidelberg 2010

Exponential Formula for Impulsive Diﬀerential Inclusions

257

said to be upper semi continuous (USC) (lower semi continuous – LSC) at x ∈ X if for every ε > 0 there exists δ > 0 such that F (x + δB) ⊂ F (x0 ) + εB (if for any y0 ∈ F (x0 ) and xi → x0 there exists yi ∈ F (xi ) with yi → y0 ). Here B denotes the unit ball centered in the origin. The multimap F is called continuous if it is continuous with respect to the Hausdorﬀ distance. Denote hF (p, x) = −σ(−p, F (x)) as lower Hamiltonian, HF (p, x) = σ(p, F (x)) as upper Hamiltonian. A multifunction F : Rn ⇒ Rn is said to be one-sided Lipschitz (OSL) with a constant L if for each x, y ∈ Rn HF (x − y, x) − HF (x − y, y) ≤ L|x − y|2 . Standing hypotheses (SH) We suppose that F (·) is USC with linear growth condition i.e. there exist A > 0, B > 0 such that |F (x)| ≤ A + B|x|. A1. τi (·) are N –Lipschitz and Si : Rn → Rn are μ–Lipschitz. A2. τi (x + Si (x)) = τj (x) ∀ j = i for every x ∈ Rn . For i = 1, . . . , p (i ≤ p − 1 if needed) just one of the following hypotheses holds: A3. τk (x) ≥ τk (x + Sk (x)), τi (x) < τi+1 (x) for every x ∈ Rn and there exist constants δ > 0 and μ < 1 such that HF (∇τi (x), x+ δB) ≤ μ at the points where the gradient ∇τi (·) exists. A4. τi (x) ≤ τi (x + Si (x)) and τi (x) > τi+1 (x) for every x ∈ Rn . There exist constants α > 1 and δ > 0 such that hF (∇τi (x), x + δB) ≥ α at the points x, where the gradient exists. Notice that these conditions are needed in fact only for a neighborhood of the domain where the values ⎛ of solutions⎞lie. co ⎝ ∇τi (y)⎠. It is well known that it is Clarke’s Let ∂x τi (x) = δ>0

y∈x+δB

subdiﬀerential of τi (·) (cf. [3]).

Proposition 1. Under the standing hypotheses for every i = 1, . . . , p: L 1) HF (l, x + ε B) ≤ α ∀ l ∈ ∂x τi (x) and all ε < ε in case of A3. 2) hF (l, x + ε B) ≥ κ ∀ l ∈ ∂x τi (x) and all ε < ε in case of A4. The proof is almost obvious. Indeed If δ < ε, then F (y + εB) ⊃ F (x + δB) for every y ∈ x + (ε − δ)B. Furthermore lim σ(y, F (x + δB) ≥ σ(x, F (x + δB). y→x

Lemma 1. Let ε > 0 be such that A1, A2, A4 hold. For every solution of x(t) ˙ ∈ F (x(t) + εB), x(0) = x0 a.e. t ∈ I = [0, 1], t = τi (x(t)), Δx|t=τi (x(t)) = Si (x), i = 1, . . . , p.

(2)

(if it exists) the equation t = τi (x(t)) admits no more than one solution (for t). Proof. The proof s contained in [2]. However, we include it (in case of A4) here.

258

T. Donchev

Let x(·) be a solution. Consider the functions fi (t) = τi (x(t)) − t. If t is a jump point of x(·) then fi (t + 0) ≥ fi (t) due to the assumption A4. Let (s, t) be in an interval of continuity. Thus due to change rule formula for Clarke’s subdiﬀerential: ≥s−t+

t

fi (t) − fi (s) = s − t + τi (x(t)) − τi (x(s))

(3)

−σ(x(r), ˙ ∂x (τi (x(t))) ≥ (t − s)(κ − 1) > 0

(4)

s

Therefore for every i the functions fi (·) are strongly increasing (except the points when some fj (·) with j = i has a jump) and hence fi (t) = 0 has at most one solution, which is a possible jump point.

Corollary 1. Under the conditions of Lemma 1 there exists a constant λ > 0 such that for every solution y(·) of (1) τi+1 (y(t)) − τi (y(t)) ≥ λ for i = 1, 2, . . . , p − 1 and every t ∈ [0, 1]. We let gi (t, x) = τi (x) − t. For X ⊆ Rn , t ∈ I and h > 0 (t + h ≤ 1), the one-step Euler set-valued map is given by E(t, h, X) = {x + hf +

p

χi (t, x, f )Si (x) : x ∈ X, f ∈ F (x)},

i=1

and we write E(t, h, X)1 = E(t, h, X). Here χi (t, x, f ) = 1 when gj (t, x)gj (t + h, x+hf ) > 0 ∀j < i and either gi (t, x)gi (t+h, x+hf ) < 0 or gi (t+h, x+hf ) = 0. In the other cases χi (t, x, f ) = 0. n The k-th step Euler map is deﬁned recursively as E(t, h, X)k = E(t + (k − k=1

1)h, h, X) for k = 2, 3, .... We write E(0, h, x) as E(h, x). T and let ti = ih, i = For a given natural number k and T > 0, set h = k 0, 1, . . . , k be a uniform grid on [0, T ]. The Euler polygonal arcs corresponding to (1) are deﬁned as the linear interpolation of a discrete time trajectory {x(tj )}. These arcs are of the form for i = 1, . . . , p = τi (x(t)), (5) x(t) = x(tj + 0) + (t − tj )fj , fj ∈ F (x(tj )), t ∈ (tj , ti+1 ], t gi (tj , x)gi (tj+1 , x + hfj ) ≤ 0 ⇒ x(tj+1 + 0) = x(tj+1 ) + Si (x(tj+1 )). Here j = 0, 1, . . . , k − 1, x(0) = x0 . The paper is organized as follows: The Euler-type approximation of the solution set of impulsive (ODI) is presented in the next section, and Section 3 is devoted to the semigroup characterization of the reachable set using modiﬁed L1 –metric.

2

Euler Approximation of the Solution Set

In this section we study Euler’s discrete approximation of (1) with the scheme (5). We need the following lemma which is Lemma 2 of [11].

Exponential Formula for Impulsive Diﬀerential Inclusions

259

Lemma 2. Let a1 , a2 , b, d ≥ 0 and let δ0+ = δ0− = δ0 . If for i = 1, 2, . . . , p δi+ ≤ a1 δi− + d, then δi− ≤ (a2 d + b)

i−1

+ δi− ≤ a2 δi−1 +b

(a1 a2 )j + δ0 (a1 a2 )i , where δ0+ ≥ 0.

j=0

Studying the problem (2) we say that the measure of distance between two solutions Dist(x(·), y(·)) ≤ ε if they intersect successively the impulsive surfaces, i.e. τi (x) ≥ τm (y) is impossible for m > i and vice p versa. Moreover p

+

− − + τi − τi < ε and |x(t) − y(t)| < ε for every t ∈ I \ [τi , τi ] . Here i=1

i=1

τi− = min{τi (x(·)), τi (y(·))} and τi+ is the maximal one. The following theorem is proved in [6] in a more general form: Theorem 1. Under SH for small ε the problem (2) has a nonempty solution set and every solution exists on the whole interval I. Furthermore there exist constants K and M such that |x(t)| ≤ M and |F (x(t) + εB)| ≤ K for every solution x(·) of (2). Theorem 2. (Lemma of Filippov–Plis) Under the standing hypotheses and OSL condition there exists a constant K such that if ε > 0 is suﬃciently small, then for for any solution y(·) of √ (2) with y(0) = y0 there exists a solution x(·) of (1) with Dist(x(·), y(·)) ≤ K ε when |x0 − y0 | < ε. Proof. Let y(·) be a solution of (2). It is easy to see that y(t) ˙ ∈ F (¯ y(t)), where |y(t)− y¯(t)| < ε. We are looking for a solution x(·) of (1), satisfying the condition of the theorem. First we assume x(·) does not intersect the i + 1-th surface before y(·) intersects i-th and vice versa. We let δi− = |y(τi− ) − x(τi− )| and δi+ = |y(τi+ + 0) − x(τi+ + 0)|. Deﬁne Γ (t, z) := u ∈ F (z) : ¯ y (t) − z, y(t) ˙ − u ≤ L|¯ y(t) − z|2 . It is easy to see that Γ (·, x) is measurable for every x and Γ (t, ·) is USC for a.e. t with nonempty convex and compact values. Let x(·) be a solution of: x(t) ˙ ∈ Γ (t, x(t)), x(0) = y(0).

(6)

− One has that ¯ y (t) − x(t), y(t) ˙ − x(t) ˙ ≤ L|¯ y (t) − x(t)|2 . Let t ∈ (τi+ , τi+1 ). Consequently

y(t) − x(t), y(t) ˙ − x(t) ˙ ≤ L|x(t) − y(t)|2 + ε|x(t) ˙ − y(t)| ˙ + 2|L|ε (|x(t)| + |y(t)| + ε) , 1 d |y(t) − x(t)|2 ≤ L|x(t) − y(t)|2 + 2ε (K + |L|(2M + ε)) . 2 dt Denote r2 (t) = |x(t) − y(t)|2 one has

d 2 r (t) ≤ 2Lr2 + 4(K + 2|L|M + ε)ε. dt

260

T. Donchev

Let a2 = max eLt . Since r2 (t) is less or equal of the solution of s(t) ˙ = 2Ls + t∈I

4(K + 2|L|M + ε)ε, one has that there exists a constant P such that r(t) ≤ √ + √ eL(t−τi ) ri + P ε. Denoting b = P ε we derive r(t) ≤ a2 ri + b with ri = + + + − |x(τi + 0) − y(τi + 0)| = δi . Hence δi+1 ≤ a2 δi+ + b. Assume that τi− = τi (x(τi− )) and τi+ = τi (y(τi+ )) (the method is the same if it is not the case). We have: |y(τi+ ) − x(τi− )| ≤ |y(τi+ ) − y(τi− )| + |y(τi− ) − x(τi− )| ≤ K(τi+ − τi− ) + δi− . τi+ − τi− = τi (y(τi+ )) − τi (x(τi− ))

≥ |τi (y(τi+ )) − τi (y(τi− ))| − |τi (y(τi− )) − τi (x(τi− ))| ≥ |τi (y(τi+ )) − τi (y(τi− ))| − N δi− . Moreover: τi (y(τi+ )) − τi (y(τi− )) =

τi+

τi−

τi+ d τi (y(t)) dt ≥ min l, y(t) ˙ dt. dt l∈∂x τi (x) τi−

Thus τi (y(τi+ )) − τi (y(τi− )) ≥ α(τi+ − τi− ) and N δi− ≥ (α − 1)(τi+ − τi− ) ⇒ τi+ − τi− ≤

N δi− . α−1

(7)

Consequently for δi+ we have: δi+ = |x(τi+ ) − y(τi+ + 0)| ≤ |x(τi− ) − y(τi− )| + |Si (x(τi− )) − Si (y(τi+ ))| τi+ + |x(t) ˙ − y(t)|dt ˙ ≤ δi− + 2K(τi+ − τi− ) + μ(K(τi+ − τi− ) + δi− ) = a1 δi− , τi−

2KN + (α − 1)(1 + μ + μKN ) . Hence Lemma 2 (with d = ε and α−1 a1 , a2 , b just deﬁned) applies. Due to (7) one has that for suﬃciently small ε λ τi+ − τi− ≤ , i.e. x(·) and y(·) intersect impulsive surfaces in the same order. 10 The rest of the proof is standard. The proof in case of A3 is similar and it is omitted.

where a1 =

The following theorem is true: Theorem 3. Under SH and OSL condition for k big enough we have: For every solution x(·) of (1) there exists a solution y(·) of (5) such that 1 Dist(x(·), y(·)) ≤ O √ and vice versa. k The proof is very similar to the proof of the corresponding result of [5].

Exponential Formula for Impulsive Diﬀerential Inclusions

3

261

L1 –Metric for Piecewise Lipschitz Functions and of the Reachable Sets of Impulsive ODI

Here we will compare the L1 metric and the commonly used distance proving the continuous dependence and approximation of the solutions. Let Imp,L be the set of all L-Lipschitz on (ti (x), ti+1 (x)] functions x(·) with no more than p jump points and ti+1 (x(·)) − ti (x(·) ≥ λ. Proposition 2. The space Imp,L equipped with the usual L1 (I, Rn ) norm becomes a complete metric space. Proof. One must only show that every Cauchy sequence {xk (·)}∞ k=1 converges to a Imp,L function, because Imp,L ⊂ L1 . By Egorov’s theorem if a sequence 1 {y k (·)}∞ k=1 of L-Lipschitz functions converges in L norm to y(·) then the latter is n ∞ also L-Lipschitz. Let {x (·)}n=1 be a Cauchy sequence in Imp,L . Denote by tnj (as j = 1, 2, . . . , k) the times of (possible) jumps of xn (·). Passing to subsequences if necessary one can assume that lim tnj = tj for j = 1, 2, . . . , k. Let 0 ≤ t1 ≤ t2 ≤ n→∞

· · · ≤ tk ≤ 1. Given ε > 0 we consider Iε =

k j=1

tj −

ε ε , tj + . It is easy to see k k

that L1 limit of this (sub)sequence on I \ Iε is L-Lipschitz function. Since it is valid for every ε > 0 one can conclude that the limit function x(·) is L-Lipschitz on every (tj , tj+1 ). The L1 limit is unique and hence x(·) ∈ Imp,L .

∞

Proposition 3. Let the sequence {xm (·)}m=1 of elements of Imp,L converges l with respect to L1 (I, Rn ) to x(·). If {τi }i=1 (l ≤ k) are the impulsive points of x(·), then for every ε > 0 there exists a number n(ε) such that |x(t) − xm (t)| < ε l for every m > n(ε) and every t ∈ I \ A, where A = (τi − ε, τi + ε). i=1

Proof. Due to Proposition 2 x(·) ∈ Imp,L . Hence l ≤ k. Notice that in our settings it is possible the values of some impulses to be 0. The points of impulses of x(·) will be denoted by ψj j = 1, . . . , l. As it was pointed out in the previous proof, due to Egorov’s theorem the limit 1 function x(·) must be L–Lipschitz on every (ψi , ψi+1 ). Fix ε < min (ψj+1 − 6 1≤j≤l−1 ψj ). We have to check that on every interval (ψj + ε, ψj+1 − ε) the convergence is uniform. Suppose the contrary, i.e. ∃ s > 0 such that for every m one has sup t∈(ψj +ε,ψj+1 −ε)

|x(t) − xm (t)| > s.

Consequently for every m there exists tm ∈ (ψj + ε, ψj+1 − ε) such that |x(tm ) − xm (tm )| > s. However x(·) is L–Lipscitz on (ψj + ε, ψj+1 − ε). If tm is not a s such that I = (xm − λ, xm + λ) ⊂ jump point of xm (·), then ∃ 0 < λ < 4L (ψj + ε, ψj+1 − ε) and moreover, both x(·) and xm (·) are L–Lipscitz on I. With trivial calculus one can show that |x(·) − xm (·)|L1 > 2Ls2 - contradiction. If tm

262

T. Donchev

is a jump point of xm (·) then the latter is L–Lipschitz on (xm − λ, xm ) and on (xm , xm + λ), i.e. |x(·) − xm (·)|L1 > Ls2 - a contradiction.

Proposition 4. In Imp,L : lim |xs (·) − x(·)|l1 = 0 if and only if lim Dist(xs (·), x(·)) = 0. s→∞

s→∞

The reachable set of (1) at the time t ∈ I is deﬁned by: Rt (F, x(0)) = {x(t) : t ≥ 0, x(·) is a trajectory of (1) on [0, t] with x(0) = x0 }. If there are two impulsive systems with right-hand sides F (·) and G(·) (initial conditions x0 and y0 ). The reachable sets are Rt (F ) and Rt (G) with graphs D(F, x0 ) and D(G, y0 ) respectively. Let Dt = {x : (τ, x) ∈ D, τ = t} and dist(z, A) := inf |z − a|. a∈A

Definition 1. Let ExL1 (D, G) :=

sup dist ((t, x), G) dt. The L1 distance beI x∈Dt

tween two graphs of multi-functions is

dL1 (D, G) := max{ExL1 (D, G) , ExL1 (G, D)}, Proposition 5. Suppose there exists a constant K such that for every solution x(·) of (1) with initial condition x0 there exists a solution y(·) with initial condition y0 with Dist(x(·), y(·)) ≤ Kδ and vise versa. Then dL1 (D(F, x0 ), D(F, y0 )) < (K + 1)Kδ. The following theorem is a consequence of Theorem 2 and Proposition 3. Theorem 4. Let δ > 0 be suﬃciently small and let F (·), satisfy SH. If F (·) is OSL (with a constant L) then there exists a (nonnegative) constant L such that √ if |x0 − y0 | < δ, then dL1 (D(F, x0 ), D(F, y0 )) < L δ. Proof. Due to Theorem 2 there exists a constant K such that for every solution x(·) of (1) with x(0) = x0 there exists a solution y(·) of (1) with y(0) = y0 and p p

− |x(t) − y(t)| ≤ Kδ on I \ [τi− , τi+ ], τi+ < τi+1 and (τi+ − τi− ) ≤ Kδ. Here i=1

i=1

τi− = min{τi (x(·)), τi (y(·))}. Let s ∈ I and let z ∈ (F , x0 )s . Hence there exists a solution x(·) of (1) such that x(s) = z. Let y(·) be the corresponding solution of (1) with initial p condition y(0) = y0 . If s ∈ I \ [τi− , τi+ ], then |x(s) − y(s)| < Rδ. Assume that i=1

− ) we s ∈ [τix , τiy ] (if τiy < τix the proof is very similar), then for every t ∈ (τiy , τi+1 y have |x(t)−y(t)| < Kδ. Furthermore for every ε > 0 there exists tε ∈ (τi , τiy +)ε) and hence tε − s < Rδ + ε. If R = 2K then dist ((t, z), (F , y0 )(I))) < Rδ.

n t For t ∈ I we denote Rk (t) = E t, , {x0 } with graph D(Rk ). k

Exponential Formula for Impulsive Diﬀerential Inclusions

263

As a trivial corollary of Theorem 3 and Theorem 4 we obtain: Theorem 5. Under SH and OSL condition: lim du (D(F ), D(Rk )(I)) = 0.

k→∞

Acknowledgment This work is partially supported by the Hausdorﬀ Research Institute of Mathematics Bonn in the framework of the Trimester Program “Computational Mathematics”.

References 1. Aubin, J.-P.: Impulsive Diﬀerential Inclusions and Hybrid Systems: A Viability Approach. Lecture Notes, Univ. Paris (2002) 2. Baier, R., Donchev, T.: Discrete Approximation of Impulsive Systems (submitted) 3. Clarke, F.: Optimization and nonsmooth analysis. John Wiley & Sons, Inc., New York (1983) 4. Deimling, K.: Multivalued Diﬀerential Equations. De Gruyter, Berlin (1992) 5. Donchev, T.: Approximation of the Solution Set of Impulsive Systems. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2007. LNCS, vol. 4818, pp. 309–316. Springer, Heidelberg (2008) 6. Donchev, T.: Impulsive diﬀerential inclusions with constrains. EJDE 66, 1–12 (2006) 7. Donchev, T., Farkhi, E., Wolenski, P.: Characterizations of Reachable Sets for a Class of Diﬀerential Inclusions. Functional Diﬀerential Equations 10, 473–483 (2003) 8. Lakshmikantham, V., Bainov, D., Simeonov, P.: Theory of Impulsive Diﬀerential Equations. World Scientiﬁc, Singapore (1989) 9. Lempio, F., Veliov, V.: Discrete approximations of diﬀerential inclusions. Bayreuther Mathematische Schiften, Heft 54, 149–232 (1998) 10. Pereira, F., Silva, G.: Necessary Conditions of Optimality for Vector-Valued Impulsive Control Problems. System Control Letters 40, 205–215 (2000) 11. Plotnikov, V., Kitanov, N.: On Continuous dependence of solutions of impulsive diﬀerential inclusions and impulse control problems. Cybernetics System Analysis 38, 749–758 (2002) 12. Plotnikov, V., Plotnikov, A., Vitiuk, A.: Diﬀerential Equations with Multivalued Right-Hand Side. Asymptotical Methods. Astro Print, Odessa (1999) (in Russian) 13. Wolenski, P.: The exponential formula for the reachable set of a Lipschitz diﬀerential inclusion. SIAM J. Control Optim. 28, 1148–1161 (1990)

Directional Sensitivity Diﬀerentials for Parametric Bang-Bang Control Problems Ursula Felgenhauer Brandenburgische Technische Universit¨ at Cottbus, Institut f¨ ur Angewandte Mathematik u. Wissenschaftliches Rechnen, PF 101344, 03013 Cottbus, Germany [email protected] http://www.math.tu-cottbus.de/~ felgenh/

Abstract. We consider optimal control problems driven by ordinary diﬀerential equations depending on a (vector-valued) parameter. In case that the state equation is linear w.r.t. the control vector function, and the objective functional is of Mayer type, the optimal control is often of bangbang type. The aim of the paper is to consider the structural stability of bang-bang optimal controls with possibly simultaneous switches of two or more components at a time. Besides of the local invariance of the number of switches for each component taken separately, existence of their directional parameter–derivatives will be shown.

1

Introduction

The problem to be analyzed for parameter dependency is the following optimal control problem in Mayer form: (Ph )

J(x, u, h) = β(x(1), h) subject to m x(t) ˙ = f (x(t), h) + ui (t)gi (x(t), h) a.e. in [0, 1], minimize

x(0) = a(h), | ui (t)| ≤ 1,

i=1

i = 1, . . . , m, a.e. in [0, 1] .

(1) (2) (3)

The parameter h is taken from a neighborhood H ⊂ Rp of the origin. The functions x : [0, 1] → Rn and u : [0, 1] → Rm stand for the state and control variables of the problem. By g = g(x, h) we denote the (n, m) matrix with columns gi representing the controlled vector ﬁelds. It will be assumed that all data functions are suﬃciently smooth (i.e. at least C 2 ) functions. We will assume that, for the reference parameter h0 = 0, the above problem 1 has a local minimizer (x0 , u0 ) ∈ W∞ × L∞ . The local optimality is supposed to hold in strong local sense, i.e. on a certain L∞ neighborhood w.r.t. x0 but arbitrary admissible controls. By Pontryagin’s Maximum Principle, an adjoint vector function μ0 : [0, 1] → Rn exists such that (x0 , u0 , μ0 ) satisfy the canonical equations and the maximum condition. We will further assume, that the control I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 264–271, 2010. c Springer-Verlag Berlin Heidelberg 2010

Directional Sensitivity Diﬀerentials

265

function u0 is a bang-bang control, i.e. each component u0j is piecewise continuous and takes only extremal values ±1 on the time interval [0, 1]. Let (x(·, z, h), μ(·, z, h), u(·, z, h)) denote a solution of the following system: x(t) ˙ = f (x(t), h) + g(x(t), h)u(t), T

μ(t) ˙ = − {∇x [f (x, h) + g(x, h) u] (t)} μ(t), uj (t) ∈ Sign μ(t)T gj (x(t), h) , j = 1, . . . , m x(1) = z,

μ(1) = −∇x β(z, h).

(4) (5) (6)

(The function Sign herein stands for the set-valued extended signum deﬁned at zero as Sign(0) = [−1, 1].) In case of bang-bang controls, the above system (4) – (6) together with the initial condition T (z, h) = x(0, z, h) = a(h). (7) can be interpreted as a backward shooting formulation of Pontryagin’s necessary optimality conditions. For h = 0, it has the solution (x0 , μ0 , u0 ) corresponding to z 0 = x0 (1). The present paper has to be seen in a line with current work on stability and optimality investigation of bang-bang control problems when some of the u-components may switch at the same time (simultaneous switch). Suﬃcient second-order conditions for strong local optimality of bang-bang extremals are given in [1] and [10], [9] (see also references therein) for rather general control problems but with restriction to simple control switches. Related sensitivity results including parameter diﬀerentials for simple switching times are obtained in [8]. For multiple control switches, the Lipschitz stability of switching points position was shown in [5] and [6]. Recently, ﬁrst results on suﬃcient strong local optimality conditions have been found by [11] for at most double switches, see also [12]. The mentioned work could be combined to proving local existence, structural stability and optimality of solutions for problem (Ph ) in case of double switches of the reference control, see [7]. In this paper, simultaneous switches of more than two control components are allowed. The structural stability result from [7] is generalized to this situation (Lemma 1, section 3). Further, it will be accomplished by construction rules for directional sensitivity diﬀerentials for switching points positions under parameter perturbation (Theorem 1, section 5). Notations. Let Rn be the Euclidean space of column vectors with norm | · |, and scalar product written as a matrix product aT b. Superscript T is used for transposition of vectors or matrices. The Lebesgue space of order p of vectorvalued functions on [0, 1] is denoted by Lp (0, 1; Rk ). Wpl (0, 1; Rk ) is the related Sobolev space, and norms are given as · p and · l,p , (1 ≤ p ≤ ∞, l ≥ 1), resp. The symbol ∇x denotes (partial) gradients, and [f, g] = ∇x g f − ∇x f g stands for a Lie bracket of f and g. By convM resp. cl M , the convex hull and closure of a set M are described. For characterizing discontinuities, jump terms s are denoted [v] = v(ts +) − v(ts −) where the index s will become clear from the context.

266

2

U. Felgenhauer

Regular Bang-Bang Controls

Consider an extremal (x, u) of (Ph ) which together with the related costate function μ satisﬁes (4) – (7). Deﬁne σj (t) := μ(t)T gj (x(t), h),

(8)

Σj := {t ∈ [0, 1] : σj (t) = 0}.

(9)

We say the control u is a regular strict bang-bang control if the following conditions are fulﬁlled: (A1) For each j = 1, . . . , m, the function uj is piecewise constant with uj (t) ∈ {−1, 1} a.e. on [0, 1], the set Σj is finite, and t = 0 or 1 do not belong to Σj . (A2) For j = 1, . . . , m, the function σj is strictly monotone near any ts ∈ Σj . Since u(t) = sign σ(t) under the above assumptions, the function σ is the socalled switching function, and Σ = (Σ1 , . . . , Σm ) the vector of switching times, or the switching structure. A switching point ts ∈ Σj is a simple switching point if only the j-th control component changes its sign at ts . In case that more than one component switch simultaneously we call ts a multiple switching point, and the number of related u-components its multiplicity κ. Assumption (A2) is a condition ensuring that all zeros of σj are regular. While we are interested in solution behavior under parameter perturbation, we should give it a more stable form. Let us reconsider the switching functions (8): for piecewise continuous u, from (4) we see that σ ∈ C 0,1 and the time derivatives are given almost everywhere by σ˙ j = μT [f, gj ] +

m

ui μT [gi , gj ].

(10)

i=1

Thus, σ˙ j may have jump discontinuities at points ts ∈ Σi , i

= j. As long as all switching points are simple, each σj is continuously diﬀerentiable in ts ∈ Σj . If, however, u has multiple switches then the σ components may be broken at their = 0 at some ts ∈ Σi ∩ Σj . Under perturbation, a zero points whenever [gi , gj ]

multiple switch will generically bifurcate to several (e.g. simple) switching points of a-priori unknown order. Thus, one has to make sure that for each possible local re-ordering of switches the perturbed σ-components remain to be strictly monotone. This can be done with the following assumption (A2’) If ts is κ-mutiple, and j is such that Σj ts then, at t = ts , the terms m ui (ts −) μT [gi , gj ] (t) + νi [ui ]s μT [gi , gj ] (t) σ˙ jν (t) = μT [f, gj ] (t) + i=1

i: ts ∈Σi

are nonzero, and their sign is independent of the binary vector ν ∈ {0, 1}κ.

Directional Sensitivity Diﬀerentials

267

Throughout the paper it is supposed that, at h = 0, the reference solution (x0 , u0 ) of (P0 ) and related μ0 satisfy (A1) and (A2’). Let us ﬁnish with some deﬁnition: for t close to ts (of multiplicity κ), set t σjν (t) = σj (ts ) + σ˙ jν (τ ) dτ ∀ ν ∈ {0, 1}κ, ∀ j : Σj ts .

(11)

ts

Notice that the particular choice ν = e = (1, 1, . . . , 1) yields σ˙ je (ts ) = σ˙ j (ts +).

3

Strong Stability of Bang-Bang Structure

Under assumptions (A1), (A2’) on the solution of (P0 ), the following stability result holds true: Lemma 1. Assume that, for some h close to h0 = 0, the triple (xh , uh , μh ) with xh (1) = z h is a solution of the shooting system (4) – (7). There exists a positive bound δ such that, if | h − h0 | + xh − x0 ∞ < δ, (12) then each uhi satisfies assumptions (A1), (A2) and (A2’). Moreover, each function uhi has the same number and type of switches as u0i , and |Σ h − Σ 0 | =O(δ). In [7], this result was obtained for the case of at most double switches by analyzing the maximized Hamiltonian ﬂow. An earlier result from [5] had been additionally restricted to L1 neighborhoods w.r.t. u0 . In this sense, the given Lemma strengthens former stability statements and puts it into correspondence to the idea of strong local optimality, cf. [1], [11], or [12]. Proof. Let the points of j Σj0 = {θ1 , . . . , θM } be ordered monotonically decreasing, i.e. θk > θl for k < l, and denote θ0 = 1, θM+1 = 0. Suppose ρ to be small enough so that the balls Bk of radius ρ with centres θk , k = 0, . . . , M + 1, do not intersect. The connecting pieces (θk+1 + ρ, θk − ρ) denote by Dk for k = 1, . . . , M − 1, and D0= (θ1 + ρ, 1), DM = (0, θM − ρ) resp. Notice that none of the σj0 vanishes in Dk . Moreover, if ρ is suﬃciently small then, almost = 0 for all j such that θk ∈ Σj0 . everywhere on Bk , we have σ˙ j

The proof makes use of a backward inspection of solution pieces from (4) – (6). If δ is suﬃciently small then, by (8), |z h − z 0 | < δ yields that each σjh (1) is of the same sign as σj0 (1), and this remains to be true in some neighborhood of t = 1 independently of the value taken by uh . Remembering the sign condition from (5), we obtain uh ≡ u0 on a certain interval [1 − Δ t, 1], and |xh (t) − x0 (t)| + |μh (t) − μ0 (t)| + |σ h (t) − σ 0 (t)| = O(δ).

(13)

Thus, we can repeat the argumentation and, possibly decreasing δ, by continuity deduce that uh ≡ u0 holds true for all t ∈ D0 . Next let us consider Bk (beginning from k = 1): if θk is a simple switch of u0j then σj0 is diﬀerentiable on Bk and has the regular zero θk . By the Implicit

268

U. Felgenhauer

Function Theorem, the same holds for σ h , so that it has a regular zero θkh ∈ Bk (with |θkh − θk | =O(δ)) whereas all other σ-components do not change their sign. Thus, on Bk we have u0 − uh 1 =O(δ), and again the estimate (13) holds on Bk . In particular, we get uh (θk − ρ) = u0 (θk − ρ). If more than one control components switch at θk simultaneously, consider the following prediction-correction procedure: solve backwards the state-adjoint equations from system (4) with u ≡ u0 (θk + ρ), and ﬁnd σ = σ ˜ h from (8). For h e h e ˙ the solutions, |˜ σ (t) − σ (t)| + |σ ˜ (t) − σ˙ (t)| = O(δ) (cf. (11)), so that each compononent σ ˜jh such that θk ∈ Σj0 has a (locally unique and regular) zero in h Bk . Find θk,1 = max {t : ∃ j such that σ ˜jh (t) = 0}. At this point, some of the h h ˜ih (θk,1 ) = 0 and νi = 1 otherwise. u -components switch, and we set νi = 0 if σ ν 0 h Resolve the system with u = u (θk −) + j νj [u0j ]s for t ≤ θk,1 : the correh ν h sponding σ ˜ then is close to σ from (11) on (θk − ρ, θk,1 ), and again, its continued j-th components switch on Bk . The iteration will be repeated until, for h each j such that θk ∈ Σj0 , a new switching point θk,l is found. Assumption (A2’) h − θk | =O(δ). ensures that these points are unique for each related j, and |θk,l h 0 At t = θk − ρ, again we have u = u , so that the argumentation can be equally applied for the next intervals Dk , Bk+1 etc. until t = 0 is reached.

4

Second-Order Condition

In order to formulate appropriate second-order coercivity conditions for (Ph ), we follow [1] and deﬁne certain auxiliary ﬁnite-dimensional problem w.r.t. switching points variation. For each j, let Σj0 = {t0js : 1 ≤ s ≤ l(j)} be monotonically ordered. Consider vectors Σ = (tjs : j = 1, . . . , m, s = 1, . . . , l(j) ) of the same length L as Σ 0 , and denote by DΣ the neighborhood of Σ 0 ∈ RL where the following condition holds: 0 < t0js < t0ir < 1 ⇒ 0 < tjs < tir < 1. (14) With tj0 = 0, tj,l(j)+1 = 1 for all j, deﬁne uj (t, Σ) ≡ (−1)l(j)−s u0j (1)

for t ∈ (tjs , tj,s+1 ).

(15)

The solution of (1) – (2) for u = u(·, Σ) will be denoted x(·, Σ, h). If (xh , uh ) is a local minimizer of (Ph ) satisfying (A1), (A2’) and (12) then, by Lemma 1, the related switching vector Σ h will belong to DΣ and in particular, is a local solution of (OPh )

min φh (Σ) = β (x(1, Σ, h), h)

w.r.t. Σ ∈ DΣ .

As explained in [5], in case ofsimultaneous control switches, the function φh is 0 piecewise C 2 on DΣ . If t0s ∈ m j=1 Σj has multiplicity κs then, locally, we ﬁnd several bifurcation patterns into κs simple switching points of diﬀerent order which can be enumerated by qs = 1, . . . , (κs )! . Assembling qs with κs > 1

Directional Sensitivity Diﬀerentials

269

into an multi-index q, we can deﬁne open subsets Dq of DΣ where the order of switches will be invariant w.r.t. perturbation. On cl Dq set ∇q (∇φh )|Σ =

lim

Σ ∈Dq ,Σ →Σ

∇2Σ φh (Σ ) .

(16)

It is an obvious necessary optimality condition for (P0 ) that all ∇q (∇φ0 ) should be positive semideﬁnite at Σ = Σ 0 . We will assume the following strong generalized second-order condition to hold for h = 0 at Σ 0 : (A3) ∃ c > 0: v T ∇q (∇φh )|Σ v ≥ c | v |2 that qs ∈ {1, . . . , (κs )!}.

for all v ∈ RL and each q such

It has been shown in [7] for at most double switches that, assuming (A1) – (A3) to be satisfyed for h = 0 and (x0 , u0 , μ0 ), is suﬃcient to prove existence and uniqueness of strong local optimizers for (Ph ) with h near h0 = 0. The generalization to simultaneous switch of more than two components is yet open.

5

Directional Sensitivity Diﬀerentials

Consider the following extended backward shooting approach introduced in [6]: for given (Σ, z, h) close to (Σ 0 , z 0 , 0) deﬁne by (15) a control u = u(Σ). For u(Σ) − u0 1 being suﬃciently small, the system x(t) ˙ = f (x(t), h) + g(x(t), h) u(t, Σ),

x(1) = z, T

μ(t) ˙ = − {∇x [f (x, h) + g(x, h) u] (t)} μ(t), μ(1) = −∇x β(z, h), 1 on [0, 1]. has an unique solution with x = x(·, Σ, z, h), μ = μ(·, Σ, z, h) ∈ W∞ T As in (8), set σ(·, Σ, z, h) = g(x(·, Σ, z, h), h) μ(·, Σ, z, h). If the solution components satisfy (4)–(7) then

V (Σ, z, h) = x(0, Σ, z, h) = a(h), W (Σ, z, h) = Γ · σ(·, Σ, z, h)|Σ

(17) = 0.

(18)

In the last equation, for j = 1, . . . , m and each α with tα ∈ Σj it is required that α Wα (Σ, z, h) = u0j σj (tα , Σ, z, h) = 0, i.e. Γ stands for the diagonal matrix with control jump entries. Near (Σ 0 , z 0 , 0), V is diﬀerentiable and ∇z V is regular (cf. [6]) so that V = 0 locally deﬁnes a C 1 map z = z˜(Σ, h). Inserting it into (18), in accordance to [8] and [5] we obtain Z(Σ, h) := W (Σ, z˜(Σ, h), h) = ∇Σ φh (Σ) = 0. (19) Consider the case when Σj contains a number d of multiple switches ti with multiplicities κi . By (16), from within each Dq × H, the derivatives ∇qΣ Z = ∇qΣ (∇Σ φh ) = ∇qΣ W − ∇z W (∇z V )

−1

∇Σ V

270

U. Felgenhauer

exist and are positive deﬁnite at (Σ 0 , 0), see assumption (A3). Thus, Z(Σ, h) = ˜ ˜ 0 has the locally unique and Lipschitz continuous solution Σ = Σ(h), and Σ(0) = 0 Σ . Now let us deﬁne M q = ∇qΣ (∇Σ φh )|(Σ 0 ,0) ,

P = ∇h (∇Σ φh )|(Σ 0 ,0) ,

and consider the following piecewise quadratic function of s ∈ RL depending on the parameter h ∈ Rp : 1 T q s M s + sT P h if s + Σ 0 ∈ cl Dq , 2 q ∈ Q = {(q1 , . . . , qd ) : qi ∈ {1, . . . , (κi )!} } .

ψh (s) =

(20) (21)

For ﬁxed h, the function is C 1,1 and strictly convex w.r.t s (the proof of this fact follows [6], Appendix 1, but has to be omitted here). Its minima determine ˜ in the following sense: directional sensitivity diﬀerentials of Σ Theorem 1. For given Rp r

= 0, let s¯ = s¯(r) denote the solution of min ψr (s)

s.t. s + Σ 0 ∈ DΣ .

(22)

The solution is represented by s¯ = −(M q )−1 P r if and only if s¯ + Σ 0 ∈ cl Dq . ˜ It yields directional derivative information for Σ = Σ(h) from (19) in the form

˜ ∂h Σ(0), r = s¯ = −(M q )−1 P · r. (23) Proof. The deﬁnition of ψ and the conic nature of the sets Dq ensure that, at each point s, the function has a generalized Hessian (in Clarke’s sense) such that ∂s (∇s ψ) ⊆ convq∈Q {M q } where Q is given in (21). Since all matrices therein are positive deﬁnite, problem (22) has an unique solution s¯ which is characterized by s) ⊆ convq∈Q {M q s¯} + P r. 0 ∈ ∂s ψ(¯ (cf. [14], Theorem 13.24). In case that s¯ ∈ cl Dq , it can be approached by a minimizing sequence from inside Dq so that s¯ = −(M q )−1 P r. ˜ we consider limit terms In order to obtain directional derivatives of Σ ˜ r) − Σ 0 ) γ −1 (Σ(γ

for

γ 0.

(24)

Assume ﬁrst that ∃ γ¯ , q¯ such that ˜ r) ∈ Dq¯ ∀ γ : 0 < γ < γ¯ . Σ(γ

˜ r = −(M q¯)−1 P r. Then the limit exists and is expressed by ∂h Σ(0),

(25)

Suppose that assumption (25) is not fulﬁlled, i.e. there exist q1 , . . . , qN ∈ Q such that, for any given n 0, ∀ n ∃ γnk < n :

˜ k r) ∈ Dqk , Σ(γ n

k = 1, . . . , N.

Directional Sensitivity Diﬀerentials

271

˜ each subsequence {(γ k )−1 (Σ(γ k r)−Σ 0 )}∞ Due to the Lipschitz continuity of Σ, n n n=1 is bounded and thus has an accumulation point sk ∈ cl Dqk so that M qk sk +P r = 0. The last relation says that the points sk are stationary points of the convex objective from (22) and, consequently, all coincide with the minimizer s¯. Hence the limit in (24) exists and is given by (23). For completeness notice that the representations (23) for s¯ with diﬀerent qk coincide, too. Acknowlegdement. The results presented in this paper could not have been achieved without the stimulating contact to and cooperation with Gianna Stefani and Laura Poggiolini (Florence) who the author is deeply indepted to. The work has partly been supported by CNR INDAM grant no. 43169224.

References 1. Agrachev, A., Stefani, G., Zezza, P.L.: Strong optimality for a bang-bang trajectory. SIAM J. Control Optim. 41, 991–1014 (2002) 2. Arutyunov, A.V., Avakov, E.R., Izmailov, A.F.: Directional regularity and metric regularity. SIAM J. Optim. 18(3), 810–833 (2007) 3. Clarke, F.: Optimization and nonsmooth analysis. Wiley Inc., New York (1983) 4. Felgenhauer, U.: On stability of bang-bang type controls. SIAM J. Control Optim. 41(6), 1843–1867 (2003) 5. Felgenhauer, U.: Lipschitz stability of broken extremals in bang-bang control problems. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2007. LNCS, vol. 4818, pp. 306–314. Springer, Heidelberg (2008) 6. Felgenhauer, U.: The shooting approach in analyzing bang-bang extremals with simultaneous control switches. Control & Cybernetics 37(2), 307–327 (2008) 7. Felgenhauer, U., Poggiolini, L., Stefani, G.: Optimality and stability result for bang-bang optimal controls with simple and double switch behavior. Control & Cybernetics 38(4) (2009) (in print) 8. Kim, J.R., Maurer, H.: Sensitivity analysis of optimal control problems with bangbang controls. In: 42nd IEEE Conference on Decision and Control, Hawaii, vol. 4, pp. 3281–3286 (2003) 9. Maurer, H., Osmolovskii, N.P.: Equivalence of second-order optimality conditions for bang-bang control problems. Part I: Main result. Control & Cybernetics 34(3), 927–950 (2005); Part II: Proofs, variational derivatives and representations, Control & Cybernetics 36(1), 5–45 (2007) 10. Milyutin, A.A., Osmolovskii, N.P.: Calculus of variations and optimal control. Amer. Mathem. Soc., Providence (1998) 11. Poggiolini, L., Spadini, M.: Strong local optimality for a bang–bang trajectory in a Mayer problem (to appear) 12. Poggiolini, L., Stefani, G.: State-local optimality of a bang-bang trajectory: a Hamiltonian approach. Systems Control Lett. 53(3-4), 269–279 (2004) 13. Poggiolini, L., Stefani, G.: Suﬃcient optimality conditions for a bang-bang trajectory. In: 45th IEEE Conference on Decision and Control, San Diego (USA) (December 2006) 14. Rockafellar, R.T., Wets, R.J.-B.: Variational analysis. Springer, Berlin (1998) 15. Sarychev, A.V.: First- and second-order suﬃcient optimality conditions for bangbang controls. SIAM J. Control Optim. 35(1), 315–340 (1997)

Estimates of Trajectory Tubes of Uncertain Nonlinear Control Systems Tatiana F. Filippova Institute of Mathematics and Mechanics, Russian Academy of Sciences, 16 S. Kovalevskaya str., GSP-384, 620219 Ekaterinburg, Russia [email protected] http://www.imm.uran.ru

Abstract. The paper is devoted to state estimation problems for nonlinear dynamic control systems with states being compact sets. The studies are motivated by the theory of dynamical systems with unknown but bounded uncertainties without their statistical description. The trajectory tubes of diﬀerential inclusions are introduces as the set-valued analogies of the classical isolated trajectories of uncertain dynamical systems. Applying results related to discrete-time versions of the funnel equations and techniques of ellipsoidal estimation theory developed for linear control systems we present approaches that allow to ﬁnd estimates for such set-valued states of uncertain nonlinear control systems.

1

Introduction

In many applied problems the evolution of a dynamical system depends not only on the current system states but also on uncertain disturbances or errors in modelling. There are many publications devoted to diﬀerent aspects of treatment of uncertain dynamical systems (e.g., [5,11,12,13,14,15,17]). The model of uncertainty considered here is deterministic, with set-membership description of uncertain items which are taken to be unknown with prescribed given bounds. We consider problems of control and state estimation for a dynamical control system x(t) ˙ = A(t)x(t) + f (x(t)) + G(t)u(t), x(t) ∈ Rn , t0 ≤ t ≤ T,

(1)

with unknown but bounded initial condition x(t0 ) = x0 , x0 ∈ X0 , X0 ⊂ Rn ,

(2)

u(t) ∈ U, U ⊂ Rm , t ∈ [t0 , T ].

(3)

Here matrices A(t) and G(t) (of dimensions n × n and n × m, respectively) are assumed to be continuous on t ∈ [t0 , T ], X0 and U are compact and convex. The nonlinear n-vector function f (x) in (1) is assumed to be of quadratic type f (x) = (f1 (x), . . . , fn (x)), fi (x) = x Bi x, i = 1, . . . , n, I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 272–279, 2010. c Springer-Verlag Berlin Heidelberg 2010

(4)

Estimates of Trajectory Tubes of Uncertain Nonlinear Control Systems

273

where Bi is a constant n × n - matrix (i = 1, . . . , n). Consider the following diﬀerential inclusion [6] related to (1)–(3) x(t) ˙ ∈ A(t)x(t) + f (x(t)) + P (t) , t ∈ [t0 , T ],

(5)

where P (t) = G(t)U. Let absolutely continuous function x(t) = x(t, t0 , x0 ) be a solution to (5) with initial state x0 satisfying (2). The diﬀerential system (1)–(3) (or equivalently, (5)–(2)) is studied here in the framework of the theory of uncertain dynamical systems (diﬀerential inclusions) through the techniques of trajectory tubes X(·, t0 , X0 ) = {x(·) = x(·, t0 , x0 ) | x0 ∈ X0 } of solutions to (1)–(3) with their t-cross-sections X(t) = X(t, t0 , X0 ) being the reachable sets (the informational sets) at instant t for control system (1)–(3). Basing on well-known results of ellipsoidal calculus [14,2,3] developed for linear uncertain systems we present the modiﬁed state estimation approaches which use the special structure of the control system (1)–(4). In this paper we continue researches initiated in [8,9].

2 2.1

Preliminaries Basic Notations

In this section we introduce the following basic notations. Let Rn be the n– dimensional Euclidean space and x y be the usual inner product of x, y ∈ Rn with prime as a transpose, x = (x x)1/2 . We denote as B(a, r) the ball in Rn , B(a, r) = {x ∈ Rn : x − a ≤ r}, I is the identity n × n-matrix. Denote by E(a, Q) the ellipsoid in Rn , E(a, Q) = {x ∈ Rn : (Q−1 (x − a), (x − a)) ≤ 1} with center a ∈ Rn and symmetric positive deﬁnite n × n–matrix Q. Let h(A, B) = max{h+ (A, B), h− (A, B)}, be the Hausdorﬀ distance for A, B ⊂ Rn , with h+ (A, B) and h− (A, B) being the Hausdorﬀ semidistances between A and B, h+ (A, B) = sup{d(x, B) | x ∈ A}, h− (A, B) = h+ (B, A), d(x, A) = inf{x − y | y ∈ A}. One of the approaches that we will discuss here is related to evolution equations of the funnel type [7,13,16,19]. Note ﬁrst that we will consider the Caratheodory–type solutions x(·) for (5)–(2)), i.e. absolutely continuous functions x(t) which satisfy the inclusion (5) for a. e. t ∈ [t0 , T ]. Assume that all solutions {x(t) = x(t, t0 , x0 ) | x0 ∈ X0 } are extendable up to the instant T that is possible under some additional conditions ([6], §7, Theorem 2). The precise value of T depending on studied system data is given in [8]. Let us consider the particular case of the funnel equation related to (5)–(2) {x + σ(A(t)x + f (x)) + σP (t)}) = 0, t ∈ [t0 , T ], (6) lim σ −1 h(X(t + σ), σ→+0

x∈X(t)

X(t0 ) = X0 .

(7)

Under above mentioned assumptions the following theorem is true (details may be found in [7,13,16,19]).

274

T.F. Filippova

Theorem 1. The nonempty compact-valued function X(t) = X(t, t0 , X0 ) is the unique solution to the evolution equation (6)-(7). Other versions of funnel equation (6) are known written in terms of semidistance h+ instead of h [14]. The solution to the h+ -equations may be not unique and the ”maximal” one (with respect to inclusion) is studied in this case. Mention here also the second order analogies of funnel equations for diﬀerential inclusions and for control systems based on ideas of Runge-Kutta scheme [5,17,18]. Discrete approximations for diﬀerential inclusions through a set-valued Euler’s method were developed in [1,4,10]. Funnel equations for diﬀerential inclusions with state constraints were studied in [13], the analogies of funnel equations for impulsive control systems were given in [7]. 2.2

Trajectory Tubes in Uncertain Systems with Quadratic Nonlinearity

Computer simulations related to modelling in nonlinear problems of control under uncertainty and based on the funnel equations require in common diﬃcult and long calculations with additional quantization in the state space. Nevertheless in some simpler cases it is possible to ﬁnd reachable sets X(t; t0 , X0 ) (more precisely, to ﬁnd their –approximations in Hausdorﬀ metrics) basing on the Theorem 1, related examples are given in [8,9]. In [8,9] we presented also new techniques of constructing the external and internal estimates of trajectory tubes X(·, t0 , X0 ) based on the combination of ellipsoidal calculus [2,3,14] and the techniques of evolution funnel equations. We considered there the control system x˙ = Ax + f˜(x)d, x0 ∈ X0 , t0 ≤ t ≤ T,

(8)

where x ∈ Rn , x ≤ K, d is a given n-vector and a scalar function f˜(x) has a form f˜(x) = x Bx with a symmetric and positive deﬁnite matrix B. The following example illustrates the estimating results based on approaches from [8,9]. Example 1. Consider the following control system

x˙ 1 = x1 + x21 + x22 + u1 , , 0 ≤ t ≤ T. x˙ 2 = − x2 + u2 ,

(9)

Here t0 = 0, T = 0.3, X0 = B(0, 1), P (t) = P = B(0, 1). External and internal ellipsoidal estimates of t the rajectory tube X(t) are shown at Fig. 1. In this paper we extend this approach to the case when right-hand sides of the system (5)–(2) contain some quadratic forms with diﬀerent symmetric positive deﬁnite matrices.

Estimates of Trajectory Tubes of Uncertain Nonlinear Control Systems

275

2.5

E(a+12,Q+12)

2 1.5

X0

1

x2

0.5 0 −0.5 −1 −1.5 −2

E(a−12,Q−12)

−2.5 0

X(t)

0.2 t

0.4

−1.5

−1

−0.5

0

0.5

1

1.5

2

2.5

3

x1

Fig. 1. Trajectory tube X(t) and its external and internal ellipsoidal estimates E(a+ (t), Q+ (t)), E(a− (t), Q− (t))

3 3.1

Results Diﬀerential System with Uncertainty in Initial States

Consider the following system x˙ = Ax + f (1) (x)d(1) + f (2) (x)d(2) , x0 ∈ X0 , t0 ≤ t ≤ T,

(10)

where x ∈ Rn , x ≤ K, d(1) and d(2) are n-vectors and f (1) , f (2) are scalar functions, f (1) (x) = x B (1) x, f (2) (x) = x B (2) x, with symmetric and positive deﬁnite matrices B (1) , B (2) . We assume also that (1) (2) di = 0 for i = k + 1, . . . , n and dj = 0 for j = 1, . . . , k where k (1 ≤ k ≤ n) is ﬁxed. This assumption means that the ﬁrst k equations of the system (10) contain (1) only the nonlinear function f (1) (x) (with some constant coeﬃcients di ) while (2) f (x) is included only in the equations with numbers k + 1, . . . , n. We will assume further that X0 in (10) is an ellipsoid, X0 = E(a, Q), with a symmetric and positive deﬁnite matrix Q and with a center a. We will need some auxiliary results. Lemma 1. The following inclusion is true X0 ⊆ E(a, k12 (B (1) )−1 ) E(a, k22 (B (2) )−1 )

(11)

where ki2 is the maximal eigenvalue of the matrix (B (i) )1/2 Q(B (i) )1/2 (i = 1, 2).

276

T.F. Filippova

Proof. The proof follows directly from the properties of quadratic forms and from the inclusions E(a, Q) ⊆ E(a, k12 (B (1) )−1 ), E(a, Q) ⊆ E(a, k22 (B (2) )−1 ), which should be fulﬁlled with the smallest possible values of k1 ≥ 0 and k2 ≥ 0. Lemma 2. The following equalities hold true max

z B (1) z≤k12

z B (2) z = k12 λ212 ,

max

z B (2) z≤k22

z B (1) z = k22 λ221 ,

(12)

where λ212 and λ221 are maximal eigenvalues of matrices (B (1) )−1/2 B (2) (B (1) )−1/2 and (B (2) )−1/2 B (1) (B (2) )−1/2 respectively. Proof. The formulas follow from direct computations of maxomal values in (12). Theorem 2. For all σ > 0 and for X(t0 + σ) = X(t0 + σ, t0 , X0 ) we have the following upper estimate X(t0 + σ) ⊆ E(a(1) (σ), Q(1) (σ)) E(a(2) (σ), Q(2) (σ)) + o(σ)B(0, 1), (13) where σ −1 o(σ) → 0 when σ → +0 and a(1) (σ) = a(σ) + σk12 λ212 d(2) , a(2) (σ) = a(σ) + σk22 λ221 d(1) ,

(14)

a(σ) = (I + σA)a + σa B (1) ad(1) + σa B (2) ad(2) ,

(15)

2 (1) −1 ) (I+σR) +(p1 +1)σ 2 ||d(2) ||2 k14 λ412 ·I, (16) Q(1) (σ) = (p−1 1 +1)(I+σR)k1 (B 2 (2) −1 Q(2) (σ) = (p−1 ) (I+σR) +(p2 +1)σ 2 ||d(1) ||2 k24 λ421 ·I, (17) 2 +1)(I+σR)k2 (B

R = A + 2d(1) a B (1) + 2d(2) a B (2)

(18)

and p1 , p2 are the unique positive solutions of related algebraic equations n i=1

n n n 1 1 = = , p1 + αi p1 (p1 + 1) p + β p (p i 2 2 + 1) i=1 2

(19)

with αi , βi ≥ 0 (i = 1, ..., n) being the roots of the following equations det((I + σR)k12 (B (1) )−1 (I + σR) − ασ 2 ||d(2) ||2 k14 λ412 · I) = 0,

(20)

det((I + σR)k22 (B (2) )−1 (I + σR) − βσ 2 ||d(1) ||2 k24 λ421 · I) = 0.

(21)

Proof.

From Theorem 1 we have {x + σ(Ax + f (1) (x)d(1) + f (2) (x)d(2) )}) = 0, lim σ −1 h(X(t0 + σ),

σ→+0

x∈X0

Estimates of Trajectory Tubes of Uncertain Nonlinear Control Systems

therefore X(t0 + σ) ⊆

277

{x + σ(Ax + f (1) (x)d(1) + f (2) (x)d(2) )} + o(σ)B(0, 1),

x∈X0

From Lemma 1 we have the inclusion X(t0 + σ) ⊆ {x + σ(Ax + f (1) (x)d(1) + f (2) (x)d(2) )| x ∈ E(a, k12 (B (1) )−1 ) ∩ E(a, k22 (B (2) )−1 )} + o(σ)B(0, 1). Obviously we have X(t0 + σ) ⊆ ( {x + σ(Ax + f (1) (x)d(1) + f (2) (x)d(2) )|x ∈ E(a, k12 (B (1) )−1 )}) ( {x+ σ(Ax+ f (1) (x)d(1) + f (2) (x)d(2) )|x ∈ E(a, k1 22 (B (2) )−1 )})+ o(σ)B(0, 1). Using the ideas of [8,9], after related calculations we come to the following inclusions: {x + σ(Ax + f (1) (x)d(1) + f (2) (x)d(2) )|x ∈ E(a, k12 (B (1) )−1 )} ⊆ E(a(1) (σ), (I + σR)k12 (B (1) )−1 (I + σR) ) + σ||d(2) ||k12 λ212 B(0, 1), and

(22)

{x + σ(Ax + f (1) (x)d(1) + f (2) (x)d(2) )|x ∈ E(a, k22 (B (2) )−1 )} ⊆ E(a(2) (σ), (I + σR)k22 (B (2) )−1 (I + σR) ) + σ||d(1) ||k22 λ221 B(0, 1).

(23)

We ﬁnd now the external estimates of the sums of ellipsoids in the right-hand parts of the inclusions (22)-(23) using the results of [2], after that we come ﬁnally to the main formula (13). We may formulate now the following scheme that gives the external estimate of trajectory tube X(t) of the system (10) with given accuracy. Algorithm 1. Subdivide the time segment [t0 , T ] into subsegments [ti , ti+1 ] where ti = t0 + ih (i = 1, . . . , m), h = (T − t0 )/m, tm = T . – Given X0 = E(a, Q), take σ = h and deﬁne ellipsoids E(a(1) (σ), Q(1) (σ)) and E(a(2) (σ), Q(2) (σ)) from Theorem 2. – Find the smallest (with respect to some criterion [2,14]) ellipsoid E(a1 , Q1 ) which contains the intersection E(a(1) (σ), Q(1) (σ)) E(a(2) (σ), Q(2) (σ)) ⊆ E(a1 , Q1 ). – Consider the system on the next subsegment [t1 , t2 ] with E(a1 , Q1 ) as the initial ellipsoid at instant t1 . – Next steps continue iterations 1-3. At the end of the process we will get the external estimate E(a(t), Q(t)) of the tube X(t) with accuracy tending to zero when m → ∞.

278

3.2

T.F. Filippova

Control System under Uncertainty

Consider the following control system in the form of diﬀerential inclusion (1)–(3) x˙ ∈ Ax + f (1) (x)d(1) + f (2) (x)d(2) + P, x0 ∈ X0 = E(a, Q), t0 ≤ t ≤ T, (24) with all previous assumptions being valid. We assume also that P is an ellipsoid, P = E(g, G), with a symmetric and positive deﬁnite matrix G and with a center g. In this case the estimate for X(t0 + σ) (the analogy of the formula (13)) takes the form X(t0 + σ) ⊆ E(a(1) (σ), Q(1) (σ)) ∩ E(a(2) (σ), Q(2) (σ)) + σE(g, G) + o(σ)B(0, 1), (25) with all parameters deﬁned in Theorem 2. We should modify now the previous scheme (Algorithm 1) in order to formulate a new procedure of external estimating of trajectory tube X(t) of the system (24). Algorithm 2. Subdivide the time segment [t0 , T ] into subsegments [ti , ti+1 ] where ti = t0 + ih (i = 1, . . . , m), h = (T − t0 )/m, tm = T . – Given X0 = E(a, Q), take σ = h and deﬁne ellipsoids E(a(1) (σ), Q(1) (σ)) and E(a(2) (σ), Q(2) (σ)) from Theorem 2. – Find the smallest (with respect to some criterion [2,14]) ellipsoid E(a∗ , Q∗ ) which contains the intersection: E(a(1) (σ), Q(1) (σ)) E(a(2) (σ), Q(2) (σ)) ⊆ E(a∗ , Q∗ ). – Find the ellipsoid E(a1 , Q1 ) which is the upper estimate of the sum [2,14] of two ellipsoids, E(a∗ , Q∗ ) and σE(g, G): E(a∗ , Q∗ ) + σE(g, G) ⊆ E(a1 , Q1 ). – Consider the system on the next subsegment [t1 , t2 ] with E(a1 , Q1 ) as the initial ellipsoid at instant t1 . – Next steps continue iterations 1-3. At the end of the process we will get the external estimate E(a(t), Q(t)) of the tube X(t) with accuracy tending to zero when m → ∞.

4

Conclusion

The paper deals with the problems of state estimation for a dynamical control system described by diﬀerential inclusions with unknown but bounded initial state. The solution to the diﬀerential system is studied through the techniques of trajectory tubes with their cross-sections X(t) being the reachable sets at instant t to control system. Basing on the results of ellipsoidal calculus developed for linear uncertain systems we present the modiﬁed state estimation approaches which use the special nonlinear structure of the control system and simplify calculations.

Estimates of Trajectory Tubes of Uncertain Nonlinear Control Systems

279

Acknowledgments. The research was supported by the Russian Foundation for Basic Research (RFBR) under Projects 06-01-00483, 09-01-00223.

References 1. Chahma, I.A.: Set-valued discrete approximation of state- constrained diﬀerential inclusions. Bayreuth. Math. Schr. 67, 3–162 (2003) 2. Chernousko, F.L.: State Estimation for Dynamic Systems. Nauka, Moscow (1988) 3. Chernousko, F.L., Ovseevich, A.I.: Properties of the optimal ellipsoids approximating the reachable sets of uncertain systems. J. Optimization Theory Appl. 120(2), 223–246 (2004) 4. Dontchev, A.L., Farkhi, E.M.: Error estimates for discretized diﬀerential inclusions. Computing 41(4), 349–358 (1989) 5. Dontchev, A.L., Lempio, F.: Diﬀerence methods for diﬀerential inclusions: a survey. SIAM Review 34, 263–294 (1992) 6. Filippov, A.F.: Diﬀerential Equations with Discontinuous Right-hand Side. Nauka, Moscow (1985) 7. Filippova, T.F.: Sensitivity Problems for Impulsive Diﬀerential Inclusions. In: Proc. of the 6th WSEAS Conference on Applied Mathematics, Corfu, Greece (2004) 8. Filippova, T.F., Berezina, E.V.: On State Estimation Approaches for Uncertain Dynamical Systems with Quadratic Nonlinearity: Theory and Computer Simulations. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2007. LNCS, vol. 4818, pp. 326–333. Springer, Heidelberg (2008) 9. Filippova, T.F., Berezina, E.V.: Trajectory Tubes of Dynamical Control Systems with Quadratic Nonlinearity: Estimation Approaches. In: Proc. of the Int. Conf. Diﬀerential Equations and Topology, Moscow, Russia, pp. 246–247 (2008) 10. H¨ ackl, G.: Reachable sets, control sets and their computation. Augsburger Mathematisch-Naturwissenschaftliche Schriften 7. PhD Thesis, University of Augsburg, Augsburg (1996) 11. Krasovskii, N.N., Subbotin, A.I.: Positional Diﬀerential Games. Nauka, Moscow (1974) 12. Kurzhanski, A.B.: Control and Observation under Conditions of Uncertainty. Nauka, Moscow (1977) 13. Kurzhanski, A.B., Filippova, T.F.: On the theory of trajectory tubes — a mathematical formalism for uncertain dynamics, viability and control. In: Kurzhanski, A.B. (ed.) Advances in Nonlinear Dynamics and Control: a Report from Russia. Progress in Systems and Control Theory, vol. 17, pp. 122–188. Birkhauser, Boston (1993) 14. Kurzhanski, A.B., Valyi, I.: Ellipsoidal Calculus for Estimation and Control. Birkhauser, Boston (1997) 15. Kurzhanski, A.B., Veliov, V.M. (eds.): Set-valued Analysis and Diﬀerential Inclusions. Progress in Systems and Control Theory, vol. 16. Birkhauser, Boston (1990) 16. Panasyuk, A.I.: Equations of attainable set dynamics. Part 1: Integral funnel equations. J. Optimiz. Theory Appl. 64(2), 349–366 (1990) 17. Veliov, V.M.: Second order discrete approximations to strongly convex diﬀerential inclusions. Systems and Control Letters 13, 263–269 (1989) 18. Veliov, V.: Second-order discrete approximation to linear diﬀerential inclusions. SIAM J. Numer. Anal. 29(2), 439–451 (1992) 19. Wolenski, P.R.: The exponential formula for the reachable set of a Lipschitz diﬀerential inclusion. SIAM J. Control Optimization 28(5), 1148–1161 (1990)

Asymptotics for Singularly Perturbed Reachable Sets Elena Goncharova1 and Alexander Ovseevich2 1

Institute of System Dynamics & Control Theory, Irkutsk, Russia [email protected] 2 Institute for Problems in Mechanics, Moscow, Russia [email protected]

Abstract. We study, in the spirit of [1], reachable sets for singularly perturbed linear control systems. The fast component of the phase vector is assumed to be governed by a strictly stable linear system. It is shown in loc.cit. that the reachable sets converge as the small parameter ε tends to 0, and the rate of convergence is O(εα ), where 0 < α < 1 is arbitrary. In fact, the said rate of convergence is ε log 1/ε. Under an extra smoothness assumption we ﬁnd the coeﬃcient of ε log 1/ε in the asymptotics of the support function of the reachable set.

1

Problem Statement

Consider the following singularly perturbed linear dynamic system x˙ = Ax + By + F u, εy˙ = Cx + Dy + Gu, u ∈ U,

(1)

where ε > 0 is a small parameter, a phase vector z = (x, y) consists of the “slow” component x ∈ X, and the “fast” component y ∈ Y . The matrices A, . . . , G, and convex compacts U are smooth functions of time t, and the parameter ε. The latter means that the support function h = HU of U depends on t, ε smoothly. The matrix D is assumed to be asymptotically stable for any t, and ε ≥ 0. An admissible trajectory of the system is an absolutely continuous solution to (1). We ﬁx an interval of time [0, T ], and study the reachable set Dε (T ) to system (1) as ε → 0. Recall that the reachable set D(T ; τ, M ) of a control dynamic system is a set of the ends at the time moment T of all admissible trajectories, starting at the moment τ at a point of the initial set M . In the linear case the argument M in D(T ; τ, M ) is not very essential, and we put τ = 0, and M = {0}. This issue was addressed in [1] under assumption that D(t)|ε=0 is an asymptotically stable matrix for any t. The main result is that the sets Dε (T ) have a limit with respect to the Hausdorﬀ metric as ε → 0, and the rate of convergence is O(εα ), where 0 < α < 1 is arbitrary. In this paper, under the same assumptions, we present a certain reﬁnement of the result [1]. We show that the rate of convergence is ε log 1/ε. Furthermore, I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 280–285, 2010. c Springer-Verlag Berlin Heidelberg 2010

Asymptotics for Singularly Perturbed Reachable Sets

281

by assuming that the support function of U is smooth outside the origin, we ﬁnd the exact coeﬃcient of the term ε log 1/ε in the asymptotics for the support function of the reachable set. We have to stress that the smoothness assumption is essential. More precisely, the results are valid when all system data are Lipschitz continuous with respect to time t. One can present some examples, where all the system parameters are H¨ older continuous with respect to time (with any positive exponent less than 1), but the assertions of theorems are wrong.

2

Splitting of Dynamic System

Following [4] we problem by using can simplify the original gauge transformax A B F tions. If z = , A = , and B = one can perform a y C/ε D/ε G/ε substitution z = Xw, where X is an invertible 2 × 2 block matrix, and we get a new control system w˙ = A w + B u, where ˙ and B = X −1 B. A = X −1 AX − X −1 X, If X depends on t, and ε in a smooth way, then such a transformation does not change essentially the behavior of the reachable sets, but allow us to simplify the system matrix A so that the smoothness and stability assumptions are preserved. We aim at reducing the system matrix to a block-diagonal form to separate slow and fast variables. For instance, the block A 21 can be eliminated by using the lower-triangular 10 transformations X = . By applying the upper-triangular transformations L1 1N X = , one can get rid of the block A12 . Here, L(t, ε), and N (t, ε) are 0 1 uniformly bounded smooth matrix functions on [0, T ]. Then, we arrive at the split case: + F u, x˙ = Ax + Gu, u ∈ U, εy˙ = Dy

(2)

where all matrices and the convex compact U depend smoothly on time and parameter ε, and D is asymptotically stable at each time instant. We can study the asymptotic behavior of the reachable set Dε (T ) to the control system (2). This simpliﬁed problem is equivalent to the original one related to system (1). F, D, G, U To make a further simpliﬁcation, assume that the parameters A, of system (2) do not depend on ε. The assumption does not result in a loss of generality. Indeed, if we put ε=0 , F = F |ε=0 , D = D| ε=0 , G = G| ε=0 , U = U |ε=0 , A = A| and form the corresponding control system x˙ = A x + F u, εy˙ = D y + G u, u ∈ U ,

(3)

282

E. Goncharova and A. Ovseevich

then, the corresponding reachable sets Dε (T ) coincide with Dε (T ) with accuracy O(ε). This follows immediately from the stability of D and the Levinson theorem (see [4]). The direct computations show that the parameters of systems (1), and (3) are related by A = (A − BD−1 C)|ε=0 , D = D|ε=0 , F = F − BD−1 G|ε=0 , G = G|ε=0 . The asymptotics we are looking for is, in fact, rather crude. Its remainders are of order o(ε log 1/ε) so that an error of order O(ε) is negligible. In what follows, we state our main results in terms of the original system (1) without appealing to splitting, but, in the proofs, the system splitting is heavily used.

3

Asymptotics of the Support Functions to Reachable Sets

Denote by Hε (ξ, η) the support function of the reachable set Dε (T ) to system (1), where ε > 0, ξ, η are dual to the variables x, y. Deﬁne a function H0 (ξ, η) =

T

ht (F (t)∗ Φ(T, t)∗ ξ ) dt +

0

∞

∗

hT (G(T )∗ eD(T ) t η) dt,

0

where ξ = ξ − C ∗ D∗−1 η, the function Φ is a fundamental matrix for linear system x˙ = (A − BD−1 C)x, and h = ht is the support function of the set U = Ut of controls. This is, in fact, the support function for the limit reachable set limε→0 Dε (T ). Theorem 1. Let Hε (ξ, η) be the support function of the reachable set Dε (T ) to system (1), then Hε (ξ, η) → H0 (ξ, η) uniformly on compacts as ε → 0. Moreover, we have the asymptotic equivalence: Hε (ξ, η) = H0 (ξ, η) + O(ε log 1/ε(|ξ| + |η|)) as ε → 0. In the proof, we address the split system (3). The proof is based upon the explicit representation of the support function Hε (ξ, η) =

0

T

1 ht (F (t)∗ Φ(T, t)∗ ξ + G(t)∗ Ψε (T, t)∗ η) dt ε

of the corresponding reachable sets, where Ψε is the fundamental matrix of system εy˙ = Dy. The idea underlying the proof goes back at least to [3], and, basically, says that the reachable set to a linear control system can be decomposed in such a way that stable, unstable and neutral parts of the reachable set are formed by using controls supported on nonoverlapping intervals. We divide the time

Asymptotics for Singularly Perturbed Reachable Sets

283

interval [0, T ] into the two subintervals [0, T − δ] and [T − δ, T ], where δ is a small positive parameter. The controls supported on the “long” interval are responsible for the “slow” part of the reachable set, while the controls supported on the “short” interval form the “fast” part of it. The proper choice of δ is crucial for the accuracy of approximation. We choose a small δ > 0 such that δ → 0 and δ/ε → ∞ as ε → 0, The main diﬀerence of the present paper with [1] stems from the ﬁnal choice δ ∼ ε log 1/ε instead of δ ∼ εα . Our main asymptotic result consists in ﬁnding the remainder in the previous theorem in a more precise form c(ξ, η)ε log 1/ε + o(ε log 1/ε). We can do this under an extra assumption that the support function h is C 1 -smooth outside ¯ = hT by ζ, and consider the the origin. Denote the argument of the function h average ¯ ¯ ∗ 1 τ ∂h ∂h (G(T )∗ eD(T ) t η) dt. Av τ ( )(η) = ∂ζ τ 0 ∂ζ One can see that there exists the limit Av(f ) = lim Avτ (f ) for any continuous τ →∞

∗

function f of ζ = 0 of degree zero. Indeed, the function φ(t) = G(T )∗ eD(T ) t η has the form of a vector-valued quasipolynomial, and, therefore, φ(t) = e(Re λ)t tN

eiωt aω + o(e(Re λ)t tN ),

where λ is an eigenvalue of D(T ), N + 1 is the maximal size of the corresponding Jordan block, sum is taken over all real ω such that Re λ + iω is an eigenvalue of D(T ), and aω is a time-independent vector. “Generically”, the trigonometric polynomial φ0 (t) = eiωt aω is comprised either of one or two harmonics. We have 1 τ Av (f ) = lim f (φ0 (t)) dt, τ →∞ τ 0 since f is homogeneous of degree zero. But the latter integrand is an almost periodic function, which can be averaged. Deﬁne the function c(ξ, η) by 1 c(ξ, η) = Λ

¯ ∗ ∂h ∗ ¯ (η) − h(F (T ) ξ ) , Av F (T ) ξ , ∂ζ

where ξ = ξ − C(T )∗ D(T )∗−1 η, Λ = Λ(η) is the absolute value of the ﬁrst ∗ Lyapunov exponent of the function t

→ |eD(T ) t η| (this is the modulus of the real part of an eigenvalue of D(T )). Theorem 2. Assume that the support function ¯h = hT of the control set UT is C 1 outside the origin. Then Hε (ξ, η) = H0 (ξ, η) + c(ξ, η)ε log

1 + o (ε log 1/ε) ε

284

4

E. Goncharova and A. Ovseevich

Concluding Remarks

Note that the coeﬃcient c(ξ, η) of ε log 1ε in the asymptotics is nonpositive. ∂¯ h ¯ ∗ ξ ) for any ζ, and the averaging operation preserves (ζ) ≤ h(F Indeed, F ∗ ξ , ∂ζ the inequality. In the generic case, when the set UT of admissible controls at the time instant T is not a singleton, the coeﬃcient c(ξ, η) is not identically (for ¯ all ξ, η) equal to zero. Otherwise, this would imply the equality h(ζ) = ζ, ϕ , ¯ ∂ h ∗ D∗ t ¯ ϕ = Av ( (G e η)), for all ζ and η. The latter necessarily means that h(ζ) ∂ζ ≡0 is the support function for the singleton UT = {ϕ}. The fact that c(ξ, η) means that the estimate given by Theorem 1 is sharp, i.e., for some ξ, η, we have |Hε (ξ, η) − H0 (ξ, η)| ≥ Cε log 1/ε, where C > 0 does not depend on ε. The above results can be illustrated by the simplest possible example of a singularly perturbed linear system: x˙ = u εy ˙ = −y + u, where x, y, u are scalars, and |u| ≤ 1. This example is also presented in [1]. An easy calculation reveals that in this case the diﬀerence of the support functions of the prelimit and limit reachable set equals ΔH = Hε (ξ, η) − H0 (ξ, η) = −2tε |ξ| − |η|(2e−tε /ε − e−T /ε ) provided that ξη < 0. Here, tε = ε log 1ε |η| |ξ| . Thus, for ﬁxed ξ, η in this range, the diﬀerence ΔH has the form −2|ξ|ε log 1ε + Cε + r, where C is a constant, and the remainder r is exponentially small as ε → 0. This proves again that the estimate in Theorem 1 is sharp. There arises a natural question: Does a greater smoothness of all data implies an improvement of the asymptotic expansion in Theorem 2? We do not know the ultimate answer, but expect that in general there is no such improvement. More precisely, we expect that there exist singularly perturbed systems with C ∞ smoothness of all input data such that the remainder o(ε log 1ε ) in Theorem 2 can be estimated from below as Ω(αε log 1ε ), where α = α(ε) is an arbitrary function such that α(ε) → 0 as ε → 0. For instance, one cannot expect that the remainder is O(ε log 1ε / log log 1ε ).

Acknowledgements The work was supported by RFBR (grants 08-01-00156, 08-08-00292), and partly by Russian Federation President grant for Scientiﬁc Schools 1676.2008.1. The authors are grateful to referees for thoughtful comments.

Asymptotics for Singularly Perturbed Reachable Sets

285

References 1. Dontchev, A.L., Slavov, J.I.: Lipschitz properties of the attainable set of singularly perturbed linear systems. Systems & Control Letters 11(5), 385–391 (1988) 2. Dontchev, A.L., Veliov, V.M.: Singular perturbations in Mayer’s problem for linear systems. SIAM J. Control and Opt. 21, 566–581 (1983) 3. Dontchev, A.L., Veliov, V.M.: On the behaviour of solutions of linear autonomous diﬀerential inclusions at inﬁnity. C. R. Acad. Bulg. Sci. 36, 1021–1024 (1983); MSC 1991 4. Kokotovich, P.V.: Singular Perturbation Techniques in Control Theory. SIAM Review 6(4), 501–550 (1984) 5. Ovseevich, A.I.: Asymptotic behavior of attainable and superattainable sets. In: Proceedings of the Conference on Modeling, Estimation and Filtering of Systems with Uncertainty, Sopron, Hungary, 1990, pp. 324–333. Birkha¨ user, Basel (1991) 6. Ovseevich, A.I., Figurina, T.Y.: Asymptotic behavior of reachable sets for singularly perturbed linear time-invariant control systems. Appl. Math. & Mech. 62(6), 977– 983 (1998)

On Optimal Control Problem for the Bundle of Trajectories of Uncertain System Mikhail I. Gusev Institute of Mathematics and Mechanics, Russian Academy of Sciences, 16 S. Kovalevskaya Str., 620219 Ekaterinburg GSP-384, Russia [email protected] Abstract. The problem of optimal choice of inputs is considered for identiﬁcation of control system parameters by the results of measurements. The integral of information function is accepted as a criterion of optimality. It is shown that the problem may be reduced to an optimal control problem for the bundle of trajectories of the control system generated by the set of unknown parameters.

1

Introduction

The paper deals with the problem of state and parameters estimation for uncertain control system on the basis of available observations corrupted by noise. The considered model of uncertainty is deterministic, with set-membership description of the uncertain items. Unknown parameters of ODEs presenting the system, and the noise in measurements are assumed to be unknown but bounded with preassigned bounds. The set of parameters, consistent with the system equations, measurements, and a priori constraints called information (feasible) set is considered as the solution of estimation problem [10,4,2,8,5]. This set-valued estimate depends on the input (control) of the system. In this paper we consider the problem of optimal input choice [7] for guaranteed estimation of the parameters of dynamic system on the basis of indirect observation. The information sets in the problem may be described as the level sets for so-called information function (information state) [5,1,6]. An information function is deﬁned as a value function for a certain auxiliary optimal control problem. The integral of information function over the set of a priori constraints on parameters is considered as a criterion of optimality. This allows to avoid, when designing an optimal input, the immediate construction of information sets. It is shown that the considered problem may be reduced to an optimal control problem for the bundle of trajectories of the system generated by the set of unknown parameters. We describe the results of numerical simulations for above problem. Finally, an input synthesis procedure based on sequential usage of the results of observations is proposed.

2

Linear Models

Here we consider the input design problem for a linear model in Hilbert space in order to clarify the details of the scheme used in the paper. In this linear I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 286–293, 2010. c Springer-Verlag Berlin Heidelberg 2010

On Optimal Control Problem for the Bundle of Trajectories

287

model the output y available for measurements depends on unknown vector of parameters q = (q1 , ..., qm ) under the equality y=

m

qi ai (u) + ξ.

i=1

Here ai : U → H are given maps from the given set of control parameters U to the real Hilbert space H, ξ ∈ H is treated as a measurement error, u ∈ U is a control. Assume that all a priori information on q and ξ is given by the conditions q = (q1 , ..., qm ) ∈ Q ⊂ Rm ,

ξ ∈ Ξ = {ξ : ξ, ξ ≤ 1},

(1)

where ·, · is an inner product in H, Q is a compact set in Rm . An input design problem consists of two stages. The ﬁst one is the identiﬁcation problem, the second is the choice of optimal input, providing the best quality of identiﬁcation. The identiﬁcation problem is related to an estimation of unknown value of q on the basis of measurement of the output y. The solution of this problem is the information (feasible) set [2,5] consisting of all values of q consistent with the results of measurement and a priori data. Let y, u be given. An information set is determined as follows ˆ u) = {q ∈ Q : ∃ ξ, ξ, ξ ≤ 1, y = Q(y,

m

qi ai (u) + ξ}.

i=1

ˆ u) contains an unknown true value of q. The set Q(y, The quality of identiﬁcation usually is characterized by the value of some ˆ u)), which is deﬁned on the class of sets. A radius, scalar functional Ω(Q(y, diameter or a volume of the set may be considered as a such functional. The problem of optimal input choice takes on the following form: to ﬁnd the control (input) u ∈ U , solving the problem ˆ u)) → min . max Ω(Q(y, y

u∈U

Here the maximum is taken in all possible values of y, or, equivalently, in all pairs q, ξ, satisfying constraints (1). A disadvantage of such approach is a necessity of constructing of information sets for calculation of optimal inputs. The solution of the last problem requires laborious computing procedures (especially in the case of nonlinear identiﬁcation problems, where an information set may be nonconvex or even nonconnected). Further, we modify the statement of the problem in order to avoid the direct constructing of information sets in the process of calculation of optimal input. The proposed approach is based on the notion of information function (information state) of the problem. In the case of linearmodel this function m V (y, u, q) m is determined by the equality V (y, u, q) = y − i=1 qi ai (u), y − i=1 qi ai (u).

288

M.I. Gusev

ˆ u) = {q ∈ Q : V (y, u, q) ≤ 1}. If for Obviously, under given u, y we have Q(y, the controls u ˆ, u ¯ under given y V (y, u ˆ, q) ≤ V (y, u ¯, q)

(2)

ˆ u ˆ u for every q ∈ Q, then Q(y, ¯) ⊂ Q(y, ˆ). Hence, u ¯ is more preferable than uˆ because it gives more precise estimate of unknown parameter. Consider a scalar functional on the set of information states, which is monotone with respect to relation (2) and is deﬁned in the following way I(y, u) = V (q, y, u)dμ(q), Q

where μ is a nonnegative measure deﬁned on the σ-algebra of Lebesgue measurable subsets of Q with the property μ(Q) = 1. Denote by q ∗ the true value of uncertain parameter and by ξ ∗ – the realization of disturbance in measurements. The output y ∗ is a function of q ∗ , ξ ∗ and input u: y ∗ = y ∗ (q ∗ , ξ ∗ , u). Depending on the way of accounting the dependence of y from parameters the following statements of the problem are possible. Problem 1. Find u ∈ U , maximizing the functional I1 (y ∗ , u) = V (q, y ∗ (q ∗ , ξ ∗ , u), u)dμ(q).

(3)

Q

Problem 2. Find u ∈ U , maximizing the functional ∗ V (q, y ∗ (q ∗ , ξ ∗ , u), u)dμ(q). I2 (q , u) = ∗inf ξ ∈Ξ

(4)

Q

Problem 3. Find u ∈ U , maximizing the functional V (q, y ∗ (q ∗ , ξ ∗ , u), u)dμ(q). I3 (u) = ∗ inf∗ ξ ∈Ξ,q ∈Q

(5)

Q

Introduce the following deﬁnitions. Let K = {k ∈ Rm : ki ∈ {0, 1}, i = 1, ..., m}, q¯ ∈ Rm . By pk denote a mapping Rm → Rm deﬁned by the equalities pi (q) = (−1)ki qi , i = 1, ..., m. The set Q is said to be q¯-symmetrical, if q ∈ Q implies pk (q − q¯) + q¯ ∈ Q for every k ∈ K. For the set E ⊂ Q denote E k = pk (E − q¯) + q¯. For measurable E the set E k is also measurable; if Q q¯ is symmetrical then E k ⊂ Q. The measure μ is said to be q¯-symmetrical if μ(E k ) = μ(E) for each measurable E ⊂ Q. Assumption 1. There exists q¯ ∈ Q such that the set Q and the measure μ are q¯-symmetrical.

On Optimal Control Problem for the Bundle of Trajectories

289

¯ = Q − q¯, as μ Let Assumption 1 holds. Denote Q ¯ denote the on m measure i ¯ Q, deﬁned by the equality μ ¯ (E) = μ(E + q ¯ ). Let y ˜ = y − q ¯ a (u) = i i=1 m ¯i )ai (u) + ξ. Then i=1 (qi − q I(y, u) =

V (q, y, u)dμ(q) =

Q

˜ y , y˜dμ(q) − 2

m i=1 Q

Q

+

m

(qi − q¯i )dμ(q)˜ y , ai (u)+

pij (u)

(qi − q¯i )(qj − q¯j )dμ(q),

i,j=1

Q

where pij (u) = ai (u), aj (u). Lemma 1. Under assumption 1 the following equalities hold (qi − q¯i )dμ(q) = 0, (qi − q¯i )(qj − q¯j )dμ(q) = 0, i, j = 1, ..., m, i = j. Q

Q

This lemma follows from the elementary properties of Lebesgue integral. From m lemma 1 it follows that I(y, u) = ˜ y , y˜ + i=1 Ai pii (u), Ai = Q (qi − q¯i )2 dμ. Calculating the inﬁmum in ξ ∗ , we get I2 (q ∗ , u) = ∗inf V (q, y ∗ (q ∗ , ξ ∗ , u), u)dμ(q) ξ ∈Ξ

Q

= φ((q ∗ − q¯) P (u)(q ∗ − q¯)) +

m

Ai pii (u),

i=1

where P (u) is a matrix with the elements pij (u), i, j = 1, ..., m and the function φ(x) is deﬁned by the equality φ(x) =

0 0 ≤ x ≤ 1, √ ( x − 1)2 x ≥ 1.

Calculating the inﬁmum I2 in q ∗ , we have I3 (u) = inf I2 (q ∗ , u) = ∗ q

m

Ai pii (u).

i=1

Proposition 1. Let for every u ∈ U , q ∗ ∈ Q (q ∗ − q¯) P (u)(q ∗ − q¯) ≤ 1, and assumption 1 holds. Then the solutions of problems 2, 3 coincide.

(6)

290

3

M.I. Gusev

Input Design Procedure for Nonlinear Control System

An analog of Problem 2 from the previous part is studied here for nonlinear control system with uncertain parameters. Consider the control system x˙ = f (t, q, x, u(t)), t ∈ [t0 , t1 ], x(t0 ) = x0 ,

(7)

(x ∈ R , u ∈ R ) with the right-hand side f depending on unknown parameter q ∈ Rm . We assume that all a priori information on q is given by the inclusion q ∈ Q where Q is a compact set in Rm . As an admissible control (input) we will consider a Lebesgue-measurable function u : [t0 , t1 ] → U , where U ⊂ Rr . We assume that f (t, q, x, u) is continuous and continuously diﬀerentiable in x on [t0 , t1 ] × Q × Rn × U . The solution of system (7) is denoted as x(t, q) (or x(t, q, u(·)). Consider the measurement equation on [t0 , t1 ] n

r

y(t) = g(t, x(t)) + ξ(t),

t ∈ [t0 , t1 ],

(8)

corrupted by unknown but bounded noise ξ(t). Here g : [t0 , t1 ] × R → R is continuous function. A priori information on ξ(t) is assumed to be given by the inclusion ξ(·) ∈ Ξ, where Ξ is a bounded set in the space Lk2 [t0 , t1 ]. Suppose that n

Ξ = {ξ(·) : W (ξ(·)) ≤ 1},

W (ξ(·)) =

t1

k

ξ (t)Rξ(t)dt,

t0

R is a given positive deﬁned matrix. Let y(t) be the result of measurements, generated by unknown ”true” value of q ∗ ∈ Q, input u(t), and measurement error ξ(t). The function q → V (q, y(·), u(·)), deﬁned by the equality V (q, y(·), u(·)) = W (y(·) − g(·, x(·, q))) is said to be an information function(information state) of the problem (7),(8). The set Q(y(·), u(·)) of all parameters q ∈ Q that are consistent with (7), (8) and a priori constraints is referred to as the information set relative to measurement y(t). It follows directly from deﬁnitions that Q(y(·), u(·)) = {q ∈ Q : V (q, y(·), u(·)) ≤ 1}. Unknown q ∗ belongs to the information set. We consider an integral of information function as a objective functional of the problem (9) I(y(·), u(·)) = V (q, y(·), u(·))dμ(q). Q

Here μ is a nonnegative measure deﬁned on Lebesgue subsets of Q such that μ(Q) = 1. The functional I is nonnegative, the more value of I corresponds to a more accurate estimate of unknown quantity of parameter q. The integral (9) depends on u(·) and the result of measurements y(·). In turn, y(t) = y ∗ (t) + ξ(t), where y ∗ (t) = g(t, x(t, q ∗ , u(·))) and ξ(t) is the measurement error. In the worst case, the value of I is equal to J(u(·)) = inf V (q, y(·) + ξ(·), u(·))dμ(q). W (ξ(·))≤1 Q

On Optimal Control Problem for the Bundle of Trajectories

291

Direct calculations lead to the following formula for J J(u(·)) = I1 (u(·)) + φ(I2 (u(·))), where

t1 I1 (u(·))) =

(10)

r(t, q) Rr(t, q)dμ(q)dt,

t0 Q

t1 I2 (u(·))) =

r(t, q) dμ(q)R

t0 Q

r(t, q)dμ(q)dt, Q

r(t, q) = g(t, x(t, q ∗ )) − g(t, x(t, q)),

φ(x) =

−x if 0 ≤ x ≤ 1 √ 1 − 2 x if x ≥ 1.

Thus, the optimal input design problem is equivalent to the maximization of the functional J on the tubes of trajectories of uncertain system (7). It is similar in a certain way to the problems of beam optimization [9]. The necessary conditions of optimality for this problem [3] constitute the basis for elaboration of numerical algorithms for constructing optimal inputs. These algorithms are based on conventional gradient method. On Fig. 1 the results of numerical simulation for the system describing oscillations of nonlinear pendulum are presented. The orange and blues lines denote the boundaries of information sets corresponding to optimal and some non optimal inputs. These information sets are constructed for the case of hard (magnitude) constraints on measurement noise: |ξ(t)| ≤ 1, t ∈ [t0 , t1 ].

0

−0.1

−0.2

−0.3

−0.4

−0.5

−0.6

−0.7

−0.8

−0.9

−1

−3

−2

−1

0

1

2

3

4

Fig. 1. Information sets for nonlinear pendulum

5

292

4

M.I. Gusev

Input Synthesis Procedure

The solution of input design problem considered in the previous section depends on unknown value q ∗ of parameter q. In this part we describe the procedure of input construction based on sequential estimates of this parameter. This procedure does not require the knowledge of q ∗ . Consider the partition {τ0 , ..., τN } of the interval [t0 , t1 ]. At the zero step we choose some initial estimate qˆ0 of q ∗ . Deﬁne the solution u0 (t) for the problem Iqˆ0 (u(·)) → min . Here notation Iqˆ0 (u(·)) is used for the functional I with qˆ0 being taken instead of q ∗ . Then we apply this control on [τ0 , τ1 ], measure the output y(·) on [τ0 , τ1 ] and go to the next step. Let us describe the k-th step . At this step we have the input u(t) to be given on the interval [t0 , τk ]. Choose the estimate qˆk of q ∗ from the results of measurement of output on [t0 , τk ]. Deﬁne the solution uk (t) for the problem Iqˆk (u(·)) → min assuming u(t) on [τ0 , τk ] to be equal to uk−1 (t). Apply this control on [τk , τk+1 ]. Measure the output y(·) on [τk , τk+1 ]. Go to the next step. As an example of application of input synthesis procedure consider the problem of optimal elevator deﬂection inputs for identifying parameters in the longitudinal shot period equations of aircraft [7]. The second order control system is considered on the interval [0, 4]. The system equations are as follows x˙ 1 = q1 x1 + q2 x2 − 1.66u, x˙ 2 = x1 + 0.737x2 + 0.005u,

(11)

x1 (0) = 0, x2 (0) = 0. Here x1 is the angle of attack, x2 is the pitch rate, u is the elevator command, |u| ≤ 10, t is a time. The coeﬃcients q1 , q2 are unknown and should be speciﬁed on the base of observations of y(t) = x1 (t) on the interval [0, 4]. A priori constraints on the measurement error ξ and parameters q = 4 (q1 , q2 ) are given by the relations 0 ξ 2 (t)dt ≤ 1, q ∈ Q, where Q is the square Q = {q : max |qi − qˆi | ≤ 0.4} i

with center qˆ = (−1.25, −0.75). Let the unknown true value of a vector q be equal to q ∗ = (−1.588; −0.562). For μ we take a measure, with equal values at the vertices of Q. Note that in considered example we don’t recalculate the measure μ at each step of the procedure. As the estimate qˆk we take mean value of the corresponding information set. The Figure 2 presents the results of numerical simulation. The blue star on the picture denotes the ”true” value q ∗ of q, and the red crosses denote the sequential estimates qˆk of this parameter. The black lines denote the boundaries of information sets corresponding to q ∗ and qˆk at the ﬁnal step of procedure. In considered example these sets and corresponding optimal inputs are suﬃciently close to each other.

On Optimal Control Problem for the Bundle of Trajectories

293

−0.4 −0.45

q

2

−0.5 −0.55 −0.6 −0.65 −0.7 −0.75 −1.75

−1.7

−1.65

−1.6

−1.55

−1.5

−1.45

−1.4

q1

Fig. 2. Information sets and parameter estimates

Acknowledgment The research was supported by the Russian Foundation for Basic Research (RFBR) under Project 09-01-00589.

References 1. Baras, J.S., Kurzhanski, A.B.: Nonlinear ﬁltering: the set-membership (bounding) and the H-inﬁnity techniques. In: Proc. 3rd IFAC Symp. on nonlinear control systems design, pp. 409–418. Pergamon Press, Oxford (1995) 2. Chernousko, F.L.: State Estimation for Dynamic Systems. CRC Press, Boca Raton (1994) 3. Gusev, M.I.: Optimal Inputs in Guaranteed Identiﬁcation Problem. Proceedings of the Steklov Institute of Mathematics (suppl. 1 ), S95–S106 (2005) 4. Kurzhanski, A.B.: Control and Observation under Conditions of Uncertainty. Nauka, Moscow (1977) 5. Kurzhanski, A.B., Valyi, I.: Ellipsoidal Calculus for Estimation and Control. Birkhauser, Boston (1997) 6. Kurzhanski, A.B., Varaiya, P.: Optimization Techniques for Reachability Analysis. Journal of Optimization Theory and Applications 108, 227–251 (2001) 7. Mehra, R.K.: Optimal inputs signals for parameter estimation in dynamic systems – Survey and new results. IEEE Trans. on Autom. Control AC–19, 753–768 (1974) 8. Milanese, M., et al. (eds.): Bounding Approach to System Identiﬁcation. Plenum Press (1995) 9. Ovsyannikov, D.A., Ovsyannikov, A.D.: Mathematical control model for beam dynamics optimization. In: Proceedings of the International Conference on Physics and Control, vol. 3, pp. 974–979 (2003) 10. Schweppe, F.: Uncertain Dynamic Systems. Prentice Hall, Englewood Cliﬀs (1973)

High-Order Approximations to Nonholonomic Aﬃne Control Systems Mikhail I. Krastanov1 and Vladimir M. Veliov1,2 1

Institute of Mathematics and Informatics, Bulgarian Academy of Sciences Acad. G. Bonchev Str. Bl. 8, 1113 Soﬁa, Bulgaria [email protected] 2 Institute of Mathematical Methods in Economics, Vienna University of Technology [email protected]

Abstract. This paper contributes to the theory of approximations of continuous-time control/uncertain systems by discrete-time ones. Discrete approximations of higher than ﬁrst order accuracy are known for aﬃne control systems only in the case of commutative controlled vector ﬁelds. The novelty in this paper is that constructive second order discrete approximations are obtained in the case of two non-commutative vector ﬁelds. An explicit parameterization of the reachable set of the Brockett non-holonomic integrator is a key auxiliary tool. The approach is not limited to the present deterministic framework and may be extended to stochastic diﬀerential equations, where similar diﬃculties appear in the non-commutative case.

1

Introduction

In this paper we present a new result about approximation of an aﬃne control system by appropriately constructed discrete-time systems. The novelty is that the approximation is of second order accuracy with respect to the discretization step, while the vector ﬁelds deﬁning the dynamics are not assumed commutative. In the commutative case higher than ﬁrst order approximations are presented e.g. in [13,4,9]. However, it is well known that the non-commutativity is a substantial problem for second order approximations of control/uncertain systems and of stochastic diﬀerential equations (cf. [8]), due to the appearance of mixed multiple integrals in the approximation by Volterra (Chen-Fliess) series. We present an approach for obtaining higher-order approximations that combines the truncated Chen-Fliess series with the constructive description of the reachable set of an auxiliary control system which depends on the bounds on the controls, but is independent of the particular vector ﬁelds in the original system. In the particular case of a two-inputs system, on which we focus in this paper, the auxiliary system is the well known Brockett non-holonomic integrator, for the reachable set of which we present an explicit parameterization making use of [12]. The approach is not limited to the present deterministic framework and I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 294–301, 2010. c Springer-Verlag Berlin Heidelberg 2010

High-Order Approximations to Nonholonomic Aﬃne Control Systems

295

may be extended to stochastic diﬀerential equations, where a similar diﬃculty appears. Let us consider an aﬃne control system x˙ =

m

ui fi (x),

x(0) = x0 ∈ Rn ,

u = (u1 , . . . , um ) ∈ U,

t ∈ [0, T ],

(1)

i=1

where T > 0 is a ﬁxed time horizon and U is a compact subset of Rm . Admissible inputs are all measurable functions u : [0, T ] → U , and the set of all such functions is denoted by U = U(0, T ). Under the assumptions formulated in the next section every u ∈ U deﬁnes an unique trajectory of (1) on [0, T ], denoted further by x[u](t). In parallel, for every natural number N (presumably large) we consider a discrete-time system y k+1 = gN (y k , v k ), y 0 = x0 ,

v k = (v1k , . . . , vpk ) ∈ V,

k = 0, . . . , N − 1, (2)

where V is a compact subset of Rp . An admissible control of this system is any element v = (v 0 , . . . , v N −1 ) ∈ (V )N , and the set of all such vectors is denoted by VN . For v ∈ VN denote by y[v] = (y 0 [v], . . . , y N [v]) the corresponding solution of (2). System (2) will be viewed as an approximation of (1). To make this precise we denote h = T /N , tk = hk, and link y k with x(tk ). In addition, in order to relate a discrete control v ∈ VN with a continuous-time control u ∈ U, we require that a mapping E : V → U(0, h) is deﬁned associating to any v ∈ V an admissible control u = E(v) on the interval [0, h]. Then any v ∈ VN deﬁnes an admissible control from U (denoted by EN (v)) as EN (v)(s) = E(v k )(s − tk ) for s ∈ [tk , tk+1 ). Definition 1. The sequence of systems (2) and mappings EN provides an approximation of order r to (1) if there is a constant c such that for every suﬃciently large N the following holds true: (i) for each u ∈ U there exists v ∈ VN for which x[u](tk ) − y k [v] ≤ chr , k = 0, . . . , N ; (ii) for each v ∈ VN it holds that x[EN (v)](tk ) − y k [v] ≤ chr , k = 0, . . . , N . The above deﬁnition requires that every trajectory of (1) is approximated with accuracy O(hr ) by a trajectory of (2), and that from every control of (2) one can recover a control of (1) which provides the same order of approximation for the respective trajectories. It is well known (see [3]) that the Euler discretization scheme provides even in a much more general formulation a ﬁrst order approximation. In our case the m respective speciﬁcations are V = U , gN (y, v) = y + h i=1 vi fi (y), E(v)(t) ≡ v. In Section 2 a system of the type of (2) that provides a second order approximation to (1) is implicitly deﬁned. Thanks to the investigation of an auxiliary problem in Section 3, the second order approximation is made explicit in the case of two controls (m = 2) in Section 4.

296

2

M.I. Krastanov and V.M. Veliov

A Second Order Approximation: Basic Idea

Below we assume that the control u is two-dimensional, that the set U ⊂ R2 is compact and that the functions fi : X → Rn are diﬀerentiable with Lipschitz n continuous derivatives. Here X ⊂ R is a set that contains in its interior the point x0 and all trajectories of (1) on [0, T ]. At a point x ∈ int X and for a “short” horizon [0, h] one can approximate in a standard way the solution of (1) starting from x and corresponding to a measurable control u by the truncated Volterra (Chen-Fliess) series (see e.g. [7]): 2 m h 1 x(h) = x + fi (x) ui (t) dt + f fi (x) ui (t) dt 2 i=1 i 0 0 i=1 h h + fi fj (x) ui (t) dt uj (t) dt

m

h

0

1≤j
+

0

h

[fi , fj ](x)

ui (t)

0

1≤i<j≤m

t

uj (s) ds dt + O(h3 ),

0

with [fi , fj ] = fi fj − fj fi , and |O(h3 )| ≤ ch3 , where c depends only on an upper bound of |fi |, the Lipschitz constant of fi around x and |U | = maxu∈U |u|. Changing the variables of integration we obtain the representation 1 2 m h2 fi (x) ui (ht) dt + f fi (x) ui (ht) dt x(h) = x + h 2 i=1 i 0 0 i=1 1 1 +h2 fi fj (x) ui (ht) dt uj (ht) dt m

1≤j
+h2

1≤i<j≤m

1

0

(3)

0

1

[fi , fj ](x)

ui (ht) 0

t

uj (hs) ds dt + O(h3 ).

0

This representation is not symmetric, but can easily be put into a symmetric form by summing the versions of the above representation corresponding to all permutations of the control variables. In the case m = 2 considered in this paper the symmetric representation gives the alternative formula (to obtain this representation one can use the explicit formulae obtained in [1], [6] and [10] for product expansion of the Chen-Fliess series) x(h) = gN (x, v) + O(h3 ),

(4)

where gN (y, v) = y + h(f1 (y)v1 + f2 (y)v2 ) +

h2 f fj (y)vi vj + h2 [f1 , f2 ](y)v3 2 i,j=1,2 i

High-Order Approximations to Nonholonomic Aﬃne Control Systems

297

and

1

v1 =

u1 (ht) dt, 0

1

u1 (ht)

v3 = 0

v2 =

1

u2 (ht) dt, 0

t

0

1

u2 (hs) ds dt −

t

u2 (ht) 0

u1 (hs) ds dt.

0

The above relations deﬁne a mapping L1 ([0, h] → U ) u −→ F (u) = v that maps the admissible controls on [0, h] onto a set V3 ⊂ R3 . The set V3 is exactly the reachable set at time t = 1 of the auxiliary control system v˙ 1 = u1 , v˙ 2 = u2 , v˙ 3 = u1 v2 − u2 v1 ,

v1 (0) = 0, v2 (0) = 0, v3 (0) = 0.

(u1 , u2 ) ∈ U.

(5)

Let E be any selection mapping of the inverse F −1 , that is V v −→ E(v) ∈ F −1 (v), (thus E “recovers” a control function u = E(v) from a vector v ∈ V3 ). By a standard propagation/accumulation error analysis one can obtain the following result. Theorem 1. The discrete-time system (2) with the speciﬁcations (2), V3 , E, provides a second order approximation to the control system (1) for the case m = 2, in the sense of Deﬁnition 1. If the original control system (1) contains a drift term f , then one can obtain a respective O(h3 ) local approximation applying (3) with the substitution U := 1 × U . In this case the mapping gN and the discrete control set V = V5 are deﬁned as h2 f f (y) + h (f1 f (y)v1 + f2 f (y)v2 ) (6) 2 +h ([f, f1 ](y)v3 + [f, f2 ](y)v4 ) h2 f fj (y)vi vj + h2 [f1 , f2 ](y)v5 , +h(f1 (y)v1 + f2 (y)v2 ) + 2 i,j=1,2 i

gN (y, v) = y + hf (y) +

where (v1 , . . . , v5 ) ∈ V5 , and V5 is the reachable set of the 5-dimensional control system v˙ 1 v˙ 2 v˙ 3 v˙ 4 v˙ 5

= u1 , = u2 , = v1 , = v2 , = u1 v2 − u2 v1 ,

v1 (0) = 0, v2 (0) = 0, v2 (0) = 0, v4 (0) = 0, v5 (0) = 0.

(u1 , u2 ) ∈ U.

A similar theorem holds true with the speciﬁcations of gN in (6), V5 , and E, where u = E(v) is a control with values in U that reaches the point v ∈ V5 .

298

M.I. Krastanov and V.M. Veliov

We stress that the sets V3 and V5 are independent of the vector ﬁelds fi , hence they can be “pre-calculated” by a representative set of points for typical control constraining sets U , such as boxes, balls, or ellipsoids. Such a “pre-calculation” is possible for V3 but is hard for V5 . Moreover, the utilization of the approximation from Theorem 1 with a set V represented by a huge number of points is limited to simulation purposes. An explicit parameterization of the discrete control set V , that may make it eﬃciently usable also in the optimal control context, is discussed in the next sections.

3

An Auxiliary Problem

In this section we investigate the reachable set V3 of the auxiliary control system (5) in the most typical case of U = [−1, 1] × [−1, 1]. This is a particular be-linear system well known in the literature as “Brocket non-holonomic integrator” [2]. The pair of vector ﬁelds B1 (v) = (1, 0, v2 ) and B2 (v) = (0, 1, −v1 ) that deﬁnes the system is nilpotent of order two and generates an Lie algebra of rank three at the origin. Thus, the system is small-time locally controllable at the origin. Hence, V3 contains the origin in its interior (cf. for example [5] or [11]). The description of the reachable set V3 is complicated by the fact that it is non-convex. More precisely, the following lemma holds for the reachable set of (5) on [0, θ], denoted further by V3 (θ). Lemma 1. The reachable set V3 (θ) is not-convex for each θ > 0. The next lemma is obvious. Lemma 2. The projection of V3 (θ) on the (v1 , v2 )-plane is the square V2 (θ) centered at the origin, with sides parallel to the axes and of length 2θ. The following result plays the key role in the characterization of V3 . Lemma 3. If a point (v1 , v2 , v3 ) belongs to V3 (θ), then also (v1 , v2 , αv3 ) belongs to V3 (θ) for each α ∈ [−1, 1]. This lemma is proved, essentially, in [12], although our proof (to be presented elsewhere) is technically diﬀerent. It employs the chronological calculus (see [1]) and is applicable for multi-input systems (with m > 2) and for systems with a drift term (see the end of the previous section), in contrast to the geometric approach in [12]. The last two lemmas together imply that the set V3 (θ) can be recovered from its upper boundary, that is, from the surface Γ (θ) = {v ∈ R3 : (v1 , v2 ) ∈ V2 (θ), v3 = max{w : (v1 , v2 , w) ∈ V3 (θ)}}. Notice that according to Lemma 1 this surface is not a graph of a concave function. Next, we shall deﬁne a parameterized family of controls that generate the upper boundary Γ (θ) of the reachable set V3 (θ). In doing that we rely on the results in [12].

High-Order Approximations to Nonholonomic Aﬃne Control Systems

299

For any θ > 0 deﬁne the sets P 1 (θ) = {(μ, τ ) :

θ ≤ τ ≤ θ, 0 ≤ μ ≤ τ, θ − 2τ ≤ μ ≤ θ − τ }, 3

θ θ ≤ τ ≤ , 0 ≤ μ ≤ τ, θ − 3τ ≤ μ ≤ θ − 2τ }. 4 2 For each parameter p = (μ, τ ) ∈ P 1 (θ) we deﬁne the control u1,p on [0, θ] as follows ⎧ ⎨ (−1, 1) for t ∈ [0, μ) (1, 1) for t ∈ [μ, μ + τ ) u1,p (t) = ⎩ (1, −1) for t ∈ [μ + τ, θ]. P 2 (θ) = {(μ, τ ) :

Moreover, for each p = (μ, τ ) ∈ P 2 (θ)) ⎧ (−1, 1) ⎪ ⎪ ⎨ (1, 1) 2,p u (t) = (1, −1) ⎪ ⎪ ⎩ (−1, −1)

we deﬁne u2,p as for for for for

t ∈ [0, μ) t ∈ [μ, μ + τ ) t ∈ [μ + τ, μ + 2τ ) t ∈ [μ + 2τ, θ].

Let us deﬁne the following symmetries in R3 : for v = (v1 , v2 , v3 ) S0 (v) = (v2 , v1 , v3 ),

S1 (v) = (−v1 , v2 , v3 ),

S2 (v) = (v1 , −v2 , v3 ), S12 (v) = (−v1 , −v2 , v3 ), S012 (v) = (−v2 , −v1 , v3 ). Proposition 1. For θ > 0 deﬁne the sets Γ i (θ) = {v[u](θ) : u ∈ P i (θ)},

i = 1, 2,

where v[u](·) is the trajectory of (5) corresponding to the control u : [0, θ] → U. Then the following representation holds true: Γ (θ) = Γ 1 ∪ S0 (Γ 1 ) ∪ S12 (Γ 1 ) ∪ S012 (Γ 1 ) ∪ Γ 2 ∪ S1 (Γ 2 ) ∪ S2 (Γ 2 ) ∪ S12 (Γ 2 ) , where the argument θ is suppressed in the right-hand side. The essence of the above proposition is two-fold. First, it implies that every point of the upper boundary of the reachable set V3 (in fact also of the whole boundary) can be reached by a bang-bang control with at most three switches. However, this information is not enough to obtain an one-to-one parameterization, since the three would give a redundant parameterization of the boundary, which is two-dimensional. The proposition claims more, namely that in the case of three switches the distance between the ﬁrst and the second switch equals the distance between the second and the third switch (see formula for u2,p (t) above). Hence the needed switching points are always determined by the two parameters μ and τ . Our proof of this fact (in contrast to the geometric proof in [12]) makes use of the chronological calculus [1] and is applicable to higher-order auxiliary systems, in particular for the 5-dimensional system in the end of Section 2.

300

M.I. Krastanov and V.M. Veliov

4

A Constructive Second Order Scheme for Two Non-commutative Fields

Thanks to the results of the previous section we can formulate explicitly a discrete-time system of the type of (2) in the case m = 2 and for U = [−1, 1] × [−1, 1]. Namely, y k+1 = gN (y k , v k ), y 0 = x0 ,

v k = (v1k , v2k , v3k ) ∈ V3 ⊂ R3 ,

k = 0, . . . , N − 1 where gN (y, v) = y + h(f1 (y)v1 + f2 (y)v2 ) +

h2 f fj (y)vi vj + h[f1 , f2 ]v3 , 2 i,j=1,2 i

and V3 = V 1 ∪ S0 (V 1 ) ∪ S12 (V 1 ) ∪ S012 (V 1 ) ∪ V 2 ∪ S1 (V 2 ) ∪ S2 (V 2 ) ∪ (S12 )(V 2 ) , V31 = {v = (v1 , v2 , v3 ) : v1 = 2μ − 1, v2 = 2(μ + τ ) − 1, v3 = 2α(1 − τ )τ, (μ, τ ) ∈ P 1 (1), α ∈ [−1, 1]}, V32 = {v : v1 = 1 − 4τ, v2 = 2(μ + τ ) − 1, v3 = 2α[(1 − μ − 2τ )(τ − μ) + τ (μ + τ )], (μ, τ ) ∈ P 2 (1), α ∈ [−1, 1]}. Notice that the discrete system is not aﬃne although the original one is such. Moreover, the constraint V3 for the discrete-time control v k is non-convex. All this makes somewhat questionable whether the obtained second-order approximation would be useful in an optimal control context. This issue deserves additional investigation and perhaps development of speciﬁc optimization methods. The case where the control constraint U is the unit ball in R2 requires also additional investigation, since to our knowledge the reachable set of the auxiliary system (5) has not been constructively parameterized.

Acknowledgments This research was supported by the Austrian Science Foundation (FWF) under grant No P18161-N13 and partly by the Bulgarian Ministry of Science and Higher Education – National Fund for Science Research under contract DO 02–359/2008.

High-Order Approximations to Nonholonomic Aﬃne Control Systems

301

References 1. Agrachev, A., Gamkrelidze, R.: The exponential representation of ﬂows and the chronological calculus. Math. USSR Sbornik, N. Ser. 107, 467–532 (1978) 2. Brockett, R.W.: Asymptotic stability and feedback stabilization. Diﬀerential geometric control theory. In: Proc. Conf., Mich. Technol. Univ., Prog. Math., vol. 27, pp. 181–191 (1982) 3. Dontchev, A., Farkhi, E.: Error estimates for discretized diﬀerential inclusion. Computing 41(4), 349–358 (1989) 4. Ferretti, R.: High-order approximations of linear control systems via Runge-Kutta schemes. Computing 58, 351–364 (1997) 5. Hermes, H.: Control systems wuth decomposable Lie lgebras. Special issue dedicated to J. P. LaSalle. J. Diﬀerential Equations 44(2), 166–187 (1982) 6. Kawski, M., Sussmann, H.J.: Noncommutative power series and formal Liealgebraic techniques in nonlinear control theory. In: Helmke, U., Pr¨ atzel-Wolters, D., Zerz, E. (eds.) Operators, Systems, and Linear Algebra, pp. 111–128. Teubner (1997) 7. Gr¨ une, L., Kloeden, P.E.: Higher order numerical schemes for aﬃnely controlled nonlinear systems. Numer. Math. 89, 669–690 (2001) 8. Kloeden, E., Platen, E.: Numerical Solutions to Stochastic Diﬀerential Equations. Springer, Heidelberg (1992) (third revised printing, 1999) 9. Pietrus, A., Veliov, V.M.: On the Discretization of Switched Linear Systems. Systems & Control Letters 58, 395–399 (2009) 10. Sussmann, H.: A product expansion of the Chen series. In: Byrnes, C.I., Lindquist, A. (eds.) Theory and Applications of Nonlinear Control Systems, pp. 323–335. Elsevier, North-Holland, Amsterdam (1986) 11. Sussmann, H.: A general theorem on local controllability. SIAM Journal on Control and Optimization 25, 158–194 (1987) 12. Vdovin, S.A., Taras’ev, A.M., Ushakov, V.N.: Construction of an attainability set for the Brockett integrator. Prikl. Mat. Mekh. 68(5), 707–724 (2004) (in Russian); Translation in J. Appl. Math. Mech. 68(5), 631–646 (2004) 13. Veliov, V.M.: Best Approximations of Control/Uncertain Diﬀerential Systems by Means of Discrete-Time Systems. WP–91–45, International Institute for Applied Systems Analysis, Laxenburg, Austria (1991)

A Parametric Multi-start Algorithm for Solving the Response Time Variability Problem Albert Corominas, Alberto Garc´ıa-Villoria, and Rafael Pastor Institute of Industrial and Control Engineering (IOC), Universitat Polit`ecnica de Catalunya (UPC), Spain

Abstract. The Multi-start metaheuristic has been applied straight or hybridized with other metaheuristics to solve a wide range of optimisation problems. Moreover, this metaheuristic is very easy to be adapted and implemented for a wide variety of problems. In this study, we propose a parametric multi-start algorithm that keeps its original simplicity. To test the proposed algorithm, we solve the Response Time Variability Problem (RTVP). The RTVP is a NP-hard sequencing combinatorial optimisation problem that has recently defined in the literature. This problem has a wide range of real-life applications in, for example, manufacturing, hard real-time systems, operating systems and network environment. The RTVP occurs whenever products, clients or jobs need to be sequenced so as to minimise variability in the time between the instants at which they receive the necessary resources. The computational experiment shows the robustness of the proposed multi-start technique.

1

Introduction

The Response Time Variability Problem (RTVP) is a scheduling problem that has recently been deﬁned in the literature [4]. The RTVP occurs whenever products, clients or jobs need to be sequenced so as to minimise variability in the time between the instants at which they receive the necessary resources. The RTVP has a broad range of real-life applications. For example, it can be used to regularly sequence models in the automobile industry [13], to resource allocation in computer multi-threaded systems and network servers [14], to broadcast video and sound data frames of applications over asynchronous transfer mode networks [5], in the periodic machine maintenance problem when the distances between consecutive services of the same machine are equal [1], and in the collection of waste [7]. One of the ﬁrst problems in which it has appeared the importance of sequencing regularly is at the sequencing on the mixed-model assembly production lines at Toyota Motor Corporation under the just-in-time (JIT) production system. One of the most important JIT objectives is to get rid of all kinds of waste and ineﬃciency and, according to Toyota, the main waste is due to the stocks. To reduce the stock, JIT production systems require producing only the necessary models in the necessary quantities at the necessary time. To achieve this, one main goal, as Monden [13] says, is scheduling the units to be produced to I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 302–309, 2010. c Springer-Verlag Berlin Heidelberg 2010

A Parametric Multi-start Algorithm for Solving the RTVP

303

keep a constant consumption rates of the components involved in the production process. Miltenburg [12] deals with this scheduling problem and assumes that models require approximately the same number and mix of parts. Thus, only the demand rates for the models are considered. In our experience with practitioners of manufacturing industries, we noticed that they usually refer to a good mixed-model sequence in terms of having distances between the units for the same model as regular as possible. Therefore, the metric used in the RTVP reﬂects the way in which practitioners refer to a desirable regular sequence. The RTVP is a NP-hard combinatorial optimisation problem [4]. Thus, the use of heuristic or metaheuristic methods for solving real-life instances of the RTVP is justiﬁed. Two multi-start algorithms to solve the RTVP were proposed in [6,3]. The general scheme of the multi-start metaheuristic consists of two phases. In the ﬁrst phase an initial solution is generated. Then, the second phase improves the obtained initial solution. These two phases are iteratively applied until a stop condition is reached. The multi-start algorithm proposed in [6] to solve the RTVP consists of generating a random solution in the ﬁrst phase and then applying to the random solution a local search in the second phase; the stop condition consists in reaching a given execution time. If the quality of the initial solution is low, the execution time required by the local search to ﬁnd its local optimum is increased. An easy and fast way to obtain better initial solutions could be generating at each iteration, for example, 5 solutions and applying a local search only for the best solution of them. Given an instance, if the execution time allows running 10 iterations, then 50 solutions are generated and 10 solutions are optimised in total. This idea is applied in the multi-start algorithm proposed in [3] to obtain the initial solutions to be optimised. However maybe a better performance is obtained if all the 50 solutions are generated at ﬁrst and then are optimised the 10 best of them; or, for example, generating, at each iteration, 10 solutions and optimising the 2 best of them, etc. In this paper we propose a multi-start algorithm to solve the RTVP that has two new parameters: the number of initial solutions generated at each iteration and the number of the best generated solutions that are optimised at each iteration. The remainder of this paper is organized as follows. Section 2 presents a formal deﬁnition of the RTVP; Section 3 proposes a parametric multi-start algorithm for solving the RTVP; Section 4 provides the results of a computational experiment; ﬁnally, some conclusions and suggestions for a future research are given in Section 5.

2

The Response Time Variability Problem (RTVP)

The aim of the Response Time Variability Problem (RTVP) is to minimise the variability in the distances between any two consecutive units of the same model. The RTVP is formulated as follows. Let n be the number of models, di the number of units to be scheduled of the model i (i = 1, . . . , n) and D the total n number of units (D = i=1 di ). Let s be a solution of an instance in the RTVP that consists of a circular sequence of units (s = s1 s2 . . . sD ), where sj is the

304

A. Corominas, A. Garc´ıa-Villoria, and R. Pastor

unit sequenced in position j of sequence s. For all unit i in which di ≥ 2, let tik be the distance between the positions in which the units k + 1 and k of the model i are found (i.e. the number of positions between them, where the distance between two consecutive positions is considered equal to 1). Since the sequence is circular, position 1 comes immediately after position D; therefore, tidi is the distance between the ﬁrst unit of the model i in a cycle and the last unit of the same model in the preceding cycle. Let ti be the average distance between two consecutive units of the model i (ti = dDi ). For all symbol i in which di = 1, ti1 is equal to ti . The objective is to minimise the metric Response Time Variability (RTV) which is deﬁned by the following expression: RT V =

n di

(tik − ti )2

(1)

i=1 k=1

For example, let n = 3, dA = 2, dB = 2 and dC = 4; thus, D = 8, tA = 4, tB = 4 and tC = 2. Any sequence such that contains exactly di times the symbol i (∀i) is a feasible solution. For example, the sequence (C, A, C, B, C, B, A, C) is a solution, where RT V = [(5 − 4)2 + (3 − 4)2 ] + [(2 − 4)2 + (6 − 4)2 ] + [(2 − 2)2 + (2 − 2)2 + (3 − 2)2 + (1 − 2)2 ] = 2.00 + 8.00 + 2.00 = 12.00.

3

The Multi-start Algorithm for Solving the RTVP

The multi-start metaheuristic is a general scheme that consists of two phases. The ﬁrst phase obtains an initial solution and the second phase improves the obtained initial solution. These two phases are applied iteratively until a stop condition is reached. This scheme was ﬁrst used at the beginning of 80’s ([2,9]). The generation of the initial solution, how to improve them and the stop condition can be very simple or very sophisticated. The combination of these elements gives a wide variety of multi-start methods. For a good review of multi-start methods, see [11,8]. The multi-start algorithm proposed in [6] for solving the RTVP is based on generating, at each iteration, a random solution and on improving it by means of a local search procedure. The algorithm stops after it has run for a preset time. Random solutions are generated as follows. For each position, a model to be sequenced is randomly chosen. The probability of each model is equal to the number of units of this model that remain to be sequenced divided by the total number of units that remain to be sequenced. The local search procedure used is applied as follows. A local search is performed iteratively in a neighbourhood that is generated by interchanging each pair of two consecutive units of the sequence that represents the current solution; the best solution in the neighbourhood is chosen; the optimisation ends when no neighbouring solution is better than the current solution. To improve the obtained results, an improved multi-start algorithm was proposed in [3]. The diﬀerence with respect to the aforementioned multi-start algorithm consists that, at each iteration, several random solutions are generated and only the best of them is improved by means of the local search procedure.

A Parametric Multi-start Algorithm for Solving the RTVP

305

1. Set the values of parameters P and N 2. Let the best solution found X initially be void 3. Let the RTV value of the best solution found be Z = ∞ 4. While execution time is not reached do: 5.

Generate P random solutions

6.

Let Xi (i = 1, . . . , P ) the ith best solution generated at step 5

7.

For each j = 1, . . . , N do:

8.

Apply the local optimisation to Xj and get Xjopt

9.

If RT V (Xjopt ) < Z, then X = Xjopt and Z = RT V (Xjopt )

10.

End For

11. End While 12. Return X

Fig. 1. Pseudocode of the multi-start algorithm

The multi-start technique is also used in [10]. In [10] a multi-start algorithm is compared with other seven metaheuristics (Boltzmann Machine, Evolution Strategy, Genetic Algorithm, Sampling and Clustering, Simulated Annealing, Tabu Search and Immune Networks) when solving the Quadratic Assignment Problem (QAP), which is also a hard combinatorial optimisation problem. Their computational experiment shows the eﬀectiveness of the multi-start approach versus other more complex approaches. We propose a multi-start algorithm that is an interesting and new extension of those proposed in [6,3]. The local search procedure is the same used in [6,3]. The diﬀerence of our proposed algorithm is that, at each iteration, instead of generating one or more random solutions and optimising the best one, our algorithm generates P random solutions and optimises the N best of them. P and N are the two parameters of the algorithm, where N ≤ P . The number of iterations is limited by the available execution time, as it occurs in [6,3]. To the best of our knowledge, this approach has not been still tested. Note that the algorithms proposed in [6,3] can be though as special cases of our more general parametric algorithm in which P = N = 1 and P ≥ 1 ∧ N = 1, respectively. Figure 1 shows the pseudocode of our algorithm. As it has been mentioned, when the execution time of the algorithm is reached, the algorithm is immediately stopped (that is, the current local optimisation is also stopped).

4

Computational Experiment

The RTVP instances used in the computational experiment are divided in two groups according to their size. For the ﬁrst group (called G1 ) 185 instances were generated using a random value of D (number of units) uniformly distributed between 50 and 100, and a random value of n (number of models) uniformly distributed between 3 and 30; for the second group (called G2 ) 185 instances were

306

A. Corominas, A. Garc´ıa-Villoria, and R. Pastor

generated using a random value of D between 100 and 200 and a random value of n between 3 and 65. For all instances and for each model i = 1, . . . , n, a random value of di (number of units of the model i) is between 1 and |(D − n + 1)/2.5| such that ni=1 di = D. The multi-start algorithm was coded in Java and the computational experiment was carried out using a 3.4 GHz Pentium IV with 1.5 GB of RAM. For each instance, the multi-start algorithm was run once for 180 seconds with diﬀerent values of P (number of random solutions generated at each iteration) and N (number of the best generated solutions optimised at each iteration). We have also used the values P = N = 1, which is equivalent to use the multi-start algorithm proposed in [6]. Tables 1 and 2 show the averages per instance of the number of iterations done by the algorithm (it ), the total number of random solutions generated (sg), the total number of solutions optimised (so) and the averages per instance of the RTV values of the best solutions obtained for the G1 and G2 instances, respectively (RTV ). Table 1. Averages results of the G1 instances for 180 seconds P

N

N/P

it

sg

so

RTV

1

1

1

1,483.19

1,484

1,483.19

38.90

10 50 100 200 400

1 1 1 1 1

0.1 0.02 0.01 0.005 0.0025

1,746.24 1,757.40 1,655.48 1,455.79 1,164.63

17,470 87,900 165,600 291,200 466,000

1,746.24 1,757.40 1,655.48 1,455.79 1,164.63

37.48 37.06 37.37 37.27 37.08

10 50 100 200 400

2 10 20 40 80

0.2 0.2 0.2 0.2 0.2

853.70 172.75 86.35 43.15 21.53

8,540 8,650 8,700 8,800 8,800

1,707.41 1,727.52 1,727.07 1,725.83 1,722.69

37.45 37.58 37.62 37.60 37.76

10 50 100 200 400

5 25 50 100 200

0.5 0.5 0.5 0.5 0.5

322.67 64.77 32.39 16.21 8.10

3,230 3,250 3,300 3,400 3,600

1,613.33 1,619.31 1,619.51 1,620.56 1,620.73

38.02 38.17 37.92 38.43 38.38

Tables 1 and 2 show that the diﬀerences between the average RTV values obtained for diﬀerent P and N values do not seem enough signiﬁcant. Thus, the parametric multi-start algorithm seems that is quite insensitive to the values of the parameters P and N . Anyway, note that when P = N = 1 or N/P = 0.5 worse solutions are obtained than when N = 1 or N/P = 0.2. One reason is that less solutions are optimised because the initial solutions are more probable to be worse and more time is needed in the local optimisations. Less solutions are also optimised when a considerable execution time is spent on generating random

A Parametric Multi-start Algorithm for Solving the RTVP

307

Table 2. Averages results of the G2 instances for 180 seconds P

N

N/P

it

sg

so

RTV

1

1

1

178.82

179

178.82

163.53

10 50 100 200 400

1 1 1 1 1

0.1 0.02 0.01 0.005 0.0025

202.33 212.00 212.18 207.39 195.45

2,030 10,650 21,300 41,600 78,400

202.33 212.00 212.18 207.39 195.45

161.72 161.17 157.85 158.99 159.65

10 50 100 200 400

2 10 20 40 80

0.2 0.2 0.2 0.2 0.2

99.31 20.04 10.05 5.03 2.52

1,000 1,050 1,100 1,200 1,200

198.62 200.42 200.90 201.08 201.30

161.93 160.54 160.41 161.51 161.72

10 50 100 200 400

5 25 50 100 200

0.5 0.5 0.5 0.5 0.5

37.97 7.64 3.83 1.92 1.00

380 400 400 400 400

189.87 190.99 191.59 192.22 193.59

162.87 161.99 161.37 164.20 162.28

Table 3. Averages of the number of times that the best solution is obtained from the ith best initial solution

G1 G1 G1 G2 G2 G2

P

N

1st

2nd

3rd

4th

5th

6th

7th

8th

9th

10th

10 50 10 10 50 10

2 10 5 2 10 5

1.40 0.21 0.43 0.56 0.11 0.23

1.28 0.21 0.39 0.56 0.09 0.30

* 0.26 0.51 * 0.13 0.17

* 0.23 0.57 * 0.11 0.18

* 0.22 0.4 * 0.11 0.23

* 0.22 * * 0.14 *

* 0.23 * * 0.08 *

* 0.22 * * 0.12 *

* 0.22 * * 0.11 *

* 0.17 * * 0.10 *

solutions (P = 400 and N = 1). Therefore, it is recommended to use a quite high value of P (100 ≤ P ≤ 200) and a value of N equal to 1 instead of using the multi-start algorithm proposed in [6] (i.e., P = N = 1). We have observed how many times the best initial solution of the N solutions to be optimised gives the best local optimum found by the algorithm during its running. This observation has been also done for the remaining N solutions to be optimised (i.e., the second, the third, . . . , the N th best initial solution). The results show that all initial solutions have similar probability to be optimised to the best solution found by the algorithm. Therefore, the quality of an initial solution is not a sign of the quality of the local optimum. Although the quality of a local optimum is independent of the quality of the initial solution, notice that it is still advisable to obtain good initial solutions because the time needed for the

308

A. Corominas, A. Garc´ıa-Villoria, and R. Pastor

(a) For G1 instances

(b) For G2 instances

Fig. 2. Average of the RTV values obtained during the execution time

local optimisation is lower, as it has been explained before. Table 3 shows some examples of the number of times, on average, that the best solution found by the algorithm has been obtained from the ith best initial solution. Finally, Fig. 2 shows how the averages of the RTV values of the best obtained solutions for the G1 and the G2 instances decrease over the execution time for the multi-start algorithm (with P = 100 and N = 1).

5

Final Conclusions

The RTVP occurs in diverse environments as manufacturing, hard real-time systems, operating systems and networks environments. The RTVP occurs whenever products, clients or jobs need to be sequenced so as to minimise variability in the time between the instants at which they receive the necessary resources. Since it is an NP-hard combinatorial optimisation problem, heuristic methods are needed for solving real-life instances. We propose a parametric multi-start algorithm for solving an NP-hard sequencing combinatorial optimisation problem as it is the RTVP. Better solutions, on average, are obtained compared with the solutions of the multi-start algorithm proposed in [6]. The computational experiment also shows that the proposed algorithm seems that is not very sensitive to the parameters. Thus, the multi-start technique is robust. Moreover, the multi-start algorithm is very easy to design and to be implemented. It is also shown that the best local optimum can be obtained from an initial solution independently of its ﬁtness although, as it is expected, the worst is the initial solution, the more time is spent on the local search. Our future research will focus on testing our proposed parametric multi-start algorithm in other combinatorial optimisation problem and using other local search methods.

Acknowledgment Supported by the Spanish Ministry of Education and Science under project DPI2007-61905; co-funded by the ERDF.

A Parametric Multi-start Algorithm for Solving the RTVP

309

References 1. Anily, S., Glass, C.A., Hassin, R.: The scheduling of maintenance service. Discrete Applied Mathematics 82, 27–42 (1998) 2. Boender, C.G.E., Rinnooy, A.H.G., Stougie, L., Timmer, G.T.: A Stochastic Method for Global Optimization. Mathematical Programming 22, 125–140 (1982) 3. Corominas, A., Garc´ıa-Villoria, A., Pastor, R.: Solving the Response Time Variability Problem by means of Multi-start and GRASP metaheuristics. Frontiers in Artificial Intelligence and Applications 148, 128–137 (2008) 4. Corominas, A., Kubiak, W., Moreno, N.: Response Time Variability. Journal of Scheduling 10, 97–110 (2007) 5. Dong, L., Melhem, R., Mossel, D.: Time slot allocation for real-time messages with negotiable distance constraint requirements. In: Real-time Technology and Application Symposium, RTAS, Denver (1998) 6. Garc´ıa, A., Pastor, R., Corominas, A.: Solving the Response Time Variability Problem by means of metaheuristics. Artificial Intelligence Research and Development 146, 187–194 (2006) 7. Herrmann, J.W.: Fair Sequences using Aggregation and Stride Scheduling. Technical Report, University of Maryland, USA (2007) 8. Hoos, H., St¨ utzle, T.: Stochastic local research: foundations and applications. Morgan Kaufmann Publishers, San Francisco (2005) 9. Los, M., Lardinois, C.: Combinatorial Programming, Statistical Optimization and the Optimal Transportation Network Problem. Transportation Research 2, 89–124 (1982) 10. Maniezzo, V., Dorigo, M., Colorni, A.: Algodesk: An experimental comparison of eight evolutionary heuristics applied to the Quadratic Assignment Problem. European Journal of Operational Research 81, 188–204 (1995) 11. Mart´ı, R.: Multi-start methods. In: Glover, Kochenberger (eds.) Handbook of Metaheuristics, pp. 355–368. Kluwer Academic Publishers, Dordrecht (2003) 12. Miltenburg, J.: Level schedules for mixed-model assembly lines in just-in-time production systems. Management Science 35, 192–207 (1989) 13. Monden, Y.: Toyota Production Systems Industrial Engineering and Management. Press, Norcross (1983) 14. Waldspurger, C.A., Weihl, W.E.: Stride Schedulling: Deterministic ProportionalShare Resource Management. Technical Report MIT/LCS/TM-528, Massechusetts Institute of Technology, MIT Laboratory for Computer Science (1995)

Enhancing the Scalability of Metaheuristics by Cooperative Coevolution Ciprian Cr˘ aciun, Monica Nicoar˘ a, and Daniela Zaharie Department of Computer Science, West University of Timi¸soara, Romania [email protected], [email protected], [email protected]

Abstract. The aim of this paper is to analyze the ability of cooperative coevolution to improve the scalability of population based metaheuristics. An extensive set of experiments on high dimensional optimization problems has been conducted in order to study the particularities and eﬀectiveness of some elements involved in the design of cooperative coevolutionary algorithms: groupings of variables into components, choice of the context based on which the components are evaluated, length of evolution for each component. Scalability improvements have been obtained in the case of both analyzed metaheuristics: diﬀerential evolution and harmony search.

1

Introduction

Optimization of high dimensional functions is challenging and results obtained by population-based metaheuristics (e.g. evolutionary algorithms, particle swarm methods, diﬀerential evolution etc.) for small or medium size problems are not necessarily valid for high dimensional ones. Cooperative coevolution is a framework developed in the context of evolutionary algorithms in order to enhance their ability to solve complex problems. In the case of large scale problems the idea of cooperative coevolution is to decompose the problem into constituent subproblems and solve them by using cooperative evolutionary processes (e.g. multiple subpopulations each one corresponding to a subproblem). Designing a cooperative coevolution variant of a metaheuristic raises several issues, the most important ones being that of identifying the components of the problem and that of designing a cooperation strategy (e.g. choosing the context to evaluate the components). Since the ﬁrst cooperative coevolutionary genetic algorithm has been proposed in [5], several cooperative approaches for function optimization have been developed [1,7]. In some recent works [10,11] a strategy based on a random grouping of variables which is applied to a synchronous diﬀerential evolution algorithm is proposed. The aim of this paper is to extend the studies in [10] by analyzing both synchronous and asynchronous coevolutionary strategies and the impact produced by the length of evolution of each component. The inﬂuence of the cooperative coevolution on the scalability has been tested for both Diﬀerential Evolution and Harmony Search, a recent populationbased metaheuristics which has not been tested up to now for high dimensional problems. I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 310–317, 2010. c Springer-Verlag Berlin Heidelberg 2010

Enhancing the Scalability of Metaheuristics

311

The main elements of the cooperative coevolutionary approach are presented in Section 2. Section 3 describes the particularities of Diﬀerential Evolution and Harmony Search while Section 4 contains the numerical results. Some conclusions are presented in Section 5.

2

Cooperative Coevolution for Large Scale Optimization

Let us consider the problem of minimizing a function f : D = [a1 , b1 ] × . . . × [an , bn ] → R. Any evolutionary approach for solving such a problem is based on evolving a population of potential solutions, X = {x1 , . . . , xm } ⊂ D, for a given number of generations, G. As n is larger, the number of generations needed to approximate the optimum with a given accuracy is also larger. If the required G does not depend linearly on n then we face the scalability problem. A possible solution to the scalability issue is to decompose the problem in subproblems by dividing the n-dimensional vector corresponding to a potential solution in smaller size components and by co-evolving in a cooperative way these components [5]. There are two main questions which should be answered when designing a cooperative coevolutionary algorithm: (i) how to divide the population elements in smaller size components? (ii) how to coevolve these components? For each of these questions there are several possible answers which are shortly reviewed in the following. Defining the components. Dividing a n-dimensional vector in K ≤ n components is equivalent with deﬁning a function c : {1, . . . , n} → {1, . . . , K} which assigns to each variable i its corresponding component, c(i). Thus for a vector x = (x1 , x2 , . . . , xn ) the k-th component can be deﬁned as Ck (x) = {xj |c(j) = k}, i.e. the set of all variables xj having its index, j, in the pre-image of k through the function c. The original vector x can be obtained by merging all its components, i.e. x = C1 (x), . . . , CK (x). If X = {x1 , x2 , . . . , xm } denotes a population then Ck (X) = {Ck (x1 ), . . . , Ck (xm )} will denote the (sub)population corresponding to the k-th component. In the traditional cooperative coevolutionary algorithms [5] the number of components is equal to the number of variables (K = n) and the function c satisﬁes c(i) = i. Such an approach is rather inappropriate in the case of interrelated variables, when an appropriate decomposition should group in the same component the highly correlated variables. However, in practice, it is diﬃcult to know which variables are signiﬁcantly correlated, therefore a compromise solution is to use random groupings as it is proposed in [10]. The groupings used in [10] are constructed based on random permutations and are characterized by equally sized groups. The size of each group depends on the number of groups which should be preset. In order to avoid the choice of the number of groups the same authors recently proposed in [11] an adaptive variant which selects between several possible values for the group sizes (e.g. {5, 10, 25, 50, 100}) according to some selection probabilities (computed by using historical data on the performance of diﬀerent group sizes). A simpler approach is that based on randomly choosing the number of components K ∈ {Kmin , . . . , Kmax } and on randomly assigning the variables to components. Thus the components do not

312

C. Cr˘ aciun, M. Nicoar˘ a, and D. Zaharie

Algorithm 1. A generic synchronous cooperative coevolutionary algorithm CCEvolveS(X) 1: Initialize and evaluate X 2: while goal for X not reached do 3: Choose K 4: Construct C1 (X),. . .,CK (X) 5: for k = 1, K do 6: Ck (V ) ← evolveS(Ck (X),X) 7: end for 8: Evaluate V 9: X ← select(X, V ) 10: end while return X

evolveS(Ck (X), X) 1: Ck (W ) ← Ck (X) 2: while goal for Ck (X) not reached do 3: Ck (Y ) ← generate(Ck (W )) 4: Y ← merge(Ck (Y ),X) 5: evaluate Y 6: Ck (Y ) ← select(Ck (Y ), Ck (W )) 7: Ck (W ) ← Ck (Y ) 8: end while return Ck (Y )

Algorithm 2. A generic asynchronous cooperative coevolutionary algorithm CCEvolveA(X) evolveA(Ck (x), Ck (X), X) 1: Initialize and evaluate X 1: Ck (y) ← generate(Ck (X)) 2: while goal for X not reached do 2: y ← merge(Ck (y),X) 3: Choose K 3: evaluate y 4: Construct C1 (X),. . .,CK (X) 4: Ck (y) ← select(Ck (x),Ck (y)) 5: for k = 1, K do return Ck (y) 6: while goal for Ck (X) not reached do 7: for each Ck (x) ∈ Ck (X) do 8: Ck (x) ←evolveA(Ck (x),Ck (X),X) 9: end for 10: end while 11: end for 12: end while return X

necessarily have exactly the same size, but only their average size is the same. The numerical tests we conducted illustrated the fact that the behavior of this simple variant is similar to that of the adaptive variant proposed in [11]. Coevolving the components. Two variants of coevolving the components, a synchronous and an asynchronous one, are presented in Algorithms 1 and 2, respectively. The main diﬀerence between these two variants is related to the moment when the context used in evaluating the components is updated. In the synchronous variant the global population X (which represents the evaluation context) is updated only after all components have been evolved while in the asynchronous case the global population is updated after evolving each component. Since the metaheuristic itself can be synchronous or asynchronous we combined these strategies in order to obtain a fully synchronous (Algorithm 1) and a fully asynchronous (Algorithm 2) variant. Numerical experiments have been conducted for both variants and some comparative results are presented in Section 4.

Enhancing the Scalability of Metaheuristics

313

The function evolveS(Ck (X), X) returns the population corresponding to component k obtained by applying a population-based metaheuristic to Ck (X) using the context X. The stopping condition of the while loop inside evolveS is related to the number of generations allowed for each component. On the other hand the function evolveA(Ck (x),Ck (X), X) returns just one element corresponding to component k. The new element is generated starting from the current element and other elements selected from the population Ck (X) according to the used metaheuristic. The evaluation of the generated element is made using as context the current global population, X. When implementing the coevolutionary process, besides the choice of a metaheuristic there are two other issues to be solved: (i) how to choose collaborators from the context when evaluating Ck (X)? (ii) how many resources (e.g. number of generations or number of function evaluations) should be allocated to the evolution of each component? The ﬁrst question has been addressed in several previous works [5,9] and the typical variants to choose the collaborators are: the best element of the population, the current element in the population and a random one. Concerning the second issue some authors use just one generation for each component [1,5] while others suggest to allocate a given number of function evaluations for each component [10]. However there is no systematic study regarding the inﬂuence of the evolution length for each component on the behavior of the coevolutionary approach. Therefore one of the aims of the numerical experiments we conducted is to analyze the inﬂuence of the evolution length per component on the behavior of cooperative coevolution.

3

Cooperative Coevolutionary Diﬀerential Evolution and Harmony Search

In order to analyze the impact of the cooperative coevolution on the metaheuristics scalability we implemented cooperative coevolutionary variants for two metaheuristics: Diﬀerential Evolution (DE) [6] and Harmony Search (HS) [3]. The reason for choosing the Diﬀerential Evolution technique is the existence of some recent results on coevolutionary DE [10,11] while the reason for choosing Harmony Search is the fact that all reported results concerning HS are for small size problems. Let us shortly describe DE, HS, and their cooperative coevolutionary variants. In the case of DE technique for each population element, xi , a so-called trial element yi = (yi1 , . . . , yin ) is constructed according to (1). The oﬀspring of xi is the best element between xi and yi . j xr1 + Fi · (xjr2 − xjr3 ) if U < CRi j , j = 1, n (1) yi = xji otherwise In (1), r1 , r2 and r3 are distinct random elements from {1, . . . , m} and U is a random value uniformly generated in [0, 1] for each i and j. The control parameters Fi (scale factors) and CRi (crossover rates) are adapted according to the rules used in Brest’s jDE [2] (each element has its own parameters F and CR,

314

C. Cr˘ aciun, M. Nicoar˘ a, and D. Zaharie

which are randomly initialized and during the evolutionary process are reset, with a small probability, to other random values). Thus the only parameter to be speciﬁed is the population size m. There are two main variants for designing the DE evolutionary process: a synchronous and an asynchronous one. The synchronous DE (denoted as sDE in the experiments) replaces the population elements with their oﬀsprings only after the oﬀsprings were generated for the entire population. The asynchronous DE (denoted as aDE) corresponds to the steady state approach in evolutionary algorithms and replaces the current element with its oﬀspring just after the construction of the oﬀspring. For each DE variant the coevolutionary variant is obtained by using the corresponding DE generation rule in Step 3 of evolveS and in Step 1 of evolveA, respectively. Thus one obtains two algorithms: a synchronous cooperative coevolutionary DE (denoted as sCCDE) and the asynchronous version (denoted as aCCDE). Harmony Search (HS) has been developed by Z. W. Geem et al. in 2001 [3], starting from an interesting analogy between the music improvisation process and the search for the optimum. Based on this analogy, HS constructs new potential solutions by combining in a stochastic way the elements of the current population and by randomly perturbing them. The rule to construct a new element yi is given in (2). In the classical variant this new element replaces the worst element in the population if it is better than it. Thus HS follows an asynchronous strategy, therefore the coevolutionary HS (denoted as CCHS) should also be based on the asynchronous approach. Due to the particularities of the coevolutionary approach the new element replaces not the worst but the current one in the population (ith element). ⎧ j ⎨ xr + bw · U3 if U1 < p1 and U2 < p2 if U1 < p1 and U2 ≥ p2 , j = 1, n (2) yij = xjr ⎩ rand(aj , bj ) otherwise In (2), r is a random element of {1, . . . , m}, U1 , U2 denotes random values uniformly generated in (0, 1) for each j, U3 is randomly generated in (−1, 1), p1 ∈ (0, 1) denotes the probability of taking a variable from the current population (HMCR in [3]), p2 ∈ (0, 1) denotes the probability to perturb the variable (PAR in [3]) and rand(aj , bj ) is a random value in the domain corresponding to the jth variable. The values of these parameters used in our experiments are p1 = 0.99, p2 = 0.75. The parameter bw controls the size of the perturbation and recently in [4] has been proposed an eﬀective way of choosing bw based on the variance of the current population, i.e. bw = var(X j ) where var(X j ) is the variance of the jth variable over the current population X. Despite the fact that HS has been successfully applied to various optimization problems its behavior on large scale ones has not been studied up to now. The ﬁrst tests on problems having a size larger than 50 suggested that HS does not scale well with the problem size. Therefore we designed a cooperative coevolutionary variant (CCHS) based on using in Algorithm 2 the generation rule speciﬁed in (2).

Enhancing the Scalability of Metaheuristics

4

315

Numerical Results

For both classical and coevolutionary DE and HS variants we analyzed two aspects: (i) the amount of resources needed to reach a given performance, i.e. the number of functions evaluations, nf e , needed to approximate the optimal value with accuracy ; (ii) the performance reached by using a given amount of resources, i.e. the best value, f∗ , obtained after a given number of function evaluations, nf e∗ . In all experiments the accuracy is = 10−10 , the maximal number of functions evaluations is nf e∗ = 5000 · n and the reported values are obtained as averages over 30 independent runs. As benchmark we used a subset, consisting of two non-separable functions (Ackley and Griewank) and a separable one (Rastrigin), of the test suite provided in [8]. The experimental design consisted in combining for each coevolutionary variant (sCCDE, aCCDE, and CCHS) two values for the number of components (K = 10 and K = 20) with six possible values for the length of evolution per component (expressed as number of generations: Gk ∈ {1, 5, 10, 50, 100, n/K}). In a set of preliminary tests we analyzed several variants of choosing the collaborators when evaluating a component: the current element in the population, the best one, and a randomly selected one. Since the results of these tests suggested that the variant using as Table 1. Inﬂuence of the method, the number of components (K) and length of evolution per component (Gk ) on the performance (f∗ ) reached after 5000 · n function evaluations Method

Ackley K Gk f∗

Griewank K Gk f∗

Rastrigin K Gk f∗

n=100 sDE - - (8.8 ± 2.2) · 10−14 - - (3.0 ± 0.7) · 10−14 - aDE - - (9.9 ± 2.0) · 10−14 - - (3.1 ± 8.5) · 10−14 - sCCDE 2 50 (8.8 ± 1.3) · 10−14 2 50 (2.9 ± 0.5) · 10−14 2 50 aCCDE 2 50 (8.5 ± 1) · 10−14 2 50 2.8 · 10−14 ± 0 2 50 HS - - (8.7 ± 1.1) · 10−5 - - (3.3 ± 2.7) · 10−8 - CCHS 10 1 2.7 · 10−13 ± 0 101 (5.9 ± 0.8) · 10−14 101 n = 500 sDE - - (1.0 ± 3) · 10−11 - - (2.4 ± 1.3) · 10−3 - aDE - - (4.3 ± 6.5) · 10−12 - - (2.3 ± 0.1) · 10−2 - sCCDE 5 100(4.6 ± 0.2) · 10−13 5 100 (1.6 ± 0.08) · 10−13 1050 aCCDE 5 100(4.8 ± 0.3) · 10−13 5 100 (1.7 ± 0.1) · 10−13 1050 HS - - 16 ± 0.07 - - (1.9 ± 0.04) · 103 - −13 CCHS 20 1 (9.1 ± 0.1) · 10 201 (3.8 ± 1.8) · 10−13 205 n = 1000 sDE - - (4.0 ± 4.6) · 10−1 - - (4.1 ± 4.4) · 10−1 - aDE - - (9.3 ± 3.7) · 10−1 - - (3.8 ± 5.5) · 10−1 - sCCDE 10 100(1.0 ± 0.04) · 10−12 10100 (3.6 ± 0.1) · 10−13 205 aCCDE 10 100(1.0 ± 0.05) · 10−12 10100 (3.7 ± 0.1) · 10−13 2050 HS - - 19 ± 0.03 - - (8.6 ± 0.09) · 103 - −12 CCHS 20 1 (2.5 ± 0.04) · 10 201 (5.8 ± 0.1) · 10−13 201

(3.2 ± 17) · 10−2 (3.2 ± 17) · 10−2 0±0 0±0 150 ± 4.5 (2.0 ± 2.7) · 10−14 1.59 ± 1.50 1.8 ± 1.3 (1.3 ± 2.4) · 10−14 (9.4 ± 21) · 10−15 (1.6 ± 0.01) · 103 (5.1 ± 1.7) · 10−14 34.5 ± 14.7 47.6 ± 13.6 (4.5 ± 2.2) · 10−14 (4.5 ± 2.2) · 10−14 (4.6 ± 0.04) · 103 (2.0 ± 0.6) · 10−13

316

C. Cr˘ aciun, M. Nicoar˘ a, and D. Zaharie

Table 2. Inﬂuence of the method on the success ratio (SR), scalability factor (SF = nf e (n)/nf e (100)) and eﬃciency expressed as the number of functions evaluations needed to reach the accuracy = 10−10 (nf e ) Method n=100 sDE aDE sCCDE aCCDE HS CCHS n = 500 sDE aDE sCCDE aCCDE HS CCHS n = 1000 sDE aDE sCCDE aCCDE HS CCHS

Ackley SR SF nf e

Griewank SR SF nf e

Rastrigin SR SF nf e

1 1 1 1 0 1

1 1 1 1 1

238766 ± 3399 221766 ± 4533 322963 ± 1813 297433 ± 4422 306433 ± 4818

1 1 1 1 0 1

1 1 1 1 1

141766 ± 3726 132433 ± 4229 193683 ± 6432 179100 ± 7000 182433 ± 6155

0.96 0.96 1 1 0 1

1 1 1 1 1

153548 ± 4533 144582 ± 4973 200080 ± 4040 186100 ± 4898 218433 ± 4533

0.96 1 1 1 0 1

5.9 6.0 3.8 4.0 5.2

1427786 ± 6E5 1350200 ± 4E4 1257610 ± 1E4 1200100 ± 0 1600100 ± 0

0.96 0.9 1 1 0 1

6.4 6.7 4.1 4.1 5.4

910544 ± 4E4 896496 ± 4E4 798360 ± 1E4 751766 ± 2E4 1001767 ± 8975

0.36 0.20 1 1 0 1

6.1 6.5 5.2 5.3 7.1

9456540 ± 6E4 9502000 ± 5E4 1050530 ± 8E3 986766 ± 2E4 1551767 ± 1E4

0.1 0.06 1 1 0 1

18 17 7.9 8.0 14

4400200 ± 3E5 3900200 ± 3E5 2576007 ± 4E4 2400100 ± 0 4300100 ± 0

0.16 0.46 1 1 0 1

18 18 8.3 8.4 15

2560200 ± 2E5 2400200 ± 2E5 1615047 ± 3E4 1513433 ± 3E4 2800100 ± 0

0 0 1 1 0 1

10 10 20

2016733 ± 1E4 2000100 ± 0 4396767 ± 6E4

collaborator the current element behaves the best this is the variant we used in all experiments presented here. The population size was set to m = 100 for all coevolutionary variants. For non-coevolutionary variants (sDE, aDE, and HS) m was set to 100 when n < 500 while for n ≥ 500 it was set to m = 200. Table 1 contains for each method the results corresponding to the pair (K, Gk ) which led to the smallest averaged f∗ . The results illustrate that while for DE small number of components and large value of the evolution length are more beneﬁcial, in the case of HS the situation is the reversed (a larger number of components and a short evolution per component). Table 2 shows that all coevolutionary variants have a good reliability (the success ratio is 1) and the scalability factor, which should be closer to n/100, is signiﬁcantly improved especially in the case of DE. Concerning the comparison between the synchronous and asynchronous variants, by applying a t-test with a signiﬁcance level of 0.05, one obtained that the number of functions evaluations needed to reach a given accuracy (nf e ) is smaller in the case of aCCDE than in the case of sCCDE. On the other hand, with respect to f∗ values, the synchronous and asynchronous variants behaves almost similarly (only in three cases sCCDE produced results signiﬁcantly better than aCCDE).

Enhancing the Scalability of Metaheuristics

5

317

Conclusions

Several cooperative coevolutionary variants obtained by combining diﬀerent ways of updating the contextual population, diﬀerent numbers of components, diﬀerent lengths of the evolution/component have been numerically analyzed. The obtained results illustrate the fact that the asynchronous variant behaves slightly better than the synchronous one with respect to the number of function evaluations and that the adequate length of the evolution/component depends on the method (it should be small in the case of HS and can be larger in the case of DE). Concerning the scalability of HS one can conclude that it can be signiﬁcantly improved by coevolution. Acknowledgment. This work is supported by Romanian project PNCD II 11028/ 14.09.2007 (NatComp).

References 1. van den Bergh, F., Engelbrecht, A.P.: A Cooperative Approach to Particle Swarm Optimization. IEEE Transactions on Evolutionary Computation 8(3), 225–239 (2004) ˇ 2. Brest, J., Boˇskoviˇc, B., Greiner, S., Zurner, V., Mauˇcec, M.S.: Performance comparison of self-adaptive and adaptive diﬀerential evolution algorithms. Soft Computing 11(7), 617–629 (2007) 3. Geem, Z.W., Kim, J., Loganathan, G.: A New Heuristic Optimization Algorithm: Harmony Search. Simulation 76(2), 60–68 (2001) 4. Mukhopadhyay, A., Roy, A., Das, S., Das, S., Abraham, A.: Population variance and explorative power of Harmony Search: An analysis. In: Proc. ICDIM 2008, pp. 775–781 (2008) 5. Potter, M., De Jong, K.: A cooperative coevolutionary approach to function optimization. In: Davidor, Y., M¨ anner, R., Schwefel, H.-P. (eds.) PPSN 1994. LNCS, vol. 866, pp. 249–257. Springer, Heidelberg (1994) 6. Price, K.V., Storn, R., Lampinen, J.: Diﬀerential Evolution. A Practical Approach to Global Optimization. Springer, Heidelberg (2005) 7. Shi, Y., Teng, H., Li, Z.: Cooperative Co-evolutionary Diﬀerential Evolution for Function Optimization. In: Wang, L., Chen, K., S. Ong, Y. (eds.) ICNC 2005. LNCS, vol. 3611, pp. 1080–1088. Springer, Heidelberg (2005) 8. Tang, K., Yao, X., Suganthan, P.N., MacNish, C., Chen, Y.P., Chen, C.M., Yang, Z.: Benchmark Functions for the CEC 2008 Special Session and Competition on Large Scale Global Optimization, Technical Report, USTC, China (2007), http://nical.ustc.edu.cn/cec08ss.php 9. Wiegand, R.P., Liles, W.C., De Jong, K.A.: An Empirical Analysis of Collaboration Methods in Cooperative Coevolutionary Algorithms. In: Proc. of Genetic and Evolutionary Computation Conference, pp. 1235–1242. Morgan Kaufmann Publ., San Francisco (2001) 10. Yang, Z., Tang, K., Yao, X.: Large scale evolutionary optimization using cooperative coevolution. Information Sciences 178, 2985–2999 (2008) 11. Yang, Z., Tang, K., Yao, X.: Multilevel Cooperative Coevolution for Large Scale Optimization. In: Proc. of the 2008 IEEE Congress on Evolutionary Computation, pp. 1663–1670. IEEE Press, Los Alamitos (2008)

Hybrid ACO Algorithm for the GPS Surveying Problem Stefka Fidanova1 , Enrique Alba2 , and Guillermo Molina2 1

2

IPP – Bulgarian Academy of Sciences, Acad. G. Bonchev, bl.25A, 1113 Soﬁa, Bulgaria [email protected] Universidad de M´ alaga, E.T.S.I. Inform´ atica, Grupo GISUM (NEO) M´ alaga, Espa˜ na {eat,guillermo}@lcc.uma.es

Abstract. Ant Colony Optimization(ACO) has been used successfully to solve hard combinatorial optimization problems. This metaheuristic method is inspired by the foraging behavior of ants, which manage to establish the shortest routes from their nest to feeding sources and back. In this paper, we propose hybrid ACO approach to solve the Global Positioning System (GPS) surveying problem. In designing GPS surveying network, a given set of earth points must be observed consecutively (schedule). The cost of the schedule is the sum of the time needed to go from one point to another. The problem is to search for the best order in which this observation is executed. Minimizing the cost of this schedule is the goal of this work. Our results outperform those achieved by the best-so-far algorithms in the literature, and represent a new state of the art in this problem.

1

Introduction

Satellite navigation systems have an impact in geoscience, in particular on surveying work in quick and eﬀective determining positions and changes in positions networks. The most widely known space systems are: the American NAVSTAR global positioning system (also known as GPS), the Russian GLObal NAvigation Satellite System (GLONASS), currently only partially functional, and the forthcoming European satellite navigation system (GALILEO), expected for 2010. GPS satellites continuously transmit radio signals to the Earth while orbiting it. A receiver, with unknown position on Earth, has to detect and convert the signals received from all of the satellites into useful measurements. These measurements would allow a user to compute a three-dimensional coordinate position: the location of the receiver [7]. Solving this problem to optimality requires a very high computational time. Therefore, metaheuristic methods are used to provide near-optimal solutions for large networks within an acceptable amount of computational eﬀort [4,6,8]. In this paper, we implement a Hybrid Ant Colony Optimization algorithm. We combine the ACO with various local search procedures to improve the achieved I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 318–325, 2010. c Springer-Verlag Berlin Heidelberg 2010

Hybrid ACO Algorithm for the GPS Surveying Problem

319

results. Our aim is to ﬁnd the best parameter settings and the most appropriate local search procedure for this complex problem. We compare our results with results achieved by local search used in [8]. The ACO algorithm has parallel nature, thus the computational time of the GPS problem can be decreased running the algorithm on supercomputers. The rest of the paper is organized as follows. The general framework for a GPS surveying network problem as a combinatorial optimization problem is described in Section 2. Then, the ACO algorithm and its hybridization with diﬀerent local search strategies are explained and applied in Section 3. The numerical results are presented and discussed in Section 4. The paper ends with conclusions and directions for future research.

2

Problem Description

The GPS network can be deﬁned as set of stations (a1 , a2 , . . . , an ), which are co-ordinated by placing receivers (X1, X2, . . .) on them to determine sessions (a1 a2 , a1 a3 , a2 a3 , . . .) among them. The GPS surveying problem consists in searching for the best order in which these sessions can be organized to give the best schedule. Every session must start at the point, where previous one ﬁnished. Thus, the schedule can be deﬁned as a sequence of sessions to be observed consecutively. The solution is represented by linear graph with weighted edges. The nodes represent the stations, the edges represent the sessions and the weight of the edges represent the moving cost. The objective function of the problem is the cost of the solution, which is the sum of the costs (time) to move from one point to another one, C(V ) = C(ai , aj ), where ai aj is a session in solution V . For example if the number of points (stations) is 4, a possible solution is V = (a1 , a3 , a2 , a4 ) and it can be represented by the linear graph a1 → a3 → a2 → a4 . The moving costs are as follows: C(a1 , a3 ), C(a3 , a2 ), C(a2 , a4 ). Thus the cost of the solution is C(V ) = C(a1 , a3 ) + C(a3 , a2 ) + C(a2 , a4 ). The initial data is a cost matrix, which represents the cost of moving a receiver from one point to another. The cost could be evaluated purely upon the time or purely upon the distance; for more details see Dare [4]. This problem is a relaxation of the Travelling Salesman Problem (TSP) since there is not requirements that one ends where one started. Thus the strategies to solve GPS surveying problem can be diﬀerent from these for TSP.

3

Hybrid ACO Algorithm

Real ants foraging for food lay down quantities of pheromone (chemical cues) marking the path that they follow. An isolated ant moves essentially at random but an ant encountering a previously laid pheromone will detect it and decide to follow it with high probability and thereby reinforce it with a further quantity of pheromone. The repetition of the above mechanism represents the auto-catalytic behavior of a real ant colony where the more the ants follow a trail, the more attractive that trail becomes [2].

320

S. Fidanova, E. Alba, and G. Molina

The ACO algorithm uses a colony of artiﬁcial ants that behave as cooperative agents in a mathematical space were they are allowed to search and reinforce pathways (solutions) in order to ﬁnd the optimal ones [3]. The problem is represented by graph and the ants walk on the graph to construct solutions. The solutions are represented by paths in the graph. After the initialization of the pheromone trails, the ants construct feasible solutions, starting from random nodes, and then the pheromone trails are updated. At each step the ants compute a set of feasible moves and select the best one (according to some probabilistic rules) to continue the rest of the tour. The structure of the ACO algorithm is shown by the pseudocode below. The transition probability pij , to choose the node j when the current node is i, is based on the heuristic information ηij and the pheromone trail level τij of the move, where i, j = 1, . . . . , n. pij =

β τijα ηij α β k∈Unused τik ηik

(1)

The higher the value of the pheromone and the heuristic information, the more proﬁtable it is to select this move and resume the search. For heuristic information we use one over the moving cost from one point to another. Thus the sessions with less cost are more desirable. In the beginning, the initial pheromone level is set to a small positive constant value τ0 ; later, the ants update this value after completing the construction stage. ACO algorithms adopt diﬀerent criteria to update the pheromone level. Hybrid Ant Colony Optimization Initialize number of ants; Initialize the ACO parameters; while not end-condition do for k=0 to number of ants ant k starts from random node; while solution is not constructed do ant k selects higher probability node; end while end for Local search procedure; Update-pheromone-trails; end while In our implementation we use MAX-MIN Ant System (MMAS) [10], which is one of the best performing ant algorithms. This approach uses a ﬁxed upper bound τmax and a lower bound τmin of the pheromone trails. Thus the accumulation of a big amount of pheromone by part of the possible movements and repetition of same solutions is partially prevented. The main features of MMAS are: – Intensiﬁcation of the search process. This can be achieved by allowing one ant, the best one, to add pheromone after each iteration.

Hybrid ACO Algorithm for the GPS Surveying Problem

321

– Diversiﬁcation of the search process. After the ﬁrst iteration the pheromone trails are reinitialized to τmax . In the next iteration only the movements that belong to the best solution receive a pheromone, while other pheromone values are only evaporated. The main purpose of using only one solution is to make solution elements, which frequently occur in the best found solutions, get a large reinforcement. The pheromone trail update rule is given by: τij ← ρτij + Δτij , 1/C(Vbest ) if (i, j) ∈ best solution Δτij = , 0 otherwise where Vbest is the iteration best solution and i, j = 1, . . . , n. To avoid stagnation of the search, the range of possible pheromone value on each movement is limited to an interval [τmin , τmax ]. τmax is an asymptotic maximum of τij and τmax = 1/(1 − ρ)C(V ∗ ), while τmin = 0.087τmax [10], where V ∗ is the optimal solution, but since it is unknown, we use Vbest instead. We hybridize MMAS algorithms by using Local Search (LS) procedures. We thus try to improve the achieved costs. The main concept of LS is searching the local neighborhood of the current solution [9]. The LS procedure perturbs a given solution to generate diﬀerent neighborhoods using a move generation mechanism. In general, neighborhoods for large-size problems can be costly to search. Therefore, LS attempts to improve a current schedule V to a GPS network by a small series of local improvements. A move generation is a transition from a schedule V to another one V ∈ I(V ) in one step (iteration). The returned schedule V may not be optimal, but it is the best schedule in its local neighborhood I(V ). A local optimal schedule is a schedule with the local minimal cost value. In this paper several local search procedures L(k, l) are applied, where k is the number of generated neighbor solutions and l is the index of the perturbation method used. With L(0, −) we note the MMAS algorithm without LS. The LS procedure 1 is from [8] and in the next section we compare achieved results by our LS procedures with it. The prepared by us LS are as follows: 1. Nodes Sequential Swaps: for i = 1 to n − 1; for j = i + 1 to n; swap ai and aj ; [8] 2. Nodes Random Swaps: two nodes are chosen randomly and are swapped; 3. Randomly Delete an Edge: let the current solution be (a1 , a2 , . . . , ai , ai+1 , . . . , an ). The edge (i, i+1) is randomly chosen and deleted. The new solution is (ai+1 , . . . , an , a1 , . . . , ai ); 4. Greedy Delete an Edge: The longest (most expensive) edge is deleted. The new solution is constructed as in previous case; 5. Randomly Delete 2 Edges: Let the current solution be (a1 , a2 , . . . , ai , ai+1 , . . . , aj , aj+1 , . . . , an ). The edges (i, i + 1) and (j, j + 1) are randomly chosen and deleted. The new solutions are (ai+1 , . . . , aj , a1 , . . . , ai , aj+1 , . . . , an ),

322

S. Fidanova, E. Alba, and G. Molina

(aj+1 , . . . , an , ai+1 , . . . , aj , a1 , . . . , ai ), and (a1 , . . . , ai , aj+1 , . . . , an , ai+1 , . . . , aj ); 6. Greedy Delete 2 Edges: The two longest edges are deleted. The new solutions are prepared as in the previous case.

4

Experimental Results

In this section we analyze the experimental results obtained using the Hybrid MMAS algorithm described in the previous section. As test problems, we use real data from Malta and Seychelles GPS networks. The Malta GPS network is composed of 38 sessions and the Seychelles GPS network is composed of 71 sessions. We also use six larger test problems, taken from http://www.informatik.uniheidelberg.de/groups/comopt/software/TSLIB95/ATSP.html. These test problems range from 100 to 443 sessions. For every experiment, the results are obtained by performing 30 independent runs, then averaging the ﬁtness values obtained in order to ensure statistical conﬁdence of the observed diﬀerence. Analysis of the data using ANOVA/KruskalWallis test has been used to get 95 % level of statistical conﬁdence of the results. Parameter settings have a crucial role for algorithm behavior. Therefore at the beginning we investigate on the transition probability parameters α and β. When the parameter α is large, the pheromone value has a strong inﬂuence on the choice the ants make. When the value of β is large, the transition probability is highly driven by the heuristic information and the algorithm operates in a greedy way. In Table 1 we show the costs achieved for every test problem. The best values are highlighted using boldface. The values of parameters (α, β) are from the set {(2,1), (1,1), (1,2), (1,3), (1,4)}. Analyzing the results, we conclude that the MMAS algorithm using (α, β)=(1,2) outperforms the other conﬁgurations. The minimal costs are obtained for (α, β) = (1, 2) in most of the cases except Table 1. Inﬂuence of the parameters α and β, L(0, −) Tests

(α, β)

sessions (2,1)

(1,1)

(1,2)

(1,3)

(1,4)

Malta

38

924

924

920

920

920

Seychelles

71

938

935

918

923

919

rro124

100

42124

41980

41454

41471

41457

ftv170

170

3581

3467

3417

3438

3429

rgb323

323

1677

1660

1670

1676

1665

rgb358

358

1692

1679

1683

1688

1682

rgb403

403

3428

3426

3443

3447

3452

rgb443

443

3776

3791

3754

3787

3792

Hybrid ACO Algorithm for the GPS Surveying Problem

323

Table 2. Inﬂuence of the evaporation parameter ρ, L(0, −) ρ

Tests

sessions 0.1

0.2

0.5

0.8

0.9

Malta

38

924

924

923

902

902

Seychelles

71

921

921

918

918

919

rro124

100

42183

41981

41454

41485

41276

ftv170

170

3578

3460

3417

3340

3356

rgb323

323

1679

1671

1670

1665

1659

rgb358

358

1697

1691

1683

1682

1679

rgb403

403

3443

3452

3443

3438

3427

rgb443

443

3782

3787

3754

3778

3750

rgb323 and rgb403. For these two tests conﬁgurations the α = β achieves the best costs. For Malta and rgb358 there are no signiﬁcant diﬀerences between (α, β) = (1, 1) and (α, β) = (1, 2). Therefore we decide to continue the tests using (α, β) = (1, 2). The other parameters are set as follow: τ0 = 0.5, ρ = 0.5, number of ants 10, number of iterations equal to the number of sessions for the tested problem. The value of parameter τ0 is not important because after the ﬁrst iteration (as it is mentioned in the previous section) all pheromone is set to τmax . We can use small number of ants, because of the random start of the ants in every iteration, and thus the number of calculations decrease. We decide to ﬁx the ants number to be 10. The next parameter we determine is the evaporation ρ. In Table 2 we show costs achieved for all tested problems when the parameter ρ takes the values {0.1, 0.2, 0.5, 0.8, 0.9}. The best values are highlighted using boldface. Comparing the results we conclude that MMAS algorithm using ρ = 0.9 outperforms the other conﬁgurations. The minimal costs are obtained by ρ = 0.9 in ﬁve of the eight tests. In the other tests there are no signiﬁcant diﬀerences between ρ = 0.8 and ρ = 0.9. Therefore we decide to continue our research using ρ = 0.9. The other parameters used in these tests take same values as in the previous case. We include in this study various kinds of local search procedures to improve the algorithm performance. To keep the running time low, the neighbor set consists of as many solutions as the number of sessions. The number of iterations is set equal to the size of the problem. Table 3 shows the costs achieved for all tested problems. The best values are highlighted using boldface. After analysis of the results, we conclude that LS procedure L(n, 5) outperforms the others. It obtains the best results in most of the tests except ftv170. In this test problem there are no signiﬁcant diﬀerences between the costs achieved by L(n, 3) and L(n, 5). We observe that there is no relative diﬀerence between achieved costs by L(n, 1), L(n, 2) and L(0, −). We

324

S. Fidanova, E. Alba, and G. Molina Table 3. Hybrid MMAS algorithms, n is size of the problem

Tests

sessions

Malta

38

LS L(n,1)

L(n,2)

L(n,3)

L(n,4)

L(n,5)

L(n,6)

L(0,–)

902

902

895

895

872

902

902

Seychelles

71

920

920

915

915

851

893

919

rro124

100

41276

41276

41004

41004

39971

41281

41276

ftv170

170

3356

3356

3229

3290

3266

3320

3356

rgb323

323

1661

1661

1691

1666

1378

1423

1659

rgb358

358

1680

1680

1689

1702

1477

1549

1679

rgb403

403

3428

3428

3401

3401

2408

2710

3427

rgb443

443

3751

3751

3838

3838

2631

3349

3750

conclude than that L(n, 1) and L(n, 2) do not improve the costs achieved by MMAS.

5

Conclusions and Future Work

We address in this paper the GPS surveying problem. Instances containing from 38 to 443 sessions have been solved using an MMAS algorithm, both in its canonical form, and hybridized with several conﬁgurations of a Local Search procedure (LS). For each case, the best parameter settings are investigated. In most of the test cases, the MMAS+LS hybrid has shown improvements of the results compared to the canonical MMAS. In particular, the best results have been obtained using the LS procedure L(n, 5), where n is equal to the number of sessions. To keep the running time at a low value, the neighbor set of the LS consists of as many solutions as the number of the sessions and the number of iterations is equal to the size of the problem. In a general manner, using larger neighborhoods and more iterations we can achieve better costs. We can state that the proposed algorithm obtains promising results, and the hybridization with the appropriate LS produces signiﬁcant improvements of the results. The nature of ant algorithms including MMAS is parallel with big transfer of data after every iteration. So they are very suitable for parallel applications on massively parallel architectures and supercomputers. In future work, other metaheuristcs algorithms will be developed, applied and compared. Our aim is to obtain a reliable sample of the expectable results for this problem when using diﬀerent kinds of metaheuristics, and establish a state of the art for this problem.

Acknowledgments This work has been partially supported by the Bulgarian NSF under contract DO 02–115. Guillermo Molina is supported by grant AP2005-0014 from the Spanish government.

Hybrid ACO Algorithm for the GPS Surveying Problem

325

References 1. Beckers, R., Deneubourg, J.L., Gross, S.: Trail and U-turns in the Selection of the Shortest Path by the Ants. J. of Theoretical Biology 159, 397–415 (1992) 2. Bonabeau, E., Dorigo, M., Theraulaz, G.: Swarm Intelligence: From Natural to Artiﬁcial Systems. Oxford University Press, New York (1999) 3. Corne, D., Dorigo, M., Glover, F. (eds.): New Ideas in Optimization. McCraw Hill, London (1999) 4. Dare, P.J., Saleh, H.A.: GPS Network Design: Logistics Solution Using Optimal and Near-Optimal Methods. J. of Geodesy 74, 467–478 (2000) 5. Dorigo, M., Gambardella, L.M.: Ant Colony System: A Cooperative Learning Approach to the Traveling Salesman Problem. IEEE Transactions on Evolutionary Computation 1, 53–66 (1997) 6. Fidanova, S.: An Heuristic Method for GPS Surveying Problem. In: Shi, Y., van Albada, G.D., Dongarra, J., Sloot, P.M.A. (eds.) ICCS 2007. LNCS, vol. 4490, pp. 1084–1090. Springer, Heidelberg (2007) 7. Leick, A.: GPS Satellite Surveying, 2nd edn. Wiley, Chichester (1995) 8. Saleh, H.A., Dare, P.: Eﬀective Heuristics for the GPS Survey Network of Malta: Simulated Annealing and Tabu Search Techniques. Journal of Heuristics 7(6), 533– 549 (2001) 9. Schaﬀer, A.A., Yannakakis, M.: Simple Local Search Problems that are Hard to Solve. Society for Industrial Applied Mathematics Journal on Computing 20, 56–87 (1991) 10. Stutzle, T., Hoos, H.H.: MAX-MIN Ant System. In: Dorigo, M., Stutzle, T., Di Caro, G. (eds.) Future Generation Computer Systems, vol. 16, pp. 889–914 (2000)

Generalized Nets as Tools for Modeling of the Ant Colony Optimization Algorithms Stefka Fidanova1 and Krassimir Atanassov2 1

2

IPP – Bulgarian Academy of Sciences, Acad. G. Bonchev, bl.25A, 1113 Soﬁa, Bulgaria [email protected] CLBME – Bulgarian Academy of Science, Acad. G. Bonchev, bl 105, 1113 Soﬁa, Bulgaria [email protected]

Abstract. Ant Colony Optimization (ACO) has been used successfully to solve hard combinatorial optimization problems. This metaheuristic method is inspired by the foraging behavior of ant colonies, which manage to establish the shortest routes to feeding sources and back. We discuss some possibilities for describing of the ACO algorithms by Generalized Nets (GNs), that help us deeply to understand the processes and to improve them. Very important is that the GNs are expandable. For example, some of the GN places can be replaced with a new GN. Thus we can include procedures to improve the search process and the achieved results or various kind of estimations of algorithm behavior.

1

Introduction

Generalized Nets (GNs) [1,2] are extensions of ordinary Petri Nets (PNs) and their other extensions, as Time PN, Color PNs, Stochastic PNs, Self-organizing PNs, etc. They are not only a tool for modelling of parallel processes, but the GN-approach for representing of real processes can play the role of a generator of new ideas for extending of the initial processes. In the present paper we will discuss some possibilities for application of the GNs as tool for modelling of Ant Colony Optimization (ACO) algorithms. Real ants foraging for food lay down quantities of pheromone (chemical cues) marking the path that they follow. An isolated ant moves essentially at random but an ant encountering a previously laid pheromone will detect it and decide to follow it with high probability and thereby reinforce it with a further quantity of pheromone. The repetition of the above mechanism represents the auto-catalytic behavior of a real ant colony where the more the ants follow a trail, the more attractive that trail becomes. The ACO is inspired by real ant behavior to solve hard combinatorial optimization problems. Examples of optimization problems are Traveling Salesman Problem, Vehicle Routing, Minimum Spanning Tree, Constrain Satisfaction, Knapsack Problem, etc. The ACO algorithm uses a colony of artiﬁcial ants that behave as cooperative agents in a mathematical space where they are allowed to I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 326–333, 2010. c Springer-Verlag Berlin Heidelberg 2010

Generalized Nets as Tools for Modeling of the ACO Algorithms

327

search and reinforce pathways (solutions) in order to ﬁnd the optimal ones. The problem is represented by graph and the ants walk on the graph to construct solutions. The solutions are represented by paths in the graph. After the initialization of the pheromone trails, the ants construct feasible solutions, starting from random nodes, and then the pheromone trails are updated. At each step the ants compute a set of feasible moves and select the best one (according to some probabilistic rules) to continue the rest of the tour. We discuss some possibilities for describing of the ACO algorithms by GNs, that help us deeply to understand the processes and to improve them. The ACO algorithm has parallel nature, thus the computational time can be decreased running the algorithm on supercomputers. By the GN description the parallel implementation of ACO can be improved. The paper is organized as follows. In section 2 are done short remarks on GNs. In section 3 GN is represented as tool for modeling ACO algorithm. In section 4 an example of GN for ACO applied on knapsack problem is given. At the end some conclusions and directions for future work are done.

2

Short Remarks on Generalized Nets

In this section we shall introduce the concept of a Generalized Net. The GN consists of transitions, places, and tokens. Every GN-transition is described by a seven-tuple (Fig. 1): Z = L , L , t1 , t2 , r, M,

,

where: (a) L and L are ﬁnite, non-empty sets of places (the transition’s input and output places, respectively); for the transition in Fig. 1 these are L = {l1 , l2 , . . . , lm } and L = {l1 , l2 , . . . , ln }; (b) t1 is the current time-moment of the transition’s ﬁring; (c) t2 is the current value of the duration of its active state; (d) r is the transition’s condition determining which tokens will pass (or transfer) from the transition’s inputs to its outputs; it has the form of an Index Matrix (IM; see [4]):

r=

l1 .. .

l1 . . . lj . . . ln

ri,j ; li (ri,j − predicate ) .. . (1 ≤ i ≤ m, 1 ≤ j ≤ n) lm

ri,j is the predicate which corresponds to the i-th input and j-th output places. When its truth value is “true”, a token from i-th input place can be transferred to j-th output place; otherwise, this is not possible;

328

S. Fidanova and K. Atanassov

l1 .. .

j

li

j

.. . lm

j

r

-? .. . .. .

-

.. . .. .

-

- jl1 .. .

- jlj .. .

- jln

Fig. 1. GN-transition

(e) M is an IM of the capacities of transition’s arcs: l1

l1 . . . lj . . . ln

.. mi,j . M= ; li (mi,j ≥ 0 − natural number ) .. . (1 ≤ i ≤ m, 1 ≤ j ≤ n) lm (f ) is an object having a form similar to a Boolean expression. It may contains as variables the symbols which serve as labels for transition’s input places, and is an expression built up of variables and the Boolean connectives ∧ and ∨ whose semantics is deﬁned as follows: ∧(li1 , li2 , . . . , liu ) - every place li1 , li2 , . . ., liu must contain at least one token, ∨(li1 , li2 , . . . , liu ) - there must be at least one token in all places li1 , li2 ,. . ., liu , where {li1 , li2 , . . . , liu } ⊂ L . When the value of a type (calculated as a Boolean expression) is “true”, the transition can become active, otherwise it cannot. The ordered four-tuple E = A, πA , πL , c, f, θ1 , θ2 , K, πK , θK , T, to , t∗ , X, Φ, b is called a Generalized Net (GN) if: (a) A is a set of transitions; (b) πA is a function giving the priorities of the transitions, i.e., πA : A → N , where N = {0, 1, 2, . . .} ∪ {∞}; (c) πL is a function giving the priorities of the places, i.e., πL : L → N , where L = pr1 A∪pr2 A, and pri X is the i-th projection of the n-dimensional set, where n ∈ N, n ≥ 1 and 1 ≤ k ≤ n (obviously, L is the set of all GN-places);

Generalized Nets as Tools for Modeling of the ACO Algorithms

329

(d) c is a function giving the capacities of the places, i.e., c : L → N ; (e) f is a function which calculates the truth values of the predicates of the transition’s conditions (for the GN described here let the function f have the value “f alse” or “true”, i.e., a value from the set {0, 1}; (f ) θ1 is a function giving the next time-moment when a given transition Z can be activated, i.e., θ1 (t) = t , where pr3 Z = t, t ∈ [T, T + t∗ ] and t ≤ t . The value of this function is calculated at the moment when the transition terminates its functioning; (g) θ2 is a function giving the duration of the active state of a given transition Z, i. e., θ2 (t) = t , where pr4 Z = t ∈ [T, T + t∗ ] and t ≥ 0. The value of this function is calculated at the moment when the transition starts functioning; (h) K is the set of the GN’s tokens. In some cases, it is convenient to consider this set in the form K = ∪ Kl , l∈QI

where Kl is the set of tokens which enter the net from place l, and QI is the set of all input places of the net; (i) πK is a function giving the priorities of the tokens, i.e., πK : K → N ; (j) θK is a function giving the time-moment when a given token can enter the net, i.e., θK (α) = t, where α ∈ K and t ∈ [T, T + t∗ ]; (k) T is the time-moment when the GN starts functioning. This moment is determined with respect to a ﬁxed (global) time-scale; (l) to is an elementary time-step, related to the ﬁxed (global) time-scale; (m) t∗ is the duration of the GN functioning; (n) X is the set of all initial characteristics the tokens can receive when they enter the net; (o) Φ is a characteristic function which assigns new characteristics to every token when it makes a transfer from an input to an output place of a given transition. (p) b is a function giving the maximum number of characteristics a given token can receive, i.e., b : K → N . A GN may lack some of the components, and such GNs give rise to special classes of GNs called reduced GNs. The omitted elements of the reduced GNs are marked by “*”.

3

GNs as Tools for Modelling of ACO Algorithms

An interesting feature of GN is that they are expandable. A new transition can be included between every two transitions or at the beginning and the end of the net. Every one of the places of the net can be replaced with GN. Thus GN representation can show us the weakness of the method and possibilities for improvements. Hybridization of the method is represented by including new transitions or new GN between two transitions. In [5] the functioning of standard ACO algorithm is described by GN. In [6] an extension of the ﬁrst GN is constructed. It describes the functioning and the

330

S. Fidanova and K. Atanassov

results of the work of hybrid ACO algorithm including local search procedure. A next step of extension of the GN-model is done in [7], where ACO algorithm with fuzzy start of the ants on every iteration is described, including new transition at the beginning of the ACO-GN. Replacing a place from GN with new GN we can include new function of the ant. Now, we shall represent some new modiﬁcations of ACO algorithms, ideas for which come from GN description of the method. 1. If over the path of some ant in the previous simulation there is a linear part, i.e., a part without branches, and if some ant treads on a node that is the beginning of such path region, in the present simulation all this path can be gone on one step. 2. We shall discuss introducing of elements of intuitionistic fuzzy estimations of the pheromone quantity. So we can obtain the degree of existing of the pheromone for a ﬁxed node and the degree of the evaporated pheromone. 3. A GN-model of multiplicity ants can be constructed. In this case we can simulate the process of joint ant transfer to a node, where the paths of the diﬀerent ants will be divided. The GN-model can use some restrictions for this situation, because if the ants multiply on each step, their number will increase exponentially. For example, we can allow multiplication only in the case when the ants are in a near neighborhood of a node, where there was a big value on the previous simulation. Another version of this idea is: only the ant that is the nearest to such node, can multiply, but for each time-moment this is possible only for one ant. 4. Including “mortal” ants in the algorithm. In this case some of the ants can die, e.g., when some of them tread on a node through which another ant already had went. 5. A modiﬁcation of this model will be a model in which some (or all) ants have limited life-time. In this case we must ﬁnd the maximal values for a limited time.

4

A GN-Model of ACO for Multiple Knapsack Problem

In this section we construct GN for Multiple Knapsack Problem (MKP). Description of Multiple Knapsack Problem you can see in [3]. The Generalized Net (GN, see [2,4]) have 4 transitions, 11 places, and three types (α, ε, and β1 , β2 , . . . — in advance determined number) of tokens (see Fig. 2). These tokens enter, respectively, places l1 with the initial characteristic “volumes of knapsacks” and l2 with the initial characteristics for the i-th β-token “i-th thing, volume, price”. Z1 =< {l1 }, {l3 },

l3 >, l1 W1,3

Generalized Nets as Tools for Modeling of the ACO Algorithms

331

Z3

? l6 -- g Z1

? l1 l3 g -- g

Z4

-

? l10 g l7 -- g - l11 -g

Z2

l2

? l4 g -- g l5

-- g

l8

-- g l9

-- g

Fig. 2. GN net model for ACO

where W1,3 =“transition function”. The aim of this construction is: token α to go to place l3 when all β-tokens are already in place l5 and therefore we can calculate the common price of the things (let they are N in number) CP =

N

pr3 xβ0 i ,

i=1

where xω s is s-th characteristic (s ≥ 0; 0-th characteristic is the initial one) of token ω. Now, token α can obtain as a new characteristic “middle price for unit volume, i.e.

CP ”. xα 0

Token α from places l1 enters place l3 with a characteristic “vector of current transition function results ϕ1,cu , ϕ2,cu , ..., ϕn,cu ”, while token ε stays only in place l4 obtaining the characteristic “new m-dimensional vector of heuristics with elements – the graph vertices or, new l-dimensional vector of heuristics with elements – the graph arcs”. Z2 =< {l2 }, {l4 , l5 }, l4 l5 > l2 W2,4 W2,5

332

S. Fidanova and K. Atanassov

where: W2,4 = new m-dimensional vector of heuristics, W2,5 = all β-tokens are collected already in place l5 . Z3 =< {l3 , l4 , l6 , l9 , l10 }, {l6 , l7 , l8 , l9 }, l3 l4 l6 l9 l10

l6 l7 l8 l9 W3,6 W3,7 f alse W3,9 W4,6 W4,7 f alse W4,9 > W6,6 f alse f alse f alse f alse W9,7 W9,8 W9,9 true true true f alse

where: W3,7 = W3,6 = W3,9 = ¬W6,6 ∨ ¬W9,8 , W4,6 = W4,7 = W4,9 = ¬W6,6 ∨ ¬W9,8 , W6,6 = the current iteration is not ﬁnished, W9,7 = the current best solution is better than global best solution, W9,8 = “truth-value of expression C1 ∨ C2 ∨ C3 is true”, W9,9 = ¬W9,8 . where C1 , C2 and C3 are the following end-conditions: C1 – “computational time (maximal number of iterations) is achieved”, C2 – “number of iterations without improving the result is achieved”, C3 – “if the upper/lower bound is known, then the current results are close (e.g., less than 5%) to the bound”. Token α from place l3 enters place l6 with a characteristic “S1,cu , S2,cu , ..., Sn,cu ”, where Si,cu is the current partial solution for the current iteration, made by i-th ant (1 ≤ 1 ≤ n). If W6,6 = true it splits to three tokens α, α , and α that enter places l6 — token α — with a characteristic “new n-dimensional vector with elements — the couples of the new ants coordinates”, place l7 – token α — with the last α-characteristic, and place l9 — token α — with a characteristic “the best solution for the current iteration; its number”. Token α can enter place l8 only when W9,8 = true and there it obtains the characteristic “the best achieved result”. Z4 =< {l7 }, {l10 , l11 }, l10 l11 > l7 W7,10 W7,11 where: W7,10 = “truth-value of expression C1 ∨ C2 ∨ C3 is true”, W7,11 = ¬W7,10 .

Generalized Nets as Tools for Modeling of the ACO Algorithms

5

333

Conclusion

This paper shows description of ACO algorithm by GN. This description presents us the weakness of the algorithm and points us possible improvements. Important feature of the GNs is that they are expandable. So we can add new transitions and replace places with other GNs. This gives us ideas how to construct new modiﬁed ACO algorithms and to improve achieved results. Like a future work we shell develop new ACO algorithms and we shell apply them on various diﬃcult optimization problems. The aim is to construct the most appropriate algorithm for given class of problems. The nature of ant algorithms are parallel with big transfer of data after every iteration. So they are very suitable for parallel applications on massively parallel architectures and supercomputers.

Acknowledgments This work has been partially supported by the Bulgarian NSF under contract DO 02–115.

References 1. Alexieva, J., Choy, E., Koychev, E.: Review and bibloigraphy on generalized nets theory and applications. In: Choy, E., Krawczak, M., Shannon, A., Szmidt, E. (eds.) A Survey of Generalized Nets. Raﬄes KvB Monograph, vol. 10, pp. 207–301 (2007) 2. Atanassov, K.: Generalized Nets. World Scientiﬁc, Singapore (1991) 3. Fidanova, S.: Ant colony optimization and multiple knapsack problem. In: Renard, J.P. (ed.) Handbook of Research on Nature Inspired Computing for Economics and Management, pp. 498–509. Idea Grup Inc. (2006) ISBN 1-59140-984-5 4. Atanassov, K.: On Generalized Nets Theory. Prof. M. Drinov Publishing House, Soﬁa (2007) 5. Fidanova, S., Atanasov, K.: Generalized Net Models of the Process of Ant Colony Optimization. Issues on Intitionistic Fuzzy Sets and Generalized Nets 7, 108–114 (2008) 6. Fidanova, S., Atanassov, K.: Generalized Net Model for the Process of Hybrid Ant Colony Optimization. Computer Rendus de l’Academie Bulgare des Sciences 62(3), 315–322 (2009) 7. Fidanova, S., Atanasov, K.: Generalized Net Models of the Process of the Ant Colony Optimization with Intuitionistic Fuzzy Estimations. In: Proc. of the Ninth Int. Conf. of Generalized nets, Soﬁa, pp. 41–48 (2008)

A Scatter Search Approach for Solving the Automatic Cell Planning Problem Francisco Luna, Juan J. Durillo, Antonio J. Nebro, and Enrique Alba Departamento de Lenguajes y Ciencias de la Computaci´ on E.T.S Ingenier´ıa Inform´ atica, Campus de Teatinos, 29071 M´ alaga, Spain {flv,durillo,antonio,eat}@lcc.uma.es

Abstract. Planning a cellular phone network makes engineers to face a number of challenging optimization problems. This paper addresses the solution of one of these problems, Automatic Cell Planning (ACP), which lies in positioning the antennae of the network and conﬁguring them properly in order to meet several objectives and constraints. This paper approaches, for the ﬁrst time, the ACP problem with a Scatter Search technique. The algorithm used is called AbYSS. Three large-scale real-world instances have been used for evaluating the search capabilities of AbYSS on this optimization problem. The results shows that AbYSS not only reaches very accurate solutions for the three instances, but also it scales well with increasingly sized instances.

1

Introduction

Automatic Cell Planning (ACP) [10] is one of the main design tasks that emerges in the deployment of cellular telecommunication networks. Given a set of candidate sites where the antennae of the network can be placed, the problem lies in chosing a subset of these sites and then conﬁguring their installed antennae properly in such a way that both the cost of the entire infrastructure (i.e., the number of installed sites) and the signal interference are minimized while its traﬃc capacity is maximized. In real-world cellular networks, like those tackled in this work, the ACP problem also becomes a large-scale optimization problem since the number of sites required to meet the cellular operator requirements (traﬃc and capacity) is high and each of these sites has many conﬁguration settings. As it can be observed, the ACP problem is multiobjective in nature, i.e., contradictory objectives (cost vs. quality of service — measured in terms of traﬃc capacity and signal interference) are to be optimized simultaneously. Instead of aggregating these objectives into one single function, we have used a multiobjective formulation. Contrary to single-objective optimization, the solution of such a multiobjective problem is not one single solution, but a set of nondominated solutions known as the Pareto optimal set, which is called Pareto border or Pareto front when it is plotted in the objective space [1,2]. Whatever solution of this set is optimal in the sense that no improvement can be reached on an objective without worsening at least another one. The main goal in the resolution I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 334–342, 2010. c Springer-Verlag Berlin Heidelberg 2010

A Scatter Search Approach for Solving the ACP Problem

335

of a multiobjective problem is to compute the set of solutions within the Pareto optimal set and, consequently, the Pareto front. Our approach for addressing the ACP problem is a multiobjective Scatter Search (SS) algorithm called AbYSS (Archive-based hYbrid Scatter Search) [9]. To the best of our knowledge, this is the ﬁrst time a SS method is used for solving this problem. Our goal is therefore twofold. On the one hand, we want to evaluate the suitability of AbYSS for addressing this kind of problems and, on the other hand, to analyze its ability of handling instances of increased size. For a comparison basis we have used three real-world cellular networks coming from the telecommunication industry (with 249, 568, and 747 sites), and PAES [4], a widely known algorithm in the literature previously used by the authors for this problem [7]. The experiments show that AbYSS reaches very accurate results, providing the cellular network designer with highly eﬃcient conﬁgurations for the given instances. The results also report that our approach is able to scale well with the size of the instances. The rest of the paper is structured as follows. The next section provides both an overall view of the ACP problem and the particular approach used in this paper. Section 3 describes AbYSS and the modiﬁcations engineered to meet the requirements imposed by the optimization problem at hand. The experimental analysis and evaluation are included in Sect. 4. Finally, Sect. 5 presents the main conclusions and future lines of work.

2

The ACP Problem

Engineering a cellular phone network gives rise to many challeging optimization tasks at diﬀerent levels of the system deployment (business-planning, design, routing, resource allocation, etc.) [12]. The ACP problem, also known as the network dimensioning problem or the capacity planning problem, appears as a major design problem that may have a large impact not only in the early phases of the network deployment but also in subsequent expansions/modiﬁcations of existing networks. In general, the ACP problem can be described as follows: given an area where the service has to be guaranteed, determine where to locate the antennae or BTSs (Base Transceiver Stations) and to select their conﬁgurations (antenna type, number of antennae, emission power, tilt, and azimuth) so that each point (or each user) in the area receives a suﬃciently strong signal. Several approaches have been proposed in the literature to address this problem [10]. The one used in this work is based on the so-called “Cell and Test Point Model”, in which the working area is discretized into a set of test points that are spread over the whole area. These test points are used to measure the amount of signal strength in the region where the network operator intends to service the traﬃc demand of a set of customers. Three subsets of test points are deﬁned: Reception Test Points (RTPs), where the signal quality is tested; Service Test Points (STPs), where the signal quality must exceed a minimum threshold to be usable by the customers; and the Traﬃc Test Points (TTPs), where a certain traﬃc amount is associated to the customers (measured in Erlangs). The model

336

F. Luna et al.

ensures that TTP ⊆ STP ⊆ RTP. Instead of allowing the location of BTSs in any position, a set of candidate sites where BTSs could be installed is identiﬁed so as to represent the real world scenarios with more realism. Not only the working area is discretized, but also the BTS conﬁguration parameters such as the emission power, tilt, and azimuth only consider a subset of possible values. Finally, three objectives are to be optimized: minimizing both the cost of the network (fcost ) and the interferences between cells (finterf ), and maximizing the traﬃc held or network capacity (ftraf f ic ). Also, two constraints have to be satisﬁed: at least 80% of the working area has to be covered (ccov ) and a minimum handover (or cell overlapping to guarantee communication continuity) has to be guaranteed. A fully detailed description of the mathematical model used in this work can be found in [11,13].

3

Setting Up AbYSS

AbYSS (Archive-based hYbrid Scatter Search) [9] is a hybrid metaheuristic algorithm which follows the scatter search template [3] but using mutation and crossover operators coming from the Evolutionary ﬁeld. This algorithm combines ideas of three state-of-the-art multiobjective evolutionary algorithms. On the one hand, an external archive is used to store the nondominated solutions found during the search, following the scheme applied by PAES [4], but using the crowding distance of NSGA-II as a niching measure instead of the adaptive grid; on the other hand, the selection of solutions from the initial set to build the reference set applies the density estimation of SPEA2 (see [1] for the details on these last two algorithms). The SS template, and thus AbYSS, is deﬁned by ﬁve methods: diversiﬁcation generation, improvement, reference set update, subset generation, and solution combination. These ﬁve methods are structured as shown in Fig. 1. The SS template is based on using a small population, known as the reference set, whose

P

Diversification Generation Method Improvement Method Initialization phase

Improvement Method

Reference Set Update Method

Solution Combination Method

Improvement Method

Stop if MaxIter reached

Subset Generation Method Reference Set

Scatter search main loop No more new solutions

Diversification Generation Method

Fig. 1. Outline of the standard scatter search algorithm

A Scatter Search Approach for Solving the ACP Problem

337

individuals are combined to build new solutions which are generated systematically. Furthermore, these new individuals can be improved by applying a local search method. The reference set is initialized from an initial population composed of diverse solutions, and it is updated with the solutions resulting from the local search improvement. The ﬁve SS methods are described below, right after detailing how the solutions manipulated by the algorithms are encoded. 3.1

Solution Representation Site Activation

A tentative solution of the ACP problem encodes the entire celullar network conOmnidirectional/ Directive ﬁguration [13]. A multilevel encoding has been used in which level 1 handles the site BTS 3 BTS 2 BTS 1 activation, level 2 sets up the number and type of antennae, and level 3 conﬁgures Small directive/ Small directive/ Small directive/ Large directive the parameters of the BTS. Then, when Large directive Power Large directive a given site is enabled, one or more anPower Power tennas are activated, always keeping eiPower Tilt Tilt Tilt Azimuth Azimuth Azimuth ther one single omnidirectional antenna or from one to three directive antennas. Fig. 2. Hierarchical encoding Figure 2 displays the hierarchical encoding used. As it can be seen, it is not a classical encoding so the genetic operators used by the EAs have to be properly designed. 3.2

The Template Methods

AbYSS has been originally proposed for solving continous multiobjective optimization problems, so therefore the new methods have to be engineered to deal with the new solution encoding: – Diversiﬁcation generation: initial solutions are generated by using a fully random strategy. It works as follows: each site is randomly activated/deactivated. If it is activated, a random conﬁguration for this site is then generated. First, either an omnidirectional antenna or several directive antennas are installed. In the former setting, a random value for the antenna power has to be chosen. In the latter one, the number of BTSs has to be set up (from one to three). Then, for each directive BTS, its loss diagram (either short or large directive) as well as power, tilt, and azimuth values are randomly selected. – Improvement: in AbYSS, this method is deﬁned by iteratively changing one given solution by means of a mutation operator [9]. The same improvement approach is used here but with the multilevel mutation proposed in [8]. – Solution combination: a crossover operator is used as solution combination method in AbYSS. The geographic crossover presented in [8] has been used. – Subset generation: the same strategy as in AbYSS is used here.

338

F. Luna et al.

(a)

(b)

(c)

Fig. 3. Topology of the instances (a) Arno 1.0, (b) Arno 3.0, and (c) Arno 3.1

– Reference set update: this method remains the same as in AbYSS but one main issue emerges here: how is the distance between solutions measured in order to check whether one given solution becomes member of the diversity subset in the Ref Set. We have used the distance proposal deﬁned in [6], that is aimed at reporting high values when comparing tentative solutions in which the same candidate sites in these solutions have very diﬀerent conﬁgurations.

4

Experimentation

This section is devoted to presenting the experiments conducted in this paper. First, we include the details of the ACP instances tackled. The methodology used is described afterwards. Finally, the results achieved are presented and analyzed. 4.1

ACP Instances

Three real-world instances provided by France Telecom R&D have been tackled here. They correspond with two well distinguished scenarios: the instance Arno 1.0 is a highway environment, whereas the instances Arno 3.0 and Arno 3.1 model two urban areas of diﬀerent size. Their topologies are displayed in Fig. 3. Table 1 includes the most relevant values that characterize the three instances. It can be seen that they have an increasing size, ranging from 250 sites in Arno 1.0 to 743 sites in the Arno 3.1 instance (almost three times larger). It is important to note that not only the number of sites is increased but also the number of test points used to measure the quality of service in the working area.

A Scatter Search Approach for Solving the ACP Problem

339

Table 1. Several values that characterize the three instances tackled Instance Arno 1.0 Arno 3.0 Arno 3.1 Total traﬃc (Erlangs) 3 210.91 2 988.12 8 089.78 Number of sites 250 568 747 Number of test points (|ST |) 29 955 17 394 42 975 Number of traﬃc test points (|T |) 4 967 6 656 21 475

4.2

Algorithm Settings

As mentioned before, PAES has been considered in this preliminary study to check the suitability of AbYSS for addressing the ACP problem. In order to guarantee a fair and meaningful comparison, the two algorithms stop when reaching 25000 function evaluations. The maximum number of nondominated solutions in the approximated Pareto front is 100. PAES also applies the multilevel mutation operator used in AbYSS. In the two algorithms, tentative solutions are always mutated by changing one single site. The geographical crossover (solution combination method in AbYSS) is also applied with a probability of 1.0. The remaining settings of AbYSS are the same as those proposed in [9].

4.3

Methodology

Given the Pareto fronts of the studied ACP instances are not known, we have used the hypervolume (HV ) quality indicator [14], which does not require to know that information beforehand. It is a Pareto-compliant indicator [5] which measures both convergence towards Pareto optimal solutions and spread-out of solutions along the nondominated front. Higher values of the hypervolume indicator are desirable. Since it is not free from an arbitrary scaling of the objectives, all the fronts have been normalized in order to guarantee a fair comparison among the algorithms. Since we are dealing with stochastic algorithms and we want to provide the results with statistical signiﬁcance, 30 independent runs for each algorithm and each problem instance have been carried out. The HV indicator is then computed for each of the approximated fronts, thus obtaining 30 HV values for each pair algorithm/instance. Next, the following statistical analysis has been carried out. First, a Kolmogorov-Smirnov test is performed in order to check whether the values of the results follow a normal (Gaussian) distribution or not. If so, the Levene test checks for the homogeneity of the variances. If samples have equal variance (positive Levene test), an ANOVA test is done; otherwise we perform a Welch test. For non-Gaussian distributions, the non-parametric Kruskal-Wallis test is used to compare the medians of the algorithms. We always consider in this work a conﬁdence level of 95% (i.e., signiﬁcance level of 5% or p-value under 0.05) in the statistical tests, which means that the diﬀerences are unlikely to have occurred by chance with a probability of 95%.

340

F. Luna et al. Table 2. HV values of AbYSS and PAES for the three tackled instances AbYSS PAES x ¯±σn Problem x ¯±σn Arno 1.0 0.3208±0.0229 0.2343±0.0680 + Arno 3.0 0.4157±0.0208 0.1889±0.0290 + Arno 3.1 0.3637±0.0237 0.1166±0.0255 +

AbYSS PAES

Number of sites 100 50 0

0 200 400 600 800 1000 Held traffic 1200 1400

0 5e+007 1e+008 1.5e+008 Interferences 2e+008

(a) AbYSS PAES

AbYSS PAES Number of sites 250 200 150 100 50 0

Number of sites 150 100 50 0 0 200 400 600 800 1000 1200 Held traffic 1400

5e+007 1e+008 1.5e+008 Interferences 2e+008

(b)

0

1e+008 2e+008 3e+008 4e+008 5e+008 6e+008 Interferences 7e+008 8e+008

1000 1500 2000 2500 Held traffic 3000

500

0

(c)

Fig. 4. Example fronts of the instances (a) Arno 1.0, (b) Arno 3.0, and (c) Arno 3.1

4.4

Results

Table 2 includes the mean, x¯ and the standard deviation, σn , of the HV indicator values reached by the approximated Pareto fronts computed by AbYSS and PAES on the three studied ACP instances. The “+” symbol in the last column points out that the diﬀerences between the mean HV values obtained are statistically diﬀerent (successful tests in all the experiments). We have also used a gray background to highlight the larger (better) HV value obtained for each instance. The ﬁrst clear conclusion that can be drawn from the table is that AbYSS always reaches approximated Pareto fronts with better (higher) HV values. These are expected results since the SS approach has a more powerful search engine that uses not only a mutation operator (multilevel) but also a recombination one

A Scatter Search Approach for Solving the ACP Problem

341

(geographic crossover). Table 2 also shows an interesting fact: the larger the instance the greater the diﬀerence between the HV values of the two algorithms. Indeed, the HV obtained by PAES in Arno 1.0, Arno 3.0, and Arno 3.1 are, respectively, 73.02%, 45.43%, and 32.07% the HV of AbYSS. This means that the latter is more capable to exploring larger search spaces. To graphically show the diﬀerences between the two algorithms, Fig. 4 displays the Pareto fronts of the three ACP instances obtained in one single run. This ﬁgure clearly points out the enhanced search capabilities of AbYSS, i.e., it is able to cover a larger portion of the Pareto front in the three instances. From the point of view of the network designer, the beneﬁts are really relevant: AbYSS provides a richer set of network conﬁgurations (nondominated solutions) which present a wider trade oﬀ between the three ACP contradictory objectives. Speciﬁcally, it can be seen that the SS-based algorithm reaches solutions involving a very low cost (low number of sites) and provoking little interference, but holding a low amount of traﬃc.

5

Conclusions and Future Work

This paper presented a preliminary approach of using a Scatter Search algorithm called AbYSS to address the Automatic Cell Planning problem. This is a very relevant planning problem that appears in the design of cellular phone networks. In many real-world instances, like those considered here, this is also a large-scale optimization problem as far as these networks might cover large geographical areas and provide a potentially large amount of users with a high quality of service. In this preliminary study, AbYSS has been compared to PAES over three increasingly sized real world-instances. The hypervolume indicator has been used to measure the quality of the obtained nondominated sets of solutions. The results have shown that AbYSS reaches approximations of the Pareto front having better (higher) values for this quality indicator. This means that our approach is able to provide the network designer with a wide set of network conﬁgurations from which the most appropriate one can be chosen. The experiments have also revealed that AbYSS, contrary to PAES, can better deal with large-scale instances since the HV values do not get diminished when their size grows up. As future work, the efectiveness of AbYSS with respect to more advanced multiobjective optimizers such as NSGA-II or SPEA2 has to be checked. Further experiments with other conﬁgurations of AbYSS and with other problem instances are also planned.

References 1. Coello, C.A., Lamont, G.B., Veldhuizen, D.A.V.: Evolutionary Algorithms for Solving Multi-Objective Problems, 2nd edn. Genetic and Evolutionary Computation Series. Springer, Heidelberg (2007) 2. Deb, K.: Multi-Objective Optimization Using Evolutionary Algorithms. John Wiley & Sons, Chichester (2001)

342

F. Luna et al.

3. Glover, F.: A template for Scatter Search and Path Relinking. In: Hao, J.-K., Lutton, E., Ronald, E., Schoenauer, M., Snyers, D. (eds.) AE 1997. LNCS, vol. 1363, pp. 13–54. Springer, Heidelberg (1998) 4. Knowles, J., Corne, D.: The pareto archived evolution strategy: A new baseline algorithm for multiobjective optimization. In: IEEE CEC, pp. 9–105. IEEE Press, Los Alamitos (1999) 5. Knowles, J.D., Thiele, L., Zitzler, E.: A tutorial on the performance assessment of stochastic multiobjective optimizers. Technical Report TIK-Report 214, Computer Engineering and Networks Laboratory, ETH Zurich (2006) 6. Luna, F.: Metaheur´ısticas avanzadas para problemas reales en redes de telecomunicaciones (in Spanish). PhD thesis, University of M´ alaga (2008) 7. Luna, F., Nebro, A.J., Alba, E.: Parallel evolutionary multiobjective optimization. In: Nedjah, N., Alba, E., de Macedo, L. (eds.) Parallel Evolutionary Computations. Studies in Computational Intelligence, vol. 22, pp. 33–56. Springer, Heidelberg (2006) 8. Meunier, H., Talbi, E.G., Reininger, P.: A multiobjective genetic algorithm for radio network optimization. In: IEEE CEC, pp. 317–324. IEEE Press, Los Alamitos (2000) 9. Nebro, A.J., Luna, F., Alba, E., Dorronsoro, B., Durillo, J.J., Beham, A.: AbYSS: Adapting scatter search to multiobjective optimization. IEEE Transactions on Evolutionary Computation 12(4), 439–457 (2008) 10. Raisanen, L.: Multi-objective site selection and analysis for GSM cellular network planning. PhD thesis, Cardiﬀ University (2006) 11. Reininger, P., Caminada, A.: Model for GSM radio network optimization. In: 2nd ACM Int. Conf. on Discrete Algorithms and Methods for Mobility. ACM Sigmobile, pp. 35–42 (1998) 12. Resende, M.G.C., Pardalos, P.M. (eds.): Handbook of Optimization in Telecommunications. Springer, Heidelberg (2006) 13. Talbi, E.G., Meunier, H.: Hierarchical parallel approach for GSM mobile network design. Journal of Parallel and Distributed Computing 66(2), 274–290 (2006) 14. Zitzler, E., Thiele, L.: Multiobjective evolutionary algorithms: A comparative case study and the strength Pareto approach. IEEE Transactions on Evolutionary Computation 3(4), 257–271 (1999)

Some Aspects Regarding the Application of the Ant Colony Meta-heuristic to Scheduling Problems Ioana Moisil and Alexandru-Liviu Olteanu “Lucian Blaga” University of Sibiu, Faculty of Engineering Str. Emil Cioran, Nr. 4, Sibiu, 550025, Romania [email protected], [email protected]

Abstract. Scheduling is one of the most complex problems that appear in various fields of activity, from industry to scientific research, and have a special place among the optimization problems. In our paper we present the results of our computational study i.e. an Ant Colony Optimization algorithm for the Resource-Constrained Project Scheduling Problem that uses dynamic pheromone evaporation.

1

Introduction

Scheduling is one of the most complex problems that appear in various ﬁelds of activity, from industry to scientiﬁc research, and have a special place among the combinatorial optimization problems (COP). Diﬀerent operating environments, with diﬀerent constraints and optimization criteria, make the scheduling problem quite challenging. In short, a scheduling problem is a problem where a given set of activities or jobs need to be put in order such that certain optimization criteria are minimized. A job can have many operations and in general operations from the same job cannot be processed at the same time. The activities can have precedence constraints, resource needs, start times, deadlines and other restrictions The number of machines and jobs varies between diﬀerent kinds of scheduling problems (Single Machine Total Tardiness Problem — SMTTP, Job Shop — JSP, Open Shop — OSP, Group Shop scheduling Problem — GSP, Resource-Constrained Project Scheduling Problems — RCPSP, Multi-Modal Resource Constrained Project Scheduling Problems — MMRCPSP, Multi-Project Scheduling Problems — MPSP, and variants). For any kind of problem, diﬀerent orders of operations result in diﬀerent amounts of time to complete all operations (makespan), and in almost all cases the main objective is to minimize the makespan. Most scheduling problems that are of practical interest are NP-hard (nondeterministic polynomial-time hard) problems. There are many methods for looking up for feasible and optimal solutions for a scheduling problem, ranging from exact methods to metaheuristics [15,5]. Exact methods, as CPM (critical path method), linear programming or bounded I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 343–351, 2010. c Springer-Verlag Berlin Heidelberg 2010

344

I. Moisil and A.-L. Olteanu

enumeration (decision trees) can lead to an optimal solution and have the advantage that they can determine if there is an unfeasible solution, but become unpractical for large size problems and for large sets of constrains, and cannot tackle with the uncertainty and dynamics of real-life problems. Heuristic methods (MIN SLK — minimum slack time; MIN LFT — nearest latest ﬁnish time; SFM — shortest feasible duration); LRP — the least resource proportion, Hierarchical approaches, Simulated Annealing, etc.), and metaheuristics (Evolutionary Algorithms, Swarm Intelligence [1]) cannot determine if there is an unfeasible solution. Other methods are evolutionary algorithms, molecular [10], DNA and membrane computing [16], and hybrid approaches. In our paper we are considering applying the ACO (Ant-Colony-Optimization) meta-heuristic for the Resource-Constrained Project Scheduling Problem, bringing a variation to the classic approach and comparing the performance with those obtained using a genetic algorithm and a best search method.

1.1

Resource-Constrained Project Scheduling

In the following we will describe the Resource-Constrained Project Scheduling Problem (RCPSP). RCPSP is a more general case of the scheduling problems. Let J = {J1 , J2 , . . . , Jn } be a set on n jobs, M = {M1 , M2 , . . . , Mm } a set of m machines and P = {p1 , p2 , . . . , pn } the processing times for each job. These jobs are intra-related in the form of a digraph. The precedence relation is unidirectional. If Ji links to Jj then the ﬁrst is a predecessor for the second. The graph is constructed so that no loops are created. Two dummy jobs are added. They don’t have processing times or resource needs. The starting job S links all the jobs that have no predecessors, while the ﬁnishing job F is linked by all the jobs that have no successors. An example of a precedence graph is shown in Fig. 1. Each job Ji has a set of machines mi ⊆ M that need to work at the same time in order to be completed.

Fig. 1. A precedence graph

Some Aspects Regarding the Application of the Ant Colony

345

Fig. 2. Solution space

A valid solution is a chain starting from the ﬁrst job S and ﬁnishing at the ending job F. The jobs need to be chosen in such an order that the precedence relations aren’t broken. In Fig. 2 we have shown the solution space for a problem with 4 jobs. This ﬁgure contains all the chains that can be formed between these jobs if no precedence constraints are active. With the increase in problem size, the solution space grows exponentially. Even if the precedence constraints reduce the number of valid solutions, enforcing them adds another layer of computational complexity, thus proving the complexity of RCPSP. Fig. 3 shows the resource usage for the optimal solution for the problem presented with the precedence constraints depicted in Fig. 1.

2

Problem Solution

We will present further how the Ant Colony Optimization algorithm works, and the changes we have brought to it. Also, in the results section we will show how this aﬀects the overall performance of the algorithm, as well as a comparison with two other methods. 2.1

ACO (Ant Colony Optimization)

The idea behind the ACO metaheuristic was inspired by the foraging behaviour of ant colonies that are able to ﬁnd the shortest path to a source of food by communicating indirectly through a substance called pheromones [7,3]. Ants do not have a systematic way for searching for food; they start randomly to explore the area near their nest. When an ant ﬁnds a source of food it determines in some way the quantity and the quality of the food, takes a certain amount to carry to the nest and on its way back deposits on the path a trail of pheromone, a chemical substance. The quantity of pheromone depends on the quantity and quality of the food. The pheromone trail will then become an indication for other ants for the path to the food. The shortest the path, the more the pheromone is

346

I. Moisil and A.-L. Olteanu

Fig. 3. Optimal solution

deposited [7,13]. This behaviour was used by the model of artiﬁcial ant colonies for solving combinatorial optimization problems. In the process of building a solution for a problem, artiﬁcial ants move concurrently and asynchronous on an appropriately constructed graph [8]. The ACO metaheuristic was introduced by Dorigo et al. [8]. The basic form of the meta-heuristic is: Foreach Iteration Construct Solution Update Pheromone Daemon Actions(optional) Endfor The ﬁrst CO problem to which ACO was applied was the JSP (Colorni, Dorigo, Maniezzo, and Trubian , 1994). They deﬁned a construction graph made up of the operations to be scheduled with an additional node representing the empty sequence from which ants start. The pheromone is associated with the edges of the graph. In later applications the pheromone is associated with the absolute position of an operation in the permutation; hence there is no more need of an additional node. The ﬁrst application of ACO metaheuristic to the RCPSP was presented by Merkle et al. [13]. In their approach the eligibility of an activity to be scheduled next is based on a weighted evaluation of the latest start time (LST) priority rule and on pheromones that represent the learning eﬀect of previous ants. A pheromone value i j describes how promising it seems to put activity j as the i-th activity into the schedule. Their approach includes separate ants for forward and backward scheduling. There is also a second optimization phase based on local search. 2.2

Building the Solution

In this step the ants form the solutions in the form of the order in which the activities are chosen to be asigned to the resources. Here we have the steps involved in building the solution:

Some Aspects Regarding the Application of the Ant Colony

347

Start with S while not finished C = Candidate Nodes if C not null Choose Next Node else if C contains F finished else Advance Time end if end if end while The ant agents start from the initial phase and choose the next job from the list of candidate jobs until they reach the ﬁnal phase. This list contains all the jobs for which all their predecessors have ﬁnished execution, and also have enough resources available to start execution immediately. As we saw in the precedence graph, the ﬁnal phase F has all the possible terminal jobs conected to it. It will only appear in the candidate jobs list when all the other jobs will have ﬁnished execution. The ants choose the next node from the candidate list based on: – Heuristic value — ηij – Pheromone value — τij The heuristic value ηij serves as a problem speciﬁc knowledge. For our problem, in order to minimize the span of the project, at each decisions point, jobs that have a longer critical path towards the ﬁnal phase will have a higher probability to be chosen. This is done by having a higher heuristic value for that job. The heuristic value is the diﬀerence between the latest ﬁnishing time of all activities noted as LF TF and the latest ﬁnishing time of the job [13] ηij = LF TF − LF Tj + 1.

(1)

The value is set at start time for all the jobs. It does not change throughout the execution of the algorithm and is computed by starting from the ﬁnishing node and through recursion ﬁnding the longest path to the current job node. The pheromone value τij is a measure of past experience, and shows if scheduling job j after job i brought good solutions in the past. The probability of choosing job j to follow job i is: pij =

α

β

[ηij ] · [τij ] , α β [η ] · [τ ] ih ih h∈D

where D is the set of possible jobs that can be scheduled after job Ji .

(2)

348

I. Moisil and A.-L. Olteanu

2.3

Updating the Pheromone

The pheromone deposit is based on the quality of the solution [13]. The better the solution the more the pheromone values increase along it. The pheromone deposit formula is: τij = τij +

ρ 2·T

(3)

ρ is the pheromone deposit value, and is set beforehand. T is the completion time of the solution. An elitist approach is used and after each iteration only the iteration best and overall best solutions are strengthened. At the beginning, poor solutions will be chosen and strengthened. This can make the algorithm converge to that solution and ignore others. To prevent this, the pheromone evaporates at a constant rate during each iteration [13]. This allows for new and better solutions to be found and older ones to be forgotten. The pheromone evaporation formula is: τij = τij · χ,

(4)

where χ is the evaporation rate and holds a value between 0 and 1. 2.4

Dynamic Evaporation Approach

We thought that having the pheromone evaporate constantly and at a slow rate brings unnecessary computations during each iteration and slows down the algorithm. The purpose of evaporation is to forget suboptimal solutions that gathered enough pheromone along them to attract other ants unnecessarily. Why not trigger the evaporation only when it is needed? Also a static evaporation rate may not be optimal for every problem instance, and could bring premature convergence to a particular solution. For this purpose, we monitor the pheromone levels along the solutions that have pheromone deposited on them. These are the most probably to have the highest levels of pheromone from the pheromone matrix. If a pheromone value passes a preset threshold, the evaporation process is triggered. A higher evaporation value is used since we only want to trigger this step occasionally. 2.5

Results

For our simulations we have used the j120 set from Project Scheduling Problem Library [21]. These benchmarks contain 120 jobs and 4 resource types. There is a ﬁxed number of each of the 4 resource types. Each result represents the average from the ﬁrst 20 instances of the benchmarks run 50 times over across 10 seconds on a machine with a 2.2 GHz processor. We have set α and β to 1 and χ to 0.01 [14]. Also ρ was set to 100 and the number of 10 ants per iteration was found to be the best.

Some Aspects Regarding the Application of the Ant Colony

349

Fig. 4. Pheromone evaporation approaches

The graph on Fig. 4 shows the diﬀerence between the constant evaporation approach and the dynamic one. The ﬁrst approach has a 1% evaporation rate, while the second a 20% evaporation rate and 90% threshold. The results show that the dynamic approach has slightly better results. We have also compared ACO to other approaches: a genetic algorithm and beam-search. The genetic algorithm has 4 steps: selection, reproduction, evaluation, and forming the new population. It starts from an initial population of solutions built using the same heuristic as ACO, but without the pheromone information. In this implementation we have chosen all the individuals to participate equally in forming the new generation. Pairs are randomly selected and 2 children are formed using a 2-point crossover operator and a mutation operator that interchanges each job in the solution with another with a 50% probability, as long as the precedence constraints are fulﬁlled. The evaluation step uses directly the solution time of each individual and the new generation is formed from both old and new individuals by keeping only the best half. The beam-search algorithm is a variation on the branch-and-bound algorithm. This algorithm starts from the ﬁrst node and constructs all the partial solutions that can be formed by incrementally adding one more job. These partial solutions are then compared through the use of another heuristic and only a ﬁxed number

350

I. Moisil and A.-L. Olteanu

Fig. 5. Comparison between ACO, genetic, and beam-search algorithms k are selected to be expanded further. The heuristic ηij for choosing Jj after Ji for the partial solution k is: tk k ηij = , (5) ηij

where tk is the current time that solution k has reached and ηij represents the heuristic used in the Ant Colony Optimization approach for choosing Jj after Ji . The Ant Colony Optimization approach clearly performs better than the other algorithms considered.

3

Conclusion

Metaheuristic algorithms have found wide application in scheduling problems. This paper considered applying the ACO meta-heuristic for the ResourceConstrained Project Scheduling Problem. We have presented results of numerical experiments involving pheromone evaporation. Dynamic evaporation brings slightly better results, but overall ACO performs much better than the other methods considered. Further research will focus on testing hybrid approaches, combining ACO metaheuristic with genetic algorithms.

References 1. Abdelaziz, F.B., Krichen, S., Dridi, O.: A Multiobjective Resource-Constrained Project-Scheduling Problem. In: Mellouli, K. (ed.) ECSQARU 2007. LNCS (LNAI), vol. 4724, pp. 719–730. Springer, Heidelberg (2007)

Some Aspects Regarding the Application of the Ant Colony

351

2. Angus, D.: Ant Colony Optimization: From Biological Inspiration to an Algorithmic Framework. Technical Report: TR013, Centre for Intelligent Systems & Complex Processes, Faculty of Information & Communication Technologies, Swinburne University of Technology Melbourne, Australia (2006) 3. Blum, C.: Theoretical and Practical Aspects of Ant Colony Optimization. Dissertations in Artificial Intelligence, vol. 282. Akademische Verlagsgesellschaft Aka GmbH, Berlin (2004) 4. Brucker, P.: Scheduling Algorithms. Springer, Heidelberg (2001) 5. Cook, W.J., Cunningham, W.H., Pulleyblank, W.R., Schrijver, A.: Combinatorial Optimization, 1st edn. John Wiley & Sons, Chichester (1997) 6. Dorigo, M.: Optimization, learning and natural algorithms, Ph.D. Thesis, Dip Elettronica e Informazione, Politecnico di Milano, Italy (1992) 7. Dorigo, M., Birattari, M., St¨ utzle, T.: Ant Colony Optimization. Artificial Ants as a Computational Intelligence Technique, IRIDIA — Technical Report Series Technical Report No. TR/IRIDIA/2006-023 (September 2006) 8. Dorigo, M., St¨ utzle, T.: Ant Colony Optimization. MIT Press, Cambridge (2005) 9. Hartman, S.: A Self-Adapting Genetic Algorithm for Project Scheduling under Resource Constraints. Naval Research Logistics 49, 433–448 (2002) 10. P˘ aun, G.: Membrane computing: some non-standard ideas. In: Jonoska, N., P˘ aun, G., Rozenberg, G. (eds.) Aspects of Molecular Computing. LNCS, vol. 2950, pp. 322–337. Springer, Heidelberg (2003) 11. Lenstra, J.K., Rinnooy Kan, A.H.G.: Complexity of Scheduling under Precedence Constraints. Oper. Res. 26, 22–35 (1978) 12. Daniel, M., Middendorf, M., Schmeck, H.: Pheromone Evaluation in Ant Colony Optimization. In: 26th Annual Conf. of the IEEE, vol. 4, pp. 2726–2731 (2000) 13. Merkle, D., Middendorf, M., Schmeck, H.: Ant Colony Optimization for ResourceConstrained Project Scheduling. IEEE Transactions on Evolutionary Computation 6(4), 333–346 (2002) 14. Olteanu, A.-L.: Ant Colony Optimization Meta-Heuristic in Project Scheduling. In: 8th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering and Data Bases, World Scientific and Engineering Academy and Society (WSEAS), Stevens Point, Wisconsin, USA, pp. 29–34 (2009) 15. Papadimitriou, C.H., Steiglitz, K.: Combinatorial Optimization: Algorithms and Complexity. Dover Pubns. (1998) ISBN 0-486-40258-4 16. Paun, G.: Membrane computing — An Introduction. Natural Computing Series. Springer, Heidelberg (2002) 17. St¨ utzle, T., Dorigo, M.: ACO algorithms for the Traveling Salesman Problem. Evolutionary Algorithms in Engineering and Computer Science. In: Evolutionary Algorithms in Engineering and Computer Science. Recent Advances in Genetic Algorithms, Evolution Strategies, Evolutionary Programming, Genetic Programming and Industrial Applications, ch. 9, pp. 163–184. Wiley, Chichester (1999) 18. Tavares, L.V., Weglarz, J.: Project Management and Scheduling: A Permanent Challenge for OR. European Journal of Operational Research 49(1), 1–2 (1990) 19. Valente, J.M.S., Alves, R.A.F.S.: Beam-search Algorithms for the early/tardy Scheduling Problem with Release Dates Investiga¸ca ˜o – Trabalhos em curso 143 (2004) 20. Wall, M.B.: A Genetic Algorithm for Resource-Constrained Scheduling. MIT Press, Cambridge (1996) 21. Project Scheduling Problem Library, http://129.187.106.231/psplib/main.html

High-Performance Heuristics for Optimization in Stochastic Traﬃc Engineering Problems Evdokia Nikolova Massachusetts Institute of Technology Cambridge, MA, U.S.A. [email protected]

Abstract. We consider a stochastic routing model in which the goal is to ﬁnd the optimal route that incorporates a measure of risk. The problem arises in traﬃc engineering, transportation and even more abstract settings such as task planning (where the time to execute tasks is uncertain), etc. The stochasticity is speciﬁed in terms of arbitrary edge length distributions with given mean and variance values in a graph. The objective function is a positive linear combination of the mean and standard deviation of the route. Both the nonconvex objective and exponentially sized feasible set of available routes present a challenging optimization problem for which no eﬃcient algorithms are known. In this paper we evaluate the practical performance of algorithms and heuristic approaches which show very promising results in terms of both running time and solution accuracy.

1

Introduction

Consider the problem one faces every day to go to work: Find the best route between two points in a complex network of exponentially many routes, ﬁlled with uncertainty. In the absence of uncertainty, there are known polynomialtime algorithms, as well as numerous metaheuristics developed to yield very fast and practical running times for shortest path computations. However, the case of uncertainty still leaves unresolved theoretical problems and calls for practical metaheuristic methods. Note that with uncertainty it is not even clear how to deﬁne the optimal route: is it the route that minimizes the expected travel time? Or its variance, or some other metric? In this paper we consider the natural deﬁnition of an optimal route to minimize a convex combination of mean and standard deviation. As it turns out, the solution approaches for ﬁnding the optimal route under this metric also yield solutions to the related problem of arriving ontime (in which the optimal path maximizes the probability that the route travel time does not exceed a given deadline). The stochasticity is deﬁned in terms of given mean and variance values for each edge in the network, under arbitrary distributions. With this deﬁnition of an optimal route, traditional methods of solving shortest path problems fail (the problem no longer has the property that a subpath of I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 352–360, 2010. c Springer-Verlag Berlin Heidelberg 2010

High-Performance Heuristics for Optimization

353

an optimal path is optimal and thus one cannot use dynamic programming techniques to solve it). The structure of the problem allows us to reduce the number of local optima so that we do not need to examine all exponentially many routes. However in the worst case, the local optima are still superpolynomially many. In the current paper we evaluate the performance of several algorithms and heuristic approaches. First, for a class of grid graphs, we provide experimental results that the number of local optima is sublinear and thus examining all of them can be done eﬃciently and yield the exact optimum. Second, for the purpose of practical implementation, it is preferable to examine only a small (constant or logarithmic) number of local optima. To this end, we examine heuristics that pick a small subset of the local optima, and provide bounds and experimental evaluation on the quality of the resulting solution.

2

Problem Statement and Preliminaries

We are given a graph G with n nodes and m edges and are interested in ﬁnding a route between a speciﬁed source node S and a destination node T . The edge lengths in the graph are stochastic and come from arbitrary independent1 distributions with given mean μi and variance τi (which can be diﬀerent for diﬀerent edges i). Our goal is to ﬁnd the optimal route that incorporates a measure of risk. As such we deﬁne the optimal route via a natural family of objectives, namely to minimize a convex combination of the route’s mean and standard deviation: μi + (1 − α) τi (1) minimize α i∈P

i∈P

such that P is an ST -path. The summations above are over the edges i in a given ST -route and the minimization is over all valid ST -routes P . The parameter α ∈ [0, 1] speciﬁes the objective function (convex combination) of choice. It will be helpful to consider a continuous formulation of the above discrete problem, by denoting an ST -route by its corresponding incidence vector x = (x1 , ..., xm ) where xi = 1 if edge i is present in the route and xi = 0 otherwise. Denote also the vector of means of all edges by μ = (μ1 , ..., μm ) and the vector of variances by τ = (τ1 , ..., τm ). The set of all feasible routes is represented by {01}-vectors x ∈ Rm , which are a subset of the vertices of the unit hypercube in m dimensions. The convex hull of these feasible vectors is called the path polytope. Thus, the continuous formulation of the routing problem is: √ minimize αμT x + (1 − α) τ T x (2) such that x ∈ path polytope. 1

The results here generalize to dependent distributions, however we focus this presentation on the independent case for clarity and brevity of the mathematical exposition.

354

E. Nikolova

This objective function is concave for all values of the parameter α ∈ [0, 1], so it attains its minimum at an extreme point2 of the feasible set [2]. This is a key property of the objective which establishes that the optimal solution of the continuous problem (which is a superset of the original feasible set) will be a valid path and thus will coincide with the optimal solution of the discrete problem. We emphasize this important observation in the following proposition. Proposition 1. The optimal solution of the continuous formulation (2) is the same as the optimal solution of the discrete problem (1). Furthermore the objective function is monotone increasing in the route mean and variance, thus its optimum also minimizes some convex combination of the mean and variance (as opposed to the mean and standard deviation!). Proposition 2. The optimal solution of the nonconvex problem (2) minimizes the linear objective βμT x + (1 − β)τ T x for some β ∈ [0, 1]. This second observation is critical for yielding a subexponential exact algorithm for the stochastic routing problem. The exact algorithm enumerates the candidate set of extreme points or paths (which is a small subset of all extreme points) in time linear in the number of such paths, which is at most nO(log n) in the worst case. In this paper, we seek to reduce this superpolynomial complexity. Related Work. Our work is most closely related to the work of Nikolova et al. [10] that proposes and analyzes the worst-case running time of the exact algorithm for a related objective (in the above notation, to maximize the objective T t−μ x √ , which arises in maximizing one’s probability of arriving ontime, that τT x is arriving within a speciﬁed time frame t, under normally distributed edge lengths). Our problem is also related to the parametric shortest path problem [3], in which the edge lengths, instead of being stochastic, are deterministic linear functions of a variable λ and the question is to enumerate all shortest paths over all values of the variable λ ∈ [0, ∞). The literature on stochastic route planning and traﬃc engineering is vast, though most commonly, the diﬀerent models minimize the expected route cost or length (e.g., [12,11,8]. As such, the work in this domain is typically based on stochastic routing formulations of very diﬀerent nature and solutions. A sample of work that is closest to the problem we consider is [9,7,5,1]. In particular, Loui [7] considers monotone increasing cost functions, however, the algorithms proposed have exponential worst-case running time. On the other hand, Fan et al. [5] provide heuristics for diﬀerent model of adaptive routing, of unknown approximation guarantee. Nikolova et al. [9] prove hardness results for a broad class of objective functions and provide pseudopolynomial algorithms. Lim et al. [6] provide empirical results showing that the independence assumption of edge distributions does not aﬀect the accuracy of the answer by too much. 2

An extreme point of a set C is a point that cannot be represented as a convex combination of two other points in the set C.

path variance

High-Performance Heuristics for Optimization

355

P1

P3 P2 path mean

Fig. 1. Finding and enumerating the candidate solution-paths P1 , P2 , ...

3

Performance of Exact Algorithm

The exact algorithm for solving the problem can be thought of as a seach algorithm on the values of the parameter β from Proposition 2. For a given β and a path P with incidence vector x, the linear objective βμT x + (1 − β)τ T x can be separated as a sum of edge weights i∈P (βμi + (1 − β)τi ) (note, the variance of the path P equals the sum of variances of the edges along the path, by the independence of the edge length distributions). Thus, ﬁnding the ST -route that minimizes the linear objective can be done with any deterministic shortest path algorithm such as Dijkstra, Bellman-Ford, etc. [4] with respect to edge weights (βμi + (1 − β)τi ). Both the exact and heuristic algorithms we consider will consist of a number of calls to a deterministic shortest path algorithm of the user’s choice, for appropriately chosen values β: this makes our approach very ﬂexible since diﬀerent implementations of shortest paths algorithms are more eﬃcient for diﬀerent types of networks and one can thus take advantage of the most eﬃcient implementations available. We thus characterize the running time performance in terms of the number of such calls or iterations to an underlying shortest path algorithm. The exact stochastic routing algorithm ﬁrst sets β = 1 and β = 0 and solves the resulting deterministic shortest paths problems (namely it ﬁnds the route with smallest mean and the route with smallest variance). Denote the mean and variance of the resulting routes P1 and P2 by (m1 , s1 ) and (m2 , s2 ) respectively. We next set β so that the slope of the linear objective is the same as the slope of the line connecting the points P1 and P2 (see ﬁgure 1). Denote the resulting path, if any, by P3 . We continue similarly to ﬁnd a path between P1 and P3 and between P3 and P2 , etc, until no further paths are found. If there are k extreme points (paths) minimizing some positive linear objective of the mean

356

E. Nikolova

250

100 Min #local opt Max #local opt 1* sqrt(n)

80 # Local Optima

# Local optima

200

150

100

50

0

Min #local opt Max #local opt 0.4* sqrt(n)

60

40

20

0

50

100 150 sqrt(n)

200

250

0

0

50

100 150 sqrt(n)

200

250

Fig. 2. Number of local optima (extreme point paths) in grids of size 10×10 to 250×250, with edge mean length μe ← U nif orm[0, 1] and variance τe ← U nif orm[0, μe ]

and variance, then this algorithm ﬁnds all of them with 2k applications of a deterministic shortest path algorithm. In the worst case the number of extreme points k can be n1+log n [10]—that is superpolynomial (albeit subexponential) and too large to yield an eﬃcient algorithm. However, in most networks and mean-variance values of interest, this number seems to be much lower, thus implying that the exact algorithm may have a good performance in practice. We thus investigate the performance of the algorithm in a class of networks that are the standard ﬁrst test set for indicating performance on realistic traﬃc networks. We consider grid networks of size ranging from 10 × 10 (100 nodes) to 250 × 250 (62, 500 nodes). For each network size type 10z × 10z (where z = 1, ..., 25), we run 100 instances of the stochastic routing problem. In a single instance, we generate the mean values uniformly at random from [0, 1], and the variance values uniformly at random from the interval [0, mean] for a corresponding edge with an already generated mean value. (By scaling all edge means if necessary, we can assume without loss of generality that the maximum mean has value 1.) Out of these 100 simulations per network size, we record the minimum and the maximum number of extreme points and plot them against the square root of the network size (i.e., the square root of the number of nodes in the network). The resulting plots are shown in ﬁgure 2. To put these empirical results in context: The maximum number of extreme points on a network with 10, 000 nodes found from the simulations, is k = 45 (meaning the exact algorithm consisted of only 2k = 90 iterations of a deterministic shortest path algorithm to ﬁnd the optimal stochastic route) as opposed to the predicted worst case value of 10, 0001+log 10,000 ≈ 1057 ! Similarly, the highest number of extreme points found in graphs of 40, 000 and 62, 500 nodes is k = 75 and k = 92 respectively as opposed to the theoretical worst-case values of 40, 0001+log 40,000 ≈ 1075 and 62, 5001+log 62,500 ≈ 1081 . In other words,

High-Performance Heuristics for Optimization 120

100 Min #local opt Max #local opt 0.4 1* n. 0.4 0.55* n.

Min #local opt Max #local opt 0.6 0.15* n.

100 # Local optima

# Local optima

80

60

40

20

0

357

80 60 40 20

0

20

60

40 n

80

100

0.4

0

0

200

400 n

600

800

0.6

Fig. 3. Number of local optima in grids with n nodes vs n0.4 (left) and n0.6 (right)

despite the pessimistic theoretical worst-case bound of the exact stochastic routing algorithm, it has a good performance in practice that is orders of magnitude smaller. In ﬁgure √ 2, we have ﬁtted the maximum number of extreme points to a linear always strictly function of √ n: the extreme points for a graph √ with n nodes are √ less than n and asymptotically less than 0.4 n. To ensure that n is the right function to bound the asymptotics, we have also plotted the number of extreme points with respect to n0.4 and n0.6 , in ﬁgure 3. The latter two plots conﬁrm that the rate of growth of extreme points is faster than θ(n0.4 ) and slower than θ(n0.6 ). On the basis of these empirical results, we conclude that a strict upper bound on the number of iterations of the exact algorithm for any grid graph with n √ nodes is n. We leave as an intriguing open problem to give a theoretical proof of this performance bound.

4

High-Performance Heuristics

In this section we present a heuristic for ﬁnding the optimal stochastic route, which dramatically reduces the running time of the exact algorithm above. Instead of enumerating all extreme points, we select a very small subset of them, leading to high practical performance of the algorithm. Remarkably, the big reduction of extreme points does not lead to a big sacriﬁce in the quality of the resulting solution. Our experiments show that even on the large networks of 40, 000 nodes, the heuristic examines only 3 to 6 extreme point paths (compare to the experimental 75 and the theoretical worst bound of 40, 0001+log 40,000 ≈ 1075 points of the exact algorithm above), and in all our simulations the value of the solution is within a multiplicative factor of 0.0001 = 0.01% of the optimum. The heuristic again utilizes Proposition 2, but instead of searching all possible parameter values β that yield a diﬀerent extreme point (path), it tests an

358

E. Nikolova T

T

100x100 network, 39 extreme points, obj f = μ x + 7 sqrt(τ x)

150 28 T

24 22

# Extreme points # Iterations of exact alg. # Iterations of heuristic

Line: β μ x + (1−β) τ x

Algorithm running times

path variance

26

T

OPT

20 18 16 50

55 60 path mean

65

100

50

0

0

50

100 grid side

150

200

Fig. 4. (left) Plot of all extreme point paths of a 10, 000-node network. The optimal path is marked with a circle and labelled ‘OPT’. The heuristic algorithm minimizes a small number of linear functions as shown, each yielding one extreme point. It then outputs the extreme point (path) with smallest objective function value. (right) Number of iterations in the heuristic vs the exact algorithm, in 10x10 up to 200x200 grid networks. Recall that the number of iterations to ﬁnd all extreme points in the exact algorithm is two times the number of extreme points, where each iteration is a call to a deterministic shortest path algorithm of the user’s choice.

appropriate geometric progression of values and selects the best of the resulting small set of paths. We illustrate the details of the heuristic through ﬁgure 4(left). This ﬁgure plots all extreme point-paths for a 10, 000-node network (with mean and variance values of its edges generated as explained in the previous section). Each point in the ﬁgure corresponds to one path in the network with mean equal to the x-coordinate of the point, and variance equal to the y-coordinate of the point. In this mean-variance plot, the optimum happens to be a path with mean 49.83 and variance 21.77 (marked by a circle in the ﬁgure). By Proposition 2 and as depicted on the ﬁgure, the optimal path minimizes the linear objective βμT x + β (1 − β)τ T x (this corresponds to a line with slope b = − 1−β ) for some range of parameter values β or equivalently for some range of slopes. If we could guess the optimal slope, then we would ﬁnd the optimal route with a single iteration of a deterministic shortest path algorithm with respect to edge weights βμi + (1 − β)τi . Instead, we test a geometric progression of slopes with a multiplicative step a. The smaller the step a, the more linear objectives we end up testing, which increases the accuracy but also increases the running time of the algorithm. In our simulations, we experiment with diﬀerent multiplicative steps. It turns out that using a multiplicative step of 1.01 results in very high accuracy of 99.99%, and also very few extreme point tests and iterations (up to 6 for all graphs with 2, 500 to 40, 000 nodes). We compare the running time of the heuristic with that of the exact algorithm in ﬁgure 4(right). This plot gives the highest number of deterministic shortest

High-Performance Heuristics for Optimization

359

path iterations that each algorithm has run over 100 simulations per network size. Recall again that the exact algorithm needs to run two times as many iterations as the total number of extreme points. The plot shows that the high performance and accuracy of the heuristic makes it a very practical and promising approach for the stochastic routing problem.

5

Conclusion

We investigated the practical performance of exact and heuristic algorithms for the stochastic routing problem in which the goal is to ﬁnd the route minimizing a positive linear combination of the route mean and standard deviation. The latter is nonconvex integer optimization problem, for which no eﬃcient algorithms are known. Our experimental results showed that the exact algorithm which is based on enumerating all paths that are potential local optima (extreme points √ of the feasible set), has surprisingly good running time performance O( n)R on networks of practical interest compared to its predicted theoretical worstcase performance nO(log(n)) R, where R is the running time of any deterministic shortest path algorithm of the user’s choice. We also showed that a heuristic that appropriately selects a small subset of the potentially optimal paths, has very high performance, using a small constant number of deterministic shortest path iterations and returning a solution that has a 99.99% accuracy. Heuristics of this type are thus a very promising practical approach.

References 1. Ackermann, H., Newman, A., R¨ oglin, H., V¨ ocking, B.: Decision making based on approximate and smoothed pareto curves. Theor. Comput. Sci. 378(3), 253–270 (2007) 2. Bertsekas, D., Nedi´c, A., Ozdaglar, A.: Convex Analysis and Optimization. Athena Scientiﬁc, Belmont (2003) 3. Carstensen, P.: The complexity of some problems in parametric linear and combinatorial programming. Ph.D. Dissertation, Mathematics Dept., Univ. of Michigan, Ann Arbor, Mich. (1983) 4. Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Introduction to Algorithms, 2nd edn. The MIT Press, Cambridge (2001) 5. Fan, Y., Kalaba, R., Moore II, J.E.: Arriving on time. Journal of Optimization Theory and Applications 127(3), 497–513 (2005) 6. Lim, S., Balakrishnan, H., Giﬀord, D., Madden, S., Rus, D.: Stochastic motion planning and applications to traﬃc. In: Proceedings of the Eighth International Workshop on the Algorithmic Foundations of Robotics (WAFR), Guanajuato, Mexico (December 2008) (accepted) 7. Loui, R.P.: Optimal paths in graphs with stochastic or multidimentional weights. Communications of the ACM 26, 670–676 (1983) 8. Miller-Hooks, E.D., Mahmassani, H.S.: Least expected time paths in stochastic, time-varying transportation networks. Transportation Science 34, 198–215 (2000)

360

E. Nikolova

9. Nikolova, E., Brand, M., Karger, D.R.: Optimal route planning under uncertainty. In: Long, D., et al. (eds.) Proceedings of the Sixteenth International Conference on Automated Planning and Scheduling, pp. 131–141. AAAI, Menlo Park (2006) 10. Nikolova, E., Kelner, J.A., Brand, M., Mitzenmacher, M.: Stochastic shortest paths via quasi-convex maximization. In: Azar, Y., Erlebach, T. (eds.) ESA 2006. LNCS, vol. 4168, pp. 552–563. Springer, Heidelberg (2006) 11. Papadimitriou, C., Yannakakis, M.: Shortest paths without a map. Theoretical Computer Science 84, 127–150 (1991) 12. Polychronopoulos, G., Tsitsiklis, J.: Stochastic shortest path problems with recourse. Networks 27(2), 133–143 (1996)

Optimization of Complex SVM Kernels Using a Hybrid Algorithm Based on Wasp Behaviour Dana Simian, Florin Stoica, and Corina Simian University “Lucian Blaga” of Sibiu, Faculty of Sciences 5-7 dr. I. Rat¸iu str, 550012 Sibiu, Romˆ ania

Abstract. The aim of this paper is to present a new method for optimization of SVM multiple kernels. The kernel substitution can be used to define many other types of learning machines distinct from SVMs. We introduced a new hybrid method which uses in the first level an evolutionary algorithm based on wasp behaviour and on the co-mutation operator LR − Mijn and in the second level a SVM algorithm which computes the quality of chromosomes. The most important details of our algorithms are presented. The testing and validation proves that multiple kernels obtained using our genetic approach are improving the classification accuracy up to 94.12% for the “leukemia” data set.

1

Introduction

Classiﬁcation task is to assign an object to one or several classes, based on a set of attributes. A classiﬁcation task supposes the existence of training and testing data given in the form of data instances. Each instance in the training set contains one target value, named class label and several attributes named features. The accuracy of the model for a speciﬁc test set is deﬁned as the percentage of test set items that are correctly classiﬁed by the model. If the accuracy is acceptable, the model can be used to classify items for which the class label is unknown. Two types of approaches for classiﬁcation can be deﬁned: classical statistical approaches (discriminate analysis, generalized linear models) and modern statistical machine learning (neural network, evolutionary algorithms, support vector machines — SVM, belief networks, classiﬁcation trees, Gaussian processes). In the recent years, SVMs have become a very popular tool for machine learning tasks and have been successfully applied in classiﬁcation, regression, and novelty detection. Many applications of SVM have been done in various ﬁelds: particle identiﬁcation, face identiﬁcation, text categorization, bioinformatics, database marketing, classiﬁcation of clinical data. The goal of SVM is to produce a model which predicts target value of data instances in the testing set which are given only the attributes. Training involves optimization of a convex cost function. If the data set is separable we obtain an optimal separating hyperplane with a maximal margin (see Vapnik [12]). In the case of non separable data a successful method is the kernel method. Using an appropriate kernel, the data are projected in a space with higher dimension in which they are separable by a hyperplane [2,12]. I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 361–368, 2010. c Springer-Verlag Berlin Heidelberg 2010

362

D. Simian, F. Stoica, and C. Simian

Standard SVM classiﬁers use a single kernel. Many recent studies proved that multiples kernels work better than the singles ones. It is very important to build multiple kernels adapted to the input data. For optimization of the weights, two diﬀerent approaches are used. One possibility is to use a linear combination of simple kernels and to optimize the weights [1,3,7]. Other uses evolutionary methods [8] for optimizing the weights. Complex nonlinear multi kernels were proposed in [3,4,5,6,10,11], where hybrid approaches using a genetic algorithm and a SVM algorithm are proposed. The aim of this paper is to introduce a hybrid method for kernel optimization based on a genetic algorithm which uses an improved co-mutation algorithm using a modiﬁed wasp behaviour computational model. The paper is organized as follows: section 2 brieﬂy presents the kernel substitution method. In section 3 and 4 we characterize the co-mutation operator LR − Mijn and the wasp behaviour computational method. Sections 5 contains our main results: construction and evaluation of our method. Conclusions and further directions of study are presented in section 6.

2

Kernel Substitution Method

SVM algorithm can solve the problem of binary or multiclass classiﬁcation. We focus in this paper only on the binary classiﬁcation due to the numerous methods to generalize the binary classiﬁer to a n - class classiﬁer [12]. Let be given the data points xi ∈ Rd , i = 1, . . . m and their labels yi ∈ {−1, 1}. We are looking for a decision function f , which associates to each input data x its correct label y = f (x). For the data sets which are not linearly separable we use the kernel method, which makes a projection of the input data X in a feature Hilbert’s space F , φ : X → F ; x → φ(x). The kernel represents the inner product of data objects implicitly mapped into a higher dimensional Hilbert space of features. Kernel functions can be interpreted as representing the inner product of the nonlinear feature space, K(xi , xj ) = φ(xi ), φ(xj ) The functional form of the mapping φ(xi ) does not need to be known. The basic properties of a kernel function are derived from Mercer’s theorem. Diﬀerent simple kernels are often used: linear, polynomial, RBF, sigmoidal. These kernels will be deﬁned in subsection 5.1. Multiple kernels can be designed using the set of operations (+, ∗, exp), which preserves the Mercer’s conditions.

3

The Co-mutation LR − Mijn Operator

We introduced in [11] a new co-mutation operator, named LR − Mijn , which makes long jumps, ﬁnding the longest sequence of σp elements, situated in the

Optimization of Complex SVM Kernels Using a Hybrid Algorithm

363

left or in the right of the position p. Let denote with σ a generic string and σ = σl−1 . . . σ0 , where σq ∈ A, ∀q ∈ {0, . . . , l − 1}, A = {a1 , . . . , as } being a generic alphabet. The set of all sequences of length l over the alphabet A will be denoted with Σ = Al . Through σ(q, i) we denote that on position q within the z sequence σ there is the symbol ai of the alphabet A. σp,j denotes the presence of z symbols aj within the sequence σ, starting from the position p and going right,n left and σlef t,m (p, i) specify the presence of symbol ai on position p within the sequence σ, between right symbols an on the right and left symbols am on the right,i left. We suppose that σ = σ(l − 1) . . . σ(p + lef t + 1, m)σlef t,i (p, i)σ(p − right − 1, n) . . . σ(0). The expression of LR − Mijn depends of the position of p. There are 4 diﬀerent cases: p = right, p = l − lef t − 1; p = right, p = l − lef t − 1; p = right, p = l − lef t − 1; and p = right, p = l − lef t − 1. The expression of LR − Mijn , in all these cases can be found in [11]. As an example, let us consider the binary case, the string σ = 11110000 and the randomly chosen application point p = 2. In this case, σ2 = 0, so we have to ﬁnd the longest sequence of 0 within string σ, starting from position p. This sequence goes to the right, and because we have reached the end of the string, and no occurrence of 1 has been met, the new string obtained after the application of LR − Mijn is 11110111. The most interesting points in the co-mutation operator presented above is the fact that allows long jumps, which are not possible in the classical BitFlip Mutation (BFM), thus the search can reach very far points from where the search currently is. Our co-mutation operator has the same capabilities of local search as ordinary mutation has and it has also the possibility of performing jumps to reach far regions in the search space which cannot be reached by BFM. This leads to a better convergence of an Evolutionary Algorithm based on the LR − Mijn in comparison with classical mutation operators. This statement was veriﬁed through experimental results in [11], where a simple evolutionary algorithm based only on selection and LR − Mijn is used for the optimization of some well-known benchmark functions(functions proposed by Rastrigin, Schwefel and Griewangk).

4

Wasp Behaviour Computational Models

In a real colony of wasps, individual wasp interacts with its local environment in the form of a stimulus-response mechanism, which governs distributed task allocation. An individual wasp has a response threshold for each zone of the nest. Based on a wasp’s threshold for a given zone and the amount of stimulus from brood located in this zone, a wasp may or may not become engaged in the task of foraging for this zone. A hierarchical social order among the wasps of the colony is formed through interactions among individual wasps of the colony. When two individuals of the colony encounter each other, they may with some probability interact in a dominance contest. The wasp with the higher social rank will have a higher probability of dominating in the interaction. Computational analogies of these systems have served as inspiration for wasp behaviour computational

364

D. Simian, F. Stoica, and C. Simian

models. An algorithm based on wasp behaviour is essentially a system based on agents that simulate the natural behaviour of insects. An artiﬁcial wasp, will probabilistically decide if it bids or nor for a task. The probability is dependent on the level of the threshold and stimulus. A hierarchical order among the artiﬁcial wasps is given using a probability function. Models using Wasp agents are used for solving large complex problems with a dynamic character and distributed coordination of resources and manufacturing control tasks in a factory. The elements which particularize an algorithm based on wasp behaviour are the ways in which the response thresholds are updated and the way in which the “conﬂicts” between two or more wasps are solved.

5 5.1

Main Results: The Model for Constructing Complex SVM Kernels Basic Construction

Our goal is to build and analyze a multiple kernel starting from the following simple kernels: d,r - Polynomial: Kpol (x1 , x2 ) = (x1 · x2 + r)d , r, d ∈ Z+ γ −1 2 - RBF: KRBF (x1 , x2 ) = exp 2γ 2 |x1 − x2 | γ (x1 , x2 ) = tanh(γ · x1 · x2 + 1) - Sigmoidal: Ksig

We use the idea of the model proposed in [5]. The hybrid techniques are structured in two levels: a macro level and a micro level. The macro level is represented by a genetic algorithm which builds the multiple kernel. The micro level is represented by the SVM algorithm which computes the quality of chromosomes. The accuracy rate is computed by the SVM algorithm on a validation set of data. In a ﬁrst level, we will build and evaluate multiple kernels using the set of operations opi ∈ {+, ∗, exp}, i = 1, . . . , 3 and a genetic algorithm based on a modiﬁed LR − Mijn operator using a wasp-based computational scheme. In the ﬁg. 1 the multiple kernel (K1 op2 K2 )op1 (K3 op3 K4 ) is represented: If a node contains the operation exp only one of its descendants is considered (the “left” kernel). We will consider in our construction at most 4 simple kernels. Every chromosome codes the expression of a multiple kernel. The chromosome which codes the multiple kernel described above has the structure given in ﬁg. 2, where opi ∈ {+, ∗, exp}, i ∈ {1, 2, 3}, ti ∈ {P OL, RBF, SIG} represents the

Fig. 1. General representation of multiple kernel

Optimization of Complex SVM Kernels Using a Hybrid Algorithm

365

Fig. 2. Chromosome’s structure

type of the kernel and di , ri , γi are parameters from the deﬁnition of the kernel Ki , i ∈ {1, . . . , 4}. Each operation opi is represented using two genes, the type of kernel ti is also represented using two genes, for a degree dj 4 genes are allocated and the parameter ri is represented using 12 genes. If the associated kernel is not polynomial, these 16 genes are used to represent a real value of parameter γi in place of di and ri . Thus, our chromosome is composed from 78 genes. 5.2

Optimization of the LR − Mijn Operator Using a Wasp-Based Computational Scheme

In this subsection we present our approach in optimization of a LR − Mijn operator using a scheme based on a computational model of wasp behavior. Our intention was to allow a often faster changing of the operations in the multiple kernel’s structure. The representation length for one operation inside the chromosome’s structure is 2 genes, signiﬁcantly less than the representation length for the multiple kernel’s parameters (4 and 12 genes). Therefore, usually, in the optimization process, the probability of changing the multiple kernels’ parameters is bigger the the probability of changing the operations from the multiple kernel. Each chromosome C has an associated wasp. Each wasp has a response threshold θC . The set of operations coded within chromosome broadcasts a stimulus SC which is equal to diﬀerence between maximum classiﬁcation accuracy (100) and the actual classiﬁcation accuracy obtained using the multiple kernel coded in the chromosome: (1) SC = 100 − CAC The modiﬁed LR − Mijn operator will perform a mutation that will change the operations coded within chromosome with probability: P (θC , SC ) =

2 SC 2 + θC

2 SC

(2)

The threshold values θC may vary in the range [θmin , θmax ]. When the population of chromosomes is evaluated, the threshold θC is updated as follows: θC = θC − δ, δ > 0,

(3)

if the classiﬁcation accuracy of the new chromosome C is lower than in the previous step, and: θC = θC + δ, δ > 0, (4) if the classiﬁcation accuracy of the new chromosome C is greater than in the previous step.

366

5.3

D. Simian, F. Stoica, and C. Simian

SVM Algorithm

The evaluation of the chromosome is made using the SVM algorithm for a particular set of data. To do this we divide the data into two subsets: the training subset, used for problem modeling and test subset used for evaluation. The training subset is also randomly divided into a subset for learning and a subset for validation. The SVM algorithm uses the data from the learning subset for training and the subset from the validation set for computing the classiﬁcation accuracy which is used as ﬁtness function for the genetic algorithm. For the implementation/testing/validation of our method was used the “leukemia” data set from the page LIBSVM data sets page [2]. In order to replace the default polynomial kernel from libsvm, we extend the svm parameter class with the following attributes: //our kernel is “hybrid” public static final int HYBRID = 5; // parameters for multiple polynomial kernels public long op[]; public long type[]; public long d[]; public long r[]; public double g[];

The class svm predict was extended with the method predict(long op[], long type[], long d[], long r[], double g[]) . The Kernel class was modiﬁed to accomplish the kernel substitution. In the k function method we have the computation of our simple kernels. Then, the kernels are combined using operation given in array param.op[]. In the genetic algorithm, the operations, the type of the simple kernels, and all other parameters are obtained from a chromosome, which is then evaluated using the result of the predict method presented above. After the end of the genetic algorithm, the best chromosome gives the multiple kernel which can be evaluated on the test subset of data. The way of construction this multiple kernel assures that it is a veritable kernel, that is, it satisﬁes Mercer’s conditions. 5.4

Experimental Results

Using the standard libsvm package, for the “leukemia” data set is obtained the following classiﬁcation accuracy: java -classpath libsvm.jar svm predict leu.t leu.model leu.output Accuracy = 67.64705882352942%(23/34) (classiﬁcation) Multiple kernels obtained using genetic approach are improving the classiﬁcation accuracy up to 94.12%. In ﬁg. 3 results from three runs of our genetic algorithm based on a modiﬁed LR − Mijn operator are presented. For each execution, dimension of population was 35 and the number of generations was 30.

Optimization of Complex SVM Kernels Using a Hybrid Algorithm

367

Fig. 3. Classification accuracy using multiple kernels

Fig. 4. Optimal kernels

One “optimal” multiple kernel obtained is depicted in ﬁg. 4 (left), where γ = 1.97, d1 = 3, r1 = 609, d2 = 2, r2 = 3970, d3 = 1, r3 = 3615. Another “optimal” multiple kernel obtained is depicted in ﬁg. 4 (right), where γ = 0.5, d1 = 3, r1 = 633, d2 = 2, r2 = 3970, d3 = 1, r3 = 4095.

6

Conclusions and Further Directions of Study

In this paper we presented a hybrid approach for optimization the SVM multiple kernels. The idea of using hybrid techniques for optimization the multiple kernels is not new, but is very recent and the way in which we designed the ﬁrst level of the method is original. Modifying the co-mutation operator LR − Mijn using a wasp behaviour model improves the probability to choose new operations in the structure of multiple kernels. We observed that the parameter r, from the simple polynomial kernels which appears in the optimal multiple kernel is large. After many executions we also observe that a better accuracy and convergence is obtained if we impose a superior limit for the parameter γ of the sigmoidal/RBF simple kernels. Experimental results prove that the utilization of genetic algorithm based on modiﬁed LR − Mijn co-mutation operator, has a better convergence and improves the accuracy toward the classical genetic algorithm used in [2]. Further numerical experiments are required in order to asses the power of our evolved kernels.

References 1. Bach, F.R., Lanckriet, G.R.G., Jordan, M.I.: Multiple kernel learning, conic duality, and the SMO algorithm Machine Learning. In: Proceedings of ICML 2004. ACM International Conference Proceeding Series 69. ACM, New York (2004)

368

D. Simian, F. Stoica, and C. Simian

2. Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines (2001), http://www.csie.ntu.edu.tw/~ cjlin/libsvm 3. Diosan, L., Oltean, M., Rogozan, A., Pecuchet, J.P.: Improving SVM Performance Using a Linear Combination of Kernels. In: Beliczynski, B., Dzielinski, A., Iwanowski, M., Ribeiro, B. (eds.) ICANNGA 2007. LNCS, vol. 4432, pp. 218–227. Springer, Heidelberg (2007) 4. Diosan, L., Rogozan, A., Pecuchet, J.P.: Une approche evolutive pour generer des noyaux multiples (An evolutionary approach for generating multiple kernels). Portal VODEL (2008), http://vodel.insarouen.fr/publications/rfia 5. Diosan, L., Oltean, M., Rogozan, A., Pecuchet, J.P.: Genetically Designed MultipleKernels for Improving the SVM Performance. Portal VODEL (2008), http://vodel.insa-rouen.fr/publications/rfia 6. Nguyen, H.N., Ohn, S.Y., Choi, W.J.: Combined kernel function for support vector machine and learning method based on evolutionary algorithm. In: Pal, N.R., Kasabov, N., Mudi, R.K., Pal, S., Parui, S.K. (eds.) ICONIP 2004. LNCS, vol. 3316, pp. 1273–1278. Springer, Heidelberg (2004) 7. Sonnenburg, S., Ratsch, G., Schafer, C., Scholkopf, B.: Large scale multiple kernel learning. Journal of Machine Learning Research 7, 1531–1565 (2006) 8. Stahlbock, R., Lessmann, S., Crone, S.: Genetically constructed kernels for support vector machines. In: Proc. of German Operations Research, pp. 257–262. Springer, Heidelberg (2005) 9. Simian, D.: A Model For a Complex Polynomial SVM Kernel. In: Proceedings of the 8-th WSEAS Int. Conf. on Simulation, Modelling and Optimization, Santander Spain. Within Mathematics and Computers in Science and Engineering, pp. 164– 170 (2008) 10. Simian, D., Stoica, F.: An evolutionary method for constructing complex SVM kernels. In: Proceedings of the 10th International Conference on Mathematics and Computers in Biology and Chemistry. Recent Advances in Mathematics and Computers in Biology and Chemistry, MCBC 2009, Prague, Chech Republic, pp. 172– 178. WSEAS Press (2009) 11. Stoica, F., Simian, D., Simian, C.: A new co-mutation genetic operator, Advanced topics on evolutionary computing. In: Proceeding of the 9th Conference on Evolutionary Computing, Sofia, pp. 76–82 (2008) 12. Vapnik, V.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995), http://www.csie.ntu.edu.tw/~ cjlin/libsvmtools/datasets

Enabling Cutting-Edge Semiconductor Simulation through Grid Technology Asen Asenov, Dave Reid, Campbell Millar, Scott Roy, Gareth Roy, Richard Sinnott, Gordon Stewart, and Graeme Stewart Device Modelling Group, University of Glasgow National e-Science Centre, University of Glasgow Department of Physics & Astronomy, University of Glasgow [email protected]

Abstract. The progressive CMOS scaling drives the success of the global semiconductor industry. Detailed knowledge of transistor behaviour is necessary to overcome the many fundamental challenges faced by chip and systems designers. Grid technology has enabled the constantly increasing statistical variability introduced by discreteness of charge and matter to be examined in unprecedented detail. Over 200,000 transistors subject to random discrete dopants variability have been simulated, the results of which provide detailed insight into underlying physical processes. This paper outlines recent scientific results of the nanoCMOS project, and describes the way in which the scientific goals have been reflected in the grid-based e-infrastructure. Keywords: nanoCMOS electronics, virtual organization, security, variability.

1

Introduction to NanoCMOS Challenges

The progressive scaling of Complementary Metal Oxide Semiconductor (CMOS) transistors drives the success of the global semiconductor industry. This is in the heart of the famous Moore’s Law [5]. As device dimensions approach the nanometer scale however, chip and systems designers must overcome problems associated with the increasing statistical variability in nano-scale CMOS transistors sue to discreteness of charge and matter. The EPSRC funded project Meeting the Design Challenges of nanoCMOS Electronics (nanoCMOS) aims to explore and ﬁnd solutions to the problems caused by the increasing statistical variability and it impact on system on a chip (SoC) design process. The EPSRC funded, nanoCMOS e-Science pilot project [25] developes eScience technology to support the computationally intensive simulations needed to gain a deeper understanding of the various sources of statistical variability and their impact on circuit and system design. These technologies also target the management and manipulation of the large amounts of resultant simulation data. The simulation of statistical variability requires 3D simulation of ensembles of devices to be performed on a statistical scale [16]. The increasing number I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 369–378, 2010. c Springer-Verlag Berlin Heidelberg 2010

370

A. Asenov et al.

of transistors in modern chips also necessitates the simulation of very large statistical samples to allow the study of statistically rare devices with potentially detrimental eﬀects on circuit performance. Previously, the computational complexity of 3D device simulation has restricted studies of variability to small ensembles of approximately 200 devices [24,13]. However, this results in inaccurate predictions of the statistical distribution of transistor parameters [21]. This motivates the deployment of grid technologies for performing the simulations of large statistical samples and for data mining of the large amount of output data generated. In the current 45 nm technology generation [5], the main source of threshold voltage ﬂuctuation in bulk MOSFETs are the random discrete dopants (RDD). Due to billions of transistors count in contemporary chips mean that statistically rare devices (beyond 5-6 σ from the mean) are beginning to have a signiﬁcant eﬀect on designs and yield. In this paper, we present groundbreaking results where ensembles in excess of 100,000 35 nm [19] and 13 nm gate length MOSFETs have been simulated in 3D using a grid based methodology, for. In total, these simulations required approximately 20 CPU years on a 2.4 GHz AMD Opteron system.

2

NanoCMOS E-Infrastructure and Simulation Methodology

The nanoCMOS project has adopted a hierarchical simulation methodology, which is shown in Fig. 1. In this paper we are primarily concerned with the bottom level of this food chain, where individual devices are simulated ‘atomistically’. Results from this level will be passed up to higher levels of abstraction in the design ﬂow in the form of transistor compact models, which will be used in circuit level SPICE-like (Berkeley SPICE [1]) simulations. To support these simulations, the project employs a variety of grid middleware and middleware including technologies provided by the Open Middleware Infrastructure Institute (OMII-UK — http://www.omii.ac.uk) and the Enabling Grids for E-Science (EGEE — http://public.eu-egee.org) project. The early phase of work focused upon development of a family of OMII-UK services, which supported the device modelling and compact model generation phases of electronics design. These services were developed to exploit the OMIIUK GridSAM job submission system [4]. The aim of GridSAM is to provide a web service for submitting and monitoring jobs managed by a variety of Distributed Resource Managers (DRMs). This web service interface allows jobs to be submitted from a client in a Job Submission Description Language (JSDL), [11], document and supports retrieval of their status as a chronological list of events detailing the state of the job. GridSAM translates the submission instruction into a set of resource-speciﬁc actions — ﬁle staging, launching and monitoring - using DRM connectors for each stage. Proof of concept nanoCMOS job submission solutions were implemented with GridSAM that showed how access to resources such as the NGS,

Enabling Cutting-Edge Semiconductor Simulation through Grid Technology

371

Fig. 1. Hierarchical simulation methodology adopted by the nanoCMOS project

Sun Grid Engine clusters and Condor pools at Glasgow could be supported. The main computational resource available to nanoCMOS is the ScotGrid cluster [7] at the University of Glasgow. The ScotGrid cluster supports the EGEE middleware stack, including its own particular ﬂavour of job submission and management software, all based around the gLite middleware. One of the frontend software environments used for EGEE job submission is Ganga [2]. Ganga supports bulk job submission, where the maximum number of concurrent jobs per user is limited to 1000, but since multiple devices can be simulated in a single job this was suﬃcient for our methodology. The nanoCMOS researchers have become the primary end users of the ScotGrid cluster with a combined CPU usage of 23% of the total resource. Thus far, the simulations undertaken in nanoCMOS have resulted in over 175,000 CPU hours of device simulations run on ScotGrid. More details on the implementation of the simulation framework for nanoCMOS are available in [17]. The project has explored a variety of data management and security solutions. Given that the vast majority of the simulation data is ﬁle-based, a performance and security evaluation of the Storage Resource Broker [8] and the Andrew File System (AFS) [15] was undertaken. The results of this are described in [27] along with our justiﬁcation for the adoption of AFS. A variety of metadata associated with simulations is captured and made available to targeted metadata services, which are in turn coupled with the predominantly ﬁle-based AFS data. A range of query interfaces to these metadata systems are also under production. Key to both the data management and simulation frameworks of nanoCMOS is support for ﬁne-grained security. We have identiﬁed that no single security solution fulﬁls the needs of nanoCMOS research. Instead, it has been necessary

372

A. Asenov et al.

Fig. 2. ID VG characteristics of the 35 nm Toshiba device as produced by the Glasgow simulator at a low drain voltage of 50 mV and a high drain voltage of 850 mV, calibrated against results obtained from both TCAD process simulation and experiment. The continuous doping profile is shown inset.

to integrate a range of security solutions, including Kerberos (for AFS) [20], the Virtual Organisation Membership Service (VOMS) [9], MyProxy [12], PERMIS [14], GSI [3], and Shibboleth [6]. The justiﬁcation for integrating these technologies and the way in which they have been integrated is described in more detail in [28,29,26].

3

NanoCMOS Scientific Results and Discussion

The actual device simulations were carried with the Glasgow 3D ‘atomistic’ drift/ diﬀusion (DD) simulator [24]. The DD approach accurately models transistor characteristics in the sub-threshold regime, making it well suited for the study of VT ﬂuctuations. The simulator includes Density Gradient quantum corrections [10], which accurately capture quantum conﬁnement eﬀects and are essential for preventing artiﬁcial charge trapping in the sharply resolved Coulomb potential of discrete impurities. Each device is also fully independent, allowing the problem to be easily parallelized using a task farming approach. The 35 nm gate length transistor used in the simulation studies was published by Toshiba [19] and the simulator was calibrated to the experimental characteristics of this device. Structural data for the device was obtained through commercial TCAD process simulation. The doping proﬁle structure and characteristics of the Toshiba device are shown in Fig. 2. Having performed 100,000 3D simulations of the 35 nm MOSFET we were able to investigate its statistics with high degree of conﬁdence. The distribution of random dopant induced threshold voltage ﬂuctuations in the simulated ensemble can be seen in Fig. 3. The ﬂuctuations arise from the fact that each

Enabling Cutting-Edge Semiconductor Simulation through Grid Technology -4 1

15

-3.5

Probability Density

Probability Density

Data Pearson IV Gaussian 10

5

0 0.1

0.15

0.2

0.25

VT (V) (a)

0.3

0.35

-3

373

-2.5

-2

3.5

4

Data Pearson IV Gaussian

0.5

0 2

2.5

3

VT (σ) (b)

Fig. 3. (a) The distribution of simulated VT data compared to Gaussian and Pearson IV distributions. (b) Tails of the VT distribution. The inaccuracy of the Gaussian fit to the distribution in the tails of the distribution can clearly be seen.

microscopically device will have, both a diﬀerent number of and microscopically diﬀerent conﬁgurations of dopant atoms. It is important to model accurately the tails of the distribution (at 6 σ and greater) so that the impact of statistically rare devices on circuit performance may be properly assessed. This is currently only possible through a ‘brute force’ approach, necessitating very large numbers of simulations. In order, to assess the accuracy of the simulation ensemble we have calculated the κ2 errors of the statistical moments of the threshold voltage distribution as a function of the ensemble size. These are shown in Fig. 4, where it can be seen that enabling larger scale simulations using grid technology has greatly improved the accuracy of our description of random dopant ﬂuctuations. From Fig. 3 it is clear that RDD induced statistical distribution is asymmetrical which has been reported in experimental measurements [22]. A Gaussian distribution which is typically used to describe the RDD variability with the data mean and standard deviation yields a relatively large κ2 error of 2.42. Fitting the data using a Pearson Type IV distribution [18], results in a much better ﬁt, with a κ2 error of 0.38. The analytical ﬁts are shown in Fig. 3 for comparison, and

Fig. 4. κ2 error of the statistical moments as a function of sample size. It is assumed that the final values are the population values.

374

A. Asenov et al.

Table 1. Statistical moments of the VT distributions. Note that mean value has been normalised to the experimental mean threshold.

Fig. 5. Potential landscapes from the extremes of the VT distribution with VG = 225.9mV

the moments of the distribution and analytical ﬁts are given in Table 1. A more detailed view of the tails is shown in Fig. 3(b), where the systematic error in the Gaussian is more apparent. The eﬀect of RDD on an individual device can be understood by examining the electrostatic potential within the devices. Fig. 5 shows the potential proﬁles extracted from devices taken from the upper and lower tails, along with a ‘mean’ device chosen from the centre of the distribution for comparison. This clearly illustrates the physical causes of variation in device characteristics, since the behaviour of a MOSFET is determined by the height of the potential barrier in the channel. It can be seen that even at the nominal threshold voltage, the device from the lower tail has already switched on and the device from the upper tail is oﬀ. While statistically rare devices can be generated artiﬁcially, it is only through the large-scale simulation undertaken here that we can obtain the correct statistical description that governs the occurrence of such devices and through this achieve a better understanding the underlying physical causes of variability. Each device within the ensemble was analysed by sectioning into 1 nm thick volumes in the x and z directions, as demonstrated in Fig. 6(a). The correlation between the number of dopants in each volume, and the measured VT of the device can be calculated. The correlation between dopant position and threshold voltage is illustrated in Fig. 6(b). This allows the deﬁnition of the statistically signiﬁcant region (SSR) where individual dopants can aﬀect the bulk device behaviour, which, as expected, corresponds approximately to the device channel but also includes some small portions of the source and drain regions as well. For a ﬁxed number of dopants in the SSR, there will also be a distribution of threshold voltage arising from the diﬀerent positional conﬁgurations of dopants that can occur in the silicon lattice. The VT distributions for n = 35, 45, 55 dopants (corresponding to lower tail, mean, and upper tail devices) in the SSR can be seen in Fig. 7. From this it is clear that both mean and standard deviation

Enabling Cutting-Edge Semiconductor Simulation through Grid Technology

0nm

375

80nm

0 z (nm)

10

Z Y X

(a)

20

80 60 70 50 30 40 ) 20 40 x (nm 0 10 30

(b)

Fig. 6. The device is divided into 1 nm slices in x and z. The correlation between number of dopants in each slice and VT is used to determine (b) the SSR.

Fig. 7. Gaussian distribution of VT for selected occurrences of a number of dopants. Note the increasing mean and standard deviation of VT as a function of n.

of VT increase with the number of dopants. By further examining this relationship using all of the available data, it can be determined that the mean and standard deviation depend linearly on the number of dopants [23]. Therefore, this relationship can be extrapolated to arbitrary numbers within the SSR as necessary. From knowledge of the physical processes governing dopant implantation, the number of dopants within the SSR must be governed by a Poisson distribution. Therefore, a VT distribution can be constructed from the convolution of this Poisson distribution with the Gaussian distributions from the random position of dopants [23], which extends to very large values of σ. This calculation is illustrated graphically in Fig. 8. The linear relationship between the number of dopants and the mean and standard deviation of the positional Gaussians allows the distribution to be extrapolated to an arbitrary value of σ, resulting in a signiﬁcantly more accurate prediction of the probability of ﬁnding devices

376

A. Asenov et al.

Fig. 8. Graphical illustration of how the full distribution is built up from the convolution of a Poissonian and the positional Gaussians -3.5

-3

-2.5

-2

1

Convolution Simulation Extrapolated

1×10

χ Error

0.5

1×10

-2

-4

2

Probability Density

-4 1

1×10

0 2

2.5

3

3.5

4

-6

1×10

VT (σ) (a)

-8

0.1

0.2

0.3

VT (V) (b)

Fig. 9. Tails of the convolved distribution and (b) the κ2 errors across the distribution. The convolution yields an excellent fit across the whole distribution.

in tails of the distribution, where real gains can be made in billion transistor count chips. The accuracy of the distribution resulting from this convolution is shown in Fig. 9(a), along with the extrapolated distribution; and the κ2 errors calculated from the comparison of this distribution with simulation data can be seen in Fig. 9(b). The total calculated values for κ2 the error are 0.94 for the convolution using simulation data and 0.55 for the extrapolated distribution. These values are comparable to the ﬁtting error of the Pearson IV demonstrating the accuracy of this analytical description.

4

Conclusions and Future Work

It is clear that small statistical ensembles do not provide suﬃcient accuracy and information to design when variations at 6 σ and beyond are important. Large datasets are necessary as some eﬀects, such as the asymmetry of the VT distribution observed here, are not visible in small datasets and will have seriously detrimental eﬀects if not incorporated into design strategies. We have shown that

Enabling Cutting-Edge Semiconductor Simulation through Grid Technology

377

by applying grid and e-science technology to the problem of intrinsic parameter ﬂuctuations we have gained a fundamental insight into device behaviour in the presence of random dopants. The large datasets obtained have provided suﬃcient data to develop and verify an accurate analytical model of the underlying physical processes that aﬀect VT ﬂuctuations in nano-scale semiconductor MOSFETs. The future work will be to explore the impact of these atomic variations throughout the design process. We are also exploring the atomistic variability of a variety of novel device architectures.

Acknowledgements This work was funded by a grant from the UK Engineering and Physical Sciences Research Council. We gratefully acknowledge their support.

References 1. 2. 3. 4. 5. 6. 7. 8. 9.

10. 11. 12. 13.

14. 15. 16. 17.

Berkeley SPICE, http://bwrc.eecs.berkeley.edu/Classes/icbook/SPICE/ Ganga webpage, http://ganga.web.cern.ch/ganga/ Globus Security Infrastructure, http://www.globus.org/security GridSAM — Grid Job Submission and Monitoring Web Service, http://gridsam.sourceforge.net/ International technology roadmap for semiconductors (2005), http://www.itrs.net/ Internet2 Shibboleth Architecture and Protocols, http://shibboleth.internet2.edu Scotgrid webpage, http://www.scotgrid.ac.uk Storage Resource Broker (SRB), http://www.sdsc.edu/srb/index.php Alfieri, R., et al.: VOMS: an authorization system for virtual organizations. In: Fern´ andez Rivera, F., Bubak, M., G´ omez Tato, A., Doallo, R. (eds.) Across Grids 2003. LNCS, vol. 2970, pp. 33–40. Springer, Heidelberg (2004) Ancona, M.G., Tiersten, H.F.: Macroscopic physics of the silicon inversion layer. Phys. Rev. B 35(15), 7959–7965 (1987) Anjomshoaa, A., et al.: Job Submission Description Language (JSDL) Specification, Version 1.0 (2005), http://www.ogf.org/documents/GFD.56.pdf Basney, J., Humphrey, M., Welch, V.: The MyProxy Online Credential Repository. Software Practice and Experience 35(9), 801–816 (2005) Brown, A., Roy, G., Asenov, A.: Poly-Si-gate-related variability in decananometer MOSFETs with conventional architecture. IEEE Transactions on Electron Devices 54(11), 3056–3063 (2007) Chadwick, D.W., Otenko, A., Ball, E.: Role-based Access Control with X.509 Attribute Certificates. IEEE Internet Computing 7(2), 62–69 (2003) Edward, R., Zayas, R.: Andrew File System (AFSv3) Programmer’s Reference: Architectural Overview. Transarc Corporation, Pittsburgh (1991) Frank, D.J., Taur, Y.: Design considerations for cmos near the limits of scaling. Solid State Electronics 46, 315–320 (2002) Han, L., Sinnott, R.O., Asenov, A., et al.: Towards a Grid-enabled Simulation Framework for nanoCMOS Electronics. In: 3rd IEEE International Conference on e-Science, Bangalore, India (December 2007)

378

A. Asenov et al.

18. Heinrich, J.: A guide to the Pearson type IV distribution. Technical report, University of Pennsylvania (2004) 19. Inaba, S., Okano, K., et al.: High performance 35 nm gate length CMOS with NO oxynitride gate dielectric and Ni salicide. IEEE Transactions on Electron Devices 49(12), 2263–2270 (2002) 20. Kohl, J.T., Neuman, B.C., T’so, T.Y.: The Evolution of the Kerberos Authentication System. Distributed Open Systems, 78–94 (1994) 21. Millar, C., Reid, D., et al.: Accurate statistical description of random dopant induced threshold voltage variability. IEEE Electron Device Letters 29(8) (2008) 22. Nassif, S., et al.: High performance CMOS variability in the 65 nm regime and beyond. IEDM Digest of Technical Papers, pp. 569–571 (2007) 23. Reid, D., Millar, C., et al.: Prediction of random dopant induced threshold voltage fluctuations in nanoCMOS transistors. In: SISPAD 2008 (2008) (in publication) 24. Roy, G., Brown, A., et al.: Simulation study of individual and combined sources of intrinsic parameter fluctuations in conventional nano-MOSFETs. IEEE Transactions on Electron Devices 53(12), 3063–3070 (2006) 25. Sinnott, R.O., Asenov, A., et al.: Meeting the Design Challenges of nanoCMOS Electronics: An Introduction to an EPSRC pilot project. In: Proceedings of the UK e-Science All Hands Meeting (2006) 26. Sinnott, R.O., Asenov, A., et al.: Integrating Security Solutions to Support nanoCMOS Electronics Research. In: IEEE International Symposium on Parallel and Distributed Processing Systems with Applications, Sydney Australia (2008c) 27. Sinnott, R.O., Bayliss, C., et al.: Secure, Performance-Oriented Data Management for nanoCMOS Electronics. Submitted to e-Science 2008, Indiana, USA (December 2008d) 28. Sinnott, R.O., Chadwick, D., et al.: Advanced Security for Virtual Organizations: Exploring the Pros and Cons of Centralized vs Decentralized Security Models. In: 8th IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2008), Lyon, France (2008a) 29. Sinnott, R.O., Stewart, G., et al.: Supporting Security-oriented Collaborative nanoCMOS Electronics e-Research. In: International Conference on Computational Science, Krakow, Poland (2008b)

Incremental Reuse of Paths in Random Walk Radiosity Francesc Castro and Mateu Sbert Graphics and Imaging Laboratory, IIiA, Universitat de Girona {castro,mateu}@ima.udg.edu

Abstract. Radiosity techniques produce highly realistic synthetic images for diﬀuse environments. Monte Carlo random walk approaches, widely applied to radiosity, have as a drawback the necessity of a high number of random paths to obtain an acceptable result, involving a high computation cost. The reuse of paths is a strategy to reduce this cost, allowing that a path distributes light power from several light positions, resulting in a noticeable speed-up factor. We present a new strategy of reuse of paths, which will allow us, given a previously computed set of radiosity solutions corresponding to n light positions, to add new light positions and accurately compute the radiosity solution at a reduced cost by reusing paths. Our incremental strategy can be applied to light positioning in interior design, allowing the set of authorized light locations to be enriched by adding new positions chosen by the user.

1

Introduction

Radiosity techniques [4] are commonly used in global illumination. They follow the radiosity equations, which model the exchange of radiant energy between the objects in the environment. Monte Carlo random walk approaches have been widely applied to radiosity. A drawback of such approaches is the necessity of a high number of paths to obtain an acceptable result, involving a high computation cost. One way to reduce such a cost is the reuse of paths [10,11]. In the case of shooting random walk, reuse allows a random path to distribute light power from several light positions, resulting in a noticeable speed-up factor. We present in this article a new strategy of reuse of shooting paths. Such a strategy will allow us, given a previously computed set of radiosity solutions corresponding to n light positions, to add new light positions and accurately compute their corresponding radiosity solutions at a reduced cost thanks to the storage and reuse of the paths generated in the computation of the initial set of solutions. Multiple importance sampling [14] will guarantee the unbiasedness of the new solutions. Our incremental strategy can be applied to light positioning in interior design. The set of authorized light locations to be chosen or combined in order to decide I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 379–386, 2010. c Springer-Verlag Berlin Heidelberg 2010

380

F. Castro and M. Sbert

the best lighting can be enriched by adding new light positions. This allows an interactive application in which the user can choose new light locations and, at a reduced computation cost thanks to the reuse, see their corresponding lighting, or light combinations involving the new computed solution. This paper is structured as follows. Section 2 describes previous work in reuse of paths in radiosity. Section 3 describes our contribution, the incremental reuse of paths. Results are presented in section 4, while section 5 outlines the conclusions and some possible lines of future research.

2

Previous Work: Reuse of Shooting Paths in Radiosity

Radiosity [4] is a global illumination algorithm that simulates the multiple reﬂections of light in a scene. Radiosity assumes that the exiting radiance of a point is direction independent, i.e., it works with diﬀuse surfaces. The equation for the radiosity B(x) (emitted plus reﬂected light leaving point x) is: ρ(x)F (x, y)B(y)dy + E(x), (1) B(x) = D

where D stands for the set of all the surfaces in the scene, ρ(x) and E(x) respectively stand for the reflectance (fraction of the incoming light that is reﬂected) and emittance (emission radiosity) at point x, and F (x, y) stands for the form factor between points x and y, a geometric term (Fig. 1, left) deﬁned in (2): F (x, y) =

cos θx cos θy V (x, y) , πr2

(2)

where θx , θy are the angles between the line from x to y and the respective normal vectors, r is the distance between x and y, and V (x, y) is the visibility function between x and y (with value 0 or 1). In practice, a discrete version of the radiosity equation is considered, corresponding to a discretization of the environment in small polygons called patches. The radiosity system of equations can be solved using Monte Carlo random walk [8,9,1]. Each path traced from a light source can be seen as light particle carrying an amount of power equal to Φ/N , N being the number of paths to be traced, and Φ being the emission power of the light. The transition probabilities are simulated by tracing rays from points uniformly distributed on the light surface, using a cosine distributed direction (according to the kernel of the form factor integral). The walk continues from the patch hit by this ray, to which a contribution equal to the power carried by the path is added. A survival test is done at each hit with the reﬂectance of the hit patch being the probability of survival. This technique is known as collision shooting random walk. The idea of reusing full paths was introduced by Halton in [7], in the context of the random walk solution of an equation system. Reuse of paths in random walks has been exploited in global illumination and radiosity, providing a signiﬁcant reduction of the computation cost. Let us outline next some approaches in which shooting paths have been reused.

Incremental Reuse of Paths in Random Walk Radiosity

381

Fig. 1. (left) Point-to-point form factor geometry. V (x, y) is 1 if points x and y are mutually visible, and 0 otherwise. (right) Reuse of shooting paths. Consider 3 points x1 , x2 , x3 in the corresponding light positions. The path started at x1 (blue/continuous) is reused to distribute power from x2 and x3 , and the same with the paths started at x2 (black/dotted) and x3 (red/dashed). Note that point y2 is not visible from x3 , so the path started at x2 will not contribute to expand any power from x3 .

Sbert et al. [12] presented a fast light animation algorithm based on the virtual lights illumination method, proposed by Wald et al. [15]. They reused paths in all frames, presenting an acceleration factor close to the number of frames of the animation. Also Sbert et al. [11] presented a radiosity algorithm that allowed the reuse of paths for diﬀerent light positions. Paths generated at a position could be used to distribute power from all the positions, in an each-for-all strategy (Fig. 1, right). Such an idea, which can be useful for light animation and light positioning [3], will be detailed next. Let us consider N points on a light in n diﬀerent positions, and let xk1 , ..., xkn be the n positions considered for xk , one of such N points. Points y (ﬁrst hit of the paths) will be sampled from any of the points xk1 , ..., xkn , which corresponds to use a mixture pdf mix(y) [13] which is just the average of the n form factors F (xk1 , y), ..., F (xkn , y). Thus, the power to be distributed from xkj can be estimated, according to MC integration, in the following way: n 1 F (xkj , yik ) Φ (3) Φ(xkj ) = × , n i=1 mix(yik ) N which means that each of the n samples considered y1k , ..., ynk is generated from one of the n positions. The i-th term in the sum in (3) can be seen as the amount of the power from xkj carried by the shooting path Γik , started at xki . Let Φj (Γik ) be this power: Φj (Γik ) =

F (xkj , yik ) mix(yik )

×

F (xkj , yik ) Φ Φ = n × . k , yk ) Nn N F (x i l=1 l

(4)

We have to note that this strategy involves a bias, because we do not consider the hits of the paths against the light. If we consider small lights, such a bias is rather unnoticeable, however.

382

3

F. Castro and M. Sbert

Our Contribution: Incremental Reuse of the Shooting Paths

We decide to take directional point lights, made up by a diﬀerential area plus a normal vector. This involves a simpliﬁcation of the model regarding to considering area lights, and carries some advantages. First, the bias described in the previous section disappears. Second, and more important, since there is an only origin point xi at each light position i, there is noticeable reduction of the information to be stored to reuse the paths for the new light positions. We aim at computing the radiosity solution for a new light position p = n + 1. Such a new position will be chosen by the user. Next we will show how this new solution can be computed in an unbiased way at a reasonable cost by reusing the paths previously generated at the n original positions. We will consider the paths which distribute the emission power from position p to be generated in n + 1 diﬀerent ways, that is, from the original n positions plus the new position p. This corresponds to take as mixture pdf the average of the n + 1 form factors F (x1 , y), ..., F (xn+1 , y), weighted by Ni , the number of paths generated at each position, to allow diﬀerent number of samples at each n+1 position. Let M = i=1 Ni . Equation (5) shows the power from the light at p to be distributed by a path Γik generated at position i ∈ [1, n + 1]. Thus, such an equation allows to reuse the paths traced during the previous computation of the set of solutions, and it is also valid for the new paths traced from p. Φp (Γik ) =

F (xp , yik ) F (xp , yik ) Φ = × ×Φ n+1 k M mix(yik ) i=1 Ni F (xi , yi )

(5)

It is worth to be mentioned that some new paths have to be cast from the new position p in order not to introduce bias. Multiple importance sampling [14,13] guarantees unbiasedness only if all the points in the domain with non-zero value of the integrand can be generated using the mixture pdf. In practice, note that points y with F (xp , y) > 0, but non-reachable from the n original positions, only can be sampled by casting paths from the new position p. The reuse in the new position p of the shooting paths cast from the n original positions, according to equation (5), demands some data storage. For each shooting path Γk to be reused, we have to store: – Lk , the list of intersected patches. – yk , the ﬁrst-hit point. – Sk , the sum of the form factors (weighted by Ni ) from each of the n positions to yk . In Fig. 2 we outline the reuse algorithm for computing the set of solutions corresponding to the n original positions. Note that the information above is stored. Fig. 3 shows the algorithm where the radiosity solution for a new light position p is computed by taking advantage of the information stored in algorithm in Fig. 2.

Incremental Reuse of Paths in Random Walk Radiosity

383

forEach original position i in 1..n do forEach k in 1..Ni do start path Γik at xi obtain and store first hit-point yik Sik =0 forEach original position j in 1..n do compute F (xj , yik ) Sik += Nj × F (xj , yik ) endfor store Sik forEach original position j in 1..n do Φki [j]= Φ × F (xj , yik )/Sik endfor continue path Γik until termination store list of patches intersected by Γik forEach original position j in 1..n do increase by Φki [j] the incoming power for each patch in the list endfor endfor endfor Fig. 2. Reuse and storage of paths for the original set of positions Choose a new position p = n + 1 forEach k in 1..Np do start path Γpk at xp obtain first hit-point yk Sk = 0 forEach position j in 1..n + 1 do compute F (xj , yk ) Sk += Nj F (xj , yk ) endfor continue path Γk until termination increase by Φ × F (xj , yk )/Sk the incoming power for each intersected patch endfor forEach stored path Γk in 1..M (M = n i=1 Ni ) do obtain stored values Lk , Sk , yk compute F (xp , yk ) Sk + = Np F (xp , yk ) At position p, increase by Φ × F (xp , yk )/Sk the incoming power for each patch in Lk endfor Fig. 3. Reuse of paths for an a-posteriori-added light position

3.1

Applications

This new strategy can be applied to light positioning in interior design. From a given set of light locations with their respective radiosity solutions, the user

384

F. Castro and M. Sbert

will be able to choose new light locations. Thanks to the storage of the shooting paths, the new radiosity solutions will be obtained fast. This allows the user to see the eﬀect of the resulting illumination for the light placed at the new location, or even to replace an old position by the new one. The strategy can also be useful for heuristic-search-based algorithms of light positioning [6,5,2], in which an optimal conﬁguration of lights is pursued by applying heuristic methods such as local-search, genetic algorithms, etc.

4

Results

We aim at comparing the performance of our strategy of reusing the previously computed paths (B) against the performance of computing the new solution without reuse (A). Computation time and mean square error (MSE) regarding to a reference solution will be taken into account in our comparisons. It is worth mention that our reusing strategy is noticeably accelerated [3] by running a pre-process in which the directional hemisphere over each light position is discretized in regions and information about the nearest intersection in the corresponding direction is stored in each region. Such a pre-process, which has also to be run for the newly-added positions, allows to accelerate the computation of the point-to-point form factors needed for the reuse, at a cost of introducing a small bias. We consider a room (see Fig. 4), discretized in near 40000 patches, in which n = 18 light positions have been taken into account in the original position set. Such positions are distributed in a non-regular way in the ceiling and walls of the room. The position added a-posteriori is also placed on one of the walls. Table 1 shows the cost due to the newly-added position and the MSE for diﬀerent number of paths cast from this new position. In the case of reuse (B), we have generated 10 times less new paths than when no reuse is applied (A). Fig. 4 illustrates the advantage of the reuse of paths for newly added positions. On the left, an image corresponding to the radiosity solution obtained without reuse. On the right, an image corresponding to the solution obtained by reusing the stored paths. Table 1. A-posteriori addition of a new light. New paths, computation time, and MSE with respect to a reference solution. (A) without reuse; (B) reusing the paths from the previous computation of a set of 18 radiosity solutions. Time in (B) includes preprocess, form factor computation, and tracing paths from the new position. The relative eﬃciency of (B) regarding to (A) is approximately 8 in this experiment. (*) The bias due to the discretization of the directions over the positions has been subtracted to the MSE. NEW PATHS (A) TIME (A) MSE (A) NEW 100K 32 0.210 200K 64 0.098 400K 128 0.052 800K 258 0.029

PATHS (B) TIME (B) MSE (B)(*) 10K 8 0.080 20K 16 0.039 40K 31 0.029 80K 60 0.014

Incremental Reuse of Paths in Random Walk Radiosity

385

Fig. 4. (left) (A) (no reuse). 400K new paths. 128 sec. MSE= 0.052. (right) (B) (reusing paths). 40K new paths. 31 sec. MSE= 0.029. We have reused the paths stored from 18 previously computed solutions corresponding to 18 light positions distributed on the ceiling and walls of the room. Note that the image obtained with reuse is slightly less noisy, while its computation time is about 4 times lower than the one for the image without reuse. Small white squares represent the directional point light sources.

The storage of the information needed for each path to be reused has been done in the RAM in form of dynamic lists. It requires 24 bytes for a path of an average length of 4, resulting in about 172 MB of memory for the run corresponding to the image in Fig. 4 (right), where 400 K paths have been stored for each of the 18 original positions. All the runs have been done in a Pentium IV at 3 Ghz and 2 GB of RAM.

5

Conclusions and Future Work

We have presented a new strategy that allows, starting from a previously computed radiosity solution for a set of light positions, to unbiasedly estimate, at a reduced cost, the solutions for newly-added light positions, thanks to the reuse of the stored paths. Our approach appears to be clearly superior to computing independent new paths. The relative eﬃciencies, which are about 1 order of magnitude in our experiments, depend on many factors such as the scene geometry, the average reﬂectance, the number and position of the initial set of locations, the relative position of the newly-added locations, and the number of paths we decide to cast from the new locations. Our strategy can be easily applied to light positioning in interior design, allowing the user to choose a new light location given an initial set of solutions, and, once computed the new solution, to visualize it independently or combined with previous solutions.

386

F. Castro and M. Sbert

We plan in the future to take into account not only new positions but also orientations. Our strategy can also be applied to animations involving moving lights in which the light trajectory is not a-priory known but decided on the ﬂy. Acknowledgment. This project has been funded in part with grant number TIN2007-68066-C04-01 from the Spanish Government.

References 1. Bekaert, P.: Hierarchical and stochastic algorithms for radiosity. Ph.D. Thesis. Catholic Univ. of Leuven (1999) 2. Castro, F., Acebo, E., Sbert, M.: Heuristic-search-based light positioning according to irradiance intervals. In: Butz, A., Fisher, B., Christie, M., Kr¨ uger, A., Olivier, P., Ther´ on, R. (eds.) SG 2009. LNCS, vol. 5531, pp. 128–139. Springer, Heidelberg (2009), http://ima.udg.edu/~ castro/articles/SG09.pdf 3. Castro, F., Sbert, M., Halton, J.: Eﬃcient reuse of paths for random walk radiosity. Computers and Graphics 32(1), 65–81 (2008) 4. Cohen, M., Wallace, J.: Radiosity and Realistic Image Synthesis. Academic Press Professional, London (1993) 5. Delepoulle, S., Renaud, C., Chelle, M.: Improving light position in a growth chamber through the use of a genetic algorithm. Artiﬁcial Intelligence Techniques for Computer Graphics 159, 67–82 (2008) 6. Elorza, J., Rudomin, I.: An interactive system for solving inverse illumination problems using genetic algorithms. Computation Visual (1997) 7. Halton, J.: Sequential Monte Carlo techniques for the solution of linear systems. Journal of Scientiﬁc Computing 9(2), 213–257 (1994) 8. Pattanaik, S., Mudur, S.: Computation of global illumination by Monte Carlo simulation of the particle model of light. In: Proceedings of third Eurographics WorkShop on Rendering, pp. 71–83. Springer, Heidelberg (1992) 9. Sbert, M.: The use of global random directions to compute radiosity. Global Monte Carlo methods. Ph.D. Thesis. Univ. Polit`ecnica de Catalunya (1997) 10. Sbert, M., Bekaert, P., Halton, J.: Reusing paths in radiosity and global illumination. In: Proceedings of 4th IMACS Seminar on Monte Carlo Methods, Berlin, Germany, vol. 10(3-4), pp. 575–585 (2004) 11. Sbert, M., Castro, F., Halton, J.: Reuse of paths in light source animation. In: Proceedings of CGI 2004, Crete (Greece), pp. 532–535. IEEE Computer Society Press, Los Alamitos (2004) 12. Sbert, M., Szecsi, L., Szirmay-Kalos, L.: Real-time light animation. In: Computer Graphics Forum (proc. EG 2004), vol. 23(3), pp. 291–299 (2004) 13. Veach, E.: Robust Monte Carlo methods for light transport simulation. Ph.D. Thesis. Stanford University (1997) 14. Veach, E., Guibas, L.: Optimally combining sampling techniques for monte carlo rendering. In: ACM SIGGRAPH 1995 proceedings, pp. 419–428. Addison Wesley Publishing Company, Reading (1995) 15. Wald, I., Kollig, T., Benthin, C., Keller, A., Slussalek, P.: Interactive global illumination using fast ray tracing. In: Rendering Techniques 2002. ACM International Conference Proceeding Series, vol. 28, pp. 15–24. Eurographics Association (2002)

Monte Carlo Adaptive Technique for Sensitivity Analysis of a Large-Scale Air Pollution Model Ivan Dimov1,2 and Rayna Georgieva2 1

ACET, The University of Reading Whiteknights, PO Box 225, Reading, RG6 6AY, UK 2 IPP, Bulgarian Academy of Sciences Acad. G. Bonchev 25 A, 1113 Soﬁa, Bulgaria [email protected], [email protected]

Abstract. Variance-based sensitivity analysis has been performed for a study of input parameters contribution into output variability of a largescale air pollution model — the Uniﬁed Danish Eulerian Model. The problem of computing of numerical indicators of sensitivity — Sobol’ global sensitivity indices leads to multidimensional integration. Plain and Adaptive Monte Carlo techniques for numerical integration have been analysed and applied. Numerical results for sensitivity of pollutants concentrations to chemical rates variability are presented.

1

Introduction

Sensitivity analysis (SA) is a study of how uncertainty in the output of a model can be apportioned to diﬀerent sources of uncertainty in the model input [6]. There are several available sensitivity analysis techniques [6]. Variance-based methods deliver sensitivity measures that are independent to the models behaviors: linearity, monotonicity, and additivity of the relationship between input factor and model output. The aim of our research is to develop an Adaptive Monte Carlo (MC) algorithm for evaluating Sobol’ global sensitivity indices increasing the reliability of the results by reducing the variance of the corresponding Monte Carlo estimator and applying adaptive concept to numerical integration of functions with local diﬃculties. It gives a possibility for a robust SA — a comprehensive study the inﬂuence of variations of the chemical rates on the model results in a particular case. This helps to identify the most inﬂuential parameters and mechanisms and to improve the accuracy. It also helps to simplify the model by ignoring insigniﬁcant parameters after careful complementary analysis of relations between parameters.

2

Background Studies

The investigations and the numerical results reported in this paper have been obtained by using a large-scale mathematical model called Unified Danish I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 387–394, 2010. c Springer-Verlag Berlin Heidelberg 2010

388

I. Dimov and R. Georgieva

Eulerian Model (UNI-DEM) [7,8]. This model simulates the transport of air pollutants and has been developed by Dr. Z. Zlatev and his collaborators at the Danish National Environmental Research Institute (http://www2.dmu.dk/ AtmosphericEnvironment/DEM/). Both non-linearity and stiﬀness of the equations are mainly introduced by the chemistry (CBM-4 chemical scheme) [8]. Thus, the motivation to choose UNI-DEM is that it is one of the models of atmospheric chemistry, where the chemical processes are taken into account in a very accurate way. Our main interest is to ﬁnd out how changes in the input parameters of the model inﬂuence the model output. We consider the chemical rate constants to be input parameters and the concentrations of pollutants to be output parameters. In the context of this paper, the term “constants” means variables with normal distribution (established experimentally) with mean 1.0. 2.1

Sobol’ Global Sensitivity Indices Concept

It is assumed that the mathematical model can be presented as a model function u = f (x),

where x = (x1 , x2 , . . . , xd ) ∈ U d ≡ [0; 1]d

(1)

is a vector of input independent parameters with a joint probability density function (p.d.f.) p(x) = p(x1 , . . . , xd ). The total sensitivity index (TSI) of input = Si + parameter xi , i ∈ {1, . . . , d}, is deﬁned in the following way [3]: Sxtot i Sil1 + Sil1 l2 + . . . + Sil1 ...ld−1 , where Si is called the main eﬀect l1 =i

l1 ,l2 =i,l1
(ﬁrst-order sensitivity index) of xi and Sil1 ...lj−1 is the j -th order sensitivity index (respectively two-way interactions for j = 2, and so on) for parameter xi (2 ≤ j ≤ d). The variance-based Sobol’ method [3] uses the sensitivity measures (indices) and takes into account interaction eﬀects between inputs. An important advantage of this method is that it allows to compute not only the ﬁrst-order indices, but also indices of a higher-order in a way similar to the computation of the main eﬀects, the total sensitivity index can be calculated with just one Monte Carlo integral per factor. The computational cost of estimating all ﬁrst-order and total sensitivity indices via Sobol’ approach is proportional to dN, where N is the sample size and d is the number of input parameters (see [2]). The method is based on a decomposition of an integrable model function f in the d-dimensional factor space into terms of increasing dimensionality: f (x) = f0 +

d

fl1 ...lν (xl1 , xl2 , . . . , xlν ),

f0 =

f (x)dx,

(2)

ν=1 l1 <...
where f0 is a constant. The representation (2) is unique (called ANOVA — rep1 resentation of the model function f (x) [4]) if 0 fl1 ...lν (xl1 , xl2 , . . . , xlν )dxlk = 2 2 0, 2 1 ≤ k ≤ ν, ν = 1, . . . , d. The quantities D = U d f (x)dx−f0 , Dl1 ... lν = fl1 ... lν dxl1 . . . dxlν are called variances (total and partial variances, respectively), where f (x) is a square integrable function. Therefore, the total variance

Monte Carlo Adaptive Technique for Sensitivity Analysis

389

of the model output is partitioned into partial variances [3] in the analogous way d as the model function, that is the ANOVA-decomposition: D = ν=1 l1 <...
... lν

=

Dl1 ... D

lν

,

ν ∈ {1, . . . , d}

(3)

are called Sobol’ global sensitivity indices [3,4]. The results discussed above make clear that the mathematical treatment of the problem of providing global sensitivity analysis consists in evaluating total sensitivity indices and in particular Sobol’ global sensitivity indices (3) of corresponding order. This leads to computing of multidimensional integrals I = Ω g(x)p(x) dx, Ω ⊂ Rd , where g(x) is a square integrable function in Ω and p(x) ≥ 0 is a p.d.f., such that Ω p(x) dx = 1. The procedure for computation of global sensitivity on the indices is based following representation of the variance Dy : Dy = f (x) f (y, z )dxdz − f02 (see [4]), where y = (xk1 , . . . , xkm ), 1 ≤ k1 < . . . < km ≤ d, is an arbitrary set of m variables (1 ≤ m ≤ d − 1) and z be the set of d − m complementary variables, i.e. x = (y, z). Let K = (k1 , . . . , km ) and the complement of the subset ¯ The last equality allows K in the set of all parameter indices is denoted by K. to construct a Monte Carlo algorithm for evaluating f0 , D, and Dy , where ξ = N N 1 1 P P f (ξj ) −→ f0 , f (ξj ) f (ηj , ζj ) −→ Dy + f02 , N j=1 N j=1 (η, ζ): N N 1 2 1 P P f (ξj ) −→ D + f02 , f (ξj ) f (ηj , ζj ) −→ Dz + f02 . N j=1 N j=1 2.2

Monte Carlo Approach for Small Sensitivity Indices

The standard Monte Carlo algorithm for estimating global sensitivity indices, proposed in [3], is spoilt by loss of accuracy when Dy f02 , i.e. in the case of small (in values) sensitivity indices. That is why here we have applied one of the approaches for evaluating small sensitivity indices — the so called combined approach [2]. The concept of the approach consists in replacement of the original integrand (the mathematical model function) with a function of the following type ϕ(x) = f (x) − c, where c ∼ f0 . The following estimator for variances has been proposed for this approach: Dy = ϕ(x) [ϕ(y, z ) − ϕ(x )]dxdx , D = ϕ(x)[ϕ(x) − ϕ(x )]dxdx .

3

Description of the Algorithms

Two Monte Carlo algorithms have been applied: Plain and Adaptive. Plain (Crude) Monte Carlo is the simplest possible MC approach for solving multidimensional integrals [1]. Let us consider the problem of the approximate computation of the integral I = Ω g(x)p(x)dx. Let ξ be a random point with a p.d.f.

390

I. Dimov and R. Georgieva

p(x). Introducing the random variable θ = f (ξ) such that Eθ = Ω g(x)p(x)dx. Let the random points ξ1 , ξ2 , . . . , ξN be independent realizations of the random point ξ with p.d.f. p(x) and θ1 = f (ξ1 ), . . . , θN = f (ξN ). Then an approximate value of I is θN = N1 N i=1 θi . The last equation deﬁnes the Plain Monte Carlo algorithm. There are various Adaptive Monte Carlo algorithms depending on the technique of adaptation [1]. Our Adaptive algorithm uses a posteriori information about the variance. The idea of the algorithm consists in the following: the domain of integration Ω is separated initially into subdomains with identical volume. The corresponding interval on every dimension coordinate is partitioned into M subintervals, i.e. Ω = j Ωj , j = 1, M d . Denote by pj and IΩj the following expressions: pj = Ωj p(x) dx and IΩj = Ωj f (x)p(x) dx. Consider (j) now a random point ξ ∈ Ωj with a density function p(x)/pj . In this case (j) p IΩj = E Nj N i=1 f (ξi ) . The algorithm starts with a relatively small number M which is given as input data. For every subdomain the integral IΩj and the variance are evaluated. Then the variance is compared with a preliminary given value. The obtained information is used for the next reﬁnement of the domain and for increasing the density of the random points. The subdomain with the largest standard deviation is divided onto 2d new subdomains. The algorithm stops when the standard deviation at all obtained after division subdomains satisﬁes the preliminary given accuracy ε (or when a given maximum value of number of levels or subdomains where the stop criterion is not satisﬁed has been reached).

4

Analysis of Numerical Results and Discussion

The ﬁrst stage of computations includes a generation of input data for our procedure using the UNI-DEM. The model runs have been done for the chemical rates variations with a ﬁxed set of perturbation factors α = {αi }, i = 1, . . . , d, where every αi corresponds to a chemical rate among the set of 69 time-dependent chemical reactions and 47 constant chemical reactions, and d is the total number of chemical reactions taken into account in the numerical experiments. The cα (aimax , bjmax ) generated data is ratios of the following type rs (α) = s s max s , αi ∈ cs {0.1, 0.2, . . . , 2.0}, where the lower index s corresponds to the chemical species (pollutants). The denominator cmax = cmax (aismax , bjsmax ) is the maximum mean s s value of the concentration of chemical species s obtained for α = (1, . . . , 1), i.e. without any perturbations, aismax and bjsmax are the coordinates of the point, where this maximum has been reached, and imax , jmax are the mesh indices of this point. The nominator represents the values of the concentrations of the corresponding pollutant for a given set of values of the perturbation parameters αi ∈ {0.1, . . . , 2.0}, computed at the point (aismax , bjsmax ). Thus we consider a set of pollutant concentrations normalized according to the maximum mean value of the concentration of the corresponding chemical species. We also study

Monte Carlo Adaptive Technique for Sensitivity Analysis

391

The first variable (rate related to 3rd time−dependent reaction) is fixed (1.0) 1.25

1.1

reaction 3 (time−dep.) reaction 22 (time−dep.) reaction 6 (time−indep.)

1.2

1.1 1.05 1 0.95 0.9

0.9 0.8

1

0.7 0.8 0.6 0.6

0.5

0.4

0.4 0.3

0.2

0.2

0.85 0.8 0.6

1

1.2 ratios related to ozone

Ratios (related to k=1.0)

1.15

1.4

0 0.5

1.5 1

1 0.7

0.8

0.9

1

1.1

1.2

1.3

1.5

1.4

Factors used in relation to the reaction

Fig. 1. Sensitivity of ozone concentrations to changes of chemical rates

0.1

0.5 rate of 6th time−ind. reaction

rate of 22nd time−dep. reaction

Fig. 2. Model function for x1 = 1

numerically how diﬀerent chemical rate reactions inﬂuence concentrations of a given pollutant. An example of how chemical rates of three diﬀerent reactions (## 3, 6, and 22 of CBM-4 chemical scheme) inﬂuence ozone concentrations (for July 1998) is presented on Figure 1. One can see that in this particular case the inﬂuence of reactions ## 3 and 22 is signiﬁcant and the inﬂuence of reaction 6 is almost negligible. Sensitivity analysis computations consists of two steps: approximation and computing of Sobol’ global sensitivity indices. As a result of computations with the use of UNI-DEM we obtain mesh functions in a form of tables of the values of the model function. The ﬁrst step is to represent the model as a continuous function (1). To do that we use approximation by polynomials of third and forth degree, where ps (x) is the polynomial approximating the mesh function that corresponds to the s-th chemical specie. The approximation domain Ω = [0.1; 2.0]3 has been chosen a bit wider ranging than the integration domain Ω = [0.6; 1.4]3 in order to present more precisely the mesh model n function. The squared 2-vector norm of the residual deﬁned as ps − rs 22 = l=1 [ps (xl )− rs (xl )]2 , xl ∈ [a; b]3 in the case of a polynomial of 4-th degree in three variables was ps − rs 22 = 0.022 for xl ∈ [0.1; 2.0]3 and ps − rs 22 = 0.00022 for xl ∈ [0.6; 1.4]3 in our numerical experiments. If one is not happy with the accuracy of polynomial approximation other tools should be used. So, we consider polynomials of 4-th degree in three variables as a case study which is completely satisfying us at this stage. The approximate function is smooth but it has a single peak at one of the corners of the domain. A section of the model function graphics (the ﬁrst variable is ﬁxed to 1.0) is presented on Figure 2. The adaptive approach seems to be promising for functions like this. Adaptive Monte Carlo algorithm (see Section 3) has been applied to the problem of numerical integration. The results have been compared with Plain MC (see Section 3). One of the best available random number generators, SIMD-oriented Fast Mersenne Twister (SFMT) [5] 128-bit pseudorandom number generator of period 219937 − 1 has been used to generate the required random points.

392

I. Dimov and R. Georgieva

Table 1. First-order and total sensitivity indices of input parameters estimated using diﬀerent approaches of sensitivity analysis applying Plain Monte Carlo algorithm

XX XXXapproach quantity XXXX g0

D

S1

S3

Sxtot 1 Sxtot 3

N 104 106 107 104 106 107 104 106 107 104 106 107 104 106 107 104 106 107

Standard (Sobol’) Est. value Rel. error 0.5155 8e-05 0.5156 3e-05 0.5155 2e-05 0.2635 0.0002 0.2636 6e-05 0.2636 5e-05 0.2657 0.0127 0.2654 0.0116 0.2653 0.0114 0.2525 0.0013 0.2521 0.0002 0.2521 0.0002 0.4183 0.0002 0.4185 0.0003 0.4184 0.0001 0.4017 0.0002 0.4018 5e-06 0.4018 8e-06

N 104 106 107 104 106 107 104 106 107 104 106 107 104 106 107 104 106 107

Combined Est. value Rel. error 0.2525 0.0002 0.2526 7e-05 0.2526 5e-05 0.0052 0.0081 0.0052 0.0009 0.0052 0.0012 0.5349 0.0048 0.5337 0.0025 0.5325 0.0003 0.0012 0.4060 0.0019 0.0333 0.0019 0.0213 0.5389 0.0021 0.5392 0.0026 0.5380 0.0003 0.0009 0.6078 0.0022 0.0354 0.0022 0.0192

Results for some ﬁrst-order and total sensitivity indices obtained by the Plain Monte Carlo algorithm are presented in Table 1. Two approaches for sensitivity indices have been applied — standard (Sobol’, 2001) and combined approach. For the implementation of the combined approach a Monte Carlo estimate of f0 has been used — c = 0.51365. The following notation is used in the tables: ε is the desired estimate of standard deviation, #sub is the number of subdomains after domain division, Nsub is the number of samples in each subdomain, N is the number of samples in the domain of integration (for the Plain algorithm), D is the variance; g0 is the integral over the domain [0.6; 1.4]3, where the integrand for the standard approach is the model function, f (x), and for the combined approach f (x) − c, respectively. Relative error is the absolute error divided by the exact value. Each estimated value is obtained after 10 algorithm runs. The exact values are known for our special case of mesh function approximation. One of the advantages of Sobol’ type approaches has been applied in the implementation of the Plain Monte Carlo algorithm — the possibility to compute ﬁrst-order and total sensitivity indices of a given input parameter using only one Monte Carlo integral and two independent multidimensional sequences of random numbers. The results obtained conﬁrm the expected eﬀect of decrease of the relative error with the increase of the number of samples. On the other hand, the order of relative error decreases for the values of sensitivity indices in comparison with g0 and variation D for both approaches. These quantities are

Monte Carlo Adaptive Technique for Sensitivity Analysis

393

Table 2. First-order and total sensitivity indices of input parameters estimated using combined approach applying Plain and Adaptive Monte Carlo algorithms Estimated quantity g0

Plain N Est.value Rel.error 192 0.2516 0.0038 7200 0.2524 0.0005 32000 0.2524 0.0005 192 0.0055 0.0493 7200 0.0052 0.0130 32000 0.0052 0.0034

#sub 64 180 64

Nsub 3 40 500

Adaptive ε Est.value Rel.error 0.5 0.0056 0.0725 0.0165 0.0051 0.0333 0.1 0.0052 0.0003

S1

192 7200 32000

0.6502 0.5299 0.5326

0.2214 0.0046 0.0004

64 180 64

3 40 500

0.5 0.0165 0.1

0.5072 0.5307 0.5323

0.0473 0.0031 0.0001

S3

192 7200 32000 192 7200 32000 192 7200 32000

0.0055 0.0009 0.0013 0.5875 0.5346 0.5368 0.0004 0.0006 0.0018

1.7695 0.5367 0.3250 0.0923 0.0061 0.0020 1.1693 0.7529 0.2094

64 64 64 64 180 64 64 180 64

3 500 104 3 40 500 3 40 500

0.5 0.1 0.1 0.5 0.0165 0.1 0.5 0.0165 0.1

0.0016 0.0027 0.0019 0.5108 0.5345 0.5376 0.0047 0.0021 0.0022

0.1790 0.3463 0.0581 0.0503 0.0061 0.0004 1.1013 0.0895 0.0153

D

Sxtot 1 Sxtot 3

presented by only one (g0 ) or two (D) integrals while each sensitivity index (ﬁrstorder or total) is presented by a ratio of integrals estimated by the Plain Monte Carlo algorithm that leads to an accumulation of errors. The value of variation for the combined approach is much smaller than the value of variation for the standard approach and the division into that relatively small quantity leads to larger relative errors for total sensitivity indices using the combined approach. Nevertheless, the results for total sensitivity indices obtained by the combined approach are more reliable — the values of total eﬀects are fully consistent with the expected tendencies according to Figure 1. A comparison between computed ﬁrst-order and total sensitivity indices obtained by the combined approach using Plain and Adaptive Monte Carlo algorithms is given in Table 2. The concepts of the combined approach and the developed adaptive approach require numerical integration over 6-dimensional domain, i.e. twice as large as the dimension of the model function. That is why g0 has not been computed in this case. Since the total number of estimated quantities is seven — variance and main and total eﬀects of three input parameters — the adaptive procedure applied to all of them would be ineﬃcient according its computational cost. The criterion for achieving the desired accuracy in computing variance has been adopted as a common criterion in computing other quantities because all main and total eﬀects depend on the variance. In contrast

394

I. Dimov and R. Georgieva

of that, two independent random sequences in 3-dimensional domain [0.6; 1.4]3 have been used for implementation of the Plain Monte Carlo algorithm. The number of samples for both Monte Carlo techniques has been chosen following the requirement for consistency of obtained results, i.e. the number of samples for Plain algorithm is a multiplication of average number of subdomains (from several runs) and number of samples in each subdomain. It has been observed that the computational times using two Monte Carlo approaches for numerical integration (Plain and Adaptive) to estimate the unknown quantities are comparable. For example, t = 0.073 s for the Plain (N = 32 000) and t = 0.078 s for the Adaptive. Thus, the conclusions about eﬃciency of the applied algorithms in computing the desired quantities can be made by comparing orders of estimated errors. The Adaptive algorithm has an advantage over the Plain algorithm for a ﬁxed number of samples that conﬁrms reducing variance eﬀect of the applied adaptive technique. Moreover, the approximative values of quantities are suﬃciently close to exact values even for the smallest chosen number of samples.

Acknowledgment The research reported in this paper is partly supported by NATO Science under grant CLG.982641, as well as by the Bulgarian NSF Grants DO 02-215/2008 and DO 02-115/2008. The authors thank Z. Zlatev and Tz. Ostromsky for providing necessary data for our analysis and for stimulating discussions.

References 1. Dimov, I.: Monte Carlo Methods for Applied Scientists. World Scientiﬁc, Singapore (2008) 2. Saltelli, S.: Making best use of model valuations to compute sensitivity indices. Computer Physics Communications 145, 280–297 (2002) 3. Sobol, I.M.: Sensitivity estimates for nonlinear mathematical models. Mathematical Modeling and Computational Experiment 1, 407–414 (1993) 4. Sobol, I.M.: Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates. Mathematics and Computers in Simulation 55(1-3), 271–280 (2001) 5. Saito, M., Matsumoto, M.: SIMD-oriented Fast Mersenne Twister: a 128-bit Pseudorandom Number Generator. Monte Carlo and Quasi-Monte Carlo Methods 2006, pp. 607–622. Springer, Heidelberg (2008) 6. Saltelli, A., Tarantola, S., Campolongo, F., Ratto, M.: Sensitivity Analysis in Practice. A Guide to Assessing Scientiﬁc Models. John Wiley & Sons Publishers, Chichester (2004) 7. Zlatev, Z.: Computer Treatment of Large Air Pollution Models. Kluwer Academic Publishers, Dordrecht (1995) 8. Zlatev, Z., Dimov, I.: Computational and Numerical Challenges in Environmental Modelling. Elsevier, Amsterdam (2006)

Parallel Implementation of the Stochastic Radiosity Method Roel Mart´ınez1 and Jordi Coma2 1

Institut d’Inform` atica i Aplicacions, Universitat de Girona [email protected] 2 Escola Polit`ecnica Superior, Universitat de Girona

Abstract. To compute high quality images with a lot of polygons or patches may take hours. For this reason parallel processing will be a good option in order to decrease the computational cost. On the other hand, Monte Carlo methods oﬀer good alternatives for parallelization, given their intrinsic decomposition properties in independent subtasks. We have implemented the stochastic method for radiosity using a cluster of PCs. Results are presented for 1 to 8 processors, exhibiting a good eﬃciency and scalability.

1

Introduction

Global illumination algorithms compute the single reﬂection of the light many times to simulate the multiple reﬂections. To obtain a single reﬂection at a point the estimate of the incoming radiance from a direction should be weighted by the probability of the reﬂection to a given direction and integrated taking into account all the possible incoming directions. Consequently, global illumination is basically a numerical integration problem. Thus global illumination is computationally very expensive, which means that in order to obtain a high quality image, with a lot of polygons or patches, may take hours. This high cost makes researchers in the area to look into parallelization alternatives to reduce it [9]. Local Monte Carlo approaches sample the domain of the integration randomly using a probability density p, evaluate the integrand f (x) here, and provide the f /p ratio as the primary estimate of the integral. This estimate is accurate if we can ﬁnd p to mimic f precisely, i.e. to make f /p as ﬂat as possible. This strategy, which is commonly referred to as importance sampling, places more samples where the integrand is high. Since in practice p can be very far from the integrand, the estimator may have high variance. Thus to get an accurate result many independent primary estimators should be used and compute the secondary estimator as their average. Global Monte Carlo methods do not rely on ﬁnding good sampling density. Instead, they take advantage of the fact that it is usually easy to evaluate the f at a well structured set of sample points x1 , . . . , xn . The emphasis is on that the simultaneous computation of f (x1 ), . . . , f (xn ) is much cheaper that the individual computation of f (x1 ),. . ., and f (xn ) by a local method, thus in this way we can have many more samples for the same I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 395–402, 2010. c Springer-Verlag Berlin Heidelberg 2010

396

R. Mart´ınez and J. Coma

computational eﬀort. The bad side of this technique is that, ﬁnding a probability density that simultaneously mimics the integrand at many points is very diﬃcult, thus practical methods usually use uniform sampling probability. In this way, global methods are implemented using global uniformly distributed lines, ﬁrst used in [3], in contrast with “local” lines, generated from sampled points in the scene. The stochastic iteration algorithm for radiosity [16] is part of this family of global lines algorithms which has seen a further development in [12,10,14,2,5]. On the other hand, task farm (or naive) Monte Carlo parallelization is based on the fact that we can decompose a Monte Carlo computation with n lines or rays into m independent smaller ones with n/m rays each with no loss in precision [18,11,1]. We have implemented our parallel solution on a cluster of PCs. We choose a high performance cluster for decreasing the execution time of the radiosity algorithm. The rest of the paper is organized as follows. In section 2 we present the stochastic iteration radiosity method. In sections 3 and 4 we introduce the whole steps for the sequential and parallel implementation of the stochastic method, respectively. In section 5 we show our results and ﬁnally we present our conclusions in section 6.

2

Stochastic Iteration Method

Szirmay-Kalos presented in [16] a new method based on an stochastic iteration scheme (see [15]), where a random operator is selected randomly in each iteration. This new method is a ﬁnite element based iteration method, which uses a single ray or set of ray bundles to transfer energy in a single random direction. The concept of stochastic iteration has been proposed and applied for the diﬀuse radiosity problem in [6,7,8,13]. The basic idea of stochastic iteration is that instead of approximating the transport operator in a deterministic way, a much simpler random operator is used during the iteration which for most of the cases has the same behavior than the real operator. Szirmay-Kalos basically used two operators: the ﬁrst one used a single ray based transport operator and the second operator is a ray bundle based operator. The ﬁrst operator uses a single ray having a random origin and direction generated with a probability that is proportional to the cosine weighted radiance of this point. This ray transports the whole energy to the point which is hit by the ray. The second operator transfers the radiance of all surface points of the scene in a single random direction. The algorithm works as follows: the scene is tessellated in patches and it is assumed that a patch has uniform radiance in a given direction (but this does not mean that the patch has the same radiance in every direction, thus the nondiﬀuse case can also be handled). In order to evaluate the transport operator a random direction and a plane perpendicular to this random direction are deﬁned, the so called transillumination direction and plane, respectively. The plane is decomposed in n × n pixels. All patches from the scene are projected into the transillumination plane and it is computed which patches are visible from a

Parallel Implementation of the Stochastic Radiosity Method

397

j

Pp

i

Fig. 1. Bundle of parallel lines that exiting patch i goes to patch j. P p is a plane perpendicular to the direction of the bundle and the thick lines represent the common projected area of the patches i and j onto the plane P p.

LS

(a)

(b)

(c)

Fig. 2. First shot method. (a) Scene with a light source (LS) and two objects. (b) The light source sends its energy to the scene. (c) The walls, ﬂoor and the objects received energy from the light source. The dash-dot line represents the light source and the thick lines represent surfaces which have received some energy.

given patch, in other words, the projected area of a patch that is visible from a given patch. This computation is done counting the number of pixels that two face to face patches have in common after the projection. Finally for each patch in the scene the transfer of energy is done using the common projected areas between patches (see ﬁgure 1). The stochastic iteration method is only eﬃcient in “smoothed” scenes with emittance occupying a large part of the scene and more or less equilibrated. For this reason a first shot (see ﬁgure 2) distributing direct illumination before applying the algorithm is necessary [4,17].

3

Sequential Implementation

The sequential implementation ﬁrst reads the scene geometry and then the scene is subdivided in patches. Second, the ﬁrst shot step, with a predeﬁned by the user number of lines, is computed. Third, the stochastic method with predeﬁned number of lines is applied. Finally, the resulting scene is saved. The ﬁrst shot step and the stochastic method are given by the following pseudo-code: begin firstShot() for i=0 to total number of patches do

398

R. Mart´ınez and J. Coma if patch[i] is a source then compute number of rays for this source according to its power for j=0 to number of rays do cast a local line transfer energy from the source to the first intersected patch endFor endIf endFor

end begin stochastic() create a bounding sphere for the whole scene for i=0 to total number of ray bundles do send_Bundle() endFor end

where send Bundle() function computes the projection of all patches, for a random direction, onto the projection plane and exchanges the energy between the patches. The more consuming parts, in our sequential implementation, are the firstShot and stochastic steps.

4

Parallel Implementation

In our parallel implementation we have two regions in our code: the sequential region and the parallel region. In our case the parallel region is given by the ﬁrst shot step and the stochastic method. Every cluster node has its own copy of the scene data and radiosity solution vector. Thus, every node computes its own solution of the ﬁrst shot step and stochastic method according to the code explained in previous section. The sequential region is given by three sections. The ﬁrst section is loading the data at the beginning of the process. The second one is the combination of the results after the ﬁrst shot and the distribution of this result to all the nodes. And the third one is the computation of the ﬁnal results at the end of the process. The tasks for each node are balanced because the predeﬁned number of rays and bundles of rays is divided by the number of processors. For every local line casted from the light source the ﬁrst intersection is computed and then the intersected surface increments its accumulated power. The cost of every local line is almost the same. In the case of bundles of rays the cost can be diﬀerent because it depends of the number of patches projected on every pixel. Considering that a scene needs hundreds or thousands of projections then average projection cost will be almost the same at the end of the process. On the other hand, the algorithmic cost depends of the communication cost plus the master node CPU time. Our communication cost is very low because

Parallel Implementation of the Stochastic Radiosity Method

399

Fig. 3. Museum stairs scene with 614 778 patches computed using 100 million rays for the ﬁrst shot step and 3000 bundles of rays for the stochastic method

there are just two communications steps in our implementation. First, a bidirectional communication process is used after the ﬁrst shot step. The master node combines the results of all nodes. Second, a unidirectional communication step is applied at the end of the process. Finally, the master node CPU add, for every scene patch, all the radiosity results and divide them by the number of processors. Thus the execution time, in the master node, is proportional to the number of patches.

5

Results

We have used a high performance cluster composed by four servers HP ProLian DL145, each server has two AMD Opteron 244 (1.8 GHz) processors and 2 GB of RAM. Also a 100Mbps network switch is used. The cluster has been conﬁgured with OSCAR (Open Source Cluster Application Resources) on a GNU/Linux Suse 10.0 OS. We did our implementation in C++ and LAM/MPI (Message Passing Interface) libraries. We tested our implementation with the museum stairs and airplane cabin scenes (see ﬁgures 3 and 4). Speed-up and Eﬃciency are measures that indicate how well a program has been parallelized. Let T (p) be the execution time on p CPUs. The Speedup S(p) and Eﬃciency E(p) are deﬁned as: S(p) =

T (1) , T (p)

E(p) =

S(p) p

p = 1, 2, 3...n

400

R. Mart´ınez and J. Coma

Fig. 4. Airplane cabin scene with 438 518 patches computed using 100 million rays for the ﬁrst shot step and 3000 bundles of rays for the stochastic method

Figure 5 shows the speed-up and eﬃciency of our parallel implementation, for the museum stairs scene in ﬁgure 3. We can see that the speed-up keeps around 90% for diﬀerent number of CPUs. The eﬃciency E(p) remains constant over a diﬀerent number of processors. We can see that our implementation has a good scalability. The projection plane was deﬁned with 256 by 256 pixels then the projection plane is simulating more than 65 thousand global lines.

8

1.2 "Speed_up" "ideal_Speed_up"

"Efficiency" "ideal_Efficiency"

7

1.1

6

Efficiency E(p)

Speed-up S(p)

1 5

4

0.9

0.8 3

0.7

2

1

0.6 2

3

4

5 Processors

(a)

6

7

8

2

3

4

5 Processors

6

7

(b)

Fig. 5. (a) Speed-up and (b) Eﬃciency for the museum stairs scene in ﬁgure 3

8

Parallel Implementation of the Stochastic Radiosity Method

401

The museum stairs scene was subdivided in 614 778 patches and 100 million rays for the ﬁrst shot step and 3000 bundles of rays for the stochastic method were cast. The execution time using the sequential implementation was 1395.2 seconds (485.1 seconds for the ﬁrst shot and 910.1 for the stochastic method) and for our parallel implementation, using 8 CPUs, was 197.9 seconds (70.3 seconds for the ﬁrst shot and 127.6 for the stochastic method). The airplane cabin scene was subdivided in 438 518 patches and 100 million rays for the ﬁrst shot step and 3000 bundles of rays for the stochastic method were cast. The execution time for the sequential and parallel implementation was 967.3 (337.9 seconds for the ﬁrst shot and 629.2 for the stochastic method) and 137.3 seconds (47 seconds for the ﬁrst shot and 90.3 for the stochastic method), respectively. This scene shows similar speed-up and eﬃciency as the museum stairs scene.

6

Conclusions

We have presented a parallel implementation of the stochastic iteration method for radiosity. The implementation has been done in a cluster of PCs. Tests have been done on a system containing 8 processors, showing good eﬃciency and scalability. As future work it is possible to implement other related global line Monte Carlo algorithms. Also, other architectures, like multicore shared memory, are a good option for Monte Carlo methods.

Acknowledgement This project has been funded in part with grant number TIN2007-68066-C04-01 of the Spanish Government.

References 1. Alme, H., Rodrigue, G., Zimmerman, G.: Domain decomposition methods for parallel laser-tissue models with Monte Carlo transport. In: Niederreiter, H., Spanier, J. (eds.) Proceedings of the Third International Conference on Monte Carlo and Quasi-Monte Carlo methods in Scientiﬁc Computing, Claremont, California, USA. Springer, Heidelberg (1998) 2. Bekaert, P.: Hierarchical and Stochastic Algorithms for Radiosity. PhD thesis, Department of Computer Science, Katholieke Universiteit Leuven, Leuven, Belgium (1999) 3. Buckalew, C., Fussell, D.: Illumination Networks: Fast Realistic Rendering with General Reﬂectance Functions. Computer Graphics (ACM SIGGRAPH 1989 Proceedings) 23, 89–98 (1989) 4. Castro, F., Mart´ınez, R., Sbert, M.: Quasi-Monte Carlo and extended ﬁrst-shot improvement to the multi-path method. In: Szirmay-Kalos, L. (ed.) Proc. Spring Conference on Computer Graphics 1998, Budimerce, Slovakia. Comenius University, pp. 91–102 (1998), http://www.dcs.fmph.uniba.sk/~ sccg/proceedings/1998.index.htm

402

R. Mart´ınez and J. Coma

5. Mart´ınez, R.: Adaptive and Depth Buﬀer Solutions with Bundle of Parallel Rays for Global Line Monte Carlo Radiosity. PhD thesis, Universitat Polit`ecnica de Catalunya, Barcelona, Spain (2004), http://ima.udg.es/~ roel 6. Neumann, L., Feda, M., Kopp, M., Purgathofer, W.: A New Stochastic Radiosity Method for Highly Complex Scenes. In: Fifth Eurographics Workshop on Rendering, Darmstadt, Germany, pp. 195–206 (1994) 7. Neumann, L.: Monte Carlo Radiosity. Computing 55(1), 23–42 (1995) 8. Neumann, L., Purgathofer, W., Tobler, R.F., Neumann, A., Elias, P., Feda, M., Pueyo, X.: The Stochastic Ray Method for Radiosity. In: Hanrahan, P.M., Purgathofer, W. (eds.) Rendering Techniques 1995 (Proceedings of the Sixth Eurographics Workshop on Rendering), New York, pp. 206–218. Springer, Heidelberg (1995) 9. Reinhard, E., Chalmers, A.G., Jansen, F.W.: Overview of Parallel Photo-Realistic Graphics. In: Eurographics 1998 State of the Art Reports, pp. 1–25 (1998), http://www.cs.bris.ac.uk/Tools/Reports/Authors/alan.html 10. Sbert, M.: The Use of Global Random Directions to Compute Radiosity: Global Monte Carlo Techniques. PhD thesis, Universitat Polit`ecnica de Catalunya, Barcelona, Spain (1997), http://ima.udg.es/~ mateu 11. Sbert, M., Perez, F., Pueyo, X.: Global Monte Carlo: A Progressive Solution. In: Hanrahan, P.M., Purgathofer, W. (eds.) Rendering Techniques 1995 (Proceedings of the Sixth Eurographics Workshop on Rendering), New York, pp. 231–239. Springer, Heidelberg (1995) 12. Sbert, M., Pueyo, X., Neumann, L., Purgathofer, W.: Global Multipath Monte Carlo Algorithms for Radiosity. The Visual Computer 12(2), 47–61 (1996) 13. Szirmay-Kalos, L., Foris, T., Neumann, L., Csebfalvi, B.: An Analysis of QuasiMonte Carlo Integration Applied to the Transillumination Radiosity Method. Computer Graphics Forum (Proceedings of Eurographics 1997) 16(3), C271–C281 (1997) 14. Szirmay-Kalos, L., Purgathofer, W.: Global Ray-Bundle Tracing with Hardware Acceleration. In: Drettakis, G., Max, N. (eds.) Rendering Techniques 1998 (Proceedings of Eurographics Rendering Workshop 1998), New York, pp. 247–258. Springer Wien, Heidelberg (1998) 15. Szirmay-Kalos, L.: Stochastic methods in global illumination — state of the art report. Technical Report TR-186-2-98-23, Vienna University of Technology, Vienna, Austria (1998), http://www.fsz.bme.hu/~ szirmay/puba.html 16. Szirmay-Kalos, L.: Stochastic Iteration for Non-Diﬀuse Global Illumination. Computer Graphics Forum (Proceedings Eurographics 1999) 18, C-233–C-244 (1999) 17. Szirmay-Kalos, L., Sbert, M., Mart´ınez, R., Tobler, R.F.: Incoming First-Shot for Non-Diﬀuse Global Illumination. In: Spring Conference on Computer Graphics, Budmerice, Slovakia (2000), http://www.fsz.bme.hu/~ szirmay/puba.htm 18. Zareski, D., Wade, B., Hubbard, P., Shirley, P.: Eﬃcient parallel global illumination using density estimation. In: Proceedings of Visualization 1995 — Parallel Rendering Symposium, pp. 219–230 (1995)

Monte-Carlo Modeling of Electron Kinetics in Room Temperature Quantum-Dot Photodetectors Vladimir Mitin, Andrei Sergeev, Li-Hsin Chien, and Nizami Vagidov University at Buﬀalo, The State University of New York, Buﬀalo NY 14260, USA [email protected]

Abstract. Results of our many-particle Monte-Carlo modeling of kinetics and transport of electrons in InAs/GaAs quantum-dot infrared photodetectors are reviewed. We studied the dependence of the electron capture time on the electric ﬁeld at diﬀerent heights of the potential barriers around the dots. The capture time is almost independent on the electric ﬁeld up to a critical ﬁeld about 1 kV/cm, and than substantially decreases with the ﬁeld increase. We found that the capture time has exponential dependence on the inverse of the average electron energy, which is in agreement with theory. Our results show that controllable kinetics in quantum-dot structures may provide a signiﬁcant increase in the photoconductive gain, device detectivity, and responsivity.

1

Introduction

There are numerous applications, ranging from infrared (IR) cameras and remote sensing to healthcare, which require sensitive scalable IR sensors. Currently, IR technologies are mainly based on quantum-well infrared photodetectors (QWIPs). QWIPs are widely utilized in various sensors and imaging devices operating at 77 K and below. However, QWIPs have drawbacks such as insensitivity to the normal incidence radiation and fast carrier capture at room temperature. Therefore, one of the main goals for the next generation of imaging IR systems is to increase the operating temperature without reduction in sensitivity.

2

Simulation Models

It has been expected that quantum-dot infrared photodetectors (QDIPs) (see Fig. 1) may operate at room temperatures. Theory has predicted long photoexcited electron lifetime due to the reduced relaxation rate associated with the “phonon bottleneck”. This eﬀect assumes that only speciﬁc phonons can provide electron transitions between narrow atom-like energy levels in quantum dots (QDs). However, the experiments showed that the electron relaxation in QDs is just slightly slower than in quantum wells [2]. Nevertheless, there are I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 403–410, 2010. c Springer-Verlag Berlin Heidelberg 2010

404

V. Mitin et al.

Fig. 1. Schematic cross-section of QDstructure with vertical electron transport

Fig. 2. QD-structure with the local potential barriers around QDs

still some other possibilities to suppress the capture processes. In order to improve the detector performance, one should suppress photoelectron capture into QDs, which means that one should separate the conducting electron states, that provide electron transport, from the localized states, which are excited by IR radiation. The QDs in real space can be separated from the conducting channels by the potential barriers, which prevent the photoelectron capture. Moreover, the potential barriers may also be used to suppress the thermionic emission from QDs. In our papers [6,7], realizations of potential barriers have been suggested: 1) the local potential barriers may be formed around each QD (see Fig. 2) and 2) the collective potential barriers may be formed in modulation-doped quantumdot structures (see Fig. 3). In the ﬁrst case the local potential barriers may be formed by the means of homogeneous doping of the interdot space. The potential barriers are formed by the electrons trapped in QDs and by the ions in the depletion regions. There are two main mechanisms of photoelectron capture in QDIP structures. One is due to electron tunneling (see Fig. 4(a)), and the other one is due to thermo-excitation (see Fig. 4(b)). The height of the potential barriers can be controlled over a wide range by varying host materials, doping level, and changing the characteristic length scale associated with the quantumdot structure. In our model, the detailed form of the potential barriers is not critical. The only important assumption we accept is that the probability of

Fig. 3. Collective barriers and conducting channels in the modulation-doped quantum-dot structure

Fig. 4. Photoelectron capture due to: (a) electron tunneling and (b) thermoexcitation; Vm is the barrier height

Monte-Carlo Modeling of Electron Kinetics

Fig. 5. Schematic cross-section of the MD-QDIP structure

405

Fig. 6. Potential proﬁle of the MD-QDIP structure with lateral electron transport

tunneling processes is small compared with the capture probability via thermoexcitation. The collective barriers may be created, if the electrons populating QDs are taken relatively far from the QDs. The potential barriers shown in Fig. 3 separate the groups of QDs the electrons with high mobility in the undoped conducting channels. Currently, most of the QDIP structures use electron tunneling in the vertical direction through the stacked QDs (see Fig. 1). However, in such structures it might be diﬃcult to have high electron mobility and high gain. An alternative structure was suggested, which is based on the lateral transport along the heterointerfaces [3,4] (see its cross-section and potential proﬁle in Figs. 5 and 6). This modulation-doped quantum dot infrared photodetector (MD-QDIP), is one of the possible realizations of structures with collective potential barriers. In such structures long carrier lifetimes can be achieved due to large distances between QDs and heterointerfaces. The high responsivity can be achieved due to long photo-excited carrier lifetimes and high-mobility transport of carriers. The optimal structures will have the advantages of quantum wells (strong coupling to radiation due to large number of electrons in localized states), as well as advantages of QDs (manageable kinetics and low capture rate), which make them promising candidates for the advanced IR detector.

3

Results of Simulation

We have developed many-particle Monte-Carlo program for simulation of threedimensinal electron transport and capture processes by QDs. Such a method is an eﬀective and versatile tool for the self-consistent simulations and for the study of transient processes. Our program includes all basic scattering processes, including electron scattering on acoustic, polar optical, and intervalley phonons. The modeling includes electrons in Γ -, L-, and X- valleys and takes into account redistribution of carriers between valleys. In current paper we present the results of numerical simulations for two types of QDIPs: 1) the one with local potential barriers around QDs (see Fig. 2) and 2) the one with collective barriers and

406

V. Mitin et al.

Fig. 7. Capture time, τcapt , as a function of Vm . The inset shows the radius of a QD, a, and interdot distance, b.

Fig. 8. Capture time, τcapt , as a function of electric ﬁeld, Ey , at diﬀerent potential barrier heights, Vm

conducting channels (MD-QDIP structure) (see Figs. 3 and 6). We assume that the intradot relaxation processes can be associated with the inelastic electronphonon scattering in the InAs dot area (Fig. 4(b)). We consider the electron capture process as a speciﬁc scattering process: (i) which is limited in space by the dot volume and (ii) during which a carrier transits from a conducting state to a localized bound state, which is below the potential barrier, Vm . Let us discuss the results of simulation for the structure with local potential barriers around QDs. Figure 7 demonstrates signiﬁcant changes in the dependence of electron capture time, τcapt , on the height of the local barriers, Vm , in ﬁelds higher than 1 kV/cm. Figure 8 shows the dependence of the capture time, τcapt , on the electric ﬁeld, Ey , for diﬀerent heights of the potential barrier, Vm . Capture time, τcapt , is practically independent on Ey up to the critical ﬁeld of the order of 1 kV/cm. At the ﬁelds higher than the critical electric ﬁeld capture time substantially decreases with the ﬁeld increase. Let us analyze the obtained simulation results in the framework of electron heating model (for the detailed equations, please see Ref. [1]). The dependence of the average electron energy, ε¯, on the electric ﬁeld, Ey , is shown in Fig. 9. The height of the barriers, Vm , does not change the energy ε¯ gained in the electric ﬁeld, Ey . Figure 10 shows the exponential dependence of the capture time, τcapt ,

Fig. 9. Average electron energy, ε¯, as a function of the electric ﬁeld, Ey

Fig. 10. Capture time, τcapt , as a function of inverse electron average energy, 1/¯ ε

Monte-Carlo Modeling of Electron Kinetics

407

Fig. 11. Photoconductive gain as a function of the electric ﬁeld, Ey

on the inverse average electron energy, 1/¯ ε. We will compare these results of simulation with the analytical ones later. Figure 11 shows the photoconductive gain as a function of Ey for a device with length of 1 μm. The gain reaches its maximum at Ey ≈ 1 kV/cm, which is characteristic ﬁeld for the dependences shown in Figs. 7 and 8. The nonmonotonic dependence of gain has a simple explanation. At low electric ﬁelds τcapt is almost constant, while the gain increases due to the decrease of the transit time, ttr . At high electric ﬁelds, the gain decreases due to the exponential decrease of τcapt . Let us discuss the results obtained for the MD-QDIP structure with the collective barriers. We have used slightly simpliﬁed potential proﬁle of the GaAs layer: instead of a self-consistent potential proﬁle, shown in Fig. 6 by solid line, we use its triangular approximation, shown in the same ﬁgure by the dasheddotted line. We used the value of the perpendicular to the heterointerface electric ﬁeld, Ex , equal to 11 kV/cm. Figure 12 shows the dependence of τcapt on the GaAs thickness, d. As expected, τcapt increases as d increases. Two factors are responsible for the increase of τcapt : 1) the increase of the distance between dots in x-direction and 2) the increase of the height of the eﬀective potential barrier, Vm . With the increase of d electrons are more and more localized at the AlGaAs/GaAs interface. Figure 12 demonstrates the exponential dependence on d, which is in good agreement with

Fig. 12. Electron capture time, τcapt , as a function of GaAs thickness, d

Fig. 13. Electron capture time, τcapt , as a function of longitudinal electric ﬁeld, Ey

408

V. Mitin et al.

Fig. 14. Electron capture time, τcapt , as a function of inverse energy 1/¯ ε

Fig. 15. The capture time, τcapt , as a function of the inverse temperature, 1/T

the experimental results of [3]. We can compare the obtained results for the model with the collective barriers with those obtained for the model with the local potential barriers. Figure 13 shows the dependence of τcapt on the applied longitudinal electric ﬁeld, Ey . This dependence has the same characteristic features as the dependences obtained for the model with the local potential barriers (see the dependence with Vm = 0.1 eV in Fig. 8). Figure 14 shows the inﬂuence of electron heating on the capture time, τcapt : it shows the exponential dependence of τcapt on the inverse average electron energy, 1/¯ ε. This result is almost the same as in Fig. 10, which was obtained for the model with local potential barriers. The comparison of two results demonstrates that the model of electron heating can be also applied to the structures with collective barriers. Operating temperatures of MD-QDIPs play signiﬁcant role in the device performance. Figure 15 shows exponential dependence of τcapt on the inverse temperature, 1/T .

4

Analytical Results

Here, as in our previous works [6,7,1], we have considered a quantum-dot structure with the local potential barriers around QDs. Without tunneling processes, the photoelectron capture rate, −1 τcapt = Nd σ˜ v,

(1)

is given by the equation for the trapping cross-section [1],

where

−1 eVm 3 αa F (V ) , σ = παa2 exp − 1+ kT 4 l

(2)

b eVm dr eV (r) F = a exp − , 2 kT kT a r

(3)

v˜ is the electron thermal velocity, Nd is the concentration of QDs, a is the radius of the dot, b is the interdot distance, l is the electron mean-free path with respect

Monte-Carlo Modeling of Electron Kinetics

409

to elastic electron scattering, α is the probability for an electron at r ≤ a to be captured by QD, and Vm is the maximum value of the potential barrier, i.e. Vm = V (a). We would like to emphasize that (2) and (3) are valid for any relation between l, a, and αa as well as for a wide variety of potentials. To compare these results with the conclusions of [5], let us consider the ﬂat-potential approximation V = 0. In this case, −1 3 αa σ = παa2 1 + . (4) 4 l Following the Ref. [5], we accept that the inelastic intradot relaxation processes are described by the relaxation time τε . In this case, the coeﬃcient α can be evaluated as α ≈ a/lε , where lε = v˜ τε and v˜ is the electron thermal velocity in the dot. Then, if a2 llε , we obtain for the capture rate: 1 v˜ 1 = πNd a3 . τcapt v˜ τε

(5)

In the opposite case, a2 llε , the capture rate is independent of the coeﬃcient α and is given by: 1 = 4πNd Da , (6) τcapt where D = v˜l/3 is the diﬀusion coeﬃcient. Both limiting cases are in the agreement with the results of [5]. Note, that the second term in the brackets in (4) describes the reduction of the carrier concentration near the dot due to the capture processes. As we already discussed, this eﬀect becomes important in the electron capture process if l < αa, which results in (6). In the opposite case, the electron concentration is practically homogeneous in space and carrier capture is determined by (5). Let us note that in the presence of the potential barriers the second term in the brackets in (2) also describes the reduction of the carrier concentration near the dot. Due to the repulsive potential barriers this eﬀect is increased by a factor of F (V ) given by (3). Comparing this result with the electron capture by the repulsive impurity traps, one can associate F (V ) with the Sommerfeld factor, which shows the reduction of carrier density (electron wavefunction) near the trap. If the local reduction of carrier density is negligible, the capture rate is equal to: 1 ˜ 1 eVm 3 v = πNd a exp − , (7) τcapt v˜ τε kT where the exponential factor describes the eﬀect of potential barriers on the capture processes. ε). Therefore, we may conAs seen in Fig. 10, τcapt is proportional to exp(1/¯ clude that the carrier capture processes in the electric ﬁeld may be described by (7), where the thermal energy kT has to be replaced by a factor of 2¯ ε/3. Besides, Eq. (7) shows that the capture time has an exponential dependence on the inverse thermal energy 1/kT , which corresponds with the result shown in Fig. 15. Thus, our model correctly describes the electron kinetics in MD-QDIPs.

410

5

V. Mitin et al.

Conclusion

Novel properties of QDIPs may be realized through the manipulation of dots and potential barriers created by the selective doping in quantum-dot structures. The structure with the local potential barriers around dots and lateral photoconductivity transport may be used to separate the localized intradot electron states from the conducting states in the matrix and, in this way, to control all electron processes. The controllable kinetics may provide signiﬁcant increase in the photoconductive gain, device detectivity, and responsivity.

References 1. Chien, L.H., Sergeev, A., Vagidov, N., Mitin, V.: Hot-electron transport in quantumdot photodetectors. International Journal of High Speed Electronics and Systems 18, 255–264 (2008) 2. Ferreira, R., Bastard, G.: Phonon-assisted capture and intradot Auger relaxation in quantum dots. Appl. Phys. Lett. 74, 2818–2820 (1999) 3. Hirakawa, K., Lee, S.-W., Lelong, P., Fujimoto, S., Hirotanu, K., Sakaki, H.: Highsensitivity modulation-doped quantum-dot infrared photodetectors. Microelectronic Engineering 63, 185–192 (2002) 4. Lee, S.-W., Hirakawa, K.: Lateral conduction quantum-dot infrared photodetectors using photoionization of holes in InAs quantum dots. Nanotechnology 17, 3866–3868 (2006) 5. Lim, H., Movaghar, B., Tsao, S., Taguchi, M., Zhang, W., Quivy, A.A., Razeghi, M.: Gain and recombination dynamics of quantum-dot infrared photodetectors. Phys. Rev. B 74, 205321–1–8 (2006) 6. Mitin, V.V., Pipa, V.I., Sergeev, A.V., Dutta, M., Stroscio, M.: High-gain quantumdot infrared photodetector. Infrared Physics and Technol. 42, 467–472 (2001) 7. Vagidov, N., Sergeev, A., Mitin, V.: Infrared quantum-dot detectors with diﬀusionlimited capture. International Journal of High Speed Electronics and Systems 17, 585–591 (2007)

Particle Model of the Scattering-Induced Wigner Function Correction M. Nedjalkov1,2 , P. Schwaha1 , O. Baumgartner1, and S. Selberherr1 1

2

Institute for Microelectronics, TU Wien Gußhausstraße 27-29/E360, A-1040 Vienna, Austria Institute for Parallel Processing, Bulgarian Academy of Sciences Acad. G.Bontchev, Bl 25A, 1113 Soﬁa, Bulgaria

Abstract. The ability of accounting for quantum-coherent and phasebreaking processes is a major feature of the Wigner transport formalism. However, the coherent case is only obtained at signiﬁcant numerical costs. Therefore, a scheme which uses coherent data obtained with a Green’s function formalism has been developed. This scheme calculates the necessary corrections due to scattering using the Wigner approach and the associated Boltzmann collision models. The resulting evolution problem is not only theoretically derived, but simulation results are presented as well.

1

Introduction

Modeling and simulation of electronic devices is a part of science where knowledge of mathematics, physics, and electrical engineering is required simultaneously to design, analyze, and optimize these core components of the integral circuits. For the semiconductor industry it appears as the only alternative to the enormously expensive trial-and-error manufacturing approach. By means of device modeling and simulation the physical characteristics of semiconductor devices are explored in terms of charge transport and electrical behavior [2]. The progress in this ﬁeld depends on the level of complexity of the transport models and the application of eﬃcient numerical and programming techniques. Physics provides a hierarchy of charge transport models summarized in Table 1. The ongoing miniaturization of devices forces the use of ever more sophisticated models to be able to capture all relevant eﬀects and correctly calculate device behavior. The sophistication of the models, which increases from the bottom to the top is also characterized by an increase of the numerical complexity. The analytical models utilized during the infancy of microelectronics for circuit design were replaced by more rigorous drift-diﬀusion and hydrodynamic formulations, based on the moments of the Boltzmann equation. As these can be solved only numerically, corresponding deterministic methods have been developed. When microelectronic devices enter the sub-micrometer scale, the moment equations fail, while the Boltzmann equation remains relevant. The Boltzmann equation provides a detailed classical picture of carrier evolution, where physical probability functions are associated with the various processes describing I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 411–418, 2010. c Springer-Verlag Berlin Heidelberg 2010

412

M. Nedjalkov et al. Table 1. Carrier Transport Models

QUANTUM: nanometer and femtosecond scales

CLASSICAL

MODEL Green’s Functions Method

FEATURES Accounts for both spatial and temporal correlations Wigner equation; von-Neumann Quantum-kinetic models of equation for the density matrix spatial correlations Quantum-hydrodynamic equa- Quantum corrections to clastions sical hydrodynamics Boltzmann transport equation Comprehensive down to subµm and/or ps scales

carrier dynamics. The probabilistic nature along with the fact that the equation deals with a seven-dimensional phase space calls for a stochastic approach. The development of Monte Carlo (MC) methods for solving the equation can be considered as a major step in this ﬁeld. The nanometer and femtosecond scales of operation of modern devices give rise to a number of phenomena which are beyond purely classical description. These phenomena are classiﬁed in the International Technology Road-map for Semiconductors (ITRS, www.itrs.net) according to their importance to the performance of next generation devices. As is recognized in ITRS Table 122, ‘Methods, models and algorithms that contribute to prediction of CMOS limits’ are of utmost interest next to ‘... computationally eﬃcient quantum based simulators’. Quantum models capable of describing not only purely coherent phenomena such as quantization and tunneling but also phase breaking processes such as interactions with phonons are especially relevant. The rising computational requirements resulting from the increasing complexity are a major concern for the development and deployment of these approaches. The harmony between theoretical and numerical aspects of the classical models is disturbed in the upper part of Table 1. At the very top is the rigorous non-equilibrium Green’s function approach (NEGF). It allows the simultaneous consideration of coherent processes of correlations in space and time and of phase breaking processes due to the lattice. However, because of numerical issues the applicability is restricted to stationary structures, basically in the ballistic limit [6]. The computational burden can be reduced by working in a mode space, obtained by separation of the problem into longitudinal and transverse directions. Furthermore, if the transverse potential proﬁle along the transport direction remains uniform, the modes in these directions can be decoupled so that the transport becomes quasimultidimensional. However, this is not the case for devices with a squeezed channel or with abruptly ﬂared out source/drain contacts which require to consider two- and three-dimensional eﬀects [3]. Phonon scattering has been included within a quasi-two-dimensional transport description [5]. Local approximations are commonly utilized in this case, where the phonon self-energy terms are diagonal in the coordinate representation. This is well justiﬁed for deformation

Particle Model of the Scattering-Induced Wigner Function Correction

413

potential interaction, but must be adopted for interactions with polar phonons, surface roughness, and ionized impurities. One level below in complexity are the Markovian in time (for pure states) approaches based on the density matrix or the unitary equivalent Wigner function. The recent interest in the semiconductor device society to simulation methods which rely on the Wigner transport picture is due to the ability of the latter to account for quantum-coherent and phase-breaking processes of de-coherence due to scattering of the carriers with phonons and other crystal lattice imperfections [4]. In this picture scattering can be accounted for in a straightforward way by using the Boltzmann collision models, while the coherent counterpart gives rise to a heavy numerical burden. This is in contrast to the Green’s function approach which is numerically eﬃcient in the cases of coherent transport.

2

Scattering-Induced Wigner Function Correction

We propose an approach [1] which combines the advantages of the two pictures. Green’s function calculations of the coherent transport determined by the boundary conditions deliver the coherent Wigner function fwc . s dE 1 s ; fwc (x, kx ) = ρ(x, x ) = −2i G< (x, x , E) dse−ikx s ρ(x+ , x− ) 2π 2π 2 2 (1) The lesser Green’s function G< depends on the coordinates x, x and energy E. The coherent Wigner function fwc (x, kx ) is obtained from the density matrix 2 ρ(x1 , x2 ) with the help of the center-of-mass transformation x = x1 +x , s= 2 x1 − x2 . Furthermore, fwc is a solution of the coherent part of the Wigner-Boltzmann equation. hkx ∂ ¯ fw (x, k) = dkx Vw (x, kx − kx )fw (x, kx , kyz ) (2) m ∂x + dk fw (x, k )S(k , k) − fw (x, k)λ(k) Vw is the Wigner potential. Phase-breaking processes are accounted for by the Boltzmann scatteringoperator with S(k, k ), the scattering rate for a transition from k to k . λ(k) = dk S(k, k ) is the total out-scattering rate. The coherent problem is obtained from (2) by setting the scattering rate S (and thus λ) to zero. In this case the kyz dependence remains arbitrary and can be speciﬁed via the boundary conditions. Formally the extrapolation must be such that fwc (x, kx ) is recovered by the integral over kyz . Moreover we want to cancel the Boltzmann scattering operator at the boundaries where standard equilibrium conditions are ) is assumed in the assumed. Hence, a Maxwell-Boltzmann distribution fMB (kyz yz directions, and the functions fwc (x, k ) = fwc (x, kx )

2 h ¯ 2 (k 2 y +k z ) ¯2 h e− 2mkT ; 2πmkT

fwΔ (x, k) = fw (x, k) − fwc (x, k)

414

M. Nedjalkov et al.

can be introduced. The equation for the correction fwΔ is obtained by subtracting the coherent counterpart from (2). The correction is zero at the device boundaries, since the same boundary conditions are assumed for both cases.

3

Classical Limit

The obtained equation is approximated with the help of the classical limit. eE(x) ∂fwΔ (x, kx , kyz ) dkx Vw (x, kx − kx )fwΔ (x, kx , kyz ) = − (3) ¯h ∂kx This approximation is valid for slowly varying potentials, so that the force F (x) = eE(x) can only be a linear function within the spatial support of fwΔ . The force gives rise to Newton’s trajectories 0 0 hKx (τ ) ¯ F (X(τ )) dτ Kx (t) = kx − dτ (4) X(t) = x − m ¯h t t initialized by x, kx , 0. In this deﬁnition, if t > 0 the trajectory is called forward, otherwise it is a backward one. A backward trajectory crosses the boundary of the device at a certain time tb , so that fwΔ (X(tb ), k(tb )) = 0. The approximated equation can be transformed with the help of (4) 0 0 Δ fw (x, k) = dt dk fwΔ (X(t), k )S(k , k(t))e− t λ(k(τ ))dτ (5)

0

dt

+ tb

tb

0 dk fwc (X(t), k )S(k , k(t)) − fwc (X(t), k(t))λ(k(t)) e− t λ(k(τ ))dτ

into a Fredholm integral equation of the second kind with a free term given by the second row of (5) determined by fwc . The solution can be presented as a Neumann series with terms obtained by iterative application of the kernel to the free term. The series corresponds to a Boltzmann kind of evolution process, where the initial condition is given by the free term. The genuine mixed mode problem posed by the boundary conditions is transformed into a classical evolution of the quantum-coherent solution fwc . The latter, however, allows negative values and thus cannot be interpreted as an initial distribution of classical electrons: rather positive and negative particles initiate the evolution process. In this way the quantum information remains in (5) by the sign of the evolving particles. The boundary is still presented by tb , however, it has a diﬀerent physical meaning: it only absorbs particles, since trajectories with evolution time t < tb < 0 do not contribute to the solution. In very small devices the carrier dwelling time can be so small that the probability for multiple scattering events tends to zero. In such cases the initial condition itself presents the correction fwΔ . In all other cases the evaluation of the initial condition is a necessary step for ﬁnding the solution. A particle approach derived for this purpose with the help of the numerical Monte Carlo theory is presented next.

Particle Model of the Scattering-Induced Wigner Function Correction

4

415

Monte Carlo Approach

The general Monte Carlo task is to compute the averaged value I(Ω) of fwΔ in a given domain Ω of the phase space. Δ I = dx dkx θΩ (x, kx )fw (x, kx ) = dx dkx dky dkz θΩ (x, kx )fwΔ (x, k) The domain indicator θΩ (x, kx ) is unity, if its arguments belong to Ω, and 0 otherwise. We ﬁrst consider the contribution of the second component f0B (x, kx ) of the initial condition which is the last term in (5). 0 2 2 2 ¯h2 I0B (Ω) = e−¯h (ky +kz )/2mkT dt dx dkx dky dkz 2πmkT −∞ fwc (X(t), Kx (t))θΩ (x, kx )λ(Kx (t), ·)e−

0 t

λ(Kx (τ ),·)dτ

θD (X(t))

The lower bound of the time integral has been extended to −∞, since the introduced device domain indicator θD takes care for its correct value at tb . The backward parametrization of the trajectories will be changed to forward ones aiming to achieve a more heuristic picture of the evolution of the real carriers. Two important properties of the trajectories will be utilized: (I) Any phase space point reached by the trajectory at any given time can be used for initialization, since it obeys a system of ﬁrst order diﬀerential equations. A full notation of a trajectory X(t), Kx (t) contains the initialization point: X(t) = X(t; x, kx , 0) = xt , Kx (t) = Kx (t; x, kx , 0) = kxt . According to (I), the initialization can be changed from x, kx , 0 to xt , kxt , t so that x = X(0, xt , kxt , t), kx = Kx (0, xt , kxt , t). The second property (II) replaces for stationary transport the absolute clock by a relative one: trajectories are invariant with respect to a shift of both, initialization and parametrization time. Applying these properties consecutively gives: X(τ ) = X(τ ; x, kx , 0) = xt = X(t − τ ; xt , kxt , t) = X(−τ ; xt , kxt , 0) = X t (−τ ) Kx (τ ) = Kx (τ ; x, kx , 0) = kxt = Kx (t − τ ; xt , kxt , t) = Kx (−τ ; xt , kxt , 0) = Kxt (−τ ) With the introduced short notations X t , K t for the novel initialization xt , kxt , 0 (of the same trajectory!) it holds that x = X t (−t), kx = Kxt (−t) (recall that t < 0). Finally, according to the Liouville theorem dxdkx = dxt dkxt . Here we need to discuss the involved domains in the phase space: the initially speciﬁed domain Ω is mapped backwards in time onto some domain Ω(t) so that the integration over xt , kxt must be conﬁned to Ω(t). However, we can augment the integration domain to the whole space, since by default trajectories deﬁned by points outside Ω(t) will not enter Ω after time −t. For such trajectories θΩ = 0 so that the value I0B remains unchanged by this step. With these relations the expression for I0B becomes 0 2 2 2 h2 ¯ t t I0B = dt dx dkx dky dkz e−¯h (ky +kz )/2mkT 2πmkT tb fwc (xt , kxt )θΩ (X t (−t), Kxt (−t))λ(kxt , ky , kz )e−

0 t

λ(Kxt (−τ ),ky ,kz )dτ

θD (xt ).

416

M. Nedjalkov et al.

We change the sign of τ in the integral of the exponent, then replace t by −t, reorder the terms according their appearance in time and augment the expression by completing the probability densities in the curly brackets. I0B

∞ t t t = dt dx dkx dky dkz θD (x )

2 +k2 ) h ¯ 2 (ky z ¯2 h e− 2mkT 2πmkT

fwc (xt , kxt ) (6)

0

λ(kxt , ky , kz ) t − 0t λ(Kxt (τ ),ky ,kz )dτ λ(K (t), k , k )e θΩ (X t (t), Kxt (t)) y z x λ(Kxt (t), ky , kz ) These conditional probabilities give rise to a MC algorithm for evaluation of I0B (Ω). The computational task is now speciﬁed as the calculation of the value i,j of f0B = f0B (xi , kxj ) at a given point (xi , kxj ). Then Ω = Ω i,j can be determined by the phase space area with a small volume Δ = Δkx Δx around (xi , kxj ) so i,j that f0B = I0B (Ω i,j )/Δ. Another peculiarity is the point wise evaluation of fwc giving rise to the approximation: dxt dkxtfwc (xt , kxt ) fwc (i, j)Δ i,j

in (6). The following algorithm for evaluation of f0B in the points of the mesh (i, j) is suggested: 1. Associate to each node i, j of the mesh in the phase space an initialized to zero estimator ξ i,j . 2. The kxt , xt integrals in (6) corresponds to a loop over i, j nodes; Initiate l = 1, 2, . . . N trajectories from each node: the initial points can be ﬁxed on the node or randomized within the cell; For each trajectory: 3. select the kyl , kzl values according to the term in the ﬁrst curly brackets: algorithms for generation of Gaussian random numbers are well established. This step accounts for the ky , kz integrals. 4. The point xt , kxt initializes the trajectory Kxt (t), X t (t) at time t = 0. Along the trajectory λ(Kxt (t), kyl , kzl ) becomes a function of the time t. A value of the time t = tl , the free ﬂight time, is generated by the term in the last curly brackets by applying some of the well known algorithms utilized for device Monte Carlo simulation. 5. Add to the estimator ξ i,j at the mesh node i, j nearest to Kxt (tl ), X t (tl ) the weight wl = fwc (xt , kxt )λ(kxt , kyl , kzl )/λ(Kxt (tl ), kyl , kzl ) 6. At the end of the i, j loop divide ξ i,j by N . This algorithm can be further enhanced by a selection of the contribution term in the i, j sum according to |fwc (i, j)|. The current version, however, is ﬂexible in providing information about the signiﬁcance of particular device regions. In a similar way the ﬁrst term gives rise to the multiple integral ∞ I0A = dt dxt dkxt dky dkz dk θD (xt ) 0

2 h ¯ 2 (k 2 y +k z ) ¯2 h e− 2mkT 2πmkT

fwc (xt , kx )

Particle Model of the Scattering-Induced Wigner Function Correction

S(k , kxt , ky , kz ) λ(k )

t t λ(Kxt (t), ·)e− 0 λ(Kx (τ ),·)dτ

417

λ(k ) θΩ (X t (t), Kxt (t)) λ(Kxt (t), ·)

When compared to the previous case this algorithm accounts for an additional k integration related to the extra scattering event in I0A : 1. Associate to each node i, j an estimator ξ i,j . 2. Loop over i, j nodes corresponding to kx and xt integrals, and initiate l = 1, 2, . . . N trajectories from each node: 3. select the kyl , kzl values according to the term in the ﬁrst curly brackets, thus accounting for the ky , kz integrals. 4. Select a wave vector according the term in the second curly-brackets. Input parameters are kx , kyl , kzl , the particular value of the after-scattering wave t vector is denoted by k = kxl , kyl , kzl . t t t 5. The point x , kxl initializes the trajectory Kxl (t), Xlt (t) at time t = 0. Generate a free-ﬂight time value tl according to the term in the last curly brackets. t 6. Add to the estimator ξ i,j at the mesh node i, j nearest to Kxl (tl ), Xlt (tl ) the weight t wl = fwc (xt , kx )λ(kx , kyl , kzl )/λ(Kxl (tl ), kyl , kzl ) 7. At the end of the i, j loop divide ξ i,j by N .

concentration [cm−3 ]

1 × 1019

1 × 1018

1 × 1017

1 × 1016

initial density ﬁst correction integral second correction integral corrected density

10

20

30

40

50 nm

60

70

80

90

Fig. 1. Initial carrier density, densities due to ﬁrst and second integrals, and corrected density calculated with N = 104 . As expected theoretically the two integrals overlap deep in the equilibrium contacts. The scattering gives rise to an increase of the electron density in the well as compared to the coherent case.

418

5

M. Nedjalkov et al.

Simulation Results

The developed algorithm has been applied to a GaAs resonant tunneling diode. The 4nm wide quantum well is surrounded by 1nm barriers with a height of 0.3eV , applied is 0.3V bias. For such small device the initial condition represents the Wigner function correction. The results shown in Fig. 1 demonstrate that the physical eﬀect of the scattering processes cannot be neglected as it would be done with a coherent approximation.

Acknowledgment ¨ This work has been partially supported by the Osterreichische Forschungsge¨ meinschaft (OFG), Project MOEL273 and by the Austrian Science Fund, special research program IR-ON (F2509).

References 1. Baumgartner, O., Schwaha, P., Karner, M., Nedjalkov, M., Selberherr, S.: Coupling of non-equilibrium Green’s function and Wigner function approaches. In: Proc. Simulation of Semiconductor Processes and Devices, Hakone, Japan, pp. 931–934 (2008) ISBN: 978-1-4244-1753-7 2. Goodnick, S.M., Vasileska, D.: Computational electronics. In: Buschow, K.H.J., Cahn, R.W., Flemings, M.C., Kramer, E.J., Mahajan, S. (eds.) Encyclopedia of Materials: Science and Technology, vol. 2, pp. 1456–1471. Elsevier, New York (2001) 3. Luisier, M., Schenk, A., Fichtner, W.: Quantum transport in two- and three- dimensional nanoscale transistors: Coupled mode eﬀects in the non-equilibrium Green’s function formalism. Journal of Applied Physics 100, 043713–1–043713–12 (2006) 4. Querlioz, D., Saint-Martin, J., Do, V., Bournel, A., Dolfus, P.: Fully quantum self-consistent study of ultimate DG-MOSFETs including realistic scattering using Wigner Monte Carlo approach. In: Intl. Electron Devices Meeting, pp. 1–4 (2006) 5. Svizhenko, A., Antram, M.P.: Role of scattering in nanotransistors. IEEE Trans. on Electron Devices 50, 1459–1466 (2003) 6. Venugopal, R., Ren, Z., Datta, S., Lundstrom, M.S.: Simulating quantum transport in nanoscale transistors: real versus mode space approach. Journal of Applied Physics 92, 3730–3739 (2002)

Analysis of the Monte Carlo Image Creation by Uniform Separation A.A. Penzov1, I.T. Dimov1,2 , L. Szirmay-Kalos3, and V.N. Koylazov4 1

4

Institute for Parallel Processing - Bulgarian Academy of Sciences Acad. G. Bonchev, bl. 25A, 1113 Soﬁa, Bulgaria 2 ACET Centre, University of Reading, Whiteknights, PO Box 217, Reading, RG6 6AH, UK 3 Budapest University of Technology and Economics, Magyar Tud´ osok krt. 2, H-1117 Budapest, Hungary Chaos Software, Alexander Malinov Blvd., bl. 33 entr. B., 1729 Soﬁa, Bulgaria

Abstract. This paper considers the uniform separation technique of integration domain for Monte Carlo image creation. Uniform Separation technique is based on symmetrical separation into small equal size subdomains to solve the rendering equation which describes the photons propagation in a virtual scene. It tries to reduce the variance at solving the rendering equation pixel by pixel in the image pixels matrix. Our goal is to analyze and study the Monte Carlo estimators by Uniform Separation for numerical treatment of the rendering equation, as well as the inﬂuence over the total process of image creation. We develop our consideration on the base of mathematical theory for Monte Carlo estimations and the experience of image creation. New results for the symmetric separation of integration domain are obtained and presented.

1

Introduction

The solution of the rendering equation is equivalent to the estimation of the global illumination (light transport problem in a virtual described scene) for photo-realistic image creation [6]. It is a second kind Fredholm type integral equation, describing the light propagation in a closed domain (see in Figure 1). The radiance L, leaving from a point x on the surface of the scene in direction ω ∈ Ωx , where Ωx is the hemisphere at point x, is the sum of the self radiating light source radiance Le and all reﬂected radiance: (1) L(x, ω) = Le (x, ω) + L(h(x, ω ), −ω )fr (−ω , x, ω) cos θ dω , Ωx

where h(x, ω ) is the ﬁrst point that is hit when shooting a ray from x into direction ω . The radiance Le has non-zero value if the considered point x is a point of solid light source. Therefore, the reﬂected radiance in direction ω is an integral of the radiance incoming from all points, which can be seen through the hemisphere Ωx at point x, attenuated by the surface BRDF (Bidirectional I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 419–426, 2010. c Springer-Verlag Berlin Heidelberg 2010

420

A.A. Penzov et al.

Fig. 1. The geometry for the rendering equation

Reﬂectance Distribution Function) fr (−ω , x, ω) and the projection cos θ . The angle θ is the angle between the surface normal nx at point x and the direction fr (−ω , x, ω) cos θ dω < 1, ω . The law for energy conservation holds, i.e.: Ωx

because a real scene always reﬂects less light than it receives from the light sources due to light absorption of the objects. If the point x is a point of a transparent object the transmitted light component must be added to Equation (1) and integration domain is a sphere Ω (x) at point x, where Ω (x) = Ωx Ω x . According to the Neumann series [12], the global illumination can be modeled as stationary linear iterative process L(x, ω) = Le + Le + 2 Le + . . . + i Le + . . . =

∞ i=0

i L e ≈

s

i Le + εs , (2)

i=0

and is integral operator of contraction reﬂecting the law for energy conservation, so εs is truncation error. Considering multi-dimensional integrals, it is very diﬃcult or sometimes impossible the rendering equation to be solved analytically. The Monte Carlo (MC) methods are proved to be the only eﬀective methods for the rendering equation. In order to improve Monte Carlo method and speed up the computation much of the eﬀorts are directed to the variance reduction techniques (see [3]). The separation technique of the integration domain is widely used Monte Carlo variance reduction method. In this paper we consider the variance reduction at image creation by stratiﬁed Monte Carlo technique with uniform separation of integration domain. The question: how to estimate the convergence rate for creation of the total image and what is it in the worst case? is important to compare the diﬀerent Monte Carlo algorithms. We analyze the convergence rate at diﬀerent situation to ﬁnd an estimation for the considered approach.

2

Uniform Separation of Integration Domain

The Uniform Separation technique of integration domain for rendering equation is introduced by us in [4] and developed in [8]. It utilizes the symmetry property

Analysis of the Monte Carlo Image Creation by Uniform Separation

421

Fig. 2. Uniform Triangle Separation - UTS

for hemisphere or sphere separation as well as symmetric samples generation. Based on it, the unit hemisphere with center at the origin of Descartes coordinate system could be partitioned into equal size symmetric sub-domains: – the coordinate planes partition the hemisphere into 4 equal areas, → − − → − → − → → − − → – the bisector planes to the dihedral angles ( X , Y ), ( X , Z ) and ( Z , Y ), partition each area into 6 equal spherical orthogonal triangles, as shown in Figure 2 for the area with positive coordinate values of X, Y , and Z. In this way we can separate the whole hemisphere Ωx into 24 symmetric parts Ωi called Uniform Triangle Separation (UTS). Then through a bijection F (u, v) : [0, 1]2 → ΩABC we generate samples in ΩABC and applying the symmetric sampling scheme [4] we can generate sampling points over Ωx . Since the symmetry is identity, the generation of uniformly distributed random samples in a sub-domain leads to the uniform distribution of all samples in the hemisphere. Because of F (u, v) : [0, 1]2 → ΩABC , more sampling points are grouped around the vertexes coincident with coordinate axes as shown in [9]. This fact could aﬀect the uniformity of sampling point distribution over Ωx partitioned into 24 spherical triangles Ω , which might be a possible drawback for numerical integration. To avoid this fact and strength the uniformity of sampling point distribution over Ωx , we can apply Uniform Quadrangle Separation (UQS) as shown in Figure 3. The hemisphereΩx is partitioned into 12 equal size of spherical quadrangles Ω , each constructed by the union of two neighbors Ω . The symmetry property is hold and all Ω are symmetric each to others.

422

A.A. Penzov et al.

Fig. 3. Uniform Quadrangle Separation - UQS

Another possible partition of Ωx is achieved as shown in Figure 4, combining of UTS and UQS. The hemispherical integration domain is separated into 16 parts. First 8 sub-domains are equal size of orthogonal spherical triangles Ω . They are symmetric each to other and grouped with a common vertex around the normal vector to the surface. The hemispherical integration domain is completed with more 8 sub-domains of equal size spherical quadrangles Ω , also symmetric each to other. The symmetric property allows us to calculate in parallel the coordinates of the symmetric points. Once a sampling point P0 (x0 , y0 , z0 ) is generated in ΩABCD or ΩCED , the coordinates of all symmetric points from the other sub-domains can be obtained by simple change of the signs and/or places of x0 and y0 , while z0 is not aﬀected. For example, P1 (y0 , x0 , z0 ), P2 (y0 , −x0 , z0 ), · · · and P7 (x0 , −y0 , z0 ). Useful bijections F1 (u, v) : [0, 1]2 → Ω and F2 (u, v) : [0, 1]2 → Ω for mapping the uniformly distributed in [0, 1] random variables u and v into the sampling points on ΩCED and ΩABCD , are derived in [1,4,8].

3

Error Analysis

Let us consider iterative Monte Carlo solution of (2) following the Uniform Separation tehcnique. On each iteration we solve multidimensional integrals that are equivalent to the problem of computing the integral J(L) = f (x)p(x)dx, Ω

Analysis of the Monte Carlo Image Creation by Uniform Separation

423

Fig. 4. Combined Uniform Separation - CUS

where Ω ≡ Es ; 0 ≤ xi < 1; i = 1, 2, . . . , s is s-dimensional unit cube and p is probability density function, i.e. p(x) ≥ 0 and Ω p(x)dx = 1. Let us apply uniform separation of Ω into m equal cubes Ωj (j = 1, 2, . . . , m), so that Ω=

m

Ωj , Ωi ∩ Ωj = ∅, i

= j.

j=1

Then J =

m

Jj , where Jj =

j=1

f (x)p(x)dx is the mean of the random variable

Ωj

pj f (xj ), xj is a random point in Ωj with a probability density function p(x) pj and obviously pj = p(x)dx. It is possible to estimate Jj by the average of Nj Ωj

samples ξ¯Nj =

pj Nj

Nj i=1

∗ f (xj ) and J by ξN =

m m ξ¯Nj , where Nj = N (see [10]).

j=1

j=1

Theorem 1. ([Dupach (1956); Haber (1966)]) Let Nj = 1 for j = 1, . . . , m (so m = N ). The function f (x) has continuous and bounded derivatives that ∂f ≤ α for every j = 1, 2, . . . , s and let there exist constants c1 and c2 such ∂xj that conditions c1 c2 pj ≤ and dj ≤ 1 , N Ns hold, where pj = p(x)dx is the probability and dj = sup |x1 − x2 | is diΩj

ameter of the domain Ωj for each j = 1, 2, . . . , m.

x1 ,x2 ∈Ωj

424

A.A. Penzov et al. ∗(s)

Then for the variance of the MC estimator ξN the following relation is fulfilled

2 ∗(s) V ar ξN ≤ (sc1 c2 α)2 N −1− s . Based on consideration in [5] for multidimensional integrals in (2) s

Ωx = Ωx1 × Ωx2 × . . . × Ωxs =

m

Ωj ,

j=1

where ms is the number of the sub-domains Ωj and s is dimension of the domain √ Ωx . One can see that with Uniform Separation pj = m1s and dj = ms for j = √ 1 to the 1, . . . , ms and estimate the constants 1 ≤ c1 and ms N s ≤ c2 . According

∗(s)

3

2

above Theorem 1, the variance can be estimated like V ar ξN ≤ smα2 N −1 . Therefore the convergence rate for MC solution with Uniform Separation for continuous integrand is O(N −1 ). The main diﬃculties at the Monte Carlo methods for numerical integration of the rendering equation are related with the smoothness of the integrand. Due to the scene geometry in general case the integrand is discontinuous. Because the integration domain is partitioned into uniformly small by probability as well as by size sub-domains, the solution of the rendering equation by the Monte Carlo with Uniform Separation method is very similar to the Quasi Monte Carlo approaches. Therefore, the same error analysis can be applied to Monte Carlo with Uniform Separation, as that to the Quasi Monte Carlo approaches. Since, the integrands in computer graphics are discontinuous, only very pessimistic upper error bounds for the integral approximation with Quasi Monte Carlo approach might be expected. Taking into account this fact, an analytic generalization of the Quasi Monte Carlo convergence rate (see [11,12]) in realistic image creation is derived by Szirmay-Kalos: – Quasi Monte Carlo integration of the rendering equation is analyzed by decomposition of discontinuous function into smooth and discontinuous parts; – the error bound for Quasi Monte Carlo integration of discontinuous function is still better than the order of Monte Carlo bound for small dimensions. The observed convergence rate for discontinuous functions at the Quasi Monte s+1 Carlo approach is O N − 2s , where s is the dimension. So, we expect to obtain the same convergence rate for the Monte Carlo with Uniform Separation. Another point of view to analyze the convergence rate is suggested by Mitchell in [7]. Mitchell investigates the variance of image pixels with stratiﬁed Monte Carlo image creation in diﬀerent situations. The convergence rate of a pixel, depends on whether the integrand function is smooth or it is only piece-wise continuous. Also, stratiﬁed sampling can increase the convergence rate noticeably in low-dimensional domains, but has little eﬀect in high-dimensional domains. Let us continue in the same direction of consideration to analyze the rendering of all pixels in the image pixel matrix by Monte Carlo approach with Uniform

Analysis of the Monte Carlo Image Creation by Uniform Separation

425

Separation. The convergence rate can be derived applying the Theorem of Emelyanov-Illin (see [2]) for the error r AEq , associated with a numerical algorithm AEq . Theorem 2 (Emelyanov-Ilin). For the problem SEq of solving s−dimensional Fredholm integral equations of second kind with p−smooth data, error is

p r AEq ≤ c N − 2s for the deterministic algorithms AEq , as well as p − 2s − 12 r AR Eq ≤ c N for the randomized algorithms AR Eq . One can see, that a realistic image is calculated applying certain randomized algorithm AR Eq consequently pixel by pixel for all pixels in image pixel matrix. In the case when the data are s-smooth (p ≡ s) for each pixel in the image pixel matrix, we obtain the convergence rate of O(N −1 ). However, in general case the data beyond each pixel are diﬀerent and p-smooth, where 1 ≤ p ≤ s. Following the Theorem 2 we can write for average pixel error: PN s p 1 1 1 R ravg AR = r A ≤ c Pp N − 2s − 2 , i Eq Eq PN i=1 PN p=1 p

where PN is the total number of pixels in the image pixel matrix and Pp is the number of processed pixels with p-smooth data (1 ≤ p ≤ s). If c = max(cp Pp ), 1 ≤ p ≤ s then 1 s 1 1 c − p −1 c 1 − N−2 R − − ravg AEq PN ≤ N 2s 2 = N 2 N 2s 1 PN p=1 PN 1 − N − 2s and obviously

1

1−N − 2 1−N

− 1 2s

≤ 1. Therefore, the average error for Monte Carlo

image creation with Uniform Separation is: s+1 c ravg AR P ≤ N − 2s . N Eq PN

4

Conclusions

The data to be rendered beyond each pixel are diﬀerent. Almost always in the image pixel matrix there exist large areas of pixels where the function is contin uous. The analysis shows that those pixels are rendered with O N −1 rate of convergence. When the integrand is piece-wise continuous, the pixels are solved

426

A.A. Penzov et al.

s+1 by a Quasi Monte Carlo convergence rate of O N − 2s . Therefore, the convergence rate of the considered creation of images by Monte Carlo with uniform

s+1 separation will be in between O N − 2s and O N −1 . One can see that in the worst case, the estimated rate of convergence is the same as at Quasi Monte Carlo methods, which hints the usage of low discrepancy sequences. The obtained convergence rate determines Monte Carlo image creation by Uniform Separation like super-convergent Monte Carlo method.

Acknowledgments This work was supported by the International Scientiﬁc Cooperation between Bulgarian Academy of Sciences and Hungarian Academy of Sciences under Joint Research Project Grand No. 7 and by OTKA ref. No.: T042735.

References 1. Arvo, J.: Stratifed sampling of spherical triangles. In: Proceedings of SIGGRAPH 1995, ACM SIGGRAPH, pp. 437–438 (1995) 2. Dimov, I.T.: Optimal Monte Carlo Algorithms. In: ISMC 2006, Proceedings of IEEE — John Vincent Atanasoﬀ, pp. 125–131 (2006) 3. Dimov, I.T.: Monte-Carlo Methods for Applied Scientists. World Scientiﬁc Publishing, Singapore (2008) 4. Dimov, I.T., Penzov, A.A., Stoilova, S.S.: Parallel Monte Carlo Sampling Scheme for Sphere and Hemisphere. In: Boyanov, T., Dimova, S., Georgiev, K., Nikolov, G. (eds.) NMA 2006. LNCS, vol. 4310, pp. 148–155. Springer, Heidelberg (2007) 5. Dimov, I.T., Penzov, A.A., Stoilova, S.S.: Parallel Monte Carlo Approach for Integration of the Rendering Equation. In: Boyanov, T., Dimova, S., Georgiev, K., Nikolov, G. (eds.) NMA 2006. LNCS, vol. 4310, pp. 140–147. Springer, Heidelberg (2007) 6. Kajiya, J.T.: The Rendering Equation. In: Proceedings of SIGGRAPH 1986 (1986); ACM SIGGRAPH Computer Graphics 20(4), 143–150 (1986) 7. Mitchell, D.P.: Consequences of stratiﬁed sampling in computer graphics. In: Proceedings of SIGGRAPH 1996, ACM SIGGRAPH, pp. 277–280 (1996) 8. Penzov, A.A., Dimov, I.T., Koylazov, V.N.: A New Solution of the Rendering Equation with Stratiﬁed Monte Carlo Approach. In: ICNAAM 2008, AIP Conference Proceedings, vol. 1048, pp. 432–435 (2008) 9. Penzov, A.A., Stoilova, S.S., Dimov, I.T., Mitev, N.M.: Uniform Separation for Parallel Monte Carlo Image Creation. In: Fourth Hungarian Conference on Computer Graphics and Geometry, SZTAKI, Budapest, Hungary, pp. 117–124 (2007) 10. Sobol, I.M.: Monte Carlo Numerical Methods, Nauka, Moscow (1975) (in Russian) 11. Szirmay-Kalos, L., Purgathofer, W.: Analisis of the Quasi-Monte Carlo Integration of the Rendering Equation. In: Proccedings of WSCG 1999, pp. 281–288. University of West Bohemia, Plzen (1999) 12. Szirmay-Kalos, L.: Monte-Carlo Methods in Global Illumination — Photo-realistic Rendering with Randomization. VDM Verlag Dr. Mueller e.K., Saarbr¨ ucken (2008)

The Role of the Boundary Conditions on the Current Degradation in FD-SOI Devices K. Raleva1 , D. Vasileska2 , and S.M. Goodnick2 1

2

Faculty of Electrical Engineering and ITs, UKIM, Skopje, Macedonia [email protected] Department of Electrical Engineering and Center for Solid State Electronics Research Arizona State University, Tempe, AZ 85287-5706, USA [email protected], [email protected]

Abstract. In this paper we demonstrate the convergence properties of our electro-thermal Monte Carlo device simulator that solves selfconsistently the Boltzmann transport equation for the electrons and the energy balance equations for acoustic and optical phonon bath. We also illustrate that the amount of current degradation in diﬀerent technologies devices depends upon the channel length and the boundary conditions imposed on the gate electrode and the artiﬁcial boundaries. Finally we address the importance of including non-stationary transport for both electrons and phonons. We show via comparison with standard Silvaco simulations that any simulator which does not take into account the nonstationary nature of the carrier transport in the system will give wrong predictions of the current degradation and the position of the hot-spot which arises because of the diﬀerent time-scales involved in the electronoptical phonons interaction and the optical phonon to acoustic phonon decay.

1

Introduction

Over the last 30 years, silicon CMOS has emerged as the predominant technology of the microelectronic industry. The concept of device scaling has been consistently applied over many technology generations, resulting in consistent improvement in both device density and performance. But sometime within the next ﬁve years, traditional CMOS technology is expected to reach the limits of scaling. As channel lengths shrink bellow 22 nm, complex channel proﬁles are required to achieve the desired threshold voltages and to alleviate short-channel eﬀects. To fabricate devices beyond current scaling limits, IC companies are pursuing fully-depleted (FD) Silicon On Insulator (SOI), dual gate (DG) structures and FinFETs. One of the major problems in the operation of nano-scale SOI devices are self-heating eﬀects. These eﬀects arise because SOI devices are thermally isolated from the substrate by the buried oxide layer (BOX). Self-heating leads to a substantial elevation of the local device temperature, which consequently I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 427–434, 2010. c Springer-Verlag Berlin Heidelberg 2010

428

K. Raleva, D. Vasileska, and S.M. Goodnick

modiﬁes the device output characteristics. As transistor dimensions and the silicion ﬁlm thickness are scaled to the order of tens of nanometers, i.e. much less than the phonon mean free path (λ = 300 nm in bulk silicon at room temperature), sub-continuum eﬀects are expected to become important. Phonon boundary scattering [1] and phonon conﬁnement eﬀects [2] are known to decrease the eﬀective thermal conductivity of thin silicon ﬁlms. Nanoscale heat conduction measurements are very diﬃcult to perform, therefore simulations play an increasingly important role in understanding heat generation and transfer at such small length and time scales. In modeling nanoscale heat transfer, the continuum heat theory (Fourier law, Joule heating) is not valid and must be replaced by a more sophisticated formulation which takes into account the energy transport via phonons and the non-equilibrium between acoustic and optical phonon baths. In previous work, we reported on the use of a new device simulator based on the self-consistent solution of Boltzmann transport equation for the electrons with a balance equation solution to the phonon Boltzmann equation [6,5]. In that work, we demonstrated that the non-locality of the electron transport and velocity overshoot eﬀects signiﬁcantly reduce self-heating and associated current degradation in nanoscale devices due to the fact that ballistic carriers do not dissipate energy in the active region of the device, rather in the drain contact [6]. It was also shown that in nanoscale devices, the hot spot corresponding to the peak phonon temperature moves towards the drain end of the channel where removal of heat is more eﬀective, and where less eﬀect on the transport dynamics occurs underneath the gate. However, very little detail was given of the numerical details of the self-consistent approach including the overall convergence of the scheme, as well as the role of boundary conditions.

Fig. 1. Flow chart of the electro-thermal simulator

The Role of the Boundary Conditions on the Current Degradation

429

Fig. 2. Cross-section of the simulated device with applied thermal boundary conditions

2

Convergence of the Scheme

As illustrated in Fig. 1, we self-consistently couple the Monte Carlo solution of the electron Boltzmann transport equation with the energy balance equations for both optical and acoustic phonons. This itself is a diﬃcult problem as we are coupling a particle picture for electrons (which is inherently noisy) with a continuum model for the phonons. To achieve convergence of the coupled scheme, both temporal and spatial averaging of the variables extracted from the Monte Carlo (MC) solver (e.g. the electron density, drift velocity and temperature) must be performed. The number of simulated particles in the model contributes signiﬁcantly to the smoothness of the variables being transferred to the energy balance solver. Within each ‘outer iteration’, we solve the Boltzmann Transport equation for the electrons using the Ensemble Monte Carlo (EMC) method for a time period of 10 ps to ensure that steady state conditions have been achieved. The required variables are then passed to the thermal solver which gives the updated optical and acoustic phonon temperatures. This constitutes one Gummel cycle or ‘outer iteration’ [5]. To test the overall convergence of the coupled EMC and thermal codes, we recorded the variation of the drain current with the number of thermal iterations for a given bias condition. For this study, we simulated the 25 nm fully-depleted SOI structure shown in Fig. 2, with the corresponding thermal boundary conditions indicated, discussed in more detail in Section 3. The device dimensions are: channel length -25 nm, source/drain length -25 nm, silicon layer thickness -10 nm, and BOX layer thickness -50 nm. The simulation results are given in Fig. 3, in which we plot the percent current decrease compared to the result of isothermal simulation. It is evident that only 3-5 iterations are necessary to obtain the steady-state solutions of the current. To further investigate the accuracy of the results and the convergence of the outer current iterative scheme, we increase the device width in order to increase

430

K. Raleva, D. Vasileska, and S.M. Goodnick

Fig. 3. Current decrease relative to the isothermal steady state current as a function of the number of thermal iterations for diﬀerent number of simulated electrons for 25 nm fully-depleted SOI MOSFET. For the thermal part of simulations, the bottom of the BOX and the gate electrode are assumed to be isothermal boundaries, set to 300K. Substrate region is not modeled.

the number of simulated √ electrons. Namely, the statistical error of the MC method decreases as 1/ Nsim [3], where Nsim , is the number of simulated carriers. The statistical uncertainty of the results is 1.14% when device width is 1μm and 0.65% for 3μm device width, so the smoother convergence of the results is obtained with the larger number of simulated carriers (see Fig. 3).

3

The Role of the Thermal Boundary Conditions

To properly solve the phonon energy balance equations, the device should be attached to a heat sink somewhere along the boundary. The substrate is typically treated as a thermal contact in commercial simulation packages such as the Silvaco ATLAS simulation package (THERMAL3D module) [7], as shown in Fig. 2. In the case of SOI technology, the boundary condition should be applied to the bottom of the Si substrate. The thermal boundary conditions chosen need to reﬂect the physics of the individual device, as well as those in the surrounding environment. We have shown [6,5] that, due to the relatively high thermal conductivity of Si, assuming 300K on the back of the BOX layer is actually a good approximation. In obtaining the results presented in Section 2 of the paper we have assumed that the neighboring device is either on or the device is near a circuit hot-spot so that Neumann boundary conditions at the artiﬁcial boundaries (Fig. 2) were applied

The Role of the Boundary Conditions on the Current Degradation

431

Fig. 4. Current degradation vs. technology generation ranging from 25 nm to 180 nm channel length FD-SOI devices. Isothermal boundary condition of 300K are set on the bottom of the BOX. Parameter is the temperature on the gate electrode. Neumann boundary conditions are applied at the vertical sides.

to estimate the worst case scenario of the current degradation. A summary of the current degradation for diﬀerent technology generations is presented in Fig. 4. As discussed previously, the degradation of performance increases with increasing gate length due to the increasing role of ballistic transport at short gate lengths. Also, the degradation is more severe for higher ﬁxed gate temperature. With regard to the role of thermal boundary conditions on the gate and side electrodes, we look now at diﬀerent combinations of Dirichlet and Neumann conditions (see Fig. 5). The simulation results show that when the gate electrode is used as an ideal heat sink (Tgate=300K), there is no diﬀerence in the current degradation due to a self-heating with thermal Neumann or Dirichlet boundary conditions applied at the vertical sides. But, when the heat transfer is not allowed through the gate electrode (Neumann boundary condition), the use of diﬀerent boundary conditions at the vertical sides has an impact on the current degradation only for smaller devices. The use of Dirichlet boundary condition at both vertical sides and the gate electrode is more adequate for digital devices where the operation mode of the transistors changes in certain intervals from on to oﬀ, to on, etc. A ‘worst case scenario’ for self-heating represents when Neumann boundary conditions are imposed on all outer boundaries, except on the bottom of the BOX.

432

K. Raleva, D. Vasileska, and S.M. Goodnick

Fig. 5. Current degradation vs. technology generation ranging from 25 nm to 180 nm channel length FD-SOI devices for diﬀerent combinations of Dirichlet and Neumann conditions at the gate electrode and vertical sides.

Fig. 6. Lattice temperature in the active silicon layer for 25 nm channel length FD-SOI device when using MC simulations (top two panels) and Silvaco ATLAS simulations (bottom panel). Diﬀerent boundary conditions are applied at vertical sides. Temperature on the gate electrode is set to 300K.

4

Non-stationary Electron and Phonon Transports

In commercial device simulators, one couples the heat-conduction equation through the Joule heating term with either the drift-diﬀusion or energy balance

The Role of the Boundary Conditions on the Current Degradation

433

Fig. 7. Lattice temperature in the active silicon layer for 180 nm channel length FD-SOI device when using MC simulations (top two panels) and Silvaco ATLAS simulations (bottom panel). Diﬀerent boundary conditions are applied at vertical sides. Temperature on the gate electrode is set to 300K.

equations of the corresponding carriers that participate in transport, thus arriving at the so-called non-isothermal drift-diﬀusion or energy balance models [7]. The Joule heating model is a local model and it can not predict properly heating eﬀects in nanoscale devices in which non-stationary transport dominates. To prove this argument, we consider the 25 nm and 180 nm channel length devices. The temperature proﬁles and the corresponding current degradations for 25 nm channel length and 180 nm channel length device are shown in Figs 6 and 7 respectively. Simulation results for the 180 nm channel length device show that both Silvaco and MC predict almost the same position of the hot-spot. Even though the MC simulations predict higher temperature at the hot spot, the predicted current degradation is slightly smaller as in the Silvaco simulations the temperature along the whole channel section of the device is higher since once heat gets generated it immediately gets transported, which is opposite to the our thermal model where there is a bottleneck in the heat transfer between optical and acoustic phonons. We might, however say that the simple Joule heating model here is not such a bad approximation since velocity overshoot does not dominate carrier transport. The situation is completely diﬀerent for the case of the 25 nm channel length device in which we have non-stationary transport throughout the whole length of the channel. Here, the MC simulations with Dirichlet boundary conditions predict that the hot spot is at the boundary between the drain end of the channel and the drain contact (again due to the bottleneck in heat transfer between optical and acoustic phonons) and in the Silvaco ATLAS simulations the hotspot is throughout the whole active region of the device and here as well, due to

434

K. Raleva, D. Vasileska, and S.M. Goodnick

the larger size of the hot spot we have more current degradation. The percentage degradation in current between MC with Dirichlet boundary conditions and Silvaco ATLAS simulations here is on the order of 60%. From the above analysis we might conclude that Silvaco ATLAS that utilizes Joule heating model completely misses the physics of lattice heating in deca-nano devices and the non-local model proposed in [6,5] must be used.

5

Conclusions

We have investigated the convergence properties of the electro-thermal device simulator that has been developed as part of our research work. We have also investigated the importance of the boundary conditions on the gate on the current degradation vs. diﬀerent technology nodes. Finally we show that Joule heating is insuﬃcient to capture the non-stationary nature of the thermal transport.

References 1. Ashegi, M., Leung, Y., Wong, S., Goodson, K.E.: Phonon-boundary scattering in thin silicon layers’. Applied Physics Letters 71, 1798 (1997) 2. Balandin, A., Wang, K.L.: Signiﬁcant decrease of the lattice thermal conductivity due to phonon conﬁnement in a free-standing semiconductor quantum well. Physical Review B 58, 1544 (1998) 3. Ferry, D.K.: Semiconductor Transport. Taylor and Francis, New York (2000) 4. Raleva, K., Vasileska, D., Goodnick, S.M.: The Role of the Substrate and the Material Properties of the BOX in Fully-Depleted SOI, SOD and SOAlN Devices. IEEE Transactions on Electron Devices (submitted) 5. Raleva, K., Vasileska, D., Goodnick, S.M., Dzekov, T.: Modeling thermal eﬀects in nano-devices. J. Computational Electronics 7, 226 (2008) 6. Raleva, K., Vasileska, D., Goodnick, S.M., Nedjalkov, M.: Modeling Thermal Eﬀects in Nanodevices. IEEE Trans. Electron Devices 55(6), 1306 (2008) 7. http://www.silvaco.org

Gamma Photon Transport on the GPU for PET L. Szirmay-Kalos1, B. T´oth1 , M. Magdics1 , D. L´egr´ ady1, and A. Penzov2 2

1 Budapest University of Technology, Hungary Institute for Parallel Processing, Bulgarian Academy of Sciences, Bulgaria

Abstract. This paper proposes a Monte Carlo algorithm for gammaphoton transport, that partially reuses random paths and is appropriate for parallel GPU implementation. According to the requirements of the application of the simulation results in reconstruction algorithms, the method aims at similar relative rather than absolute errors of the detectors. The resulting algorithm is SIMD-like, which is a requirement of eﬃcient GPU implementation, i.e. all random paths are built with the same sequence of instructions, thus can be simulated on parallel threads that practically have no conditional branches. The algorithm is a combined method that separates the low-dimensional part that cannot be well mimicked by importance sampling and computes it by a deterministic quadrature, while the high-dimensional part that is made low-variation by importance sampling is handled by the Monte Carlo method. The deterministic quadrature is based on a geometric interpretation of a direct, i.e. non-scattered eﬀect of a photon on all detectors.

1

Introduction

The simulation of gamma photon transport in scattering media is important in engineering simulations, nuclear technology, radiotherapy, PET/SPECT reconstruction, etc. In Positron Emission Tomography (PET) a pair of gamma photons is born from each positron–electron collision [Gea07]. Due to the special character of PET several simplifying assumptions can be made. Assuming that the electron and the positron are “not moving” before collision, the energy E of the photons can be obtained from the rest mass me of the colliding particles and the speed of light c, E = me c2 = 0.511 MeV. As these photons ﬂy in the medium, they might collide with the electrons of the material. The probability that such collision happens in unit distance is the cross section σ. During such collision the photon may get scattered, absorbed according to the photoelectric eﬀect and new photon pair may be generated, but in our energy range only scattering is relevant. When scattering happens, there is a unique correspondence between the relative scattered energy and the cosine of the scattering angle, as deﬁned by the Compton formula: =

1 1 + 0 (1 − cos θ)

=⇒

cos θ = 1 −

1− , 0

where = E1 /E0 expresses the ratio of the scattered energy E1 and the incident energy E0 , and 0 = E0 /(me c2 ) is the incident energy relative to the energy of I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 435–442, 2010. c Springer-Verlag Berlin Heidelberg 2010

436

L. Szirmay-Kalos et al.

the electron. The diﬀerential of the scattering cross section at point x, i.e. the probability density that the photon is scattered from direction ω into diﬀerential solid angle dω in direction ω, is given by the Klein-Nishina formula: dσs (x, cos θ, 0 ) = C(x)( + 3 − 2 sin2 θ), dω

where cos θ = ω · ω .

In this equation C(x) is a material property at point x that is proportional to the number of electrons in a unit volume material. Note that the Klein-Nishina formula depends on incident energy 0 indirectly through sin θ. The probability density of the scattering direction can be expressed as the product of the total scattering cross section, which is the probability of scattering in a unit distance, σs (x, 0 ) =

dσs (x, cos θ, 0 ) dω = 2πC(x) dω

1

+ 3 − 2 sin2 θ d (cos θ) ,

−1

Ω

and of the material invariant phase function describing the conditional probability density of the scattering direction given that scattering happened: P (cos θ, 0 ) =

dσs (x, cos θ, 0 ) + 3 − 2 sin2 θ 1 = . σs (x, 0 ) dω σs (x, 0 )/C(x)

As photons travel in the considered volume, they may get scattered several times before they leave the volume or are captured by a detector. We need to compute the expected landing energy at a detector of area AD : 1 D=

F (x, ω, 0 )0 d0 dxdω, ΩH AD

0

where ΩH is the hemisphere above the detector plane. Probability density F (x, ω, 0 ) that a photon of energy 0 is at point x and traveling in direction ω satisﬁes a Fredholm type integral equation: dσs (x, ω · ω, 0 ) dω , ω · ∇F (x, ω, 0 ) = −σs (x, 0 )F (x, ω, 0 ) + F (x, ω , 0 ) dω Ω

(1) where Ω is the directional sphere, and 0 , 0 , and ω · ω are related by the Compton formula. If 0 = 1, then source term E(x) should also be added to the right side, that expresses the probability density of newly generated gamma photons at point x and at a uniformly distributed direction. The system contains detectors forming a grid of typical resolution 256 × 256. Thus, for every initial sample point x0 we have to compute roughly 105 functionals of density F , which can be expressed as a Neumann series of integrals of ever increasing dimension for every detector [SK08]. Equation (1) is usually solved by Monte Carlo simulation that directly mimics the physical process. A typical algorithm is the following:

Gamma Photon Transport on the GPU for PET

437

for each sample do terminated = FALSE; Generate initial sample with energy 0 = me c2 and direction ω; while not terminated do Traverse line segment of direction ω and of the sampled length; if detector d is hit then Add photon energy to detector d; terminated = TRUE; else if examined volume is left then terminated = TRUE; else Sample new direction ω and relative energy change ; 0 = 0 ; endif endwhile endfor

There are several problems with this approach. Due to the high number of voxels and detectors, such simulation may take days or even weeks of computation on conventional computers. As we compute not a single integral but a high number of functionals, the integrands contain factors representing scattering inside the volume and a factor representing the detector response, which is typically an indicator function. Volume scattering and free path can be well mimicked by importance sampling, but the detector sensitivity is not, which causes high variance sampled estimates. Mimicking the physical process, the absolute errors of the detectors will be similar. However, in the application of the computed result, we rather need similar relative errors since reconstruction algorithms take the ratio of measured and simulated detector responses [SV82]. In this paper we propose an approach that simultaneously solves all mentioned problems. The high computational power is provided by a Graphics Processing Unit (GPU) programmed under CUDA [NVI07]. GPUs have special parallel architecture, and are eﬀective only for Single Instruction Multiple Data (SIMD) like algorithms. It means that we may run hundreds of parallel threads, but — to get high performance — all threads should execute the very same instruction at a time on diﬀerent data. Thus, we eliminate conditionals from the Monte Carlo algorithm and ensure that all threads always execute the same sequence of instructions.

2

New Algorithm

In order to re-use a single random sample for all detectors and guarantee similar relative error everywhere, the path simulation is decomposed to a random path building part and to a deterministic splitting, called connection part. During the path building part, certain number of random scattering points, called virtual sources [SKSS08] are generated in the volume. Then, the deterministic splitting part connects all virtual soruces to all detectors and computes the impact of this random path to each of the detectors (Fig. 1). As the deterministic connection

438

L. Szirmay-Kalos et al.

Fig. 1. The simulated physical process and the decomposition of the computation to Monte Carlo simulation and to classical quadrature

does not consider additional scattering events, only the accumulated extinction needs to be calculated between the scattering points and the detectors. The path building part involves free-path sampling, termination handling, and scattering angle sampling. The deterministic part computes accumulated extinction. In the following subsections we discuss how these elementary tasks can be attacked by programs having minimal dynamic loops and branching instructions. 2.1

Free Path Sampling

The cumulative probability density of the length along a ray of origin x and S direction ω is CDF (S) = 1 − exp − 0 σs (x + ωs)ds , which can be sampled by transforming a uniformly distributed random variable r1 and ﬁnding S = nΔs where a running sum exceeds the threshold of the transformed variable: n−1

σs (x + ωiΔs)Δs ≤ − log r1 <

i=0

n

σs (x + ωiΔs)Δs.

i=0

In order to get the energy dependent total scattering cross section, we build a one-dimensional texture (Fig. 2) in the preprocessing phase, that stores normalized values 1 T1 (0 ) = 2π −1

+ 3 − 2 (1 − cos2 θ) d cos θ,

where =

1 1 + 0 (1 − cos θ)

for 0 = 1/128, 2/128, . . . , 1. During sampling, σs (y) is obtained as the product of the normalized values and the density of electrons C, that is σs (y) = T1 (0 )C(y).

Gamma Photon Transport on the GPU for PET

439

17 texture T_1(eps0) texture T_2(eps0,r) 16 1 15 0.5 14

0 -0.5

13

-1 12 11

0 0.2

10

0.4 9

0.6 0.8

8

1 0

0.2

0.4

0.6

0.8

1

7 0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Fig. 2. Contents of textures T1 (0 ) = σs /C (left) and T2 (0 , r2 ) = cos θ (right)

2.2

Termination Handling

The random path is terminated when the photon leaves the examined volume, which means that we have paths of random length, which are diﬃcult to simulate on a SIMD machine (in fact, the longest path will determine the time of computation, which is unacceptable). Thus, in order not to waste time for the computation of very long paths, the maximum path lengths are set deterministically. We generate N1 paths of length 1, N2 paths of length 2, etc., ﬁnally NL paths of length L. Simultaneously, we assign weight w = 1/(N1 + . . . + NL ) to the ﬁrst scattering points, weight w = 1/(N2 + . . .+ NL ) to the second scattering points, etc. It can happen that a path of lengths N leaves the volume before the N th scattering point. Such cases are handled by assigning energy 0 = 0 to such virtual sources falling outside the volume. Path lengths are set to ensure that the number of samples generating ﬁrst, second, etc. virtual sources will be proportional to a geometric series 1, λ, λ2 , etc. where λ approximates the probability of leaving the volume. 2.3

Scattering Direction

The cumulative probability distribution of the cosine of the scattering angle is 2π CDF (cos θ, 0 ) = T1 (0 )

cos θ

+ 3 − 2 (1 − cos2 Θ) d cos Θ.

−1

The cumulative distribution is just a one-variate integral for a given incident energy 0 . Let us compute these integrals for cos θ ∈ [−1, 1] in a pre-processing phase for regularly sampled values. We set up a 128 × 128 resolution twodimensional array, called texture T2 (u, v) (Fig. 2), addressed by texture coordinates u, v in [0,1] and in the texel addressed by u, v we store cos θ obtained as the solution of v = CDF (cos θ, u). Note that this texture is independent of the material properties and should be computed only once during pre-processing.

440

L. Szirmay-Kalos et al.

During particle tracing the direction sampling is executed in the following way. Random or quasi-random sample r2 ∈ [0, 1) is obtained and we look up texture T2 (u, v) with it and with the incident energy 0 resulting in scattering angle cos θ = T2 (0 , r2 ), and consequently in the relative scattered energy. Note that the texture lookup automatically involves bi-linear interpolation of the precomputed data at no additional cost. The other spherical coordinate φ is sampled from uniform distribution, i.e. φ = 2πr3 where r3 is a uniformly distributed random value in the unit interval. Let us establish a Cartesian coordinate system i, j, k where k = ω is the incident direction, i = k × v/|k × v|, j = i × k. Here v is an arbitrary vector that is not parallel with ω . Using these unit vectors, the scattering direction is ω = sin θ cos φi + sin θ sin φj + cos θk. 2.4

Deterministic Detector Response Calculation

Having the random scattering points in the volume, each of them is treated as an individual virtual source and their contributions to all detectors are obtained. Solving equation 1 for a single virtual source of position x, incident photon direction ω , incident energy 0 , and weight w when additional scattering is ignored, we get the following detector response: D ≈ w exp − σs (i)Δs 0 P (ω · ω, 0 )Δω, where Δω is the solid angle in which the detector is visible from the scattering point, which can be cheaply approximated or can even be analytically computed [Eri90]. Attenuation exp (− σs (i)Δs) is obtained by ray-marching between the detector and the virtual source. Relative energy change is computed from 0 and cos θ = ω · ω using the Compton formula.

3

Results

The proposed method has been implemented in CUDA and run on nVidia GeForce 8800 GFX graphics hardware. We took a 1283 voxel array of a human head to describe the electron density C(x). The detector module has 256 × 256 detectors. The primary source is placed at the center of the head. The distribution of the virtual sources is shown by Fig. 3. The detector images of the scattered contribution (the direct contribution is computed separately and is not included in the error analysis) and the respective error and time graphs are in Fig. 4 and Fig. 5, respectively. Note that we could obtain the result of the scattered contribution within 10% relative L1 error using as few as 1000 photon paths. The explanation is that due to deterministic connection, these photon paths correspond to about 2000 virtual sources, which translate to 2000 samples in each detector (the real eﬀective sample number is smaller since photons may ﬂy out the examined volume). The simulation of 1000 photon paths of length 2, including the deterministic connection of each scattering point with all 2562 detectors, requires just 3.5 seconds on the GPU. This allows interactive placement of the source inside the volume.

Gamma Photon Transport on the GPU for PET

128 paths

512 paths

441

1024 paths

Fig. 3. Distribution of the scattering points in the volume

256 paths 1.44 secs

512 paths 2.26 secs

1024 paths 3.62 secs

4096 paths 14 secs

Fig. 4. Detector images after diﬀerent number of photon paths 4

0.25

time

relative L1 error

3.5 0.2

3

2.5

0.15

2 0.1

1.5

1 0.05

0.5

0 1000

0 100

200

300

400

500

600

700

800

900

1000

Fig. 5. The relative L1 error and the computation time in seconds with respect to the number of photon paths. The length of the paths is 2.

4

Conclusions

This paper proposed a Monte Carlo gamma photon transport solver running on the massively parallel GPU hardware. The algorithm provides the same relative accuracy at each detectors since PET reconstruction algorithms will use ratios of measured and computed detector responses. In order to meet the requirements of SIMD processing concepts of the GPU, we eliminated the conditional loops

442

L. Szirmay-Kalos et al.

from the algorithm. As a result, the GPU is able to solve the transport problem interactively, while such simulations typically take hours on a single CPUs.

Acknowledgement This work has been supported by the Terratomo project of NKTH, Hungary, the National Scientiﬁc Research Fund (OTKA ref. No.: T042735), the Bulgarian Academy of Sciences, and by the Hungarian Academy of Sciences.

References [Eri90]

Eriksson, F.: On the measure of solid angles. Mathematics Magazine 63(3), 184–187 (1990) [Gea07] Geant: Physics reference manual, Geant4 9.1. Technical report, CERN (2007) [NVI07] CUDA (2007), http://developer.nvidia.com/cuda [SV82] Shepp, L., Vardi, Y.: Maximum likelihood reconstruction for emission tomography. IEEE Trans. Med. Imaging 1, 113–122 (1982) [SK08] Szirmay-Kalos, L.: Monte-Carlo Methods in Global Illumination — Photo-realistic Rendering with Randomization, VDM. Verlag Dr. M¨ uller, Saarbr¨ ucken (2008) [SKSS08] Szirmay-Kalos, L., Sz´ecsi, L., Sbert, M.: GPU-Based Techniques for Global Illumination Eﬀects. Morgan and Claypool Publishers, San Rafael (2008)

Transport in Nanostructures: A Comparative Analysis Using Monte Carlo Simulation, the Spherical Harmonic Method, and Higher Moments Models M. Vasicek, V. Sverdlov, J. Cervenka, T. Grasser, H. Kosina, and S. Selberherr Institute for Microelectronics, TU Wien, Gußhausstr. 27-29, A 1040 Vienna, Austria [email protected]

Abstract. With the modern transistor size shrinking below 45 nm the classical drift-diﬀusion model to describe transport in the conducting channel is loosing its validity. In short-channel devices carriers get accelerated by the driving ﬁeld and do not thermalize before they reach the drain contact. Thus, the assumption underlying the classical transport model, that the driving electric ﬁeld produces a weak perturbation of the local equilibrium distribution function, is violated. Several generalizations of the classical drift-diﬀusion model are possible. The most common approach in the TCAD community is to introduce higher moments of the distribution function. Another approach is to use a spherical harmonic expansion of the distribution function. We perform a comprehensive analysis of the validity of the highermoments transport models with the model based on spherical harmonic expansion by rigorously comparing their results with results of the Monte Carlo solution of the Boltzmann transport equation.

1

Introduction

The success of microelectronics technology has been partly enabled and supported by sophisticated Technology Computer-Aided Design (TCAD) tools which are used to assist in IC development and engineering at practically all stages from process deﬁnition to circuit optimization. At this moment, the TCAD related research and development cost reduction amounts to 35% and is expected to increase to 40% in the near future [1]. Most TCAD tools are based on semi-classical macroscopic transport models. From an engineering point of view, semi-classical models, such as the drift-diﬀusion transport model, have enjoyed an amazing success due to their relative simplicity, numerical robustness, and the ability to perform two- and threedimensional simulations on large unstructured meshes [9]. However, with device size dramatically reduced the TCAD tools based on a semi-classical transport description begin to show shortcomings. With downscaling the devices’ channel length the driving ﬁeld and its gradient increase dramatically in the short I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 443–450, 2010. c Springer-Verlag Berlin Heidelberg 2010

444

M. Vasicek et al.

channel. As a result the carrier distribution along the channel can no longer be described by the shifted and heated Maxwellian distribution, and the solution of the Boltzmann transport equation is needed to determine suﬃciently accurate the distribution function. The Monte Carlo method is a well-established numerical technique to solve the Boltzmann transport equation. Traditionally, the so called forward Monte Carlo method [7] is used to ﬁnd the distribution function. For realistic structures, however, a direct numerical solution of this equation by discretization of the phase space is computationally too expensive. TCAD tools, however, do not usually solve the Boltzmann equation and are based on simpliﬁed transport models. Approximate solutions can be obtained by the method of moments. Deﬁning the moments of the distribution function f (r, k, t), one consecutively obtains the drift-diﬀusion model [4], the hydrodynamic model [2], the energy-transport models [10], or the six moments model [3]. Transport models based on the moments of the Boltzmann equation are well accepted in TCAD. Another way of generalizing the classical drift-diﬀusion model is to use a spherical harmonic expansion of the distribution function. In this work we perform a comprehensive analysis of the validity of the highermoments transport models with the model based on spherical harmonic expansion by rigorously comparing their results with results of the Monte Carlo solution of the Boltzmann transport equation. The analytical conduction band model with the non-parabolicity parameter α =0.5 eV−1 is assumed in all the methods.

2 2.1

Simulation Methods Monte Carlo Solution of the Boltzmann Equation

The Monte Carlo methods have been applied to a variety of materials [6,11]. Within this approach, particles are moving on classical trajectories interrupted by scattering processes with phonons and impurities. Although solving the Boltzmann equation with a Monte Carlo technique is computationally very expensive, it is necessary to obtain accurate closure relations, expressing higher moments via the moments of lower order, to introduce and control scattering mechanisms at the microscopic level and, most importantly, to incorporate the peculiarities of the semiconductor band structure. Thus, diﬀerent Monte Carlo algorithms are conveniently used for the calibration of TCAD tools [7]. 2.2

Higher Moments Models

From an engineering point of view, the advantages of the drift-diﬀusion model are its eﬃciency and numerical robustness. These properties make two- and three-dimensional numerical studies of fairly complex device structures feasible. However, hot-carrier eﬀects are diﬃcult to estimate correctly and non-local effects such as velocity overshoot are completely neglected. Higher-order transport models such as the hydrodynamic transport [2] and the energy transport [10]

Transport in Nanostructures: A Comparative Analysis

445

models are designed to overcome some of the shortcomings of the drift-diﬀusion model. The energy-transport model additionally takes into account the carrier energy balance. However, it typically tends to overestimate the non-local eﬀects and thus the on-current of a device motivating the development of transport models including higher order moments. Some time ago a six moments transport model has been proposed [3]. Such a model, while computationally more eﬃcient than the Monte Carlo method, provides additional information on the shape of the distribution function. 2.3

Spherical Harmonic Expansion

The steady state Boltzmann equation can conveniently be solved by expanding the angular dependence of the distribution function f (r, k) on k using a complete set of spherical harmonics Ylm (θ, φ): flm (r, k)Ylm (θ, φ), (1) f (r, k) = lm

where the θ and φ are the polar angles between the electric ﬁeld E and k. In the low-ﬁeld limit one can truncate the expansion (1) after the terms with l = 1, which in case of parabolic isotropic bands and randomizing elastic scattering results in a drift-diﬀusion transport model with low-ﬁeld mobility. As is shown in [5], in case of elastic scattering this approximation gives good results for silicon where the valleys are not isotropic. For general scattering processes and realistic band structures as well as at higher driving ﬁelds more terms in the expansion (1) are needed [8].

3

Results

In order to investigate the validity of macroscopic transport models we have carried out extensive Monte Carlo simulations of transport through a silicon nin structure. The length of the two heavily doped contacts (ND = 1020 cm−3 ) is kept constant, while the length of the intrinsic channel (NA = 1016 cm−3 ) varies. The band structure of silicon is approximated by six valleys with parabolic dispersion relation. Electron scattering with acoustic and optical phonons as well as with ionized impurities is taken into account. The average velocity determines carrier transport in the structure. The average velocity as a moment of the distribution function can be computed along the device and is thus a more sensitive measure of validity for a particular transport model than any integral characteristic of a device like the total current. A typical result for the average velocity of carriers along an nin structure with an intrinsic region length of 40 nm computed with the Monte Carlo method is shown in Fig. 1. The solution of the Boltzmann equation by spherical harmonic expansion, with diﬀerent number of terms in the series (1) is also displayed in Fig. 1. In such a short device the series (1) truncated at l = 1 gives only a poor approximation for the velocity proﬁle. At the same time it is demonstrated that

446

M. Vasicek et al.

Monte Carlo

2

l=1 l=5 l=9 l = 15

7

ve [10 cm/s]

1.5

1

0.5

0 20

40

60

80 x [nm]

100

120

140

Fig. 1. Average velocity along the nin structure computed by solving the Boltzmann transport equation with the Monte Carlo method and using the spherical harmonic expansion method, with diﬀerent numbers of terms included in the series (1). It is demonstrated that even in a short-channel structure the series expansion with terms only up to l = 9 included provides excellent results as compared to the more time consuming Monte Carlo data.

the spherical harmonic expansion (1) including terms up to l = 9 gives a perfect result. The inclusion of terms with l > 9 into (1) does not change the velocity proﬁle conﬁrming the rapid convergence of the series (1). Therefore, the spherical harmonic expansion method gives excellent results with only a few terms included into (1) even for short-channel devices. For this reason the method is much less time consuming than the Monte Carlo solution of the Boltzmann equation. However, Monte Carlo data are needed in order to validate the applicability of the spherical expansion method to describe transport in short-channel devices. Current-voltage characteristics computed with the Monte Carlo (identical to those computed with spherical harmonic expansion method) and using the macroscopic transport models based on the moments of the distribution function are shown in Fig. 2 and Fig. 3, for several channel lengths. It is demonstrated that for long devices (1000 nm) the drift-diﬀusion (DD), the energy transport (ET) model, and the six moments (SM) model give almost equivalent results which are in perfect agreement with the results of the spherical harmonic expansion method. For a device with Lch = 250 nm the drift-diﬀusion model underestimates the current. Since the carrier temperature is constant, the driftdiﬀusion model does not account for any non-local eﬀects and cannot capture the non-local transport inside short-channel devices. This causes the accuracy of the drift-diﬀusion model to decrease for gate lengths shorter than 250 nm, where the restriction of constant carrier temperature must be relaxed. Due to the temperature gradient, heat ﬂow and thermal diﬀusion appear. The drift-diﬀusion transport model must be augmented with the energy ﬂow, or the third moment equation. The energy-transport model takes into account the energy ﬂux equation in addition to the carrier energy balance equation. The energy-transport

Transport in Nanostructures: A Comparative Analysis

447

2.5 DD ET SM SHE

1.5

-3

2

Id [10 A/μm ]

2

100nm

1

0.5

250nm 1000nm

0 0

0.2

0.4 0.6 Vd/LCh [V]

0.8

1

Fig. 2. Current-voltage characteristics for devices with diﬀerent channel length computed with the spherical harmonic expansion method and using macroscopic transport models based on moments of the distribution function. Here DD stands for the driftdiﬀusion model, ET for energy-transport model, SM for the six moments model, and SHE for spherical harmonic expansion.

DD ET SM SHE

0.04

2

Id [A/μm ]

0.03

0.02

40nm 0.01 80nm 0 0

0.2

0.4 0.6 Vd/LCh [V]

0.8

1

Fig. 3. The same as in Fig. 2 for shorter devices. The six moments model gives results closest to the results of Monte Carlo and the spherical Harmonic expansion methods.

model requires modeling of two mobilities for the current density and the energy ﬂux for each carrier type, one relaxation time, and the non-parabilicity factor for non-parabolic bands. The simpliﬁed four-moments model is implemented in standard TCAD simulation tools and can be applied for a large set of semiconductor materials. The energy-transport model overestimates the drive current. Fig. 4 illustrates the average velocity proﬁle in a device with a 40 nm long channel. The driftdiﬀusion model underestimates the average velocity while the energy-transport model overestimates it. In order to reduce this spurious velocity overshoot eﬀect the next moments should be included for devices with Lch shorter than 100 nm, where the energy-transport model starts failing to predict the current accurately. Going one step further in the model hierarchy one obtains a transport model of sixth order. A balance equation for the average squared energy and the related ﬂux equation are added. To close the equation system, the moment of sixth

448

M. Vasicek et al.

3.5

DD ET SM SHE

LCh = 40nm

3 2.5 2 1.5 1

vsat

0.5 0 0

0.2

0.4 0.6 Normalized Distance

0.8

1

Fig. 4. Average velocity along the device with 40 nm channel length computed with the macroscopic transport models and with the spherical harmonic expansion method (equivalent to the Monte Carlo results). While the drift-diﬀusion model underestimates the velocity and current, the energy-transport model overestimates the velocity in short-channel devices. The six moments model improves the situation. 70 60

DD ET SM

Relative Error [%]

50 40

Vd/LCh = 1 V

30 20 10 0 40

50

60

70 LCh [nm]

80

90

100

Fig. 5. Relative error of the current computed with macroscopic transport models. While the drift-diﬀusion and the energy-transport models are gradually losing their validity with the channel length reduced, the six moments model maintains its accuracy down to the channel length of 40 nm.

order has to be approximated using lower order moments. From Monte Carlo simulations serving as an accurate reference, an empirical closure relation has been proposed taking into account also the second order temperature. Compared to the energy-transport models, the six moments model requires two additional relaxation times for the second order temperature and the kurtosis ﬂux. Having too many adjustable parameters is a particular inconvenience of the six moments model. A solution to this problem is based on tabulating some parameters of the model using Monte Carlo simulations. The parameter dependences on temperature, doping, and driving ﬁeld are determined from the condition that the six moments transport model reproduces exactly all the six moments obtained from the Monte Carlo simulator under homogeneous conditions.

Transport in Nanostructures: A Comparative Analysis

449

The inclusion of higher moments improves the transport model signiﬁcantly. The current-voltage characteristics are reproduced fairly well even in devices as short as 40 nm as demonstrated in Fig. 3, because of the more accurate results for the velocity (Fig. 4). The relative error of the current computed by the macroscopic transport models presented in Fig. 5 as function of the channel length clearly demonstrates the applicability of the six moments model to describe the output characteristics of devices with a channel as short as 40 nm.

4

Conclusion

Transport modeling for scientiﬁc computing applications has grown into a mature ﬁeld of research. Models of diﬀerent complexity, precision and accuracy are oﬀered. Monte Carlo techniques are used to obtain solutions of the Boltzmann transport equation with arbitrary scattering mechanisms and general band structure. These methods require signiﬁcant computing resources and are, therefore, relatively rarely used for industrial TCAD applications, where timely, but perhaps less accurate, results are of primary importance. For example, a typical time to get a single point at the output characteristic of a 100 nm long device on a 3 GHz AMD Opteron processor with a Monte Carlo method is 24 hours, while it is only 30 minutes with the spherical harmonic expansion method, in which terms up to ninth order are included. It is demonstrated that the spherical harmonic expansion method provides accurate results for structures with the channel length as short as 50 nm. At the same time, with the channel length decreased, the drift-diﬀusion and even the energy-transport models gradually lose their validity, and higher moments must be included into the model. We demonstrate that the six moments transport model, which needs only 60 seconds of CPU for a single IV point, provides accurate results for devices with the gate as short as 40 nm by a comprehensive comparison of the simulation results with the data obtained with the spherical harmonics expansion method and results of Monte Carlo simulations.

Acknowledgments We gratefully acknowledge ﬁnancial support from the Austrian Science Fund FWF, project P19997-N14.

References 1. International Technology Roadmap for Semiconductors: 2005 Edition (2005), http://www.itrs.net/Common/2005ITRS/Home2005.htm 2. Blotekjaer, K.: Transport equations for electrons in two-valley semiconductors. IEEE Trans. Electron Devices 17, 38–47 (1970) 3. Grasser, T., Kosina, H., Gritsch, M., Selberherr, S.: Using six moments of Boltzmann’s transport equation for device simulation. J. Appl. Phys. 90, 2389–2396 (2001)

450

M. Vasicek et al.

4. Gummel, H.: A self-consistent iterative scheme for one-dimensional steady state transistor calculations. IEEE Trans. Electron Devices ED-11, 455–465 (1964) 5. Herring, C., Vogt, E.: Transport and deformation-potential theory for many-valley semiconductors with anisotropic scattering. Physical Review 101, 944–961 (1956) 6. Jacoboni, C., Reggiani, L.: The Monte Carlo method for the solution of charge transport in semiconductors with applications to covalent materials. Reviews of Modern Physics 55, 645–705 (1983) 7. Kosina, H., Nedjalkov, M., Selberherr, S.: Theory of the Monte Carlo method for semiconductor device simulation. IEEE Trans. Electron Devices 47, 1899–1908 (2000) 8. Pham, A., Jungemann, C., Meinerzhagen, B.: Deterministic multisubband device simulations for strained double gate PMOSFETs including magnetotransport. IEDM Techn. Dig., 895–898 (2008) 9. Selberherr, S.: Analysis and Simulation of Semiconductor Devices. Springer, Heidelberg (1984) 10. Stratton, R.: Diﬀusion of hot and cold electrons in semiconductor barriers. Physical Review 126, 2002–2014 (1962) 11. Sverdlov, V., Ungersboeck, E., Kosina, H., Selberherr, S.: Current transport models for nanoscale semiconductor devices. Materials Science and Engineering R 58, 228– 270 (2008)

Thermal Modeling of GaN HEMTs D. Vasileska1 , A. Ashok1 , O. Hartin2 , and S.M. Goodnick1 1

Department of Electrical Engineering and Center for Solid State Electronics Research Arizona State University, Tempe, AZ 85287-5706, USA 2 Frescale Inc, Tempe, AZ, USA [email protected], [email protected], [email protected], [email protected]

Abstract. Thermal eﬀects were investigated to get a better understanding on the role of self-heating eﬀects on the electrical characteristics of AlGaN/GaN HEMTs. This is implemented by solving simultaneously the acoustic and optical phonon energy balance equations and also takes into account the coupling of the two subsystems. The electro-thermal device simulator was used to observe the temperature proﬁles across the device. Hot spots or regions of higher temperatures were found along the channel in the gate-drain spacing. These preliminary results from the electro-thermal simulations suggest that the thermal eﬀects do not have a drastic impact on the electrical characteristics, the current reduction falls between 5–10% over the simulated range of voltages. However, the non-equilibrium phonon eﬀects might play an important role in determining the thermal distribution in these HEMTs and thus, resulting in reliability issues such as current collapse.

1

Theoretical Model

The solutions of the heat problem in semiconductor devices have focused on several issues: (1) independent analysis of the phonon heat bath problem via direct solutions of the phonon Boltzmann Transport Equations (BTE), and (2) inclusion of lattice heating in device simulators such as Silvaco by solving the Joule heating (Fourier law) equation that does not take into consideration the microscopic nature of the heat ﬂow and the non-equilibrium between the acoustic and optical phonon heat baths. Brieﬂy, three diﬀerent models are most commonly used and these include: (1) Joule Heating, (2) electron-lattice scattering, and the (3) phonon model. Although these three models yield identical results in equilibrium, under non-equilibrium conditions the results of the three models can vary signiﬁcantly. Lai and Majumdar, [2], developed a coupled electro-thermal model for studying thermal non-equilibrium in submicron silicon MOSFETs. Their results showed that the highest electron and lattice temperatures occur under the drain side of the gate electrode, which also corresponds to the region where non-equilibrium eﬀects such as impact ionization and velocity overshoot are maximum. Majumdar et al., [3], have analyzed the variation of hot electrons and associated hot phonon eﬀects in GaAs MESFETs. These hot carriers were observed to decrease the output drain current by as much as 15%. Recently, I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 451–458, 2010. c Springer-Verlag Berlin Heidelberg 2010

452

D. Vasileska et al.

there have been numerous studies on describing thermal eﬀects in devices that couple the Monte Carlo/Poisson approach to electro-thermal modeling in SOI and nitride devices [4]. In these studies, most of the approaches use a simplistic model for non-equilibrium phonons, which does not distinguish the acoustic and the optical phonons as separate subsystems. In addition to that, none of the approaches include all the thermal eﬀects that come into play in GaN HEMTs operating at large drain biases — hot electrons, self heating and non-equilibrium phonon eﬀects. Raleva et al. [5] here at ASU, using a similar approach to Majumdar et al., solved the Boltzmann transport equation (BTE) for electrons using the ensemble Monte Carlo (EMC) method coupled with moment expansion equations for the phonons, both acoustic and optical. In their work, they solved simultaneously the acoustic and optical phonon energy balance equations and also take into account the coupling of the two subsystems in full depleted SOI devices [5]. In our work, we have utilized the same approach to include thermal eﬀects in GaN HEMTs. In GaN, recent experimental ﬁndings have shown that the optical phonon (LO) lifetime have a carrier density dependence [7]. The lifetime was found to decrease from 2.5 ps at low densities, to 0.35 ps at higher densities in Raman spectroscopy measurements. We have formulated an analytical expression for the optical phonon (LO) lifetime that incorporates this density dependence. Figure 1 shows the experimental data and the analytical form of the phonon lifetime used in our simulations. To simulate the steady-state behavior of a device, the system is started in some initial condition, with the desired potential applied to the contacts, and then the simulation proceeds in a time stepping manner until steady-state is reached. When the system is driven into a steady-state regime and MC simulation time has elapsed, the steady-state current through a speciﬁed terminal is calculated. To continue with the thermal part of the simulation, the average electron density, drift velocity and electron temperature must be calculated on the device grid. These are used in the phonon balance equations to compute the acoustic and optical phonon temperature distribution all over the device. During the simulation, the gate contact and the bottom of the substrate are set to 300 K, while Neumann boundary conditions for the heat transfer are used in all other outer surfaces. When the simulation starts, all variables obtained from the ﬁrst iteration of the EMC solver are calculated using uniform distribution for the acoustic and optical phonon temperatures. This means that only one scattering table is used for all electrons, no matter where they are located in the device. When the phonon temperatures are computed from the phonon energy balance equations, they are “returned” at the beginning of the MC free-ﬂight — scattering phase. Now, for each mesh point, we have a scattering table which corresponds to the acoustic and optical phonon temperatures at that point. In this case, the electron position deﬁnes which scattering table is valid and then, by generating a random number, the scattering mechanism is chosen for the given electron energy.

Thermal Modeling of GaN HEMTs

453

Fig. 1. Carrier density dependant phonon lifetime, τLO

The outer Gummel loop between the MC solver and the phonon energy balance solver ends when the steady-state conditions for the phonon temperatures and the device current are reached. To test the overall convergence of the coupled thermal and EMC codes, the variations of the drain current with the number of thermal iterations are registered for every bias condition. The results of our simulations show that only 5–10 thermal iterations are necessary to obtain the steady-state solution of the current. A ﬂowchart depicting the sequence of steps involved in the coupled electro-thermal Monte Carlo device simulator is shown in Figure 2.

2

Simulation Results

Simulations were run to investigate the heating eﬀects of GaN HEMTs in their device characteristics. The coupled formulation polarization model was utilized in these simulations and the Monte Carlo routine was run for a steady state time of 10ps. After steady state, the time averaged electron density, drift velocity and energy are used in computing the lattice and phonon temperature distributions by solving the phonon equations. This procedure is repeated for several thermal iterations until a current convergence is obtained. Short range electron-electron

454

D. Vasileska et al.

Fig. 2. Flowchart of the electro-thermal Monte Carlo Device Simulator

interactions were not included in these simulations in order to reduce the huge strain placed on the computing resources (memory and computational time). The current convergence is achieved after around 5–10 iterations in our simulations for various gate and drain biases. Figure 3 shows the observed current convergence for zero gate voltage and 10 V applied on the drain. There is a reduction of about 11% in the drain current after 10 thermal iterations. The electron temperature proﬁle shows a very diﬀerent temperature distribution across the device. This is better seen in the temperature contour shown in Figure 4. The hot spot occurs in the gate-drain spacing, right where the gate terminates, but is restricted closer to the AlGaN/GaN interface. This means that most of the hot electrons are close to the AlGaN/GaN interface. The proﬁle also shows that there might be some high energy electrons in the AlGaN barrier layer on the drain end. However, in the Silvaco simulations the same temperature distribution is not observed. The acoustic phonon (lattice) temperature and optical phonon temperature proﬁles are shown in Figures 5 and 6 respectively. Both proﬁles show that the peak temperatures are experienced along the channel in the gate-drain spacing. The phonon temperature proﬁles look fairly similar except that the optical phonons have a slightly larger spread around the middle of the channel. This “phonon hot spot” has been observed in a recent theoretical study by Yoder et al. done on GaN HEMTs using a dispersionless optical phonon model coupled

Thermal Modeling of GaN HEMTs

455

Fig. 3. Drain Current vs. # of thermal iterations for VGS = 0V and VDS = 10V

to the Monte Carlo transport routine. This study reveals that the hot spot is localized at the drain end of the channel at low drain-source bias, but expands towards the drain towards the middle portion of the channel at higher bias, signiﬁcantly degrading channel mobility [6]. It is important to mention that they have used polarization charges as ﬁxed source terms in the Poisson equation and also used a constant optical phonon lifetime of 5 ps. Figure 7 shows the ID — VD characteristics of the device respectively. In order to quantify the eﬀect of these non-equilibrium thermal eﬀects on the device performance, we started with isothermal simulations run at 300K and 400K. These I – V characteristics were them compared with the results obtained from our electro-thermal simulations. The ID — VD characteristics show that including the thermal eﬀects reduce the drain current at larger drain voltages. It also shows that this reduction in drain current increases as the drain voltage increases, thereby bringing a negative rolloﬀ — an eﬀect typically attributed to lattice heating by many researchers and scientists. The drain current reduction ranges between 5–10% over the simulated range of drain voltages. The isothermal 400K simulation brings about a reduction of around 8% at VD = 10V in comparison to the thermal simulations. The inﬂuence of the thermal eﬀects become smaller as we increase the gate bias negatively, fewer electrons are present in the channel as we go towards the threshold voltage.

456

D. Vasileska et al.

Fig. 4. Electron Temperature Contour for VGS = 0V and VDS = 10V

Fig. 5. Lattice Temperature Contour for VGS = 0V and VDS = 10V

Thermal Modeling of GaN HEMTs

Fig. 6. Phonon Temperature Contour for VGS = 0V and VDS = 10V

Fig. 7. ID −−VD characteristics with and without thermal eﬀects

457

458

3

D. Vasileska et al.

Conclusions

In summary, the thermal eﬀects arising due to contributions from the lattice, phonons and hot electrons were included in our simulations and current degradation is observed in the device characteristics. The temperature proﬁles show that hot spots in the device are conﬁned close to the channel suggesting a strong dependence of the thermal distribution on the average electron density and energy. The high thermal conductivity of GaN and its alloys greatly helps in the faster heat dissipation seen in these devices. The presence of some electrons in the AlGaN barrier layer introduces the possibility of these electrons being lost to surface states and buﬀer trap sites. Several studies on GaN HEMT reliability have focused on these current collapses observed in short term DC stress tests. Most researchers have proposed that the role of these trap sites in trapping/detrapping electrons can lead to a permanent reduction in the polarization charge density available at the AlGaN/GaN interface [1]. Our simulation results also tend to point towards the same direction but extensive studies on electrothermal simulations for diﬀerent device structures is needed to clearly identify the mechanism responsible for the current reliability issues that have plagued GaN HEMT technology.

Acknowledgements This work was supported in part by Freescale, Tempe, AZ and the Army Research Lab (ARL).

References 1. Bouya, M., Malbert, N., Labat, N., Carisetti, D., Perdu, P., Cl´ement, J.C., Lambert, B., Bonnet, M.: Analysis of traps eﬀect on AlGaN/GaN HEMT by luminescence techniques. Microelectronics Reliability 48, 1366 (2008) 2. Lai, J., Majumdar, A.: Concurent thermal and electrical modeling of submicrometer silicon devices. Journ. of Appl. Phys. 79, 7353 (1996) 3. Majumdar, A., Fushinobu, K., Hijikata, K.: Eﬀect of gate voltage on hot electron and hot phonon interaction and transport in a submicrometer transistor. Journ. of Appl. Phys. 77, 6686 (1995) 4. Matulionis, A.: Hot phonons in GaN channels for HEMTs. Phys. Stat. Sol. A 203(10), 2313 (2006) 5. Raleva, K., Vasileska, D., Goodnick, S.M., Nedjalkov, M.: Modeling Thermal Eﬀects in Nanodevices. IEEE Trans. on Elec. Devices 55(6), 1306 (2008) 6. Sridharan, S., Venkatachalam, A., Yoder, P.D.: Electrothermal analysis of AlGaN/GaN high electron mobility transistors. Journ. of Compt. Electronics 7, 236 (2008) 7. Tsen, K.T., Kiang, J.G., Ferry, D.K., Morko¸c, H.: Subpicosecond time-resolved Raman studies of LO phonons in GaN: Dependence on photoexcited carrier density. Appl. Physc. Lett. 89, 112111 (2006)

Tuning the Generation of Sobol Sequence with Owen Scrambling Emanouil Atanassov, Aneta Karaivanova, and Soﬁya Ivanovska Institute for Parallel Processing, Bulgarian Academy of Sciences, Acad. G. Bonchev, Bl. 25A, 1113 Soﬁa, Bulgaria {emanouil,anet,sofia}@parallel.bas.bg

Abstract. Sobol sequence is the most widely used low discrepancy sequence for numerical solving of multiple integrals and other quasi-Monte Carlo computations. Owen ﬁrst proposed scrambling of this sequence through permutation in a manner that maintained its low discrepancy. Scrambling is necessary not only for error analysis but for parallel implementations. Good scrambling is especially important for GRID applications. However, scrambling is often diﬃcult to implement and time consuming. In this paper we propose fast generation of Sobol sequence with Owen scrambling, tuned to speciﬁc hardware. Numerical and timing results, demonstrating the advantages of our approach are presented and discussed.

1

Introduction

Quasi-Monte Carlo methods are deterministic methods based on the use of highly uniform sequences (called quasirandom sequences). They can be interpreted as variants of Monte Carlo methods with change of the underlying pseudorandom number generator, [14]. Usually, quasi-Monte Carlo methods outperform Monte Carlo in terms of a faster convergence rate, [3]. This fact is due to the famous Koksma-Hlawka inequality for numerical integration, later extended for other problems. This inequality gives a theoretical error bound, and in order to have practical error estimates from quasi-Monte Carlo samples one can use randomized quasi-Monte Carlo methods. The core of randomized quasi-Monte Carlo is to ﬁnd an eﬀective and fast algorithm to scramble (randomize) the underlying quasirandom sequences. This approach for error estimation is based on treating each scrambled sequence as a diﬀerent and independent random sample from a family of randomly scrambled quasirandom numbers. Thus, randomized quasi-Monte Carlo overcomes the main disadvantage of quasi-Monte Carlo while maintaining their favorable convergence rate. Secondarily, scrambling gives us a simple and uniﬁed way to generate quasirandom numbers for parallel, distributed, and Grid-based computational environments [6]. The idea of scrambling quasirandom sequences was ﬁrst proposed by Cranley and Patterson who took lattice points and randomized them by adding random shifts to the sequences [7]. Later, Owen [15] and Tezuka [21] independently I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 459–466, 2010. c Springer-Verlag Berlin Heidelberg 2010

460

E. Atanassov, A. Karaivanova, and S. Ivanovska

developed two powerful scrambling methods for (t, s)-sequences. Owen also explicitly pointed out that scrambling can be used to provide error estimates for Quasi-Monte Carlo (QMC). Although many other methods for scrambling (t, s)sequences [13] have been proposed, most of them are modiﬁed or simpliﬁed Owen or Tezuka schemes. Owen’s scheme is theoretically powerful for (t, s)-sequences. The problem with Owen scrambling is its computational complexity. In this paper we propose and study eﬀective generation of the most popular quasirandom sequence, Sobol sequence, with Owen scrambling using GPU computing. The model for GPU computing is to use a CPU and GPU together in a heterogeneous computing model. The sequential part of the application runs on the CPU and the computationally-intensive part runs on the GPU. From the user’s perspective, the application just runs faster because it is using the high-performance of the GPU to boost performance. The paper is organized as follows. In the next section we give some preliminaries: description of the Sobol sequence and Owen scrambling. The third section presents our algorithm for GPU implementation and some numerical results. The last section gives some conclusions and directions for future work.

2 2.1

Preliminaries Sobol Sequence

The Sobol sequence, [19], is the most popular quasirandom sequence because of its simplicity and eﬃciency in implementation. Many methods have been proposed for scrambling quasirandom sequences. Some scrambling methods [1,4,11,16,17] were designed speciﬁcally for the Sobol sequence. The construction of the Sobol sequence uses linear recurrence relations over the ﬁnite ﬁeld, IF2 , where IF2 = {0, 1}, that is why pure digital permutation, where zero is not changed, is not suitable for the Sobol sequence [4]. The linear permutation is also not a proper method for scrambling the Sobol sequence, [4]. We use the Deﬁnition below, which covers most digital (t, m, s)-nets in base 2. The Sobol sequence is a (t, s)-sequence in base 2 and is a particular case of this deﬁnition. (k)

Definition Let A1 , . . . , As be infinite matrices Ak = {aij }, i, j = 0, 1, . . . , (k)

(k)

(k)

with aij ∈ {0, 1}, such that aii = 1 for all i and k, aij = 0 if i < j. The τ (1) , . . . , τ (s) are sequences of permutations of the set {0, 1}. Each non-negative integer n may be represented in the binary number system as n=

r

bi 2i .

j=0

Then the nth term of the low-discrepancy sequence σ is defined by x(k) n =

r j=0

2−j−1 τj (⊕ji=0 bi aij ), (k)

(k)

Tuning the Generation of Sobol Sequence with Owen Scrambling

461

where by “⊕” we denote the operation “bit-wise addition modulo 2”. Next lemma explains how we generate consecutive terms of the sequence. Lemma. Let σ be a sequence or net satisfying the above Definition, and let the non-negative integers n, p, m be given. Suppose that we desire the first p binary digits of elements in σ with indices of the form 2m j + n < 2p ; this implicitly defines a set of compatible j s. Thus the only numbers we need to compute are (k)

yj

(k)

= 2p x2m j+n .

The integers {vr }∞ r=0 , which we call “twisted direction numbers”, are defined by p−1 (k) (k) vr = 2p−1−t ⊕p−1 j=m atj . (k)

t=0

Suppose that the largest power-of-two that divides 2m (j + 1) + n is l, i.e. 2m (j + 1) + n = 2i (2K + 1). Then the following equality holds (k)

(k)

yj+1 = yj 2.2

(k)

⊕ vi .

Owen Scrambling

The purpose of Owen scrambling is not only to improve the quality of the Sobol sequence but to make it suitable for parallel applications. The parallelization of the quasi-Monte Carlo applications has been under investigation during the last two decades due to the computationally intensive nature of the problems solved. The successful parallel implementation of a quasi-Monte Carlo application depends crucially on the parallelization of the quasirandom sequences used [5,6]. Much of the recent work on parallelizing quasi-Monte Carlo methods has been aimed at splitting a quasirandom sequence into many subsequences which are then used independently on various parallel processes, for example in [1,2]. This method works well for the parallelization of pseudorandom numbers, but due to the nature of quality in quasirandom numbers, this technique has some difﬁculties. Scrambling proved to be very eﬃcient way of parallelization especially suitable for heterogeneous computing environments. (1)

(2)

(s)

Owen scrambling. Let xn = (xn , xn , . . . , xn ) be a quasirandom number in (1) (2) (s) [0, 1)s , and let zn = (zn , zn , . . . , zn ) be the scrambled version of the point xn . (j) (j) (j) (j) (j) Suppose each xn can be represented in base b as xn = (0.xn1 xn2 . . . xnK . . .)b with K being the number of digits to be scrambled. Then nested scrambling proposed by Owen [15,18] can be deﬁned as follows: zn(j) = σ(x(j) n ), j = 1, 2, . . . , s, (j)

(j)

where σ = {π1 , π2 , . . . , πK }, zni = πi (xni ), for i = 1, 2, . . . , K. Here πi is a uniformly chosen permutation of the digits, {0, . . . , b − 1}. Of course, (t, m, s)net remains (t, m, s)-net under nested scrambling. However, nested scrambling

462

E. Atanassov, A. Karaivanova, and S. Ivanovska

requires bi−1 permutations to scramble the ith digit. Owen scrambling (nested scrambling), which can be applied to all (t, s)-sequences, is powerful; however, from the implementation point-of-view, nested scrambling or so-called path dependent permutations requires a considerable amount of bookkeeping, and leads to more problematic implementation. Owen [15] shows that a scrambled (t, m, s)-net is a (t, m, s)-net with probability one, and a scrambled (t, s)-sequence is a (t, s)-sequence with probability one. Owen [15,16,17] shows that the variance of a scrambled net estimator converges to zero faster than the variance of an ordinary Monte Carlo estimator does, while cautioning that the faster rate may not set in until the number of points becomes very large. For suﬃciently smooth integrands, the variance is O(1/n3− ) in the sample size n.

3 3.1

Algorithm for GPU Implementation and Computational Tests The Graphics Processing Unit

The recent advances in hardware functionality and programmability of graphics processors (GPUs) have greatly increased their appeal as add-on co-processors for general purpose computing. The programming is based on CUDA (computer uniﬁed device architecture), [23], which is a hardware and software architecture that issues and manages data-parallel computations on GPU. The GPU is a Single Instruction Multiple Data (SIMD) parallel device. The GPU has to be considered as a coprocessor, while CPU is the host. The model for GPU computing is to use a CPU and GPU together in a heterogeneous computing model. The sequential part of the application runs on the CPU and the computationally-intensive part runs on the GPU. From the user’s perspective, the application just runs faster because it is using the high-performance of the GPU to boost performance. The application developer has to modify their application to take the compute-intensive kernels and map them to the GPU. The rest of the application remains on the CPU (Intel(R) Core 2 Quad CPU Q6600 @ 2.40GHz). Mapping a function to the GPU involves rewriting the function to expose the parallelism in the function and adding calls to I/O functions to move data to and from the GPU. Below we present our algorithm for GPU/CPU implementation, where we use NVIDIA GPU GeForce 9800 GT which allows vectorized operations only in single precision. Nevertheless generation of Sobol sequence in double precision can also be done eﬃciently on this GPU because only a simple operation (subtracting of a 1) must be carried on the CPU, while the more expensive operations related to the scrambling are done in the same way as in the single precision case on the GPU.

Tuning the Generation of Sobol Sequence with Owen Scrambling

3.2

463

The Algorithm

The algorithm has two phases. The ﬁrst phase involves generation of the terms of the sequence without scrambling in the way as described in [1], which involves several bitwise xor operations for generating the ﬁrst term of the sequence and only one bitwise operation per coordinate for generating of the subsequent terms plus one ﬂoating point operation (subtraction of constant 1) per coordinate. Using the Gray code is not necessary because the preprocessing eliminates the need to obtain the sequence in diﬀerent order. This algorithm is described in detail in [1] and in this work we only make use of the high number of cores in the GPU to speed up the implementation. The most demanding part of the generation is the implementation of scrambling that follows the requirements for statistical independence, imposed by the Owen’s construction. Instead of keeping in memory all the terms of the sequence and performing the random permutations, we utilize a family of pseudorandom functions to generate random trees on the ﬂy. Using the GPU and porting the publicly available implementation of the SALSA cipher [24], this method becomes feasible and usable in real-world situations. The algorithm for scrambling the nth coordinate of the N th term of the sequence follows: 1. Input — coordinate n, term of the sequence without scrambling τ = 0.t1 t2 t3 . . ., initial seed for the pseudorandom function family s. 2. Scramble 9 consecutive bits of τ using a sequence of 512 random bits, generated from the initial seed s, the coordinate n and the sequence of bits scrambled so far. The 512 random bits are interpreted as a random binary tree, where the root of the tree is bit number 0 and if the jth random bit corresponds to a node of the binary tree bit number 2 ∗ j + 1 corresponds to the left node and leaf number 2 ∗ j + 2 corresponds to the right node. The ﬁrst bit of the consecutive 9 bits is changed if and only if the root of the tree is 1, and in such case we move to the right leaf, otherwise we move to the left leaf. We continue until we have scrambled all the 9 bits. 3. Continue with the next 9 bits until we have scrambled all the bits of τ . We used the SALSA function with 20 rounds. It is amenable to implementation on a GPU since it uses only small number of variables, uses integer operations and does not involve high number of logical operations and conditional statements. The cost for scrambling 9 bits of the terms of the sequence comes mainly from the invocation of the SALSA pseudorandom function. Scrambling one term is achieved with 3 invocations in single precision and with 6 invocations in double precision. We used in our tests the 20-rounds version of SALSA, with 512 bit output. 3.3

Numerical Tests

The computational tests are done with high-dimensional integral problems which always are a good way to test the quality of quasirandom sequences. We use a published set of test integrands [8,9,12,20,22]. Consider a class of test functions:

464

E. Atanassov, A. Karaivanova, and S. Ivanovska

Table 1. Accuracy for I1 and I2 with Owen scrambling (for dimension s = 10) N 210 211 212 213 214 215 216 217 218 219 220

I1 8.32E-03 9.34E-03 4.53E-03 1.34E-03 8.05E-03 3.01E-03 7.53E-04 3.58E-05 7.91E-04 4.47E-04 1.22E-04

I2 , ai = 0 1.23E-01 7.23E-02 6.26E-03 7.15E-03 6.51E-03 4.98E-05 2.68E-03 1.17E-03 5.64E-04 6.33E-04 5.39E-04

I2 , ai = i 1.26E-03 6.50E-04 1.70E-04 1.47E-04 4.14E-05 1.02E-05 1.18E-06 6.20E-07 9.76E-07 1.85E-07 6.18E-07

I2 , ai = i2 7.70E-05 3.24E-05 2.54E-05 1.69E-05 2.02E-06 2.51E-07 3.27E-08 2.38E-08 1.19E-08 1.25E-08 1.27E-07

Table 2. Accuracy for I1 and I2 without scrambling (for dimension s = 10) N 210 211 212 213 214 215 216 217 218 219 220

I1 2.56E-02 2.05E-02 7.56E-03 4.61E-03 4.23E-03 2.06E-03 1.56E-03 9.13E-04 7.94E-04 6.90E-04 1.69E-04

I2 , ai = 0 4.51E-02 4.42E-02 1.92E-02 8.20E-03 6.40E-03 3.94E-03 3.80E-03 8.33E-04 5.56E-04 5.53E-04 4.73E-04

1

I1 (f ) =

... 0

I2 (f ) =

1

... 0

0

I2 , ai = i 7.34E-04 6.02E-04 3.02E-04 1.69E-04 3.16E-05 1.30E-05 2.49E-06 2.76E-07 1.77E-06 5.08E-07 5.08E-07

I2 , ai = i2 3.44E-05 4.55E-05 2.91E-05 1.50E-05 2.33E-07 2.20E-07 5.97E-09 5.98E-08 5.35E-08 3.42E-08 3.98E-08

s 1

π (sinπxi )dx1 . . . dxs = 1 2 i=1

s 1

0 i=1

|4xi − 2| + ai dx1 . . . dxs = 1 1 + ai

where a are parameters. First, we present the accuracy achieved using Sobol sequence with Owen scrambling (Table 1). For comparison, on Table 2 one can see the accuracy when solving I1 and I2 using Sobol sequence without scrambling. Our GPU/CPU implementation fully supports the theoretical estimates that the variance of the scrambled estimator converges to zero faster than the variance of the ordinary Monte Carlo estimator does. On Fig. 1 elapsed time of the two implementations CPU (dash line) and GPU/CPU (i.e., the implementation uses GPU as co-processor) are compared. The GPU/CPU implementation is very eﬃcient.

Tuning the Generation of Sobol Sequence with Owen Scrambling

465

CPU GPU

4

Time (ms.)

10

3

10

2

10

11

12

13 14 17 18 16 15 N N (number of integration points 2 )

19

20

Fig. 1. Numerical results for I1 (for dimension s = 10)

4

Conclusions

Our computational results demonstrate that the GPU oﬀers a low-cost and highperformance computing solution for Owen scrambling which is theoretically better but has been avoided due to computational burden. Various improvements to this algorithm are possible, for instance by using a more eﬃcient implementation of the SALSA pseudorandom function on this GPU or by choosing a diﬀerent family of pseudorandom functions. Continued improvements will only make the GPU more attractive as a highperformance computing device for quasirandom applications. Future research and algorithm design in this area will include Grid/GPU implementation.

Acknowledgement Supported by the Ministry of Education and Science of Bulgaria under Grant No. DO02-146/2008.

References 1. Atanassov, E.: A New Eﬃcient Algorithm for Generating the Scrambled Sobol Sequence. In: Dimov, I.T., Lirkov, I., Margenov, S., Zlatev, Z. (eds.) NMA 2002. LNCS, vol. 2542, pp. 83–90. Springer, Heidelberg (2003) 2. Bromley, B.C.: Quasirandom Number Generation for Parallel Monte Carlo Algorithms. Journal of Parallel Distributed Computing 38(1), 101–104 (1996) 3. Caﬂisch, R.: Monte Carlo and quasi-Monte Carlo methods. Acta Numerica 7, 1–49 (1998) 4. Chi, H.: Scrambled Quasirandom Sequences and Their Applications, PhD dissertation, FSU (2004)

466

E. Atanassov, A. Karaivanova, and S. Ivanovska

5. Chi, H., Jones, E.: Generating Parallel Quasirandom Sequences by using Randomization. Journal of Distributed and Parallel Computing 67(7), 876 (2007) 6. Chi, H., Mascagni, M.: Eﬃcient Generation of Parallel Quasirandom Sequences via Scrambling. In: Shi, Y., van Albada, G.D., Dongarra, J., Sloot, P.M.A. (eds.) ICCS 2007. LNCS, vol. 4487, pp. 723–730. Springer, Heidelberg (2007) 7. Cranley, R., Patterson, T.: Randomization of number theoretic methods for multiple integration. SIAM Journal of Numerical Analysis 13(6), 904–914 (1976) 8. Davis, P., Rabinowitz, P.: Methods of Numerical Integration. Academic Press, New York (1984) 9. Genz, A.: The numerical evaluation of multiple integrals on parallel computers. In: Keast, P., Fairweather, G. (eds.) Numerical Integration, pp. 219–230. D. Reidel, Dordrecht (1987) 10. Goeddeke, D., Strzodka, R., Turek, S.: Performance and accuracy of hardwareoriented native-, emulated- and mixed-precision solvers in FEM simulations. International Journal of Parallel, Emergent and Distributed Systems 22(4), 221–256 (2007) 11. Hong, H., Hickernell, F.: Algorithm 823: Implementing scrambled digital sequences. ACM Transactions on Mathematical Software 29(2), 95–109 (2003) 12. Joe, S., Kuo, F.: Remark on Algorithm 659: Implementing Sobol’s quasirandom sequence generator. ACM Transactions on Mathematical Software 29(1), 49–57 (2003) 13. Niederreiter, H.: Low-discrepancy and low-dispersion sequences. Journal of Number Theory 30(1), 51–70 (1988) 14. Niederreiter, H.: Random Number Generations and Quasi-Monte Carlo Methods. SIAM, Philadelphia (1992) 15. Owen, A.: Randomly permuted (t, m, s)-nets and (t, s)-sequences. In: Monte Carlo and Quasi-Monte Carlo Methods in Scientiﬁc Computing. Lecture Notes in Statistics, vol. 106, pp. 299–317 (1995) 16. Owen, A.: Scrambled net variance for integrals of smooth functions. Annals of Statistics 25, 1541–1562 (1997) 17. Owen, A.: Scrambling Sobo’l and Niederreiter-Xing points. Journal of Complexity 14, 466–489 (1998) 18. Owen, A.: Variance with alternative scramblings of digital nets. ACM Transactions on Modeling and Computer Simulation 13(4), 363–378 (2003) 19. Sobol, I.M.: Uniformly distributed sequences with additional uniformity properties. USSR Comput. Math. and Math. Phy. 16, 236–242 (1976) 20. Radovic, I., Sobol, I.M., Tichy, R.: Quasi-Monte Carlo methods for numerical integration: Comparison of diﬀerent low discrepancy sequences. Monte Carlo Methods and Appl. 2, 1–14 (1996) 21. Tezuka, S.: Quasi-Monte Carlo discrepancy between theory and practice. In: Fang, K.T., Hickernell, F., Niederreiter, H. (eds.) MC&QMC Methods, pp. 124–140. Springer, Berlin (2002) 22. Wang, X., Fang, K.: The eﬀective dimension and quasi-Monte Carlo. Journal of Complexity 19(2), 101–124 (2003) 23. http://www.nvidia.com/cuda 24. http://en.wikipedia.org/wiki/Salsa20

Applying the Improved Saleve Framework for Modeling Abrasion of Pebbles P´eter D´ ob´e, Rich´ ard K´ apolnai, Andr´ as Sipos, and Imre Szeber´enyi Budapest University of Technology and Economics, Hungary {dobe,kapolnai,szebi}@iit.bme.hu, [email protected]

Abstract. Saleve is a generic framework for making the development of Parameter Study tasks easy for scientists and engineers not familiar with distributed technologies. In this paper we present our lightweight authentication procedure for Saleve to delegate user credentials towards the grid. Then we present a detailed statistics of abrasion of pebbles gained with Saleve. Finally we make remarks on the makespan of the application and look for ways to reduce it.

1

Introduction

A signiﬁcant number of scientiﬁc and engineering problems could be regarded as a Parameter Study (PS) problem i.e. a parameter space is to be traversed which can be partitioned into subdomains, and these subdomains can be processed independently. Consequently, these problems can be parallelized trivially. Although there are many computational infrastructures to execute a PS problem, these systems usually need deep knowledge for optimal utilization, thus a researcher could beneﬁt from a tool which liberates him/her from the task of managing jobs in a distributed system. The Saleve framework [14] aims to fulﬁl this mission. To use the services of Saleve is not challenging: the researcher needs to make some straightforward modiﬁcation in the source code of the PS program, recompile it, and the binary can be executed either on the local desktop machine, or the same binary can submit itself to a grid or a cluster. The architecture of the Saleve component connecting to the EGEE grid [11] was presented in [10] and [7]. In this paper ﬁrst we wish to introduce our architectural development in the method of authentication which integrates the authentication between Saleve and the EGEE grid, and between the Saleve services and the researcher (user). The user does not have to be aware whether the jobs were submitted to a grid or to a cluster, however, these systems need diﬀerent authentication. Our method is based on the proxy certiﬁcate mechanism. If the user holds a grid-enabled certiﬁcate, a common proxy will be generated with the appropriate grid-speciﬁc extension. Otherwise, a dummy proxy certiﬁcate will be generated for authentication purposes. Comparing to other solutions, the main advantage of our approach is the lightweight client side i.e. no heavy software dependency on the user’s desktop machine. Second, we apply the Saleve framework for a computational model for a particular problem in modeling abrasion of pebbles which clearly belongs to the I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 467–474, 2010. c Springer-Verlag Berlin Heidelberg 2010

468

P. D´ ob´e et al.

domain PS. Our ﬁnal goal is to treat the general, three dimensional case and investigate the evolution of the shapes emerging in the abrasion process. Here we present a two-dimensional model, which requires much less computational eﬀort, however, a detailed investigation cannot be carried out rapidly without a parallel implementation. Finally we take observations on the performance of this Saleve application, and present ideas for improvement which may be generalized to other PS applications.

2

Grid Authentication Sequence in Saleve

This section describes how Saleve mediates the authentication of the user towards a grid. More speciﬁcally, we use as grid the EGEE infrastructure and its middleware, gLite [11], but this method probably could be generalized with minor modiﬁcations to other ones built on the proxy and attribute certiﬁcate mechanism (RFC 3820 and 3281). In this section we distinguish between a user who is a researcher (and thus a Saleve user), and a grid user who submits jobs to the grid. To present the authentication procedure, we need to outline the basics of Saleve. In a general usage scenario the user has a C/C++ scientiﬁc application to submit to a distributed environment. The ﬁrst task the user has to accomplish in order to exploit the framework is making the Saleve client. This means some straightforward modiﬁcations need to be done in the original source code i.e. separating the parameter space partitioning logic and the main task, and the new source code is to be compiled and linked against the Saleve client library. We call the created application the Saleve client. The client can be executed on the local computer for testing and developing, but in case of a computationally intensive task the client can submit itself to a Saleve server. The Saleve server receives the client’s task already cut into pieces and dispatches these pieces as jobs to a selected adapter (plug-in) to handle the jobs. Currently the available adapters are for Condor, gLite, and local execution. Here we discuss the gLite plugin. A detailed presentation of Saleve can be found in [10,7,14]. 2.1

Authentication towards the gLite Middleware

When the Saleve server submits jobs to the grid, it has to authenticate itself as a grid user. Consequently, the researcher using Saleve (the user) needs to be a grid user at the same time, and has to delegate his/her right of accessing grid resources to the Saleve server. This supersedes our previous approach where the Saleve server itself was the grid user. Usually when a grid user wishes to submit a job to the grid, a short-term proxy certiﬁcate is created [12], signed by the grid user with the long-term key and extended by the VO. This short-term certiﬁcate and private key can be used by the job for delegation purposes with much less security risk. We itemize these steps as we need to alter the roles in the following subsection.

Applying the Improved Saleve Framework for Modeling Abrasion of Pebbles

469

Step 1 The grid user generates a proxy certiﬁcation request only for authenticating to the VO. Step 2 The grid user signs the proxy request with the long-term key1 . Step 3 The grid user asks the VO to issue an attribute certiﬁcate (AC) based on his/her membership. Step 4 The grid user creates a new proxy request extended by the returned AC, and signs it with the old proxy generated in step 1 and 2. The steps above are integrated in the tool voms-proxy-init in gLite. 2.2

Involving the Client in the Delegation Process

Although the four steps above can be carried out by voms-proxy-init, it would have expensive assumptions on client side. If we used voms-proxy-init in the Saleve client, many software dependency would need to be installed, but our intention is making the client as lightweight as possible. On the other hand, in order to let the server authenticate the user, the client has to delegate the user credentials. This is achieved by distributing the four steps between the client and the server after a mutual authentication: Step 1 The server generates a proxy request with subject to the distinguished name of the user, and sends it to the client. Step 2 The client signs the request with the long-term key and sends it back. Step 3 The server asks the VO to issue an AC for the subject of the proxy. Step 4 The server creates a new proxy request extended by the returned AC, and signs it with the old proxy signed by the client (the user). The mutual authentication is performed in the SSL handshake when the clientserver connection is built up, maintaining a list of certiﬁcate authorities to be accepted. Alternatively, the server could verify the identity of client by verifying the signature on the proxy. This delegation procedure is favourable from many aspects: the client stays lightweight as it only needs to sign a standard certiﬁcate request, and the longterm credentials are safely stored only by the client. If the server is going to submit the jobs to a batch system where there is no need to delegate user credentials (e.g. a local Condor pool), step 3 and 4 are simply omitted as they belong to the Saleve gLite plugin. Moreover, step 3 and 4 can be executed by calling voms-proxy-init. Related Work. There are other toolkits that already implemented delegation mechanisms. The Globus Toolkit provides these functionalities [12], which can be utilized even via the GSI plugin in the gSOAP toolkit [4,15]. 1

Step 1 and 2 can be repeated in a chain by replacing the “long-term key” with the newly created proxy or even possible to omit them and use directly the long-term key to contact the VO, but these scenarios are not relevant in our case.

470

P. D´ ob´e et al.

Another approach to implement credential delegation is extending the web service interface of the grid service with delegation-related methods, more similarly to our case. One notable example is CREAM, the new computing element implementation of gLite [2], another one is the Delegation Interface oﬀered by services of the ARC middleware [1]. The developers of GridSite also proposed a Delegation portType [13]. However, we could not aﬀord to rely on these tools in order to keep the Saleve client lightweight which now only depends on the gSOAP [15] and the openssl libraries.

3

Simulating the Abrasion of Pebbles

A wide range of possible applications of the Saleve Framework could be mentioned here, as an example we take a new model of abrasion of pebbles [8,9]. The ﬁnal goal is to develop a three dimensional discrete model of the abrasion process. Even the experiences with the planar, two dimensional model showed that the parallel implementation of the code is needed to gain ﬁne results. A detailed description is given in [8]. The model captures the abrasion process of one pebble by collisions of several impactors during time. The shape of the pebble is a convex polygon. We distinguish between two possible events: 1. the impactor is much larger than the pebble (resulting a randomly chosen vertex of the polygon is chopped and replaced by a small edge.). 2. the impactor is much smaller than the pebble (resulting a randomly chosen edge of the polygon is retreated parallel to itself.). One of the key questions is the shape of a pebble after a long procedure of abrasion, i.e. a long sequence of events 1 or 2 with the probability p and (1 − p). The initial shape of the pebbles is gained by discretising an ellipse into a polygon 0 0 /Rmin , where with 40 vertices. The elongation of the shape is given by r0 = Rmax r denotes the ﬂatness, Rmax and Rmin are the maximal and minimal distances between the centroid of the polygon and its perimeter. The superscript denotes the number of the numerical steps, i.e r0 is the initial ﬂatness. By the initial shape and the parameter p we aim to investigate the outcome of the abrasion process. After a threshold in the number of collisions (in the presented simulation this i i /Rmin > number was 1000) we regard a pebble to be a needle if at any step Rmax i i 0 0 i i 8.0 and Rmax /Rmin > 1.5·Rmax /Rmin . On the other hand for Rmax /Rmin < 1.5 i i 0 0 /Rmin < Rmax /Rmin + ε the ﬁnal shape is considered to be a circle and Rmax (ε is an arbitrarily small number). To avoid endless runs the number of collisions is maximised in 90000 steps, the pebbles in this group have reached none of the mentioned limit states. Fig. 1(a) shows the bifurcation diagram and 1(b) contains the number of steps required for the decision about the type of the abrasion process.

0.4 0.6

p

0.8 2

4

6

8 10 12 14 1

0

r

(a) final distribution of pebbles

steps

0 0.2

471

be n c tw eed ircle ee le n

Applying the Improved Saleve Framework for Modeling Abrasion of Pebbles

90000 60000 30000 0 2

4 6 0 8

r

10 12 14 1

0.8

0.6

0.4

0.2

0

p

(b) steps needed for classification

Fig. 1. Bifurcation diagrams for the abrasion of pebbles. The black points forming a thick curve on 1(a) denote those runs, where none of the limit states has been reached in 90000 steps of abrasion. The points above this curve denote pebbles became a needle, the points below the curve correspond to circle final shapes.

4

Performance of the Application

Although the simulation presented in Section 3 is an ordinary PS task, executing it eﬃciently on the grid raises non-trivial problems. We deﬁne eﬃciency as the maximum completion time or makespan, because it is the primary indicator of the satisfaction of a Saleve user. Below we give an illustration of our observations on the performance of a Saleve task, and suggest some ways for further improvement. The completion times of the jobs, thus the makespan as well, basically depend on two weakly correlated factors: the processing time, and the queueing time (or latency) of each job submitted to the grid.2 The impact of these factors are illustrated in Fig. 2 where the parameter space was partitioned into 700 pieces. Since both the maximum queueing time and maximum processing time for an arbitrary job are diﬃcult to be foreseen, we propose a posteriori approaches to reduce them, i.e. we intervene the execution after submitting the jobs. Reducing the Queueing Time. Handling latency a priori is assigned to the grid schedulers, however, it is inevitable that a small percent of jobs spend extremely long time in a queue. To avoid this, which is crucial to reduce the makespan, we have tried to resubmit these jobs after a certain timeout, and cancel the old ones. The resulting completion times are shown in Fig. 3. This method guarantees that sooner or later every job gets in a limited queue. While today the grid has no cost of usage for a researcher yet, our objective function (the eﬃciency) does not directly depend on the load of the worker nodes or on the experienced latency by other users. Consequently it may be reasonable 2

For simplicity, we consider the waiting time after the completion negligible compared to the queueing time. Note that while the two types of time are independent for a single job, the maximum values of the two sets may be correlated.

472

P. D´ ob´e et al.

processing queueing

50000

time (sec)

40000

30000

20000

10000

0 0

100

200

300

400

500

600

700

jobs

Fig. 2. Job completion times ordered by queueing time

to experiment with a greedier procedure. When a job is considered to be in an extremely long queue, instead of cancelling and resubmitting it, the Saleve server could resubmit it several times at once. This could decrease the expected completion time of the resubmitted single job. Additionally, if we resubmitted only a limited percent of the jobs, it would not signiﬁcantly increase the load of the grid. Reducing the Processing Time. The most intuitive idea to reduce the maximum processing time is selecting a ﬁner partitioning. Although the average processing time would be decreased then, the maximum processing time would not be surely decreased: as the size of the jobs are usually not identical (the width of the black stripe in Fig. 2), in rare cases the maximum can even increase. Furthermore, incrementing massively the number of jobs also may increase the expected maximum latency because ﬁrst, the probability of catching a very long queue is increased, second, the available grid resources are limited. This is why we plan to design a more sophisticated, iterative partitioning method. It starts from an initial partitioning, and if the processing time of a job exceeds a given threshold (a timeout), then cuts it into smaller pieces, and resubmits theses pieces as new jobs. Related Work. A number of eﬀorts have been made to support the deployment of parameter study tasks. The iterative partitioning of the parameter space has also been introduced in [16], but with the interactivity of the user, not for eﬃciency purposes. The APST framework has also an adaptive scheduling algorithm with task duplication for generic middlewares [5,6]. A possible extension

Applying the Improved Saleve Framework for Modeling Abrasion of Pebbles

473

processing queueing timeout timeout timeout

50000

time (sec)

40000

30000

20000

10000

0 0

100

200

300

400

500

600

700

jobs

Fig. 3. Job completion times with a timeout of 3 hours

of our approach could be taking costs into account, for which a cost-time optimisation algorithm is discussed in e.g. [3].

5

Concluding Remarks

We presented our authentication improvement for Saleve which involves the user credentials in authenticating towards the grid. The most important feature of our solution is a practical one: the client could stay lightweight resulting a better portability of the scientiﬁc application due to its minimal requirements. We applied Saleve for the simulation of abrasion of pebbles and presented empirical high-resolution statistics. We also gave some feedback on the performance of the distributed application and outlined possibilities to improve it. ´ Acknowledgements. The authors are grateful to Akos Nagy for his contribution in the implementation. Part of this work was funded by the P´eter P´ azm´any program (RET-06/2005) of the National Oﬃce for Research and Technology, and by the Hungarian Scientiﬁc Research Fund (OTKA 72146). The authors would like to thank the Enabling Grids fro E-scienceE project (INFSO-RI-222667).

References 1. D1.2-1 Extensive description of functionality and definition of interfaces (WSDLs) for the core components, KnowARC public deliverable edition (August 2007) 2. Aiftimiei, C., et al.: Job submission and management through web services: the experience with the CREAM service. Journal of Physics: Conference Series 119(6), 062004, 10 p. (2008)

474

P. D´ ob´e et al.

3. Buyya, R., Murshed, M., Abramson, D., Venugopal, S.: Scheduling parameter sweep applications on global grids: a deadline and budget constrained cost-time optimization algorithm. Softw. Pract. Exper. 35(5), 491–512 (2005) 4. Cafaro, M., Lezzi, D., Fiore, S., Aloisio, G., van Engelen, R.: The GSI plug-in for gSOAP: Building cross-grid interoperable secure grid services. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Wasniewski, J. (eds.) PPAM 2007. LNCS, vol. 4967, pp. 894–901. Springer, Heidelberg (2008) 5. Casanova, H., Berman, F.: Parameter Sweeps on the Grid with APST. In: Grid Computing: Making the Global Infrastructure a Reality. John Wiley & Sons, Inc., New York (2003) 6. Casanova, H., Zagorodnov, D., Berman, F., Legrand, A.: Heuristics for scheduling parameter sweep applications in grid environments. In: Proceedings of the 9th Heterogeneous Computing Workshop, p. 349. IEEE Computer Society, Washington (2000) 7. D´ ob´e, P., K´ apolnai, R., Szeber´enyi, I.: Saleve: Supporting the deployment of parameter study tasks in the grid. In: Cracow Grid Workshop, Krakow, Poland, pp. 276–282. Academic Computer Centre CYFRONET AGH (2007) 8. Domokos, G., Sipos, A.A., V´ arkonyi, P.L.: Continuous and discrete models for abrasion processes. Periodica Polytechnica 40(1) (2009) (in Press) 9. Domokos, G., V´ arkonyi, P.: Static equilibria of rigid bodies: dice, pebbles and the poincare-hopf theorem. Journal of Nonlinear Science 16, 255–281 (2006) 10. D´ ob´e, P., K´ apolnai, R., Szeber´enyi, I.: Simple grid access for parameter study applications. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2007. LNCS, vol. 4818, pp. 470–475. Springer, Heidelberg (2008) 11. EGEE Information Sheets, http://www.eu-egee.org/ 12. Foster, I., Kesselman, C., Tsudik, G., Tuecke, S.: A security architecture for computational grids. In: 5th ACM conference on Computer and Communications Security, pp. 83–92. ACM, New York (1998) 13. McNab, A., Kaushal, S.: The GridSite security framework. UK e-Science All Hands Meeting (2005) ISBN 1-904425-53-4 14. Moln´ ar, Z., Szeber´enyi, I.: Saleve: simple web-services based environment for parameter study applications. In: The 6th IEEE/ACM International Workshop on Grid Computing. IEEE Computer Society, Los Alamitos (2005) 15. Van Engelen, R.A., Gallivan, K.A.: The gSOAP toolkit for web services and peerto-peer computing networks. In: 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid, pp. 128–135. IEEE Computer Society, Los Alamitos (2002) 16. Wibisono, A., Zhao, Z., Belloum, A., Bubak, M.: A framework for interactive parameter sweep applications. In: 8th IEEE International Symposium on Cluster Computing and the Grid, p. 703. IEEE Computer Society, Washington (2008)

Information Flow and Mirroring in an Agent-Based Grid Resource Brokering System Maria Ganzha1 , Marcin Paprzycki1, Michal Drozdowicz1 , Mehrdad Senobari2 , Ivan Lirkov3 , Soﬁya Ivanovska3, Richard Olejnik4 , and Pavel Telegin5 1

3

Systems Research Institute, Polish Academy of Science, Warsaw, Poland 2 Tarbiat Modares University, Tehran, Iran Institute for Parallel Processing, Bulgarian Academy of Sciences, Soﬁa, Bulgaria 4 University of Sciences and Technologies of Lille, Lille, France 5 SuperComputing Center, Russian Academy of Sciences, Moscow, Russia Abstract. We are developing an agent-team-based Grid resource brokering and management system. One of issues that has to be addressed is team preservation through mirroring of key information. We focused our attention on information generated within the agent team. In this paper we discuss sources of information generated in the system and consider which information should be mirrored to increase long-term survival of the team.

1

Introduction

In our work, to develop an agent-team-based high-level intelligent Grid middleware, we have established when access to information, generated and stored in the team, is needed. For instance, when a team leader (the LMaster agent) receives a Call for Proposals (CFP ) message asking about conditions of executing a job, its response (an oﬀer, or a rejection) is to be based on knowledge of the client and of market conditions (see [7, 11]). Such knowledge is to be based on data collected during past interactions with (potential) client(s). Since we assume that a Global Grid is a highly dynamic structure [12], in which nodes can crash, it is important to assure that the team knowledge will not be lost if its leader crashes. Therefore, in [8, 13] we have suggested utilization of an LMirror agent, which should keep copy of information necessary to prevent team disintegration, in the case of the LMaster crash. The aim of this paper is to point to sources of information useful for an agent team, and to discuss which information should be mirrored and when. To this eﬀect, in the next section, we present a brief overview of the proposed system, followed by arguments for the need of information mirroring. We follow with description of sources of information to be mirrored. In each case we discuss when and where this information should be persisted.

2

System Overview

To discuss the birds-eye view of the system and the two main processes taking place in it (Worker joining a team and team accepting a job to be executed) we I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 475–482, 2010. c Springer-Verlag Berlin Heidelberg 2010

476

M. Ganzha et al.

Fig. 1. AML social diagram of the proposed system

will utilize an AML Social Diagram [5] in Figure 1. In our approach, agents work in teams. Each team is supervised by the LMaster agent. Agent teams utilize services of the Client Information Center (represented by the CIC agent), to advertise their resources, and Workers they are seeking. Teams consist of the LMaster, Workers, and the LMirror agent (which stores copy of information necessary to persist the team when the LMaster crashes). When the User is seeking team to execute its job, it speciﬁes job constraints to the LAgent. The LAgent contacts the CIC and obtains the list of teams that can execute its job. Next, it utilizes trust information and the FIPA Contract Net protocol [15] to either ﬁnd a team, or establish that no team fulﬁlls User -speciﬁed conditions. When the User would like its computer(s) to become Worker (s) in a team, it speciﬁes conditions of joining. The LAgent interacts with the CIC to obtain a list of teams that are seeking Workers. Next, it utilizes trust information and the FIPA Contract Net to establish if a team to join can be found. In both cases, trust information is a part of the LAgent knowledge. Similarly, when responding to the CFP from an LAgent, the LMaster utilizes Team knowledge (see [11,13]).

3

Need for Information Mirroring

While collecting knowledge by the LAgent, and utilizing it in decision making are interesting, we focus on processes taking place within the agent-team. Recall our assumption that in the Global Grid any node can disappear at any time and for an unspeciﬁed time. Obviously, this applies also to the LMaster agent. Let us now see what happens when the LMaster becomes unavailable (e.g. due to the Internet access failure), and there is no information mirroring. Observe that the LMaster is the only agent that Users (their LAgents) contact. Note that in the case of contract negotiations, there would be no long-term consequences in the case of a short-term disappearance of the LMaster (while the team would loose a potential contract, it should be able to manage the team). More serious is the fact that the LMaster is the only gateway through which jobs are received and results send back. The LMaster is also the only agent that Workers know about. In the case of a more permanent disappearance of the LMaster, damage could be irreparable; e.g. Workers would not be able to send results of completed

Agent-Based Grid Resource Brokering System

477

tasks and/or receive new tasks and would leave the team and not come back (see also [11]). Finally, observe that in this situation all knowledge collected by the LMaster becomes unavailable (and may be potentially lost forever). In response to these challenges we have decided to use data mirroring. In [6, 12, 13] we have proposed that an LMirror agent should store information needed to keep the team alive and competitive. Furthermore, in [13] we have described restoring the LMaster and the LMirror in the case when either one of them crashes. Let us now, consider three issues: (1) what information should be mirrored, (2) when should it be mirrored, and (3) where should it be mirrored.

4 4.1

What Should Be Mirrored, When and Where? Team Member Data

Let us start from information about members of the team. When a Worker joins the team, the LMaster obtains the following information: 1. ID of the LAgent joining the team. We have assumed that each agent utilizing our system has to be registered within it (function of the CIC ). It is thus possible to check the registration of the potential Worker, before accepting it into the team). This can be treated as a mini-security measure that can be extended as the system develops. 2. Available resources. In the CFP the LAgent includes (ontologically demarcated; [10]) information about resources oﬀered for the team. This information about each Worker is to be, among others, used for job scheduling [14]. 3. Details of the contract. In [13], we have suggested that the contract proposal should contain: (a) speciﬁcation of the time of availability, and (b) length of contract. However, results of the MiG project (see papers available at [16]) indicate that more complex data-sets can be involved in contract negotiations and thus in the contract. Out of these three items, the ID of the Worker has to be mirrored immediately. Without this information it will be impossible to contact this LAgent in the future. The remaining two could be recovered from the Worker. However, since information about item (1) is to be send to the LMirror regardless, information about items (2) and (3) may be included as well. In this way crashing of the LMaster will not disturb team Worker s with unnecessary exchanges of messages with the new LMaster. Note that the total amount of information mirrored here is proportional to the number of Workers in the team and thus should not be very large. 4.2

Job Contracts and Their Execution

In [8, 7, 6] we have speciﬁed a limited set of parameters describing the job contract. In the CFP User could specify: (a) job start time, (b) job end time, and (c) resource requirements, while the response contained the price proposed by

478

M. Ganzha et al.

the LMaster. Obviously, additional parameters may be added, but this does not aﬀect material presented here. As soon as the contract is signed all data concerning it has to be mirrored. Furthermore, loosing information about a contract would mean that it would either not be completed, or results would have no User to be send to (the latter would happen in the case, when information was lost while the task was already being executed by one of the Workers). In both cases, the team would rapidly loose its reputation and potential future contracts [11]. Signing a contract means that ﬁles pertinent to the job will be send to the LMaster and have to be kept and mirrored until job is completed. As soon as the job is completed, its results have to be sent back to the User. However, they should be also kept by the Worker until it receives a conﬁrmation from the LMaster that they were successfully delivered to the customer. At the same time the LMaster keeps the second copy of results. In this way we, again, have two copies of sensitive material available in the system. Information about job completion and successful delivery of results has to be stored. It is to be useful, ﬁrst, for job scheduling (see, for instance, [14] and references collected there). Second, for trust management (see [11] and section 4.3), and third, for contract disputes. Interestingly, only fact that job has been completed and its results successfully delivered to the customer, has to be mirrored immediately (it describes the current status of the team). Information needed for future job scheduling may be mirrored “later”. Loosing data about execution times of some jobs (due to the crash of LMaster ) does not pose a threat to the existence of the team. Similarly, some information needed for trust management may be lost without damaging the team. Finally, note that scheduling and trust related information may be sent to the team data warehouse instead of the LMirror (see section 4.4). 4.3

Trust Information

Thus far we have dealt mostly with information that has to be mirrored immediately. Now we devote our attention to the remaining data generated in the system. In [11] we have considered trust management in the system. As noted above, here, we are interested only in relationships between the LMaster and its Workers. Following the discussion presented in [11] we assume that contract between Worker and the team is based on paying the Worker for availability (not for actual work done). It was also stipulated that Worker will have a right to not to be available for a limited number of times during its contract. Here, we can use the “pinging-mechanism” described in [6] to monitor Worker availability. Under these assumptions, for each Worker, we will collect information how many times (a) it fulﬁlled the contract (#f c) — was available when it was expected to be available; (b) violated the contract (#vc) — was not available when it was supposed to be; and (c) did more than contract required (#ac) — was available even though it could have been gone. This combined data will be stored, for each Worker, as a triple (#f c, #vc, #ac). While this information is in a cumulative

Agent-Based Grid Resource Brokering System

479

form, we have also access to detailed information about results of each “pingingprocedure,” and each assigned and completed job. Since the latter data is not proportional to the number of Workers, but to the number of processed jobs, we consider it separately in section 4.4. Obviously, while the trust-related information has to be mirrored, some loses (e.g. missing the fact that Worker 31B successfully fulﬁlled contract 324 55B 3 ) are not a big problem. Therefore, it is enough to update information about trust related issues in a digested format at predeﬁned time-intervals; e.g. once a day. Frequency of updates depends on nature of jobs that a given team is executing and on the type of Worker contracts. If a team is executing time-consuming jobs and its members have long-term contracts, then update of trust information can occur much less frequently than in the case when large number of short jobs are completed by a team consisting of Workers with short-term contracts. We believe that a good approximation of the right time to send trust information to the LMirror is when, on average, each Worker has completed one contract. Note that the LMaster can adaptively adjust time between updates (responding to changing jobs and contracts of its team). 4.4

Volume Data Collection and Storage

Most information considered thus far was relatively small in volume — of order of the size of the team or of the number of currently contracted jobs. There are however important data sets that grow over time and thus can become very large. First, data generated during contract negotiations (either job execution, or team joining), needed to establish market conditions and results in team behavior adaptivity (see also [6,11]). Second, information about execution time of completed jobs, used for advanced scheduling techniques (see [14] and references collected there for more details). Third, detailed information about behavior of each worker (e.g. history of execution of each job, results of pinging sessions, etc.). This information is crucial not only for trust management, but also to build a realistic and comprehensive economy-based jobs scheduling model. Let us consider LAgent –LMaster negotiations (the remaining cases follow the same general pattern and are omitted for lack of space). Even though in [1,2,3,4] a large variety of contract negotiation mechanisms have been listed, in our work, for both job execution, and joining a team negotiations, we utilize the FIPA Contract Net Protocol [15]. The primary reason is that while allowing for calls for proposals with practically unlimited variety of constraints, it generates the contract (or establishes impossibility of an agreement) within a single round of negotiations. This also reduces the total amount of information that needs to be stored for further use (compare, for instance, with the amount of data generated during any of the multi-round auction mechanisms; [9]). Now, information about a single job contract negotiation consists (in the minimal case) of: – content of the CFP — LAgent ID, hardware and/or software sought; “start time”; time to be contracted / or was it an open-ended contract? – content of the response — the proposed price; etc.; – result of the negotiation — success/failure.

480

M. Ganzha et al.

Similarly, in the case of negotiation concerning a Worker joining the team, at least the following information can be stored: – content of the CFP — including information about available hardware and software; length of contract and availability, – content of the response — the proposed price, – result of the negotiation — did the Worker join the team, or not. Interestingly, in all cases loosing some of the data may decrease competitiveness of the team, but the process is characterized by graceful degradation. Therefore it is possible to perform updates in a digested fashion at times of reduced utilization of the system. However, observe that, as time passes, the total volume of collected information increases rapidly. While it would be possible to store only a speciﬁc amount of the most current data, this would reduce utility of data mining techniques (long-term trends may be lost). Therefore, it is important to preserve all historical data and a data warehouse may be a solution. It should be obvious that in these cases mirroring utilizing the LMaster and the LMirror pair can be very costly. First, an increasing volume of data would have to be stored by each one of them. Second, large amounts of data would have to be passed from the LMaster to the LMirror. Naturally, since we utilize singleround negotiations, passing updates in a compressed and digested form would reduce the burden somewhat, but this would help only in a short-term. Finally, recovery of the crashed LMaster (or LMirror ) would be extremely resource consuming. In this case, as a part of the recovery, an entire data warehouse would have to be transferred. For a large team that exists for a long time period (and our assumption is that good teams will be large and will exists for a long time), this would mean necessity to transfer hundreds of gigabytes of data. Such transfer would have to happen each time during recovery of the crashed managerial agent. Since, as suggested in [13], recovery of one crashed managerial agent stops the other one from doing anything else, it also stops all managerial activities in the team. Therefore, such approach seems infeasible in a long run. Considering this, based on the current trend of IT infrastructure outsourcing and specialization, we propose the following solution. The team data warehouse will be outsourced for storage to a team within the system that specializes in data warehousing. This is similar to a popular e-commerce scenario, where merchants contract software companies to design, implement, and run infrastructure of their e-stores, while software companies contract storage companies to actually store the data (this solution facilitates quality assurance for data preservation). Note that, utilization of a data warehouse allows to completely redeﬁne the roles of the LMaster and the LMirror. Here, the LMaster stores information in the contracted “storage facility”. Only data needed for team operation, e.g. list of team members, is kept locally (and “mirrored” in the data warehouse; not with the LMirror ). The role of the LMirror is not to store team data, but to store procedures for: (a) checking existence of the LMaster, and (b) replacing an LMaster in the case when it crashes. In this way the LMirror can work as all other Workers (it will not be burdened by the mirroring procedures), which may also simplify the overall management and economic structure of the team.

Agent-Based Grid Resource Brokering System

5

481

Concluding Remarks

The aim of this paper was to discuss issues involved in information mirroring in an agent-based Grid resource management system. We have focused our attention on information generated within the agent team and considered four important cases: (1) team data, (2) job contracts and their execution, (3) trustrelated information, and (4) other sources of large volume information. We have established that we deal with two main situations: (a) small-volume data that has to be mirrored immediately, and (b) large volume data that may be mirrored infrequently. Further analysis indicated that collection of large volume data may be best achieved through utilization of a contracted data storage facility. This latter solution is our solution of choice and we plan to utilize it in our system.

Acknowledgments Work of Maria Ganzha and Michal Drozdowicz was supported from the “Funds for Science” of the Polish Ministry for Science and Higher Education for years 2008-2011, as a research project (contract number N516 382434). Collaboration of the Polish and Bulgarian teams is partially supported by the Parallel and Distributed Computing Practices grant. Collaboration of Polish and French teams is partially supported by the PICS grant New Methods for Balancing Loads and Scheduling Jobs in the Grid and Dedicated Systems. Collaboration of the Polish and Russian teams is partially supported by the Eﬃcient use of Computational Grids grant. Work of Marcin Paprzycki and Soﬁya Ivanovska was supported in part by the National Science Fund of Bulgaria under Grants No. D002-146 and D002-115.

References 1. Abraham, A., Buyya, R., Nath, B.: Nature’s heuristics for scheduling jobs on computational grids. In: 8th IEEE International Conference on Advanced Computing and Communications (ADCOM 2000), pp. 45–52 (2000) 2. Buyya, R., Abramson, D., Giddy, J., Stockinger, H.: Economic models for resource management and scheduling in grid computing. Concurrency and Computation: Practice and Experience 14(13-15), 1507–1542 (2002) 3. Buyya, R., Abramson, D., Venugopal, S.: The grid economy. Proceedings of the IEEE 93(3), 698–714 (2005) 4. Buyya, R., Giddy, J., Abramson, D.: An evaluation of economy-based resource trading and scheduling on computational power grids for parameter sweep applications. In: Second Workshop on Active Middleware Services (AMS 2000), p. 221. Kluwer Academic Press, Pittsburgh (2000) 5. Cervenka, R., Trencansky, I.: Agent Modeling Language (AML): A Comprehensive Approach to Modeling MAS. Whitestein Series in Software Agent Technologies and Autonomic Computing. Birkhauser, Basel (2007) 6. Dominiak, M., Ganzha, M., Gawinecki, M., Kuranowski, W., Paprzycki, M., Margenov, S., Lirkov, I.: Utilizing agent teams in grid resource brokering. International Transactions on Systems Science and Applications 3(4), 296–306 (2008)

482

M. Ganzha et al.

7. Dominiak, M., Ganzha, M., Paprzycki, M.: Selecting grid-agent-team to execute user-job — initial solution. In: Proc. of the Conference on Complex, Intelligent and Software Intensive Systems, pp. 249–256. IEEE CS Press, Los Alamitos (2007) 8. Dominiak, M., Kuranowski, W., Gawinecki, M., Ganzha, M., Paprzycki, M.: Utilizing agent teams in grid resource management — preliminary considerations. In: IEEE J. V. Atanasoﬀ Conference, pp. 46–51. IEEE CS Press, Los Alamitos (2006) 9. Drozdowicz, M., Ganzha, M., Paprzycki, M., Gawinecki, M., Legalov, A.: Information Flow and Usage in an E-Shop operating within an Agent-Based E-commerce system. Journal of Sibirian Federal University, SFU, Krasnoyarsk (in press) 10. Drozdowicz, M., Ganzha, M., Paprzycki, M., Olejnik, R., Lirkov, I., Telegin, P., Senobari, M.: Ontologies, agents and the grid: An overview. In: Topping, B., Ivanyi, P. (eds.) Parallel, Distributed, and Grid Computing for Engineering. Computational Scientce, Engineering and Technology Series, vol. 21, pp. 117–140. SaxeCoburg Publications, Stirligshire (2009) 11. Ganzha, M., Paprzycki, M., Lirkov, I.: Trust management in an agent-based grid resource brokering system — preliminary considerations. In: Todorov, M. (ed.) Applications of Mathematics in Engineering and Economics’33. AIP Conf. Proc., vol. 946, pp. 35–46. American Institute of Physics, College Park (2007) 12. Kuranowski, W., Ganzha, M., Gawinecki, M., Paprzycki, M., Lirkov, I., Margenov, S.: Forming and managing agent teams acting as resource brokers in the grid — preliminary considerations. International Journal of Computational Intelligence Research 4(1), 9–16 (2008) 13. Kuranowski, W., Ganzha, M., Paprzycki, M., Lirkov, I.: Supervising agent team an agent-based grid resource brokering system — initial solution. In: Xhafa, F., Barolli, L. (eds.) Proceedings of the Conference on Complex, Intelligent and Software Intensive Systems, pp. 321–326. IEEE CS Press, Los Alamitos (2008) 14. Senobari, M., Drozdowicz, M., Ganzha, M., Paprzycki, M., Olejnik, R., Lirkov, I., Telegin, P., Charkari, N.: Resource management in grids: Overview and a discussion of a possible approach for a agent-based middleware. In: Topping, B., Ivanyi, P. (eds.) Parallel, Distributed, and Grid Computing for Engineering. Computational Scientce, Engineering and Technology Series, vol. 21, pp. 141–164. Saxe-Coburg Publications, Stirligshire (2009) 15. Welcome to the FIPA, http://www.fipa.org/ 16. Projekt Minimum intrusion Grid, http://www.migrid.org/MiG/Mig/published_papers.html

Scatter Search and Grid Computing to Improve Nuclear Fusion Devices Antonio G´omez-Iglesias1, Miguel A. Vega-Rodr´ıguez2, Francisco Castej´on-Maga˜ na1, Miguel C´ ardenas-Montes3, and Enrique Morales-Ramos4 1

2

National Fusion Laboratory, CIEMAT, Madrid, Spain {antonio.gomez,francisco.castejon}@ciemat.es http://www.ciemat.es Dep. of Technologies of Computers and Communications, University of Extremadura, C´ aceres, Spain [email protected] 3 Dep. of Basic Research, CIEMAT, Madrid, Spain [email protected] 4 ARCO Research Group, C´ aceres, Spain [email protected]

Abstract. Even though nuclear fusion is the next generation of energy, there are still many problems that are present in current nuclear fusion devices. Several of these problems can be solved by means of modelling tools. These tools are extremely demanding in terms of computational resources and they also use a large number of parameters to represent the behaviour of nuclear fusion devices. The possibility to introduce Evolutionary Algorithms (EAs) like Scatter Search (SS) to look for approximate conﬁgurations oﬀers a great solution for some of these problems. However, as long as these applications require high computational costs to perform their operations, the use of the grid is an ideal environment to carry out these tests. The distributed paradigm of the grid, as well as the high number of computational resources, represents an excellent alternative to execute these tools. Within this paper, we explain our work in these three ﬁelds, looking for optimized conﬁgurations of a nuclear fusion device.

1

Introduction

Scatter Search is a metaheuristic process that has been applied to many diﬀerent optimisation areas with optimal results. SS works over a set of solutions, combining them to obtain new ones that improve the original set. Unlike other evolutionary methods, such as genetic algorithms (GA), SS is not based on large random populations. Rather, it is based on strategic selections among small populations: while GAs usually work with populations of hundreds or thousands of individuals, SS uses a small set of diﬀerent solutions. In the case of optimisation problems related to many scientiﬁc areas, the required time to get a ﬁtness value is so high that traditional computation cannot be used to perform an exploration of the entire solution space. Modern I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 483–490, 2010. c Springer-Verlag Berlin Heidelberg 2010

484

A. G´ omez-Iglesias et al.

paradigms, like grid computing, oﬀer the computational resources and capabilities to carry out these optimisation problems, but they are not easy to use [3,5,9,10]. Until now, many investigations have been carried out in order to use parallel architectures and GAs [2], but the coupling of grid capabilities and complex EAs is still in an immature state. The development of the grid has created a way that could potentially lead to an increase in the performance of these kinds of algorithms in terms of execution time and problem size. Nevertheless, a high level of expertise is required to develop and execute grid applications because many problems can arise due to the speciﬁcs of the grid infrastructure. Our goal consists of improving the equilibrium of plasma in a nuclear fusion device (TJ-II, a stellarator [4] located in Madrid) by means of the SS algorithm and grid computing. The rest of this paper is organised as follows: section 2 introduces some basic concepts about nuclear fusion while section 3 gives a description of the elements and methods of SS algorithm based on the implementation proposed in [11]. Section 4 shows the required workﬂow to optimise nuclear fusion devices, whereas section 5 is devoted to the distributed ﬁtness calculation using the grid. In section 6, we collect the results obtained in our experiments. Finally, in section 7, we conclude the paper and summarise a variety of perspectives of the explained work.

2

Nuclear Fusion

Nuclear fusion is the process by which multiple atomic particles join together to form a heavier nucleus. It is accompanied by the release or absorption of energy. The fusion of two light nuclei generally releases energy while the fusion of heavy nuclei absorbs energy [6]. Among the diﬀerent fusion reactions, the most promising one is shown in (1). Most of current fusion devices work with these elements in order to obtain the reaction. In the equation, D denotes a deuteron, T a triton, 4 He an isotope of helium being an α-particle, n a neutron and MeV megaelectron-volt. (1) D + T =4 He + n + 17.6M eV The quality of magnetic conﬁnement is characterized by diﬀerent criteria [12]. Equilibrium means the characteristics of the plasma are varying slowly so we have sustainable characteristics of the particles inside the device. By improving the equilibrium of plasma, we improve the eﬃciency of the device; therefore, increasing the probability of achieving fusion reactions.

3

Scatter Search Algorithm

Descriptions of this algorithm, as well as explanations for all the stages involved, are explained in great detail in a variety of related works [7,11]. The design of these stages is very wide and general, with the aim that anyone can implement other techniques for any of them.

Scatter Search and Grid Computing to Improve Nuclear Fusion Devices

3.1

485

Concurrent Implementation

For many problems we can get a new generation after a few seconds, but when each ﬁtness value requires several minutes or even hours to get its evaluation, this template does not oﬀer an optimal result. Furthermore, we could have a scenario where the researchers involved in a speciﬁc problem do not have access to all the computational power required, but do have access to the grid, something that many people can easily do. For these reasons, we have modiﬁed this template by adding functionalities that allow us to handle grid jobs and proxies without human supervision.

4 4.1

Workflow to Measure the Equilibrium Measuring the Equilibrium

What we have to measure in order to get a realistic idea of the equilibrium of our → device is the value of − v d , or drift velocity of particles within conﬁned plasma. This means to calculate the value given by (2), that is our target function (a deeper explanation of this expression can be found in the related work [8]). Ftargetf unction

− → − N → B × ∇ |B| = B3 i=1

(2) i

In this equation, i represents the diﬀerent magnetic surfaces existing inside TJII and B, the intensity of the magnetic ﬁeld. The target is to minimise the value given by this function. This is the ﬁtness function implemented in our SS algorithm. Once we have the data to solve this function, it takes nearly ten minutes, on average, to get the ﬁnal value running. For current conﬁgurations of TJ-II we get values greater than 1.0E + 16 for this expression. Our goal will be to reduce this value which, in fact, means we are improving the conﬁnement of particles inside the device. To get all of the values involved in this function, we need to execute a workﬂow application to calculate the magnetic surfaces existing in a magnetic conﬁnement device. These applications are Fortran codes running in speciﬁc computers, often with deprecated compilers which are not longer available. The eﬀorts needed in order to run these applications in heterogeneous environments, as grid computing, are also high. 4.2

Workflow Application

The workﬂow we need to execute to measure the equilibrium of the TJ-II, by solving (2), is widely explained in the related work [8]. The entire workﬂow requires one hour and a half on average to ﬁnish its computation and less than one hour for optimal conﬁgurations.

486

5

A. G´ omez-Iglesias et al.

Grid-Oriented SS Implementation

Grid cannot be compared with supercomputers or special parallel machines which can oﬀer excellent results in term of eﬃciency. Nevertheless, the distributed paradigm of the grid, along with the number of computational resources available, make using SS with complex optimisation problems a very promising alternative. In these problems, the diﬀerent ﬁtness functions can be calculated in diﬀerent Worker Nodes (WNs) of the grid, reducing the overall execution time. We have developed a set of python scripts to interact with the command line in the User Interface (UI) in order to perform complex operations, such as proxy management, allowing the user to call oﬀ any job or resend failed jobs. New tests will only require the modiﬁcation of selection and combination methods to get a new implementation of the algorithm, but the entire source code (SS algorithm and python scripts) will remain the same. 5.1

Python Scripts

To get a non-supervised system, we have developed a set of python scripts that interact with the metascheduler and the proxy to properly manage all of the required processes. We have developed the interaction with two metaschedulers: GridWay and WMS (Workload Management System). With GridWay, we can access the grid environment by using both gLite and Globus, while WMS is the metascheduler by default in gLite middleware. As a result, we get a highly customizable system that can interact with diﬀerent kinds of grid middlewares. Not only do these scripts provide functionalities related to job management, they also interact with the whole grid environment loading the metascheduler, detecting any failures in the grid infrastructure, collecting usage statistics, or performing proxy management. 5.2

SS Implementation

Diversification Method. We are using a diversiﬁcation method based on Greedy Randomized Adaptive Search Procedure (GRASP) constructions [13]. This method uses a local search procedure [1] to improve the generated solutions. The evaluation function given for this GRASP method is the ﬁtness value obtained for the solution when a value is added to one of the elements within the solution. The resulting solution is the best overall solution found with this local search. Combination Method. We have chosen a mutation-based procedure to mix individuals and obtain a new one. This mutation-based procedure uses the sample standard deviation of each chromosome in the whole population to perform the mutation. This function assures convergence within the values of the chromosomes even though this convergence is only noticed after a long number of generations.

Scatter Search and Grid Computing to Improve Nuclear Fusion Devices

487

Improvement Method. The improvement method we have to use cannot perform many evaluations because of the high execution time required for each of these evaluations. For this reason, we have developed a method that introduces small variations in the values of the solution and checks if the solution is better than the original one. Diversity Measure. In our case, to measure the diversity among the elements in the population, we use the normalised value of each chromosome of any individual. This normalisation is calculated using (3). In other cases, this normalisation is not necessary; however, for this problem the diﬀerence between the possible values of the chromosomes makes it impossible to use the current value of each chromosome. N orm valuei =

valuei − M ini M axi − M ini

(3)

Finally, with the expression (4) we can obtain the distance between two individuals, p and q by: n |pi − qi | , (4) d(p, q) = i=1

where n is the number of chromosomes in the individuals p and q. Using this expression, we can get diﬀerent levels of diversity within the elements in the population, which is also one of the objectives of the SS algorithm.

6

Results

Here we present several results obtained by running our system on the grid. First, we show results related to the time required to evaluate 50 iterations in the SS. Next, we present results focused on the evolution of the ﬁtness function within the population along the generations. As mentioned before, the computational time required for the local search process is very high, so diﬀerent tests with this process activated and deactivated have been carried out and are also described in this section. The number of chromosomes of each solution is set to 100 for all the tests to get comparable values for the ﬁtness function. We have run our tests using two diﬀerent VOs (Virtual Organizations) and accesing two diﬀerent grid infrastructure. In the ﬁrst infrastructure we can access to 5,088 computational resources in a gLite infrastructure using Scientiﬁc Linux 4.7. The second infrastructure consists of 11,321 computational resources, running also the gLite middleware and Scientiﬁc Linux 4.7. 6.1

Computational Results

First, we tried two tests with the same conﬁguration and without local search, as shown in Table 1. The size of the subsets in the Reference Set is indicated in the table with RefSet Size. The results are also shown in this table.

488

A. G´ omez-Iglesias et al.

Table 1. Execution times for two diﬀerent conﬁgurations without local search and with local search. All the times are in the hh:mm:ss format. Test First Test Second Test Third Test Fourth Test Number of Iterations 2 50 2 50 Number of Chromosomes 100 100 100 100 Population Size 100 100 100 50 RefSet Size 10 10 10 5 Number of Jobs 574 12,361 11,427 6,781 Execution Time 36:10:19 535:36:25 495:12:51 310:12:53 Aggregated Execution Time 783:07:21 9,095:41:07 8,727:29:07 5,321:03:57

In this table, Execution Time represents the time from the SS start until it ﬁnishes, using the grid. Aggregated Execution Time is the sum of the execution times of all the jobs sent to the grid which, in fact, is the time that the algorithm must take to perform all the calculations in a single machine. If we focus on the results obtained for 50 iterations, we can see that the required time to obtain these results was 535:36:25 (more than 22 days). Considering the aggregated execution time being more than 9,095 hours, which is almost 379 days, the advantage of using grid computing becomes clear. The required time to get these results depends on the number of computational resources available, as well as the number of jobs submitted by other users. In the ideal case of simultaneously having the same number of free WN as the number of maximum jobs, with one single user, this time will be lower, but this is just an ideal case which is diﬃcult to reach in real grid environments. For the next tests, we just activated the local search and used the conﬁgurations in last columns of Table 1. With these conﬁgurations the execution times required are as shown in the same table, that also shows the aggregated execution time required by all the processes. The main problem with the conﬁguration that has 2 iterations is the long execution time required. The local search process takes a long time as a result of the high number of conﬁgurations to be processed. For this reason, when we tried the fourth test, we reduced the population, as well as the reference set size, which increased the number of iterations. Table 2. Best values for the diﬀerent tests performed Test Initial Random Individuals Current Conﬁgurations 2 Iterations - Local Search Deactivated 40 Iterations - Local Search Deactivated 2 Iterations - Local Search Activated 50 Iterations - Local Search Activated - Smaller Population

Best Value 1.2E+22 1.0E+10 1.76781E+05 7.056280E+03 1.47256E+04 1.02314E+04

Scatter Search and Grid Computing to Improve Nuclear Fusion Devices

6.2

489

Evaluation Results

In Table 2 we show the best results obtained for the previous tests, as well as the ﬁtness value (on average) for random individuals used at the beginning of the process and the value for existing conﬁgurations of the device. There are big diﬀerences depending on the diﬀerent conﬁgurations used. The resulting design allows for a better conﬁnement of particles within this device, allowing the possibilities to obtain fusion reactions to increase and improve the eﬃciency of the device.

7

Conclusions and Future Work

In this paper we have shown an implementation of the SS algorithm using the capabilities oﬀered by grid computing to improve the equilibrium of conﬁned plasma in a nuclear fusion device. The computational cost of the existing modelling tools and the time required to run diﬀerent tests makes it almost impossible to use traditional environments. In this work, grid computing has been used and the results are very promising. Even though execution times continue being long, we are thankful that due to a number of the computational resources that the grid has, this time has been reduced from several months to just a few weeks. In the future, we plan on performing more tests with diﬀerent conﬁgurations for the algorithm, as well as changing the combination method. One of the characteristics that can be measured with the designed workﬂow is the equilibrium, but many other characteristics can be measured with VMEC. This will lead us to a multi-objective implementation of the SS using grid computing. Acknowledgments. This work has been funded in part through the Research Infrastructures initiatives of the 7th Framework Programme of the European Commission, grant agreement Number 211804 (EUFORIA project) and grant agreement Number 222667 (EGEE III). The authors want to thank Ashley Bucholz for her interest and comments about the work.

References 1. Aarts, E., Lenstra, J.K.: Local Search in Combinatorial Optimization. Princeton University Press, Princeton (2003) 2. Alba, E., Tomassini, M.: Parallelism and Evolutionary Algorithms. IEEE Transactions on Evolutionary Computation 6(5), 443–462 (2002) 3. Berman, F., Hey, A., Fox, G.C.: Grid Computing. Making the Global Infrastructure a Reality. John Wiley & Sons, Chichester (2003) 4. Castej´ on, F., et al.: Ion Orbits and Ion Conﬁnement Studies on ECRH Plasmas in TJ-II Stellarator. Fusion Science and Technology 50(3), 412–418 (2006) 5. Foster, I., Kesselman, C.: The Grid: Blueprint for a New Computing Infrastructure. Morgan Kaufmann, San Francisco (1999)

490

A. G´ omez-Iglesias et al.

6. Freidberg, J.: Plasma Physics and Fusion Energy. Cambridge University Press, Cambridge (2007) 7. Glover, F.: A Template for Scatter Search and Path Relinking. In: Hao, J.-K., Lutton, E., Ronald, E., Schoenauer, M., Snyers, D. (eds.) AE 1997. LNCS, vol. 1363, pp. 13–54. Springer, Heidelberg (1998) 8. G´ omez-Iglesias, A., et al.: Grid Computing in Order to Implement a ThreeDimensional Magnetohydrodynamic Equilibrium Solver for Plasma Conﬁnement. In: 16th Euromicro International Conference on Parallel, Distributed and networkbased Processing, pp. 435–439. IEEE Computer Society, Los Alamitos (2008) 9. Joseph, J., Fellenstein, C.: Grid Computing. Prentice Hall, Englewood Cliﬀs (2003) 10. Juh´ asz, Z., Kacsuk, P., Kranzlm¨ uller, D.: Distributed and Parallel Systems: Cluster and Grid Computing. Springer Science, Heidelberg (2005) 11. Laguna, M., Mart´ı, R.: Scatter Search — Methodology and Implementations. Kluwer Academic Publishers, Boston (2003) 12. Miyamoto, K.: Plasma Physics and Controlled Nuclear Fusion. Springer, Heidelberg (2005) 13. Pitsolulis, L.S., Resende, M.G.C.: Greedy Randomized Adaptive Search Procedures. In: Handbook of Applied Optimization. Oxford University Press, Oxford (2002)

Service-Oriented Integration of Grid Applications in Heterogeneous Grids Radoslava D. Goranova University of Soﬁa “St. Kl. Ohridski”, FMI, 5 James Bourchier Blvd., 1164 Soﬁa, Bulgaria

Abstract. Open Grid Service Architecture (OGSA) is an architectural model for service-oriented Grids. During the years, OGSA domineer as Grid architecture for building Grid systems and it tightly hold SOA principles. However, we can not claim that OGSA is supported by every Grid environment. On contrary, the development of OGSA as Grid standard is not ﬁnished yet, and many Grid environments do not implement it. The heterogeneous Grids use diﬀerent technologies and protocols to provide services for access to Grid resources. Some of them are not service-oriented, others are fully or partially. This is causing problems for developers of Grid applications, who have to use diﬀerent approaches to access the diﬀerent Grids. Moreover, from the users’ point of view, it is very diﬃcult to use simultaneously resources from these diﬀerent types of Grids. If we do not take into account the diﬀerent security policies that every Grid implements, there is no uniﬁed way to access Grid services or to integrate them. Solution of the problem is SOA. In this article we present our service-oriented approach for integration. For test purpose we use Grid middleware Globus Toolkit 4 and g-Lite.

1

Introduction

The Grid [1] is hardware and software infrastructure that provides coordinated resource sharing and high-end computational capabilities, based on standard open protocols, for delivering of qualities of services. The idea of the grid is to provide inexpensive and consistent access to unlimited computational and storage recourses. From architectural point of view Grid is ﬁve-layered (Figure 1). Fabric layer provides resources: computational, storage systems, network resources, speciﬁc devices, sensors even clusters or pools. Connectivity layer deﬁnes communication and authentication protocols for exchanging data between fabric layer and secure mechanism for resource access. Collective layer contains protocols and services for resource interaction monitoring, diagnostics, scheduling, brokering, service discovering, etc. The services expose speciﬁc grid functionalities and are not bind to any particular technology. Application layer provides software tools: application toolkits, portals and libraries for access to the services deﬁned and implemented in collective layer. User application layer covers user deﬁned applications and user deﬁned grid services. Independently of the fact that every grid projects designs and builds its own architecture to realize project requirements, we can say that every one of these projects’ architectures in its base I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 491–498, 2010. c Springer-Verlag Berlin Heidelberg 2010

492

R.D. Goranova

Fig. 1. Grid architecture

follows the grid architectural model described above. Based on project requirements, the diﬀerent Grid projects realize this model using diﬀerent approaches. Some of them for example, follow principles of the service-oriented architecture (SOA), others on client-server architecture. Based on architectural design, Grid environments can be classiﬁed as fully service-oriented, partially and not serviceoriented. Fully service-oriented are grid environments that provide well-deﬁned and self-described services for all grid functionalities and capabilities. Example architecture for that is OGSA [2]. Open Grid Service Architecture (OGSA) is Grid architecture, which tries to impose itself as a standard for building serviceoriented Grid systems. OGSA speciﬁcation is being developed by the Grid community with leadership from the Globus Project. OGSA is an architectural model for service-oriented Grids that elaborate traditional Grid architecture. It tightly holds SOA principles. The aim of OGSA is to deﬁne a common, standard, and open architecture for grid-based applications and to standardize all the services in grid systems, by specifying a set of standard interfaces for them. Although OGSA domineer as a standard, we can not claim that it is supported by every Grid environment. On contrary, the development of OGSA as Grid standard is not ﬁnalized yet, and many Grid environments do not implement it. Typical examples for that are middleware as g-Lite and BOINC. The ﬁrst one is partially service-oriented and the second is not service-oriented at all. OGSA itself is implemented by Globus Toolkit middleware. In this article we will generalize main problems, which arise during development of grid application for heterogeneous grid environments and we will propose an approach for solving these problems. Our research is based on examples of g-Lite middleware and Globus Toolkit.

2

Problem Definition

The Grid middleware is a software environment that implements connectivity, collective and application layer from grid architecture by coordinating and integrating Grid resources and services. The heterogeneous Grids use diﬀerent technologies and protocols in their middleware for design and implementation of

Service-Oriented Integration of Grid Applications in Heterogeneous Grids

493

environment’s services. The development of grid application however is associated with access and invocation of the middleware’s services. This is causing problems for developers of Grid applications, who have to use diﬀerent approaches to access the diﬀerent Grids. Globus Toolkit 4 (GT4), for example is service-oriented toolkit for development of Grid infrastructures. It realizes OGSA requirements for statefull services, by implementing Web Services Resource Framework (WSRF) speciﬁcation. GT4 provides high-level services as discovery service, job submission, security service, as well as common runtime libraries and tools. Technology used in GT4 for realization of all high-level services is Axis2. From developers’ point of view the advantage of this middleware is that high-level services covers all necessary functionality for one grid application and one technology is used for realization of all of them. Another advantage is that the end user can deﬁne, register and deploy its own user-deﬁned service. Disadvantage is the service description; high-level services can not be invoked by standard Axis2 tools client. Another example of common used middleware is g-Lite. The environment is designed and implemented for EGEE Grid infrastructure and is tightly speciﬁed for the need of this project. G-Lite provides grid services (components) as Workﬂow Management System, computing services as Computing Element, information services as BDII, RGMA, storage services as Storage Element and security services. Some of them are serviceoriented high-level services and others are not. Technology used for realization of service-oriented high-level services is Axis 1.4. For other grid functionality, the middleware provides diﬀerent API (Java, Python, C) for access. From developers’ point of view the advantage of this middleware is that service-oriented high-level services are well deﬁned and described. Disadvantage is that the end user can not deﬁne its own service and that there is no uniﬁed access to all high-level services. Integration of services provided by heterogeneous grid environments will require uniﬁcation and standardization of the provided services. This requirement of course can not be satisﬁed for Grid environments which use diﬀerent technologies for service realization. On the other hand, this is a problem for development of these grid applications which uses services and resources from heterogeneous grid environments. On the base of the two given examples, we can generalize the following type of problems, concerning partially g-Lite environment and partially integration problems. 1. No uniﬁed way for access to high-level services — The developer has to use diﬀerent technology to access services from one middleware. 2. Lack of service-oriented high-level services — Not all important high-level services are accessible as actual service (Grid service) 3. Lack of opportunity to deﬁne and register user-deﬁned service — User can develop its own grid service, but can not register it in the grid environment. 4. Diﬃculties for integration of high-level service from more than one grid environments — the diﬃcultness to use simultaneously services from diﬀerent type of Grids, excluding the diﬀerent security policies. Solution for those problems is in Service-oriented integration.

494

3

R.D. Goranova

Service-Oriented Approach for Integration

Service-oriented integration (SOI) [3] is approach to integrate computing entities using only service interactions in a service-oriented architecture. As we mentioned above most grid environments, provide services and API to application for access to grid functionalities. Through the use of APIs, end users or applications can access the functionality of underling grid middleware. However, the APIs exposed by diﬀerent environments will diﬀer in the way they are accessed and with technology they are accessed. The objective of heterogeneous grid integration is to understand and use APIs for accessing the required functionalities and to mask the technology diﬀerences between diﬀerent technologies used for APIs and their access. The service-oriented approach for integration achieved that by using services, which expose interfaces. The interfaces, as they are deﬁned by SOA, provide the contract between diﬀerent applications. This contract assures that interface of the service stays unchanged, nevertheless of interface’s implementation. The main accent in service-oriented approach for integration is well deﬁned interfaces, which are loosely coupled. How this approach can be applied as a solution of above deﬁned problems is a matter of more extensive research. To the rest part of this article we described the initial steps and conclusions that were made during the process of investigation and future work that remain to be done. All results and conclusions are made on the base of the example of the two middleware GT4 and g-Lite. As we mention above, not all services in legacy Grid environments are service-oriented. We can not change this, but we can deﬁne set of well deﬁned interfaces of services as service adaptors, service proxies and service mediator, which to provide mechanism for legacy service access. Such kind of service layer has to be developed or adapted for every grid environment, which is used by applications. If we reconsider the above described problems, the problem with uniﬁed access to the services and the lack of service-oriented high-level services can be easily solved by creating service adaptors and service mediators. Service adaptor is a service, which wrap a legacy function and make it callable through web services. In this way functionality that is not service-enabled, can be exposed as a service in the mining of SOA. Service mediators hide complexity of underling legacy function, by providing service-enabled access to this functionality. It is important to use only one technology for implementation of all the services. That will provide uniﬁcation and standardization of the way these services are accessed. The possibility for a user to register a set of services (chosen or developed by him/her) is of great beneﬁt for every service-oriented environment. Using service-oriented approach, this problem can be solved by deﬁning registry, a service which store user-deﬁned service description, service endpoint and service security policies into a database. To implement this service a UDDI speciﬁcation with standard functions: browse, get, and publish, can be used. The integration of high-level service on more than one grid environment however is not an easy task. The diﬀerent security policies and mechanisms, which heterogeneous Grid uses, will require creation of proxy services for every Grid which will be integrated.

Service-Oriented Integration of Grid Applications in Heterogeneous Grids

495

Fig. 2. Services

4

Applying Approach Experience

In order to solve the above problems, we start with deﬁnition and development of registries. The function of registries in our layer is very important, because they contain information for users, certiﬁcation authorities, virtual organizations and services description. The technology that we used for layer development is based on Tomcat Server, Axis SOAP Engine and MySQL for database management system. First we investigate g-Lite and GT4 services and classify them according several criteria: level, origin, and description. The level of the service speciﬁes if the service is used for access of grid resource (low level), if it is accessed by another service or by grid application (high level). According to that criterion g-Lite services are low level and high level services. The origin of the service deﬁnes if the service is a part of the existing environment or is a part from external application. In this case, services are system and user-deﬁned services. G-Lite has only system services. Description of the service is the most important characteristic, since it deﬁnes its functionality. According that criterion services are computational — provide functionality for job submission, security — provide functionality for user authentication and authorization and proxy creation, storage — provides functionality for ﬁle transfer and storage, informational — provides functionality for service discovery, composite — provides functionality for service composition. All registered user-deﬁned services and system services are stored in a database in registry services (Figure 2). The information which we store about the services is based on the above deﬁned criterion. We store the service type (origin and description), for example JSB for job submission, USR for user-deﬁned, INF for information services. We store also and grid type containing information in which environment this service is deﬁned. High level services for adding service into the registry, browsing registry and ﬁnding service remain to be developed. For every grid user, which has valid certiﬁcate and is registered in a virtual organization we store information into registry users (Figure 3). We store information about user certiﬁcate, generated user proxy and distinguish name. We also store information about user key and key password, however this sets many security issues. Problems are mainly related to that if an application can access this registry, then it can create more and more

496

R.D. Goranova

Fig. 3. Users

user proxies on his behalf, simply because the application will have access to user key and user password. This is against all security policies in the Gird, based on X509 PKI, which claims that only the user can have access to its private key. In order to avoid this problem, solution that can partially solve the problem is that application can use dynamically build connection to the registry based on user DN which invoke the service. Access to the row in the table of users, can be restricted based on data contained in that row (user DN). In that way the application will be able to see only those rows where the value of the data in the DN column is equal to the DN passed in the dynamic connection. But still more security issues remain to be solved, as how user will secure upload his private key, if there is a need the two ﬁelds for user key and user password to be double encrypted for better security etc. For g-Lite, such kind of user information (user DN, user rights and corresponding CA) is stored on a ﬁle in a directory of every component (grid service), which can be accessed by the user. The same is valid and for the public certiﬁcates of certiﬁcation authorities. Even that all valid certiﬁcation authorities has URL, from where their public certiﬁcates can be downloaded, this URLs are also stored on a ﬁle on the same grid service. That makes development of grid application dependable from the operating system and tightly related to deployment of that application on a machine where all mentioned user and certiﬁcation authorities ﬁles exists. That of course is disadvantage, which can be avoided exactly by using these two types of registries: users and certiﬁcation authorities. For Certiﬁcation Authorities (Figure 4), we store information about caURL, status, version, PEM certiﬁcate, etc. Until now that information can be loaded and updated through shell script (both for gLite and for GT4), that scans and reads ﬁles from speciﬁed directory where all certiﬁcates are located. The registry is updated every time when new CA is added, updated or removed. This registry is applicable also for GT4 certiﬁcation authorities, which use the same principles of user authorization. Interesting problems that can occur in the case are related to Certiﬁcates Authorities from diﬀerent grid environment that use the same alias, but are diﬀerent in their mining. Such kind of problems can be solved, by unifying every certiﬁcation authority in the registry not only by its allies, but by the type of the grid as well. The advantages of using database, especially for services which security is based on X509 certiﬁcates and grid user proxies is that all necessary information from grid security is stored in one place, which can be accessed from everywhere. That can allow grid applications to be developed independently from the

Service-Oriented Integration of Grid Applications in Heterogeneous Grids

497

Fig. 4. Certiﬁcation Authorities

operating system. Using this feature we develop an example WMS mediator, which loads user’s proxy and CA from registries and expose main WMS functionality, requiring as input only the IP address of WMS Proxy Server. The problem with WMS service is that service security is dependent from proxy certiﬁcate and its veriﬁcation. WMS service provides Java based client API for access to service functionality. To use this API, developer has to have access to directory with all valid certiﬁcates of all Certiﬁcation Authorities, in order to verify its user certiﬁcate. This made WMS usable only from a place (computer) where such directory exists. The technology we use for service development is Axis2. For the test purpose and in order to unify the way services in GT4 and g-Lite are accessed, we start with development of service adaptor for job submission in g-Lite and GT4 environment. The technology we use for service development is again Axis2. For the moment service provides functionalities for job submission only in g-Lite environment (Figure 5). We still work on the problem to develop Axis2 proxy service, which to provide functionality for grid proxy creation and voms proxy creation and store generated proxy certiﬁcate into the database.

Fig. 5. Job Submission service

498

5

R.D. Goranova

Conclusion and Future Work

In g-Lite environment there are a lot of components which are not serviceenabled. The aim of this research is not to develop services for every non serviceenable component in g-Lite, but provide a concept way how access to the services from g-Lite and GT4, two heterogeneous Grids can be uniﬁed. However, many thing remains to be done. More services adaptors have to be deﬁned for information services in the two types of grids. In g-Lite for example the information services are realized based on two components: BDII and MON box. G-Lite provides client service discovery functionality (API and CLI), which searches for requested information in both components. Provided service discovery functionality is not service-enabled. GT4 provides MDS4 index service, for component information and discovery. In this case, the service adaptor has to provide service discovery functionality as a service, using the API or CLI, which g-Lite provides and mechanism for invoking GT4 services, based on user credential. Moreover well deﬁned common description of all services has to be provided for service registry. And last but not least, a mechanism how user-deﬁned services can be deployed and where has also to be provided.

References 1. Foster, I., Kesselman, K.: The Grid2: Blueprint for a New Computing Infrastructure. Morgan Kaufman, San Francisco (2004) 2. Foster, I., Keshimoto: The Open Grid Service Architecture, Version 1.5 (2006), http://forge.gridforum.org/projects/ogsa-wg 3. Juric, M., Loganathan, R.: SOA Approach to Integration. Packt Publishing (2007)

Grid Based Environment Application Development Methodology Dorian Gorgan, Teodor Stefanut, Victor Bacu, Danut Mihon, and Denisa Rodila Computer Science Department, Technical University of Cluj-Napoca {dorian.gorgan,teodor.stefanut,victor.bacu}@cs.utcluj.ro, {danut mihon,denisa rodila}@yahoo.com

Abstract. The satellite image oriented processing in the Grid based environment applications needs flexible tools, process description and execution in order to reveal significant information. The main processing consists of imagery classification which is actually a search of information through combinations of multispectral bands of satellite data. The Grid based application development process should assist the developer to implement required functionality, but making transparent to the user the Grid complexity and the mapping of the conceptual application over the physical Grid resources. The paper exemplifies the gProcess based Grid application development methodology throughout the development phases of the GreenView environment application, and analyzes the efficiency of such an approach. The primary aim of the GreenView application is the refinement of surface and vegetation parameters in SEE and CE regions based on satellite images. The GreenView application is based on ESIP platform, which lies on the gProcess toolset specific satellite image processing operators and services. Keywords: Environment application, development methodology, Grid, workflow, process description.

1

Introduction

The Environment oriented Satellite Data Processing Platform (ESIP) has been developed through the SEE-GRID-SCI (SEE-GRID eInfrastructure for regional eScience) [11] two-year project (2008–2010), which is funded by the European Commission. The ESIP platform supports the development and the execution of the Grid based applications concerning particularly with the processing of satellite images and generally with environment-related processing and studies. ESIP is based on the gProcess platform [10] developed by the MedioGrid [8] national research project. It provides the user with the possibility to explore the optimal solutions for Grid processing and information searching in the multispectral bands of the satellite images. The gProcess platform is an interactive toolset supporting the ﬂexible description, instantiation, scheduling and execution of the Grid processing. The gProcess platform is a collection of Web and I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 499–506, 2010. c Springer-Verlag Berlin Heidelberg 2010

500

D. Gorgan et al.

Grid services, and tools providing the following basic functionality: visual manipulation based interactive description of the Grid based satellite image processing by pattern workﬂow; development of hypergraphs as a composition of basic operators, services, and subgraphs; pattern workﬂow instantiation for particular satellite image; satellite data management, access and visualization; and workﬂow based Grid execution. The applications based on ESIP platform aim to beneﬁt from Grid processing and storage capabilities but, at the same time, to provide the ﬁnal users with functionalities that will make the data processing and acquisition complexity transparent for them. This paper presents a methodology for creating environmental applications (e.g. GreenView [11] using the resources provided by the ESIP platform. The methodology concerns mainly with algorithm identiﬁcation and analysis, data model deﬁnition, identiﬁcation of atomic parts of the algorithms, parallel or serial processing design, description of processing using gProcess, graph description and instantiated graph speciﬁcation, process execution, and graphical user interface.

2

Related Works

The large amount of information received from the satellites — about 1.5 Terabytes of raw data per day from the next generation of HES [6,12] geostationary satellites — requires very important processing resources and massive data storage. This is one of the main reasons for the need of parallel distributed computing in GIS information extraction and real-time processing. Over the years, the scientiﬁc community has developed large data repositories in Earth Observation domain, but these are usually addressed only to speciﬁc organizations and are not available for public access. Actually, projects like GPOD [4] and GENESI-DR [5] aim to oﬀer open access to repositories that provide functionalities for satellite images management, indexing and retrieval. These projects allow both manual and automatic data access, through a Web interface or Web services. Simple and complex Grid operators based on workﬂow description have been developed for data processing as well. The workﬂow systems use direct acyclic graph as a solution for process modeling. UNICORE [3] and Symphony [7] use the direct acyclic graph (DAG) based descriptions, while Kepler [1] and Triana [2] use direct cyclic graphs. Another available solution is related with Petri nets [9]. The description language of Grid workﬂows uses mainly the XML description, such as the Grid Service Flow Language (GSFL) and Grid Workﬂow. The GreenView application relies on the gProcess workﬂow based description and execution, which has been developed through the MedioGrid project.

3

Grid Application Development Methodology

As mentioned above, actually most of the components needed to create an environmental application are scattered among diﬀerent platforms that have been

Grid Based Environment Application Development Methodology

501

developed independently. Many times, creating a reliable bridge that connects all these heterogeneous tools is a very complex task. Moreover, the complexity of an environmental application requires that specialists from diﬀerent domains such as mathematics, informatics, geography, and earth observation, work together in most of the development phases. As a result, in order to develop eﬃciently a reliable environmental application, a development methodology must be considered. This paper presents such a methodology based on the ESIP and the gProcess workﬂow description and execution platforms, with the following development phases: 1. Algorithm identiﬁcation and analysis; 2. Data model deﬁnition; 3. Identify atomic parts of the computation; 4. Algorithm implementation; 5. gProcess based process description; 6. User interface development.

4

Algorithm Identification and Analysis

At the beginning of the application development it must be established what types of processing algorithms (e.g. interpolation, random number generators etc.) will be implemented. After choosing the type of algorithms, where applicable, a thorough analysis must be conducted in order to choose the best algorithm for each processing type (e.g. for interpolation we could use: neighbor interpolation, bicubic interpolation, bilinear interpolation, etc.). There are many criteria that can be used on algorithm selection, like: precision, computation resources needed on a speciﬁc data set, parallelization possibilities, etc. The selection process must be conducted by experts from mathematics, informatics and application domain together in order to assure the best result in the algorithms selection. At the end of this phase, the algorithms must be described as mathematical formulas that will be later implemented as operators in a Grid processing structure. For example, in our application we will implement a coarse to fine interpolation aiming to create an image with a better resolution than the original. The missing points will be computed using a distance weighted nonlinear interpolation that computes the value of a Vm point from four surrounding known values: V1, V2, V3, and V4. The mathematical formulas that will be implemented are: di 4 cos4 π2 · dmax Vi · Vm = (1) 4 4 π · dj cos i=1 j=1 2 dmax di = arcsin

sin2

φa − φb 2

+ cosφa cosφb

sin2

λa − λb 2

R

(2)

where: Vi — one of the four surrounding pixels, Vi (Φi , Λi ); di — the great circle distance between two points; dmax — the great circle distance between the furthest two pixels of the surrounding four pixels; R — average radius for a spherical approximation of the Earth (≈ 6371.01 km).

502

5

D. Gorgan et al.

Data Model Definition

This second phase of the development process aims to establish what kind of data types (e.g. satellite images) and sources can be used and processed in the application. There are several satellite image types (MODIS, Landsat, Aster, etc.) that provide diﬀerent information in speciﬁc data formats and structures. The selection of image types is conditioned mainly by the information required for the functionalities included into the application and, where multiple image types are compatible, by the data structures (e.g. coordinate systems) and information quality (e.g. image resolution). The next step in data model deﬁnition is to identify the available data sources (repositories) for the data types established. These sources should provide functionalities for automatic data management, indexing and retrieval, as these databases should provide to the end user the possibility to search in real time for needed information. For our example application the data model will be deﬁned by a MODIS satellite image representing a certain area of the Earth and temperature data values recorded in speciﬁc points inside the same area. The resolution of the MODIS image is 1 km2 and the resolution of the temperature measurements is about 150 km2 . The aim of the application will be to compute the temperature in every point of the MODIS image using the interpolation algorithm described above and the recorded temperature values. As an output data an image in .tiﬀ format will be obtained representing the temperature values for the same area but with a higher resolution of 1 km2 .

6

Identify Atomic Parts of the Computation

In order to take advantage of the Grid processing capabilities, the execution of the algorithms needed for the application should be parallelized. The parts of the processing algorithms that can be executed independently, named atomic parts, should be identiﬁed and analyzed for potential parallel execution. This analysis should also consider the time needed for data transfer between the nodes of the Grid network, as there are situations when this operation could take longer than the overall serialized execution of the algorithms’ atomic parts on the same Grid node. The implementation of the algorithms as atomic parts will increase signiﬁcantly the percentage of the re-usage resources, as these parts can be used in the same application but diﬀerent functionalities or even in diﬀerent applications, speeding up the development process. The interpolation algorithm used in our example is a simple one and does not involve separate processing steps, so it can be all considered as an atomic part. Nevertheless, the conversion of the temperature ﬁle to the metric coordinate system and the pseudo-coloring algorithm are separate atomic parts of the application and will be implemented as separate Grid operators. As the data involved in the processing can be quite large, all three operators are serialized and executed on the same node in the grid. In these conditions,

Grid Based Environment Application Development Methodology

503

the processing time for every month is about 32 seconds and grows linearly with the number of months. Therefore we group one month regarding computation on the same node, and parallelize the diﬀerent months related computation, by using the Grid parallel processing capabilities. This way, two or more processing can take place in the same time and the overall computing duration will be signiﬁcantly shorter.

7

Algorithm Implementation

This phase decides the implementation form of the algorithms’ atomic parts. Grid or web services, procedures or separate applications can be used to implement a speciﬁc part of the algorithm, while the other parts could be implemented in the same or in diﬀerent forms. Some of the most important criteria that could inﬂuence the choice are: the source of the needed information (local saved ﬁle, resource from a distributed database, remote ﬁle that can be acquired through HTTP or FTP), Grid platform speciﬁc requirements or the possibility of this operator being used in other applications over Web or Grid. As a result of the analysis carried out in the previous step of the methodology, we will need three operators in our example application: system coordinate conversion, interpolation algorithm computation, and the pseudo-coloring. As an example, we consider that the algorithm for conversion from geographical to metric coordinates is available already implemented as a Grid or Web service.

8

Graph Based Process Description

Conceptually the algorithm can be described as an acyclic graph (Fig. 1, left) that has arcs which represent the execution dependencies between nodes and four diﬀerent types of nodes: Input data or resource. This type of node can represent a satellite image or a data value (int, ﬂoat, string data types) used for special operations (e.g. threshold). Operator. Represents any atomic operation identiﬁed in the 3rd phase of the methodology and implemented at the 4th phase as procedure or standalone application. Service. It is Web or Grid service that may be included into the processing graph in a similar way as ordinary operator. The main diﬀerence comes from the way in which these nodes are executed over the Grid. Sub-graph. Used for developing complex processing description graphs, this type of nodes allows the integration of a sub-graph in another graph. Virtually, any acyclic processing graph could be included as sub-graph in another one. 8.1

Pattern Process Description

Making use of the graph elements described above and through the interface provided by the gProcess platform, the processing graph for each algorithm

504

D. Gorgan et al.

Fig. 1. PDG examples: left — workflow describing the Grid based processing, by satellite image bandwidths (Input), operators (OP), Grid and Web services (S), subgraphs (SG), and processed images (Output). right — the non-linear interpolation algorithm.

can easily be described combining the atomic parts and the data deﬁned at the 1st and 2nd phases of the methodology (Fig. 1, right) by a pattern process description graph (PDG) [10]. High attention should be paid to the compatibility of two connected nodes, in terms of input and output types and formats. The PDG description of the algorithm speciﬁes the data models used, the operators (i.e. atomic parts) involved and how all these nodes are connected, but does not make any reference to speciﬁc data resources (e.g. a speciﬁc image ﬁle, int or ﬂoat value, etc). This way, the same PDG can be used to process any speciﬁc data that meets the data model requirements, and avoid remaking the graph based description for every data set. 8.2

Instantiated Process Description Graph

In order to apply an algorithm over a compatible data set, the PDG that describes the algorithm must be mapped over this speciﬁc data by the instantiated PDG (iPDG). This operation requires that for every resource node deﬁned in the PDG to specify a data resource that meets the format and type requirements (e.g. satellite image, int, ﬂoat, string, etc). All the data ﬁles should meet the description of the data models deﬁned in the 2nd phase of the development methodology. 8.3

Process Execution

The execution of an algorithm over a data set can be done using the iPDG ﬁle described by the user. When the execution is requested the application veriﬁes if the speciﬁed iPDG is correctly formed and, if the ﬁle meets the requirements, it is passed to the Executor service provided by the gProcess platform. The service parses the iPDG description and generates the appropriate internal data structure, expanding the sub-graphs nodes and considering for execution only the atomic nodes such as operators, services and data resources. At this point, before launching in execution an operator, the Executor service performs a ﬁnal consistency checking, similar to the one performed at PDG

Grid Based Environment Application Development Methodology

505

Fig. 2. Sample of temperature computation for coarse to fine resolution of satellite image: left — initial temperature measurements of coarse resolution satellite image; right — computed temperature of fine resolution satellite image

description phase but extended by veriﬁcations concerning the availability of the operators (procedures, applications, services, etc.) and data sources that are to be used in execution.

9

User Interface Development

The application interface must provide the user with access to all the application functionalities while hiding the complexity of the processing that is carried out in the background. The development of the interface may start earlier, just after the data model deﬁnition phase, when all the required information for the user interface design is already available. The graphical user interface development should be user-centered in order to maximize the interface usability and eﬃciency. As most of the functionalities needed by the environmental application are already implemented in ESIP platform, the interface development will be constrained only by the very speciﬁc information related to the project (e.g. operators, data model, workﬂow, and parallelization).

10

Conclusions

The development of Environmental applications based on Grid infrastructures and dedicated to non-technical experts from diﬀerent domains is a challenging task. The complex computing and data acquiring processes must be covered with a simple and straightforward interface that allows specialists with average computer and no Grid related skills to process massive distributed data or available through http or ftp protocols, in order to conduct research or observation studies based on satellite images analysis. The development process requires a thorough analysis over the algorithms that are to be implemented, the required data and its availability, the technologies used to develop the application and over the interface design and implementation. All these studies must be performed by specialists from very diﬀerent

506

D. Gorgan et al.

domains such as mathematics, informatics, and Earth Observation-related domains, which must work together to a single and unitary product. The methodology presented in this paper oﬀers guiding recommendations that allow a better collaboration between the experts participating to the development process and a more eﬃcient resources management. Acknowledgments. This research is supported by SEE-GRID-SCI (SEE-GRID eInfrastructure for regional eScience) project, funded by the European Commission through the contract nr RI-211338.

References 1. Altintas, I., Berkley, C., Jaeger, E., Jones, M., Ludaescher, B., Mock, M.: Kepler: Towards a Grid-Enabled System for Scientific Workflows. In: Proceedings of Workflow in Grid Systems Workshop in GGF10, Berlin, Germany (2004) 2. Churches, D., Gombas, G., Harrison, A., Maassen, J., Robinson, C., Shields, M., Taylor, I., Wang, I.: Programming scientific and distributed workflow with Triana services. Journal of Concurrency and Computation: Practice and Experience 18(10), 1021–1037 (2005) 3. Erwin, D.W., Snelling, D.F.: UNICORE: A Grid Computing Environment. In: Sakellariou, R., Keane, J.A., Gurd, J.R., Freeman, L. (eds.) Euro-Par 2001. LNCS, vol. 2150, pp. 825–834. Springer, Heidelberg (2001) 4. Fusco, L., Cossu, R., Retscher, C.: Open Grid Services for Envisat and Earth observation applications. In: Plaza, A., Chang, C. (eds.) High Performance Computing in Remote Sensing, pp. 237–280. Chapman & Hall/CRC/Taylor & Francis Group (2008) 5. GENESI-DR project consortium (2009), http://www.genesi-dr.eu 6. Hawick, K.A., Coddington, P.D., James, H.A.: Distributed frameworks and parallel algorithms for processing large-scale geographic data. Parallel Computing 29(10), 1297–1333 (2003) 7. Lorch, M., Kafura, D.: Symphony — A Java-based Composition and Manipulation Framework for Computational Grids. In: Proceedings of 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 2002), Berlin, Germany, pp. 136–144. IEEE Computer Society, Washington (2002) 8. Petcu, D., Zaharie, D., Gorgan, D., Pop, F., Tudor, D.: MedioGrid: a Grid-based Plat-form for Satellite Images. In: Procs. IDAACS 2007, pp. 137–142. IEEE Press, Los Alamitos (2007) 9. Peterson, J.L.: Petri Nets. ACM Computing Surveys 9(3), 223–252 (1977) 10. Radu, A., Bacu, V., Gorgan, D.: Diagrammatic Description of Satellite Image Processing Workflow. In: Proc. Int. Symp. Symbolic and Numeric Algorithms for Scientific Computing (SYNASC 2007), pp. 341–348. IEEE Press, Los Alamitos (2007) 11. SEE-GRID-SCI project consortium, SEE-GRID eInfrastructure for regional eScience (2009), http://www.see-grid-sci.eu/ 12. Schmit, T.J., Li, J., Gurka, J.: Introduction of the Hyperspectral Environmental Suite (HES) on GOES-R and beyond. Presented at the International (A)TOVS Science Conference (ITSC-13) in Sainte Adele, Quebec, Canada (2003)

User Level Grid Quality of Service Anastas Misev1 and Emanouil Atanassov2 1

2

University Sts Cyril and Methodius, Faculty of Natural Sciences & Mathematics Institute of Informatics, Skopje, Macedonia [email protected] Bulgarian Academy of Sciences, Institute for Parallel Processing, Soﬁa, Bulgaria [email protected]

Abstract. Improving the Quality of Service of the Grid infrastructure is one of the most important ongoing issues in the Grid community. It has many implications, from broader users’ acceptance of the technology to the shift of the Grid usage from the scientiﬁc world toward the businesses. While best eﬀort can be acceptable in the scientiﬁc community, the business applications must have clear and well deﬁned service levels based on deterministic QoS metrics. In this paper we introduce a diﬀerent type of Grid QoS, one under users’ control. Using it, the users can have better control over the level of services they are using from the Grid infrastructure.

1

Introduction

To have a wide acceptance of some technology, it has to be user oriented and user controlled, as much as possible. Even traditionally one-way services like audio or video broadcasting are getting a whole new dimension when the users can control them, in the form of on-demand services, personalized streams etc. We expect to get the same from the Grid technology. Since its beginnings, it is a technology adapted mostly by the scientiﬁc community. But having in mind its potential and possible ways of development like cloud computing, network computing etc, something has to be done to make it closer to the users, but at the same time attractive to the businesses. Improving the Quality of Service of the Grid infrastructure is one of the most important ongoing issues in the Grid community. It has many implications, from broader users’ acceptance of the technology to the shift of the Grid usage from the scientiﬁc world toward the businesses. While best eﬀort can be acceptable in the scientiﬁc community, the business applications must have clear and well deﬁned service levels based on deterministic QoS metrics. Using techniques like process mining [1] we have realized that there are many cases of jobs with very high queue waiting times [9]. The long waiting time is one of the most common reasons for job failures, usually due to expired proxies. Similar results can be found in [7]. Also, many new Grid users note that even their jobs are short; they could get stuck in the waiting queues for very long time. This makes them pretty skeptical toward accepting this powerful infrastructure. I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 507–514, 2010. c Springer-Verlag Berlin Heidelberg 2010

508

A. Misev and E. Atanassov

The paper is structured as follows: in the second part we discus general need and justiﬁcation for the QoS in Grid computing, in the third part we review the current work in the area of Grid QoS, in the fourth part we introduce our proposed system, in the ﬁfth part we explain its operation in more details, in the sixth we give our initial experimental results and in the ﬁnal two parts we give guidelines for future work and conclusion.

2

The Need for Grid QoS

The issue of Grid QoS has been addressed by many authors, from various perspectives [11]. Foster and Kesselman [5] refer to the Grid QoS as one of the greatest challenges. They identify it as an end-to-end resource management system, capable of orchestrating multiple elements in order to meet the growing needs of the users, including businesses. A common approach is the categorization of the QoS attributes in two major categories: quantitative and qualitative. Qualitative attributes are usually very hard to measure and refer to elements including user satisfaction and service reliability. On the other hand, quantitative attributes can be measured exactly. They include network attributes (throughput, delay, jitter, bandwidth), processing attributes (exclusive or shared usage, time intervals of dedicated processing) or storage attributes (capacity, bandwidth). Although qualitative attributes are hard to measure, they are very important, especially from the users’ perspective. Maintaining Grid QoS can be crucial for some type of applications. Interactive jobs, jobs involving strict deadlines, visualization and rendering are only a few examples where certain level of QoS is needed to support them. Without it, most of them will produce unusable results. According to work of Al-Ali et al [2], several requirements are expected to be fulﬁlled by Grid resource management systems, related to the QoS: – – – – – –

Advance Resource Reservation Reservation Policy Agreement Protocol Security Simplicity Scalability

QoS has been extensively used in the areas of multimedia and networking. Although the ultimate goal is similar, satisfying user requirements, the QoS in grid computing usually requires centralized control [4]. This central system will then be queried to determine the resources to be allocated for the execution of speciﬁc tasks. One of the main tasks of this system will be enforcement of the Service Level Agreements governing the QoS. The Grid QoS should be observed at several layers: networking, middleware and applications layer [13]. Maintaining QoS has diﬀerent requirements and implementation techniques at each of these layers. To identify the possible bottlenecks in the jobs lifetime, thus point the elements where QoS is crucial to be applied, we ﬁrst need to deeply understand the

User Level Grid Quality of Service

509

jobs’ workﬂow. As a Grid middleware platform for analysis, we have chosen the gLite middleware, used in most European Grid installation. By applying a process mining tool ProM [17] to the data collected by the Logging and Bookkeeping [12] service of the gLite middleware, we obtained an objective overview of the job workﬂow. Most important conclusions are elaborated in [9]. Two of the used plug-ins leads us to a similar solution. Both Performance sequence diagram and Petri net performance analysis pointed to a relatively high percentage (up to 20% for some users) of the jobs spend between 57 and 65 hours waiting in the queues, before failing. Even the average waiting time for all the jobs is more than 6 hours [8]. This conclusion is in line with other researchers. Khalli et al [7] identify waiting times at batch schedulers as “the most relevant and prevalent source of performance variability”.

3

Current Grid QoS

Since it has been identiﬁed as one of the most important aspects of Grid computing, QoS has been addressed by many authors. Most important work in this area is presented in this chapter. The General-purpose Architecture for Reservation and Allocation (GARA) is the most commonly known framework for supporting QoS in the context of computational Grids. GARA [6] provides programmers with the capability to specify end-to-end QoS requirements. It provides advance reservations, with uniform treatment of various types of resources such as network, computation, and storage. GARA’s reservation is aimed to provide a guarantee that the client or application initiating the reservation will receive a speciﬁc QoS from the resource manager. GARA also provides an application programming interface to manipulate reservation requests, such as create, modify, bind, and cancel. GARA uses the Dynamic Soft Real-Time (DSRT) scheduler. The most important drawback of this framework is that it lacks the support for web services, meaning that it is not OGSA-compliant. An OGSA based programmable framework must export APIs for managing allocation of CPU and storage resources to the Grid Services. GT3 provides a set of management mechanisms enabling job submission on distributed resources, namely the Globus Resource Allocation Manager (GRAM). GRAM deﬁnes and implements APIs and protocols allowing Grid clients to securely instantiate jobs, according to the functionality of remote schedulers. Members of the Global Grid Forum (GGF) have identiﬁed the issues of concern related to Grid QoS. The GGF Grid Resource Agreement and Allocation Protocol (GRAAP) Working Group (WG), has produced a WS-Agreement[15]. The speciﬁcation consists of three parts which may be used in a composable manner: a schema for specifying an agreement, a schema for specifying an agreement template, and a set of port types and operations for managing agreement life-cycle, including creation, expiration, and monitoring of agreement states This agreement deﬁnes the required behavior (QoS) of a delivered service, with reference to a particular service consumer.

510

A. Misev and E. Atanassov

Grid Quality of Service Management (G-QoSm) [2] is a framework to support QoS management in computational Grids in the context of the Open Grid Service Architecture (OGSA) [16]. G-QoSm consists of three main operational phases: establishment, activity, and termination. During the establishment phase, a client application states the desired service and QoS requirements. G-QoSm then undertakes a service discovery, based on the speciﬁed QoS properties, and negotiates an agreement for the client application. During the activity phase additional operations such as QoS monitoring, adaptation, accounting and possibly re-negotiation may take place. During the termination phase the QoS session is ended due to resource reservation expiration, agreement violation, or service completion; resources are then freed for use by other clients. The framework supports these three phases using a number of specialist components.

4

User Level QoS System

Our proposed system of job prioritization is intended to supplement the current gLite based grid infrastructure in as less invasive manner as possible. We propose an implementation in a way that it could be an option for the sites to have it, giving the users some control over the QoS level of the middleware. Using this model of QoS will give the possibility to the users that have short running jobs to have them wait less time in the queues, while maintaining the fair resource sharing with other users. To avoid misuse of the prioritization, the system will constantly monitor the actual jobs performances and compare them with the user reported estimations. Hence, if a user tries to run a long running job and gives a wrong estimation of its’ running time, he will be penalized for his further usage. Since the system is integrated with the ULMON Tool [10], it makes the deployment of the prioritization part on the Computing Elements optional. The implementation of the system on more CEs can be done gradually. If a job is matched to a CE that does not support prioritization, the user’s credits will be returned to his pool for future usage. On the other hand, using the requirements part of the JDL, users can pick the sites that support this system and have their jobs matched only to this sites. In the future, this feature might become part of the GLUE schema making the selection of the sites much easier. The proposed tool satisﬁes most of the requirements mentioned earlier, deﬁned by Al-Ali et al, with some modiﬁcations.

5

ULQoS Operation

Our QoS system operation is illustrated in Figure 1. The user submits a job to the WMS, using either command line tools or other means (portal, IDE, etc). Then, using the web interface of the User Level MONitoring tool, he gets a list of his jobs, including the newly submitted one. He selects the job that he wants to be prioritized and assigns it the expected running time and some amount of credits. The system, using the amount of credits allocated c and the expected

User Level Grid Quality of Service

511

Fig. 1. Sequence diagram of the operation of the tool

execution time et, calculates the priority level p of the job. Priority is calculated using (1): c p = k1 (1) et To ensure proper operation and fair usage, we have limited the execution time et to 100 minutes and the maximum number of credits per job c to 100. We note that the constants k1 , k2 and k3 used in the calculation will be corrected using empirical results. It then sends message containing the priority to the computing element that the job has been matched to, so it can prioritize it. After the jobs ﬁnishes, the system (again using the data from the ULMON tool), having the actual running and waiting times, calculates the correction of the credits. If the infrastructure obeyed the users’ QoS level request, there is no correction. If the infrastructure partially obeyed the prioritization, cret credits are returned to the user. Finally, if the infrastructure ignored the QoS request, all the credits are returned to the user. The correction of the credits is done using (2), where wt is the actual waiting time and ext is the actual running time. cret =

wt k2 . max(et, ext)

(2)

If the user tried to misuse the system, by specifying a short execution time, but the actual running time was much bigger, then it is penalized by taking cpen credits from his pool. The penalty calculation is done using cret =

max(et, ext) ck2 . et

(3)

512

A. Misev and E. Atanassov

The idea to prioritize short running jobs comes from the user experience and user perception of the infrastructure. Users having long running jobs rely on the fact that they will get the results in a longer period. On the other hand, users that submit shorter jobs are much more aﬀected by the long waiting times. The proposed system also allows the users to give priority to longer running jobs, but for that purpose they will have to allocate more credits to acquire the same priority level. The limited number of available credits per time period also ensures that the resources will be shared fairly, preventing the users which only submit short running jobs to be in a privileged position over the ones that run longer ones.

6

Usage Results

To verify the proposed QoS mechanism, we have used simulation. The simulation was done using the SimWiz simulator [3]. The input data was generated using the guidelines obtained from the real life L&B data [8].

Fig. 2. Waiting times in the queues with and without prioritization

Fig. 3. Comparison of the ratio of the waiting and execution time

User Level Grid Quality of Service

513

Most important results are shown in the Figures 2 and 3. Figure 2 shows the comparison of the waiting times for a mixture of jobs with diﬀerent priorities. We note here that the average waiting time in both cases is the same. Only the jobs with lower priority, usually longer running ones, will experience longer waiting times, which are expected and are considered not to be a disadvantage. Users that submit long running jobs count on having the results back much later. The efects of prioritization are most obvious with the short running jobs with high priority. Figure 3 shows the ratio of the waiting time over the job execution time is much lower when prioritizaion is used.

7

Future Work

Future versions will include priority aging, the system corrects the priority periodically, using the time spend in the queues wt as additional parameter. This way, jobs with some initial priority and with some waiting time in the queues, will not be suppressed by new jobs with equal or little higher priority. The next step is the production testing on real life systems. We will start with small number of sites, using only one WMS and ULMON instance. Further on, more sites and more WMS instances will be included in the system. To enable easier identiﬁcation of the sites that will potentially use this ULQoS, we also propose future integration in the middleware, meaning that site administrators can advertize their sites that use ULQoS, using the GLUE [14] schema, so the users can pick them in their JDLs.

8

Conclusion

Giving the users greater monitoring and QoS control over the Grid middleware will be one of the most important factors toward greater user acceptance of the technology. Also, deﬁning clear QoS possibilities will make the infrastructure more appealing to the businesses too. Our proposed system only focuses on the user perspective of the Grid QoS, giving some advantages to the users submitting short running jobs, while maintaining the average waiting times the same. We expect that by its successful implementation on more sites, the users will get more comfortable to use this technology and to rely on it for their daily operations. Further research of this matter may produce industry level QoS standards that when applied will provide overall QoS mechanisms and metrics, giving the possibility to deﬁne clear and sound usage relations between all the participants, including service providers and business and scientiﬁc users. Acknowledgement. This paper is based on the work done in the framework of the SEE-GRID-SCI FP7 EC funded project, with partial support from NSFB grant D002–146/2008.

514

A. Misev and E. Atanassov

References 1. van der Aalst, W.M.P., Weijters, A.J.M.M. (eds.): Process Mining. Special Issue of Computers in Industry, vol. 53. Elsevier Science Publishers, Amsterdam (2004) 2. Al-Ali, R., von Laszewski, G., Amin, K., Hategan, M., Rana, O., Walker, D., Zaluzec, N.: QoS support for high-performance scientiﬁc Grid applications. In: Cluster IEEE International Symposium on Computing and the Grid, CCGrid 2004, pp. 134–143. IEEE Computer Society, Los Alamitos (2004) 3. Cohen, M.: SimWiz — a Self-Learning Tool for Learning Discrete Event System Simulation, Innovative Techniques in Instruction Technology, E-learning, Eassessment, and Education, pp. 370–374. Springer, Netherlands (2008) 4. Czajkowski, K., Fitzgerald, S., Foster, I., Kesselman, C.: Grid Information Services for Distributed Resource Sharing. In: Proceedings of the Tenth IEEE International Symposium on High-Performance Distributed Computing, HPDC-10 (2001) 5. Foster, I., Kesselman, C.: The grid: blueprint for a new computing infrastructure, 2nd edn., pp. 31–32. Morgan Kaufmann, San Francisco (2004) 6. Foster, I., Kesselman, C., Lee, C., Lindell, R., Nahrstedt, K., Roy, A.: A distributed resource management architecture that supports advance reservation and coallocation. In: Proceedings of the International Workshop on Quality of Service, pp. 27–36. IEEE Communications Society (1999) 7. Khalili, O., He, J., Olschanowsky, C., Snavely, A., Casanova, H.: Measuring the Performance and Reliability of Production Computational Grids. In: Proceedings of the 7th IEEE/ACM International Conference on Grid Computing, pp. 293–300. IEEE Computer Society, Los Alamitos (2006) 8. Misev, A.: Performance analysis of GRID middleware using process mining, Technical report for the Scientiﬁc Seminar, IPP BAS, August 28 (2007) 9. Misev, A., Atanassov, E.: Performance Analysis of GRID Middleware Using Process Mining. In: Bubak, M., van Albada, G.D., Dongarra, J., Sloot, P.M.A. (eds.) ICCS 2008, Part I. LNCS, vol. 5101, pp. 203–212. Springer, Heidelberg (2008) 10. Misev, A., Atanassov, E.: User Level Monitoring Tool. In: 4th EGEE User Forum, Catanina, Italy, EGEE (2009) 11. Oguz, A., Campbell, A.T., Kounavis, M.E., Liao, R.F.: The Mobiware Toolkit: Programmable Support for Adaptive Mobile Networking. IEEE Personal Communications Magazine 5(4), 32–43 (1998) 12. Sitera, Z., Skrabal, J., Vocu, M.: Advances in the L&B Grid Job Monitoring Service (2009), http://lindir.ics.muni.cz/dg_public/lb2.pdf 13. Soldatos, J., Polymenakos, L., Kormentzas, G.: Programmable Grids Framework Enabling QoS in an OGSA Context. In: Bubak, M., van Albada, G.D., Sloot, P.M.A., Dongarra, J. (eds.) ICCS 2004, Part III. LNCS, vol. 3038, pp. 195–201. Springer, Heidelberg (2004) 14. GLUE Speciﬁcation v. 2.0 (2009), http://forge.gridforum.org/sf/go/doc15023?nav=1 15. Grid Resource Allocation Agreement Protocol (GRAAP) WG: Web Services Agreement Speciﬁcation (WS-Agreement) (2007), http://forge.gridforum.org/sf/projects/graap-wg 16. OGSA-WG: Deﬁning the Grid: A Roadmap for OGSA Standards (2005), http://www.ogf.org/documents/GFD.53.pdf 17. ProM tool for Process mining (2009), http://www.processmining.org/

Dictionary Compression and Information Source Correction D´enes N´emeth, M´at´e Lakat, and Imre Szeber´enyi Budapest University of Technology, Magyar Tud´ osok k¨ or´ utja 2, H-1117 Budapest, Hungary Abstract. This paper introduces a method to compress, store, and search a dictionary of a natural language. The dictionary can be represented as groups of words derived form a stem. We describe how to represent and store a word group in a way that is compact and eﬃciently searchable. The compression eﬃciency of the used algorithm highly depends on the quality of the information source. The currently available tools and data sources contain several mistakes, which can be cleaned by the introduced method. The paper also analyzes the eﬃciency of XML and two binary formats, and proposes two methods: directed acyclic graph transformation and word group regrouping that can be used to increase eﬃciency.

1

Introduction

With the development of computer technologies the human interaction with the machines has also moved from punching and reading holes on a card to a more user friendly form. For humans one of the most natural ways of conveying thoughts would be to express ones wishes in mother tongue to the machine. Taken into account the current state of technology we are far from achieving this. The ﬁrst step would be to correct the inputted data (typed or voice recognized). The purpose of this paper is to show how a dictionary of a language can be created, stored and searched in an eﬀective way. The ﬁrst section explains very brieﬂy how a group of words derived from a single stem can be compressed and stored. The second section introduces the developed storage formats with their metrics and properties. The third section evaluates the conclusions that can be drown from the statistical analysis of the compressed data, highlights the problems and proposes enhancements to the compression. The fourth section proposes an algorithm that can be used to determine the words which were incorrectly inserted into a single group of words derived from a stem, while the last section deﬁnes the data compression problem, which would have to be solved to move the compression to the global level.

2

Word Group Compression

Let W denote a set of derived words from one stem. Let us assume that the size of this set is N , and W = {ω1 , ω2 , . . . , ωN }, and is a set with distinct elements. Let I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 515–522, 2010. c Springer-Verlag Berlin Heidelberg 2010

516

D. N´emeth, M. Lakat, and I. Szeber´enyi

ωi,j be the i-th word’s j-th character, and let |ωi | be the length of the i-th word. Let us deﬁne ∗ as the operation of concatenation, so ωi = ωi,1 ∗ ωi,2 ∗ · · · ∗ ωi,|ωi | . Let us deﬁne the base word (B) to be the longest character sequence, which is included by all ωi words. Let us use the word decomposition algorithm described in [2] to create the aﬃxums. The aﬃxums consist of minimal number of preﬁx and suﬃx set pairs (P1 , S1 ) , . . . , (Pm , Sm ), which fulﬁll the following requirements: 1. i = 1 . . . m : p∗B∗s=W ∀p∈Pi

2. ∃i, j fulﬁll 3. ∃i, j fulﬁll

∀s∈Si

: = Pj \ {pi ∈ Pj } that (P1 , S1 ) , . . . , Pj , Sj , . . . , (Pm , Sm ) still (1) : Sj = Sj \ {si ∈ Sj } that (P1 , S1 ) , . . . , Pj , Sj , . . . , (Pm , Sm ) still (1) Pj

The aﬃxums and base words for each word group are stored in a database. The database consists of preﬁxes (Pi ), suﬃxes (Si ), and base words (Bi ), which are represented with variable branching trees. The aﬃx is a pair of preﬁx and suﬃx set. The database also holds the array of aﬃx elements (AF Fi ) and the array of aﬃxums (AF Si ). If one type of object contains another type of element (for example an aﬃx contains a preﬁx) only a reference is stored. This means that if either part of two components is exactly the same only one instance and a reference is stored.

3

Storage Formats

This section describes the developed diﬀerent memory and disk based database storage formats. The database can be stored in XML, in indexed binary format, and in several non-indexed binary formats. Each of these formats has its advantages and disadvantages. The XML on side is the most compatible format, but on the other side it consumes signiﬁcantly more space than the binary format. The indexed property in the binary format allows a much faster pattern matching, but consumes some extra space compared to the non-indexed binary format. In all of those formats some parts of the database are represented by variable branching trees. It is hard to store variable branching trees in computer systems due to the fact that the size and number of connections to children may diﬀer on each node. This makes indexing complicated, so we are using a method, which transforms a variable branching tree into a binary tree. The transformation is the following: Every node has three connected nodes: its parent, its ﬁrst children, and its next sibling. Figure 1 illustrates how a variable branching tree can be transformed into an identical binary tree. The horizontal arrows on the right ﬁgure represent the sibling, while the vertical arrows represent the ﬁrst child relation. From this point onward let us always use this transformed form of the variable branching tree. The information to be stored on a node is the data related to the

Dictionary Compression and Information Source Correction

oA@ oo~o~~ @@@ o o @@ oo ~ ooo ~~~ @ w oo o B C D? ~~ ??? ~ ?? ~~ ?? ~~~ E

F

G

H

517

A

B

I

E

/C

F

/D

G

/H

/I

Fig. 1. Variable branching tree transformation

node. If the amount of this information is the same, then there are three types of nodes relative to the amount of space required to store a single node with its connections. These categories are the following: Nodes with 0, 1, or 2 connected ascending nodes. This type of trees can easily be represented in an XML due to the recursive capabilities of the XML format, so we will only introduce the developed binary indexed and non-indexed formats. 3.1

Binary Non-indexed Tree Representation

In this format each node is represented in the following way recursively starting with the root node. The ﬁrst bit is used to represent whether the node has a ﬁrst child, then a second bit is used to represent whether the node has a next sibling, then the information residing on the node is serialized. The only requirement against this data is that it is stored in a way, which is able to reconstruct itself if at the start of the reconstruction phase the binary stream handler is positioned at the ﬁrst bit of the data on the node, and after serialization the binary stream handler should be placed after the serialized binary data. Each node is followed by its child, and the next sibling is placed after the complete sub tree of its child if it exists. In this way no links representing the connections are needed. This format is the most compact, but has the disadvantage of being unable to process the next sibling node without processing the complete sub tree of the ﬁrst child. Due to this property this format can mainly be used for data transportation or persistent data serialization, but not as a cost eﬃciently searchable database. 3.2

Binary Indexed Tree Representation

This format stores the tree in an indexed way, which eliminates the drawback of the non-indexed representation of being unable to process the next sibling of the node without pre processing the complete sub tree of the ﬁrst child if it exists. This serialization consists of two parts: the nodes in the tree and a header, which stores the metadata of the serialization and indexes. The header consists of the following items, in which the number in the brackets represents the amount

518

D. N´emeth, M. Lakat, and I. Szeber´enyi

of bits the data element is stored on. In the header let K = log2 (sizeof (int)). Every number in the header is represented in the following way: First on K bits the length is stored, which speciﬁes on how many bits the number is stored after this K bit. The header consists of the following elements: – – – – – – – – –

(2 bit) The type of the root (K bit) The size of leaf index size (The size of leaf index size): The number of leaf nodes (K bit) The size of single index size (The size of single index size): The number of single nodes (K bit) The size of double index size (The size of double index size): The number of double nodes (K bit) The size of single oﬀset size (The size of single oﬀset size): The oﬀset of the ﬁrst single node relative to the end of the header – (K bit) The size of double oﬀset size – (The size of double oﬀset size): The oﬀset of the double single node relative to the end of the header The second part of the serialization is the serialized nodes contained in the tree. The nodes are grouped into three categories: nodes with 0, 1 or 2 connected nodes. Each node in all categories begins with 2 bits, which represent the category of the node. The ﬁrst bit is 1, if it has a ﬁrst child, and the second bit is 1 if it has a next sibling node. After these two bits the data portion of the node is serialized, which is followed by the node type dependent indexes of the directly descended nodes. Let us deﬁne INDEX as the number of bits required to represent any index oﬀset within one single tree in any of the categories. – Leaf nodes: No index is stored – Single node: • (2): The category of the connected node • (1): 1=The descended node is a ﬁrst child • (INDEX): The index of the descended node (the location of the descended node is typeoﬀset + nodesize * index bits after the header) – Double node • (2) The category of the ﬁrst child • (2) The category of the next sibling • (INDEX): The index of the ﬁrst child (the location of the descended node is typeoﬀset + nodesize * index bits after the header) • (INDEX): The index of the next sibling (the location of the descended node is typeoﬀset + nodesize * index bits after the header) Figure 2 shows the achieved compression ratio. The x axis represents the number of word groups, while the y axis represent the compression ratio relative to the uncompressed size of the data. The lines on the ﬁgures were created by ﬁtting a power law using the least squares method on the diﬀerent data set. These lines are nearly parallel, which means that the eﬃciency of the developed storage formats are only diﬀerent in a constant factor.

Compresion ratio

Dictionary Compression and Information Source Correction

1

519

Uncompressed XML.raw bin.indexed bin.noindex XML.bz2 bin.indexed.bz2 bin.noindex.bz2

0.1

0.01

0.001

0.0001

1e-05 100

1000

10000

Number of basewords

Fig. 2. The compression ratio 10

The size of the DB [MB]

bin.indexed bin.noindex bz2(xml) bz2(bin.noindex) bz2(bin.indexed)

1

0.1

0.01 100

1000

10000

Number of basewords

Fig. 3. The size of the database

Figure 3 shows the size of the database for the diﬀerent storage formats. If a bzip2 function is applied on the diﬀerent forms of the database a higher compression ratio can be achieved. If we compare a normal and a bziped format the biggest diﬀerence is between the XML and its compressed form. The XML format has no indexing capabilities, so the bziped XML should be compared with the bziped non-indexed binary format, which provides better compression. Comparing the compression ratios of the binary indexed and it’s bziped form shows that bzip2 is unable to compress the indexed data as it was capable with the non-indexed data.

520

4

D. N´emeth, M. Lakat, and I. Szeber´enyi

Statistical Analysis

This section elaborates on the conclusions that can be drawn from the statistical analysis of the data in the database. Let us deﬁne a word group valid if it has at least one common letter in every word. In the compression 50000 word groups were processed and only 39000 were valid. Word groups can become invalid if the stem is mutilated through the derivation process or through the faulty creation of the word group. The ﬁrst is a natural, but a very rare event. In Hungarian it mainly happens only if the stem consists of one or two letters. This represents in less than 0.1 percent of the words, however in the compression the number of faulty word groups were around 17 percent. They were mainly caused by the errors in the dictionary and bugs in the hunspell and aspell spellchecker programs [3,4]. These errors may result in that words derived from diﬀerent stems to be grouped into the same group. 4.1

Word Group Cutting

This section proposes an algorithm that can be used to divide a word group into smaller groups. These smaller groups try to couple words upon which the word group compression provides better compression ratio. Then these subgroups are divided into two categories: 1, group of correct words 2, words which do not belong to the group (faulty stem determination) or misspelled words (assumed correct by the spellchecker). The type 2 groups can be manually analyzed or automatically dropped from the database. The algorithm that can be used to determine these types are the following: Let the original word group be G = {ω1 , ω2 , . . . , ωN }. For every ωi word let us select S = ωi,j ωi,j+1 . . . ωi,j+l sub word in every possible way. Let M be the number of words, which include this S sub word. Let T table consist of the diﬀerent l and M pairs, and let us store one i index for each l and M pair. For every S sub word let us add a line in the table if every line in the table would be distinct after the adding. This table will contain maximal |wi | ∗ N entries, and can be used to select the best base word. 4.2

DAG Transformation

We analyzed why the bzip was able to further compress the non-index binary database, and we come to the conclusion that is mainly caused by two properties of the database. First the variable branching tree representing a preﬁx or suﬃx has sub trees, which are alike. Without the indexes bzip recognizes this as same character sequences and was able to compress it. Since preﬁxes and suﬃxes are not connected further to any more elements each element can be independently compressed. Our binary representation or pattern matching functionalities does not depend on the parent relation between any of the tree elements. Without requirement of the parent relation in the tree it can be easily transformed into a DAG [1], which will enhance the compression.

Dictionary Compression and Information Source Correction

521

The non-indexed tree representation requires the ascending nodes of an element to be always present, so in our binary formats only the indexed version is capable of handling DAGs. In this case no modiﬁcation is needed to the storage format.

5

Towards Global Optimality

The algorithm described in the [2] article provides an eﬃcient solution for compressing a single group of words. If the aﬃx groups created from groups of words are regrouped by violating the local optimality within the aﬃx (preﬁx or suﬃx group) a better overall compression can be achieved. This section describes the mathematical problem that has to be solved to provide a better compression. Let M = {ω1 ; ω2 ; . . . ; ωn } be the set of all aﬃx words (preﬁx or suﬃx), let ωi be the i-th aﬃx word, and let us denote the Gi set of the i-th aﬃx. Let us assume that we have m number of aﬃx sets. In this case let G1 ⊆ M, G2 ⊆ M, . . . , Gm ⊆ M . Let k be the storage cost of a single wi word, and let ci be the number of references to the i-th aﬃx. Let us ﬁnd the G1 ⊆ M, G2 ⊆ M, . . . , Gm ⊆ M minimal number of sets and the T n × m and S m × m matrices, which fulﬁll the following requirements: 1, if wi ∈ Gj – T [i, j] = 0, if wi ∈ / Gj 1, if Gj ⊆ Gi – S[i, j] = 0, otherwise Gj = Gi – ∀i ∈ [1, m]: – k

6

∀i,j

S[i,j]=1

T [i, j] +

i=1...m

ci

S[i, j] is minimal

j=1...m

Conclusions

The single word group compression algorithm can only be eﬃciently used if the information source to be compressed contains very few errors. The currently available spellchecker tools and databases unfortunately can not produce high quality group of words derived form a single stem. This causes that the data has to be cleaned beforehand, and only those algorithms will be able to compress the data eﬃciently, which take multiple word groups into account simultaneously. The used indexed binary form of the database is eﬃciently searchable, and makes it possible to use DAGs as the variable branching trees. In the case of standard computers the database can easily ﬁt into the memory, which makes the seaching most eﬃcient. However if the database needs to be stored and used in systems with very limited memory it can be searched by partially importing the required parts of the database.

522

D. N´emeth, M. Lakat, and I. Szeber´enyi

Acknowledgments Part of this work was funded by the P´ azm´any P´eter program (RET-06/2005) of the National Oﬃce for Research and Technology, and the authors would like to thank the Enabling Grids for E-scineceE project (INFSO-RI-222667).

References 1. Buneman, P., Grohe, M., Koch, C.: Path queries on compressed XML. In: Freytag, J.C., et al. (eds.) Proc. VLDB 2003, pp. 141–152. Morgan Kaufmann, San Francisco (2003) 2. N´emeth, D.: Parallel dictionary compression using grid technologies. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2007. LNCS, vol. 4818, pp. 492–499. Springer, Heidelberg (2008) 3. N´emeth, L., Tr´ on, V., Hal´ acsy, P., Kornai, A., Rung, A., Szakad´ at, I.: Leveraging the open-source ispell codebase for minority language analysis. In: Proceedings of SALTMIL 2004. European Language Resources Association, pp. 56–59 (2004) 4. Viktor, T., Gy¨ ogy, G., P´eter, H., Andr´ as, K., L´ aszl´ o, N., D´ aniel, V.: Hunmorph: open source word analysis. In: Proceedings of the ACL Workshop on Software, Association for Computational Linguistics, Ann Arbor, Michigan, pp. 77–85 (2005)

A Parallelization of Finite Volume Method for Calculation of Gas Microflows by Domain Decomposition Methods Kiril S. Shterev and Stefan K. Stefanov Institute of Mechanics, Bulgarian Academy of Sciences, Acad. G. Bonchev, Block 4, Sofia 1113, Bulgaria {kshterev,stefanov}@imbm.bas.bg http://www.imbm.bas.bg

Abstract. In this paper a parallel organization of a finite volume algorithm for calculation of two-dimensional compressible, viscous gas flows is presented. The problem is addressed to the new emerging area of the micro gas flows, taking place in Micro-Electro-Mechanical Systems (MEMS). Technically, the parallel algorithm is organized by using standard MPI, non-blocking communications and the latency of the communications is overlapped with useful work. The speedup was estimated on two clusters. The first cluster uses MYRINET interconnections (BG04ACAD). The second uses conventional cards for interconnections (BG03NGCC). Both clusters are a part of GRID-infrastructure of the European Research Area (ERA). An ideal speedup is obtained on BG04-ACAD for a number of processors up to 20 CPUs. The speedup obtained on BG03NGCC is very good, however it depends on the mesh refinement. Keywords: Finite volume method, gas microflows, parallel algorithms, GRID.

1

Introduction

The computational analysis of ﬂuid dynamics problems depends strongly on the computational resources [11]. The computational demands are related mainly to: the CPU performance and the memory size. The considered in the paper example of calculation of a two-dimensional gas ﬂow past a supersonic speed particle moving in planar microchannel is a typical problem demonstrating such kind of computational requirements. We consider a supersonic ﬂow with Mach number equal to 2.43. The problem is described in details in [9]. The supersonic speed leads to the existence of a bow shock wave , which reﬂects from the channel walls. As a result past the particle we obtain a complex picture of interaction of reﬂected shock waves (see Fig. 1). The shock waves have signiﬁcant gradients of velocities, pressure, and temperature. Thus, an accurate calculation of the ﬂow requires a very ﬁne or adaptive grid to be used. Calculations of this problem has been carried out for a set of gradually reﬁned meshes. Finally, a mesh with 8000x1600 cells was found to give stable and accurate enough results. It is easy I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 523–530, 2010. c Springer-Verlag Berlin Heidelberg 2010

524

K.S. Shterev and S.K. Stefanov

Fig. 1. Horizontal velocity (upper part) and pressure (lower part) fields calculated by parallel FVM [9]

to predict that a calculation of this problem on a single PC will take years. Obviously, a parallel organization of the computational process is required in order to overcome this diﬃculty. In the next section we present how this can be accomplished.

2

Parallel Organization

Parallel organizations of FVM algorithms are presented in many papers (for examples, see [12,6]). We presented here the main statements of the parallel organization of the ﬁnite volume method [8,11] (FVM) for calculation of the considered gas ﬂow. The corresponding serial algorithm is presented in details in [10]. The same notations are used here. A domain decomposition (data partitioning) approach is used. The idea is a single instruction stream and multiple data stream (SIMD) to be used in accordance with Flinn’s taxonomy. The realization was accomplished by using standard MPI (Message Passing Interface) [3] instructions. This approach makes possible running the code on symmetric multiprocessing (SMP) systems and computer clusters. Most of the modern supercomputers now are highly-tuned computer clusters using commodity processors combined with custom interconnects [5]. Here the send/received packages are of small size. The size of packages is less then 50KB, which is negligible for bandwidth 1Gbps. Therefore the bottleneck of the communications is the latency. There are a lot of communications in one iteration of FVM. To reach a high parallel eﬃciency non-blocking communications are used and latency is overlapped with calculations. Figure 2 presents an example of decomposition of a domain into two processes. The halo region for process 0 contains a column of variables, some of them positioned in the centre and others on the face of the cells. The halo region for process 1 contains two columns of variables, placed in the centre and on the face

A Parallelization of Finite Volume Method

525

Fig. 2. Domain decomposition of FVM computational domain

of the cells. When halo regions are set in this way the minimum sent/received messages between neighbours is reached. The steps in the parallel FVM are the same as in the serial FVM [10]. Parallel FVM 1. Initial guess of u, v, p, T and calculation of ρ, using equation of state. Start loop 1 • 2. Initial conditions for the current time step: t = t + Δt, u(n−1) = u, v (n−1) = v, p(n−1) = p, T (n−1) = T , ρ(n−1) = ρ. • Start loop 2 (current time step calculations): •• 3. Evaluate ﬂux and diﬀusion terms: F x , F y , Dux , Duy , Dvx , Dvy , DT x , DT y . •• 4.1. Evaluate pseudo velocity u ˆ, coeﬃcient du and coeﬃcients apx , bpx of pressure equation in the boundary region. •• 4.2. Initiate sending of boundary values of apx and bpx to neighbours and initiate receipt of halo values of apx and bpx from neighbours. •• 4.3. Calculate boundary values of pseudo velocity vˆ, coeﬃcient dv and coeﬃcients for pressure equation apy , bpy . •• 4.4. Initiate sending of boundary values of apy and bpy to neighbours and initiate receipt of halo values of apy and bpy from neighbours. •• 4.5. Calculate non-boundary values of pseudo velocity u ˆ, coeﬃcient du px px and coeﬃcients for pressure equations a and b . •• 4.6. Calculate non-boundary values of pseudo velocity vˆ, coeﬃcient dv and coeﬃcients for pressure equations apy and bpy . •• 4.7. Wait for completion of sending of boundary values of apx , bpx , apy , and bpy and wait for completion of receipt of halo values of apx , bpx , apy , and bpy .

526

K.S. Shterev and S.K. Stefanov

•• Start loop 3: • • • 5.1. Calculate temperature and pressure in the boundary cells, using energy and pressure equations. • • • 5.2. Initiate sending of the boundary values of p and T to neighbours and initiate receiving of halo values of p and T from the neighbours. • • • 5.3. Calculate temperature and pressure in the non-boundary cells. • • • 5.4. Wait for completion of the sending of boundary values of p and T and wait for completion of receiving of halo values of p and T. • • • 5.5. Initiate sending of maximum residuals for the equations of pressure and energy from each process to process 0 and initiate receipt of maximum residuals for equations or pressure and energy from all processes in process 0. • • • 5.6. Initiate array pold for next iteration in loop 3: pold = p. • • • 5.7. Wait for completion of sending of maximum residuals for equations for pressure and energy and wait for completion of receipt of maximum residuals for equations for pressure and energy in process 0. • • • 5.8. The loop 3 continues until convergence criteria are not satisﬁed or the maximum number of iterations is not reached. Initiate sending and receipt of information to stop or continue loop 3. • • • 5.9. Initiate array Told for next iteration in loop 3: Told = T . • • • 5.10. Wait for completion of sending and receipt of information to stop or continue loop 3. •• End loop 3. At least 2 iteration have to be performed to ensure convergence. The detailed analysis of two-dimensional unsteady state pressure driven channel ﬂow [10] showed that 2 iterations were enough to achieve a good accuracy. •• 6.1. Calculate u in the boundary cells. •• 6.2. Initiate sending of boundary values of u to neighbours and initiate receipt of halo values of u from neighbours. •• 6.3. Calculate v in the boundary cells. •• 6.4. Initiate sending of boundary values of v to neighbours and initiate receipt of halo values of v from neighbours. •• 6.5. Calculate u in non-boundary cells. •• 6.6. Initiate sending of maximum residuals for u from each process to process 0 and initiate the reception of the maximum residuals for u from all processes in process 0. •• 6.7. Calculate v in the non-boundary cells. •• 6.8. Initiate sending of maximum residuals for v from each processes to process 0 and initiate receipt of maximum residuals for v from all processes in process 0. √ •• 6.9.√Calculate the array containing information for T . (This is useful as T is often used in algorithm).

A Parallelization of Finite Volume Method

527

•• 6.10. Wait for completion of sending of boundary values of u and v and wait for completion of receipt of halo values of u and v. •• 6.11. Wait for completion of sending of maximum residuals for u and v and wait for completion of receipt of maximum residuals for u and v in process 0. •• 6.12. Check conditions to stop loop 2. •• 6.13. Initiate sending of information to continue or to stop loop 2 in process 0 to all processes and initiate receipt of decision to stop or to continue loop 2 in all processes. •• 6.14. Calculate density ρ, using equation of state and p and T calculated in loop 3. •• 6.15. Wait for completion of sending and receipt of information to stop or continue loop 2. • End loop 2. End loop 1: The calculation is stopped, when the end time or the stationary state is reached. If these criteria are not reached go to step 2. Note: Equations for p and T in loop 3 are calculated using Jacobi iterative method (see [7]). The organization of sending and receiving messages is done fully manual in order to make possible the use of non-blocking communications. To this aim a structure DomainDecomposition is created. It contains information for every process. The following functions are used for calculation of tags and indices of requests of messages: • CalculateTagForSendVariableToNeighbour and CalculateIndexRequestForSendVariableToIJ0 calculate the tag and index of request of each message send to a neighbour. Both functions receive an unique index for each variable (i variable) that must be send (for example the index for pressure is 1, for temperature is 2 and etc.) and the index of the neighbour (i neighbour). Variable zero tag for this IJ is the maximum tag of all processes on the left (Fig. 2.). • CalculateTagForSendVariableToIJ0 and CalculateIndexRequestForSendVariableToIJ0 calculate the tags and indices of request of the messages send to process 0. Both functions receive the following variables: i variable; N exchanged variables with neighbours — the number of all variables, which are sent/received with neighbours; N neighbours — the number of neighbours (for 2D case is 4); N actions, which is equal to 2 (two possible actions — send and receive). • CalculateTagForSendVariableFromIJ0 and CalculateIndexRequestForRecvVariableFromIJ0 calculate tags and indices of request of the messages send of process 0 to other processes, where N send variables to IJ0 is the number of send variables to process 0. A part of source code containing functions for calculation of tags and indices of requests of messages:

528

K.S. Shterev and S.K. Stefanov

inline int CalculateTagForSendVariableToNeighbour( const int& i_variable, const int& i_neighbour){ return (zero_tag_for_this_IJ + i_variable * N_neighbours + i_neighbour);}; inline int CalculateIndexRequestForSendOrRecvVariableNeighbour( const int& i_variable, const int& i_neighbour, const int& send_or_recv_data){ return(i_variable * N_actions * N_neighbours + send_or_recv_data * N_neighbours + i_neighbour);}; inline int CalculateTagForSendVariableToIJ0( const int& i_variable){ return (zero_tag_for_this_IJ + N_exchanged_variables_with_neighbours * N_neighbours + i_variable);}; inline int CalculateIndexRequestForSendVariableToIJ0( const int& i_variable){ return(N_exchanged_variables_with_neighbours * N_actions * N_neighbours + i_variable);}; inline int CalculateTagForSendVariableFromIJ0( const int& i_variable){ return (zero_tag_for_this_IJ + N_exchanged_variables_with_neighbours * N_neighbours + N_send_variables_to_IJ0 + i_variable);}; inline int CalculateIndexRequestForRecvVariableFromIJ0( const int& i_variable){ return(N_exchanged_variables_with_neighbours * N_actions * N_neighbours + N_send_variables_to_IJ0 + i_variable);};

3

Speedup Analysis

The speedup of the parallel FVM for calculations of gas microﬂows, was estimated on two clusters. The ﬁrst cluster uses myrinet [4] interconnection (BG04ACAD). The second cluster uses conventional cards for interconnection (BG03NGCC). The characteristics of the clusters are shown in Table 1. They are a part of a GRID-infrastructure [2] of the European Research Area (ERA). The tests were submitted by user certiﬁcate [1]. The speedup is calculated as Sn = Ts /Tpar and the eﬃciency is calculated as En = Sn /n, where n is the number of cores (CPUs), Sn is the speedup, when n-cores are used, Ts is the time for sequential run of the program (run on a single core), Tpar is the time of code run on n-cores, En is the eﬃciency, when n-cores are used. The test problem mentioned in the introduction is a calculation of supersonic ﬂow past a small square shaped particle moving in a microchannel, Fig. 1. The test calculations were carried out on two meshes: 500x100 and 1000x200 cells. The calculation of the case (500x100 cells) was completed for around 2.5 hours on a single core.

A Parallelization of Finite Volume Method

529

Table 1. The characteristics of the clusters BG04-ACAD and BG03-NGCC Cluster: Numbers of cores: CPU model: CPU GHz: Cache size: RAM per core: Interconnection:

a)

BG04-ACAD BG03-NGCC 80 200 AMD Opteron(tm) Processor 250 Intel(R) Xeon(R) CPU E5430 2.4 GHz 2.66 GHz 1024 KB 6144 KB 2 GB 2 GB Myrinet Conventional cards

b)

Fig. 3. The speedup of the clusters BG04-ACAD and BG03-NGCC, for meshes: a) 500x100 cells and b) 1000x200 cells

The results concerning the speedup are shown in Fig. 3. The speedup for BG04-ACAD was calculated by taking the minimal run time obtained from 6 runs of each case. The speedup for BG03-NGCC was calculated by taking the minimal time obtained from 11 runs of each case. The tests on BG03-NGCC needed more runs, than those performed on BG04-ACAD, because there was a larger variation in the run times obtained from BG03-NGCC due to the use of conventional card for interconnection. A super linear speedup was reached on the cluster BG04-ACAD for the case with mesh 500x100 cells, Fig. 3. a). The reason for this is the cache eﬀect. The program, run on a single core, needs around 14.5 MB of memory. The increase of process number leads to a decrease of the used memory of the program and, in the case with 20 processes, all data can be placed in the cache memory of the processors (Table 1.). The case with 4 times more cells (1000x200) needs around 58 MB on a single core, Fig. 3. b), respectively the available cache memory is not enough to hold all data and a super linear speedup cannot be reached. The speedup for BG03-NGCC is very good Fig. 3. a), even considering the small problem on mesh 500x100 cells. For a ﬁner mesh (1000x200 cells) the speedup Fig. 3. b) is excellent, nevertheless that BG03-NGCC uses conventional cards

530

K.S. Shterev and S.K. Stefanov

for interconnection. The speedup line is not very smooth (Fig. 3) because of the weak interference with other runs simultaneously performed by other users on the clusters.

4

Conclusions

The non-blocking communications make possible an excellent speedup to be reached. A super linear speedup was reached on the cluster with myrinet interconnection (BG04-ACAD). A very good speedup was reached on the cluster BG03-NGCC for the case of mesh with 500x100 cells and an excellent speedup — on mesh (1000x200). These results show that the good parallel organization makes the algorithm eﬃcient even when run on clusters with conventional cards for interconnection. Acknowledgments. The authors appreciate the ﬁnancial support of NSFB DO 02-115, Module 1. The author K. Shterev also appreciates the ﬁnancial support of THE EUROPEAN SOCIAL FUND, OPERATIONAL PROGRAMME HUMAN RESOURCES DEVELOPMENT, GRANT No: BG051PO001/07/3.3-0255/17.06.2008. The results were obtained using the grid sites of the South East European Regional Operating Centre of the EGEE project (http://public. eu-egee.org).

References 1. 2. 3. 4. 5. 6. 7.

8. 9.

10.

11. 12.

Bulgarian Academic Certification Authority (BG.ACAD|CA), http://ca.acad.bg Enabling Grids for E-sciencE (EGEE), http://www.eu-egee.org MPI Forum, http://www.mpi-forum.org Myricom Inc., http://www.myri.com Supercomputers, www.top500.org Bui, T.T.: A parallel finite-volume algorithm for large-eddy simulation of turbulent flows. Computers & Fluids 29, 877–915 (2000) Em, K.G., Kirby II, R.M.: Parallel Scientific Computing in C++ and MPI. A Seamless Approach to Parallel Algorithms and their Implementation. University Press, Cembridge (2003) Patankar, S.V.: Numerical heat transfer and fluid flow. Hemisphere Publishing Corporation, New York (1980) Shterev, K.S., Stefanov, S.K.: Finite Volume Calculations of Gaseous Slip Flow Past a Confined Square in a Two-Dimensional Microchannel. In: Proceedings of the 1st European Conference on Microfluidics, La Societe Hydrotechnique de France (2008) Shterev, K.S., Stefanov, S.K.: Pressure based finite volume method for calculation of compressible viscous gas flows. Journal of Computational Physics 229, 461–480 (2010), doi:10.1016/j.jcp.2009.09.042 Versteeg, H.K., Malalasekra, W.: An Introduction to Computational Fluid Dynamics: The Finite Volume Method, 2nd edn. Prentice Hall, Pearson (2007) Wang, P., Ferraro, R.D.: Parallel multigrid finite volume computation of threedimensional thermal convection. Computers & Mathematics with Applications 37, 49–60 (1999)

Background Pollution Forecast over Bulgaria D. Syrakov1, K. Ganev2 , M. Prodanova1, N. Miloshev2 , G. Jordanov2, E. Katragkou3, D. Melas3 , A. Poupkou3, and K. Markakis3 1

National Institute of Meteorology and Hydrology, Bulgarian Academy of Sciences, Soﬁa, Bulgaria [email protected] 2 Geophysical Institute, Bulgarian Academy of Sciences, Soﬁa, Bulgaria 3 Laboratory of Atmospheric Physics, Aristotle University of Thessaloniki, Greece

Abstract. Both, the current level of air pollution studies and social needs in the country, are in a stage mature enough for creating Bulgarian Chemical Weather Forecasting and Information System. The system is foreseen to provide in real time forecast of the spatial/temporal Air Quality behaviour for the country and (with higher resolution) for selected sub-regions and cities on the base of the weather forecast and national emission inventory. The country-scale part of the system is designed. It is based on the US EPA Models-3 System. The meteorological input is the ALADIN output, ALADIN being the national numerical weather forecast tool. The boundary conditions are prepared by a similar system running operationally in Aristotle University of Thessaloniki, Greece (AUTH). Special interface is created to retrieve in real time the AUTH-system forecasts producing boundary ﬁles uploaded to dedicated server in Bulgaria. In the paper, detailed description of the System will be given together with ﬁrst results of its testing.

1

Introduction

Air quality forecasting is extremely challenging scientiﬁc problem, which has recently emerged as an important priority in many urbanized and industrialized countries due to the increasing harmful eﬀect on health and environment caused by airborne pollution constituents like ozone and particulate matter (PM). One of the requirements of the Ozone Directive 2002/3/EC [5] is the availability of a trustworthy air pollution forecast system. The goals of reliable air quality forecasts are the eﬃcient control and protection of population exposure as well as possible emission abatement measures. In last years the concept of “chemical weather” arises and in many countries respective forecast systems are being developed along with the usual meteorological weather forecasts (see, for instance, [7,9,10,11,12], and many others). As air pollution crosses national borders, it would be cost-eﬀective and beneﬁcial for citizens, society and decision-makers that national chemical weather forecast and information systems would be networked across Europe. For the I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 531–537, 2010. c Springer-Verlag Berlin Heidelberg 2010

532

D. Syrakov et al.

purpose a new COST Action ES0602 “Towards a European Network on Chemical Weather Forecasting and Information Systems (ENCWF)” [15] was launched aiming at providing a forum for harmonizing, standardizing and benchmarking approaches and practices in data exchange for air quality forecast real-time information systems in Europe. Bulgaria was one of the ﬁrst countries that join COST Action ES0602, but so far it has not its own Chemical Weather Forecast System in spite air pollution causes serious damage on human health and constructions. The harmful eﬀects of the background pollution on natural and agricultural ecosystems are noticeable, too. All this invokes a project ﬁnancially supported by the National Science Fund aiming at creating Bulgarian Chemical Weather Forecast and Information System (BGCW) intended to provide timely, informative and reliable forecast products tailored to the needs of various users and decision-makers. The system is planned to have nesting structure, starting from the region of Bulgaria and nearest territories as a whole (background pollution) and zooming to smaller areas of interest In the paper, the structure of the country part of the system will be described. Some preliminary results addressed to model validation and veriﬁcation will be presented, as well.

2

Operational Design of BGCW

The country (background) part of BGCW is designed in a way to ﬁt the realtime constraints and to deliver forecasts twice a day (00 and 12 UTC) for the next 48 hours. US EPA Models-3 air quality modelling system is used, here, consisting of: (i) CMAQ — Community Multi-scale Air Quality model [16] being the chemical-transport model (CTM) of the System; (ii) MM5 — the 5th generation PSU/NCAR Meso-meteorological Model [14] used as meteorological pre-processor to CMAQ, and (iii) SMOKE — Sparse Matrix Operator Kernel Emissions Modelling System [17] being the emission pre-processor to CMAQ. Meteorological forecasts are obtained at the main synoptic terms using the MM5 mesoscale meteorological model forced by ALADIN output, ALADIN being the national weather forecast tool. Three nested modeling domains are set: ALADIN domain being the outer and the biggest one. MM5 domain is nested in it and CMAQ domain, covering Bulgaria, is nested in the MM5 one. The MM5 (respectively CMAQ) domain resolution is 10 km. The MM5 vertical structure consists of 23 σ-levels with varying thickness, extending up to 100 hPa height. Proper physical options are set. MM5 starts its calculation 12 hours earlier for spin-up reasons (60-hour run). In Fig. 1, the data ﬂow diagram for one 48-hour cycle is displayed. In the boxes, together with the names of the system’s elements the format of the respective output is given. The white boxes present Models-3 components, the dark gray ones — newly created interface modules (Fortran codes). The light grey boxes present the data input to the system. First of all, this is the meteorological forecast created by ALADIN which calculations drive MM5. MCIP,

Background Pollution Forecast over Bulgaria

533

Fig. 1. 48-hour data ﬂow of CW forecast calculations

the Meteorology-Chemistry Interface Processor, is part of CMAQ and together with the needed meteorological parameters prepares some other data (ﬂuxes, dry deposition velocities etc.) to be used by CMAQ and SMOKE. Area Source (AS) gridded inventory feeds the AEmis (AS emission processor). The Large Point Source (LPS) inventory is input to SMOKE, together with the ambient meteorological data as to produce LPS emissions (LPS processor). The met-data together with the gridded land-use are used by SMOKE to produce biogenic emissions (BgS processor). SMOKE is used once more to merge these 3 emission ﬁles in a model-ready emission input. Apart these two main inputs (meteorology and emissions), CMAQ needs initial and boundary conditions. The initial conditions are taken from previous CMAQ run. The case with the boundary conditions (BC) is much more complicated. BCs are of great importance for small regions like Bulgaria. For such small areas, multiple nesting is usually applied. In BGCW, the boundary conditions are provided by the chemical weather forecast system running operationally in Aristotle University of Thessaloniki (AUTH), Greece [9,10]. AUTH system exploits the nested domain approach. There, the air quality forecast is carried out for Europe (50 km spatial resolution), the Balkans (10km), and Athens (2km) using a modelling system which consists of the prognostic meteorological model MM5 and the photochemical air quality model CAMx [4]. Over the European region MM5 is forced by AVN/NCEP global forecast with default (climatic) boundary conditions. AUTH system is run once a day producing 3-day pollution forecast. The boundary conditions for CMAQ are prepared in two steps. First, CAMx data is interpolated for CMAQ boundary points, an operational procedure that takes place in AUTH, results being uploaded via Internet to a dedicated sever in Soﬁa. Here, this data is processed as to produce 3-day CMAQ-ready boundary condition ﬁle. The last box in Fig. 1 tags activities that are not done, yet. So far, main eﬀorts refer system veriﬁcation and validation against measurement data.

534

3

D. Syrakov et al.

Emission Modelling

CMAQ demands its emission input in speciﬁc format reﬂecting the time evolution of all polluters accounted for by the used chemical mechanism. The emission inventory usually is made on annual basis for, as a rule, big territories (municipalities, counties, countries, etc.) and many pollutants are estimated as groups like VOC and PM2.5. In preparing CMAQ emission ﬁle, a number of speciﬁc estimates must be done. First, all this information must be gridded. Secondly, time variation proﬁles must be over-posed on these annual values to account for seasonal, weekly, and daily variations. Finally, VOC and PM2.5 emission estimates must be split into more deﬁned compounds (speciation) in order to be properly modeled for chemical transformations and deposition. Obviously, models are needed as emission pre-processors to CTMs. Such a component in Models-3 system is SMOKE. Unfortunately, it is highly adapted to the US conditions and is partly used, here, only for calculating LPS and BgS emissions and to merge the tree types emission ﬁles into a unique CMAQ input. Emission input to BGCW is foreseen to be the Bulgarian National Emission Inventory for 2007. It is elaborated for AS and LPS separately, distributed over 10 SNAPs (Selected Nomenclature for Air Pollution) classifying pollution sources according the processes leading to harmful material release to the atmosphere [3]. The temporal allocation is made on the base of daily, weekly, and monthly proﬁles, provided by Builtjes et al. [1]. The speciation procedure is dependent on the Chemical Mechanism (CM) used. Here, the Carbon Bond, v.4, (CB4) is exploited [6] upgraded with the ISORROPIA aerosol model [8]. It requires splitting of VOC to 10 lump pollutants and PM2.5 to 5 groups of aerosol. A speciﬁc approach for obtaining speciation proﬁles is used here. The USA EPA data base [18] is exploited. A Bulgarian emission expert has found coincidence between main Bulgarian sources for every SNAP with similar source types from US EPA nomenclature. The weighted averages of the respective speciation proﬁles are accepted as SNAP-speciﬁc splitting factors, weights being the percentage of contribution of every source type in emission of particular SNAP. As shown in Fig. 1, the AS emission ﬁle is prepared by interface programs AEmis. Gridded emission inventory data is input to AEmis. The program performs only speciation and time allocation for every grid cell and for every SNAP. SMOKE produces LPS emission ﬁle. For the purpose, LPS inventory is transformed in the required IDA format. Serious update of SMOKE proﬁles and references’ ﬁles had to be made as to urge the model to produce Europe-speciﬁc results.

4

CMAQ Calculations and Validation

Fourteen σ-levels with varying thickness determine the vertical structure of CMAQ. The Planetary Boundary Layer is presented by 8 of these levels. CMAQ calculations are run twice a day, the previous forecast for the respective hour of the day is initial concentration for the next run.

Background Pollution Forecast over Bulgaria

535

Before starting to use BGCW, validation of model results is needed in order to assure the quality and credibility of delivered output. Here, validation for ozone and for 2000 is done. The up-to-date national emission inventory is in a stage of completion. For the validation, TNO inventory [13] is exploited. Its resolution is on average 15 × 15 km. GIS technology is applied as to produce AEmis input from this data base. The background measurements in Bulgaria are quite scanty. Fortunately, in the frame of research project [2], hourly measurements of ozone for all year 2000 have been done in two background points — peak Rojen (Rodopi mountains, south Bulgaria) and Ahtopol (small Black sea town, southeastern Bulgaria). Respective meteorology data for 2000 is taken from ALADIN archive. The validation calculations are made in oﬀ-line mode. Four scenarios are set as to distinguish the importance of diﬀerent processes to ozone formation. Scenario A is basic — all kind of emissions and boundary conditions are accounted for. With Scenario B, the boundary values of all pollutants are set to zero. Scenarios C and D are as A and B, respectively, but biogenic emissions are excluded. Further on, results only for Rojen will be presented and discussed. Ahtopol’s results are quite similar. In the left graph of Fig. 2, one can notice that almost all scatter points are into FA2 boundaries (dashed lines). The ﬁtting lines are quite close to the ideal ﬁtting line showing slight underestimation of measured data. The correlation coeﬃcients of both ﬁts are quite high that reﬂects the good quality of the simulation. The comparison of scenarios A and B (the graph in the middle–left of Fig. 2) shows that it is quite important to account correctly for the boundary conditions, especially at air quality calculations over such a small domain. BC form almost 90% of pollution levels in the region. The middle–right graph on Fig. 2 show that Scenario D ozone values are at least three times lower than in Scenario B, pointing out the importance of biogenic emissions for ozone formation. As far as the best way to compare ozone data is the usage of daily maxima, the simulation quality of BGCW system is demonstrated once more in the right graph of Fig. 2, where the scatter diagrams of measured vs. calculated daily maxima (for Rojen, again) is displayed. One can notice that the ﬁtting line are close to the ideal one, correlation coeﬃcients being over 0.95, measured daily maxima are somewhat underestimated. The simulation results are rather good. This means that BGCW system is able to reproduce quite realistically the everyday ozone variations and serve as air quality forecast and information system.

Fig. 2. Scatter diagrams: Measurements vs. Scenario A (left); A vs. B (middle–left), B vs. D (middle–right), daily maxima measured vs. daily maxima Sc. A (right)

536

5

D. Syrakov et al.

Conclusion

Evaluation of BGCW simulations showed that the modelling system has a satisfactory performance with respect to O3 as shown from the diﬀerent plots discussed. Despite using boundary conditions from another modelling system the basic spatial and temporal O3 patterns are captured by the model. The best simulation quality refers summer time daily maxima. The reasonable performance of the BGCW system for the past time simulations justiﬁes its use for future forecast and information. Due to the considerable volume of calculations and operational needs it is worth to check the ability to use BGCW system on GRID. Eﬀorts in this direction are planed to be put.

Acknowledgments The presented results were not possible without the experience obtained during the participation in the ﬁfth FP project BULAIR, the sixth FP Network of Excellence ACCENT and the sixth FP Integrated Project QUANTIFY. The resent funding by the National Science Fund (Grant DO02-161/16.12.2008 and DO02-115/2008) must be specially acknowledged.

References 1. Builtjes, P.J.H., van Loon, M., Schaap, M., Teeuwisse, S., Visschedijk, A.J.H., Bloos, J.P.: Project on the modelling and veriﬁcation of ozone reduction strategies: contribution of TNO-MEP, TNO-report, MEP-R2003/166, Apeldoorn, The Netherlands (2003) 2. Donev, E., Zeller, K., Avramov, A.: Preliminary background ozone concentrations in the mountain and coastal areas of Bulgaria. Environmental Pollution 17, 281–286 (2002) 3. EMEP/CORINAIR: Atmospheric emission inventory guidebook, 3 edn., Copenhagen, European Environmental Agency (2002), http://reports.eea.europa.eu/EMEPCORINAIR3/en/page002.html 4. ENVIRON: User’s Guide to the Comprehensive Air Quality Model with Extensions (CAMx), Version 4.40, ENVIRON International Corporation, Novato, CA (2006) 5. European Parliament: DIRECTIVE 2002/3/EC of 12 February 2002 relating to ozone in ambient air. Oﬃcial Journal of the European Communities L67, 14–30 (March 9, 2002) 6. Gery, M.W., Whitten, G.Z., Killus, J.P., Dodge, M.C.: A Photochemical Kinetics Mechanism for Urban and Regional Scale Computer Modeling. Journal of Geophysical Research 94, 12925–12956 (1989) 7. Monteiro, A., Lopes, M., Miranda, A., Borrego, C., Vautard, R.: Air pollution for ecast in Portugal: a demand from the new air quality framework directive. Int. J. Environment and Pollution 25(1-4), 4–15 (2005) 8. Nenes, A., Pandis, S.N., Pilinis, C.: ISORROPIA: A new thermodynamic equilibrium model for multiphase multicomponent inorganic aerosols. Aquat. Geoch. 4, 123–152 (1998)

Background Pollution Forecast over Bulgaria

537

9. Poupkou, A., Kioutsioukis, I., Lisaridis, I., Markakis, K., Giannaros, T., Katragkou, E., Melas, D., Zerefos, C., Viras, L.: Evaluation in the Greater Athens Area of an air quality forecast system. In: Proc. of the IX EMTE National-International Conference of Meteorology-Climatology and Atmospheric Physics, Thessaloniki, Greece, pp. 759–766 (2008) 10. Poupkou, A., Kioutsioukis, I., Lisaridis, I., Markakis, K., Melas, D., Zerefos, C., Giannaros, T.: Air quality forecasting for Europe, the Balkans and Athens. In: 3rd Environmental conference of Macedonia, Thessaloniki, Greece (2008b) 11. San Jose, R., Perez, J., Gonzalez, R.: An Operational Real-Time Air Quality Modelling System for Industrial Plants. Environmental Modelling & Software 22, 297– 307 (2007) 12. Soﬁev, M., Siljamo, P., Valkama, I., lvonen, M., Kukkonen, J.: A dispersion modeling system SILAM and its evaluation against ETEX data. Atmospheric Environment 40, 674–685 (2005/2006) 13. Visschedijk, A.J.H., Zandveld, P.Y.J., Denier van der Gon, H.A.C.: A High Resolution Gridded European Emission Database for the EU Integrate Project GEMS, TNO-report 2007-A-R0233/B, Apeldoorn, The Netherlands (2007) 14. http://box.mmm.ucar.edu/mm5/ 15. http://www.chemicalweather.eu/ 16. http://www.cmaq-model.org/ 17. http://www.smoke-model.org/ 18. http://www.epa.gov/ttn/chief/emch/speciation/

Climate Change Impact Assessment of Air Pollution Levels in Bulgaria D. Syrakov1, M. Prodanova1, N. Miloshev2 , K. Ganev2 , G. Jordanov2, V. Spiridonov1 , A. Bogatchev1, E. Katragkou3, D. Melas3 , A. Poupkou3 , and K. Markakis3 1

National Institute of Meteorology and Hydrology, Bulgarian Academy of Sciences, Soﬁa, Bulgaria [email protected], 2 Geophysical Institute, Bulgarian Academy of Sciences, Soﬁa, Bulgaria [email protected] 3 Laboratory of Atmospheric Physics, Aristotle University of Thessaloniki, Greece

Abstract. The presented work is aiming at climate change impacts and vulnerability assessment in Bulgaria. Climate change may aﬀect exposures to air pollutants by aﬀecting weather and thereby local and regional pollution concentrations. Local weather patterns inﬂuence atmospheric chemical reactions and can also aﬀect atmospheric transport and deposition processes. US EPA Models-3 System for a region with resolution of 10 km covering Bulgaria is exploited here. The meteorological background is produced by the climatic version of ALADIN weather forecast system. TNO emission inventory for 2000 is used. The chemical boundary conditions are extracted from 50-km resolution runs over Europe made in Aristotle University of Thessaloniki, Greece. Calculations for the period 1991-2000 are performed, results presented in a study. For year 2000, some scenarios are run, results compared with measured data.

1

Introduction

Extremely many scientiﬁc projects and related publications are aimed to assessment of the impact of climate changes on various areas of human activity and environment. The impact of climate changes on pollution levels uses to be object of various publications. Special attention is due to NATO Project CLG 980505 ﬁnal report [2]. Long-term runs are performed over Europe with resolution of 10 km exploiting UNI-DEM AQM [11,12] and meteorology for a period of 16 years (1989-2004). Kinds of scenarios (traditional, climatic, emission) are combined in 14 diﬀerent runs. The meteorology for prospective climatic runs is prepared on the base of suitable changing the available meteorological variables in a way to ﬁt the means from the IPCC climate scenario SRES A2 [6]. The comments of results are quite interesting. The EC FP6 project CECILIA [15] is devoted to climate change impacts. CECILIA’s Work Package 7 aims at long-term simulations of chemistry air quality I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 538–545, 2010. c Springer-Verlag Berlin Heidelberg 2010

Climate Change Impact Assessment of Air Pollution Levels in Bulgaria

539

models (AQM) driven by Regional Climate Models (RCMs) for present climate and for future projections with ﬁne resolution of 10 km for some target regions of Central and Eastern Europe. The boundary conditions for these regions are prepared by calculations covering the whole of Europe with a grid resolution 50× 50 km. CECILIA approach diﬀers from the one just described. The meteorological ﬁelds are determined as output of RCMs forced by available forecast/reanalysis or global scale climatic models runs. In spite the investigated periods are quite long, the emission scenario is ﬁxed — only EMEP 50 km data base for 2000 [18]. For ﬁne scale simulations a proper disaggregating of EMEP data base is exploited [10]. The aim is to estimate only the inﬂuence of meteorology changes on pollution levels in regional and local aspect. In the paper, ﬁrst results of applying CMAQ chemical transport model for long-term simulation over the region of Bulgaria are presented stressing on model validation.

2

Meteorological Data Base

The meteorological data base for this study is created oﬀ-line by the ALADIN Regional Climatic Model (ALADIN-RCM). It is a modiﬁcation of the current operative weather forecast system in the National Institute of Meteorology and Hydrology (NIMH) of Bulgaria. Description of the small size high resolution RCM with the ALADIN dynamics (ALADIN-RCM) is given in [9] and [3]. ALADIN dynamics was entirely built on the notion of compatibility with its “mother” system, IFS/ARPEGE. The ALADIN/CMAQ meteorological data base consists of 6-hour GRIB ﬁles containing the main meteorological parameters in 31 standard p-levels going up to 50 hPa.

3

Air Quality Models Used, Computational Domains and Data Flow

US EPA Models-3 air quality modelling system is used here, consisting of: (i) CMAQ (Community Multi-scale Air Quality model, [16]) being the chemicaltransport model; (ii) MM5 (the 5th generation PSU/NCAR Meso-Meteorological Model, [13]) used as meteorological pre-processor to CMAQ, and (iii) SMOKE (Sparse Matrix Operator Kernel Emissions Modelling System, [17]) being the emission pre-processor to CMAQ. The calculations are performed for a region containing mainly Bulgaria with 10 × 10 km resolution. For the purpose, three nested computational domains are set: ALADIN’s, MM5’s, and CMAQ’s ones. Lambert conformal projection is used with parameters: Lat1 = 42.5◦ , Lat2 = 42.5◦, λc = 25◦ , ϕc = 42.5◦ . A number of interfaces (Linux scripts and Fortran codes) are created as to link those models with diﬀerent types input information in a system capable to perform long term calculations. The 1-day data ﬂow is shown in Fig. 1. The white boxes present Models-3 compounds, MCIP being Meteorology-Chemistry Interface Processor of CMAQ.

540

D. Syrakov et al.

Fig. 1. Data ﬂow of calculations for estimating climate change impact on air quality

The light grey boxes present the data input to the system and the dark grey ones — Fortran codes. First of all, this is the ALADIN data that drives MM5. Its output feeds MCIP that produces meteorological input to both CMAQ and SMOKE. The CAMx data base [7,8] contains air pollution calculations for Europe on a 50-km grid. It is used for oﬀ-line interpolations to the CMAQ boundary points (module BG.BC1). The uploaded results are basis for on-line elaboration of CMAQ boundary conditions input ﬁle (module BG.BC2).

4

Modelling

CMAQ demands its emission input in speciﬁc format reﬂecting the time evolution of all polluters accounted for by the chemical mechanism used. As a rule, the emission inventory is made on annual basis for big territories (municipalities, counties, countries, etc.). Many pollutants are estimated as groups like VOC (Volatile Organic Compounds), PM2.5 (Particle Matter with d < 2.5μm). In preparing model-ready emission data, a number of speciﬁc estimates must be done (gridding, time allocation, speciation) for the main type of sources: Area (AS), Large Point (LPS), and Biogenic (BgS) sources. Obviously, emission models are needed as pre-processors to the chemical transport models. Such a component in EPA Models-3 system is SMOKE. Unfortunately, it is highly adapted to the US conditions and is partly used here. Fed by gridded land-use data and MCIP output it calculates biogenic sources and merges AS-, LPS- and BgS-ﬁles into a single CMAQ emission input ﬁle. As shown in Fig. 1, anthropogenic emission ﬁles (AS and LPS) are prepared by the interface programs AEmis and PEmis. Input to them are the gridded emission inventories for year 2000. Here, TNO high resolution inventory [10] is exploited. GIS technology is applied as to produce 10× 10 km AEmis and PEmis input from this data base.

Climate Change Impact Assessment of Air Pollution Levels in Bulgaria

541

The temporal allocation is made by over-posing daily, weekly, and monthly proﬁles (see [1]). The speciation proﬁles are elaborated following the recommendations given in the US EPA site [14]. AEmis and PEmis perform speciation and time allocation for every grid cell and every pollutant for the current Julian date. The obtained hourly values are output in NetCDF format and enter in SMOKE to be merged with BgS emissions. Fourteen σ-levels with varying thickness determine the vertical structure of CMAQ. The Planetary Boundary Layer is presented by 8 of them. The daily CMAQ output is a NetCDF ﬁle with 3D hourly data for 78 pollutants. Besides meteorology, boundary conditions and emissions, CMAQ needs also Initial Condition (IC) input. As the CMAQ calculations are day by day, the last hour output for a day is IC for the next day. Finally, a number of post-processing programs (XtrMET, XtrCON) are created aiming at saving important part of the output information.

5

Validation of Model Results

Before starting the assessment of climate change impact on air quality, validation of model results is needed in order to assure the quality and credibility of delivered output. There are quite few measurements in the territory of Bulgaria able to be used (no EMEP station in the domain). Fortunately, it occurs that during all the year 2000 in the frame of research project [4] hourly ozone measurements have been performed in two points in Bulgaria — peak Rojen, Rhodopi mountain (41N41, 24E44, altitude 1700 m), and Ahtopol, small town at Black sea cost (42N05, 27E57, altitude 10 m). This data is the main source for estimating model results. For year 2000 four scenarios are calculated (see Fig. 2). The aim is to estimate the importance of biogenic sources and the boundary conditions. The scatter diagrams of Scenario A vs. measurements are displayed in Fig. 3. One can notice that almost all scatter points are into FA2 boundaries (dashed lines). The ﬁtting lines are quite close to the ideal ﬁtting showing slight underestimation of measurement data in Rojen and overestimation in Ahtopol. The correlation coeﬃcients of both ﬁts are quite high that reﬂects the good quality of the simulation.

Fig. 2. Measurement points (left) and scenario’s description (right)

542

D. Syrakov et al.

Fig. 3. Scatter diagrams of measurement and modelled (Scenario A) 1-hour ozone data

Fig. 4. Scenarios comparison: A vs. B (left), A vs. C (middle), and B vs. D (right), Ahtopol

The main insights from the sensitivity analysis will be shown by the results from diﬀerent scenarios for Ahtopol. The Rojen diagrams are quite similar. The left graph of Fig. 4 demonstrates the exclusive importance of the boundary conditions for such a small area as Bulgarian territory. The lack of pollution at the boundaries, i.e. the action of only Bulgarian pollution sources, produces ozone concentrations several times less than the Basic scenario. The other two graphs demonstrate the importance of biogenic sources in ozone formation. Together with the direct comparisons some indexes, recommended by European Ozone Directive [5], mainly AOT40 (Accumulated Over Threshold of 40 ppb in the day-time hours during the period from May 1 to July 31 values), NOD60 (Number Of Days in which the day-time 8 hours running averages over ozone concentration exceed the critical value of 60 ppb), and ADM (Averaged Daily Maxima) are calculated for both measured data and Scenario A simulations. The comparison of AOT40 and NOD60 indexes is not satisfactory, that is not strange keeping in mind the high sensitivity of these indexes to the measurement and calculation errors as well as to the ozone daily variations. ADM, is mostly used for validation purposes, shows quite close results: 59.3 and 54.4 ppb for Rojen and 54.1 and 53.6 ppb for Ahtopol for the measured and calculated ozone, respectively. These comparisons support the conclusion that the simulation results are rather

Climate Change Impact Assessment of Air Pollution Levels in Bulgaria

543

good and can be used with certainty for investigation of the climate change impact on pollution levels.

6

Long Term Simulations: Results for 1991–2000 Time–Slice

Following the technology just described, the 10-year period of the CECILIA Present Climate time-slice was simulated. No other comparisons with measurement data are performed. Here, mainly demonstrations will be presented. In the left side of Fig. 5, the “climatic” ozone ﬁeld is presented compared with the ﬁeld of 10-year mean ADM. One can notice some resemblance between space patterns but the values are quite diﬀerent — the maximum of climatic ozone ﬁeld is close to the minimum of the ADM–ﬁeld. The inter–annual variability of ADM is demonstrated in Fig. 6. In spite of the similar space distribution, the ADM for 2000 are by several ppb higher than for 1997. This can be explained with the prevailing meteorological conditions — cold and wet summer of 1997 and dry and worm year 2000. The last illustration of ALADIN/CMAQ output, presented in Fig. 7, are the “climatic” ﬁelds of SO2 and PSO4 (particulate sulfate). The left graph shows two main pollution spots in the Bulgarian territory — TPP “Maritza-Iztok” in

Fig. 5. 10-year mean ozone concentration (left) and Averaged Daily Maxima (right)

Fig. 6. Averaged Daily Maxima for years 1997 (left) and 2000 (right)

544

D. Syrakov et al.

Fig. 7. 10-year averaged ﬁelds of SO2 (gas, left) and PSO4 (aerosol, right)

the centre (3 neighbouring TPPs that used to be the most powerful SO2 polluter in the Balkans) and the industrial area “Soﬁa-Pernik” on the left. The ligniteburning TPPs are quite well expressed on the PSO4 map as well (the graph in right). Many plots with inter-annual variations of those and all other saved pollutants as well as related meteorological parameters can be presented and interpreted that will be object of further work.

7

Conclusion

Evaluation of ALADIN/CMAQ simulations showed that the modelling system has a satisfactory performance with respect to O3 as shown from the plots discussed in this study. Despite the use of boundary conditions from another modelling system the basic spatial and temporal O3 patterns are captured by the model. The best simulation quality refers summer time daily maximums. The reasonable performance of the ALADIN/CMAQ system for the present time simulations justiﬁes its use for future time projections. The enormous volume of calculations related with climate change impact assessment on air quality requires powerful computational platforms like GRID. As far as two new 10-year time slices must be simulated in the frame of CECILIA project, some eﬀorts were put as to produce a version of the above described data stream suitable for GRID application. The results are very promising: the performance is faster than on the single worker and the obtained results are practically identical. Many jobs, reﬂecting diﬀerent time pieces, can be prepared and run almost simultaneously exploiting in great extent the parallel computational abilities of the GRID.

Acknowledgments This study is made under the ﬁnancial support of European Commission — 6thFP Integrated Project CECILIA [15] as well as by the Bulgarian National Science Fund (grants No. DO02-161/16.12.2008 and DO02-115/2008). The presented results were not possible without the experience obtained during the

Climate Change Impact Assessment of Air Pollution Levels in Bulgaria

545

participation in the 5th FP project BULAIR, the 6th FP Network of Excellence ACCENT and the 6th FP Integrated Project QUANTIFY.

References 1. Builtjes, P.J.H., van Loon, M., Schaap, M., Teeuwisse, S., Visschedijk, A.J.H., Bloos, J.P.: Project on the modelling and veriﬁcation of ozone reduction strategies: contribution of TNO-MEP, TNO-report, MEP-R2003/166, Apeldoorn, The Netherlands (2003) 2. Csomos, P., Cuciureanu, R., Dimitriu, G., Dimov, I., Doroshenko, A., Farago, I., Georgiev, K., Havasi, A., Horvath, R., Margenov, S., Moseholm, L., Ostromsky, T., Prusov, V., Syrakov, D., Zlatev, Z.: Impact of Climate Changes on Pollution Levels in Europe, NATO Project CLG 980505 Final Report, NATO HQ, Brussels, Belgium (2006) 3. D´equ´e, M., Piedelievre, J.-P.: High-Resolution climate simulation over Europe. Climate Dynamics 11, 321–339 (1995) 4. Donev, E., Zeller, K., Avramov, A.: Preliminary background ozone concentrations in the mountain and coastal areas of Bulgaria. Environmental Pollution 17, 281–286 (2002) 5. European Parliament: DIRECTIVE 2002/3/EC of 12 February 2002 relating to ozone in ambient air. Oﬃcial Journal of the European Communities L67, 14–30 (March 9, 2002) 6. Houghton, J.T., Ding, Y., Griggs, D.J., Noguer, M., van der Linden, P.J., Dai, X., Maskell, K., Johnson, C.A. (eds.): Climate Change 2001: The Scientiﬁc Basis. Cambridge University Press, Cambridge (2001) 7. Katragkou, E., Zanis, P., Tegoulias, I., Melas, D., Krueger, B.C., Huszar, P., Halenka, T.: Tropospheric Ozone over Europe: An air quality model evaluation for the period 1990-2001. In: Proceedings of IX EMTE National-International Conference of Meteorology-Climatology and Atmospheric Physics, p. 649. ishing house of Aristotle University of Thessaloniki, Thessaloniki (2008) URL, 649 8. Krueger, B.C., Katragkou, E., Tegoulias, I., Zanis, P., Melas, D., Coppola, E., Rauscher, S.A., Huszar, P., Halenka, T.: Regional decadal photochemical model calculations for Europe concerning ozone levels in a changing climate. Idojaras — Quarterly Journal of the Hungarian Meteorological Service (in press) 9. Spiridonov, V., D´equ´e, M., Somot, S.: ALADIN-CLIMATE: from the origins to present date. ALADIN Newsletter 29 (2005) 10. Visschedijk, A.J.H., Zandveld, P.Y.J., Denier van der Gon, H.A.C.: A High Resolution Gridded European Emission Database for the EU Integrate Project GEMS, TNO-report 2007-A-R0233/B, Apeldoorn, The Netherlands (2007) 11. Zlatev, Z.: Computer treatment of large air pollution models. Kluwer Academic Publishers, Dordrecht (1995) 12. Zlatev, Z., Moseholm, L.: Impact of climate changes on pollution levels in Denmark. Environmental Modelling 217, 305–319 (2008) 13. http://box.mmm.ucar.edu/mm5/ 14. http://www.epa.gov/ttn/chief/emch/speciation/ 15. http://www.cecilia-eu.org/ 16. http://www.cmaq-model.org/ 17. http://www.smoke-model.org/ 18. http://webdab.emep.int/

Eﬀective Algorithm for Calculating Spatial Deformations of Pre-stressed Concrete Beams J´ anos T¨or¨ ok, D´enes N´emeth, and M´at´e Lakat Centre of Information Technology, Budapest University of Technology Magyar tud´ osok krt. 2, H-1117 Budapest, Hungary

Abstract. An eﬀective algorithm is presented in this paper to solve boundary value problems describing the deformations of pre-stressed concrete beams. The solution presented in this paper aims to deliver a speedup of several magnitudes compared to existing algorithms in order to be able to handle more complex problems inaccessible so far. Parallelization problems are investigated and a solution is proposed which allows the running of this algorithm in various environments from desktop computers to clusters and Grids.

1

Introduction

There are many ﬁelds where the solution of more complicated problems requires exponentially more computation time [5]. In such cases accessing more computation power does not really help. In order to make a considerable step forward one has to renew the tools and achieve orders of magnitude speed gain. The problem we address here is the algorithms used for calculating deformation of simple architecture elements, including prestressed concrete beams, arches, beam assemblies, etc. It is unnecessary to describe the importance of providing physically correct results in a ﬁeld, where the general engineering procedures consist of heuristics and in some cases ﬁnite element methods with mainly empirical models [1]. Physically exact calculations would be priceless but they need the solutions of large number of non-linear equations, with solutions being one-dimensional object in a d-dimensional domain [2]. An important requirement makes these problems event more diﬃcult, namely that it is not enough to calculate the solutions only once and use it for diﬀerent cases, because by changing the parameters of the structures the topology of the solutions may change. Therefore one must come up with an algorithm that can be run by the architects and does not require the access of supercomputers. There are two diﬀerent approaches used in the literature [8]: 1) ﬁnd some “trivial” solution and try to follow it by changing slowly the parameters, 2) take the parameter space and scan for solutions [7]. The drawback of the ﬁrst is that it is unable to ﬁnd solutions that are not connected to the “trivial” one and thus may predict false stability, on the other hand the second solution requires immense amount of CPU time. I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 546–553, 2010. c Springer-Verlag Berlin Heidelberg 2010

Eﬀective Algorithm for Calculating Spatial Deformations

547

A delicate combination of the two approaches led so far to the best algorithm [4]. However as usual engineers want more. In previous papers [3,6] we described the basics of a new idea that searches the solutions in an artiﬁcial potential by gradient algorithm. This solution was integrated into a hybrid algorithm that combined approaches 1) and 2). It gave promising results, but due to the limitations of the original approach (see Sec. 3) a new algorithm had to be designed. In this paper we describe the analyses that gave birth to an algorithm which is optimized on both memory and CPU consumption and is able to access problems too diﬃcult for previous algorithms.

2 2.1

Description The Problem

The non-linear diﬀerential equations of the boundary value problems describing the problems can be transformed to a set of non-linear equations [4]. A problem having d degrees of freedom can thus be represented as: 0 = f1 (x1 , x2 , . . . xd ) 0 = f2 (x1 , x2 , . . . xd ) .. . 0 = fd−1 (x1 , x2 , . . . xd ),

(1)

where fj are non-linear functions. It is important to note that we have one equation less than the number of unknowns (meaning one physically independent parameter). Thus we will have one-dimensional solutions (lines, curves) in the d-dimensional parameter space (x1 , x2 , . . . xd ∈ X). One important remark is that in general these functions are really complicated to calculate which takes a lot of CPU time. On the other hand due to their complicity in general we consider that analytically no information is available for the functions. From now on we use letters x (any point) and y (solution candidate) to designate points in the parameter space with index i = 1, 2 . . . d, and letter f for functions with index j = 1, 2 . . . d − 1. 2.2

Solution Testing

It is easy to see that trivial solution testing methods by limiting |fj ({yi })| are useless in our case as no information is available on the functions. Therefore we chose the following method for solutions testing. Let us consider a simplex in the d-dimensional space. If a solition line goes through the simplex all functions must cross zero, and thus change sign which can be easily detected. This assumption is true only if the simplices are small enough to contain only one zero line which is checked by repeating the calculation with diﬀerent simplex size.

548

J. T¨ or¨ ok, D. N´emeth, and M. Lakat

The next task is to create simplices. To work directly with simplices in d dimensions is a nightmare so we chose to handle hypercubes and if necessary decompose them to simplices. The simplex decomposition of a hypercube is non-trivial, even the number of simplices may vary. Since there is no algorithm to provide the decomposition with minimum number of simplices we used the easiest method with d! simplices. One can ﬁx two points from each simplex, which can be chosen to be the origin (0, 0, . . . 0) and the opposite corner (1, 1, . . . 1). The other corners of the simplices are chosen by permuting the unit vectors without repetition. On the faces of the simplices a linear extrapolation can be carried out to interpolate the solution points which requires the solution of a set of linear equations. The solutions of these equations will be points on the surface of the simplices more accurate than the corners. If a point also lies on the surface of the containing hypercube we can follow the solution line by choosing the touching hypercube next. 2.3

Potential

In general there is no hint about the poisiont of the solutions so up to now mainly parameter scanning was used. Here we propose a method to ﬁnd a solution faster. We create an artiﬁcial potential which will have minima at the solutions so that gradient algorithms can be used to ﬁnd them. A simple choise of the potential was presented in [6]: d−1 cj fj2 ({xi }), (2) U ({xi }) ≡ j=1

where cj > 0 are coeﬃcients, taken from random sampling in X: cj =

m

fj−2 ({xi }), (k)

(3)

k=1 (k)

where {xi } is a random point in X. Obviously if {yi } is a solution of (1), then U ({yi }) = 0, otherwise U > 0. 2.4

Algorithm

The new algorithm presented here uses a mixed approach. The ability to ﬁnd solutions by a gradient method in the artiﬁcial potential of U makes the scanning of the parameter space useless. Therefore the strategy adopted here is the following: 1. Generate a solution candidate point using one of the following methods: (a) Gradient method (starting from random points, regular lattice, etc.) (b) read from ﬁle (c) from an algorithm (if the previous two methods cannot be used) 2. Test for solutions the hypercube which contains the point 3. If a solution was found, follow it. 4. Return to 1.

Eﬀective Algorithm for Calculating Spatial Deformations

3 3.1

549

Analysis Concept

The algorithm has two distinct parts: the solution candidate point generation and the solution following. The solution following is done similarly to previous realizations: If solution was found in a hypercube, the simplex algorithm is used to determine on which faces it exits from the cube (can be more than two if there are branches). The neighbouring hypercubes in these directions are considered as candidates for solutions and put in a list to be tested. This procedure is repeated until the list gets empty. This list is expected to remain small, if we do not insert hypercubes which were tested: So as a solution is followed a cube is taken from the list and a new is inserted. The only increase in size of the queue may come from branches. As we said, we have to keep track the tested hypercubes (or the ones with solutions) to avoid back and forth walking on a solution line. Due to the ﬁne resolution, large number of hypercubes should be stored and searched eﬀectively. An optimized solution is presented in Sec. 3.4. The gradient method was found to be the most eﬀective solution candidate point generation method. One can choose the starting point of the gradient in many diﬀerent ways (random, regular, hierarchical, etc.) but we found no signiﬁcant diﬀerence in eﬃciency. In some cases the gradient method can not be used, because, for example, parts of X are not in the domain of fj , or the landscape of U is too noisy. In these cases we have to generate the solution candidate points in conventional ways. Either in some special cases analytic solutions exist and this can be supplied or a kind of sweeping algorithm can be used. The ﬁrst choice is simple — one has to provide points to the code which are checked for solutions, the second is more complicated and implemented in hierarchically cutting X into half and scanning the cutting surface. This requires the scan of a d − 1 dimensional space, which is one dimension less than the original, but still very large. This latter option is useful for two purposes: 1) if everything else fails this should always work. This can even be used for small domains where the gradient algorithm seems to fail. 2) To ensure that there are no solutions above a given length scale (distance between two cuts). This can be useful when the mapping of X is done for a new problem. 3.2

Computation Need of Gradient Method

In general, it is impossible to generate analytically the derivative of the fj functions, so ﬁnite diﬀerence must be used, which makes the gradient method time consuming. We made several (∼ 100 − 1000) runs of the gradient method with real test problems to study the computation need of the gradient method. The result is summarized in Table 1. The important thing we can learn from the above analysis is that the cost (number of function calculations) of a gradient run grows with the dimension as:

550

J. T¨ or¨ ok, D. N´emeth, and M. Lakat

Table 1. The average number of gradient steps, the required function calls are shown as functions of the space dimension. An average of few hundred runs is presented. dim. grad. step f calls normalized calls d ng nf nf /d4.92 3 18.6 178 0.80 5 133.6 2223 0.81 6 270.5 5355 0.80

nf ∼ d5 which is an algebraic (power law) increase which should be compared to the exponential increase of any sweeping algorithm (nf ∼ N d , where N is the linear resolution). Thus the gradient algorithm has two major advantages over previous approaches: 1) The scale free nature of it, 2) the power law computation time increase with d compared to the exponential one. 3.3

Computation Need of Simplex Decomposition

The simplex decomposition [8] can be very expensive since we have d! simplices in a d dimensional cube (see Sec. 2.2). Furthermore we have to solve a set of linear equations which requires d2 operations. Even if the simplex decomposition does not require the calculation of the expensive function values, due to its factorial increase in computation need necessarily, it will become the most time consuming part of the code with increasing d. However, current measurements in parameter space dimensions show only 0.1 − 2% computation overhead, which is negligible. However this might become the bottleneck in higher dimensions. 3.4

Memory Need

Keeping track of all hypercubes in X is impossible: for example in d = 5 a resolution of 100 hypercubes per dimension would require 10 gigabytes of memory. Therefore we keep track only the hypercubes with solution. The one-dimensional nature of the solutions may suggest that the list of sites with solution is small, however, there are cases which resemble more to a bowl of spaghetti and the number of hypercubes with solution may be several hundred times the linear solution. In order to keep memory usage small, we used the feature that solutions are curves. Thus in each step we only move to a neighboring cube. The algorithm used for storing hypercubes with solutions is the following: 1. Deﬁne a box-size parameter s 2. We consider a continuous section of the solutions which is not longer than s in any dimension. It is deﬁned by its starting point, its path, and its bounding box 3. The starting point is a d dimensional vector 4. The bounding box consists of two d dimensional vectors pointing to the two opposite corners of the box which contains the entire section

Eﬀective Algorithm for Calculating Spatial Deformations

551

5. The path is a bit-coded array in which one element indicates the next hypercube. Each element is composed of a header and a data part which are put together in a continuous bit array: (a) header: d bits indicating (with bit 1) in which direction the coordinate of the next hypercube is diﬀerent from the previous one. Let c be the number of bits with 1, then (b) data consists of c bits, indicating a diﬀerence of +1 (bit set=1) or −1 (bit is not set=0) in the respective direction between the successive hypercubes coordinates. The parameter s must be chosen carefully, because both extreme values of it are unfavorable: If s is too small, the number of sections gets too high, if s is too large, the linear search inside the box gets too long. Of course the optimization is function of the resolution, number of solution lines, etc. In the studied cases the optimum was around 2 − 10% of the linear size of the system. 3.5

Parallelization

The algorithm presented here can easily be parallelized, since both parts of the algorithm (solution candidate point generation, solution following) take their work from a kind of list which would be ideal for a Master-Slave type scenario. In our case we chose to avoid direct parallelization to be able to run the code on Grids, where inter process communication is not supported. We chose to simply parallelize the 1st part of the algorithm, namely the solution candidate point generation and leave the rest as it is. In this way we may end up calculating certain solutions multiple times. In Section 4 we investigate the price of this solution.

4

Results

We chose two problems: the 3-dimensional euler and the 5-dimensional sp [9] for test runs of the presented algorithm. The latter was run with two diﬀerent parameter space X. We recorded the time used by the codes as well as the number of solutions. Let us note by N the number of hypercubes in each dimension, by ng the number of gradient runs, by τ the running time of the tests, by ns the number of hypercubes with solution. The solution lines are visualized on Fig. 1. Table 2 details the time needs, and eﬃciency of the algorithm with diﬀerent parameters in diﬀerent dimensions. This linear increase can be observed mainly in the case of the euler model where the solution is a big connected line set thus most of the time the algorithm follows solutions. The time need of the gradient can even be approximated from the data to be ∼ 4s for 900 gradient runs. In the other cases, when the gradient part is more pronounced τ increases even more slowly with N . Thus a

552

J. T¨ or¨ ok, D. N´emeth, and M. Lakat

n g=1000 n g= 100 n g= 10 6 4

z 20

−2 −4 −6

n g= 10000 n g= 1000 n g= 100

4 2 z0 −2 −4 200

−300 −200 −100

y

0

100

0 100 200

−200

−100

x

200

160 120

y

80 40 0

−20

−10

0

10

20

30

x

Fig. 1. Visualization of the solution lines of the euler (left) and the sp (right) problem. The diﬀerent colors indicate solutions found with diﬀerent number of gradient runs. As the same random seed was used for all runs, the solutions are always subsets of an other one with higher ng . Table 2. Time and eﬃciency measurements of two algorithms called euler (d=3) and sp (d = 5). spa and spb diﬀer in the size of X. N Code ng τ ns /N Code ng τ ns /N Code ng τ ns /N 200 spa 100 31 29 spb 100 52 27 euler 10 0.36 23 400 spa 100 39 20 spb 100 66 24 euler 10 0.69 24 800 spa 100 96 31 spb 100 95 22 euler 10 1.46 25 1600 spa 100 198 29 spb 100 159 20 euler 10 3.49 26 200 spa 1000 189 36 spb 1000 342 27 euler 100 0.86 33 400 spa 1000 214 40 spb 1000 374 49 euler 100 1.29 33 800 spa 1000 270 39 spb 1000 408 46 euler 100 2.02 29 1600 spa 1000 396 37 spb 1000 528 35 euler 100 4.41 29 200 spa 10000 1782 38 spb 10000 3266 54 euler 1000 4.68 37 400 spa 10000 1812 40 spb 10000 3326 87 euler 1000 5.24 38 800 spa 10000 1879 39 spb 10000 3377 78 euler 1000 6.48 39 1600 spa 10000 1997 37 spb 10000 3553 59 euler 1000 9.67 38

much higher resolution solution can be obtained with marginal increase in the computational time. Let us note that the increase of N should be followed by the strengthening of the convergence conditions of the gradient algorithm, because the solution candidate point found may not lay in the right hypercube. This eﬀect can be observed on the model spb , where the largest resolution gives less solutions. One can also read from the table that indeed it is extremely diﬃcult to tell how many initial gradient runs should be used to ﬁnd all solutions. Sometimes a 10% increase in the number of solutions requires 10 times more gradient runs because the domain of convergence of certain branches is too small. This eﬀect is diﬃcult to overcome, but two advises can be given: 1) Branches close to the boundary are diﬃcult to detect → increase size of X well above the interested domain. 2) Branches are too close to each other → either restrict the domain of gradient start generation to that part, or to use a parameter sweep in those congested areas.

Eﬀective Algorithm for Calculating Spatial Deformations

553

The sp model shows that in high dimensions the chosen parallelization scheme, namely that we run independent runs with pre-divided solution candidate point sets is optimal. If we compare runs with N =200 and N =1600 for large number of ng we can see, that the diﬀerence in computation time is about 10% which is spent on solution following. Thus the gradient part (the solution candidate point search) accounts for 98 − 88% of the computation time. So the part that might be calculated multiple times account only for 2 − 12% which is much better than the general gain in direct parallelization.

5

Conclusion

In this paper we introduced a new algorithm capable of solving non-linear boundary value problems used in engineering ﬁelds. The computation time gain compared to previous algorithms can be as large as 104 in d=5. The same (or even larger) amount of gain is achieved in memory usage which makes the code ﬁt on clusters of virtual computers having little memory. The independent parallelization scheme enables the code to be run in Grid and stand-alone environment. All these factors make the new concept much more ﬂexible. Acknowledgment. Part of this work was funded by P´eter P´ azm´any program RET-06/2005 of the National Oﬃce for Research and Technology and the EGEEIII project (INFSO-RI-222667).

References 1. Bangash, M.Y.H.: Manual of Numerical Methods in Concrete. Thomas Telford, London (2001) 2. Domokos, G., G´ asp´ ar, Z., Szeber´enyi, I.: CAMES 4(1), 55–68 (1997) 3. Domokos, G., Sipos, A., V´ arkonyi, P.: Szerkezet-tervez´es az interneten: egy g´epi ´ ıt´es-Ep´ ´ ıt´eszettudom´ algoritmus gyors´ıt´ asi lehet˝ os´egei. Ep´ any 34(3-4), 271–291 (2006) 4. Domokos, G., Szeber´enyi, I.: A Hybrid Parallel Approach to One-Parameter Nonlinear Boundary Value Problems. Computer Assisted Mechnics and Engineering Sciences 11, 15–34 (2004) 5. Garey, M.R., Johnson, D.S.: Computers and Intractability: A Guide to the Theory of NP-Completeness. W.H. Freeman, New York (1979) 6. Pasztuhov, D., T¨ or¨ ok, J.: A Gradient Hybrid Parallel Algorithm to One-Parameter Nonlinear Boundary Value Problems. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2007. LNCS, vol. 4818, pp. 500–507. Springer, Heidelberg (2008) 7. Sipos, A., Domokos, G.: Assymetrical, spatial deformations of reinforced concrete columns and prestressed beams. In: Bal´ azs, G.L., Borosny´ oi, A. (eds.) Proc. FIB Symposium on “Keep concrete active”, vol. II, pp. 693–699. Publishing Co. of BUTE, Budapest (2005) 8. Szeber´enyi, I.: PVM Implementation of Boundary Value Problems on Heterogeneous Parallel Systems. Periodica Polytechnica Electrical Engineering 44(3-4), 317 (2000) 9. Szeber´enyi, I.: Parallel Algorithms for Solving Nonlinear Boundary Value Problems, PhD thesis, BME (2003)

Structurally Stable Numerical Schemes for Applied Dynamical Models Roumen Anguelov Department of Mathematics and Applied Mathematics, University of Pretoria Institute of Mathematics and Informatics, Bulgarian Academy of Sciences [email protected]

Abstract. The paper deals with the construction of reliable numerical discretizations of continuous dynamical systems arising as models for diﬀerent natural phenomena, with a focus on schemes which correctly replicate the properties of the original dynamical systems. The work is based on the new concept of topological dynamic consistency, which describes in precise terms the alignment of the properties of the discrete dynamical system and the approximated continuous dynamical system. The derivation of structurally stable numerical schemes via the nonstandard ﬁnite diﬀerence method is also demonstrated.

1

Introduction

Let W be an open subset of Rd , d ≥ 1. Consider the initial value problem dy = f (y), dt y(0) = x,

(1) (2)

where x ∈ W and f ∈ C 0 (W, W ). We assume that (1) deﬁnes a (positive) dynamical system on D ⊂ W . This means that for every x ∈ D the problem (1)–(2) has a unique solution y = y(x, t) ∈ D for all t ∈ [0, ∞). In a typical setting D is a compact subset of Rd , but we do not need to make upfront such an assumption. For a given t ∈ (0, ∞), the mapping S(t) : D → D given by S(t)(x) → y(x, t) is called the evolution operator and the set {S(t) : t ∈ (0, ∞)}

(3)

is the well-known evolution semi-group. For every x ∈ D the set {S(t)(x) : t ∈ (0, ∞)} is called the (positive) orbit of x. Suppose that the solution of (1)–(2) is approximated on the time grid {tk = kh : k = 0, 1, ...}, where h is the time step, by a diﬀerence equation of the form yk+1 = F (h)(yk ), y0 = x,

(4) (5)

where the maps F (h) : D → D are deﬁned for every h > 0. Hence, for every given h > 0, equation (4) deﬁnes a discrete dynamical system with an evolution I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 554–562, 2010. c Springer-Verlag Berlin Heidelberg 2010

Structurally Stable Numerical Schemes for Applied Dynamical Models

555

semi-group {(F (h))k : k = 1, 2, ...}. The orbit of a point x ∈ D is the sequence {(F (h))k (x) : k = 0, 1, 2, ...}. The main aim of our investigation is the alignment of the properties of the systems (1)–(2) and (4)–(5). In this conceptual setting the structural stability of the evolution operators (3) of the original system and the evolution maps of its discretization, that is, {F (h) : h ∈ (0, ∞)} (6) plays an important role. More precisely, it is a typical setting in Numerical Analysis to solve a well-posed problem by a stable numerical method. This, in particular, means that small changes of the data change neither the solution of the original problem nor the approximate solution by the numerical method in signiﬁcant ways. Therefore, looking at the problem and its discretization as dynamical systems one would require by analogy that small changes of the data do not aﬀect in a signiﬁcant way the properties of these dynamical systems. This, in essence, means that they both need to be structurally stable. Definition 1. Let V be a topological space of maps from X to X. A map f ∈ V is called V structurally stable if there exists a neighborhood U of f in the topology of V such that every map g ∈ U is topologically equivalent to f , that is, there exists a homeomorphism μ : X → X such that f ◦ μ = μ ◦ g.

(7)

In the general theory of topological dynamics the space V is typically C 1 (X, X). For models in practical applications this space is not always applicable. Hence the generalization given in Deﬁnition 1 is essential. Structural stability for ﬂows is deﬁned in a similar way where topological equivalency is replaced by orbit equivalency. It is important to note the diﬀerent nature of the dynamical systems deﬁned by (1) and (4). The ﬁrst is a continuous dynamical system (a ﬂow), while the second one is a discrete dynamical system. Most of the concepts of dynamical systems are deﬁned for both continuous and discrete system but very often, as it is the case with structural stability as well, the meaning is diﬀerent. In particular let us note that even if a ﬂow is structurally stable the maps S(t1 ) and S(t2 ) are not necessarily topologically equivalent when t1 = t2 . To simplify the matters and enhance the main ideas, we assume here that equation (1) is such that S(t), t > 0, are all topologically equivalent to each other. This is for example the case of Morse-Smale ﬂows with ﬁxed points only. In this setting the requirement that the ﬂow deﬁned by (1) is structurally stable can be replaced with the stronger requirement that the maps S(t), t > 0, are structurally stable. The reliability of the method (4) is in terms of correctly replicating the properties of the dynamical system it approximates.

2

Topological Dynamic Consistency

The concept of topological dynamic consistency, introduced in [4], describes in precise terms the alignment of the properties of the discrete dynamical system and the approximated continuous dynamical system. This development is

556

R. Anguelov

based on the important observation that the “dynamics” of dynamical systems, e.g. ﬁxed points, periodic orbits, non-wandering sets, and the way these ﬁnal states are approached, are topological properties. The similarity of a continuous dynamical system with the associated numerical methods, viewed as discrete dynamical systems, is in the sense of topological equivalence between the corresponding evolution operators. Definition 2. The diﬀerence scheme (4) is called topologically dynamically consistent with the dynamical system (1), whenever all the maps in the set (3) are topologically equivalent to each other and every map in the set (6) is topologically equivalent to them. (Thus, the maps S(t) and F (h) are topologically the same for every t > 0 and h > 0). The next theorem provides a straightforward way of proving topological dynamic consistency via the structural stability of the evolution operators of the original system and its numerical method. Theorem 1. Let V be a topological space of maps from D to D such that it contains the sets (3) and (6) and the mappings S : (0, ∞) → V and F : (0, ∞) → V are both continuous. Let also the following conditions hold: (i) for each t > 0 the map S(t) is V structurally stable; (ii) for each h > 0 the map F (h) is V structurally stable; (iii) there exists h > 0 such that S(h) and F (h) are topologically equivalent. Then the numerical method (4) is topologically dynamically consistent with the dynamical system (1). Proof. The topological equivalence of maps is an equivalence relation in the space V. Thus, we need to prove that maps in the two sets (3) and (6) belong to the same equivalence class. Let us see ﬁrst that condition (i) implies that the maps S(t), t > 0, are all topologically equivalent. Let 0 < t1 < t2 . From the structural stability of the maps S(t), t > 0, it follows that for every t ∈ [t1 , t2 ] there exists a neighborhood Wt of S(t) in V such that S(t) ∼ g for every g ∈ Wt . Since the map S : (0, ∞) → V is continuous, for every t ∈ (0, ∞) there exists δt such that S(τ ) ∈ Wt whenever |τ − t| < δt . Then t∈[t1 ,t2 ] (t − δt , t + δt ) is an open cover of [t1 , t2 ]. By the compactness of [t1 , t2 ] it follows that there exists a ﬁnite set {τ1 , τ2 , ..., τk } such that [t1 , t2 ] ⊂

k

(τi − δτi , τi + δτi ).

i=1

Without loss of generality we may assume that τ1 = t1 , τk = t2 and that the set is arranged in increasing order, that is, τi < τi+1 . For an arbitrary pair τi , τi + 1 there exists τi+ 12 such that τi+ 12 ∈ (τi − δτi , τi + δτi ) ∩ (τi+1 − δτi+1 , ti+1 + δτi+1 ). Therefore S(τi+ 12 ) ∈ Wτi ∩ Wτi+1 , which implies that Sτi ∼ Sτi+1 . Then using induction we obtain that Sτ1 ∼ Sτk or equivalently St1 ∼ St2 . Hence all maps S(t), t > 0, are topologically equivalent to each other.

Structurally Stable Numerical Schemes for Applied Dynamical Models

557

Using a similar argument we obtain from condition (ii) that all maps F (h), h > 0 are topologically equivalent to each other. The statement of the theorem then follows from (iii). Remark 1. Properties (i) and (ii) in Theorem 1 are attributes of the problem and the method respectively. Property (iii) provides a link between the problem and the method which may be considered an analogue of the consistency of the scheme since the value of h in (iii) is typically unknown and is described as suﬃciently small. This property is established for a variety of numerical methods in the works of B.M. Garay and M-C Li, see [5,8]. In all cases the topological equivalence is established for suﬃciently small h. We should emphasize the fact that Theorem 1 provides for topological equivalence of the respective evolution operators for all values of h and t, thus implying the topological dynamic consistency of the scheme. The main ingredient in Theorem 1 is the structural stability of the involved maps. Deriving suﬃcient conditions for structural stability of ﬂows and diﬀeomorphisms is one of the greatest achievements of Topological Dynamics with main contributions from Anosov, Moser, Palis, Smale, Robin, and Robinson, [7]. The current form of the structural stability theorem, proved by Robinson, [12,13], states that ﬂows and diﬀeomorphisms on a compact manifold without boundary which satisfy Axiom A and the strong transversality condition are structurally stable. In particular, Morse-Smale ﬂows and diﬀeomorphisms on a compact manifold are structurally stable, see [14, Theorem 12.2]. In all these theorems the structural stability is considered with respect to the space C 1 (X, X). Hence, and as mentioned in the Introduction, their practical application could be rather limited. For example, the evolution operator of the epidemiological model considered in the next section is not C 1 structurally stable since it has an invariant set on the boundary and a ﬁxed point which is not necessarily hyperbolic. Our approach in such cases is to design for the given model an appropriate space V so that the respective evolution operator is structurally stable. Then we construct a discretization such that the evolution operator of the numerical scheme is in V and is also V structurally stable. Then the topological dynamic consistency of the scheme follows from Theorem 1.

3

SEIR(→S) Model

As a model example for the application of the theory derived so far we consider a basic compartmental model for the spread of an infectious disease in a given population. The course of the disease is schematically represented as S −→ E −→ I −→ R ( −→ S), where S denotes the number of susceptible individuals, E — the number of exposed (carriers which are not yet infective), I — the number of infectives, and R — the number of recovered with immunity. The following mathematical

558

R. Anguelov

model is derived in [16, Chapter 21] as a system of diﬀerential equations for the S fractions of the respective classes in the total population N , that is, u = N , E I R x = N, y = N, z = N ⎧ du ⎪ ⎪ ⎪ ⎪ dt ⎪ ⎪ ⎪ ⎪ dx ⎪ ⎨ dt dy ⎪ ⎪ ⎪ ⎪ ⎪ dt ⎪ ⎪ ⎪ ⎪ ⎩ dz dt

= ν(1 − u) − uy + ηz = uy − (ξ + ν)x (8) = ξx − (θ + ν)y = θy − (η + ν)z

The time is scaled in such a way that the coeﬃcient of the nonlinear term uy representing the mass action principle for the spread of the infection equals one. The nonnegative constants ξ, θ, and η model the transfer rates between the respective compartments, while ν is linked to the life expectancy under the assumption of constant population. The system of ODEs (8) deﬁnes a dynamical system on the three dimensional closed simplex G = {(u, x, y, z) : u ≥ 0, x ≥ 0, y ≥ 0, z ≥ 0, u + x + y + z = 1}. Hence, one can eliminate one of the variables, e.g. u, and obtain the system in the following equivalent form ⎧ dx ⎪ ⎪ = (1 − x − y − z)y − (ξ + ν)x ⎪ ⎪ ⎪ ⎨ dt dy (9) = ξx − (θ + ν)y ⎪ dt ⎪ ⎪ ⎪ ⎪ ⎩ dz = θy − (η + ν)z dt where (9) deﬁnes a dynamical system on the compact domain D = {(x, y, z) : x ≥ 0, y ≥ 0, z ≥ 0, x + y + z ≤ 1}. The point (0, 0, 0) is always an equilibrium of (9). This is the Disease Free Equilibrium (DFE). The system may have another equilibrium, namely, ze =

θν(R0 − 1) ν +η ν +θ ze , xe = ye , , ye = ν + η − ηθR0 θ ξ

(10)

ξ where R0 = (ν+ξ)(ν+θ) is the basic replacement ratio. The point in (10) is an equilibrium of the dynamical system (9) whenever it belongs to its domain D, that is, when R0 > 1. It is called an Endemic Equilibrium (EE) since it describes a permanent presence of the disease. Our concern here are the properties of the

Structurally Stable Numerical Schemes for Applied Dynamical Models

559

dynamical system (9). It was proved in [16, Theorem 21.2] that: • If R0 ≤ 1 then DFE is globally asymptotically stable on D.

(11)

• If R0 > 1 then DFE is a hyperbolic saddle point with stable manifold Γ = {(x, y, z) ∈ D : x = y = 0}, EE is stable and attracting with basin of attraction D \ Γ .

(12)

It is easy to see that the maps S(t) for the dynamical system (9) are not C 1 structurally stable due to the fact that they have a ﬁxed point and an orbit on the boundary of the domain D. In order to obtain structural stability for S(t) we consider the following smaller space ⎫ ⎧ 1) g : D → g(D) is a diﬀeomorphism ⎬ ⎨ V = g : D → D : 2) DFE is a ﬁxed point of g ⎩ 3) Γ is invariant and in the stable manifold of DFE ⎭ Note that all the maps in V have the “unstable” features of S(t), which leads to the following theorem. Theorem 2. For every t > 0 the evolution operator S(t) is V structurally stable provided (ν + ξ + θ)(ν + η) > θξ. (13) The proof is carried out essentially by the same method as the proof of [6, Theorem 16.3.1], where the structural stability of ﬂows with a single globally attractive ﬁxed point is established. Condition (13) ensures that whenever DFE∈ D \ Γ , its basin of attraction is D \ Γ , see [16, Theorem 21.11]. Hence it can be replaced by any other condition implying the said property, e.g. η = 0, see [16, Theorem 21.12]

4

V-Structurally Stable Scheme by the Nonstandard Finite Diﬀerence Method

Traditionally the main concerns of Numerical Analysis are the stability, convergence and rate of convergence of numerical methods. While the importance of convergence is not in doubt, it often happens that essential properties of the approximated models are lost in the discretization of the diﬀerential equations. These may include the physical laws (e.g. conservation of mass, energy or momentum) used to construct the diﬀerential models. The main contribution of the Non-Standard Finite Diﬀerence Method to the ﬁeld of Numerical Analysis is the conceptual base and the tools for preserving essential physical properties of the models, [9,10,3]. There has been a considerable eﬀort in recent years to construct numerical procedures which correctly replicate the properties of the original dynamical system by using the non-standard ﬁnite diﬀerence method. In fact the concept of dynamic consistency, which was made precise in the recent works of the author and his collaborators, originally appeared in the context of

560

R. Anguelov

this method, [1,11]. The following scheme, which uses nonlocal approximation of the nonlinear term, is crafted in such a way that the operator F (h) is in V and it is also V structurally stable. ⎧ uk+1 − uk ⎪ ⎪ = −uk+1 yk + νxk+1 + νyk+1 + (η + ν)zk+1 ⎪ ⎪ h ⎪ ⎪ ⎪ ⎪ xk+1 − xk ⎪ ⎪ = uk+1 yk − (ξ + ν)xk+1 ⎨ h (14) yk+1 − yk ⎪ ⎪ ⎪ = ξx − (θ + ν)y k+1 k+1 ⎪ ⎪ h ⎪ ⎪ ⎪ ⎪ z − zk k+1 ⎪ ⎩ = θyk+1 − (η + ν)zk+1 h Let us note that we discretize the four equation form (8) of the dynamical system since in this way the method is more convenient for both implementation and theoretical analysis. This does not increase the computational complexity since the values of uk+1 need to be calculated anyway. The method is implicit but any time step involves only the solution of a linear system ⎛ ⎞ ⎛ ⎞ uk+1 uk ⎜ xk+1 ⎟ ⎜ xk ⎟ ⎟ ⎜ ⎟ C(yk ) ⎜ (15) ⎝ yk+1 ⎠ = ⎝ yk ⎠ , zk+1 zk where

⎛

⎞ 1+hyk −hν −hν −h(η+ν) ⎜ −hyk 1+h(ξ+ν) ⎟ 0 0 ⎟ C(yk ) = ⎜ ⎝ 0 ⎠ −hξ 1+h(θ+ν) 0 0 0 −hθ 1+h(η+ν)

From this formulation it is easy to see that starting from an initial condition in G the approximations remain nonnegative for all k = 1, 2, ... and that uk+1 + xk+1 + yk+1 + zk+1 = uk + xk + yk + zk = 1.

0.5

R =0.93 0

0.25

R =3.9

0.4

0

0.2 0.3

x y z

0.15 x y z x

0.2

0.1

y z 0 0

5

10

15

20

0.1 e

0.05

e e

25

t

Fig. 1. Approximate solution, R0 = 3.9

0 0

5

10

15

20

25

30

35

t

Fig. 2. Approximate solution, R0 = 0.93

Structurally Stable Numerical Schemes for Applied Dynamical Models

561

Hence the scheme (14) deﬁnes a discrete dynamical system on G. Therefore, the respective diﬀerence operator F (h) for the system (9) maps D onto D. Further one can see that DEF and EE are preserved together with their stability. Therefore, F (h) ∈ V. The proof of the V structural stability of F (h) is rather technical but it uses similar arguments as in Theorem 2.The approximate solutions for two sets of constants are presented in Figs. 1 and 2. In Fig. 1 we have R0 = 3.9 > 1 while in Fig. 2 R0 = 0.93 < 1 and one may observe that the properties of the exact solutions in (11) and (12) are correctly replicated. We note that it may happen that standard methods also preserve the stated properties. However, in general, this cannot be guaranteed or at least cannot be guaranteed for all step sizes. Examples to that eﬀect for similar systems can be found in [2,4]. See also [3,9,10] for a general discussion on the issue.

References 1. Al-Kahby, H., Dannan, F., Elaydi, S.: Non-standard discretization methods for some biological models. In: Mickens, R.E. (ed.) Applications of nonstandard ﬁnite diﬀerence schemes, pp. 155–180. World Scientiﬁc, Singapore (2000) 2. Anguelov, R.: Dynamically Consistent Schemes for Epidemiological Models. In: Proceedings of BGSIAM 2008, Demetra, Soﬁa, pp. 11–14 (2009) 3. Anguelov, R., Lubuma, J.M.-S.: Contributions to the mathematics of the nonstandard ﬁnite diﬀerence method and applications. Numerical Methods for Partial Diﬀerential Equations 17, 518–543 (2001) 4. Anguelov, R., Lubuma, J.M.-S., Shillor, M.: Dynamically consistent nonstandard ﬁnite diﬀerence schemes for continuous dynamical systems. In: AIMS Proceedings (to appear) 5. Garay, B.M.: On structural stability of ordinary diﬀerential equations with respect to discretization methods. Numerische Mathematik 72, 449–479 (1996) 6. Hirsch, M.W., Smale, S.: Diﬀerential equations, dynamical systems and linear algebra. Academic Press, New York (1974) 7. Katok, A., Hasselblatt, B.: Introduction to the Modern Theory of Dynamical Systems. Cambridge University Press, New York (1995) 8. Li, M.-C.: Structural stability for the Euler method. SIAM Journal of Mathematical Analysis 30, 747–755 (1999) 9. Mickens, R.E. (ed.): Applications of Nonstandard Finite Diﬀerence Schemes. World Scientiﬁc, Singapore (2000) 10. Mickens, R.E. (ed.): Advances in the applications of nonstandard ﬁnite diﬀerence schemes, pp. 1–9. World Scientiﬁc, Singapore (2005) 11. Mickens, R.E.: Discrete models of diﬀerential equations: the roles of dynamic consistency and positivity. In: Allen, L.J.S., Aulbach, B., Elaydi, S., Sacker, R. (eds.) Diﬀerence equations and discrete dynamical systems, pp. 51–70. World Scientiﬁc, Singapore (2005) 12. Robinson, C.: Structural stability of C 1 diﬀeomorphisms. Journal of Diﬀerentrial Equations 22, 238–265 (1976) 13. Robinson, C.: Structural stability on manifolds with boundary. Journal of Diﬀerentrial Equations 37, 1–11 (1980)

562

R. Anguelov

14. Robinson, C.: Dynamical systems: stability symbolic dynamics and chaos, 2nd edn. CRC Press, Boca Raton (1999) 15. Stuart, A.M., Humphries, A.R.: Dynamical systems and numerical analysis. Cambridge University Press, New York (1998) 16. Thieme, H.R.: Mathematics of population biology. Princeton University Press, Princeton (2003)

Matrix and Discrete Maximum Principles Istv´an Farag´ o E¨ otv¨ os Lor´ and University, P´ azm´ any P. s. 1/c, 1117 Budapest, Hungary

Abstract. Qualitative properties play central role in constructing reliable numerical models for parabolic problems. One of such basic properties is the discrete maximum principle. In this paper we analyze its relation to the so-called matrix maximum principles. We analyze the different matrix maximum principles (Ciarlet, Stoyan and Ciarlet-Stoyan maximum principles) and their relation. Introducing the iterative algebraic problem (IAP) we show that the discrete maximum principles for discrete parabolic problems are more general than the algebraic maximum principles. We also analyze and compare the conditions which ensure the above qualitative properties. Keywords: Heat equation, qualitative properties, discrete maximum principle, linear ﬁnite element, ﬁnite diﬀerence method. Subject Classifications: 65N06, 65N30.

1

Introduction

When we construct mathematical and/or numerical models in order to describe or solve real-life problems, these models should have certain qualitative properties which typically arise from some basic principles of the modelled phenomena. In other words, it is important to preserve the characteristic properties of the original process, i.e., the models have to possess the equivalents of these properties. In this paper we investigate the parabolic problem C

∂u − ∇(κ∇u) = f, on QT = Ω × [0, T ], ∂t u|ΓT = g,

(1)

(2)

where C : Ω → IR is a given bounded function with the property 0 < Cmin ≤ C ≡ C(x) ≤ Cmax , the known bounded function κ : Ω → IR has continuous ﬁrst derivatives and fulﬁlls the property 0 < κmin ≤ κ ≡ κ(x) ≤ κmax , the function g : ΓT → IR is continuous on ΓT , the function f : QT → IR is bounded in QT , ∇

The work was supported by the Hungarian Research Grant OTKA K 67819 and by Project HS-MI-106/2005 of NSF of Bulgaria.

I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 563–570, 2010. c Springer-Verlag Berlin Heidelberg 2010

564

I. Farag´ o

denotes the usual nabla operator, and the solution u is sought in C 2,1 (QT ∪ ΓT ). (Later we use the notation u∂ (x, t) := g(x, t) for x ∈ ∂Ω and u0 (x) := g(x, 0) for x ∈ Ω.) The problem (1)-(2) is generally applied for the description of heat conduction processes. The variables x and t play the role of the space and time variables, respectively. Problem (1)-(2) is suitable to describe diﬀusion and transport phenomena.

2

The Discrete Heat Problem and the Iterative Algebraic Problems

In order to construct the numerical model, we introduce a mesh where we denote the interior mesh points by P1 , . . . , PN ∈ Ω, and the points on the boundary by PN +1 , . . . , PN +N∂ ∈ ∂Ω, respectively. Using the ﬁnite diﬀerence or ﬁnite element method for the space discretization and the θ-method with the step-size Δt for the time discretization, we arrive at the one-step iterative method of the form (M + θΔtK)u(n) = (M − (1 − θ)ΔtK)u(n−1) + Δt f (n,θ)

(3)

where M and K are known matrices and θ ∈ [0, 1] is a ﬁxed given parameter. In the sequel, we denote M + θΔtK and M − (1 − θ)ΔtK by A and B, respectively, and we use the following partitions of the matrices and vectors: (n) u0 (n) A = [A0 |A∂ ], B = [B0 |B∂ ], u = (n) , u∂ where A0 and B0 are square matrices from IRN ×N ; A∂ , B∂ ∈ IRN ×N∂ , u0 ∈ (n) IRN denotes the unknown approximation and u∂ ∈ IRN∂ is known from the boundary condition, i.e., from the given function g. Then, iteration (3) can be written as (n) (n−1) u0 u0 (n) [A0 |A∂ ] . (4) (n) = [B0 |B∂ ] (n−1) + f u∂ u∂ (n)

The relation (4) deﬁnes a one-step iterative process (n)

A0 u0

(n−1)

= B0 u0

(n−1)

+ B∂ u∂

(n)

− A∂ u∂ + f (n) ,

(5)

which is deﬁned on the set of the given vectors (n−1)

H = {u0

(n−1)

, f (n) ∈ IRN , u∂

(n)

, u∂

(n)

∈ IRN∂ }

(6)

and we want to ﬁnd the only unknown vector u0 . We will refer to the problem (5) as iterative algebraic problem (IAP). Similarly to the continuous problem, the preservation of the qualitative properties in the IAP is a crucial task too. To this aim, we introduce the following deﬁnitions. (For more details, we refer to [2].)

Matrix and Discrete Maximum Principles

565

Definition 1. We say that the IAP (5) is discrete non-negativity preserving (DNP) if the implication (n−1)

f (n) ≥ 0; u∂

(n)

≥ 0; u∂

(n−1)

≥ 0 and u0

(n)

≥ 0 ⇒ u0

≥0

(7)

is true. Definition 2. We say that the IAP (5) satisﬁes the discrete weak boundary maximum principle (DWBMP) when the implication (n)

f (n) ≤ 0 ⇒ max u0

(n−1)

≤ max{0, u0

(n−1)

, u∂

(n)

, u∂ }

(8)

is true. Definition 3. We say that the IAP (5) satisﬁes the discrete strong boundary maximum principle (DSBMP), when the implication (n)

f (n) ≤ 0 ⇒ max u0

(n−1)

≤ max{u0

(n−1)

, u∂

(n)

, u∂ }

(9)

is true. In the following we analyze the relation of these qualitative properties to the properties deﬁned for the partitioned matrices.

3

Matrix Maximum Principles and Their Relations

In the literature, for partitioned matrices with a certain structure some qualitative properties have been introduced (e.g., [1,3,6]). We consider the block-matrix H ∈ IRk×k and the block-vector y ∈ IRk in the form H1 H2 y1 H= , y= , (10) 0 I y2 where submatrices H1 ∈ IRk1 ×k1 , I ∈ IRk2 ×k2 , H2 ∈ IRk1 ×k2 , 0 ∈ IRk2 ×k1 , y1 ∈ IRk1 and y2 ∈ IRk2 with k = k1 + k2 . In the sequel, for arbitrary vectors v, w ∈ IRk , we will use the following notations: max{v} := max{v1 , v2 , . . . , vk }, max{0, v} := max{0, max{v}}.

(11)

According to the papers [1,5,6], we introduce some deﬁnitions. Definition 4. We say that the matrix H satisﬁes the Ciarlet matrix maximum principle (CMMP) if for arbitrary vectors y1 ∈ IRk1 and y2 ∈ IRk2 , such that H1 y1 + H2 y2 ≤ 0, the inequality max{y1 } ≤ max{0, y2 } holds. Definition 5. We say that the matrix H satisﬁes the Stoyan matrix maximum principle (SMMP) if for arbitrary vectors y1 ∈ IRk1 and y2 ∈ IRk2 , such that H1 y1 + H2 y2 = 0 and y2 ≥ 0, the inequality max{y1 } ≤ max{y2 } holds.

566

I. Farag´ o

The above deﬁnitions give information about the location of the maximum components of the unknown vector y ∈ IRk , using some a priori information: for the CMMP the non-negative maximum, while for the SMMP the maximum is taken over the last indices i = k1 + 1, k1 + 2, . . . , k, i.e., on the sub-vector y2 . Remark 1. If a matrix H satisﬁes the CMMP, then it is necessarily regular. To show this, it is enough to prove that the relation H1 y1 = 0 implies that y1 = 0. By the choice y2 = 0, the application of the CMMP implies the inequality y1 ≤ 0. Repeating this argument for −y1 , we obtain −y1 ≤ 0, which shows the validity of the required equality. The similar statement holds for the SMMP, too. The CMMP and SMMP properties can be guaranteed in the following way. Theorem 1 ([1]). The matrix H satisﬁes the CMMP if and only if the following two matrix conditions hold: (C1): H is monotone, i.e., H−1 =

−1 H−1 1 −H1 H2 0 I

≥ 0;

(12)

(C2): using the notation ekm ∈ IRkm (m = 1, 2) for the vectors with all coordinates equal to one, we have −H−1 1 H2 ek2 ≤ ek1 .

(13)

The condition (C2) can be relaxed by the following suﬃcient condition (C2’): the row sums of the matrix H are all non-negative, i.e., H1 ek1 + H2 ek2 ≥ 0.

(14)

Remark 2. The CMMP obviously implies the SMMP. Therefore, the above conditions also guarantee the SMMP property. It is worth mentioning that for the un-partitioned matrix H the monotonicity and the condition He ≥ 0 (which is, in fact, the analogue of the condition (14)) are necessary conditions for validity of the SMMP. We can combine the CMMP and the SMPP as follows: we require that under the CMMP condition the implication in the SMMP is true, i.e., we introduce Definition 6. We say that the matrix H satisﬁes the Ciarlet-Stoyan matrix maximum principle (CSMMP) if for arbitrary vectors y1 ∈ IRk1 and y2 ∈ IRk2 , such that H1 y1 + H2 y2 ≤ 0, the relation max{y1 } ≤ max{y2 } holds. Obviously, the CSMMP implies both the CMMP and SMMP properties. This property can be guaranteed by the following statement (cf. [4]). Lemma 1. A matrix H has the CSMMP property if and only if it is monotone and the condition (15) H1 ek1 + H2 ek2 = 0 holds.

Matrix and Discrete Maximum Principles

567

Proof. First we assume that H is monotone and (15) holds. Let y1 ∈ IRk1 and y2 ∈ IRk2 be arbitrary vectors with the property H1 y1 + H2 y2 ≤ 0. Since H is −1 monotone, therefore H−1 1 ≥ 0 and −H1 H2 ≥ 0 (cf. (12)). Therefore we have −1 −1 y1 ≤ −H−1 1 H2 y2 ≤ −H1 H2 (max{y2 }ek2 ) = −(max{y2 })H1 H2 ek2 .

(16)

Due to the assumption (15), the relation (16) implies that y1 ≤ (max{y2 })ek1 ,

(17)

which proves the ﬁrst part of the statement. Now we assume that H has the CSMMP property. Since the CSMMP property implies the CMMP property, therefore, due to the Theorem 1, H is monotone. In order to show (15), ﬁrst, let us put y1 = −H−1 1 H2 ek2 and y2 = ek2 . Since for this choice the CSMMP is applicable, we get the estimation max{−H−1 1 H2 ek2 } ≤ max{ek2 } = 1. Let us put now y1 = H−1 1 H2 ek2 and y2 = −ek2 . The CSMMP is again applicable and we get the estimation max{H−1 1 H2 ek2 } ≤ max{−ek2 } = −1. The above two estimations clearly result in the equality −H−1 1 H2 ek2 = ek1 , which yields the required relation.

4

MMP to Linear Algebraic Systems of Special Form

We use the notation b for the vector Hy ∈ IRk , i.e., we consider the system Hy = b. Hence, b has the partitioning

b=

b1 b2

(18)

,

(19)

where b1 ∈ IRk1 and b2 ∈ IRk2 , respectively. Moreover, let (n) (n) ¯f A −B u H= , y= , b= , (n−1) 0 I u u(n−1) with A=

A0 A∂ 0 I

, B=

B0 B∂ 0 I

, u

(n)

=

(n)

u0 (n) u∂

, ¯f (n) =

(20)

f (n) (n) f∂

. (21)

Here A0 , B0 ∈ IRN ×N , A∂ , B∂ ∈ IRN ×N∂ , u0 , f (n) ∈ IRN and u∂ , f∂ ∈ IRN∂ . ¯ ¯ ¯ Then in (18) H ∈ IR2N ×2N , y, b ∈ IR2N . The problem (18),(20),(21) is equivalent to the system of linear algebraic equations of the form (n)

Au(n) = Bu(n−1) + ¯f (n) . (n)

(n)

(n−1)

(n)

(n)

(22)

, then (22) becomes equivalent to (5). If we use the notation f∂ = u∂ − u∂ We will refer to this problem as the canonical algebraic problem (CAP). The qualitative properties of this problem can be deﬁned as follows.

568

I. Farag´ o

Definition 7. We say that the CAP is non-negativity preserving, when b ≥ 0 results in the relation y ≥ 0, that is, the implication (n−1)

f (n) ≥ 0; u0

(n)

≥ 0 and u∂

(n−1)

≥ u∂

(n)

≥ 0 ⇒ u0

≥0

(23)

is true. Definition 8. We say that the CAP satisﬁes the Ciarlet maximum principle, when the matrix H has the CMMP property, i.e., the implication (n)

f (n) ≤ 0, u∂

(n−1)

≤ u∂

(n)

(n−1)

⇒ max{u0 } ≤ max{0, u0

(n−1)

, u∂

(n)

, u∂ } (24)

is true. Analogically, we introduce the following Definition 9. We say that the CAP satisﬁes the Stoyan maximum principle, when the correspondig matrix H by (20),(21), has the SMMP property, which yields that the implication (n−1)

f (n) = 0, u0 ⇒

(n) max{u0 }

≤

(n)

≥ 0, u∂

(n−1)

= u∂

≥0

(n−1) (n−1) (n) max{u0 , u∂ , u∂ }

(25)

is true. It follows from the deﬁnitions that the validity of the Ciarlet maximum principle implies the validity of the Stoyan maximum principle. First we investigate the non-negativity preservation property of the CAP. Since, according to (12), −1 −1 A A B H−1 = , (26) 0 I we need the monotonicity of the matrix H, which is valid if and only if the following conditions hold: A−1 ≥ 0;

A−1 B ≥ 0.

(27)

(This yields that the matrices A and B must form a weak regular splitting of the matrix A − B.) Using (21), we get that (27) is valid if and only if the relations A−1 0 ≥ 0,

−A−1 0 A∂ ≥ 0,

A−1 0 B0 ≥ 0,

A−1 0 (B∂ − A∂ ) ≥ 0

(28)

are true. Hence, the following statement is true. Lemma 2. The CAP is non-negativity preserving if and only if (28) is true. We pass to the investigation of the Ciarlet maximum principle property of the CAP. Due to Theorem 1, it is suﬃcient to require the monotonicity of the matrix H and the relation (A − B)e ≥ 0. (29)

Matrix and Discrete Maximum Principles

569

Substituting (21), the inequality (29) yields the condition (A0 − B0 )e0 + (A∂ − B∂ )e∂ ≥ 0.

(30)

Hence we get Lemma 3. The CAP satisﬁes both the Ciarlet and Stoyan maximum principle properties if the conditions (28) and (30) are satisﬁed.

5

Qualitative Properties of IAP

The discretization (by using ﬁnite element/diﬀerence method) results in a CAP (n) (n) (n−1) with notation f∂ = u∂ − u∂ . (This means that f∂ is not a free parameter in the CAP.) In the sequel, we analyze the relation between the qualitative properties of the CAP and the IAP. We deﬁne some subsets of H in (6) (n−1)

H+ = {f (n) ≥ 0, u0 M H+

= {f

(n)

≥ 0,

(n−1)

≥ 0, u∂

(n)

(n) u∂

(n−1) u∂

≥ 0, u∂

(n−1) u0

≥ 0,

≥

C HDBMP = {f (n) ≤ 0}, HDW BMP = HDBMP (n−1)

S HDSBMP = {f (n) = 0, u0

(31) ≥ 0},

(n−1)

{ u∂

(n−1)

≥ 0, u∂

≥ 0},

(n)

= u∂

(n)

≥ u∂ },

(32)

≥ 0}.

(33)

The inclusions M C S H+ ⊃ H+ ; HDBMP ⊃ HDW BMP ; HDBMP ⊃ HDSBMP

(34)

are obvious. For the CAP the diﬀerent qualitative properties (non-negativity preservation, Ciarlet maximum principle, Stoyan maximum principle) are deM S C , HDSBMP , and HDSBMP , respectively. For the IAP, ﬁned on the sub-sets H+ the corresponding qualitative properties (non-negativity preservation, DWBMP, DSBMP) are deﬁned on the wider subsets H+ and HDBMP , respectively. Moreover, if some qualitative property is guaranteed for the IAP, then the corresponding CAP also possesses this qualitative property on the smaller subset, where it is deﬁned. Hence, from the deﬁnitions we have Theorem 2. Using the notations f (x, t) for the source (forcing) function, u0 (x) for the initial function and u∂ (x, t) for the boundary function, Table 1 shows the applicability of IAP and CAP to the deﬁnition of the diﬀerent qualitative properties in the initial ﬁrst boundary value problem for the time-dependent linear PDE. We may observe that the applicability of the Ciarlet and Stoyan maximum principles is rather restrictive: the ﬁrst one can be applied only for a problem with sign-determined source function and to boundary conditions decreasing in time

570

I. Farag´ o

Table 1. Conditions for the given data of the continuous problem providing diﬀerent qualitative properties

DNP NPCAP DWBMP Ciarlet DSBMP Stoyan

f u0 u∂ non-negative non-negative non-negative non-negative non-negative non-negative and time-decreasing non-positive any any non-positive any time-decreasing non-positive any any zero non-negative non-negative and time-independent

(e.g., time-independent). The Stoyan maximum principle gives some information about the maximum (minimum) only for a homogeneous equation with non-negative initial and time-independent, non-negative boundary conditions. If one of the above conditions does not hold, we must apply another principle. We can also compare the conditions which guarantee the qualitative properties. Deﬁnition 1 yields that the IAP (5) has DNP property if and only if the conditions A−1 0 ≥ 0;

A−1 0 B∂ ≥ 0,

A−1 0 B0 ≥ 0,

−A−1 0 A∂ ≥ 0

(35)

are satisﬁed. This yields that the conditions (35) imply the conditions (28), i.e., the non-negativity preservation property of the IAP implies the non-negativity preservation property of the CAP. Similar results can be shown for the diﬀerent discrete maximum principles, too.

References 1. Ciarlet, P.G.: Discrete maximum principle for ﬁnite-diﬀerence operators. Aequationes Math. 4, 338–352 (1970) 2. Farag´ o, I., Horv´ ath, R.: Discrete maximum principle and adequate discretizations of linear parabolic problems. SIAM Sci. Comput. 28, 2313–2336 (2006) 3. Fujii, H.: Some remarks on ﬁnite element analysis of time-dependent ﬁeld problems. In: Theory and practice in ﬁnite element structural analysis, pp. 91–106. Univ. Tokyo Press, Tokyo (1973) 4. Kar´ atson, J., Korotov, S.: Discrete maximum principles in ﬁnite element solutions of nonlinear problems with mixed boundary conditions. Numer. Math. 99, 669–698 (2005) 5. Stoyan, G.: On a maximum principle for matrices and on conservation of monotonicity with applications to discretization methods. Z. Angew. Math. Mech. 62, 375–381 (1982) 6. Stoyan, G.: On maximum principles for monotone matrices. Lin. Alg. Appl. 78, 147–161 (1986)

On a Bisection Algorithm That Produces Conforming Locally Refined Simplicial Meshes Antti Hannukainen1 , Sergey Korotov2, and Michal Kˇr´ıˇzek3 1

2

Institute of Mathematics, Helsinki University of Technology P.O. Box 1100, FI–02015 Espoo, Finland [email protected] Department of Mathematics, Tampere University of Technology P.O. Box 553, FI–33101 Tampere, Finland [email protected] 3 Institute of Mathematics, Academy of Sciences ˇ a 25, CZ–115 67 Prague 1, Czech Republic Zitn´ [email protected]

Abstract. First we introduce a mesh density function that is used to deﬁne a criterion to decide, where a simplicial mesh should be ﬁne (dense) and where it should be coarse. Further, we propose a new bisection algorithm that chooses for bisection an edge in a given mesh associated with the maximum value of the criterion function. Dividing this edge at its midpoint, we correspondingly bisect all simplices sharing this edge. Repeating this process, we construct a sequence of conforming nested simplicial meshes whose shape is determined by the mesh density function. We prove that the corresponding mesh size of the sequence tends to zero for d = 2, 3 as the bisection algorithm proceeds. It is also demonstrated numerically that the algorithm seems to produce only a ﬁnite number of similarity-distinct triangles.

1

Introduction

Bisection algorithms which are very convenient for reﬁning simplicial partitions were originally used for solving nonlinear equations [4,18]. Various properties of partitions generated by such algorithms were proved in a number of works in the 70-th [6,17,19,20]. Later, in the eighties, mainly due to eﬀorts of M. C. Rivara, bisection-type algorithms became popular also in the FEM (ﬁnite element method) community for mesh reﬁnement/adaptation purposes [12,13,15,16]. Several another variants of the algorithm suitable for standard FEMs were also proposed, analysed and numerically tested in [1,2,3,7,10,11] (see also references therein). A practical realization of bisection algorithms is much simpler than red, red-green, and green reﬁnements of simplices to provide mesh conformity, especially in the case of local mesh reﬁnements and in three or higher dimensions (see [9]). The guiding rules for mesh reﬁnements/adaptivity often come from a posteriori error estimation which generally delivers estimates in the form of integrals I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 571–579, 2010. c Springer-Verlag Berlin Heidelberg 2010

572

A. Hannukainen, S. Korotov, and M. Kˇr´ıˇzek

over the solution domain. Thus, we usually have in hands a certain function over a given domain which dictates the actual mesh reconstruction, see e.g. paper [5]. Its general idea is essentially used in this work, where we propose to apply the longest-edge bisection algorithm, introduced in [12,13] and later analysed in [7], in the following (modiﬁed) form. We choose for bisection not the longest edge in the partition, but the edge which has a maximal value of its length multiplied by the value of mesh density function, which is deﬁned a priori. Some properties of such a bisection algorithm are analyzed in this work. Our approach entirely diﬀers from the others, as we do not need any postreﬁnements of meshes (which can be rather nontrivial task, see e.g. [14, p. 2228]) to provide their conformity. Let Ω ⊂ Rd be a bounded polygonal or polyhedral domain. By T we denote a usual conforming simplicial mesh of Ω, i.e., with no hanging nodes. Let E = E(T ) be the set of all edges of all simplices of T . A set F of meshes is said to be a family of meshes if for every ε > 0 there exists a mesh T ∈ F such that |e| < ε for all edges e ∈ E(T ), where | · | stands for the Euclidean norm.

2

Mesh Density Function

Local simplicial mesh reﬁnements of Ω can be done by means of a priori given positive mesh density function m which is supposed here to be Lipschitz continuous over Ω, i.e., there exists a constant L such that |m(x) − m(y)| ≤ L|x − y|,

x, y ∈ Ω.

(1)

Ideally, it should be large over those parts of Ω, where we need a very ﬁne mesh. On the other hand, m is deﬁned to be small over those parts of Ω, where we do not need a ﬁne mesh (see, e.g. Example 1 and Remark 2). From the positiveness and continuity of m we see that there exists a constant m0 such that 0 < m0 ≤ m(x)

∀x ∈ Ω.

(2)

Denote by Me the midpoint of the edge e and deﬁne the criterion function J(e) = |e| m(Me ).

(3)

We shall look for an edge e∗ ∈ E (see Remark 1 below) at which J attains its maximal value, i.e., (4) J(e∗ ) = max J(e). e∈E

Further, we bisect all simplices from T sharing e∗ through the midpoint Me∗ (see Figs. 1 and 2). For d = 3 each bisection plane contains the edge opposite to e∗ in a particular simplex. This reﬁnement strategy will be called the generalized conforming bisection (GCB-) algorithm. It is used repeatedly to produce a family of conforming nested meshes. If m ≡ 1 (or if m is constant) then we get the standard conforming longest-edge bisection algorithm which has been recently analyzed in [7].

Algorithm That Produces Conforming Locally Reﬁned Simplicial Meshes

573

Fig. 1. Generalized bisection when e∗ is inside Ω and at the boundary ∂Ω for d = 2. The dotted lines represent the last bisections.

Fig. 2. Bisection of all tetrahedra sharing the edge e∗

Remark 1. If there exists more than one edge for which J attains its maximum, we may choose e∗ randomly. (This happens, e.g., if all triangles in T are equilateral and m ≡ 1.) Another way would be to modify slightly the deﬁnition of J as follows J(ei ) = |ei |m(Mei ) + 10−10 i, ei ∈ E, i = 1, . . . , n, where n is the number of edges in E.

3

Convergence of the Mesh-Size for GCB-Algorithm

In [6], Kearfott proved for the longest-edge bisection algorithm (which however produces hanging nodes, in general, see [7, p. 1688]) that the largest diameter of all simplices tends to zero. In Theorem 2 below we prove the same result for our GCB-algorithm and d = 2, 3. Before that we show that the maximal value of J monotonically tends to zero after the “mesh size becomes suﬃciently small”.

574

A. Hannukainen, S. Korotov, and M. Kˇr´ıˇzek

Theorem 1. For each newly generated edge e after one step of the GCBalgorithm applied to T we always have J(e ) ≤ 0.9 J(e∗ ), provided |e| ≤ 0.03

m0 L

∀e ∈ E(T ).

(5) (6)

Proof. Let e∗ be the edge satisfying (4) and (6). Let T ∈ T be a triangle which will be bisected for d = 2. If d = 3, then T will stand for one of the triangular faces containing e∗ of the tetrahedra from T that will be bisected. There will be three new edges in T : two halves of e∗ and the median to e∗ . Let e be the ﬁrst (or second) half of e∗ . Then from the positiveness and Lipschitz continuity of m we obtain that 0.5 m(Me ) ≤ 0.9 m(Me∗ ). (7) Indeed, from (1), (6), and (2) we ﬁnd that m(Me ) ≤ m(Me∗ ) + L|Me − Me∗ | 1 = m(Me∗ ) + L|e∗ | ≤ m(Me∗ ) + 0.8 m0 ≤ 1.8 m(Me∗ ). 4 Hence, (7) holds. Multiplying (7) by |e∗ | = 2|e |, we obtain (5), namely J(e ) = |e |m(Me ) ≤ 0.9 |e∗ |m(Me∗ ) = 0.9 J(e∗ ). Now let e ⊂ T be the median to e∗ and let the lengths of edges a, b, c of T satisfy |a| ≤ |b| ≤ |c|. (8) Consider three possible cases: 1) Let c = e∗ (see Fig. 3). Then by (6), (2), and (3) √ 3 0.09 3 2 L|c| ≤ |c|m0 ≤ 0.9 − J(c). 8 8 2

(9)

Let t = e be the median on the edge c. Since the angle opposite to c is greater than or equal to π3 , we have √ 3 |c|. (10) |t| ≤ 2 Applying (3) and also (10) twice, we ﬁnd by the Lipschitz continuity of m that √ √ √ 3 3 3 J(e ) = |t|m(Mt ) ≤ |c|m(Mt ) = |c|m(Mc ) + |c|(m(Mt ) − m(Mc )) 2 √ √2 √2 √ |t| 3 3 3 3 ≤ J(c) + L|c||Mt − Mc | ≤ J(c) + L|c| 2 2 2 2 2 √ √ √ √ 3 3 1 3 3 3 ≤ J(c) + L|c| |c| ≤ J(c) + L|c|2 ≤ 0.9 J(c) = 0.9 J(e∗ ), 2 2 2 2 2 8

Algorithm That Produces Conforming Locally Reﬁned Simplicial Meshes

575

C

b

a

t

A

c

c

2

2

B

Fig. 3. Bisection of a triangle T ∈ T for which c = e∗

where the last inequality follows from (9). Thus, (5) holds. 2) Assume now that b = e∗ and let u = e be the median on b. From (6) and (2) we see that √ √ 3 |e∗ | 3 L ≤ 0.03m0 ≤ 0.9 − m(Mc ). 2 4 2 From this, the Lipschitz continuity of m, and the fact that |Mu − Mc | = 4b we obtain √ √ √ 3 3 3 |e∗ | m(Mu ) ≤ m(Mc ) + L ≤ 0.9m(Mc ). 2 2 2 4 By (8) we can ﬁnd that (see Fig. 4) √ 3 |u| ≤ |c|. (11) 2 The equality in (11) is attained when the triangle ABC is equilateral as marked in Fig. 4. Thus, √ 3 J(e ) = J(u) = |u|m(Mu ) ≤ |c|m(Mu ) ≤ 0.9|c|m(Mc ) 2 = 0.9 J(c) ≤ 0.9 J(b) = 0.9 J(e∗ ) and (5) holds again. 3) Finally, let a = e∗ . We shall distinguish two cases: a) Let |b| 2 ≤ |a|. Then |b| m0 ∗ ≤ |e | = |a| ≤ due to (6). From this, the Lipschitz continuity of m and 2 9L (2) we ﬁnd further that 9m(Ma ) ≤ 9m(Mc ) + 9L

|b| ≤ 9m(Mc ) + m0 ≤ 10m(Mc ). 2

From above and the inequality J(c) ≤ J(a) we obtain that |c| ≤

m(Ma ) 10 |a| ≤ |a|. m(Mc ) 9

576

A. Hannukainen, S. Korotov, and M. Kˇr´ıˇzek C

B

a 2

c

a

b

u

v

a 2

A

b

b

2

2

C

A

Fig. 4. Admissible region for the vertex B. The position of B in its left corner yields the maximal value of the ratio |u| . |c|

This implies that (see Fig. 5) |v| ≤

1−

B

c

Fig. 5. A very small admissible region for the vertex C. The position of C in its right corner yields the . The maximal value of the ratio |v| |c| triangle ABC is almost equilateral.

√ 9 2 319 |c| = |c|. 20 20

(12)

The equality on the left-hand side of (12) is attained when the vertex C is in the 9 right corner (where |a| = 10 |c| and |b| = |c|) of the admissible region marked in Fig. 5. From (12), the positiveness and Lipschitz continuity of m, (4), and (6) we obtain √ √ 319 319 |a| J(e ) = J(v) = |v|m(Mv ) ≤ |c|m(Mv ) ≤ |c|(m(Mc ) + L ) 20 20 4 √ 319 0.03 ≤ |c|(m(Mc ) + m0 ) ≤ 0.9 |c|m(Mc ) 20 4 = 0.9 J(c) ≤ 0.9 J(a) = 0.9 J(e∗ ). b) Now, let |a| <

|b| 2 .

Then by (6) we have

1 1 |b| |b| |c| m(Ma ) ≤ (m(Mb ) + L ) = J(b) + L|b||c| 2 2 2 2 4 1 0.03 0.03 1 ≤ J(b) + m0 |b| ≤ J(b) + J(b) < J(b). 2 4 2 4

J(a) = |a|m(Ma ) <

However, this contradicts the relation J(e∗ ) = J(a) ≥ J(b). Hence, the case 3b) cannot happen. Theorem 2. The GCB-algorithm yields a family of nested conforming meshes whose longest edges tend to zero if the initial mesh satisfies (6).

Algorithm That Produces Conforming Locally Reﬁned Simplicial Meshes

577

1 0.8 0.6 0.4 0.2 0 −0.2 −0.4 −0.6 −0.8 −1 −1

−0.5

0

0.5

1

Fig. 6. The initial mesh of Ω

Proof. Let T be a triangle (possibly also a face of a tetrahedron) that will be bisected. Then all three newly generated edges will be shorter than the longest edge c of T . Therefore, the length of the longest edge of the whole mesh is a nonincreasing function. Thus its limit exists as the bisection proceeds. We prove that this limit is zero. Due to Theorem 1, the value J(e∗ ) tends monotonically to zero, since for all newly generated edges we have J(e ) ≤ 0.9J(e∗ ) provided (6) holds. Therefore, J(e∗ ) = |e∗ |m(Me∗ ) ≥ |e|m(Me ) ≥ |e|m0 . Since m0 in (2) is positive, we ﬁnd that also |e| → 0

4

Numerical Results

In this section we demonstrate the performance of GCB-algorithm, which illustrates its convergence even though condition (6) is not satisﬁed for the initial mesh. We also present numerical evidence that the number of similarity-distinct triangles generated by our method seems to be ﬁnite. Example 1. Consider the L-shaped domain Ω = (−1, 1) × (−1, 1) \ [0, 1) × [0, 1). Its initial mesh is illustrated in Fig. 6. Set m(x) =

1 . 1 + 4|x|

(13)

This function and the corresponding meshes from two diﬀerent iterations are plotted in Fig. 7. Figures 6 and 7 nicely illustrate that the longest edge tends to zero as reﬁnements proceed. Example 2. In [19,20] Martin Stynes assumes that all triangles are bisected at the same time. He showed that this repeated process yields only a ﬁnite number of similarity-distinct subtriangles, but also hanging nodes may appear. Our algorithm does not produce hanging nodes. Moreover, from Fig. 8 we observe

578

A. Hannukainen, S. Korotov, and M. Kˇr´ıˇzek 1

1

1

0.8

0.8

0.8

0.6

0.6

0.6

0.4

0.4

0.4

0.2

0.2

0.2

0

0

0

−0.2

−0.2

−0.2

−0.4

−0.4

−0.4

−0.6

−0.6

−0.6

−0.8

−0.8

−1 −1

−0.5

0

0.5

1

−0.8

−1 −1

−0.5

0

0.5

1

−1 −1

−0.5

0

0.5

1

Fig. 7. Mesh after 100 and 500 reﬁnements for the mesh density function (13) (left and centre). The right picture shows the behaviour of function (13). 90

2.5

80 70

2

60 50 1.5 40 30 1

20 10 0

0.5 0

10

20

30

40

50

60

Fig. 8. Various colours indicate the value of the decimal logarithm of the number of similarity-distinct subtriangles for α ≤ β ≤ γ, α ∈ (0, π/3], β ∈ (0, π/2) and m ≡ 1

that the number of similarity-distinct subtriangles is also ﬁnite if Ω is a triangle that changes over all possible shapes. This result is proved in [7, p. 1691] under some angle conditions and m ≡ 1. Remark 2. In practice, m need not be Lipschitz continuous, but it can have jumps or singularities. The function m can also be modiﬁed during computational process. For instance, we may put m(x) = |˜ u(x) − uh (x)| to control adaptive mesh reﬁnement for a posteriori error estimates. Here u˜ is a higher order approximation of the exact solution of some boundary value problem and uh is a numerical approximation of the true solution. The function u ˜ can be obtained, e.g., by some averaging superconvergence technique [8].

Algorithm That Produces Conforming Locally Reﬁned Simplicial Meshes

579

Acknowledgement. This paper was supported by Institutional Research Plan nr. AV0Z 10190503 and Grant nr. IAA 100190803 of the Academy of Sciences of the Czech Republic. Also, the support of Projects 211512 and 124619 from the Academy of Finland is acknowledged. The authors are indebted to reviewers for very useful comments.

References 1. Adler, A.: On the bisection method for triangles. Math. Comp. 40, 571–574 (1983) 2. Arnold, D.N., Mukherjee, A., Pouly, L.: Locally adapted tetrahedral meshes using bisection. SIAM J. Sci. Comput. 22, 431–448 (2000) 3. B¨ ansch, E.: Local mesh reﬁnement in 2 and 3 dimensions. IMPACT Comp. Sci. Engrg. 3, 181–191 (1991) 4. Eiger, A., Sikorski, K., Stenger, F.: A bisection method for systems of nonlinear equations. ACM Trans. Math. Software 10, 367–377 (1984) 5. Eriksson, K.: An adaptive ﬁnite element method with eﬃcient maximum norm error control for elliptic problems. Math. Models Methods Appl. Sci. 4, 313–329 (1994) 6. Kearfott, R.B.: A proof of convergence and an error bound for the method of bisection in Rn . Math. Comp. 32, 1147–1153 (1978) 7. Korotov, S., Kˇr´ıˇzek, M., Krop´ aˇc, A.: Strong regularity of a family of face-to-face partitions generated by the longest-edge bisection algorithm. Comput. Math. Math. Phys. 48, 1687–1698 (2008) 8. Kˇr´ıˇzek, M., Neittaanm¨ aki, P.: On superconvergence techniques. Acta Appl. Math. 9, 175–198 (1987) 9. Kˇr´ıˇzek, M., Strouboulis, T.: How to generate local reﬁnements of unstructured tetrahedral meshes satisfying a regularity ball condition. Numer. Methods Partial Diﬀerential Equations 13, 201–214 (1997) 10. Liu, A., Joe, B.: On the shape of tetrahedra from bisection. Math. Comp. 63, 141–154 (1994) 11. Liu, A., Joe, B.: Quality of local reﬁnement of tetrahedral meshes based on bisection. SIAM J. Sci. Comput. 16, 1269–1291 (1995) 12. Rivara, M.-C.: Algorithms for reﬁning triangular grids suitable for adaptive and multigrid techniques. Internat. J. Numer. Methods Engrg. 20, 745–756 (1984) 13. Rivara, M.-C.: Selective reﬁnement/dereﬁnement algorithms for sequences of nested triangulations. Internat. J. Numer. Methods Engrg. 28, 2889–2906 (1989) 14. Rivara, M.-C.: Lepp-algorithms, applications and mathematical properties. Appl. Numer. Math. 59, 2218–2235 (2009) 15. Rivara, M.-C., Iribarren, G.: The 4-triangles longest-side partition and linear reﬁnement algorithm. Math. Comp. 65, 1485–1502 (1996) 16. Rivara, M.-C., Levin, C.: A 3D reﬁnement algorithm suitable for adaptive and multigrid techniques. Comm. Appl. Numer. Methods Engrg. 8, 281–290 (1992) 17. Rosenberg, I.G., Stenger, F.: A lower bound on the angles of triangles constructed by bisection of the longest side. Math. Comp. 29, 390–395 (1975) 18. Sikorski, K.: A three dimensional analogue to the method of bisections for solving nonlinear equations. Math. Comp. 33, 722–738 (1979) 19. Stynes, M.: On faster convergence of the bisection method for certain triangles. Math. Comp. 33, 717–721 (1979) 20. Stynes, M.: On faster convergence of the bisection method for all triangles. Math. Comp. 35, 1195–1201 (1980)

A Discrete Maximum Principle for Nonlinear Elliptic Systems with Interface Conditions J´ anos Kar´ atson Department of Applied Analysis and Computational Mathematics ELTE University, H-1117 Budapest, Hungary [email protected]

Abstract. A discrete maximum principle is proved for some nonlinear elliptic systems. A recent result is extended to problems with interface conditions. The discrete maximum principle holds on meshes with suitably small mesh size and under proper acuteness type conditions. In particular, this implies the discrete nonnegativity property under nonnegative data. As an example, reaction-diﬀusion systems in chemistry are considered, using meshes with the above properties; one can derive that the numerically computed concentrations are nonnegative as required.

1

Introduction

The maximum principle is an important qualitative property of second order elliptic equations, hence its discrete analogues, the so-called discrete maximum principles (DMPs) have drawn much attention. Various DMPs have been given for FD and FE discretizations, including geometric conditions on the computational meshes, see e.g. [1,2,3,14,17,19] for linear and [10,11,15] for nonlinear equations. For elliptic operators including lower order terms as well, the DMP has the form (1) max uh ≤ max{0, max uh }, ∂Ω

Ω

and it can be ensured for suﬃciently ﬁne mesh under geometric conditions expressing uniform acuteness or non-narrowness in the case of simplicial or rectangular FEM meshes, respectively. (Similar conditions also appear for prismatic FEM.) The above results concern a single elliptic equation. The DMP has ﬁrst been extended to certain systems in [12]. The considered class has cooperative and weakly diagonally dominant coupling, which conditions also appear in the underlying continuous maximum principle [4,16]. In the case of mixed boundary conditions and nonpositive right-hand sides, we have max max uhk ≤ max max{0, max gkh },

k=1,...,s

Ω

k=1,...,s

ΓD

where ΓD is the Dirichlet boundary. The acuteness type conditions for simplicial FE meshes have also been suitably weakened in [12]. The goal of this paper is to extend the above DMP to systems with interface conditions. I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 580–587, 2010. c Springer-Verlag Berlin Heidelberg 2010

A Discrete Maximum Principle for Nonlinear Elliptic Systems

2

581

Some Algebraic Background

Let us consider a system of equations of order (k+m)×(k+m) with the structure ˜ d c AA ¯, ¯ (2) A¯ c := = ˜ =: d ˜ c d 0 I where I is the m × m identity matrix and 0 is the m × k zero matrix. Many known results on discrete maximum principles for FEM or FDM discretizations ¯ is of generalized are based on the ‘matrix maximum principle’, proved in [2]: if A ¯ nonnegative type and d is (elementwise) nonnegative, then max

i=1,...,k+m

ci ≤ max{0,

max

i=k+1,...,k+m

ci }.

(3)

¯ to be of generalized nonnegative type is the irreducibilA key assumption for A ity of A i.e., for any i = j there must exist a sequence of nonzero entries {ai,i1 , ai1 ,i2 , . . . , ais ,j } of A. This is a technical condition which is sometimes diﬃcult to check in applications, see e.g. [6], therefore a weakened property has been introduced in [12]. This requires two deﬁnitions. Definition 1. Let A be an arbitrary k × k matrix. The irreducible blocks of A are the matrices A(l) (l = 1, . . . , q) deﬁned as follows. Let us call the indices i, j ∈ {1, . . . , k} connectible if there exists a sequence of nonzero entries {ai,i1 , ai1 ,i2 , . . . , ais ,j } of A, where i, i1 , i2 , . . . , is , j ∈ {1, . . . , k} are distinct indices. Further, let us call the indices i, j mutually connectible if both i, j and j, i are connectible in the above sense. Let N1 , . . . , Nq be the equivalence classes, i.e. the maximal sets of mutually connectible indices. Letting (l) (l) Nl = {s1 , . . . , skl } for l = 1, . . . , q, we have k1 + · · ·+ kq = k. Then we deﬁne for (l)

all l = 1, . . . , q the kl × kl matrix A(l) by Ap q := as(l) ,s(l) p

q

(p, q = 1, . . . , kl ).

¯ with the structure (2) is said to be Definition 2. A (k + m) × (k + m) matrix A of generalized nonnegative type with irreducible blocks if the following properties hold: (i) aii > 0, i = 1, . . . , k, (ii) aij ≤ 0, i = 1, . . . , k, j = 1, . . . , k + m k+m aij ≥ 0, i = 1, . . . , k, (iii)

(i = j),

j=1

(iv’) For each irreducible block of A there exists an index i0 = i0 (l) ∈ Nl = (l) (l) {s1 , . . . , skl } for which k ai0 ,j > 0. (4) j=1

¯ be a (k + m) × (k + m) matrix with the structure (2), Theorem 1. [12]. Let A ¯ is of generalized nonnegative type with irreducible blocks in and assume that A the sense of Deﬁnition 2. ¯ c)i ≤ 0, i = If the vector ¯ c = (c1 , . . . , ck+m )T ∈ Rk+m is such that di ≡ (A¯ 1, . . . , k, then (3) holds.

582

3

J. Kar´ atson

The Problem

Let us consider the system ⎫ a.e. in Ω \ Γ, ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎬ = γk (x) a.e. on ΓN , ⎪

−div bk (x, ∇uk ) ∇uk + qk (x, u1 , . . . , us ) = fk (x) k bk (x, ∇uk ) ∂u ∂ν

[ uk ]Γ = 0

and

k bk (x, ∇u) ∂u ∂ν

uk = gk (x)

Γ

= k (x)

⎪ a.e. on ΓD , ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎭ on Γ,

(k = 1, . . . , s)

(5) where [ u]Γ and b(x, ∇u) ∂u denote the jump (i.e., the diﬀerence of the limits ∂ν Γ from the two sides of the interface Γ ) of the solution u and the normal ﬂux b(x, ∇u) ∂u ∂ν , respectively. We impose the following

Assumptions 3.1 (i) Ω ⊂ Rd is a bounded domain with a piecewice C 1 boundary; ΓD , ΓN are disjoint open measurable subsets of ∂Ω such that ∂Ω = Γ D ∪ Γ N , and Γ is a piecewise C 1 surface lying in Ω. (ii) (Smoothness and growth.) For all k = 1, . . . , s we have bk ∈ (C 1 ∩L∞ )(Ω× Rd ) and qk ∈ C 1 (Ω × Rs ). Further, let 2 ≤ p < p∗ , where p∗ :=

2d d−2

if d ≥ 3 and p∗ := +∞ if d = 2,

and let there exist constants β1 , β2 ≥ 0 such that ∂qk p−2 (k, l = 1, . . . , s; x ∈ Ω, ξ ∈ Rs ). ∂ξl (x, ξ) ≤ β1 + β2 |ξ|

(6)

(7)

(iii) (Ellipticity.) There exists m > 0 such that bk ≥ m holds for all k = 1, . . . , s. Further, deﬁning ak (x, η) := bk (x, η)η for all k, the Jacobian ∂ matrices ∂η ak (x, η) are uniformly spectrally bounded from both below and above. ∂qk (iv) (Cooperativity.) We have (x, ξ) ≤ 0 for all k, l = 1, . . . , s, k = l, ∂ξl x ∈ Ω, ξ ∈ Rs . (v) (Weak diagonal dominance for the Jacobians w.r.t. rows and columns.) s ∂qk l=1

∂ξl

(x, ξ) ≥ 0,

s ∂ql (x, ξ) ≥ 0 ∂ξk

(k = 1, . . . , s; x ∈ Ω, ξ ∈ Rs ).

l=1

(vi) For all k = 1, . . . , s we have fk ∈ L2 (Ω), γk ∈ L2 (ΓN ), k ∈ L2 (Γ ), and ∗ gk = gk|Γ with g ∗ ∈ H 1 (Ω). D Remark 1. Assumptions (iv)-(v) imply ξ ∈ Rs ).

∂qk ∂ξk (x, ξ)

≥ 0 (k = 1, . . . , s; x ∈ Ω,

A Discrete Maximum Principle for Nonlinear Elliptic Systems

583

For the weak formulation of such problems, we deﬁne the Sobolev space 1 HD (Ω) := {z ∈ H 1 (Ω) : z|ΓD = 0} . The weak formulation of problem (5) then reads as follows: ﬁnd u ∈ H 1 (Ω)s such that A(u), v = ψ, v ∗

and u − g ∈ where A(u), v =

s

1 (∀v ∈ HD (Ω)s )

(8)

1 HD (Ω)s ,

bk (x, ∇u) ∇uk · ∇vk +

Ω k=1

(9) s

qk (x, u) vk

(10)

k=1

1 for given u = (u1 , . . . , us ) ∈ H 1 (Ω)s and v = (v1 , . . . , vs ) ∈ HD (Ω)s , further,

ψ, v =

s

fk vk +

Ω k=1

s

ΓN k=1

γk vk +

s

k vk

(11)

Γ k=1

1 (Ω)s , and ﬁnally, g ∗ := (g1∗ , . . . , gs∗ ). Existence for given v = (v1 , . . . , vs ) ∈ HD and uniqueness of the weak solution can be derived following the general framework of monotone elliptic operators, see e.g. [5]. For a single equation, this was achieved and a discrete maximum principle was considered in [11].

4

Finite Element Discretization

We deﬁne the ﬁnite element discretization of problem (5) in the following way. First, let n ¯0 ≤ n ¯ be positive integers and let us choose basis functions 1 (Ω), ϕ1 , . . . , ϕn¯ 0 ∈ HD

1 ϕn¯ 0 +1 , . . . , ϕn¯ ∈ H 1 (Ω) \ HD (Ω),

which correspond to homogeneous and inhomogeneous boundary conditions on ΓD , respectively. These basis functions are assumed to be continuous and to satisfy n ¯ ¯ ), ϕp ≡ 1, ϕp (Bq ) = δpq (12) ϕp ≥ 0 (p = 1, . . . , n p=1

for given node points Bp ∈ Ω (p = 1, . . . , n ¯ 0 ) and Bp ∈ ΓD (p = n ¯ 0 + 1, . . . , n ¯ ), where δpq is the Kronecker symbol. (These conditions hold e.g. for standard linear, bilinear or prismatic ﬁnite elements.) Finally, we assume that any two interior basis functions can be connected with a chain of interior basis functions with overlapping support. This assumption is natural by its geometric meaning, counterexamples only occur in quasi-degenerate cases as mentioned in [7]. We in fact need a basis in the corresponding product spaces, which we deﬁne by repeating the above functions in each of the s coordinates and setting zero in the other coordinates. That is, let n0 := s¯ n0 and n := s¯ n. First, for any 1 ≤ i ≤ n0 , if i = (k − 1)¯ n0 + p

for some 1 ≤ k ≤ s and 1 ≤ p ≤ n ¯ 0 , then

584

J. Kar´ atson

φi := (0, . . . , 0, ϕp , 0, . . . , 0)

where ϕp stands at the k-th entry,

that is, (φi )m = ϕp if m = k and (φi )m = 0 if m = k. From these, we let 1 (Ω)s . Vh0 := span{φ1 , . . . , φn0 } ⊂ HD

(13)

Similarly, for any n0 + 1 ≤ i ≤ n, if i = n0 + (k − 1)(¯ n−n ¯0) + p − n ¯0

for some 1 ≤ k ≤ s and n ¯0 + 1 ≤ p ≤ n ¯,

φi := (0, . . . , 0, ϕp , 0, . . . , 0)

where ϕp stands at the k-th entry,

T

then

that is, (φi )m = ϕp if m = k and (φi )m = 0 if m = k. From this and (13), we let Vh := span{φ1 , . . . , φn } ⊂ H 1 (Ω)s . Using the above FEM subspaces, the ﬁnite element discretization of problem (5) leads to the task of ﬁnding uh ∈ Vh such that A(uh ), v = ψ, v (∀v ∈ Vh0 ) and uh − g h ∈ Vh0 , i.e., uh = g h on ΓD (where g h = n

uh =

n

(14) (15)

gj φj ∈ Vh is the approximation of g ∗ on ΓD ). Then, setting

j=n0 +1

cj φj and v = φi (i = 1, . . . , n0 ) in (8), we obtain the n0 × n system

j=1

of algebraic equations n

aij (¯ c) cj = di

(i = 1, . . . , n0 ),

(16)

j=1

where for any ¯ c = (c1 , . . . , cn )T ∈ Rn (i = 1, . . . , n0 , j = 1, . . . , n), the entries ¯ c) are of A(¯ s s c) = bk (x, ∇uh ) (∇φj )k · (∇φi )k + Vkl (x, uh ) (φj )l (φi )k , (17) aij (¯ Ω k=1

k,l=1

where

1

Vkl (x, uh (x)) = 0

and

di :=

s Ω k=1

∂qk (x, tuh (x)) dt ∂ξl fˆk (φi )k +

s

ΓN k=1

(k, l = 1, . . . , s; x ∈ Ω), γk (φi )k +

s Γ k=1

k (φi )k

(18)

where fˆk (x) := fk (x) − qk (x, 0). We enlarge system (16) to a square one with the structure (2) by adding a proper zero and an identity block, and write it brieﬂy as ¯ c)¯ A(¯ c = d. ¯ c) has the entry aij (¯ I.e. for i = 1, . . . , n0 and j = 1, . . . , n, matrix A(¯ c) from (17).

A Discrete Maximum Principle for Nonlinear Elliptic Systems

5

585

The Discrete Maximum Principle

We will consider a family of ﬁnite element discretizations, and thereby we need a regularity notion for the mesh. Definition 3. Let us consider a family of FEM subspaces V = {Vh }h→0 constructed as above. The corresponding mesh will be called quasi-regular w.r.t. problem (5) if c1 hγ ≤ meas(supp ϕp ) ≤ c2 hd , where the positive real number γ satisﬁes d ≤ γ < γd∗ (p) := 2d − from Assumption 3.1 (ii).

(d−2)p 2

with p

First we verify the desired generalized nonnegativity for the stiﬀness matrix. Theorem 2. Let problem (5) satisfy Assumptions 3.1. Let us consider a family of ﬁnite element subspaces Vh (h → 0), such that the corresponding family of meshes is regular from above, according to Deﬁnition 3, for some proper γ. Assume further that for any indices p = 1, ..., n ¯ 0 , t = 1, ..., n ¯ (p = t), if meas(supp ϕp ∩ supp ϕt ) > 0 then ∇ϕt · ∇ϕp ≤ 0 on Ω and ∇ϕt · ∇ϕp ≤ −K0 hγ−2 (19) Ω

with some constant K0 > 0 independent of p, t and h. ¯ c) deﬁned in (17) is of generalized Then for suﬃciently small h, the matrix A(¯ nonnegative type with irreducible blocks in the sense of Deﬁnition 2. Proof. In [12, Theorem 4.10], the statement was proved for problems like (5) without interface conditions. Note that the presence of the interface conditions c). only aﬀects the r.h.s. (18) of the algebraic system, but not the entries aij (¯ Hence, we can apply the quoted theorem to (17). A classical way to satisfy the integral condition in (19) is a pointwise inequality on the support, which expresses uniform acuteness for a family of simplicial meshes. However, one can ensure it with less strong conditions as well, see discussed in [12]. Now we can derive the discrete maximum principle for problem (5): Corollary 1. Let the assumptions of Theorem 2 hold. If fk ≤ qk (x, 0),

γk ≤ 0

and

k ≤ 0

(k = 1, . . . , s),

then for suﬃciently small h, the FEM solution uh = (uh1 , . . . , uhs ) of system (5) satisﬁes max max uhk ≤ max max{0, max gkh }. (20) k=1,...,s

Ω

k=1,...,s

ΓD

Proof. Using (18) and that (φi )k ≥ 0 by (12), the coordinate vector ¯ c = ¯ c)i ≤ 0, i = 1, ..., k. Then Theorem (c1 , ..., ck+m )T ∈ Rk+m is such that di ≡ (A¯ 1 is valid. From (3), using (12) and the same derivation as in [12, Theorem 4.5], we obtain (20).

586

6

J. Kar´ atson

Some Related Results

Analogously to Corollary 1, a change of signs yields the discrete minimum principle for problem (5): under the assumptions of Theorem 2, if fk ≥ qk (x, 0),

γk ≥ 0

and k ≥ 0

(k = 1, . . . , s),

then for suﬃciently small h, the FEM solution uh = (uh1 , . . . , uhs ) of system (5) satisﬁes min min uhk ≥ min min{0, min gkh }. k=1,...,s Ω

ΓD

k=1,...,s

In particular, if in addition gk ≥ 0 for all k, then we obtain the discrete nonnegativity principle: (k = 1, . . . , s). uhk ≥ 0 As an important example, let us consider reaction-diﬀusion systems in chemistry. The steady states of such processes can sometimes be described by systems ⎫ −bk Δuk + Pk (x, u1 , . . . , us ) = fk (x) in Ω, ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ∂uk bk ∂ν = γk (x) on ΓN , ⎬ (k = 1, . . . , s). ⎪ uk = gk (x) on ΓD , ⎪ ⎪ ⎪ ⎪ ⎪ ∂uk ⎭ [ uk ]Γ = 0 and bk ∂ν Γ = k (x) on Γ (21) Here, for all k, the quantity uk describes the concentration of the kth species, and Pk is a polynomial which characterizes the rate of the reactions involving the k-th species. A common way to describe such reactions is the so-called mass action type kinetics [8], which implies that Pk (x, 0) ≡ 0 on Ω for all k. The function fk ≥ 0 describes a source independent of concentrations. System (21) was considered in [12] without the interface conditions. Now the latter express that a part of the process is localized on the interface. We consider system (21) as a special case of (5) under Assumptions 3.1. As pointed out in [12], such chemical models describe processes with cross-catalysis and strong autoinhibiton. Since the concentrations uk are nonnegative, a proper numerical model must produce such numerical solutions. Based on the abovementioned discrete nonnegativity principle, we obtain the required property: if the FEM discretization of system (21) satisﬁes the conditions of Theorem 2, and fk ≥ 0,

γk ≥ 0,

gk ≥ 0,

k ≥ 0

(k = 1, . . . , s),

then for suﬃciently small h, the FEM solution u = (uh 1 , . . . , uh s )T of system (21) satisﬁes (k = 1, . . . , s). uhk ≥ 0 on Ω h

Remark 2. Both in the general problem (5) and the example (21), one may include additional terms sk (x, u1 , . . . , us ) on ΓN and/or Γ , which we omit here for technical simplicity. Then sk must satisfy similar properties as assumed for qk or Pk , respectively. Such localized reactions for a single equation (i.e. one chemical agent) have been considered e.g. in [9].

A Discrete Maximum Principle for Nonlinear Elliptic Systems

587

References 1. Brandts, J., Korotov, S., Kˇr´ıˇzek, M.: The discrete maximum principle for linear simplicial ﬁnite element approximations of a reaction-diﬀusion problem. Linear Algebra Appl. 429, 2344–2357 (2008) 2. Ciarlet, P.G.: Discrete maximum principle for ﬁnite-diﬀerence operators. Aequationes Math. 4, 338–352 (1970) 3. Ciarlet, P.G., Raviart, P.-A.: Maximum principle and uniform convergence for the ﬁnite element method. Comput. Methods Appl. Mech. Engrg. 2, 17–31 (1973) 4. de Figueiredo, D.G., Mitidieri, E.: Maximum principles for cooperative elliptic systems. C. R. Acad. Sci. Paris S´er. I Math. 310(2), 49–52 (1990) 5. Farag´ o, I., Kar´ atson, J.: Numerical solution of nonlinear elliptic problems via preconditioning operators. Theory and applications. In: Advances in Computation, vol. 11. NOVA Science Publishers, New York (2002) 6. Draganescu, A., Dupont, T.F., Scott, L.R.: Failure of the discrete maximum principle for an elliptic ﬁnite element problem. Math. Comp. 74(249), 1–23 (2005) 7. Hannukainen, A., Korotov, S., Vejchodsk´ y, T.: On Weakening Conditions for Discrete Maximum Principles for Linear Finite Element Schemes. In: Margenov, S., Vulkov, L.G., Wasniewski, J. (eds.) Numerical Analysis and Its Applications. LNCS, vol. 5434, pp. 297–304. Springer, Heidelberg (2009) 8. H´ ars, V., T´ oth, J.: On the inverse problem of reaction kinetics. In: Farkas, M. (ed.) Qualitative Theory of Diﬀerential Equations, Szeged, Hungary (1979); Coll. Math. Soc. J´ anos Bolyai, vol. 30, pp. 363–379. North-Holland–MSJB, Budapest (1981) 9. Kandilarov, J.D., Vulkov, L.G.: Analysis of immersed interface diﬀerence schemes for reaction-diﬀusion problems with singular own sources. Comput. Methods Appl. Math. 3(2), 253–273 (2003) 10. Kar´ atson, J., Korotov, S.: Discrete maximum principles for ﬁnite element solutions of nonlinear elliptic problems with mixed boundary conditions. Numer. Math. 99, 669–698 (2005) 11. Kar´ atson, J., Korotov, S.: Discrete maximum principles for FEM solutions of some nonlinear elliptic interface problems. Int. J. Numer. Anal. Modelling 6(1), 1–16 (2009) 12. Kar´ atson, J., Korotov, S.: A discrete maximum principle in Hilbert space with applications to nonlinear cooperative elliptic systems. SIAM J. Numer. Anal. 47(4), 2518–2549 (2009) 13. Korotov, S., Kˇr´ıˇzek, M.: Acute type reﬁnements of tetrahedral partitions of polyhedral domains. SIAM J. Numer. Anal. 39, 724–733 (2001) 14. Korotov, S., Kˇr´ıˇzek, M., Neittaanm¨ aki, P.: Weakened acute type condition for tetrahedral triangulations and the discrete maximum principle. Math. Comp. 70, 107–119 (2001) 15. Kˇr´ıˇzek, M., Qun, L.: On diagonal dominance of stiﬀness matrices in 3D. East-West J. Numer. Math. 3, 59–69 (1995) 16. Mitidieri, E., Sweers, G.: Weakly coupled elliptic systems and positivity. Math. Nachr. 173, 259–286 (1995) ˇ ın, P.: Discrete maximum principle for higher-order ﬁnite ele17. Vejchodsk´ y, T., Sol´ ments in 1D. Math. Comp. 76(260), 1833–1846 (2007) 18. Varga, R.: Matrix iterative analysis. Prentice Hall, New Jersey (1962) 19. Xu, J., Zikatanov, L.: A monotone ﬁnite element scheme for convection-diﬀusion equations. Math. Comp. 68, 1429–1446 (1999)

Inverse Problem for Coeﬃcient Identiﬁcation in Euler-Bernoulli Equation by Linear Spline Approximation Tchavdar T. Marinov1 and Rossitza Marinova2 1

2

Southern University at New Orleans, Department of Natural Sciences, 6801 Press Drive, New Orleans, LA 70126 [email protected] Department of Mathematical and Computing Sciences, Concordia University College of Alberta, Edmonton, AB, Canada T5B 4E4, Adjunct professor at the Department of Computer Science, University of Saskatchewan, SK, Canada [email protected]

Abstract. We display the performance of the technique called Method of Variational Imbedding for solving the inverse problem of coeﬃcient identiﬁcation in Euler-Bernoulli equation from over-posed data. The original inverse problem is replaced by a minimization problem. The EulerLagrange equations comprise an eight-order equation for the solution of the original equation and an explicit system of equations for the coeﬃcients of the spline approximation of the unknown coeﬃcient. Featuring examples are elaborated numerically. The numerical results conﬁrm that the solution of the imbedded problem coincides with the exact solution of the original problem within the order of approximation error.

1

Introduction

Consider the simplest form of Euler-Bernoulli equation d2 d2 u σ(x) = f (x), 0 ≤ x ≤ 1. dx2 dx2

(1)

The function f (x) represents the transversely distributed load. The coeﬃcient σ(x), called ﬂexural rigidity, is the product of the modulus of elasticity E and the moment of inertia I of the cross-section of the beam about an axis through its centroid at right angles to the cross-section. If the coeﬃcient σ(x) > 0 and the right-hand side function f (x) ≥ 0 are given, under proper initial and/or boundary conditions, the problem possesses a unique solution, usually referred as a direct solution. In practice, there exist lots of interesting problems, in which the coeﬃcient σ(x) is not exactly known. In reality, Euler-Bernoulli equation models a tensioned “beam”. Under environmental loads, caused by environmental phenomena such as wind, waves, current, tides, earthquakes, temperature, ice, the structure of the I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 588–595, 2010. c Springer-Verlag Berlin Heidelberg 2010

Inverse Problem for Coeﬃcient Identiﬁcation in Euler-Bernoulli Equation

589

ingredients of the “beam” is changing. Usually it is expensive, even not possible, to measure the changes of the properties of the materials directly. On the other hand, the changes in the physical properties of the materials cause changes in the coeﬃcient σ in equation (1) and, respectively, changes in the solution. Thus, a new, so called inverse, problem appears: to ﬁnd simultaneously the solution u and the coeﬃcient σ of the Euler-Bernoulli equations. A method for transforming an inverse problem into a correct direct problem, but for a higher-order equation, was proposed in [1] and called Method of Variational Imbedding (MVI). The idea of MVI is to replace the incorrect problem with a well-posed problem for minimization of a quadratic functional of the original equations, i.e. we “embed” the original incorrect problem in a higher-order boundary value problem which is well-posed (see [1,7,8]). For the latter, a difference scheme and a numerical algorithm for its implementation can easily be constructed. The advantage of MVI comparing to a regularization method (see, for example, [10,4]) is that there are no “boundary layers” at the two ends of the interval as it was observed for a similar problem in [5]. Recently, in [9], the MVI was applied to the problem for identifying the coefﬁcient σ in (1) in the case when the coeﬃcient is a piecewise constant function. In the present work we are considering the case when the coeﬃcient is a piecewise linear function. Although this paper is focused on a fourth order ordinary diﬀerential equation, the proposed method can be generalized for identiﬁcation of coeﬃcient in partial diﬀerential equations. Similar to the procedure proposed here, the approach for the identiﬁcation of a coeﬃcient in parabolic partial diﬀerential equation is given in [2]. In [8] MVI was successfully applied to the problem for identiﬁcation of a coeﬃcient in elliptic partial diﬀerential equation. The paper is organized as follows. In Section 2 the inverse problem for identiﬁcation of the unknown coeﬃcient is formulated. The application of the MVI to the inverse problem is described in Section 3. The numerical scheme is given in Section 4. Illustration of the constructed numerical scheme is given in Section 5.

2

Inverse Problem Formulation

Consider the Euler-Bernoulli equation (1) where the function f (x) is given. We expect that the functions under consideration are as many time diﬀerentiable as necessary. If the coeﬃcient σ is not given, in order to identify it one needs additional information. Suppose the solution satisﬁes the conditions u(0) = α0,0 ,

u(1) = α1,0 ,

(2)

u (0) = α0,1 ,

u (1) = α1,1 ,

(3)

u (0) = α0,2 , u(ξi ) = γi ,

u (1) = α1,2 , i = 1, 2, . . . , n − 1,

(4) (5)

where the points 0 < ξi < 1 are given. We suppose that the coeﬃcient σ is a piecewise linear function σ(x) = σi (x) = ai + bi (x − ξi−1 ) for ξi−1 < x < ξi , i = 1, 2, . . . , n,

(6)

590

T.T. Marinov and R. Marinova

where the constants ai , bi , are unknown (ξ0 = 0, ξn = 1). The number of unknown constants ai , bi is 2n. On the other hand, the additional number of conditions in (2)-(5) for the fourth order ordinary diﬀerential equation is n + 1 = (n + 5) − 4. Therefore, if we add the condition for continuity of the function σ, i.e., σi (ξi ) = σi+1 (ξi ),

(7)

i = 1, 2, . . . , n − 1 the number of the conditions is exactly equal to the number of unknown constants. There may be no solution (u, σ), satisfying all of the conditions (2)–(5), for arbitrary αk,l , (for k = 0, 1 and l = 0, 1, 2), and γi , (for i = 1, 2, . . . , n − 1). For this reason, we suppose that the problem is posed correctly after Tikhonov, [10] i.e., it is known a-priori that the solution of problem exists. In other words, we assume that the data in the boundary conditions (2)–(5) have “physical meaning” and, therefore, the solution exists. The problem is how to convert this additional information to the missing information on the coeﬃcients. The solution approach proposed here is a generalization of the implementation of MVI to a similar problem given in [6] for identiﬁcation of a piecewise coeﬃcient of two parts (see also [7] and [8]), and continuation of the idea proposed in [9].

3

Variational Imbedding

Following the idea of MVI, we replace the original problem with the problem of minimization of the functional 1

1 I(u, σ) =

A (u, σ)dx = 2

0

d2 dx2

2 d2 u σ(x) 2 − f (x) −→ min , dx

(8)

0

where u satisﬁes the conditions (2)–(5), σ is an unknown piecewise function, deﬁned with equation (6). The functional I(u, σ) is a quadratic and homogeneous function of A(u, σ) and, hence, it attains its absolute minimum if and only if A(u, σ) ≡ 0. In this sense there is an one-to-one correspondence between the original equation (1) and the minimization problem (8). Since σ(x) is a piecewise linear function, we can rewrite the functional I as ξ 2 n i 2 d d2 u σi − f (x) −→ min . I(u, σ) = dx2 dx2 i=1

(9)

ξi−1

The necessary condition for minimization of the functional I is expressed by the Euler-Lagrange equations for the functions u(x) and σ(x). 3.1

Equation for u

The Euler-Lagrange equation with respect to the function u reads 2 d2 d2 d d2 d2 d2 u σ A= σ σ − f (x) = 0. dx2 dx2 dx2 dx2 dx2 dx2

(10)

Inverse Problem for Coeﬃcient Identiﬁcation in Euler-Bernoulli Equation

591

Therefore, in each interval ξi−1 < x < ξi , the function u satisﬁes the equation d4 d2 u d2 d2 d2 σi 4 σi 2 = σi 2 f (x), 2 2 dx dx dx dx dx

(11)

under the boundary conditions (2)–(5). Since each equation (11) is of eight order we need some additional boundary conditions. From the original problem we have d2 d2 u σi − = f (ξi ), dx2 dx2 ξi

d2 u d2 σ + = f (ξi ), i+1 dx2 dx2 ξi

(12)

where ξi− and ξi+ stand for the left-hand and right-hand derivatives and d4 d2 u d2 σ f (ξi ), − = i dx4 dx2 ξi dx2 2 2 d u d u σi 2 ξ− = σi+1 2 ξ+ , dx i dx i

d4 d2 d2 u σ f (ξi ), + = i+1 dx4 dx2 ξi dx2 2 2 d d u d d u σi 2 ξ− = σi+1 2 ξ+ , dx dx i dx dx i

(13) (14)

where i = 1, 2, . . . , n − 1. 3.2

Equation for σ

The problem is coupled by the equation for σ. Since σ is a piecewise function, for the functional I one arrives at the problem for minimization of the function q(a1 , . . . , an , b1 , . . . , bn ) =

n

(Ai11 a2i +2Ai12 ai bi +Ai22 b2i +Ai1 ai +Ai2 bi +Ai0 ), (15)

i=1

with respect to a1 , . . . , an , b1 , . . . , bn under the continuity conditions (7) which we rewrite in the form ai + bi (ξi − ξi−1 ) − ai+1 = 0.

(16)

Minimizing the function q under the constraints (16) we introduce Lagrange multipliers μi and introduce the function: Q=

n i 2

A11 ai + 2Ai12 ai bi + Ai22 b2i + Ai1 ai + Ai2 bi + Ai0

(17)

i=1

+

n−1

μi (ai + bi (ξi − ξi−1 ) − ai+1 ) .

i=1

We obtain the following ﬁve-diagonal system of linear equations for ai , bi , μi : ∂Q = 2A111 a1 + 2A112 b1 + A11 + μ1 = 0, ∂a1 ∂Q = 2A112 a1 + 2A122 b1 + A12 + μ1 ξ1 = 0, ∂b1 ∂Q = a1 + b1 ξ1 − a2 = 0, ∂μ1

(18) (19) (20)

592

T.T. Marinov and R. Marinova

and ∂Q = 2Ai11 ai + 2Ai12 bi + Ai1 + μi − μi−1 = 0, ∂ai ∂Q = 2Ai12 ai + 2Ai22 bi + Ai2 + μi (ξi − ξi−1 ) = 0, ∂bi ∂Q = ai + bi (ξi − ξi−1 ) − ai+1 = 0, ∂μi

(21) (22) (23)

for i = 2, . . . , n − 1, and ∂Q = 2An11 an + 2An12 bn + An1 − μn−1 = 0, ∂an ∂Q = 2An12 an + 2An22 bn + An2 = 0. ∂bn

4

(24) (25)

Diﬀerence Scheme

We solve the formulated eight-order boundary value problem using ﬁnite differences. It is convenient for the numerical treatment to rewrite the eight order equation (11) as a system of four second order equations. In each of the subintervals [ξi−1 , ξi ], i = 1, 2, . . . , n we solve the following system of four equations u = v, 4.1

(σv) = w,

w = z,

(σz) = (σf ) .

(26)

Grid Pattern and Approximations

We introduce a regular mesh with step hi (see Fig. 1) in each of the subintervals [ξi−1 , ξi ], i = 1, 2, . . . , n, allowing to approximate all operators with standard central diﬀerences with second order of approximation. i−1 For the grid spacing in the interval [ξi−1 , ξi ] we have hi ≡ ξin−ξ , where ni i −2 is the total number of grid points for the i-th interval. Then, the grid points are deﬁned as follows: xij = ξi−1 + (j − 1.5)hi for j = 1, 2, . . . , ni . Let us introduce the notation uij = u(xij ) for i = 1, 2, . . . , n, and j = 1, . . . , ni . We employ symmetric central diﬀerences for approximating the diﬀerential operators. The diﬀerential operators in the boundary conditions are approximated with second order formulae using central diﬀerences and half sums.

•

ξi−1 | •

•

•

xi1

xi2

xi3

xi4

...

•

•

•

xini −3

xini −2

xini −1

Fig. 1. The mesh used in our numerical experiments

ξi |

• xini

Inverse Problem for Coeﬃcient Identiﬁcation in Euler-Bernoulli Equation

4.2

593

General Construction of the Algorithm

(I) With the obtained “experimentally observed” values of αk,l , (for k = 0, 1 and l = 0, 1, 2, 3), and γi , (for i = 1, 2, . . . , n − 1) the eight-order boundary value problem (10), (2)–(5), (12)–(14) is solved for the function u with an initial guess for the function σ. (II) The approximation of the function σ for the current iteration is calculated from the system (18)-(25). If the l2 -norm of the diﬀerence between the new and the old ﬁeld for σ is less than ε0 then the calculations are terminated. Otherwise, the algorithm returns to (I) with the new calculated σ.

5

Numerical Experiments

The accuracy of the diﬀerence scheme is validated with tests involving various grid spacing h. We have run a number of calculations with diﬀerent values of the mesh parameters and veriﬁed the practical convergence and the O(h2 ) approximation of the diﬀerence scheme. For all calculations presented here, ε0 = 10−12 and the initail values for the coeﬃcients in σ are ai = 0.5 and bi = 1.5. Here we illustrate the developed diﬀerence scheme using two coeﬃcient identiﬁcation problems. 5.1

Linear Coeﬃcient

Consider the case when σ(x) = 1 + x and f (x) = (3 + x) exp(x) for which under proper boundary conditions the exact solution is u(x) = exp(x).

(27)

For this test we let the number of intervals n in the deﬁnition (6) of σ equal to 1, i.e., n = 1. In other words, we know a-priori that the coeﬃcient is a linear function. The goal of this test is to conﬁrm second order of approximation of the proposed scheme. The values of the identiﬁed coeﬃcient σ = a + bx with four diﬀerent steps h are given in Table 1. The rates of convergence, calculated as a2h − aexact b2h − bexact , , rate = log (28) rate = log2 2 ah − aexact bh − bexact Table 1. Obtained values of the coeﬃcients a and b, and the rate of convergence for four diﬀerent values of the mesh spacing h exact 0.1 0.05 0.025 0.0125

a 1.0 0.996671245 0.999166953 0.999791684 0.999947911

|a − aexact | rate b |b − bexact | rate — — 1.0 — — 3.328754087E-03 — 0.998334860 1.665139917E-03 — 8.330468464E-04 1.99851 0.999583428 4.165712337E-04 1.99901 2.083158588E-04 1.99962 0.999895839 1.041605270E-04 1.99975 5.208818084E-05 1.99974 0.999973961 2.603891276E-05 2.00007

594

T.T. Marinov and R. Marinova

Table 2. l2 norm of the diﬀerence u − uexact and the rate of convergence for four diﬀerent values of the mesh spacing h 0.1 0.05 0.025 0.0125

||u − uexact ||l2 rate 1.018490804573E-04 — 2.367271717866E-05 2.10514 5.686319062981E-06 2.05766 1.392183281145E-06 2.03015

Table 3. l2 norm of the diﬀerences u−uexact and σ −σexact and the rate of convergence for four diﬀerent values of the mesh spacing h 0.01 0.005 0.0025 0.00125

||σ − σexact ||l2 rate ||u − uexact ||l2 rate 1.069187970901E-08 — 4.552581248664E-05 — 2.584956511555E-09 2.0483 1.138599436962E-05 1.99942 6.379806228978E-10 2.01856 9.614682942106E-07 3.56588 1.586000035587E-10 2.00812 1.624871713448E-07 2.56491

are also shown in Table 1. Similar results for the l2 norm of the diﬀerence between the exact and the numerical values of the function u are presented in Table 2. This test clearly conﬁrms the second order of convergence of the numerical solution to the exact one. 5.2

Linear Coeﬃcient as a Piecewise Linear Function

Consider again the solution (27) but now we do not assume a-priori that the coeﬃcient is the same function in the whole interval. We identify the coeﬃcient as a piecewise linear function, as deﬁned in (6), for n = 10. In each subinterval, the expected values of the coeﬃcient σ are ai = 1 and bi = 1. For this test we performed a number of calculations with diﬀerent spacings h. The l2 norm of the diﬀerence between the exact and the numerical values of the functions u and σ, and the rate of convergence, calculated using the norm of the diﬀerence, for four diﬀerent steps h, are given in Table 3. The fact that the numerical solution approximates the analytical one with O(h2 ) is clearly seen from the Table 3.

Acknowledgment This work was partially supported by MITACS and NSERC.

References 1. Christov, C.I.: A method for identiﬁcation of homoclinic trajectories. In: Proc. 14th Spring Conf., Sunny Beach, Soﬁa, Bulgaria. Union of Bulg. Mathematicians, Soﬁa (1985)

Inverse Problem for Coeﬃcient Identiﬁcation in Euler-Bernoulli Equation

595

2. Christov, C.I., Marinov, T.T.: Identiﬁcation of heat-conduction coeﬃcient via method of variational imbedding. Mathematics and Computer Modeling 27(3), 109–116 (1998) 3. Hadamard, J.: Le Probleme de Cauchy et les Equations aux Derivatives Partielles Lineares Hyperboliques, Hermann, Paris (1932) 4. Latt`es, R., Lions, J.L.: M`ethode de quasi-reversibilite et applications, Dunod, Paris (1967) 5. Lesnic, D., Elliott, L., Ingham, D.B.: Analysis of coeﬃcient identiﬁcation problems associated to the inverse Euler-Bernoulli beam theory. IMA J. of Applied Math. 62, 101–116 (1999) 6. Marinov, T.T., Christov, C.I.: Identiﬁcation the unknown coeﬃcient in ordinary diﬀerential equations via method of variational imbedding. In: Deville, M., Owens, R. (eds.) 16th IMACS World Congress 2000 Proceedings, paper 134–2 (2000) ISBN 3-9522075-1-9 7. Marinov, T.T., Christov, C.I., Marinova, R.S.: Novel numerical approach to solitary-wave solutions identiﬁcation of Boussinesq and Korteweg-de Vries equations. Int. J. of Bifurcation and Chaos 15(2), 557–565 (2005) 8. Marinov, T.T., Marinova, R.S., Christov, C.I.: Coeﬃcient identiﬁcation in elliptic partial diﬀerential equation. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2005. LNCS, vol. 3743, pp. 372–379. Springer, Heidelberg (2006) 9. Marinov, T.T., Vatsala, A.: Inverse Problem for Coeﬃcient Identiﬁcation in EulerBernoulli Equation. Computers and Mathematics with Applications 56(2), 400–410 (2008) 10. Tikhonov, A.N., Arsenin, V.: Methods for Solving Incorrect Problems, Nauka, Moscow (1974)

A Fully Implicit Method for Fluid Flow Based on Vectorial Operator Splitting Rossitza S. Marinova1, , Raymond Spiteri2 , and Eddy Essien1 1

Department of Mathematical and Computing Sciences, Concordia University College of Alberta, 7128 Ada Boulevard, Edmonton, AB T5B 4E4, Canada [email protected] http://www.math.concordia.ab.ca/marinova 2 Department of Computer Science, University of Saskatchewan 176 Thorvaldson Bldg, 110 Science Place, Saskatoon, SK S7N 5C9 Canada

Abstract. A fully implicit ﬁnite diﬀerence method for the incompressible Navier–Stokes equations is presented that conserves mass, momentum, the square of velocity components, and, in the inviscid limit, kinetic energy. The method uses a staggered arrangement of velocity and pressure on a structured Cartesian grid and retains its conservation properties for both uniform and non-uniform grids. The convergence properties of the method are conﬁrmed via numerical experiments.

1

Introduction

Flows with high Reynolds numbers or complex geometries are of great interest to industry; hence there is signiﬁcant demand for robust and stable algorithms and software even at the expense of increased computational cost. Fully implicit time-stepping methods are generally more robust and stable than explicit and semi-explicit methods. Therefore, as mentioned in [7], fully implicit methods should be further investigated and developed. The most popular time-stepping methods for the Navier–Stokes equations are the so-called projection or operator-splitting methods (e.g., fractional step or pressure correction methods) are not fully implicit; see [7] and [6]. Decoupling velocity and pressure reduces the system into simpler sub-problems, but the choice of boundary conditions for the pressure in these procedures is problematic. Moreover, the explicit element introduced by this decoupling requires small time steps. Although operator-splitting methods can work well, they must be used with care in terms of how well the overall solution algorithm behaves. They are usually not suitable for high Reynolds number ﬂows and long time calculations because a small time step is required for accuracy and stability. Because a fully implicit approach leads to a system of nonlinear equations that may be singular, special spatial discretization or stabilization techniques are needed. On the other hand, strongly coupled solution strategies need to deal with

Adjunct professor at the Department of Computer Science, University of Saskatchewan, SK, Canada.

I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 596–603, 2010. c Springer-Verlag Berlin Heidelberg 2010

A Fully Implicit Method for Fluid Flow

597

large nonlinear algebraic systems that must be solved iteratively. Although fully implicit schemes may lead to unconditionally stable solvers, these approaches can be too expensive for practical computations. In this paper, we employ coordinate splitting, a generalization of the Douglas– Rachford scheme [4], leaving the system coupled at each fractional step to allow the satisfaction of the boundary conditions and avoid the introduction of artiﬁcial boundary conditions for the pressure. For the temporal discretization, we use the backward Euler scheme.

2 2.1

Problem Statement Incompressible Navier–Stokes Equations

Consider the incompressible Navier–Stokes equations in dimensionless form u ∂u u = ν∇2u − ∇p + g + u · ∇u ∂t

(1)

coupled by the continuity equation, also called the incompressibility constraint, div u = ∇ · u = 0

(2)

on Ω × (0, T ), where Ω is a bounded compact domain with a piece-wise smooth x , t) = (u, v, w) is the ﬂuid velocity at position x ∈ Ω boundary ∂Ω. Here u = u (x x, t) is the ﬂuid kinematic pressure, and time t ∈ (0, T ), T is given, p = p(x ν = 1/ Re is the kinematic viscosity, Re is the Reynolds number, g is an external force, ∇ is the gradient operator, and ∇2 is the Laplacian operator. 2.2

Initial and Boundary Conditions

We assume the initial condition x ), u t=0 = u 0 (x

(3)

where ∇ · u 0 = 0. Remark: In order to avoid singularities, initial and boundary conditions should agree at t = 0 and x ∈ ∂Ω. Navier–Stokes equations can be classiﬁed as partial diﬀerential-algebraic equations (PDAEs), e.g., [1]. The challenges in the numerical solution are well known; they are connected with the fact that the Navier–Stokes equations are not an evolutionary system of Cauchy–Kovalevskaya type and that the pressure is an implicit function responsible for the satisfaction of the continuity equation. Furthermore, no boundary conditions on the pressure can be imposed on the rigid boundaries. This creates formidable obstacles for the construction of fully implicit schemes.

598

3

R.S. Marinova, R. Spiteri, and E. Essien

Singularity of Direct Fully Implicit Schemes

We ﬁrst write the system (1), (2) in the following form, u ∂u u + ∇p = g , + (C + L)u ∂t

∇ · u = 0,

(4)

1 where C is the nonlinear convection operator and L = − Re ∇2 is the linear viscosity operator. The use of a fully implicit approach such as the Backward Euler method for time stepping leads to the solution of the following nonlinear stationary problem at each time step

x , t + Δt) − u (x x , t) u (x x, t + Δt) + ∇p(x x, t + Δt) + (C + L) u (x Δt x , t + Δt), ∇ · u (x x , t + Δt) = 0, = g (x

(5)

x , t + Δt) u (x x , t + Δt) + ∇p(x x, t + Δt) + (C + L) u (x Δt x , t) u (x x , t + Δt), ∇ · u (x x , t + Δt) = 0. = + g (x Δt

(6)

or

The choice of discretization in space is crucial for the stability of the scheme. Because the equations to be solved are conservation laws, it is highly desirable for the numerical scheme should also preserve these laws [9]. We choose approximations of the diﬀerential operators for which the numerical scheme preserves the integral properties of the respective diﬀerential problem. Constructing such schemes especially in the case of operator splitting is a challenging task. After discretizing the system of equations (6) in space, one arrives at a linear system, which can be written in matrix form as N Q u f = . (7) 0 QT 0 p In the case of a fully implicit approach, the matrix N is not symmetric positive deﬁnite in contrast to systems arising from an explicit treatment of the convective term. Stabilization techniques are usually based on perturbed versions of the equation of continuity. There exist many variations of pressure stabilization techniques [2]; see [7] and [6] for review. Although not originally derived as stabilization methods, the artiﬁcial incompressibility method [3] and the penalty method [10] can be also placed into this category. All these methods aim at stabilizing pressure oscillations and allowing standard grids and elements. They are usually used with ﬁnite element discretizations. Most popular time-stepping methods, including fully implicit schemes such as (5), typically do not solve the resulting system in a fully coupled manner. Velocity and pressure are usually decoupled, and this requires imposition of

A Fully Implicit Method for Fluid Flow

599

pressure boundary conditions. Solving the system (7) in a fully coupled approach is preferred because it preserves the implicitness of the scheme, but such solvers require further development. Direct linear solvers, such as Gaussian elimination, are not eﬃcient for 3D problems. Iterative strategies, such as BiCGStab and GMRES, combined with suitable preconditioners, can be eﬀectively used for solving such systems of equations.

4

Balanced Pressure Equation

A formulation with a pressure equation is preferable compared to the continuity equation because we can construct a solver for the resulting non-linear stationary problem that is not only robust with respect to the physical and numerical parameters but also computationally eﬃcient. For this reason, we use a special pressure equation, which is equivalent to the standard Poisson equation for pressure on a diﬀerential level. A similar form of pressure equation is presented in [5]. In this paper, the Laplacian of the pressure in a pressure equation is balanced using the divergence of the momentum equations u − g )]. ∇ · u = [∇2 p + ∇ · (Cu

(8)

On the discrete level, the right-hand side of equation (8) does not necessarily vanish. Here, is a balancing parameter that can be varied. As noted in [5], this parameter is not related to the time step Δt. In fact, the formulation of the problem (1), (2) is equivalent to the formulation with the pressure equation (1), (8) (or (9)) if and only if the continuity equation is satisﬁed on the boundary. The equation we use is similar to (8). The balancing coeﬃcient in our pressure equation is equal to the viscosity in the momentum equation ν = 1/ Re and we u , so that the modiﬁed pressure also use a balancing coeﬃcient γ for the term ∇·u equation becomes u − g )]. (9) γ(∇ · u ) = ν[∇2 p + ∇ · (Cu We ﬁnd that using a balanced pressure equation such as (9) in combination with conservative diﬀerence approximations (see next section) improves the convergence of (7) considerably. Conservative discretizations are not considered in [5], nor is a splitting procedure used to improve the eﬃciency of the solver for linear systems of equations.

5 5.1

Diﬀerence Problem Analytical Requirements

We want a diﬀerence scheme that satisﬁes the following analytical requirements: 1. Conservation properties Following [9], we call T (ϕ) conservative if it can be written in divergence form T [·] = ∇ · (S[·]), where S is an operator that can be used to write the

600

R.S. Marinova, R. Spiteri, and E. Essien

system of equations in an equivalent form on the continuous level provided the continuity equation is satisﬁed. In general, however, these forms are not equivalent on the discrete level. It is known that (a) The mass is conserved a priori because the continuity equation (2) appears in divergence form. (b) The momentum is conserved a priori if the continuity equation (2) is satisﬁed: the pressure and viscous terms are conservative a priori; the convective term is also conservative a priori if ∇ · u = 0. (c) The square of a velocity component ϕ2 is of importance in case of coordinate splitting. If the convective term is written in a skew-symmetric u ] = 12 u [∇ · (u uu ) + u · ∇u u], then it conserves ϕ2 . For instance, in form C[u direction x ∂ϕ ϕ ∂(ϕu) 1 ∂(ϕ2 u) +u · (10) ϕ Cx [ϕ] = = 2 ∂x ∂x 2 ∂x The convective term in a skew-symmetric form is conservative a priori, whereas the pressure and viscous term are not conservative. def (d) The kinetic energy K = 12 (u2 + v 2 + w2 ): The skew-symmetric convective term is energy conservative, the pressure term is energy conservative if the continuity equation is satisﬁed, and the viscous term is not energy conservative. In addition to conservation we also ensure that the scheme satisﬁes the following properties: 2. 3. 4. 5.

5.2

Compatibility for Poisson’s equation for pressure. Commutativity of the Laplacian and divergence operators. Consistency between gradient and divergence operators. A velocity ﬁeld that is solenoidal at each time step; i.e., ∇ · u = 0. Satisfaction of properties 1–5 leads to strong L2 stability of the scheme [8]. Coordinate Splitting and Spatial Discretization

We consider a ﬂow in a region with rectilinear boundaries in Cartesian coordinates. The boundary conditions derived from the continuity equation ∇ · u = 0 in the three-dimensional case read ∂u ∂v ∂w = ψ (y, z), = ψ (x, z), = ψ3 (x, y), 1 2 ∂x (x=c1 ,y,z) ∂y (x,y=c2 ,z) ∂z (x,y,z=c3 ) (11) where (x = c1 , y, z), (x, y = c2 , z), and (x, y, z = c3 ) are boundary points, ci are constants, and ψi , i = 1, 2, 3 are given functions. The stationary equation in general form can be written as A[ϕ] = G,

A = L + P + N,

(12)

A Fully Implicit Method for Fluid Flow

601

u] = ∇ and P [p] = γ∇ · u , C = Cx + Cy + Cz , and G contains the where P [u source term and terms coming from the time discretization. Equation (12) is converted to an evolution system by adding derivatives with respect to an artiﬁcial time s ∂ϕ + A[ϕ] = G. ∂s

(13)

We split the operator A = A1 + · · · + Al in (13) as follows: ϕn+1/l − ϕn Δs ϕ

n+i/l

−ϕ Δs

n+(i−1)/l

= A1 ϕn+1/l + A1 ϕn +

l

Ai ϕn + Gn

i=2

= Ai (ϕn+i/l − ϕn ), i = 2, . . . , l,

where l = 2, 3 in 2D and 3D, respectively. The splitting procedure is a generalization of the scheme of Douglas and Rachford [4]. Standard three-point diﬀerences are used for the second derivatives, which inherit the negative deﬁniteness of the respective diﬀerential operators. The ﬁrst derivatives for pressure are discretized with second-order diﬀerences. The grid is staggered in each direction. Also, we keep the coupling between the pressure and the respective velocity component through the boundary conditions at each fractional step. This allows us to construct a robust implicit splitting scheme with strong L2 -stability.

6

Numerical Results

We now verify the convergence properties of the method in space and time. To conﬁrm the discretization in space, we perform calculations on a uniform mesh h = hx = hy = hz and the following analytic solution √ ⎛ ⎞ ⎛√ ⎞ 2 exp(− u √ 2x) cos(y + z) u = ⎝ v ⎠ = ⎝ exp(−√2x) sin(y + z) ⎠ , w exp(− 2x) sin(y + z) √ p = − exp(−2 2x), Re = 100 in the unit square. No boundary condition is imposed on p. We only present results for the velocity component u because they are representative of the three components. We use maximum and average errors as accuracy measures: num. anal. − fi,j,k |, Errorf,max. = max |fi,j,k i,j,k

Errorf,aver. =

num. anal. |fi,j,k − fi,j,k | , (14) Nx Ny Nz

i,j,k

where f can be any of the functions u, v, w, or p. The discretization errors for u are presented in Table 1 and Figure 1. In the Errorf,max (2h) last column of Table 1, the convergence rate = log2 Errorf,max (h) is reported. It

602

R.S. Marinova, R. Spiteri, and E. Essien Table 1. Discretization Error: h = hx = hy = hz 1/h 8 16 32 64

h2 1.56250E-02 3.90625E-03 9.76563E-04 2.44141E-04

Erroru,max. 3.20504E-04 1.14828E-04 3.36453E-05 8.95454E-06

Erroru,aver. rate 4.64364E-05 2.06725E-05 1.480867308 6.70046E-06 1.770999827 1.87366E-06 1.909714065

Fig. 1. Discretization error as a function of the mesh spacing h = hx = hy = hz (•– h2 ; – maximum error; – average error

Fig. 2. Divergence Δt = 0.1; 0.2; 0.4; 0.8

can be seen that the rate of convergence approaches the second order; i.e., the discretization error is of order of O(h2 ) in space.

A Fully Implicit Method for Fluid Flow

603

In order to validate the 3D unsteady algorithm, we perform tests by using the following 3D analytical solution of the incompressible Navier–Stokes equations u = v = w = et ,

p = (x + y + z)et .

(15)

The results, summarized in Figure 2, conﬁrm the ﬁrst-order convergence in time.

7

Conclusion

A fully implicit method for the numerical solution of the incompressible Navier– Stokes equations is presented. The most important characteristic of the method is its stability due to the implicit treatment of the boundary conditions and conservative spatial discretization. Future work will focus on implementation of higher-order time-stepping and comparison with existing approaches.

Acknowledgment This work was partially supported by MITACS and NSERC.

References 1. Ascher, U.M., Petzold, L.R.: Computer Methods for Ordinary Diﬀerential Equations and Diﬀerential-Algebraic Equations. SIAM, Philadelphia (1998) 2. Brezzi, F., Fortin, M.: Mixed and Hybrid Finite Element Methods. Springer, Berlin (1991) 3. Chorin, A.J.: Numerical solution of the Navier-Stokes equations. Mathematics of Computation 22(104), 745–762 (1968) 4. Douglas, J., Rachford, H.H.: On the numerical solution of heat conduction problems in two and three space variables. Trans. Amer. Math. Soc. 82, 421–439 (1956) 5. Hafez, M., Soliman, M.: Numerical solutions of the incompressible Navier-Stokes equations in primitive variables, pp. 183–201 (1993) 6. Kwak, D., Kiris, C., Kim, C.S.: Computational challenges of viscous incompressible ﬂows. Computers & Fluids 34, 283–299 (2005) 7. Langtangen, H.P., Mardal, K.-A., Winther, R.: Numerical methods for incompressible viscous ﬂow. Advances in Water Resources 25, 1125–1146 (2002) 8. Marinova, R.S., Takahashi, T., Aiso, H., Christov, C.I., Marinov, T.T.: Conservation properties of vectorial operator splitting. Journal of Computational and Applied Mathematics 152(1-2), 289–303 (2003) 9. Morinishi, Y., Lund, T.S., Vasiliev, O.V., Moin, P.: Fully conservative higher order ﬁnite diﬀerence schemes for incompressible ﬂow. Journal of Computational Physics 143, 90–124 (1998) 10. Reddy, J.N.: On penalty function methods in the ﬁnite element analysis of ﬂow problems. International Journal for Numerical Methods in Fluids 18, 853–870 (1982)

Discrete Maximum Principle for Finite Element Parabolic Operators Miklos E. Mincsovics Department of Applied Analysis and Computational Mathematics, ELTE, P´ azm´ any P´eter s´et´ any I/C, Budapest H-1117, Hungary [email protected]

Abstract. When we construct continuous and/or discrete mathematical models in order to describe a real-life problem, these models should possess various qualitative properties, which typically arise from some basic principles of the modelled phenomenon. In this paper we investigate this question for the numerical solution of initial-boundary value problems for parabolic equations with nonzero convection and reaction terms with function coeﬃcients in higher dimensions. The Dirichlet boundary condition will be imposed, and we will solve the problem by using linear ﬁnite elements and the θ-method. The principally important qualitative properties for this problem are the non-negativity preservation and different maximum principles. We give the conditions for the geometry of the mesh and for the choice of the discretization parameters, i.e., for θ and the time-step sizes, under which these discrete qualitative properties hold. Finally, we give numerical examples to investigate how sharp our conditions are.

1

Introduction

Let Ω ∈ IRd denote a bounded simply connected domain with a Lipschitz ¯T = Ω ¯ × [0, T ] and boundary ∂Ω. We use the notations QT = Ω × (0, T ), Q ΓT = (∂Ω × [0, T ]) ∪ (Ω × {0}) for the parabolic boundary. In this paper we consider the parabolic operator which is deﬁned for the ¯ T ) and which can be described as functions v(x, t) ∈ C 2,1 (QT ) ∩ C(Q La,b,c v =

∂v − div(a grad v) + b, grad v + c v, ∂t

(1)

where a, c : Ω → IR, b : Ω → IRd and a ∈ C 1 (Ω), b, c ∈ C(Ω). The symbol ·, · is the usual scalar product in IRd . In the sequel we assume that 0 < am ≤ a ≤ aM , b ≤ bM and 0 ≤ c ≤ cM holds with the constants am , aM , bM , cM . · denotes the norm of IRd induced by the scalar product ·, ·. Then, the operator La,b,c satisﬁes the so-called weak boundary maximum principle (WBMP), which means the following: for any v ∈ dom La,b,c the relation max{0, max v} = max v, ΓT

¯T Q

for the functions La,b,c v ≤ 0

I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 604–612, 2010. c Springer-Verlag Berlin Heidelberg 2010

(2)

Discrete Maximum Principle for Finite Element Parabolic Operators

605

¯ T takes its holds, i.e., the continuous function v on the bounded and closed Q non-negative maximum on the parabolic boundary ΓT . For the operator La,b,0 a more strict relation holds, the strong boundary maximum principle (SBMP): max v = max v, ΓT

¯T Q

for the functions La,b,0 v ≤ 0.

(3)

The operator La,b,c also has the so-called non-negativity preservation (NP) property, which says that the condition v|ΓT ≥ 0 and La,b,c v ≥ 0 imply v ≥ 0 for any function v ∈ dom La,b,c . These results can be found in [4]. The weak/strong boundary maximum principle and the non-negativity preservation property are physically based, and their violation contradicts to the nature. Therefore, their preservation in the discrete models is very important, and it is quite a natural requirement for the really reliable and meaningful numerical modelling of various real-life phenomena, like the above formulated parabolic process. This problem was already investigated with constant coeﬃcients and absent convection terms, see [3,6,7,8]. In this work we analyze the discrete boundary maximum principles and the NP for the discrete parabolic operator corresponding to (1), where the discretization is done by the linear ﬁnite element methods with respect to the space variable and we apply the one-step θ-method for the time integration. We give those conditions under which this qualitative properties holds.

2

Mesh Operators, the Discrete Boundary Maximum Principles and the Non-negativity Preservation Property

The sets P = {x1 , x2 , . . . , xN } and P∂ = {xN +1 , xN +2 , . . . , xN +N∂ } consist ¯ = N + N∂ and of diﬀerent vertices in Ω and on ∂Ω, respectively. We set N ¯ P = P ∪ P∂ . Let T be as before and M any given integer. We deﬁne the time discretization step-size Δt as Δt = T /M . We introduce the notations RM = ¯ M = {t = nΔt, n = 0, 1, . . . , M } and the {t = nΔt, n = 1, 2, . . . , M − 1}, R ¯ ¯ ¯ ¯ M ) ∪ (P × {0}). (The set sets QM = P × RM , QM = P × RM , GM = (P∂ × R GM is the discrete parabolic boundary.) The values ν(xi , nΔt) of the function ν ¯ M will be denoted by ν n . Similar notation is applied to the function deﬁned in Q i ¯ ¯ N N n Lν. We introduce the vectors ν n = [ν1n , . . . , νN ¯ ] ∈ IR , e = [1, . . . , 1] ∈ IR . Definition 1. Linear mappings that map from the space of real-valued functions ¯ M to the space of real-valued functions defined on QM are called defined on Q discrete (linear) mesh operators. The discrete version of the boundary maximum principles and the non-negativity preservation property can be formulated as follows.

606

M.E. Mincsovics

Definition 2. We say that the discrete mesh operator L satisfies the discrete weak/strong boundary maximum principle (DWBMP/DSBMP) if for any function ν ∈ dom L the relation max{0, max ν} = max ν GM

¯M Q

/

max ν = max ν GM

(4)

¯M Q

holds for the mesh-functions Lν ≤ 0. Definition 3. The discrete mesh operator L is called non-negativity preserving (DNP) if for any ν ∈ dom L such that minGM ν ≥ 0 and Lν ≥ 0, the relation ν ≥ 0 holds. In the sequel we analyze the discretization of the operator La,b,c , deﬁned in ¯ ) which have the (1). We denote the FEM basic functions by φi (x) (i = 1, . . . N ¯ properties for x ∈ Ω ¯ N φi (x) = 1, φi (x) ≥ 0. (5) i=1

Then, introducing the bilinear form on H 1 (Ω) as (a grad u, grad v + b, grad u v + c uv) dx, we deﬁne the eleB [u, v] := Ω

N ×N ments Kij and Mij of the as stiﬀness matrix and mass matrix K, M ∈ IR Kij = B [φj , φi ], Mij = φj φi dx. Then L has the form: ¯

Ω

(Lν)ni = (X1 ν n − X2 ν n−1 )i , where X1 =

1 Δt M

+ θK, X2 =

1 Δt M

i = 1, . . . , N, n = 1, . . . , M,

(6)

− (1 − θ)K with θ ∈ [0, 1].

We denote by La,b,c the two-level discrete mesh operator, obtained by ﬁnite element space and θ-method time discretization of the operator La,b,c . For such operators the following theorems describe the relation between DWBMP/ DSBMP and DNP, the proof goes similarly as in [7]. Theorem 1. If the discrete mesh operator of type (6) satisfies the DWBMP/ DSBMP, then the DNP property is also valid. On the other hand, under the DNP property and the condition K e ≥ 0 / K e = 0 the DWBMP/DSBMP holds. Theorem 2. Let us assume (5). If c ≥ 0, then K e ≥ 0. In addition, if c ≡ 0, then K e = 0 holds. Theorem 1 and 2 provide for us that we need to care only with the DNP of La,b,c , which is deﬁned by the rectangular matrices X1 and X2 , respectively. These matrices, by the inner and boundary points, can be partitioned into a convenient form, namely X1 = [X10 | X1∂ ], X2 = [X20 | X2∂ ]. Here X10 and X20 are square matrices from IRN ×N , and X1∂ , X2∂ ∈ IRN ×N∂ . Introducing the notation λn0 = [(La,b,c ν)n1 , . . . , (La,b,c ν)nN ] with an arbitrary chosen mesh

Discrete Maximum Principle for Finite Element Parabolic Operators

607

function ν for the operator La,b,c , and supposing the regularity of the matrix X10 , we arrive at the iteration form ν n0 = −(X10 )−1 X1∂ ν n∂ + (X10 )−1 X2 ν n−1 + (X10 )−1 λn0 ,

(n = 1, . . . , M ).

Thus, it is easy to see that the next Theorem is valid. Theorem 3. Let us suppose that the matrix X10 is regular. Then La,b,c possesses the discrete non-negativity preservation property if and only if the following relations hold: (P1) (X10 )−1 ≥ 0, (P2) −(X10 )−1 X1∂ ≥ 0, (P3) (X10 )−1 X2 ≥ 0. The following statement gives a suﬃcient condition for (P1)-(P3) in terms of the matrices M and K, see [7]. Theorem 4. Under the assumptions (P 1 ) Kij ≤ 0,

¯, i = j, i = 1, ..., N, j = 1, ..., N

¯ (7) (P 2 ) Δt(X1 )ij = Mij + ΔtθKij ≤ 0, i = j, i = 1, ..., N, j = 1, ..., N, (P 3 ) Δt(X2 )ii = Mii − Δt(1 − θ)Kii ≥ 0, i = 1, ..., N, the non-negativity assumptions (P1)–(P3) in Theorem 3 are satisfied.

3

Conditions for DWBMP/DSBMP of La,b,c

In this section, based on Theorems 1, 2, and 4 in order to guarantee the DWBMP /DSBMP for the operator La,b,c /La,b,0 , we give conditions under which the assumptions (P 1 )-(P 3 ) are satisﬁed. We assume that Ω can be covered by a regular simplicial mesh Th , and that this mesh is of nonobtuse type, i.e., all the angles made by any faces of each simplex T ∈ Th are not greater than π/2. Further we assume that the FEM ¯ ) are the so-called hat functions, which are basic functions φi (x), (i = 1, 2, . . . N continuous and linear over each simplex T ∈ Th and φi (xj ) = δij . We remark the following information about Theorem 4 in the light of that we made some restriction for the mesh and the basic functions: – Since Mij = Kij = 0, (i = j) for the index pairs which determine nonneighbouring vertices, we need to investigate the remainder (we will not remind you of this in the following). – (P 1 ) is one additional restriction for the mesh, (P 2 ) and (P 3 ) give a lower and an upper bound for the time-step Δt. Naturally, the lower bound must be smaller than the upper bound, this can be attained by the corresponding choice of θ.

608

M.E. Mincsovics

– In the case θ = 0 the condition (P 2 ) cannot be fulﬁlled. Thus, we ﬁx that θ ∈ (0, 1]. (However, if we use the lumped mass technique, then θ = 0 is possible, too.) In case θ = 1 (P 3 ) is automatically fulﬁlled. – (P 2 ) implies (P 1 ). However, we need to require a strict inequality in (P 1 ) for the index pairs which determine neighbouring vertices, to make (P 2 ) possible. Let us denote this modiﬁed condition by (P 1 ). Since we want to get a usable condition for the mesh, we investigate (P 1 ) instead of (P 1 ) in the following. – Finally, we remind the reader that conditions (P 1 )-(P 3 ) are only suﬃcient but not necessary to guarantee maximum principles. 3.1

Local Conditions for DWBMP/DSBMP of La,b,c

For this case we deﬁne/estimate the elements of the local mass and stiﬀness matrices similarly as in [2,7,9]. The contributions to the mass matrix M over the simplex T ∈ Th are Mij |T =

measd T , (i = j), (d + 1)(d + 2)

Mii |T =

2 measd T . (d + 1)(d + 2)

(8)

We estimate the contribution to the stiﬀness matrix K over the simplex T in the following way. If the simplex T is tightened by the d + 1 piece vertices xi , and we denote by Si the (d − 1)-dimensional face opposite to the vertex xi , then cos γij is the cosine of the interior angle between faces Si and Sj . Note that (measd T)d = (measd−1 Si )mi , where mi is the (Euclidean) distance between Si and Pi . Let us introduce the notations: am (T ) = minT a, aM (T ) = maxT a, bM (T ) = maxT b , cM (T ) = maxT c. Then, cos γ a gradφj , gradφi dx = − a gradφj gradφi cos γij dx = − mj miji a dx ≤

T

T

T

d T) − am (Tm)(meas cos γij (≤ 0) in case i = j, otherwise i mj 2 d T) a gradφi , gradφi dx = a gradφi dx ≤ aM (T )(meas and m2i T T b, gradφj φi dx ≤ |b, gradφj | φi dx ≤ b gradφj φi dx ≤

T bM (T ) mj

T

φi dx =

T bM (T )(measd T) mj (d+1)

T

hold.

Thus we have an estimation for the non-diagonal elements cM (T ) am (T ) cos γij bM (T ) Kij |T ≤ (measd T) − + + mi mj mj (d + 1) (d + 1)(d + 2) and

aM (T ) bM (T ) 2cM (T ) + + Kii |T ≤ (measd T) m2i mi (d + 1) (d + 1)(d + 2) for the diagonal elements.

(9)

(10)

If we require (P 1 )-(P 3 ) on every simplex T ∈ Th , then we get a suﬃcient condition to fulﬁll (P 1 )-(P 3 ). Thus, one can easily check — on the base of (8)–(10) — that the next Theorem is valid.

Discrete Maximum Principle for Finite Element Parabolic Operators

609

Theorem 5. Let us assume that for the simplicial mesh Th the geometrical condition bM (T ) mi cM (T ) mi mj cos γij > + (11) am (T ) d + 1 am (T ) (d + 1)(d + 2) is satisfied. Then, for Δt chosen in accordance with the lower bound Δt ≥

−1 1 d+2 (d + 1)(d + 2) am (T ) cos γij − bM (T ) − cM (T ) θ mi mj mj

(12)

and the upper bound Δt ≤

−1 aM (T ) (d + 1)(d + 2) bM (T ) d + 2 1 + + c (T ) , M 1−θ 2 m2i 2 mi

(13)

respectively, the linear finite element discrete mesh operator La,b,c /La,b,0 satisfies the discrete weak/strong boundary maximum principle.

3.2

Global Conditions for DWBMP/DSBMP of La,b,c

Theorem 5 is of little use in practice, since conditions (11)-(13) should be checked for each T ∈ Th , moreover it does not contain any useful information about the corresponding choice of θ. In the following we deal with getting rid of these problems. To this aim, let us introduce the notations m = minTh mi , M = maxTh mi , G = minTh cos γij , ♠ = am G (d+1)(d+2) M2

−

bM d+2 m

aM (d+1)(d+2) 2 m2

+

bM d+2 2 m

+ cM and ♥ =

− cM . Then, from Theorem 5 it follows:

Corollary 1. Let us assume that for the simplicial mesh Th the geometrical condition ♥>0 (14) holds. Moreover the condition for the parameter θ the inequality θ≥

♠ ♠+♥

(15)

holds, too. Then under the condition 11 1 1 ≤ Δt ≤ θ♥ 1−θ♠

(16)

the linear finite element discrete mesh operator La,b,c with c ≥ 0 satisfies the DWBMP, and with c ≡ 0 it satisfies the DSBMP, similarly as in the continuous model.

610

M.E. Mincsovics

1

1

0.8

0.8

0.6

0.6

0.4

0.4

0.2

0.2

0 −0.2

0

0

0.5

1

1.5

−0.2

0

0.5

1

1

0.8

0.8

0.6

0.6

0.4

0.4

0.2

0.2

0 −0.2

1

1.5

0 0

0.5

1

−0.2

0

0.5

1

Fig. 1. Mesh and reﬁned mesh on two diﬀerent domains Ω

We remark that the geometrical condition (14) can be substituted for the less M M2 restrictive condition G > baM + caM . However, with this condition m d+1 m (d+1)(d+2) we cannot guarantee that the right side of (15) is not greater than one. The condition (14) is an upper bound for the angles, and it depends on how ﬁne the mesh is, i.e., the ratio M 2 /m cannot be too large. Note that G ≤ d1 , and here the equality holds only in the case where Th is a uniformly regular simplicial mesh, i.e., that consisting of the congruent regular simplices, see [7]. Naturally, this can be attained if Ω is special. This case allows us the widest choice of the parameters θ, Δt. However even in this case ♠ > ♥ for d > 2, which means that the Crank– Nicolson method is excluded. If Th and θ is such that the conditions (14) and (15) hold, then the lower and upper bounds for Δt determine a nonempty interval, this is condition (16). Note that our bounds contain as special case the bounds in [7] — in which the operator La,0,c with constant coeﬃcients was investigated — if we set the parameters as aM = am = a, bM = 0, cM = c.

4

Numerical Examples

As one can see, the conditions collected in the last section are suﬃcient, but not necessary to guarantee the DWBMP/DSBMP. Consequently, we need to investigate how sharp our conditions are. This section is devoted to illustrate this question with several numerical examples. We ﬁx the dimension d = 2 and the parameters a ≡ 1, b ≡ (6, 0), c ≡ 10. We investigate two operators La,b,c with homogeneous Dirichlet boundary conditions, which diﬀer only in their domains, see Figure 1. In the ﬁrst √ case the domain √ is a rhombus, determined by the vertices (0, 0), (1, 0), (3/2, 3/2), (1/2, 3/2), which allow us to use a uniformly regular simplicial mesh. In the second case the domain is a unit square; here we used a mesh which contains right-angled triangles, which is problematical from the point of view of Corollary 1. The question is which bounds we obtain from Corollary 1 – see Table 1 – and how these

Discrete Maximum Principle for Finite Element Parabolic Operators

611

Table 1. Bounds of DWBMP obtained from Corollary 1 rhombus (l) rhombus (r) square (l) square (r) geometrical condition not fulﬁlled fulﬁlled not fulﬁlled not fulﬁlled lower bound for θ 0.9644 θ = 1/2, bounds for Δt θ = 1, bound for Δt 0.1399 Table 2. The real bounds of DWBMP rhombus (l) rhombus (r) square (l) square (r) K−1 0

≥0 true lower bound for θ 0 θ = 1/2, bounds for Δt 0 and 0.0476 θ = 1, bound for Δt 0

true 0.8525 0.0415

false -

true 0.9809 0.0699

compare with the real bounds of the DWBMP – see Table 2 – insisting on the stability of course. Stability means that ρ(X−1 10 X20 ) < 1, where ρ(·) denotes the spectral radius of the matrix. The next theorem – see [1] – shows us that we can lose the stability already with an unsuitable choice of the mesh, namely, it is not repairable with a clever choice of θ and Δt. This explains the column of square (l) of Table 2, namely, for such a mesh for which K−1 0 0 holds, the DWBMP rules out the stability. However, Theorem 6 has an advantage, namely, for such a mesh for which K−1 0 ≥ 0 holds, DWBMP implies stability. Theorem 6. Assume that K0 = X10 −X20 , and X10 is nonsingular with X−1 10 ≥ 0 and (X10 )−1 X20 ≥ 0. Then the following statements are equivalent. (i) K−1 0 ≥ 0. (ii) ρ(X−1 10 X20 ) < 1. The conclusion is that our conditions are useful, however, they delimit our mesh generation, and there are lot of domains (e.g., in 3D the cube) for which no acute angled triangulation is known.

References 1. Berman, A., Plemmons, A.R.J.: Nonnegative matrices in the mathematical sciences. Academic Press, New York (1997) 2. Brandts, J., Korotov, S., Krizek, M.: Simplicial ﬁnite elements in higher dimensions. Applications of Mathematics 52, 251–265 (2006) 3. Farag´ o, I., Horv´ ath, R.: Discrete maximum principle and adequate discretizations of linear parabolic problems. SIAM Sci. Comput. 28, 2313–2336 (2006) 4. Farag´ o, I., Horv´ ath, R.: Qualitative properties of monotone linear operators. Electronic Journal of Qualitative Theory of Diﬀerential Equations 8, 1–15 (2008)

612

M.E. Mincsovics

5. Farag´ o, I., Horv´ ath, R.: Continuous and discrete parabolic operators and their qualitative properties. IMA Journal of Numerical Analysis 29, 606–631 (2009) 6. Farag´ o, I., Horv´ ath, R., Korotov, S.: Discrete maximum principle for linear parabolic problems solved on hybrid meshes. Appl. Num. Math. 53, 249–264 (2005) 7. Farag´ o, I.: Discrete maximum principle for ﬁnite element parabolic models in higher dimensions. Math. Comp. Sim (2009), doi:10.1016/j.matcom.2009.01.017 8. Fujii, H.: Some remarks on ﬁnite element analysis of time-dependent ﬁeld problems. Theory and Practice in Finite Element Structural Analysis, 91–106 (1973) 9. Holand, I., Bell, K.: Finite element methods in stress analysis. Tapir, Trondheim (1996)

Climate Change Scenarios for Hungary Based on Numerical Simulations with a Dynamical Climate Model Ildik´ o Pieczka, Judit Bartholy, Rita Pongr´ acz, and Adrienn Hunyady Department of Meteorology, E¨ otv¨ os Lor´ and University, P´ azm´ any st. 1/a, H-1117 Budapest, Hungary [email protected], [email protected], [email protected], [email protected] http://nimbus.elte.hu/

Abstract. Climate models are systems of partial diﬀerential equations based on the basic laws of physics, ﬂuid motion, and chemistry. The use of high resolution model results are essential for the generation of national climate change scenarios. Therefore we have adapted the model PRECIS (Providing REgional Climates for Impacts Studies), which is a hydrostatic regional climate model HadRM3P developed at the UK Met Oﬃce, Hadley Centre, and nested in HadCM3 GCM. It uses 25 km horizontal resolution transposed to the Equator and 19 vertical levels with sigma coordinates. First, the validation of the model (with two diﬀerent sets of boundary conditions for 1961–1990) is accomplished. Results of the diﬀerent model experiments are compared to the monthly climatological data sets of the Climatic Research Unit (CRU) of the University of East Anglia as a reference. Signiﬁcance of the seasonal bias ﬁelds is checked using Welch’s t-test. Expected future changes — in mean values, distributions and extreme indices — are analysed for the period 2071–2100. The results suggest that the signiﬁcant temperature increase expected in the Carpathian Basin may considerably exceed the global warming rate. The climate of this region is expected to become wetter in winter and drier in the other seasons. Keywords: Regional climate modeling, model PRECIS, Carpathian Basin, temperature, precipitation, Welch’s t-test.

1

Introduction

Climate models are widely used for estimating the future changes. They are computer-based numerical models of the governing partial diﬀerential equations of the atmosphere. Global climate models — due to their coarse horizontal resolution — do not provide enough information about regional scale. For the generation of national climate change scenarios in European subregions the use of ﬁne resolution (10–25 km) regional climate models (RCMs) is crucial[8]. I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 613–620, 2010. c Springer-Verlag Berlin Heidelberg 2010

614

I. Pieczka et al.

The expected regional climate change in the Carpathian Basin (located in Central-East Europe) is modelled by four diﬀerent RCMs. For instance, validation studies of models RegCM and ALADIN-Climate are discussed in [17] and [4], respectively, and preliminary results of REMO are published in [16]. The present paper discusses the climate modeling experiments using the model PRECIS. We evaluated the model capability of reconstructing the present climate (1961– 1990) using two diﬀerent sets of boundary conditions, (i) from the European Centre for Medium Range Weather Forecast (ECMWF) ERA-40 reanalysis database, (ii) from the HadCM3 GCM output data. In order to fulﬁll the validation task the results of the diﬀerent model experiments are compared to the monthly climatological data sets of the Climatic Research Unit (CRU) of the University of East Anglia as a reference. For the future (2071–2100) we completed two simulation runs, namely the IPCC A2 and B2 scenario runs. In order to objectively evaluate the results, Welch’s t-test is used for testing the simulated seasonal ﬁelds.

2

Regional Climate Model PRECIS

The installation and the adaptation of the regional climate model PRECIS at the Department of Meteorology, E¨ otv¨ os Lor´ and University (Budapest, Hungary) started in 2004. PRECIS is a high resolution limited area model with both atmospheric and land surface modules. The model was developed at the Hadley Centre of the UK Met Oﬃce [20], and it can be used over any part of the globe (e.g., [14,1]). The PRECIS regional climate model is based on the atmospheric component of HadCM3 [7] with substantial modiﬁcations to the model physics [10]. The atmospheric component of PRECIS is a hydrostatic version of the full primitive equations (1–5): ∂v ∂v = −v · ∇v − ω + f k × v − ∇Φ + DM ∂t ∂p ∂T = −v · ∇T + ω ∂t

κT ∂T − p ∂p

+

rad Q con Q + + DH cp cp

(1)

(2)

∂q ∂q = −v · ∇q − ω + E − C + Dq ∂t ∂p

(3)

∂ω = −∇ · v ∂p

(4)

∂Φ RT =− ∂p p

(5)

where DM = (Dλ , Dφ ) are dissipation terms for momentum and DH and Dq are diﬀusion terms for heat and moisture, respectively, q is the speciﬁc humidity, E and C are the rates of evaporation and condensation due to cloud processes, and Φ is the geopotential [18].

Climate Change Scenarios for Hungary

615

Table 1. Conﬁguration and properties of PRECIS Prognostic variables

Horizontal coordinate-system Horizontal resolution Horizontal grid Horizontal discretization Vertical coordinate-system Coupling (LBC treatment) Linear term Advection Timestep

Surface pressure, zonal and meridional wind components, water vapour, potential temperature [10] Spherical polar coordinates rotated to the equator [10] 0.44◦ × 0.44◦ or 0.22◦ × 0.22◦ [10] Arakawa B grid [2] Finite diﬀerences (split-explicit) [10] Hybrid: terrain-following + pressure [15] Davies relaxation scheme [10] Explicit scheme [10] Heun-scheme [11] 5 min [10]

Table 1 summarizes the main properties and the conﬁguration of the model. Splitting techniques are commonly used when large-scale models — like climate models — are treated numerically [5]. In our model during the integration the geostrophic adjustment is separated from the advection part: adjustment is iterated three times per 5 minutes advection timestep. Averaged velocities over the three adjustment timesteps are used for advection, which is integrated in time using the Heun scheme [11]. This ﬁnite diﬀerence scheme is 4th order accurate except at high wind speeds when it is reduced to 2nd order accuracy for stability. In our studies, we used the ﬁnest possible horizontal resolution (0.22◦ ) for modeling the Central European climate. Hence, the target region contains 123 × 96 grid points, with special emphasis on the Carpathian Basin and its Mediterranean vicinity containing 105 × 49 grid points (Fig. 1). In the post-processing of the RCM outputs, daily mean values are used. In case of the control period (1961-1990), the initial conditions and the lateral boundary conditions (IC&LBC) for the regional model are taken from (i) the ERA-40 reanalysis database [6] using 1◦ horizontal resolution, compiled by the ECMWF, and (ii) the HadCM3 ocean-atmosphere coupled GCM using 150 km as a horizontal resolution. For the validation of the PRECIS results CRU TS 1.0 ([12,13]) data sets are used.

3

Design of the Statistical Analysis

In order to improve our knowledge about the reliability of the model results, we have made a hypothesis test. We cannot assume that the variances are equal in our samples (e.g. the variances of the model are equal with the variances of the observational data set, or the variances of the model are equal for the two diﬀerent periods), so we decided to use Welch’s t-test [19]. It is an adaptation of Student’s t-test intended for use with two samples having possibly unequal variances. It deﬁnes the statistic t by the following formula:

616

I. Pieczka et al.

Fig. 1. Topography of the selected Central European domain used in model PRECIS

X1 − X2 t= 2 s1 s2 + N22 N1

(6)

where X i , s2i , and Ni are the ith sample mean, sample variance, and sample size, respectively. The degrees of freedom can be calculated from (7–8): C= l =

C 2 (N

N2 s21 N1 s22 + N2 s21

(N1 − 1)(N2 − 1) 2 2 − 1) + (1 − C) (N1 − 1)

(7)

(8)

so the degree of freedom is the closest integer value to l . Once they have been computed, these statistics can be used with the t-distribution to test the null hypothesis that the two population means are equal (using a two-tailed test). In our work we used the seasonal mean values of the 30-year periods. For the period of 1961–1990 we analysed only the results from the PRECIS run where the initial and lateral boundary conditions were provided by a GCM, because in case of the other run (where we had the IC&LBC from the ERA-40 reanalysis) it cannot be assumed that the samples are independent. For that run we plan to calculate the one-tailed t-test of the bias.

4

Results

During the validation process, we analyzed monthly, seasonal, and annual temperature mean values and precipitation amounts for the control period. We found that, in general, the seasonal mean temperature ﬁelds are overestimated by the PRECIS simulations. The largest bias values are found in summer, when the average overestimation of PRECIS over Hungary is 2.2◦ C.

Climate Change Scenarios for Hungary

617

After completing the statistical hypothesis tests the results show that in case of Hungary the temperature bias values are not signiﬁcant at 0.05 level at 93% and at 70% of the gridpoints located within the country in autumn and spring, respectively. In winter signiﬁcant positive bias is found at 76% of the gridpoints at 0.05 level, while in summer bias values are signiﬁcant at all the gridpoints of the Hungarian domain. Precipitation is far more variable both in time and in space than temperature. The spatially averaged precipitation is overestimated in the entire model domain, especially, in spring and winter. In case of Hungary, the spring precipitation is overestimated (by 35% on average), while in the other three seasons the precipitation is slightly underestimated (by less than 10% on average) in the country. For the 1961–1990 reference period statistical hypothesis tests suggest that in case of precipitation the spring bias values are signiﬁcantly large (at 99% of all the gridpoints located inside the Hungarian borders). Interesting to see, that although the bias is only 9% in winter (this means about 10 mm/month) on average, it is large enough to make the bias signiﬁcantly large at 28% of the gridpoints. Fig. 2 (left column) shows the diﬀerences between the simulated (PRECIS outputs) and the observed (CRU data) values for the 1961–1990 reference period in case of temperature and precipitation in the gridpoints where they are signiﬁcant (non-signiﬁcant bias values are indicated by white color). Temperature and precipitation bias ﬁelds of the PRECIS simulations can be considered acceptable if compared to other European RCM simulations ([9,3]). In our research plan several future scenario runs are scheduled using the PRECIS model for the period 2071–2100. Here, the results of the ﬁrst accomplished PRECIS experiments are summarized using the SRES A2 and B2 emission scenarios [8]. The two right columns of Fig. 2 show the expected mean seasonal temperature and precipitation change in the gridpoints where it is signiﬁcant at 0.05 level by the end of the 21st century (middle column — A2 scenario, right column — B2 scenario). The changes are more pronounced in case of A2 than B2, which is due to the higher CO2 concentration level projected for A2 scenario (it is 850 ppm, while about 600 ppm for B2 scenario). The expected temperature changes are signiﬁcantly large for all seasons by 2071–2100 for both scenarios. The largest warming is projected for summer, 8◦ C and 6◦ C on spatial average for Hungary in case of A2 and B2 scenarios, respectively. For precipitation the projected drying expected in summer is signiﬁcant at 0.05 level in each gridpoint located in the Carpathian Basin. According to the PRECIS simulations, the expected precipitation decreases are 58% and 43% on spatial average for the entire country for A2 and B2 scenarios, respectively. Seasonal precipitation amounts, in spring and autumn are also projected to decrease by the end of the 21st century, however the expected change is signiﬁcant only in some of the gridpoints within Hungary (e.g., in the northern part of the country the expected spring drying is signiﬁcant at 0.05 level in case of A2 scenario). Winter is projected to become wetter in Hungary, especially for A2 scenario in

618

I. Pieczka et al.

Temperature BIAS (°C) PRECIS/CTL-CRU (1961-1990) IC&LBC: HadCM3 GCM

Expected temperature change (°C) PRECIS/SCEN-PRECIS/CTL (2071-2100) IC&LBC: IPCC A2 scenario IC&LBC: IPCC B2 scenario

DJF

MAM

JJA

SON

Precipitation BIAS (mm/month) PRECIS/CTL-CRU (1961-1990) IC&LBC: HadCM3 GCM

Expected precipitation change (mm/month) PRECIS/SCEN-PRECIS/CTL (2071-2100) IC&LBC: IPCC A2 scenario IC&LBC: IPCC B2 scenario

DJF

MAM

JJA

SON

Fig. 2. Figure shows the BIAS (left) and the expected change (right) of the seasonal mean temperature (◦ C) and precipitation (mm/month) predicted by PRECIS in the gridpoints where it is signiﬁcant at 0.05 level in the Carpathian Basin

Climate Change Scenarios for Hungary

619

the western Transdanubian region, where the expected change exceeds 5–10% and it is signiﬁcant at 0.05 level.

5

Conclusions

Based on the results presented in this paper, the following conclusions can be drawn. 1. Performance of the model for the past is acceptable compared to other simulations. 2. The sign of the expected changes is the same for A2 and B2 scenarios, but the amplitude of the change is larger in case of A2. 3. In all the four seasons signiﬁcant warming is projected at 0.05 level, the largest warming can be expected in summer. 4. Precipitation amount is expected to decrease by the end of the century, except in winter. The projected drying is signiﬁcant at 0.05 level in summer.

MEC

Albert Apponyi programme

Established by the support of the National Office for Research and Technology.

Acknowledgments. Research leading to this paper has been supported by the following sources: the Hungarian Academy of Sciences under the program 2006/TKI/246 titled Adaptation to climate change, the Hungarian National Research Development Program under grants NKFP-3A/082/2004 and NKFP6/079/2005, the Hungarian National Science Research Foundation under grants T-049824, K-67626, K-69164, and K-678125, the Hungarian Ministry of Environment and Water, and the CECILIA project of the European Union Nr. 6 program (contract no. GOCE-037005).

References 1. Akhtar, M., Ahmad, N., Booij, M.J.: The impact of climate change on the water resources of Hindukush-Karakorum-Himalaya region under diﬀerent glacier coverage scenarios. Journal of Hydrology (2008), doi:10.1016/j.jhydrol.2008.03.015 2. Arakawa, A., Lamb, V.R.: Computational design of the basic dynamical processes of the UCLA general circulation model. In: Chang, J. (ed.) Methods in Computational Physics, vol. 17, pp. 173–265. Academic Press, New York (1977) 3. Bartholy, J., Pongr´ acz, R., Gelyb´ o, G.: Regional climate change expected in Hungary for 2071–2100. Applied Ecology and Environmental Research 5, 1–17 (2007) 4. Csima, G., Hor´ anyi, A.: Validation of the ALADIN-Climate regional climate model at the Hungarian Meteorological Service. Id˝ oj´ ar´ as 112, 155–177 (2008)

620

I. Pieczka et al.

´ Zlatev, Z.: Diﬀerent splitting techniques with 5. Dimov, I., Farag´ o, I., Havasi, A., application to air pollution models. Int. J. Environment and Pollution 32(2), 174– 199 (2008) 6. Gibson, J.K., Kallberg, P., Uppala, S., Nomura, A., Hernandez, A., Serrano, A.: ERA description, Reading. ECMWF Reanalysis Project Report Series, vol. 1, p. 77 (1997) 7. Gordon, C., Cooper, C., Senior, C.A., Banks, H., Gregory, J.M., Johns, T.C., Mitchell, J.F.B., Wood, R.A.: The simulation of SST, sea ice extents and ocean heat transports in a version of the Hadley Centre coupled model without ﬂux adjustments. Climate Dynamics 16, 147–168 (2000) 8. IPCC: Climate Change 2007: The Physical Science Basis. In: Solomon, S., Qin, D., Manning, M., Chen, Z., Marquis, M., Averyt, K.B., Tignor, M., Miller, H.L. (eds.) Contribution of Working Group I to the Fourth Assessment Report of the IPCC, 996 p. Cambridge University Press, Cambridge (2007) 9. Jacob, D., B¨ arring, L., Christensen, O.B., Christensen, J.H., de Castro, M., D´equ´e, M., Giorgi, F., Hagemann, S., Hirschi, M., Jones, R., Kjellstr¨ om, E., Lenderink, G., Rockel, B., S´ anchez, E., Sch¨ ar, C., Seneviratne, S.I., Somot, S., van Ulden, A., van den Hurk, B.: An inter-comparison of regional climate models for Europe: Model performance in Present-Day Climate. Climatic Change 81, 21–53 (2007), doi:10.1007/s10584-006-9213-4 10. Jones, R.G., Noguer, M., Hassell, D.C., Hudson, D., Wilson, S.S., Jenkins, G.J., Mitchell, J.F.B.: Generating high resolution climate change scenarios using PRECIS, 40 p. UK Met Oﬃce Hadley Centre, Exeter (2004) 11. Mesinger, F.: Horizontal Advection Schemes of a Staggered Grid — An Enstrophy and Energy-Conserving Model. Monthly Weather Review 109, 467–478 (1981) 12. New, M., Hulme, M., Jones, P.: Representing twentieth-century space-time climate variability. Part I: Development of a 1961–90 mean monthly terrestrial climatology. Journal of Climate 12, 829–856 (1999) 13. New, M., Hulme, M., Jones, P.: Representing twentieth-century space-time climate variability. Part 2: Development of 1901–96 monthly grids of terrestrial surface climate. Journal of Climate 13, 2217–2238 (2000) 14. Rupa Kumar, K., Sahai, A.K., Krishna Kumar, K., Patwardhan, S.K., Mishra, P.K., Revadekar, J.V., Kamala, K., Pant, G.B.: High-resolution climate change scenarios for India for the 21st century. Current Science 90, 334–345 (2006) 15. Simmons, A.J., Burridge, D.M.: An energy and angular-momentum conserving vertical ﬁnite diﬀerence scheme and hybrid vertical coordinates. Monthly Weather Review 109, 758–766 (1981) 16. Sz´epsz´ o, G., Hor´ anyi, A.: Transient simulation of the REMO regional climate model and its evaluation over Hungary. Id˝ oj´ ar´ as 112, 203–232 (2008) 17. Torma, C., Bartholy, J., Pongr´ acz, R., Barcza, Z., Coppola, E., Giorgi, F.: Adaptation and validation of the RegCM3 climate model for the Carpathian Basin. Id˝ oj´ ar´ as 112, 233–247 (2008) 18. Trenberth, K.E. (ed.): Climate System Modeling, 788 p. Cambridge University Press, Cambridge (1992) 19. Welch, B.L.: The signiﬁcance of the diﬀerence between two means when the population variances are unequal. Biometrika 29, 350–361 (1938) 20. Wilson, S., Hassell, D., Hein, D., Jones, R., Taylor, R.: Installing and using the Hadley Centre regional climate modelling system, PRECIS. Version 1.5.1, 157 p. UK Met Oﬃce Hadley Centre, Exeter (2007)

Numerical Simulations of Reaction-Diﬀusion Systems Arising in Chemistry Using Exponential Integrators R˘azvan S¸tef˘anescu and Gabriel Dimitriu “Gr. T. Popa” University of Medicine and Pharmacy Department of Mathematics and Informatics, 700115 Ia¸si, Romania [email protected], [email protected]

Abstract. We perform a comparative numerical study of two reactiondiﬀusion models arising in chemistry by using exponential integrators. Numerical simulations of the reaction kinetics associated with these models, including both the local and global errors as a function of time step and error as a function of computational time are shown.

1

Introduction

Reaction-diﬀusion models can display a wide variety of spatio-temporal phenomena in response to large amplitude perturbations. The nature of the kinetic terms plays the important role in determining the solution behaviour. The eﬃcient and accurate simulation of such systems, however, represent a diﬃcult task. This is because they couple a stiﬀ diﬀusion term with a (typically) strongly nonlinear reaction term. When discretised this leads to large systems of strongly nonlinear, stiﬀ ODEs. In this work we carry out a comparative numerical approach of two reaction-diﬀusion models arising in chemistry by using exponential integrators. The paper is organized as follows. Section 2 brieﬂy describes the exponential integrators and their features. In Section 3 the two reaction kinetics — the chlorite-iodide-malonic acid (CIMA) starch reaction and a glycolysis model governed by a cross kinetics, respectively — are presented on the basis of which the numerical study is carried out. Section 4 is devoted to a short description of the numerical schemes applied to the models under study, together with results of the numerical simulations. Some concluding remarks are drawn at the end.

2

General Framework of the Exponential Integrators

Exponential integrators represent numerical schemes speciﬁcally constructed for solving diﬀerential equations (see for details [11]), where it is possible to split the problem into a linear and a nonlinear part y˙ = Ly + N (y, t),

y(tn−1 ) = yn−1 ,

I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 621–628, 2010. c Springer-Verlag Berlin Heidelberg 2010

(1)

622

R. S ¸ tef˘ anescu and G. Dimitriu

where y ∈ Cd , L ∈ Cd×d and N : Cd × R → Cd for all t. In some cases (discretizations of PDEs), depending on the model under study, the matrix L is unbounded as d tends to inﬁnity. The goal of the exponential integrators is to treat the linear term exactly and allow the remaining part of the integration to be integrated numerically using an explicit scheme. An exponential integrator has two main characteristics: (i) If L = 0, then the scheme reduces to a standard general linear scheme. This is often called the underlying general linear scheme; (ii) If N (y, t) = 0 for all y and t, then the scheme reproduces the exact solution of (1). To satisfy (i) the exponential function must be used within the numerical scheme. Despite the fact that L is unbounded, typically the coeﬃcients of the scheme will be bounded. For an s-stage exponential integrator of Runge-Kutta type, we deﬁne the internal stages and output approximation: Yi = h yn = h

s j=1 s

aij (hL)N (Yj , tn−1 + cj h) + ui1 (hL)yn−1 ,

i = 1, . . . , s ,

bi (hL)N (Yi , tn−1 + ci h) + v1 (hL)yn−1 .

(2)

i=1

The feature (i) above is satisﬁed if we require in (2) as ui1 (0) = 1, aij (0) = aij , v1 (0) = 1, and bi (0) = bi , where the real numbers aij and bj represent the coeﬃcients of the underlying Runge-Kutta scheme. A step of length h in an exponential general linear scheme, requires to import r approximations into the [n−1] step, denoted as yi , i = 1, . . . , r. The internal stages (as in the Runge-Kutta case) are written as Yi , i = 1, . . . , s. After the step is completed, r updated approximations are computed. These are then used in the next step. Each step in an exponential general linear scheme can be written as Yi = h [n]

yi

=h

s j=1 s

aij (hL)N (Yj , tn−1 + cj h) + bij (hL)N (Yj , tn−1 + cj h) +

j=1

r j=1 r j=1

[n−1]

,

i = 1, . . . , s ,

[n−1]

,

i = 1, . . . , r. (3)

uij (hL)yj vij (hL)yj

The exponential integrators of Runge-Kutta type are easily seen to be a special case when r = 1 with ui1 (z) = ai0 (z), v11 (z) = b0 (z) and b1j (z) = bj (z).

3

Deﬁning the Model Equations

In this section we shortly describe the models governing diﬀerent reaction kinetics arising in chemistry, which will be solved numerically in Section 4. Following nondimensionalization of the system in space, such that the spatial domain becomes [0, 1], the general system is represented by the following equations: ut = Du uxx + f (u, v) ,

vt = Dv vxx + g(u, v) ,

Numerical Simulations of Reaction-Diﬀusion Systems Arising

623

where u and v are the concentrations of the two morphogens, Du and Dv are the corresponding diﬀusion coeﬃcients, and f (u, v) and g(u, v) encode the reaction kinetics between u and v. Model describing the chlorite-iodide-malonic acid starch reaction. The ﬁrst Turing patterns were noticed in the chlorite-iodide-malonic acid (CIMA) starch reaction [7]. The model proposed by Lengyel and Epstein [10] stresses three processes: the reaction between malonic acid (MA) and iodine to create iodide, and the reactions between chlorite and iodide and chloride and iodide. By imposing the experimentally realistic hypothesis that the concentration of malonic acid, chlorine dioxide, and iodine are constant, Lengyel and Epstein obtained the following model: 4uv ∂v uv ∂u 2 2 = k1 − u − +∇ u, = k2 k3 u − + c∇ v , ∂t ∂t 1 + u2 1 + u2 where u, v are the concentrations of iodide and chlorite, respectively and k1 , k2 , k3 , and c are positive constants. Glycolysis model with cross kinetics. We also analyze numerically a glycolysis model [1] with cross kinetics (where the activator upregulates itself, but downregulates the inhibitor, and the inhibitor likewise upregulates the activator and downregulates itself). In this model, the functions f and g are given by f (u, v) = ru2 v + νv − μu ,

g(u, v) = r(1 − u2 v) − νv .

Here, the parameters r, μ, and ν are positive constants.

4

Numerical Schemes and Computational Issues

Here, we present the numerical schemes deﬁning the exponential integrators that have been used in our simulations. All these integrators belong to the package EXPINT written in Matlab [3]. In this description, we will use two terms of order. The non-stiﬀ order refers to the case when the operator L is bounded, such conditions were derived in [11]. The stiﬀ order refers to the case when L is unbounded [5], for various schemes. The ﬁrst scheme that has been applied to our models is named Lawson4. The scheme Lawson4 belongs to the Lawson schemes constructed by applying the Lawson transformations [9] to the semilinear problem. It is based on the classical fourth order scheme of Kutta (see [4], Eq. (235i)), and this scheme has stiﬀ order one. The scheme denoted by hochost4 was developed by Hochbruck and Ostermann. It has ﬁve-stages and is the only known exponential Runge-Kutta method with stiﬀ order four. Nørsett designed in [13] a class of schemes which reduced to the AdamsBashforth methods when the linear part of the problem is zero. ABLawson4 scheme has stiﬀ order one and is based on the Adams-Bashforth scheme of order four and is represented in this way so that the incoming approximation has the form y [n−1] = [yn−1 , hNn−2 , hNn−3 , hNn−4 ]T .

624

R. S ¸ tef˘ anescu and G. Dimitriu

ABNørsett4 is a stiﬀ order four scheme of Nørsett [13], which is implemented so that the incoming approximation has the same form as in ABLawson4. ETD schemes are based on algebraic approximations to the nonlinear term in the variation of constants formula. ETD means “Exponential Time Diﬀerencing” and the name stems from [6]. The scheme ETD4RK due to Cox and Matthews (in [6], Eqs. (26)–(29)) started the recent focus on exponential integrators, unfortunately it has only stiﬀ order two. ETD5RKF is a non-stiﬀ ﬁfth order scheme based on the six stage ﬁfth order scheme of Fehlberg. The scheme RKMK4t uses a convenient truncation of the dexp−1 operator, leading to the method of Munthe-Kaas [12], which again is of stiﬀ order two, but suﬀers from instabilities when non-periodic boundary conditions are used. Krogstad [8] constructed the generalized Lawson schemes as a means of overcoming some of the undesirable properties of the Lawson schemes. This class of schemes uses approximations of the nonlinear term from previous steps, resulting in an exponential general linear method. The scheme genlawson45 included in the package mentioned above is also used for our numerical study. In what follows, we present some comparative results concerning the quality of the numerical schemes that have been used for CIMA reaction model. There are shown relationships between the global error, the time step h varying from 10−3 to 10−1 , and the computational time. The global error is calculated at the ﬁnal moment of time (t = 1) by the formula: ||Y (:, 1) − Yref (:, 1)||2 , where Y (:, 1) := (u(:, 1), v(:, 1)) and Yref (:, 1) := (uref (:, 1), vref (:, 1)). All the reference solutions Yref were obtained by using Matlab routine ode15s. We notice for CIMA model, that etd5rkf scheme has the smallest global error but, at the same time, the largest computational time, while the ablawson4 scheme indicated a signiﬁcant increased rate of the global error in conjunction with the lowest computational time. Practically, we have found the same behaviour for the other numerical schemes that have been implemented for this model. This is the reason why we have just presented the numerical results obtained only for these three schemes.

Table 1. Comparative results concerning the quality of the numerical schemes: the global error as a function of time step h and time used for the CIMA reaction kinetics CIMA, ND=128, IC: Smooth, c = 0.1, k1 = 0.095, k2 = 0.5, k3 = 3.5 Scheme/ Etd5rkf Ablawson4 Genlawson45 Time step Time used Global Err. Time used Global Err. Time used Global Err. 0.0010 1.203 3.940E-08 0.625 8.816E-05 1.297 5.498E-07 0.0020 0.656 8.524E-07 0.313 4.193E-04 0.625 7.032E-06 0.0032 0.422 5.271E-06 0.234 9.923E-04 0.438 3.136E-05 0.0063 0.266 5.428E-05 0.125 2.893E-03 0.219 2.134E-04 0.0127 0.188 3.494E-04 0.063 6.866E-03 0.141 1.007E-03 0.0313 0.094 2.264E-03 0.094 1.701E-02 0.078 4.987E-03 0.0625 0.078 7.078E-03 0.016 2.411E-02 0.063 1.380E-02 0.1000 0.063 1.445E-02 0.031 1.332E-01 0.047 1.001E-01

Numerical Simulations of Reaction-Diﬀusion Systems Arising

625

Glycolysis, ND=128, IC: Smooth, r=0.025, miu =0.045, niu =0.35

−2

10

lawson4 hochost4 etd4rk rkmk4t abnorsett4 ablawson4 etd5rkf genlawson45 modgenlawson45

−4

Global error

10

−6

10

−8

10

−10

10

−12

10

−2

10

−1

0

10

10

1

10

Time used

Fig. 1. Comparative results concerning the global error as a function of computational time for the glycolysis model with cross reaction kinetics

From theoretical point of view, the genlawson45 scheme, which has a non stiﬀ order 6, and a stiﬀ order 5, should have a global error less than the one obtained by applying the etd5rkf scheme, which has non stiﬀ order 5 and stiﬀ order 1. The results contained in Table 1 are in contradiction with the previous statement, so that we tried to use other model parameters to see whether the situation continues to persist. We noticed that if we increase the regularity of the initial conditions, the global error in genlawson45 case has lower values (see [2] for a similar interpretation related to the nonlinear Schr¨ odinger equation). In the case of the cross reaction kinetics for glycolysis model, the schemes genlawson45 and modgenlawson45 had the best behaviour in comparison with the results of the other 7 schemes involved in our study (see Fig. 1). Figure 2 gives timing results in the sense that it depicts dependencies of the global error as a function of the computational time for glycolysis model. In this case, good results have been obtained with the lawson4, etd4rkf, and rkmk4t schemes. Despite their low stiﬀ order (stiﬀ order 1), the ABLawson4 and etd5rkf schemes (that have both the stiﬀ order 1) behaved as 3.7 and 5.2 order schemes respectively, upon glycolysis problem with smooth initial conditions. Also, the genlawson45 scheme (with stiﬀ order 5) behaved like a 5.6 order scheme (see Fig. 3). We remark that with a proper choice of the initial condition we can give an ascending boost to the scheme order levels, especially for schemes with low stiﬀ order, but further investigations are needed.

626

R. S ¸ tef˘ anescu and G. Dimitriu Glycolysis, ND=128, IC: Smooth, r=0.025, miu =0.045, niu =0.35

−2

10

lawson4 hochost4 etd4rk rkmk4t abnorsett4 ablawson4 etd5rkf genlawson45 modgenlawson45

−4

Global error

10

−6

10

−8

10

−10

10

−12

10

−3

10

−2

−1

10 Timestep h

10

Fig. 2. Comparative results concerning the quality of the numerical schemes: the global error as a function of time step h for the glycolysis model with cross reaction kinetics Glycolysis, ND=128, IC: Smooth, r=0.025, miu =0.045, niu =0.35

−2

10

−4

Global error

10

−6

10

3.7 −8

10

5.6 5.2

−10

10

ablawson4 etd5rkf genlawson45

−12

10

−3

10

−2

10 Timestep h

−1

10

Fig. 3. An order test applied to the ablawson4, etd5rkf, and genlawson45 schemes. The dotted lines are only indicator lines shoing how corresponding orders look like.

Numerical Simulations of Reaction-Diﬀusion Systems Arising

627

Table 2. Comparative results concerning the quality of the numerical schemes: the local error evaluated as a function of the time step h for the CIMA reaction kinetics CIMA, ND=128, IC: Smooth, c = 0.1, k1 = 0.095, k2 = 0.5, k3 = 3.5 Scheme/ Inner points Lawson4 Hochost4 Etd4rkt Rkmk4t Friedli 0.001 3.864E-06 6.181E-06 6.169E-06 3.467E-05 6.176E-06 0.002 3.993E-05 5.126E-05 5.122E-05 2.134E-04 5.125E-05 0.004 3.391E-04 3.036E-04 3.038E-04 8.104E-04 3.037E-04 0.016 6.391E-03 4.553E-03 4.565E-03 8.255E-03 4.551E-03 0.032 1.693E-02 1.293E-02 1.294E-02 3.911E-02 1.291E-02 0.126 7.011E-02 7.457E-02 7.325E-02 4.028E-01 7.363E-02 0.251 1.569E-01 1.739E-01 1.443E-01 1.024E+00 1.447E-01 Table 3. Comparative results concerning the quality of the numerical schemes: the local error evaluated as a function of the time step h for the glycolysis model with cross reaction kinetics Glycolysis, ND=128, Scheme/ Inner points Lawson4 0.001 5.700E-14 0.002 1.820E-13 0.004 4.216E-12 0.016 2.157E-09 0.032 3.287E-08 0.126 7.105E-06 0.251 9.375E-05

IC: Smooth, r = 0.025, μ = 0.045, ν = 0.35 Hochost4 1.690E-13 5.012E-12 1.524E-10 1.169E-07 2.628E-06 5.068E-04 3.737E-03

Etd4rkt 1.610E-13 4.771E-12 1.450E-10 1.111E-07 2.495E-06 4.753E-04 3.439E-03

Rkmk4t 5.770E-13 1.792E-11 5.453E-10 4.210E-07 9.591E-06 2.038E-03 1.709E-02

Friedli 1.650E-13 4.884E-12 1.485E-10 1.139E-07 2.562E-06 4.925E-04 3.609E-03

Tables 2 and 3 contain the results of a local order test applied to the two models under investigation. Here, another exponential Runge-Kutta method — Friedli scheme with the stiﬀ order 3 — is added. This test does not make any sense for multistep schemes. The vector of time steps is built up, so that the smallest time step must divide all other time steps used. Otherwise, the computation would take an unnecessarily long time. The local error is calculated at every inner time point by ||Y (:, t)−Yref (:, t)||2 , where as before, Y (:, t) := (u(:, t), v(:, t)) and Yref (:, t) := (uref (:, t), vref (:, t)). The plots precise in their title N D = 128 and “IC: Smooth”. This means that we have used 128 Fourier modes in the spatial direction (must be power of 2), and as initial conditions for the model variables, we have chosen a set of values with a Gaussian distribution.

5

Conclusions

In this study we carried out a comparative numerical approach for two models arising in chemistry: the chlorite-iodide-malonic acid (CIMA) starch reaction and

628

R. S ¸ tef˘ anescu and G. Dimitriu

a glycolysis model governed by a cross kinetics. The numerical results have been obtained by using several exponential integrators belonging to Matlab package EXPINT [3]. We computed for the glycolysis model the values of the observed orders belonging to three numerical schemes. We also noticed that lower stiﬀ order schemes depend upon the regularity of the initial conditions. Illustrative plots and numerical results were presented concerning comparative global and local errors as well as timing results of the integration schemes.

Acknowledgments The paper was supported by the project ID 342/2008, CNCSIS, Romania.

References 1. Ashkenazi, M., Othmer, H.G.: Spatial Patterns in Coupled Biochemical Oscillators. J. Math. Biol. 5, 305–350 (1978) 2. Berland, H., Skaﬂestad, B.: Solving the nonlinear Schr¨ odinger equation using exponential integrators. Modeling, Identiﬁcation and Control 27(4), 201–217 (2006) 3. Berland, H., Skaﬂestad, B., Wright, W.: EXPINT — A Matlab package for exponential integrators. Numerics 4 (2005) 4. Butcher, J.C.: Numerical methods for ordinary diﬀerential equations. John Wiley & Sons, Chichester (2003) 5. Celledoni, E., Marthinsen, A., Qwren, B.: Commutator-free Lie group methods. FGCS 19(3), 341–352 (2003) 6. Cox, S.M., Matthews, P.C.: Exponential time diﬀerencing for stiﬀ systems. J. Comp. Phys. 176(2), 430–455 (2002) 7. Kepper, P., De Castets, V., Dulos, E., Boissonade, J.: Turing-type chemical patterns in the chlorite-iodide-malonic acid reaction. Physica D 49, 161–169 (1991) 8. Krogstad, S.: Generalized integrating factor methods for stiﬀ PDEs. Journal of Computational Physics 203(1), 72–88 (2005) 9. Lawson, D.J.: Generalized Runge-Kutta processes for stable systems with large Lipschitz constants. SIAM J. Numer. Anal. 4, 372–380 (1967) 10. Lengyel, I., Epstein, I.R.: Modeling of Turing structures in the chlorite-iodidemalonic acid-starch reaction system. Science 251, 650–652 (1991) 11. Minchev, B., Wright, W.M.: A review of exponential integrators for semilinear problems. Technical Report 2, The Norwegian University of Science and Technology (2005) 12. Munthe-Kaas, H.: High order Runge-Kutta methods on manifolds. Applied Numerical Mathematics 29, 115–127 (1999) 13. Nørsett, S.P.: An A-stable modiﬁcation of the Adams-Bashforth methods. In: Conf. on Numerical solution of Diﬀerential Equations, Dundee, pp. 214–219. Springer, Berlin (1969)

On the Discretization Time-Step in the Finite Element Theta-Method of the Two-Dimensional Discrete Heat Equation Tam´as Szab´o E¨ otv¨ os Lor´ and University, Institute of Mathematics 1117 Budapest, P´ azm´ any P. S. 1/c, Hungary [email protected]

Abstract. In this paper the numerical solution of the two-dimensional heat conduction equation is investigated, by applying Dirichlet boundary condition at the upper side and Neumann boundary condition to the left, right, and lower sides. To the discretization in space, we apply the linear ﬁnite element method and for the time discretization the well-known theta-method. The aim of the work is to derive an adequate numerical solution for the homogeneous initial condition by this approach. We theoretically analyze the possible choices of the time-discretization step-size and establish the interval where the discrete model can reliably describe the original physical phenomenon. As the discrete model, we arrive at the task of the one-step iterative method. We point out that there is a need to obtain both lower and upper bounds of the time-step size to preserve the qualitative properties of the real physical solution. The main results of the work is determining the interval for the time-step size to be used in this special ﬁnite element method and analyzing the main qualitative characterstics of the model. Our theoretical results are veriﬁed by diﬀerent numerical experiments.

1

Preliminaries

The Proton exchange membrane fuel cells (PEMFC) are electrochemical conversion devices that produce electricity from fuel and an oxidant, which react in R the presence of an electrolyte. The most commonly used membrane is Naﬁon, which relies on liquid water humidiﬁcation of the membrane. It is not appropriate to operate it above 80-90◦C, since the membrane would dry, which implies the importance of the deeper analysis of the heat conduction modeling of a fuel cell. Since the heat produced in fuel cells appearing on a speciﬁed side of cell, we need to apply diﬀerent boundary conditions on diﬀerent sides of the two dimensional mathematical model. Minimum time step sizes for diﬀerent discrete one-dimensional diﬀusion problems have been analyzed by many researchers [7]. Thomas and Zhou [10] have constructed an approach to develop the minimum time step size that can be used in the ﬁnite element method (FEM) of one-dimensional diﬀusion problems. In a previous work [9], we pointed out its imperfections and extended the analysis to I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 629–636, 2010. c Springer-Verlag Berlin Heidelberg 2010

630

T. Szab´ o

the theta-method as well, and developed an upper bound for the maximum time step size. In this paper, the bounds of the time step size for two-dimensional classical diﬀusion problem was investigated that can be used in the FEM applying bilinear shape functions [6]. During the analysis the heat conduction equation is considered as a prototypical parabolic partial diﬀerential equation. The general form of this equation on Ω × (0, T ), where Ω := [0, 1] × [0, 1], is c

∂u = κ∇2 u, (x, y) ∈ Ω, t ∈ (0, T ), ∂t ∂u = 0, t ∈ [0, T ) u|ΓD = τ, ∂n ΓN

(1.1)

u(x, y, 0) = u0 (x, y), (x, y) ∈ Ω, where c represents the speciﬁc heat capacity, u is the temperature of the analyzed domain, t and x, y denote the time and space variables, respectively, κ is the coeﬃcient of the thermal conductivity. Moreover, τ is the given temperature at ΓD , a non-negative real number. The left-hand side of this equation expresses the rate of the temperature change at a point in space over time. The righthand side indicates the spatial thermal conduction in direction x. ΓN denotes a speciﬁed part of the boundary of Ω, where Neumann boundary condition holds (x = 0 or x = 1 or y = 1), and ΓD denotes the part of the boundary where Dirichlet boundary condition was applied (y = 0). During the analysis of the problem the space was divided into 2n2 equilateral triangle elements (Fig. 1). The heat capacity and the coeﬃcient of thermal conductivity are assumed to be constants. We seek the spatially discretized temperature ud in the form: ud (x, y, t) =

n

φi,j (t)Ni,j (x, y),

(1.2)

i,j=0

where Ni,j (x, y) are the following shape functions: ⎧ 1− ⎪ ⎪ ⎪ ⎪ ⎪ 1− ⎪ ⎪ ⎪ ⎨1 + Ni,j (x, y) := ⎪ 1+ ⎪ ⎪ ⎪ ⎪ ⎪1 + ⎪ ⎪ ⎩ 1−

1 (xi h 1 h (xi 1 h (yj 1 h (xi 1 h (xi 1 h (yj

− x) − h1 (yj − y), − x), − y), − x) + h1 (yj − y), − x), − y),

if if if if if if

(x, y) ∈ ω1i,j (x, y) ∈ ω2i,j (x, y) ∈ ω3i,j (x, y) ∈ ω4i,j (x, y) ∈ ω5i,j (x, y) ∈ ω6i,j

(1.3)

φi,j are unknown, and n2 is the ordinal number of nodes. The unknown temperature index j starts from 1, hence, due to the boundary condition at y = 0 the temperature is known, namely, φ0,i (t) = τ, ∀i = 0, 1...n.

On the Discretization Time-Step in the Finite Element Theta-Method

631

Fig. 1. The equilateral triangle grid on the analysed domain

By redistributing the indices, the following equations draw up: n i,k=0

φi,k (t)

cNi,k Nj,l dydx+ Ω n

φi,k (t)

κ Ω

i,k=0

∂Ni,k ∂Nj,l ∂Ni,k ∂Nj,l + ∂x ∂x ∂y ∂y

dydx = 0,

(1.4)

j = 1, 2 . . . n, l = 0, 1 . . . n. 3

Let K, M ∈ R(n+1)×n denote the so-called stiﬀness and mass matrices, respectively, deﬁned by:

∂Ni,k ∂Nj,l ∂Ni,k ∂Nj,l κ + (Kk,l )i,j = dydx, (1.5) ∂x ∂x ∂y ∂y Ω cNi,k Nj,l dydx. (1.6) (Mk,l )i,j = Ω

Then (1.4) can be expressed as:

M Φ + K Φ = 0,

(1.7)

where Φ ∈ Rn×n is a vector function with the components φi,j . For the time discretization of the system of ODEs (1.7) we apply the well-known theta-method, which results in the equation M

Φm+1 − Φm + K ΘΦm+1 + (1 − Θ)Φm = 0. Δt

(1.8)

Clearly, this is a system of linear algebraic equations w.r.t. the unknown vector Φm+1 being the approximation of the temperature at the new time-level which

632

T. Szab´ o

is an array on the discretization of Ω. Here the parameter Θ is related to the applied numerical method and it is an arbitrary parameter on the interval [0, 1]. It is worth emphasizing that for Θ = 0.5 the method yields the Crank-Nicolson implicit method which has higher accuracy for the time discretization [2]. In order to preserve the qualitative characteristics of the solution, the connections between the equations and the real problem must be analyzed. To obtain a lower bound for the time-step size, Lorenz’s theorem was applied [5]. As it is well known, the temperature (in Kelvin) is a non-negative function in physics. In this article the following suﬃcient condition will be shown for the time-step size of the ﬁnite element theta-method to retain the physical characteristics of the solution: √ h2 c h2 c 3 + 14 ≤ Δt ≤ , (1.9) 12Θκ 8(1 − Θ)κ where h is the length of the spatial approximation. This suﬃcient condition is well known for problems with pure Dirichlet boundary conditions but not for the problems with mixed boundary conditions (Newton and Dirichlet), see, e.g., [3,4].

2

Analysis of FEM Equation

Let Φi denote the transpose of the temperature vector at the i-th row of the T discretization of Ω, namely Φi = (Φi,0 , Φi,1 , . . . , Φi,n ) . First we analyze the case when homogeneous initial condition is given, i.e., u0 (x) = 0. Then Φ0i = 0, (i = 1, 2, ..., n). Since Φ0 = τ > 0, therefore, it is worth emphasizing that, if τ is greater than zero, there is a discontinuity in the initial conditions at y = 0. We investigate the condition under which the ﬁrst iteration, denoted by Φ=Φ1 , results in non-negative approximation. Considering the fact that the matrices of equation (1.8) are block tridiagonal matrices the following system can be obtained: AΦ0 + BΦ1 + CΦ2 = 0

(2.1(1))

AΦ1 + BΦ2 + CΦ3 = 0 ...

(2.1(2))

AΦn−2 + BΦn−1 + CΦn = 0 AΦn−1 + DΦn = 0

(2.1(n-1)) (2.1(n))

where A, B, C, D are n-by-n tridiagonal matrices that one can get easily after performing the integral in (1.5) and (1.6) for the given shape functions. When A = 0n,n , then the solution of this equation system is equal to zero viz. Φn = 0 follows from (2.1(n)), (2.1(n-1)) results in Φn−1 = 0 and so on, by taking into consideration the regularity of D. This means that the numerical scheme does not change the initial state, which contradicts to the physical process. Therefore, in the sequel we assume that A = 0n,n .

On the Discretization Time-Step in the Finite Element Theta-Method

633

We seek the solution in the form Φi = Zi Φ0 , i = 0, 1, ..., n.

(2.2)

Obviously, Z0 is the n-by-n indentity matrix. Using (2.1(n)), Φn can be expressed as Φn = −D−1 AZn−1 Φ0 , (2.3) which implies that

Zn = −D−1 AZn−1 = Xn−1 Zn−1 ,

(2.4)

Xn−1 = −D−1 A.

(2.5)

where

In the next step, Zn−1 can be expressed from (2.1(n-1)), applying (2.4), and then for the i-th equation the following relation holds: Zi = − (B + CXi ) where

−1

AZi−1 = Xi−1 Zi−1 , i = 1, 2, ..., n − 1, −1

Xi−1 = − (B + CXi )

A, i = n − 1, n − 2, ..., 1.

(2.6)

(2.7)

Hence we obtained the following statement. Theorem 1. The solution of the system of linear algebraic equations (2.5) can be deﬁned by the following algorithm. 1. 2. 3. 4.

We put Z0 = In ; We deﬁne Xn−1 , Xn−2 , ..., X0 by the formulas (2.5) and (2.7), respectively; We deﬁne Z1 , Z2 , ..., Zn by the formulas (2.4) and (2.6), respectively; By the formula (2.2) we deﬁne the values of Φi .

The relation Φi ≥ 0 holds only under the condition Zi ≥ 0. From (2.6) we can see that it is equivalent to the non-negativity of Xi for all i = 0, 1, ..., n − 1. Therefore, based on (2.7), it is hard to get a condition of the non-negativity of the solution, hence it contains the inverse of the sum of two matrices. It is worth emphasizing that D is an M-matrix, therefore, its inverse is positive. Using the one-step iterative method to the discretization of (1.7) the following system of linear algebraic equations is obtained: P1 Φm+1 = P2 Φm , m = 0, 1, . . . ,

(2.8)

where P1 = M + ΔtΘK, P2 = M − Δt(1 − Θ)K. It is clear that for all Φm+1 to be non-negative, the non-negativity of the following matrix is required: P = P1−1 P2

(2.9)

The suﬃcient conditions of the non-negativity of P are the following: P1−1 ≥ 0 and P2 ≥ 0.

(2.10)

634

T. Szab´ o

For P2 it is easy to give a condition that guarantees its non-negativity. By anaysing the elements of the matrix. The following condition can be obtained: h2 c − 4κ(1 − Θ) ≥ 0, 2Δt

(2.11)

which is equivalent to the condition Δt ≤

h2 c . 8(1 − Θ)κ

(2.12)

For the homogeneous initial condition it is clear that this condtion for the timestep size disappears. It is not possibile to obtain a suﬃcient condition for the nonnegativity of the matrix P1−1 by the so-called M-Matrix method (similarly for the one-dimensional case [9]). It follows from the fact that P1 contains some positive elements in its oﬀdiagonal. Therefore, a suﬃcient condition for the inversepositivity of P1 is obtained by Lorenz [5]. Lemma 2. (Lorenz) Let A be an n-by-n matrix, denote Ad and A− the diagonal and the negative oﬀdiagonal part of the matrix A, respectively. Let A− = Az + As = (azij ) + (asij ). If aij ≤

n

s azik a−1 = j, kk akj , ∀aij , i

(2.13)

k=1

then A is a product of two M-matrices, i.e., A is monotone. We could analyse matrix P1 with the theorem if it is decomposed into the diagonal part, the positive oﬀdiagonal part, the upper triangular and lower triangular negative parts. All the conditions of the theorem are satisﬁed if: 1 ≤ 12

2 1 Δt −κ 2 12 h c , Δt 1 + 4κ 2 2 h c

(2.14)

which implies the lower bound √ h2 c 3 + 14 ≤ Δt 12Θκ

(2.15)

Hence, the following statement is proven. Theorem 3. Let us assume that the conditions (2.12) and (2.15) hold. Then for the problem (1.1) with arbitrary non-negative initial condition the linear ﬁnite element method results in a non-negative solution on any time level.

On the Discretization Time-Step in the Finite Element Theta-Method

3

635

Numerical Experiments

For the numerical experiments for the boundary condition at the left hand side of the space domain we choose the value τ = 273. A special type of Gaussian elimination was used for the inversion of the sparse tri-diagonal matrices [8]. The following ﬁgures are in three dimensions, in Fig. 2 the ﬁrst dimension is the estimated time, the second one is the spatial variable y at x = 1, and the third one is temperature at the nodes. First, we apply a relatively long time step, which results in a negative P2 . One can see in Fig. 2 that the numerical method is quite unstable, hence there is an oscillation with decreasing tendency in the results. When we apply a smaller time step than the time step deﬁned by (2.15), then close to the ﬁrst node there will be small negative peaks. These solutions are unrealistic, since the absolute temperature should be non-negative. For the sake of completeness in Fig. 3 we applied the time-step size from the interval (1.9), and we have got a more stable numerical method. In this ﬁgure the ﬁrst two dimensions are the spatial ones (x, y) and the third is the temperature at the nodes. It is easy to see that, by use of appropriate time steps, the solution becomes much smoother than in Fig. 2.

350

Temperature [K]

300 400

250

300

200

200

150

100

100

0

10

50

20 30

25

20

15

10

5

30

y

Time

Fig. 2. The solution when a small time step is applied

250

200

Temperature [K]

400

300

150

200 100 100

0

10

50

20 30

25

20

15

10

5

30

y

x

Fig. 3. The solution obtained by applying time step from the interval (1.9)

636

4

T. Szab´ o

Conclusions and Further Works

In this article a suﬃcient condition was given for the time-step size of the ﬁnite element theta-method to preserve the physical characteristics of the solution. For the homogeneous initial condition we have shown that only a lower bound exists for the time-step size of the ﬁnite element theta-method, in order to preserve the non-negativity at the ﬁrst time level. Since we were interested in the nonnegativity preservation property on the whole discretized time we showed the existence of bounds from both directions, i.e., there are upper and lower bounds for the time-step, as well. Detailed analysis of other types of shape functions and rectangular domains will be done in the future. This work is important for a better understanding of the chemical and physical phenomena of the fuel cells by way of the mathematical modeling.

References 1. Berman, A., Plemmons, R.J.: Nonnegative matrices in the mathematical sciences. Computer Science and Applied Mathematics. Academic Press (Harcourt Brace Jovanovich, Publishers), New York (1979) 2. Crank, J., Nicolson, P.: A practical method for numerical evaluation of solutions of partial diﬀerential equations of the heat conduction type. Proceedings of the Cambridge Philosophical Society 43, 50–64 (1947) 3. Farago, I.: Non-negativity of the diﬀerence schemes. Pour Math. Appl. 6, 147–159 (1996) 4. Farkas, H., Farag´ o, I., Simon, P.: Qualitative properties of conductive heat transfer. In: Sienuitycz, S., De Vos, A. (eds.) Thermodynamics of Energy Conversion and Transport, pp. 199–239. Springer, Heidelberg (2000) 5. Lorenz, J.: Zur Inversmonotonie diskreter Probleme. Numer. Math. 27, 227–238 (1977) (in German) 6. Marchuk, G.: Numerical methods. Mˇ uszaki K¨ onyvkiad´ o, Budapest (1976) (in Hungarian) 7. Murti, V., Valliappan, S., Khalili-Naghadeh, N.: Time step constraints in ﬁnite element analysis of the Poisson type equation. Comput. Struct. 31, 269–273 (1989) 8. Samarskiy, A.A.: Theory of diﬀerence schemes, Moscow, Nauka (1977) (in Russian) 9. Szab´ o, T.: On the Discretization Time-Step in the Finite Element Theta-Method of the Discrete Heat Equation. In: Margenov, S., Vulkov, L., Wa`sniewski, J. (eds.) Numerical Analysis and Its Applications. LNCS, vol. 5434, pp. 564–571. Springer, Berlin (2009) 10. Thomas, H.R., Zhou, Z.: An analysis of factors that govern the minimum time step size to be used in ﬁnite element analysis of diﬀusion problem. Commun. Numer. Meth. Engng. 14, 809–819 (1998)

A Locally Conservative Mimetic Least-Squares Finite Element Method for the Stokes Equations Pavel Bochev1 and Max Gunzburger2 1

Applied Mathematics and Applications, P.O. Box 5800, MS 1320, Sandia National Laboratories , Albuquerque, NM 87185-1320, USA [email protected] 2 Department of Scientific Computing, Florida State University, Tallahassee FL 32306-4120, USA [email protected]

Abstract. Least-squares finite element methods (LSFEMs) for the incompressible Stokes equations that use nodal C 0 elements conserve mass approximately. This has led to the common and widespread misconception that LSFEMs are unable to provide the local conservation of their mixed cousins. We develop a mimetic LSFEM for the Stokes equations that yields divergence-free velocity fields and satisfies a discrete momentum equation. The LSFEM uses the velocity-vorticity-pressure Stokes equations, div-conforming elements for the velocity, curl-conforming elements for the vorticity and discontinuous elements for the pressure.

1

Introduction

In this paper we develop a mimetic least-squares ﬁnite element method (LSFEM) for the ﬁrst-order velocity-vorticity-pressure (VVP) form ⎧ ⎪ in Ω ⎨ ∇ × ω + ∇p = f ∇×u−ω = 0 in Ω (1) ⎪ ⎩ ∇·u= 0 in Ω . of the incompressible Stokes equations, augmented with the normal velocity– tangential vorticity boundary condition u·n=0

and ω × n = 0

on Γ ,

and the zero mean pressure constraint p dΩ = 0 .

(2)

(3)

Ω

Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy under contract DEAC04-94-AL85000.

I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 637–644, 2010. c Springer-Verlag Berlin Heidelberg 2010

638

P. Bochev and M. Gunzburger

In (1)–(3), u, ω, p, ν, and f are the (non-dimensional) velocity, vorticity, pressure, inverse Reynolds number, and body force per unit mass, respectively. In what follows we restrict attention to simply connected, contractible bounded regions Ω ⊂ R3 with Lipschitz-continuous boundary Γ = ∂Ω. The paper is organized as follows. Section 2 introduces notation and reviews key technical results needed in the sequel. The new method is presented in Section 3 and its conservation properties are established in Section 4.

2

Notation

As usual, L2 (Ω) is the Hilbert space of all square integrable functions with inner product and norm (·, ·) and · 0 , respectively, L20 (Ω) is the subspace of L2 (Ω) consisting of functions with zero mean, bold face denotes vector spaces, and1 H01 (Ω) = {u ∈ L2 (Ω) | ∇u ∈ L2 (Ω);

u = 0 on Γ }

H0 (Ω, curl) = {u ∈ L2 (Ω) | ∇ × u ∈ L2 (Ω); H0 (Ω, div) = {u ∈ L (Ω) | ∇ · u ∈ 2

L20 (Ω);

u×n=0 u·n=0

(4) on Γ }

on Γ } .

(5) (6)

The spaces (4)–(6), equipped with the graph norms u2G = u20 + ∇u20

u2C = u20 + ∇ × u20

and u2D = u20 + ∇ · u20 ,

(7)

are Hilbert spaces, as is H(Ω, curl) ∩ H0 (Ω, div) endowed with norm u2CD = u20 + ∇ × u20 + ∇ · u20 .

(8)

We recall that under the assumptions on Ω the DeRham complex ∇

∇×

∇·

R → H01 (Ω) −→ H0 (Ω, curl) −→ H0 (Ω, div) −→ L20 (Ω) −→ 0

(9)

is exact; see [4,1]. We also recall the Poincar´e–Friedrichs inequalities: u0 ≤ CP ∇u0

∀u ∈ H 1 (Ω) ∩ L20 (Ω)

u0 ≤ CP ∇ × u0 + ∇ · u0 ∀u ∈ H(Ω, curl) ∩ H0 (Ω, div) .

(10)

(11)

Mimetic LSFEMs for the VVP Stokes system (1)–(3) are formulated using ﬁnite element approximations of the De Rham complex (9). Such approximations comprise of conforming ﬁnite element subspaces Gh0 (Ω) ⊂ H01 (Ω), Ch0 (Ω) ⊂ H0 (Ω, curl), Dh0 (Ω) ⊂ H0 (Ω, div) and S0h (Ω) ⊂ L20 (Ω) which form an exact sequence, and bounded projection operators ΠG : H01 (Ω)

→ Gh0 (Ω), ΠC : 1

We write H 1 (Ω), H(Ω, curl) and H(Ω, div) when (4)–(5) are not constrained by boundary conditions.

A Locally Conservative Mimetic Least-Squares Finite Element Method

639

H0 (Ω, curl)

→ Ch0 (Ω), ΠD : H0 (Ω, div)

→ Dh0 (Ω), and ΠS : L20 (Ω)

→ S0h (Ω) such that the diagram ∇

∇×

∇·

H01 (Ω) −→ H0 (Ω, curl) −→ H0 (Ω, div) −→ L20 (Ω) ΠG ↓

ΠC ↓ ∇

Gh0 (Ω) −→

Ch0 (Ω)

ΠD ↓ ∇×

−→

Dh0 (Ω)

ΠS ↓

(12)

∇·

−→ S0h (Ω)

commutes. The standard L2 (Ω) projections onto compatible ﬁnite element spaces are denoted by πG , πC , πD and πS . In what follows we restrict attention to aﬃne partitions Th of Ω into simplicial elements κ because construction of ﬁnite element spaces and projection operators such that (12) holds is best understood in that setting; see [1,3]. For brevity LSFEMs are formulated using the lowest-order ﬁnite element DeRham complex for which Gh0 (Ω) is the piecewise linear C 0 nodal space, Ch0 (Ω) is the lowestorder N´edelec edge element space of the ﬁrst-kind [6], Dh0 (Ω) is the lowest-order Raviart-Thomas [7] face element space and S0h (Ω) is the piecewise constant space.

3

Mimetic LSFEM for the Stokes Equations

The key to well-posed LSFEMs for the VVP Stokes boundary value problem (1)–(2) and (3) is the following stability result for that system. Theorem 1. Assume that Ω is a simply connected bounded domain in R3 with a Lipschitz boundary. Then, there exists a positive constant C such that C uCD + ωC + pG ≤ ∇ × ω + ∇p0 + ∇ × u − ω0 + ∇ · u0 (13) for all {u, ω, p} ∈ H0 (Ω, div) ∩ H(Ω, curl) × H0 (Ω, curl) × H 1 (Ω) ∩ L20 (Ω). Proof. Owing to the boundary condition ω × n = 0 on ∂Ω, ∇ × ω + ∇p20 = ∇ × ω20 + ∇p20

(14)

∇ × u − ω20 = ∇ × u20 + ω20 − (∇ × u, ω) − (u, ∇ × ω) .

(15)

and Using this identity and the Cauchy inequality, it follows that ∇ × u − ω20 + ∇ · u20 ≥ ∇ × u20 + ∇ · u20 + ω20 − ∇ × u0 ω0 − u0 ∇ × ω0 . Because u ∈ H0 (Ω, div) ∩ H(Ω, curl), the Poincar´e-Friedrichs inequality (11) implies that 1 ∇ × u20 + ∇ · u20 ≥ 2 u20 . (16) CP

640

P. Bochev and M. Gunzburger

In conjunction with the previous bound, we see that 1 ∇ × u20 + ∇ · u20 2

∇ × u − ω20 + ∇ · u20 ≥ +

1 u20 + ω20 − ∇ × u0 ω0 − u0 ∇ × ω0 . 2CP2

Using the ε-inequality for the last two terms gives β∇ × ω + ∇p20 + ∇ × u − ω20 + ∇ · u20 1 1 ≥ β ∇ × ω20 + ∇p20 + u20 + ω20 ∇ × u20 + ∇ · u20 + 2 2CP2 ε1 1 ε2 1 − ∇ × u20 − ω20 − u20 − ∇ × ω20 2 2ε1 2 2ε2

1 1 2 ≥ β− ∇ × ω0 + 1 − ω20 2ε2 2ε1

1 1 1 1 2 + (1 − ε1 ) ∇ × u0 + − ε2 u20 + β∇p20 + ∇ · u20 , 2 2 CP2 2 where β = 2CP2 . Setting ε1 = 2/3, ε2 = 1/(2CP2 ), and using the Poincar´eFriedrichs inequality (10) yields 2CP2 ∇ × ω + ∇p20 + ∇ × u − ω20 + ∇ · u20 1 1 1 ∇ × u20 + ∇ · u20 + u20 6 2 4CP2 1 2CP2 p2G +CP2 ∇ × ω20 + ω20 + 4 (1 + CP2 ) ≥ min Cu , Cω , 2CP2 (1 + CP2 )−1 u2CD + ω2C + p2G , ≥

with Cω = min

CP2 ,

1 4

and

Cu = min

1 1 1 , , 4CP2 6 2

.

The theorem follows by noting that ∇ × ω + ∇p20 + ∇ × u − ω20 + ∇ · u20 1 ≥ 2CP2 ∇ × ω + ∇p20 + ∇ × u − ω20 + ∇ · u20 . 2 max{1, 2CP } 2 Stability bound (13) asserts that J(u, ω, p; f ) = ∇ × ω + ∇p − f 20 + ∇ × u − ω20 + ∇·u20

(17)

is a norm–equivalent least-squares functional on X = H0 (Ω, div) ∩ H(Ω, curl) × H0 (Ω, curl) × H 1 (Ω) ∩ L20 (Ω) ,

(18)

A Locally Conservative Mimetic Least-Squares Finite Element Method

641

and so, (17) has a unique minimizer out of X; see [2]. Unfortunately, conforming discretization of (17) requires ﬁnite element subspaces of H(Ω, curl)∩H0 (Ω, div) which makes this formulation ill-suited2 for problems where the latter is not equivalent to [H01 (Ω)]3 . Instead, we propose to use a non-conforming discretization of (17) in which X is approximated by the non-conforming ﬁnite element space X h = Dh0 × Ch0 × S0h .

(19)

In other words, we approximate the ﬁrst component of X by a div-conforming space. Note that this choice allows us to impose the boundary condition (2) on the approximating space. The third component of X is also approximated in a non-conforming fashion. This enables additional conservation properties in the corresponding least-squares formulation. Because the ﬁrst and the last component spaces of X h are non-conforming, the original least-squares functional (17) has to be modiﬁed to a functional that is well-deﬁned for X h . To this end we replace the curl operator acting on the velocity ﬁeld by a discrete weak curl deﬁned by ∗ (20) ∇h × vh , uh = vh , ∇ × uh ∀ uh ∈ Ch0 (Ω) , and the gradient operator acting on the pressure by a discrete weak gradient ∇∗h : S0h (Ω)

→ Dh0 (Ω) deﬁned by ∗ h h h ∀ vh ∈ Dh0 (Ω) . (21) ∇h u , v = u , −∇ · vh The weak curl operator deﬁned in (20) provides discrete version of the Poincar´e– Friedrichs inequality (11) for div-conforming elements: ∀ uh ∈ Dh0 (Ω) ; uh 0 ≤ CP ∇∗h × uh 0 + ∇ · uh 0 (22) see [2] for further details and properties of discrete operators. Thus, we are led to the following non-conforming least-squares functional J h (uh , ωh , ph ; f ) = ∇ × ωh + ∇∗h ph − f 20 + ∇∗h × uh − ωh 20 + ∇ · uh20 (23) The associated non-conforming discrete least-squares principle is given by min

{uh ,ω h ,ph }∈X h

J h (uh , ωh , ph ; f ) .

(24)

The following theorem asserts that (24) is well-posed. 2

A finite element subspace CDh of H(Ω, curl) ∩ H0 (Ω, div) must contain piecewise smooth vector fields that are both tangentialy and normally continuous, i.e., fields that are necessarily of class C 0 . It follows that CDh ⊂ [H01 (Ω)]3 . Unless Ω has smooth boundary or is a convex polyhedron, the latter is not equivalent to H(Ω, curl) ∩ H0 (Ω, div) and CDh does not have approximability property with respect to H(Ω, curl) ∩ H0 (Ω, div), see [5, Corollary 3.20, p. 97].

642

P. Bochev and M. Gunzburger

Theorem 2. Define the div-conforming version of (8) by uh 2C ∗ D = uh 20 + ∇∗h × uh 20 + ∇ · uh 20

∀uh ∈ Dh0 .

(25)

There exists a positive constant C, independent of h, such that C uh C ∗ D + ωh C + ∇∗h ph 0 ≤ ∇ × ω h + ∇∗h ph 0 + ∇∗h × uh − ω h 0 + ∇ · uh 0

(26)

for all {uh , ωh , ph } ∈ X h ≡ Dh0 × Ch0 × S0h . Proof. The key junctures in the proof of Theorem 1 are (14), (15) and (16). We show that, thanks to the use of a ﬁnite element DeRham complex, discrete versions of these key junctures continue to hold in the discrete setting. From deﬁnition (21) of ∇∗h we see that h ∗ h w , ∇h q = − ∇ · wh , q h ∀ q h ∈ S0h

and ∀ wh ∈ Dh0 ,

and because the range of ∇× is in Dh0 it follows that ∇ × ωh , ∇∗h ph = ∇ · ∇ × ω h , ph = 0 . This provides a discrete version ∇ × ω h + ∇∗h ph 20 = ∇ × ω h 20 + ∇∗h ph 20 of the ﬁrst key juncture (14). To obtain a discrete version of (15) we use (20) which implies that ∗ ∇h × vh , wh = vh , ∇ × wh

∀ vh ∈ Dh0

and ∀ wh ∈ Ch0 .

From this identity it easily follows that ∇∗h × uh − ωh 20 = ∇∗h × uh 20 + ωh 20 − ∇∗h × uh , ωh − uh , ∇ × ω h . (27) Finally, (22) provides a discrete analogue of (16): ∇∗h × uh 20 + ∇ · uh 20 ≥

1 uh 20 . CP2

(28)

The proof of (26) now follows by repeating the remaining steps from the proof 2 of Theorem 1, and taking into consideration the deﬁnition of · C ∗ D . Theorem 2 implies that (24) is a well-posed minimization problem, i.e., that the least-squares functional (23) has a unique minimizer out of X h .

A Locally Conservative Mimetic Least-Squares Finite Element Method

4

643

Conservative Properties

In this section we establish the conservative properties of the least-squares principle deﬁned in (24). Speciﬁcally, we show that the least-squares solution satisﬁes the following discrete version of the VVP Stokes system ⎧ ∇ × ωh + ∇∗h ph = πD f ⎪ ⎪ ⎨ ∇∗h × uh − ω h = 0 (29) ⎪ ⎪ ⎩ ∇ · uh = 0 . The bound (26) in Theorem 2 implies that (29) has a unique solution in X h . Theorem 3. Solution of the LSFEM (24) coincides with the solution of (29). Proof. Let {uh , ωh , ph } ∈ X h = Dh0 × Ch0 × S0h denote the solution of (29). We show that {uh , ω h , ph } satisﬁes the ﬁrst-order optimality condition of (23). To this end, write the corresponding variational problem as ∗ ⎧ h h ∗ h h h × u − ω , ∇ × v , ∇ · v ∇ + ∇ · u =0 ∀ vh ∈ Dh0 ⎪ h h ⎨ ∇ × ω h + ∇∗h ph , ∇ × ξ h − ∇∗h × uh − ωh , ξ h = f , ∇ × ξh ∀ ξh ∈ Ch0 ⎪ ⎩ ∇ × ωh + ∇∗h ph , ∇∗h q h = f , ∇∗h q h ∀ q h ∈ S0h . The second and third equations in (29) imply that ⎧ ∗ h h h ⎪ ⎨ ∇h × u − ω , ξ = 0 ∇∗h × uh − ωh , ∇∗h × vh = 0 ⎪ ⎩ ∇ · uh , ∇ · vh = 0 for all ξ h ∈ Ch0 and all vh ∈ Dh0 . Thus, it remains to show that ⎧ ⎨ ∇ × ω h + ∇∗h ph , ∇ × ξ h = f , ∇ × ξh ∀ ξh ∈ Ch0 ⎩ ∇ × ωh + ∇∗ ph , ∇∗ q h = f , ∇∗ q h ∀ q h ∈ S0h h h h

(30)

holds for the solution of (29). From the exact sequence property it follows that ∇ × ξh ∈ Dh0 for any ξ h ∈ Ch0 . This, in conjunction with the ﬁrst equation in (29) and deﬁnition of the L2 (Ω)-projection πD , implies that f , ∇ × ξ h = πD f , ∇ × ξ h = ∇ × ω h + ∇∗h ph , ∇ × ξh , i.e., the ﬁrst equation in (30). Likewise, ∇∗h q h ∈ Dh0 for any q h ∈ S0h and so, f , ∇∗h q h = πD f , ∇∗h q h = ∇ × ω h + ∇∗h ph , ∇∗h q h , i.e., the second equation in (30) also holds.

2

644

P. Bochev and M. Gunzburger

This theorem establishes that LSFEMs based on (23) result in velocity approximations that are divergence-free on every element. Moreover, we see that the least-squares solution satisﬁes discrete versions of the momentum equation and the second equation in (1). These properties cannot be achieved by using standard nodal ﬁnite element spaces and are the main reason to consider compatible ﬁnite elements in least-squares methods. Because (24) is a non-conforming LSFEM, its error analysis cannot be accomplished by standard elliptic arguments. This subject, as well as implementation and computation with the mimetic LSFEM will be considered in forthcoming papers.

References 1. Arnold, D.N., Falk, R.S., Winther, R.: Finite element exterior calculus, homological techniques, and applications. Technical Report 2094, Institute for Mathematics and Its Applications (2006) 2. Bochev, P., Gunzburger, M.: Least-Squares finite element methods. Applied Mathematical Sciences, vol. 166. Springer, Heidelberg (2009) 3. Christiansen, S.H., Winther, R.: Smoothed projections in finite element exterior calculus. Math. Comp. 77, 813–829 (2007) 4. Dezin, A.: Multidimensional Analysis and Discrete Models. CRC Presss, Boca Raton (1995) 5. Ern, A., Guermond, J.L.: Theory and Practice of Finite Elements. Applied Mathematical Sciences, vol. 159. Springer, New York (2004) 6. N´ed´elec, J.C.: Mixed finite elements in R3 . Numerische Mathematik 35, 315–341 (1980) 7. Raviart, P.A., Thomas, J.M.: A mixed finite element method for second order elliptic problems. In: Galligani, I., Magenes, E. (eds.) Mathematical Aspects of the Finite Element Method. LNM, vol. 606. Springer, New York (1977)

Additive Operator Decomposition and Optimization–Based Reconnection with Applications Pavel Bochev1 and Denis Ridzal2 1 Applied Mathematics and Applications Optimization and Uncertainty Quantiﬁcation, Sandia National Laboratories , Albuquerque, NM 87185-1320, USA [email protected], [email protected] 2

Abstract. We develop an optimization-based approach for additive decomposition and reconnection of algebraic problems arising from discretizations of partial diﬀerential equations (PDEs). Application to a scalar convection–diﬀusion PDE illustrates the new approach. In particular, we derive a robust iterative solver for convection–dominated problems using standard multilevel solvers for the Poisson equation.

1

Introduction

Decomposition of PDEs into component problems that are easier to solve is at the heart of many numerical procedures: operator split [15] and domain decomposition [17] are two classical examples. The use of optimization and control ideas to this end is another possibility that has yet to receive proper attention despite its many promises and excellent theoretical foundations. For previous work in this direction we refer to [11,4,8] and the references cited therein. In this paper we develop an optimization–based approach for additive decomposition and reconnection of algebraic equations that is appropriate for problems associated with discretized PDEs. Our approach diﬀers from the ideas in [11,4,8] in several important ways. The main focus of [4,8] is on formulation of non–overlapping domain–decomposition via optimization, whereas our approach targets the complementary single domain/multiple physics setting. Our ideas are closer to the decomposition framework in [11]. Nevertheless, there are crucial diﬀerences in the deﬁnition of the objectives, controls, and the constraints. The approach in [11] retains the original equation, albeit written in a form that includes the controls, and the constraints impose equality of states and controls. In other words, reconnection in [11] is eﬀected through the constraints rather than the objective.

Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy under contract DEAC04-94-AL85000.

I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 645–652, 2010. c Springer-Verlag Berlin Heidelberg 2010

646

P. Bochev and D. Ridzal

In contrast, we replace the original problem by component problems that, by themselves, are not equivalent to the original problem. These problems deﬁne the constraints, while reconnection is eﬀected via the objective functional. Consequently, in our approach the states are diﬀerent from the controls and the objective functional is critical for “closing” the formulation.

2

Additive Operator Decomposition

For clarity we present the approach in an algebraic setting, i.e., we consider the solution of the linear system Ax = b , (1) where A ∈ IRn×n is a nonsingular matrix and x and b are vectors in IRn . We present a method suitable for the scenario in which the matrix A comprises multiple operators with fundamentally diﬀerent mathematical properties, e.g. resulting from an all–at–once discretization of a multiphysics problem, in which case the linear system (1) requires nonstandard, highly specialized solution techniques. For a concrete example and discussion, see Section 4. The proposed optimization–based approach for the solution of (1) rests on the assumption that the matrix A can be written as the sum of two component matrices (2) A = A1 + A2 for which robust solvers are readily available. We note that A1 and A2 can represent the original operator components of a multi–operator equation, however, other, nontrivial decompositions are oftentimes more useful, see Section 4. We assume that A1 and A2 are nonsingular. To motivate the optimization formulation, we consider an equivalent formulation of (1) in terms the component matrices, A1 x − u − b + A2 x + u = 0 , where u ∈ IRn is an arbitrary coupling vector. As the overall intent is to make use of robust solvers available for the solution of linear systems involving A1 and A2 , we aim to develop a procedure that would allow us to repeatedly solve linear systems of the type A1 x = u + b

and

A2 x = −u ,

instead of the original problem. Our approach, based on ideas from optimization and control, is presented next.

3

Reconnection via an Optimization Formulation

We propose to replace (1) by the following constrained optimization problem: ⎧ 1 ε 2 2 ⎪ minimize J x (x , x , u) = − x + εu ⎪ 1 2 1 2 2 2 ⎪ 2 ⎨ (3) ⎪ A1 x1 − u = b ⎪ ⎪ subject to ⎩ A2 x2 + u = 0,

Additive Operator Decomposition and Optimization–Based Reconnection

647

where ε > 0 is a regularization parameter and · 2 denotes the Euclidean 2–norm. In the language of optimization and control, x1 and x2 are the state variables and u is the distributed control variable. We ﬁrst show that (3) is well–posed, i.e., that it has a unique solution {xε1 , xε2 , uε }. Then we examine the connection between {xε1 , xε2 } and the solution x of the original equation (1). 3.1

Existence of Optimal Solutions

The ﬁrst–order necessary conditions for {xε1 , xε2 , uε } to be a solution of (3) state that there is a Lagrange multiplier vector pair {λε1 , λε2 }, λε1,2 ∈ IRn , such that the KKT system of equations is satisﬁed, ⎛

⎞⎛ ε ⎞ ⎛ x1 I −I 0 AT1 0 T ⎟ ⎜ xε ⎟ ⎜ −I I 0 0 A2 ⎟ ⎜ 2 ⎟ ⎜ ⎜ ⎜ ⎜ ⎟⎜ ⎟ ⎜ 0 εI −I I ⎟ ⎜ uε ⎟ = ⎜ ⎜ 0 ⎜ ⎟⎜ ⎟ ⎜ ⎝ A1 0 −I 0 0 ⎠ ⎝ λε1 ⎠ ⎝ λε2 0 A2 I 0 0

0 0 0 b 0

⎞ ⎟ ⎟ ⎟ ⎟, ⎟ ⎠

(4)

where I is the n×n identity matrix. In the following, we let H denote the 3n×3n (full) Hessian matrix and Z the 3n × n null–space matrix, respectively, ⎞ ⎛ ⎞ ⎛ I −I 0 A−1 1 ⎟ ⎜ ⎟ ⎜ I 0⎠, H = ⎝ −I Z = ⎝ −A−1 ⎠, 2 0 0 εI I with the property

A1 0 −I 0 A2 I

Z=

0 0

.

= Z T HZ, known as the reduced Hessian, is symLemma 1. The matrix H metric positive deﬁnite. = (A−1 +A−1 )T (A−1 +A−1 )+εI. Proof. A straightforward calculation yields H 1 2 1 2 −1 −1 T −1 −1 The matrix (A1 + A2 ) (A1 + A2 ) is (at least) symmetric positive semidefinite, while εI is symmetric positive deﬁnite. The claim follows. Corollary 1. Invertibility of A1 and A2 and Lemma 1 directly imply that the KKT matrix deﬁned in (4) is nonsingular, and that there is a unique pair {{xε1 , xε2 , uε }, {λε1 , λε2 }} satisfying (4). Furthermore, the vector triple {xε1 , xε2 , uε } is the unique global solution of the optimization problem (3). For a proof, see [12, p.444–447]. −1 T −1 −1 Remark 1. As we will show below, the matrix (A−1 1 +A2 ) (A1 +A2 ) is in fact 2 symmetric positive deﬁnite. Hence the regularization term εu2 in (3), which is typically needed to guarantee existence and uniqueness of optimal solutions, is seemingly superﬂuous. However, one should not forget that we think of (1) as

648

P. Bochev and D. Ridzal

resulting from the discretization of a PDE, i.e., that we are dealing with a family of linear systems parametrized by some measure h of the mesh size, instead of with a single linear system. In this case the regularization term is needed to guarantee the uniform in h invertibility of A; see [2]. 3.2

Reformulation Error

In general, as ε > 0, the state solutions xε1 and xε2 of (3) will diﬀer from the solution x of the original problem (1). In this section we calculate the error induced by ε. −1 Lemma 2. The matrix (A−1 1 + A2 ) is nonsingular, with the inverse given by −1 T −1 −1 −1 A1 − A1 A A1 . (It follows trivially that the matrix (A−1 1 + A2 ) (A1 + A2 ) is symmetric positive deﬁnite.)

Proof. By inspection, −1 −1 −1 −1 A1 ) = I + A−1 A1 − A−1 A1 (A−1 1 + A2 )(A1 − A1 A 2 A1 − A 2 A1 A −1 −1 −1 −1 A1 = I + A2 − (A1 + A2 ) − A2 A1 (A1 + A2 ) −1 −1 −1 = I + A2 (A1 + A2 ) − I − A2 A1 (A1 + A2 ) A1 = I.

Lemma 3. The following statements hold: −1 (I − A1 A−1 )b , xε1 − x = ε A−1 1 H and

−1 A2 A−1 b . xε2 − x = −ε A−1 2 H

(5) (6)

Proof. We present a proof for (6), statement (5) can be veriﬁed analogously. The KKT system (4) yields expressions for the states, ε xε1 = A−1 1 (u + b),

ε xε2 = −A−1 2 u ,

(7)

the Lagrange multipliers, ε ε λε1 = −A−T 1 (x1 − x2 ),

ε ε λε2 = A−T 2 (x1 − x2 ),

and the controls, uε = (1/ε)(λε1 − λε2 ) . By substitution, we obtain the so–called reduced system for the controls, ε −1 −1 −1 −1 −1 T −1 T (A1 + A−1 2 ) (A1 + A2 ) + εI u = −(A1 + A2 ) A1 b ,

(8)

implies which using the notation for the reduced Hessian H −1 (A−1 + A−1 )T A−1 − A−1 b xε2 − x = A−1 2 H 1 2 1 −1 −1 −1 −1 T −1 (A1 + A2 ) A1 = A2 H −1 −1 −1 T −1 b − (A1 + A−1 2 ) (A1 + A2 ) + εI A2 A −1 (A−1 + A−1 )T A−1 − (A−1 + A−1 )A2 A−1 − εA2 A−1 b = A−1 2 H 1 2 1 1 2 −1 (A−1 + A−1 )T A−1 (A1 + A2 ) − (A−1 + A−1 )A2 A−1 = A−1 2 H 1 2 1 1 2 −1 A2 A−1 b . −εA2 A−1 b = −ε A−1 H 2

Additive Operator Decomposition and Optimization–Based Reconnection

649

−1 T −1 −1 Lemma 4. Let M = (A−1 1 +A2 ) (A1 +A2 ) and let Λmin denote the smallest eigenvalue of M. Let ε < (Λmin /2). Then,

−1 2 < H

2 Λmin

.

= M I − (−εM −1 ) . Due to ε < (Λmin /2) and e.g. [7, Proof. We have H Lemma 2.3.3], the matrix I − (−εM −1 ) is nonsingular, hence we can write −1 = I − (−εM −1 ) −1 M −1 , which implies H 1

−1 I − (−εM −1 ) ≤

1 Λmin Λmin (1 − εM −1 2 ) 2 1 2 1 = < . = Λmin (1 − (ε/Λmin )) Λmin − ε Λmin

−1 2 ≤ H

Theorem 1. Let the assumptions of Lemma 4 be satisﬁed. There exists a constant C, independent of ε, such that xε1 − x2 + xε2 − x2 ≤ εCb2 . Proof. Lemmas 3 and 4 yield the claim directly, with C=

2 Λmin

−1 A (I − A1 A−1 ) + A−1 A2 A−1 . 1 2 2 2 2 2 2

Remark 2. For parametrized linear systems (1) corresponding to discretized PDEs the constant C may depend on the mesh size h. 3.3

A Solution Algorithm

We now exploit the structure of the reformulated problem to develop robust solution methods for (1). Our approach is based on the premise that robust solution methods for linear systems involving A1 and A2 are readily available. We focus on a solution method known as the null–space or reduced–space approach, which eﬀectively decouples the component equations in (4). In this approach one typically solves the reduced system (8) iteratively, using a Krylov subspace method. The main computational cost of such an approach is in the repeated application of the reduced Hessian operator, speciﬁed in Algorithm 1. A similar procedure can be derived for the (one–time) computation of the right–hand side in (8). Optimal states can be recovered by solving either of the equations given in (7).

4

Numerical Results

In this section we apply the optimization–based reformulation to an SUPG– stabilized [10] discretization of the scalar convection–diﬀusion elliptic problem −νΔu + c · ∇u = f in Ω,

u = uD on ΓD ,

∇u · n = 0 on ΓN .

(9)

650

P. Bochev and D. Ridzal

to vector u Algorithm 1. Application of reduced Hessian H

1 2 3 4

input : vector u u output: vector H Solve: A1 y1 = u, A2 y2 = u Compute: y3 = y1 + y2 Solve: AT1 y4 = y3 , AT2 y5 = y3 u = y4 + y5 + εu Compute: H

(state equations) (adjoint equations)

We assume that Ω is a bounded open domain in Rd , d = 2, 3, with the Lipschitz continuous boundary ∂Ω = ΓD ∪ ΓN , c is a given velocity ﬁeld with ∇ · c = 0, uD is a given Dirichlet boundary function, n is the outward unit normal, and ν > 0 is a constant diﬀusion coeﬃcient. We focus on settings where ν is small compared to |c|, i.e., when (9) is convection–dominated. The linear system resulting from (9) is typically of the form (νD + C) x = b ,

(10)

where D is a (pure) diﬀusion matrix (discretization of the Laplace operator), C corresponds to the convection contribution (including any terms due to the SUPG stabilization), and b stands for a discretization of the source term f (plus stabilization terms). In order to apply our optimization–based approach to (10), we make the identiﬁcation A1 = D + C,

and A2 = (ν − 1)D.

(11)

The key property of this decomposition is that, unlike the original operator (νD + C), the matrices A1 and A2 are diﬀusion–dominated. The scalable iterative solution of linear systems arising from PDEs of the type (9) is a very active area of research. Algebraic multigrid methods (see [16] and references therein) work well for diﬀusion–dominated problems but their performance degrades in the convection–dominated case. The eﬀorts to extend multigrid methods to such problems [1,3,13,14] have led to improvements, albeit at the expense of increased algorithm complexity. As we show below, widely used “oﬀ–the–shelf” multigrid solvers, such as BoomerAMG (hypre library) [9] or ML (Trilinos project) [6], can lack robustness when applied to problems with complex convection ﬁelds, particularly in the case of very large P´eclet numbers (deﬁned here as 1/ν). We consider the so–called double–glazing problem, see [5, p.119]. The domain is given by Ω = [−1, 1]2 , subject to a uniform triangular partition, generated by dividing each square of a uniform square partition of Ω into two triangles, diagonally from the bottom left to the top right corner. The convection ﬁeld is given by c = 2y(1 − x2 ), −2x(1 − y 2 ) , and the diﬀusion constant is set to ν = 10−8 . Boundary conditions are speciﬁed by

ΓN = ∅,

uD = 0 on {[−1, 1) × {−1}} ∪ {{−1} × (−1, 1)} ∪ {[−1, 1) × {1}} , uD = 1 on {{1} × [−1, 1]} .

Additive Operator Decomposition and Optimization–Based Reconnection

651

We compare the solution of linear systems arising from the full convection– diﬀusion problem (10), denoted the full problem, to the solution of the optimization reformulation (3) with A1 and A2 deﬁned in (11). For the solution of (10) we use the multigrid solvers BoomerAMG and ML as preconditioners for the GMRES method with a maximum Krylov subspace size of 200, denoted by GMRES(200). Solver abbreviations, options, and failure codes used hereafter are summarized below. We note that the stated MLILU and BAMG parameters reﬂect the best solver settings that we could ﬁnd for the example problem. MLILU BAMG

ML: incomplete LU smoother (IFPACK, ILU, threshold=1.05), W cycle BoomerAMG: Falgout–CLJP coarsening (6), symmetric Gauss–Seidel / Jacobi hybrid relaxation (6), V cycle (1)

—MX

exceeded maximum number of GMRES iterations (2000)

The optimization reformulation is solved via the reduced–space approach, denoted here by OPT. The outer optimization loop amounts to solving the linear system (8) using unpreconditioned GMRES(200). Within every optimization iteration, we solve four linear systems, see Algorithm 1. The linear systems involving A1 and A2 are solved using GMRES(10), preconditioned with ML with a simple smoother, to a relative stopping tolerance of 10−10 . We remark that in either case only 5–8 iterations are required for the solution of each system. The regularization parameter ε is set to 10−12 , see [2]. Table 1 presents a comparison of the number of outer GMRES(200) iterations for the optimization approach and the total number of GMRES(200) iterations for multigrid solvers applied to the full problem. Relative stopping tolerances are set to 10−6 . For our test problem, MLILU and BAMG show very strong mesh dependence, and eventually fail to converge on the 256 × 256 mesh. OPT, on the other hand, is robust to mesh reﬁnement, and successfully solves the problem for all mesh sizes. In addition, Table 1 clearly demonstrates that while MLILU and BAMG are very sensitive to the size of the P´eclet number, OPT’s performance is aﬀected only mildly, and in fact does not change when the diﬀusion constant is lowered from ν = 10−4 to ν = 10−8 . Overall, our optimization–based strategy provides a promising alternative for the robust solution of problems on which widely used multigrid solvers struggle. It is best used, and easily implemented, as a robustness–improving wrapper around standard multigrid solvers. To better Table 1. Number of outer GMRES(200) iterations for the optimization approach; total number of GMRES(200) iterations for multigrid applied to the full problem ν = 10−8 64 × 64 128 × 128 256 × 256 OPT ML BAMG

114 71 72

114 196 457

113 —MX —MX

128 × 128 ν = 10−2 ν = 10−4 ν = 10−8 84 9 7

114 96 33

114 196 457

652

P. Bochev and D. Ridzal

assess the general eﬀectiveness of the approach, additional numerical experiments on a wider range of target problems are clearly needed and will be the subject of future studies.

References 1. Bank, R.E., Wan, J.W.L., Qu, Z.: Kernel preserving multigrid methods for convection-diﬀusion equations. SIAM J. Matrix Anal. Appl. 27(4), 1150–1171 (2006) 2. Bochev, P., Ridzal, D.: An optimization-based approach for the design of robust solution algorithms. SIAM J. Numer. Anal. (submitted) 3. Brezina, M., Falgout, R., MacLachlan, S., Manteuﬀel, T., McCormick, S., Ruge, J.: Adaptive smoothed aggregation (αsa) multigrid. SIAM Review 47(2), 317–346 (2005) 4. Du, Q., Gunzburger, M.: A gradient method approach to optimization-based multidisciplinary simulations and nonoverlapping domain decomposition algorithms. SIAM J. Numer. Anal. 37, 1513–1541 (2000) 5. Elman, H., Silvester, D., Wathen, A.: Finite Elements and Fast Iterative Solvers. Oxford University Press, Oxford (2005) 6. Gee, M.W., Siefert, C.M., Hu, J.J., Tuminaro, R.S., Sala, M.G.: ML 5.0 Smoothed Aggregation User’s Guide. Technical Report SAND2006-2649, Sandia National Laboratories, Albuquerque, NM (2006) 7. Golub, G.H., van Loan, C.F.: Matrix Computations, 3rd edn. Johns Hopkins University Press, Baltimore (1996) 8. Gunzburger, M., Lee, H.K.: An optimization-based domain decomposition method for the Navier-Stokes equations. SIAM J. Numer. Anal. 37, 1455–1480 (2000) 9. Henson, V.E., Meier Yang, U.: Boomeramg: a parallel algebraic multigrid solver and preconditioner. Appl. Numer. Math. 41(1), 155–177 (2002) 10. Hughes, T.J.R., Brooks, A.: A theoretical framework for Petrov-Galerkin methods with discontinuous weighting functions: Application to the streamline-upwind procedure. In: Rao, R.H.G., et al. (eds.) Finite Elements in Fluids, vol. 4, pp. 47–65. J. Wiley & Sons, New York (1982) 11. Lions, J.L.: Virtual and eﬀective control for distributed systems and decomposition of everything. Journal d’Analyse Math´ematique 80, 257–297 (2000) 12. Nocedal, J., Wright, S.J.: Numerical Optimization, 1st edn. Springer, Heidelberg (1999) 13. Notay, Y.: Aggregation-based algebraic multilevel preconditioning. SIAM J. Matrix Anal. Appl. 27(4), 998–1018 (2006) 14. Sala, M., Tuminaro, R.S.: A new Petrov–Galerkin smoothed aggregation preconditioner for nonsymmetric linear systems. SIAM J. Sci. Comp. 31(1), 143–166 (2008) 15. Strang, G.: On the construction and comparison of diﬀerence schemes. SIAM J. Numer. Anal. 5, 506–517 (1968) 16. St¨ uben, K.: A review of algebraic multigrid. J. Comp. Appl. Math. 128(1-2), 281– 309 (2001) 17. Toselli, A., Widlund, O.: Domain Decomposition Methods - Algorithms and Theory. Springer, New York (2005)

Least-Squares Spectral Element Method on a Staggered Grid Marc Gerritsma, Mick Bouman, and Artur Palha TU Delft, Delft, The Netherlands [email protected]

Abstract. This paper describes a mimetic spectral element formulation for the Poisson equation on quadrilateral elements. Two dual grids are employed to represent the two ﬁrst order equations. The discrete Hodge operator, which connects variables on these two grids, is the derived Hodge operator obtained from the wedge product and the innerproduct. The gradient operator is not discretized directly, but derived from the symmetry relation between gradient and divergence on dual meshes, as proposed by Hyman et al., [5], thus ensuring a symmetric discrete Laplace operator. The resulting scheme is a staggered spectral element scheme, similar to the staggering proposed by Kopriva and Kolias, [6]. Diﬀerent integration schemes are used for the various terms involved. This scheme is equivalent to a least-squares formulation which minimizes the diﬀerence between the dual velocity representations. This generates the discrete Hodge- operator. The discretization error of this schemes equals the interpolation error.

1

Mimetic Formulation of the Poisson Equation

The Poisson equation is given by Δφ = f ,

(1)

supplemented with appropriate boundary conditions (Dirichlet, Neumann, etc). This equation can be written in vector form as a ﬁrst order system u = gradφ , divu = f .

(2)

In terms of diﬀerential forms this equation is given as u(1) = dφ(0) ,

dq (n−1) = f (n) , q (n−1) = u(1) ,

(3)

where u(1) denotes a 1-form, φ(0) is a 0-form and f (n) is a prescribed n-form, n = dim(Ω). The Hodge -operator associates the 1-form with a corresponding (n − 1)-form. Mimetic discretizations are based on the strong analogy between diﬀerential geometry and algebraic topology. The global, metric-free description can be rephrased without error in terms of cochains, while the local description requires I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 653–661, 2010. c Springer-Verlag Berlin Heidelberg 2010

654

M. Gerritsma, M. Bouman, and A. Palha

a diﬀerential ﬁeld reconstruction. The two operators which switch between these two descriptions are the reduction operator, R and the interpolation operator, I. For a thorough discussion of the interplay between diﬀerential forms and cochains the reader is referred to [1,3,7,8]. In this paper, we work on two dual meshes, the Gauss-Lobatto-Legendre (GLL) mesh and the extended Gauss (EG) mesh. All variables, functions and points on the EG-mesh are referred to with a ˜ i (ξq ) is a basis function on the EG-mesh evaluated at a point on the twiddle. So h GLL-mesh, while hi (ξ˜q ) is a basis function deﬁned on the GLL-mesh evaluated at a point on the EG-mesh.

2

Dual Spectral Element Meshes

The two equations in (3) can both be represented exactly in discrete form. However, the u in the ﬁrst equation, u(1) = dφ(0) , is internally oriented along contours (or line segments), whereas the q (n−1) in the second equation is externally oriented with respect to (n − 1)-dimensional hyper-surfaces. This duality between k-forms acting internally on k-dimensional geometric objects and externally (n − k)-forms acting on (n − k)-dimensional geometric objects can be represented by two dual meshes, [7,8]. For the ﬁrst mesh we choose the a spectral element mesh with a Gauss Lobatto-Legendre (GLL) points. Let ξi , denote the zeros of 1 − ξ 2 LN (ξ) where LN (ξ) is the derivative of the Legendre polynomial of degree N for i = 0, . . . , N . Then the GLL-mesh is given by (ξi , ξj ), i, j = 0 . . . , N , in the case n = 2. The GLL-mesh is depicted in Fig. 1 with the black squares.

1

0

−1 −1

0

1

Fig. 1. Dual spectral element grids, GLL grid (solid lines) and the EG grid (dashed lines)

Least-Squares Spectral Element Method on a Staggered Grid

655

The mesh, which is dual to the GLL-mesh, needs to associate p-cells with (n − p)-cells. Geometrically, this means that given a cell-complex (the GLLmesh in the spectral element) as shown in Fig. 1 by the solid lines, we need to set up a corresponding dual element in which p-cells on the primal element correspond to (n − p)-cells on the dual element. In this paper we use the Gauss points supplemented with nodes on the boundary as dual nodes, see Fig. 1, which form the intersection of the dashed lines in Fig. 1. Gauss points: LN (ξ˜i ) = 0 , i = 1, . . . , N

(4)

So we have (ξ˜i , η˜j ) supplemented with the boundary nodes (ξ˜0 , η˜j ) = (−1, η˜j ) , (ξ˜N +1 , η˜j ) = (1, η˜j ) , j = 1, . . . , N ,

(5)

and (ξ˜i , η˜0 ) = (ξ˜i , −1) , (ξ˜i , η˜N +1 ) = (ξ˜i , 1) ,

i = 1, . . . , N .

(6)

This topological positioning of the dual points follows the location of the scalar variables in the support operator method by Hyman et al., [5]. Representing diﬀerent quantities on diﬀerent meshes has also been investigated by Kopriva and Kolias, [6]. The extended Gauss mesh will be referred to as the EG-mesh.

3 3.1

The Reduction Operators R and Interpolation Operators I The Conservation Law dq (n−1) = f (n)

On the GLL-grid, the black grid in Fig. 1, we discretize the conservation law dq (1) = f (2) . Let us associate with each line segment, which connects two GLLpoints, a normal velocity integrated over that line segment, see Fig. 2. x qi,j

ηj

=

q

(1)

(ξi , s) ds and

ηj−1

y qi,j

ξi

=

q (1) (s, ηj ) ds .

(7)

ξi−1

The operation of reducing the ﬂux ﬁeld to integrated values over faces (for n = 2 line segments) is the reduction operator R. For the reconstruction operator I we need an interpolant which interpolates values, deﬁned in the GLL-nodes, given by 1 − ξ 2 LN (ξ) , i = 0, . . . , N , (8) hi (ξ) = N (N + 1)LN (ξi )(ξi − ξ) and an interpolant which interpolates integral quantities deﬁned on edges, given by i−1 ei (ξ) = − dhk (ξ) , (9) k=0

656

M. Gerritsma, M. Bouman, and A. Palha y q 1,5

1 x q 1,4

y q 2,5 x q 2,4

y q 1,4

x q 1,3

x q 2,3

x q 1,2

x q 1,1

x q 4,3

x q 3,2

x q 2,1

x q 5,3

y q 4,3

x q 4,2

y q 3,2 x q 3,1

y q 2,1

x q 5,4 y q 4,4

y q 3,3

y q 2,2

y q 1,1

x q 4,4

x q 3,3

x q 2,2

y q 4,5

y q 3,4

y q 2,3

y q 1,2

−1 −1

x q 3,4 y q 2,4

y q 1,3

0

y q 3,5

x q 5,2

y q 4,2 x q 4,1

y q 3,1

x q 5,1 y q 4,1

0

1

Fig. 2. Location of the externally oriented ﬂux components on the GLL-mesh as deﬁned by (7)

then the interpolation of these edge values (7) is given by the 1-forms IRq (1) (ξ, η) = −

N N i=1 j=0

y qi,j ei (ξ)hj (η) +

N N

x qi,j hi (ξ)ej (η) .

(10)

i=0 j=1

Note that the basis functions ei have the property that ⎧ ξk ⎨ 1 if i = k ei (ξ) = ⎩ ξk−1 0 if i = k

(11)

which amounts to RI = Id, so this interpolation is the left inverse of the reduction operator R. Since I is only an approximate right inverse of R, (10) is an approximation to the true solution q (1) . If we take the exterior derivative of (10) we obtain dIRq (1) (ξ, η) = −

N N i=1 j=0

y qi,j ei (ξ)dhj (η) +

N N

x qi,j dhi (ξ)ej (η)

i=0 j=1

N N x y y x = qi,j + qi,j ei (ξ)ej (η) − qi−1,j − qi,j−1

(12)

i=1 j=1

= IδRq (1) , where δ is the coboundary operator acting on (n − 1)-cochains. This shows that the interpolation operator satisﬁes the commutation relation dI = Iδ, the second commuting diagram property, CDP2, by Bochev and Hyman, [3].

Least-Squares Spectral Element Method on a Staggered Grid

657

The reduction operator applied to the 2-form f (2) is deﬁned as the integration of f (2) over the volumes [ξi−1 , ξi ] × [ηj−1 , ηj ], ξi ηj (2) fi,j = Rf := f (2) , i, j = 1 . . . , N . (13) ξi−1

ηj−1

The interpolation of the 2-cochain fi,j is deﬁned as Ifi,j =

N N

fi,j ei (x)ej (y) .

(14)

i=1 j=1

Using (12) and (14) we can write dq (1) = f (2) as N N

y y x x qi,j + qi,j − qi−1,j − qi,j−1 − fi,j ei (x)ej (y) = 0 .

(15)

i=1 j=1

This equation can be interpreted as a spectral interpolation of a ﬁnite volume method. An alternative way of writing this equation is in terms of the incidence matrix, F2,1 , which is a matrix representation of the coboundary operator acting on 1-cochains

x qi,j F2,1 − f (16) y i,j ei (x)ej (y) = 0 . qi,j Discretization of u(1) = dφ(0)

3.2

For the discretization of u(1) = dφ(0) we use the EG-mesh as depicted in Fig. 1. The reduction operator for the 0-form φ(0) is deﬁned by sampling the scalar function φ(0) in the extended Gauss points, i.e. φ˜i,j = Rφ(0) := φ(0) (ξ˜i , η˜j ) .

(17)

A nodal interpolation of the φi,j is given by I φ˜i,j =

N N i=1 j=1

+

˜ i (ξ)h ˜ j (η) + φ˜i,j h

N

˜ N +1 (η) h ˜ i (ξ) φ˜i,0 ˜ h0 (η) + φ˜i,N +1 h

i=1

N

˜ N +1 (ξ) h ˜ j (η) , φ˜0,j ˜ h0 (ξ) + φ˜N +1,j h

(18)

j=1

where

⎧ (−1)N ⎪ (1 − ξ) LN (ξ) ⎪ 2 ⎪ ⎪ ⎪ ⎪ ⎨ (1−ξ2 )LN (ξ) ˜ i (ξ) = h ˜2 ˜ ⎪ ⎪ (1−ξi )LN (ξi )(ξ−ξi ) ⎪ ⎪ ⎪ ⎪ ⎩1 2 (1 + ξ) LN (ξ)

if i = 0 if 1 ≤ i ≤ N if i = N + 1

(19)

658

M. Gerritsma, M. Bouman, and A. Palha

The connection between the divergence equation on the GLL-mesh and the gradient equation on the EG-mesh is established by (0) (1) dφ(0) ∧ q (1) (20) dφ , q ωn = Ω Ω = φ(0) ∧ q (1) − φ(0) ∧ dq (1) , (21) ∂Ω

Ω

where in (20) we use the deﬁnition of the Hodge operator, see for instance [4], in (21) we use integration by parts for the wedge product. Equations (20) and (21) are used to implicitly deﬁne the dφ(0) . By using (21) we ensure that the resulting Laplace operator will be symmetric, due to the similarity to the support operator method proposed by Hyman et al., [5]. Let us discretize the Hodge of the gradient dφ as Gφ = (G x φ, G y φ) as Gφ = −

N N

(G y φ)k,l ek (ξ)hl (η) +

k=1 l=0

N N

(G x φ)k,l hk (ξ)el (η) .

(22)

k=0 l=1

If we take the inner-product between (10) and (22), and integrate over the domain, then we obtain an approximation of the term on the left hand side of (20). For this integration we use a mixture of Gauss and Gauss-Lobatto integration: For the ﬁrst term we use Gauss integration in the ξ-direction and Gauss-Lobatto integration in the η-direction, while for the second term we use Gauss-Lobatto in the ξ-direction and Gauss integration in the η-direction ⎧ ⎫ N N N N ⎨ ⎬ x (Gφ, q) = qp,j ej (˜ ηq ) (G x φ)p,l el (˜ ηq ) wpGL wqG + (23) ⎩ ⎭ Ω p=0 q=1 j=1 l=1 N N N N y ˜ q ei (ξp ) (G y φ) ek (ξ˜p ) wG wGL (24) i,q

p=1 q=0

i=1

k,q

p

q

k=1

For the boundary integral, the ﬁrst term in (21), we insert the reconstructed polynomials I φ˜k,l ∧ Iqi,j = ∂Ω

N N y ˜ y ˜ qi,N φp,N +1 − qi,0 φp,0 wpG + i=1 p=1 N N x ˜ x ˜ qN,j φN +1,q − q0,j φ0,q wqG

(25)

j=1 q=1

And ﬁnally, we need to discretize the second term in (21). Here we use Gauss integration in both directions

x N N N N q φ(0) ∧ dq (1) = φp,q F2,1 ei (ξ˜p )ej (˜ ηq )wpG wqG . (26) y q Ω i,j p=1 q=1

i=1 j=1

Least-Squares Spectral Element Method on a Staggered Grid

659

After we have evaluated the integrals numerically, we can write the resulting system as

x q ¯ T A¯ Gφ q = φ¯T F2,1 B¯ q , ∀¯ q= (27) qy where we have assembled the contribution of the two terms in (21) on the right hand side. Note also that A is symmetric. This relation is satisﬁed for all q¯ if ¯ = A−1 BT F2,1 T φ¯ = Gφ

−1 T A E1,0 φ¯ , B Discrete Hodge

(28)

where E1,0 is the incidence matrix relating 0-cells to 1-cells on the EG mesh. Once we have this cochain interpolation of dφ(0) on the GLL mesh, we can apply the coboundary operator in integral form as derived in (26) to get the discrete version of d dφ(0) = f (2) 1,0 T E BA−1 BT E1,0 φ¯ = f¯ ,

(29)

where f¯ is a vector containing the 2-cochains fij . Lemma 1. The same discretization is obtained in terms of a least-squares formulation. If we minimize the diﬀerence between Gφ and dφ as Find Gφ ∈ Λn−1 Ω; L2 such that for φ ∈ Λ0 Ω; L2

∗Gφ = arg

min

v∈Λn−1 (Ω;L2 )

1 2 v − dφL2 (Ω) 2

(30)

Proof. A necessary condition for a minimizer is dφ ∧ w (v, w) = (∗dφ, w) = Ω = φ∧w− φ ∧ dw , ∀w ∈ Λn−1 Ω; L2 ∂Ω

Ω

The trivial norm-equivalence C1 vL2 (Ω) ≤ vL2 (Ω) ≤ C2 vL2 (Ω) , for C1 = C2 = 1 , establishes well-posedness. Inserting the conforming discrete function space completes the proof.

This minimization is the same as the least-squares minimization described in [2], equations (4.5) and (4.6), and [3], equation (7.10), with γ = 0 and g = 0, respectively. The lemma shows that mixed mimetic method, as described in [1], section 6.4.2, (161), equals the least-squares minimization as described in [2,3].

660

M. Gerritsma, M. Bouman, and A. Palha 2

10

m=1 (numerical) m=1 (interpolation) m=2 (numerical) m=2 (interpolation)

0

10

−2

10

−4

L2 -error

10

−6

10

−8

10

−10

10

−12

10

−14

10

5

10

15

20

25

N

Fig. 3. Convergence of the L2 -error vs the polynomial degree N for the cases m = 1 (blue dashed line) and m = 2 (blue solid line) and the interpolation error of the exact solution on the EG mesh for m = 1 (red dashed line) and m = 2 (red solid line)

4

Results

As a ﬁrst test to assess the above described method the following test case is considered. We solve Δφ = f (x, y) , (x, y) ∈ [−1, 1]2 ,

(31)

with the exact solution given by φ(x, y) = sin(πx) sin(mπy). The exact boundary conditions for φ are prescribed along the boundary of the domain. In Fig. 3 the L2 -error vs the polynomial degree is plotted for m = 1 and m = 2 in blue. Norm-equivalence in Lemma 1 implies that the error of the least-squares approximation is equal to interpolation error of the exact solution on the EG mesh. This interpolation error of the exact solution is indicated with the red lines in Fig. 3, which conﬁrms that the discretization error equals the interpolation error. This would imply that the numerical solution is nodally exact.

References 1. Bochev, P.B.: A discourse on variational and geometric aspects of stability of discretizations. In: Deconinck, H. (ed.) VKI LS 2003-2005. 33rd Computational Fluid Dynamics Lecture Series, vol. 72, Von Karman Institute for Fluid Dynamics, Chaussee de Waterloo, Rhode Saint Genese (2005) (B-1640) ISSN0377-8312 2. Bochev, P.B., Gunzburger, M.: On least-squares ﬁnite element methods for the Poisson equation and their connection to the Dirichlet and Kelvin Principles. SIAM J. Numer. Anal. 43(1), 340–362 (2005) 3. Bochev, P.B., Hyman, J.M.: Principles of mimetic discretizations of diﬀerential equations. In: Arnold, D., Bochev, P., Lehoucq, R., Nicolaides, R., Shashkov, M. (eds.). IMA, vol. 142. Springer, Heidelberg (2006)

Least-Squares Spectral Element Method on a Staggered Grid

661

4. Flanders, H.: Diﬀerential forms with applications to the physical sciences. Academic Press, Inc., New York (1963) 5. Hyman, J., Shaskov, M., Steinberg, S.: The numerical solution of diﬀusion problems in strongly heterogeneous non-isotropic materials. Journal of Computational Physics 132, 130–148 (1997) 6. Kopriva, D.A., Kolias, J.H.: A Conservative Staggered-Grid Chebyshev Multidomain Method for Compressible Flows. Journal of Computational Physics 125, 244– 261 (1996) 7. Mattiussi, C.: The Finite Volume, Finite Diﬀerence, and Finite Elements Methods as Numerical Methods for Physical Field Problems. Advances in Imaging and Electron Physics 113, 1–146 (2000) 8. Tonti, E.: On the mathematical structure of a large class of physical theories. Accademia Nazionale dei Lincei, estratto dai Rendiconti della Classe di Scienze ﬁsiche, matematiche e naturali, Serie VIII, Vol. LII, fasc. 1, Gennaio (1972)

Mimetic Least-Squares Spectral/hp Finite Element Method for the Poisson Equation Artur Palha and Marc Gerritsma Delft University of Technology, Delft, The Netherlands [email protected]

Abstract. Mimetic approaches to the solution of partial diﬀerential equations (PDE’s) produce numerical schemes which are compatible with the structural properties – conservation of certain quantities and symmetries, for example – of the systems being modelled. Least Squares (LS) schemes oﬀer many desirable properties, most notably the fact that they lead to symmetric positive deﬁnite algebraic systems, which represent an advantage in terms of computational eﬃciency of the scheme. Nevertheless, LS methods are known to lack proper conservation properties which means that a mimetic formulation of LS, which guarantees the conservation properties, is of great importance. In the present work, the LS approach appears in order to minimise the error between the dual variables, implementing weakly the material laws, obtaining an optimal approximation for both variables. The application to a 2D Poisson problem and a comparison will be made with a standard LS ﬁnite element scheme.

1

Introduction

The role of physics lies, essentially, in the translation of a physical phenomena (heat transport, ﬂuid ﬂow, electromagnetism, etc.) into an adequate mathematical statement that yields correct predictions of the phenomena, that is, models it. This translation is usually accomplished by setting up a set of partial diﬀerential or integro-diﬀerential equations that relate the diﬀerent physical quantities. In order to do this, these physical quantities have to be represented by scalar ﬁelds (e.g., temperature) or vector ﬁelds (e.g., velocity). The main drawback of the classical approach which uses scalar and vector ﬁelds to represent physical properties is the fact that they obscure the topological and geometrical content of the model. Diﬀerential geometry provides a more adequate framework for the analysis of these problems. A speciﬁc feature of diﬀerential forms is that they possess algebraic as well as geometrical, topological, diﬀerential, integral, and many other properties. The generality of this framework allows one, for example, to express the classical ∇, ∇× and ∇· diﬀerential operators as well as the gradient theorem, Kelvin-Stokes theorem and Gauss theorem, in a concise manner in terms of diﬀerential forms and an operator, called exterior derivative, acting on these objects. I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 662–670, 2010. c Springer-Verlag Berlin Heidelberg 2010

Mimetic Least-Squares Spectral/hp Finite Element Method

663

Apparently the equations that describe diﬀerent branches of science look completely distinct. However, the concept of “conservation law” ranges from the conservation of speciﬁc quantities (energy, mass, etc.), to the conservation of the balance between a change of physical quantities (energy, linear momentum, angular momentum and mass) and relevant external forcing (such conservation laws may be called the balance conservation laws). What do they have in common? Tonti [15], aided by the generality provided by diﬀerential geometry, pointed out that there exists a fundamental similarity between these physical theories that can be traced back to common geometric principles from which they stem. For this, Tonti developed the so-called Tonti diagrams. These diagrams are graphical representations of these ﬁeld equations. With them it becomes clear that physical theories are made up of equations that relate global integral quantities, associated with appropriate geometrical objects (oriented p-dimensional space-time surfaces), and equations that relate local physical quantities with differently oriented geometrical objects. The ﬁrst, known as conservation or equillibrium laws, have an inherently topological nature in the sense that they are independent of metric. The latter, also known as material contitutive relations, are instrinsically connected to the metric. For more comprehensive information the reader is referred to the works by Tonti [15], Bossavit [4] and Mattiussi [13]. Numerical schemes are an essential tool for solving PDE’s. These schemes, being a model reduction, inherently lead to loss of information of the system being modeled, namely on its structure, e.g. conservation of certain quantities – mass, momentum, energy, etc. – and symmetries, which are embedded into the PDE’s as a result of the geometrical properties of the diﬀerential operators. It is known today, that the well-posedness of many PDE problems reﬂects geometrical, algebraic topological and homological structures underlying the problem. It is, therefore, important for the numerical scheme to be compatible with these structures (the physics), i.e., to mimic them. The goal of mimetic methods is to satisfy exactly, or as good as possible, the structural properties of the continuous model. It is becoming clear that in doing so, one obtains stable schemes. Additionally, a clear separation between the processes of discretization and approximation arises, the latter only take place in the constitutive relations. Least Squares (LS) schemes oﬀer many desirable properties, most notably the fact that they lead to symmetric positive deﬁnite algebraic systems, which represent an advantage in terms of computational eﬃciency of the scheme. Nevertheless, LS methods fail to satisfy the conservation laws, [14]. A mimetic LS formulation will satisfy the conservation law exactly. In the cuurrent paper, the LS approach is used to minimise the error between the dual variables, obtaining an optimal approximation. This LS approximation is known in the literature as the implementation of Weak Material Laws, as proposed by Bochev and Hyman in [1].

2

Mimetic Approaches for the 2D Poisson Equation

For mimetic schemes we introduce basic concepts from diﬀerential geometry. The construction of the Tonti diagram for the 2D Poisson equation will be presented

664

A. Palha and M. Gerritsma

and also an analysis of possible routes for mimetic discretizations, in particular, the one taken in this work. 2.1

Diﬀerential Geometry, a Refresher

For a brief introduction to diﬀerential geometry, see for instance [5,8], one needs essentially to introduce one object (k-diﬀerential form, or k-form), four operators on these objects (wedge product, ∧, exterior derivative, d, inner product and the Hodge- operator, ). A k-diﬀerential form, or k-form, is a k-linear and antisymmetric mapping: ω k : T Mx × . . . × T Mx →R

(1)

k

where T Mx is the tangent space of a k-diﬀerentiable manifold (smooth kdimensional surface) at x. The wedge product, ∧, in a n-dimensional space, is deﬁned as a mapping: ∧ : Λk × Λl → Λk+l ,

k + l < n,

(2)

with the property that: ω k ∧ αl = (−1)kl αl ∧ ω k , where Λk is the space of k-forms. This means that, in Rn , one can write any k-form as: ωk = wi1 ,...,ik (x)dxi1 ∧ . . . ∧ dxik ,

(3)

(4)

i1 <...
where dxi are elemental basis 1-forms. In this way one can deﬁne the set of functions {wi1 ,...,ik (x)} as proxies of k-forms, in the sense that there is a oneto-one correspondence between these sets and k-forms. Moreover, one can verify that these sets, in R2 for example, have a correspondence to well known entities: scalar ﬁelds as proxies for 0-forms and 2-forms, vector ﬁelds as proxies for 1forms. The inner product in Rn and this correspondence between k-forms and their proxies, provides the mapping (·, ·) : Λk × Λk → R: ai1 ,...,ik (x)bi1 ,...,ik (x) (5) (αk , β k ) = i1 <...
The L2 inner product for k-forms, in n-dimensional space, can also be deﬁned by: k k k k n α ,β Ω = α ,β ω , (6) Ω

where the n-form ω n = dx1 ∧ · · · ∧ dxn in an n-dimensional space. The exterior derivative d, in a n-dimensional space, is a mapping: → Λk+1 , d : Λk

k = 0, 1, . . . , n − 1,

(7)

Mimetic Least-Squares Spectral/hp Finite Element Method

which satisﬁes: d ω k ∧ αl = dω k ∧ αl + (−1)k ω k ∧ dαl ,

k+l
665

(8)

and ∀ω ∈ Λk ,

ddω k = 0,

k
(9)

The Hodge- operator, in an n-dimensional space, is a mapping: : Λk → Λn−k ,

k ≤ n,

(10)

∀α, β ∈ Λk .

(11)

that satisﬁes: α ∧ β = (α, β) ω n ,

With the above mentioned deﬁnitions it is straighforward to verify that in 2D, the usual, ∇f , ∇×v and ∇·v, are easily expressed as df 0 , dv 1 , dv 1 , respectively. Additionally, for a simply connected domain Ω the spaces of diﬀerential forms together with the exterior derivative d constitute an exact sequence, called the de Rham complex, which, in 2D is1 : / Λ0

R

d

d

/ Λ2

Λ˜2 o

2.2

/ Λ1

d

Λ˜1 o

(12)

d

Λ˜0 o

R

Tonti Diagram for the 2D Poisson Equation and Its Discretizations

In order to construct the Tonti diagram for the 2D Poisson equation, −∇2 φ = f , one needs ﬁrst to restate it as a system of ﬁrst order PDE’s2 : ⎧

⎨ −dφ0 = u1 −∇φ = u ⇔ (13) d˜ v 1 = f˜2 , ∇·u=f ⎩ v˜1 = u1 where one clearly sees that the u1 that appears in the ﬁrst equation is not equal to the v˜1 (a twisted 1-form) that appears in the second equation, but rather 1

By using the and -operators, one can convert this diﬀerential complex to a complex in terms of vector operators. In 2D the associated De Rham complex in vector form is given by R

/ H1

∇

L2 o 2

∇·

∇×

/ H 1 (curl)

/ L2

H 1 (div) o

−∇⊥

H1 o

R

This special form can be found in [3,6]. Here f 2 denotes a 2-form. Not to be confused with the square of f .

666

A. Palha and M. Gerritsma

it is related to it through a material constitutive relation. This is not explicit when using standard vector calculus and, as it will be seen later (sections 3 and 4), it plays an important role in the accurate numerical solution of the Poisson equation. The Tonti diagram for this system of equations is: φ0

f˜O 2 d

d

u1

(14)

/ v˜1

In order to numerically solve this problem, diﬀerent discretization approaches may be implemented. Three main approaches may be followed: dual grid methods, elimination methods and primal-dual methods. The ﬁrst approach, albeit not always possible since it depends on the existence of a pair of topologically dual grids, is the one that enables a one-to-one correspondence between dual variables which results in a simple discrete Hodge- operator and enables one to satisfy exactly all equations globally. The second approach sacriﬁces one of the equillibrium equations while satisfying exactly the other equilibrium equation and the constitutive equation locally. The third approach satisﬁes exactly the equilibrium equations between unkown physical quantities but relaxes the constitutive equation, being enforced weakly. This third approach is the one followed in this work. Hence, discretization of the problem in appropriate function spaces leads to the following Tonti diagram to be solved (dotted lines represent weakly imposed relations): (15) φ0h f˜h2 O d

u1h

3

d

/ v˜1 h

Weak Material Laws: The Role of Least-Squares

As proposed by Bochev and Hyman [1], a way of deﬁning the Hodge- operator and hence the constitutive equation in a weak sense is by using a least-squares minimization process that penalizes the discrepancy between the dual physical quantities. The exact equillibrium equation appears as a linear constraint that must be satisﬁed by the minimizers of the functional. Hence the problem is reduced to a constrained minimization problem: Seek (φ0h , u1h , v˜h1 ) in Λ0h × Λ1h × Λ˜1h such that I(φ0h , u1h , v˜h1 ) = 12 v˜h1 + u1h 20 + d˜ vh1 − f˜2 20

(16)

subject to: − dφ0h = u1h

(17)

Mimetic Least-Squares Spectral/hp Finite Element Method

667

If the subspaces Λ0h , Λ1h and Λ2h , and the twisted ones, are chosen in such a way that they constitute a de Rham complex: R

/ Λ0 h

d

d

h

Λ˜2h o

/ Λ1

d

Λ˜1h o

/ Λ2 h

(18)

d

Λ˜0h o

R

then (17) is satisﬁed exactly and one can substitute u1h by −dφ0h without any approximation involved. In this way the constrained minimization problem is reduced to a simple minimization problem only on two variables, φ0h and v˜h1 : Seek (φ0h , v˜h1 ) in Λ0h × Λ˜1h such that I(φ0h , v˜h1 ) = 12 v˜h1 − dφ0 20 + d˜ vh1 − f˜2 20

(19)

In this way, the Hodge- operator is implemented as L2 projections between the diﬀerent dual spaces.

4

Application to the 2D Poisson Equation

To apply the above mentioned scheme to the 2D Poisson equation ﬁrst the appropriate subspaces Λ0h , Λ1h and Λ2h , and the associated twisted form spaces, must be speciﬁed. Since one will use a spectral/hp LS method, these spaces are deﬁned as:

Λ1h,p

Λ0h,p = span hpi (x)hpj (y) , i = 0, . . . , p j = 0, . . . , p ˜ p−1 (x)hp (y) ⊗ hp (x)h ˜ p−1 (y) , i, m = 1, . . . , p j, n = 0, . . . , p = span h n m i j ˜ p−1 (x)h ˜ p−1 (y) , i = 1, . . . , p j = 1, . . . , p Λ2h,p = span h i j

where hpi (ξ) is the i-th Lagrange interpolant of order p over Gauss-Lobatto˜ p (ξ) is the i-th Lagrange interpolant of order p over the Legendre points and h i Gauss-Legendre points. We see that with this choice the degrees of freedom associated with these bases of the discrete subspaces are located where they should be: at nodal points (for 0-forms), at edges (for 1-forms) and at volumes (for 2-forms). In this way, the resulting reconstructed physical quantities will have diﬀerent continuity properties: continuity across elements (0-forms), tangential continuity along edges (1-forms) and no continuity across elements (2-forms). It is possible to show that these subspaces constitute a De Rham complex, as in (18) and hence they are suitable to be used to represent the unknown degrees of freedom. Kopriva, [12], employed a similar use of staggered spectral element grids.

668

A. Palha and M. Gerritsma

Fig. 1. Numerical solution of φ for mimetic LS approach (left) and standard LS (right), with p = 4 and 4 elements

Fig. 2. Numerical solution of uy for mimetic LS approach (left) and standard LS (right), with p = 4 and 4 elements

In order to assess the above described method, it will be applied to the solution of the 2D Poisson equation with a particular right hand side and Dirichlet boundary conditions in order to obtain the following analytical solution: φ(x, y) = cos(x2 ) + y 2 ,

x ∈ [−1, 1] × [−1, 1]

(20)

Results using mimetic least-squares method and a standard least-squares approach are compared. In Fig. 1 the numerical solutions for φ(x, y) for the mimetic LS method and for the standard LS method are presented. In Fig. 2 the numerical solutions for uy (x, y) for the mimetic LS approach and for the standard LS method are presented. One clearly observes the oscillations in the solution of the standard LS solution. These wiggles are responsible for the increase in the L2 error and also on the increased value of the curl of u(x, y). In Fig. 3 the L2 errors for the

Mimetic Least-Squares Spectral/hp Finite Element Method

669

Fig. 3. L2 convergence for mimetic and standard LS (left) and ∇ × u for mimetic LS and standard LS (right)

mimetic and standard LS approaches are presented (for φ, u and primal–dual velocity diﬀerences).

5

Concluding Remarks

As can be seen in Fig. 3, the mimetic LS method results in smaller errors, especially for the u variable. Additionally, with regard to ∇ × u one sees a great improvement in the mimetic LS method, where the errors are approximately 2 orders of magnitude smaller compared to conventional LS. The primal–dual diﬀerences, v˜h1 + u1h 0 , appears to be a good estimator for the error of the numerical solution, as can be seen in Fig. 3.

Acknowledgments The authors would like to acknowledge Foundation for Science and Technology (Portugal) for the funding given by the PhD grant SFRH/BD/36093/2007.

References 1. Bochev, P., Hyman, J.: Principles of mimetic discretizations of diﬀerential operators. IMA 142, 89–119 (2006) 2. Bochev, P.: Discourse on variational and geometric aspects of stability of discretizations. 33rd Computational Fluid Dynamics Lecture Series, VKI LS 20032005 (2003) 3. Bochev, P., Robinson, A.C.: Matching algorithms with physics: exact sequences of ﬁnite element spaces. In: Estep, D., Tavener, S. (eds.) Collected lectures on preservation of stability under discretization. SIAM, Philadelphia 4. Bossavit, A.: On the geometry of electromagnetism. J. Japan Soc. Appl. Electromagn. & Mech. 6 (1998)

670

A. Palha and M. Gerritsma

5. Burke, W.L.: Applied diﬀerential geometry. Cambridge University Press, Cambridge (1985) 6. Demkowicz, L.: Computing with hp-adaptive ﬁnite elements, vol. 1. Chapman and Hall/CRC (2007) 7. Desbrun, M., Kanso, E., Tong, Y.: Discrete diﬀerential forms for computational modeling. In: SIGGRAPH 2005: ACM SIGGRAPH 2005 Courses (2005) 8. Flanders, H.: Diﬀerential forms with applications to the physical sciences. Academic Press, Inc., New York (1963) 9. Proot, M., Gerritsma, M.: A least-squares spectral element formulation for the Stokes problem. J. Sci. Computing 17, 285–296 (2002) 10. Hiptmair, R.: Discrete Hodge operators. Numer. Math. 90, 265–289 (2001) 11. Jiang, B.: The least squares ﬁnite element method: theory and applications in computational ﬂuid dynamics and electromagnetics. Springer, Heidelberg (1998) 12. Kopriva, D.A., Kolias, J.H.: A Conservative Staggered-Grid Chebyshev Multidomain Method for Compressible Flows. Journal of Computational Physics 125, 244– 261 (1996) 13. Mattiussi, C.: An analysis of ﬁnite volume, ﬁnite element, and ﬁnite diﬀerence methods using some concepts from algebraic topology. J. Comp. Physics 133, 289– 309 (1997) 14. Proot, M.M.J., Gerritsma, M.I.: Mass- and momentum conservation of the leastsquares spectral element method for the Stokes problem. Journal of Scientiﬁc Computing 27(1-3), 389–401 (2007) 15. Tonti, E.: On the formal structure of physical theories. Consiglio Nazionale delle Ricerche, Milano (1975)

Adaptive Least Squares Finite Element Methods in Elasto-Plasticity Gerhard Starke Institut f¨ ur Angewandte Mathematik, Universit¨ at Hannover, 30167 Hannover, Germany [email protected] http://www.ifam.uni-hannover.de/~ gcs/

Abstract. In computational mechanics applications, one is often interested not only in accurate approximations for the displacements but also for the stress tensor. Least squares ﬁnite element methods are perfectly suited for such problems since they approximate both process variables simultaneously in suitable ﬁnite element spaces. We consider an H 1 × H(div) least squares formulation for the incremental formulation of elasto-plasticity using a plastic ﬂow rule of von Mises type. The nonlinear least squares functional constitutes an a posteriori error estimator on which an adaptive reﬁnement strategy may be based. The variational formulation under plane strain and plane stress conditions is investigated in detail. Standard conforming elements are used for the displacement approximation while the stress is represented by Raviart-Thomas elements. The algebraic least squares problems arising from the ﬁnite element discretization are nonlinear and nonsmooth and may be solved by generalized Newton methods.

1

Introduction

A least-squares mixed ﬁnite element method for the incremental formulation of elasto-plastic deformation models is investigated in this contribution. It is based on a ﬁrst-order system based on the stress tensor and the displacement ﬁeld as independent process variables. The method was introduced and analyzed in [9] where a benchmark example under the plane strain assumption was used for computational testing, In this contribution we show that the plane stress case can be handled by our approach in much the same way and we discuss the solution of the arising nonlinear and non-smooth algebraic least-squares problems by a generalized Newton method. The monograph by Bochev and Gunzburger [5] gives a survey the many variants and application areas for ﬁnite element methods of least-squares type. These methods constitute an alternative to mixed ﬁnite element methods of saddle point structure whenever accurate approximations of the stress tensor is desired. Among its advantages is the greater ﬂexibility in combining ﬁnite element spaces for the diﬀerent process variables which are not restricted by an inf-sup condition. Moreover, if the least-squares functional is shown to be equivalent to a I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 671–678, 2010. c Springer-Verlag Berlin Heidelberg 2010

672

G. Starke

norm of the error, then its local evaluation provides an a posteriori error estimator. This may be used in an adaptive reﬁnement technique, see [3] for a detailed study of such strategies in the context of least-squares formulations. The most appropriate combination for the elasto-plasticity models treated in this paper consists of Raviart-Thomas elements for the stresses coupled with conforming ﬁnite element spaces of the same polynomial degree for the displacement components. This is due to the fact that the same order of approximation is achieved for the individual variables. In particular, next-to-lowest order Raviart-Thomas spaces are combined with piecewise quadratic conforming ﬁnite elements in our computations. Such a least-squares ﬁnite element approach was extended to the Signorini contact problem in [1]. The numerical simulation of elasto-plastic deformation processes has been an intensive area of research for several decades. Two monographs which appeared at the end of the last century cover the state of the art from a more engineeringoriented perspective [8] and a more abstract mathematical view [7]. The solution of the nonlinear algebraic systems associated with ﬁnite element discretizations of elasto-plastic models was the subject of much recent research acitivities including [2,4,11]. The connection of generalized Newton-type methods with the radial return projection approaches common in the engineering community was investigated in [12].

2

Least-Squares Formulation of Incremental Elasto-Plasticity

The common models for elasto-plastic deformation processes may be written as a ﬁrst-order system div σ = 0 ,

(1)

σ = C(ε(u) − p) ,

(2)

for the stress tensor σ : Ω → IR3×3 and the displacement ﬁeld u : Ω → IR3 . In (2), div σ means row-wise application of the divergence operator, and ∇u contains the gradient vectors of the components of u in each row. Similarly to the model of linear elasticity, ε(u) =

1 (∇u + (∇u)T ) 2

(3)

denotes the linearized strain tensor, and Cε = 2με + λ (tr ε)I

(4)

represents a linear material law. The diﬀerence to the elastic case lies in the term p which stands for the plastic strains satisfying additional constraints. In particular, for the von Mises plasticity model, the deviatoric stress part dev(σ) = σ −

1 (tr σ) I 3

(5)

Adaptive Least Squares Finite Element Methods in Elasto-Plasticity

673

satisﬁes the constraint

2 K(α) (6) 3 with a hardening function K(α). The hardening parameter α : Ω → IR constitutes an additional process variable in the case of elasto-plasticity with hardening which is governed by an evolution equation as is the case for the plastic strains. Therefore, elasto-plasticity models become time-dependent with the need to employ an appropriate time-discretization scheme. The model for elasto-plastic deformation processes described above is taken from [8, Chap.2]. Discretization in time by an implicit Euler scheme leads to a ﬁrst-order system for the increments σ inc and uinc in the representation σ = σ old + σ inc and u = uold + uinc , respectively. The system associated with one time-step in an incremental formulation of elasto-plasticity may be written as |dev(σ)| ≤

div(σ old + σ inc ) = 0 , σ

inc

− R(ε(u

inc

); σ

old

,α

old

) =0.

(7) (8)

The stress operator R(ε; σ old , αold ) in (8) depends, in general, nonlinearly and non-smoothly on ε as soon as plastic deformation occurs. For notational convenience the increments σ inc and uinc are simply denoted by σ and u (which had a diﬀerent meaning in (2)) throughout the rest of this paper. For simplicity, we will also omit the dependence on σ old and αold in the stress operator and simply write R(ε) instead of R(ε; σ old , αold ). With the Sobolev spaces HΓN (div, Ω) = {s ∈ H(div, Ω) : s · n = 0 on ΓN } , HΓ1D (Ω) = {p ∈ H 1 (Ω) : p = 0 on ΓD } , the solution of the system (7), (8) is then sought in σ N + HΓN (div, Ω)3 for σ : Ω → IR3×3 and in HΓ1D (Ω)3 for u : Ω → IR3 . Here, σ N ∈ H(div, Ω)3 satisﬁes the boundary conditions σ N · n = g on ΓN where g denotes the increment of the boundary traction. For the case of von Mises plasticity with isotropic hardening, the stress response is given by 1 dev(σ old + C ε) γR (dev(σ old + C ε)) R(ε) = C ε − , (9) 2μ |dev(σ old + C ε)| where the return parameter γR (dev(σ old + C ε)) is implicitly deﬁned as the solution of the nonlinear equation 2 2 γR old old γR = |dev(σ + C ε)| − K α + , (10) 3 3 2μ if |dev(σ old + C ε)| > 2/3K(αold ) and γR (dev(σ old + C ε)) = 0 otherwise. The hardening parameter is updated by 2 γR (dev(σ old + C ε)) old α=α + . (11) 3 2μ

674

G. Starke

The least-squares formulation, associated with (7) and (8), consists in minimizing the functional F (σ, u; σ old , αold ) = div(σ old + σ)2 + C −1/2 (σ − R(ε(u)))2

(12)

among all suitable (σ, u) ∈ H(div, Ω)3 × H 1 (Ω)3 where · abbreviates the L2 (Ω)d norm. More precisely, our aim is to ﬁnd σ ∈ σ N + HΓN (div, Ω)3 and u ∈ HΓ1D (Ω)3 such that F (σ, u; σ old , αold ) ≤ F(σ N + τ , v; σ old , αold )

(13)

holds for all τ ∈ HΓN (div, Ω)3 and v ∈ HΓ1D (Ω)3 . Due to the fact that R(ε(u)) is not diﬀerentiable everywhere, the subdiﬀerential ∂R(ε(u)) needs to be employed in the formulation of the variational problem. The minimum of (12) is given by the solution of the variational problem (div(σ old + σ), div τ ) + (C −1 (σ − R(ε(u)), τ ) = 0 0 ∈ (C −1 (σ − R(ε(u)), ∂R(ε(u))[ε(v)])

(14) (15)

for all τ ∈ HΓN (div, Ω)3 and v ∈ HΓ1D (Ω)3 (see e.g. [7, Section 4] for the use of the subdiﬀerential in the context of plasticity models). If K(α) satisﬁes K(α) ≥ K0 > 0 , K (α) ≥ K1 > 0

(16)

for all α > 0, then it is known that the system (7), (8) possesses a unique solution (cf. [7, Section 8]). Obviously, this solution is also the unique minimizer of (12). Under the assumption (16), it can also be shown that the least-squares functional (12) is equivalent to the H(div, Ω) × H 1 (Ω) norm of the error in the following sense (see [9, Theorem 3.2]). Let (σ, u) ∈ (σ N +HΓN (div, Ω)3 )×HΓ1D (Ω)3 denote the solution of the system (7), (8). Then, there are positive constants β1 , β2 such that ¯ 2div,Ω + u − u ¯ u ¯ ; σold , αold ) ¯ 21,Ω ≤ F(σ, β1 σ − σ ¯ 2div,Ω + u − u ¯ 21,Ω ≤ β2 σ − σ ¯ u ¯ ) ∈ (σ N + HΓN (div, Ω)3 ) × HΓ1D (Ω)3 (e.g. in hold for any approximation (σ, ﬁnite element spaces). This norm equivalence implies that the solution of (14), (15) is unique.

3

Plane Stress Model and Finite Element Approximation

There are certain situations in which the simulation may actually be carried out in a two-dimensional domain. For thin plane-like structures where loads are only applied in tangential directions, all stress components perpendicular to the plane vanish leading to the plane stress model. The plane strain case where the

Adaptive Least Squares Finite Element Methods in Elasto-Plasticity

675

strains are restricted to two dimensions is treated in [9]. For standard ﬁnite element approaches, the plane stress model is often avoided due to additional complications associated with the third displacement component. Within the least-squares framework the treatment of the plane-stress situation is just as straightforward as the plane-strain case as we shall describe now. Assuming plane stress conditions means that ⎞ ⎛ σ11 σ12 0 σ = ⎝ σ21 σ22 0 ⎠ , (17) 0 0 0 where σ 1 = (σ11 , σ12 ) ∈ H(div, Ω) and σ 2 = (σ21 , σ22 ) ∈ H(div, Ω) for a twodimensional domain again denoted as Ω. For the strain tensor, this implies ⎛ ⎞ (∂2 u1 + ∂1 u2 )/2 0 ∂1 u1 ⎠. ∂2 u2 0 ε(u) = ⎝ (∂2 u1 + ∂1 u2 )/2 0 0 ∂3 u3 Since we are not interested in the displacement component u3 itself but only require the strain component ε33 (x1 , x2 , 0) for (x1 , x2 ) ∈ Ω to complete our model, we may write ⎛ ⎞ (∂2 u1 + ∂1 u2 )/2 0 ∂1 u1 ∂2 u2 0 ⎠ ε(u) = ⎝ (∂2 u1 + ∂1 u2 )/2 0 0 ε33 with ε33 (x1 , x2 , 0) ∈ L2 (Ω). For the two remaining displacement components we still have u1 , u2 ∈ HΓ1D (Ω). This implies that dev(Cε(u)) ⎛2 ⎞ 1 1 1 0 3 ∂1 u1 − 3 ∂2 u2 − 3 ε33 2 (∂2 u1 + ∂1 u2 ) 1 2 1 1 ⎠ 0 = 2μ ⎝ 2 (∂2 u1 + ∂1 u2 ) 3 ∂2 u2 − 3 ∂1 u1 − 3 ε33 2 1 1 0 0 3 ε33 − 3 ∂1 u1 − 3 ∂2 u2 and the least-squares functional (12) is to be minimized with respect to σ i ∈ 1 2 σN i + HΓN (div, Ω), ui ∈ HΓD (Ω), i = 1, 2 and ε33 ∈ L (Ω). The choice of appropriate ﬁnite element spaces Σ h , Uh and Sh for the approximation of (σ 1 , σ 2 ), (u1 , u2 ) and ε33 , respectively, is done with the aim to achieve a certain approximation order with respect to the norm in HΓN (div, Ω)2 ×HΓ1D (Ω)2 × L2 (Ω). Suitable for the stress approximation is a product space of Raviart-Thomas elements (of degree k ≥ 0) for σ 1 and σ 2 . The interpolation estimate for Raviart-Thomas elements (cf. [6, Prop. III.3.9]) yields div (σ i − Π h σ i ) + σ i − Π h σ i ≤ Chk+1 (|div σ i |k+1,Ω + |σ i |k+1,Ω ) (18) for i = 1, 2 with a suitable interpolation operator Π h . H 1 -conforming ﬁnite elements which consist of piecewise polynomials of degree k + 1 lead to ∇(u − Ψ h u) ≤ Chk+1 |u|k+2,Ω

(19)

676

G. Starke

for the interpolation error. Discontinuous piecewise polynomial interpolation of degree k for ε33 , separately on each element T ∈ Th , leads to ε33 − Qh ε33 ≤ Chk+1 |ε33 |k+1,Ω ,

(20)

where Qh may be chosen as the L2 (Ω) projection. For the least-squares ﬁnite element approach, the functional (12) is minimized with respect to the spaces Σ h × Vh × Sh . Combined with the equivalence of the least-squares functional shown in [9], (18), (19) and (20) imply the error estimate div (σ − σ h ) + C −1/2 (σ − σ h ) + C 1/2 ε(u − uh ) ≤ Chk+1 (|div σ 1 |k+1,Ω + |div σ 2 |k+1,Ω + |σ|k+1,Ω + |u|k+2,Ω ) for the least-squares ﬁnite element approximation. The regularity assumptions div σ i ∈ H k+1 (Ω), σ ∈ H k+1 (Ω)2 for i = 1, 2 and u ∈ H k+2 (Ω)2 will rarely be fulﬁlled in applications of practical relevance even for k = 0. More importantly, the equivalence of the least-squares functional (12) to the error norm implies that the functional itself may be used as an a posteriori error estimator. This allows the least-squares ﬁnite element method to be implemented in an adaptive fashion based on the local evaluation of the least-squares functional for a posteriori error estimation.

4

Generalized Gauss-Newton Method

For the solution of the nonlinear least-squares problem (13), a Gauss-Newton type iteration is the natural approach. This consists in minimizing a quadratic least-squares functional of the form F (k) (δσ, δu) = div(σ old + σ (k) + δσ)2 +C −1/2 (σ (k) + δσ − R(ε(u(k) )) − S (k) [ε(δu)]2 in each step and setting (σ (k+1) , u(k+1) ) = (σ (k) , u(k) ) + (δσ, δu). Since R(ε(u(k) )) is not necessarily diﬀerentiable, S (k) [ε(δu)] is chosen as an element of the subdiﬀerential, in general. For |dev(σ old + C ε(u(k) ))| > 2/3K(αold ), diﬀerentiating (9) yields S (k) [ε(δu)] = Cε(δu) (k) (k) (ξ :ε(δ u))ξ 1 − 2μ γR (ξ(k) )[ε(δu)] + γR (ξ (k) ) ε(δ(k)u) − (k) |ξ | |ξ |3 where ξ (k) = dev(σ old + C ε(u(k) )). Diﬀerentiating (10) leads to γR (ξ (k) )[ε(δu)]

=

ξ (k) : ε(δu) |ξ(k) |

2 γR (ξ (k) ) 1 (k) old γ (ξ )[ε(δu)] K α + − 3μ R 3 2μ

Adaptive Least Squares Finite Element Methods in Elasto-Plasticity

677

which implies the explicit formula (ξ (k) )[ε(δu)] = γR

1 1+ K 3μ

αold +

2 γR (ξ (k) ) 3 2μ

−1

ξ(k) : ε(δu) |ξ (k) |

(k) old (k) to be usedoldfor the computation of S [ε(δu)]. If |dev(σ + C ε(u ))| < 2/3K(α ) holds, then we are inside the elastic domain and

S (k) [ε(δu)] = Cε(δu) . We may simply use (21) also in the case |dev(σ old + C ε(u(k) ))| = since this certainly constitutes an element of the subdiﬀerential.

5

(21) 2/3K(αold )

A Computational Example

We present numerical results for a benchmark problem of elasto-plasticity taken from [10]. The problem is given by a quadratic plate of an elasto-plastic isotropic material with a circular hole in the centre. At the upper and lower edges of the plate, traction forces pointing outwards are applied. Because of the symmetry of the domain, it suﬃces to discretize only a fourth of the total geometry. The computational domain is then given by Ω = {x ∈ IR2 : 0 < x1 < 10, 0 < x2 < 10, x21 + x22 > 1} . The boundary conditions on the top edge of the computational domain (x2 = 10, 0 < x1 < 10) are set to σ · n = (0, t)T , on the right edge (x1 = 10, 0 < x2 < 10) and on the circular arc (x21 + x22 = 1) the boundary conditions are σ · n = (0, 0). Symmetry boundary conditions are prescribed on the rest of the boundary, i.e., (σ11 , σ12 ) · n = 0, u2 = 0 on the bottom (x2 = 0, 1 < x1 < 10), and u1 = 0, (σ21 , σ22 ) · n = 0 on the left (x1 = 0, 1 < x2 < 10). The Poisson ratio is ν = 0.29 which implies for the Lam´e constants λ = 1.381 μ. A combination of linear and exponential isotropic hardening of the form K(α) = K0 + Hα + (K∞ − K0 )(1 − e−ωα ) with K0 = 450, K∞ = 750, H = 129 and ω = 16.93 is used as in [10]. The load is increased starting from t = 0 in steps of Δt = 2.5. For each load step, an initial triangulation consisting of 52 elements is successively reﬁned based on the local evaluation of the least-squares functional. The reﬁnement rule is calibrated in such a way that the number of degrees of freedom approximately doubles with each reﬁnement step. Table 1 shows the size of the least-squares functional, at diﬀerent stages of the simulation. Shown in brackets is the number of iteration steps required to reduce the square of the energy norm of the nonlinear residual in (14), (15) below 10−6 F (σ h , uh ). For t = 150, the results are still well within the elastic domain which means that the results in Table 1 simply correspond to a linear elasticity

678

G. Starke Table 1. Least-squares functional F(σ h , uh ) and iteration counts l t = 150 t = 175 t = 200 t = 250 t = 300 t = 350 t = 400 t = 450

2.24 2.24 2.24 2.18 2.27 2.40 3.72 1.46

0 e-5 e-5 e-5 e-5 e-5 e-5 e-5 e-4

(1) (1) (1) (1) (2) (2) (2) (2)

3.04 3.04 2.98 3.14 4.02 1.89 5.78 8.45

1 e-6 e-6 e-6 e-6 e-6 e-5 e-5 e-5

(1) (1) (2) (2) (2) (2) (2) (2)

5.77 5.77 6.05 8.05 2.65 2.00 1.87 2.65

2 e-7 e-7 e-7 e-7 e-6 e-5 e-5 e-5

(1) (2) (2) (2) (2) (2) (3) (3)

1.37 1.41 1.47 2.05 1.62 5.44 5.60 7.17

3 e-7 e-7 e-7 e-7 e-6 e-6 e-6 e-6

(1) (2) (2) (2) (2) (3) (3) (3)

4.04 4.23 4.37 5.01 3.19 6.95 8.20 9.38

4 e-8 e-8 e-8 e-8 e-7 e-7 e-7 e-7

(1) (2) (2) (2) (3) (3) (3) (3)

problem which, of course, is solved in one Newton step. Inelastic deformation starts around t = 170 which already becomes apparent from the increase to 2 Newton steps on ﬁner levels for t = 175. The maximum number of Newton steps, however, remains bounded by 3 as the load is further increased which indicates that our generalized Newton approach is very eﬀective in this example.

References 1. Attia, F.S., Cai, Z., Starke, G.: First-order system least squares for the Signorini contact problem in linear elasticity. SIAM J. Numer. Anal. 47, 3027–3043 (2009) 2. Axelsson, O., Blaheta, R., Kohut, R.: Inexact Newton solvers in plasticity: Theory and experiments. Numer. Linear Algebra Appl. 4, 133–152 (1997) 3. Berndt, M., Manteuﬀel, T.A., McCormick, S.F.: Local error estimates and adaptive reﬁnement for ﬁrst-order system least squares. Electr. Trans. Numer. Anal. 6, 35–43 (1997) 4. Blaheta, R.: Convergence of Newton-type methods in incremental return mapping analysis of elasto-plastic problem. Comput. Methods Appl. Mech. Engrg. 147, 167– 185 (1997) 5. Bochev, P., Gunzburger, M.: Least-Squares Finite Element Methods. Springer, Berlin (2009) 6. Brezzi, F., Fortin, M.: Mixed and Hybrid Finite Element Methods. Springer, New York (1991) 7. Han, W., Reddy, B.D.: Plasticity: Mathematical Theory and Numerical Analysis. Springer, New York (1999) 8. Simo, J.C., Hughes, T.J.R.: Computational Inelasticity. Springer, New York (1998) 9. Starke, G.: An adaptive least-squares mixed ﬁnite element method for elastoplasticity. SIAM J. Numer. Anal. 45, 371–388 (2007) 10. Stein, E., Wriggers, P., Rieger, A., Schmidt, M.: Benchmarks. In: Stein, E. (ed.) Error-controlled Adaptive Finite Elements in Solid Mechanics, ch. 11, pp. 385–404. John Wiley and Sons, Chichester (2002) 11. Wieners, C.: Multigrid methods for Prandtl-Reuss plasticity. Numer. Linear Algebra Appl. 6, 457–478 (1999) 12. Wieners, C.: Nonlinear solution methods for inﬁnitesimal perfect plasticity. Z. Angew. Math. Mech. 87, 643–660 (2007)

The Automatic Construction and Solution of a Partial Diﬀerential Equation from the Strong Form Joseph Young Institutt for informatikk Universitetet i Bergen [email protected]

Abstract. In the last ten years, there has been significant improvement and growth in tools that aid the development of finite element methods for solving partial diﬀerential equations. These tools assist the user in transforming a weak form of a diﬀerential equation into a computable solution. Despite these advancements, solving a diﬀerential equation remains challenging. Not only are there many possible weak forms for a particular problem, but the most accurate or most eﬃcient form depends on the problem’s structure. Requiring a user to generate a weak form by hand creates a significant hurdle for someone who understands a model, but does not know how to solve it. We present a new algorithm that finds the solution of a partial diﬀerential equation when modeled in its strong form. We accomplish this by applying a first order system least squares algorithm using triangular B´ezier patches as our shape functions. After describing our algorithm, we validate our results by presenting a numerical example.

1 Introduction The variety of algorithms used to solve a partial diﬀerential equation has been both an asset as well as a burden. On one hand, we have an assortment of extremely sophisticated tools that allow us to solve a diverse set of problems. On the other, the heterogeneous nature of these algorithms makes it challenging to design a general modeling tool. Within the realm of finite element methods, there has been considerable progress towards this goal. Modeling tools such as deal.II [2], FEniCS [7,10], FreeFEM [9], GetDP [6], and Sundance [11] allow the user to specify the weak form of a diﬀerential equation by hand. Then, given a specific kind of element, these tools either assist in or automate the construction of the linear system that arises from the discretization. In spite of their usefulness, these tools assume that their user possesses the technical expertise to find the weak form of a diﬀerential equation. Unfortunately, this can be a diﬃcult task. Ideally, we would like a system that can transform the original strong form of the diﬀerential equation into a computable solution. This would allow a user with far less technical knowledge to solve a problem than is currently possible. While it is doubtful that such a perfect mechanism exists for all diﬀerential equations, we focus on a system that can achieve this goal for a relatively broad class of problems.

This research was financed by the Research Council of Norway through the SAGA-geo project. Special thanks to Magne Haveraaen for his suggestions and guidance.

I. Lirkov, S. Margenov, and J. Wa´sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 679–686, 2010. c Springer-Verlag Berlin Heidelberg 2010

680

J. Young

Specifically, we automate the straightforward least squares algorithm [3,4] using triangular B´ezier patches as our shape functions. Neither our choice of the straightforward least squares algorithm nor our choice of B´ezier patches is unique [14,1,13]. Nonetheless, we combine these pieces in such way that we can automate the construction and solution of any polynomial diﬀerential equation where every function can be adequately approximated by a surface composed of several B´ezier patches. This includes all smooth functions as well as, in a practical sense, some discontinuous functions. We do not intend nor claim that this system will provide the best possible solution in all cases. Simply, it provides a smooth solution given relatively little analytical work by the user. In this way, we view it as a tool that allows an end user to rapidly prototype a problem and then determine whether further investigation into an alternative algorithm is necessary.

2 A Calculus for B´ezier Patches and Surfaces The key to this method is the manipulation of surfaces composed of B´ezier patches. In the following section, we introduce and develop a calculus for these surfaces. 2.1 Basic Definition Let us define the Ith Bernstein polynomial of degree k over the jth simplex within the set t as bkI, j,t : p → k (T (x))I [T (x)]i ≥ 0 for all i bkI, j,t (x) = I j 0 otherwise where I ∈ 0p+1 , |I| = k, and T j (x) denotes the solution y of the (p + 1) × (p + 1) linear system z1 z2 . . . z p+1 x y= 1 1 ... 1 1 where t j ∈ p+1×p = (z1 , . . . , z p+1 ) and zi denotes a corners of the simplex j. Based on these polynomials, we form B´ezier patches by taking the sum over all possible polynomials of degree k. We form a surface by summing over all possible simplices. 2.2 Sums, Negation, Subtraction, and Multiplication Our core manipulations include the sum, negation, subtraction, and multiplication of patches. Assume we have two patches, f and g, defined over the same simplex where αI bkI 1 g= βI bkI 2 . f = |I|=k1

|I|=k2

When k1 = k2 , we find the sum f + g by simply adding the corresponding coeﬃcients. However, when the degrees diﬀer, we must elevate the degree of one of the patches until they match. Without loss of generality, assume k1 > k2 . Then, let k k −k 2 1 2 J I−J β J k γI = |J| = k2 (I − J)i ≥ 0∀i

1

I

The Automatic Construction and Solution of a PDE from the Strong Form

681

where |I| = k1 . Then, we have that [5,8,12] g= βI bkI 2 = γI bkI 1 . |I|=k2

|I|=k1

In a similar manner, in order to negate a patch, we simply negate the coeﬃcients. Then, in order to subtract one patch from another, we negate the second patch and add it to the first. Another useful manipulation is the product between two diﬀerent B´ezier patches which is equal to [5,12] h= γI bkI 1 +k2 |I|=k1 +k2

where γI =

k

1

I1

|I1 | = k1 , |I2 | = k2 , I = I1 + I2

αI1 kI22 βI2 k +k . 1

2

I1 +I2

2.3 Derivatives Next, we give the directional derivative in the direction h of a B´ezier patch as [5,8,12] ⎞ ⎛ ⎜⎜⎜ ⎟ ⎜⎜⎜ αI bk ⎟⎟⎟⎟⎟ (x)(h) = γI˜bk−1 ˜ j,t (x) I, j,t ⎠ ⎝ I, |I|=k

where γI˜ =

|I|=k

˜ |I|=k−1

αI βI˜

βI˜ =

k[T 0j (h)]i I − I˜ = ei . 0 otherwise

In addition, p describes the length of x, ei denotes the ith canonical vector, bk−1 I−ei , j,t (x) = 0 for all x when I−ei contains negative indices, and T 0j (x) denotes the solution to the linear system z1 z2 . . . z p+1 x y= 1 1 ... 1 0 where t j ∈ p+1×p = (z1 , . . . , z p+1 ). This is the same transformation as T j except that the right hand side includes a 0 in the final element rather than 1. In other words, the derivative of a B´ezier patch is simply a B´ezier patch of degree one lower. 2.4 Smoothness Although a single B´ezier patch is smooth and diﬀerentiable, a surface composed of many patches may not be diﬀerentiable nor continuous between patches. In order to stitch these patches together, we leverage the blossom of the Bernstein polynomials. This introduces a set of linear constraints on the coeﬃcients where we can tune the amount of smoothness between patches.

682

J. Young

We compute the blossom of the Ith Bernstein polynomial of degree k, bˆ I, j,t : k p i=1 → , through the recurrence relation bˆ kI, j,t (x1 , . . . , xk ) =

p+1 [T (x1 )]i bˆ kI−ei , j,t (x2 , . . . , xk ) i=1

where we define bˆ 0I, j,t (x) = 1 for all x and bˆ kI, j,t (x) = 0 for all x when I contains negative indices. Next, let us consider two simplices, defined by the corners r and s, that share a boundary. The face that defines this boundary requires p points. Thus, without loss of generality let s p+1 denote the single point not needed to define this boundary. For example, given the triangulation r3 r1

s2

s3 r2

s1 ,

the points s1 and s2 define the boundary while s3 remains unneeded. Then, the boundary between the B´ezier patches defined over the simplices r and s is q times continuously diﬀerentiable when [8,12] αIs = αIrˆ bˆ kI,r,t 1 , . . . , s1 , s2 , . . . , s2 , . . . , s p+1 , . . . , s p+1 ) ˆ (s ˆ I

I1

I2

I p+1

for all I where I p+1 ≤ q and αIr and αIs denotes the coeﬃcients of the B´ezier patch over the simplices r and s respectively. 2.5 Symbolic Coeﬃcients When we use B´ezier surfaces within a diﬀerential equation, some coeﬃcients are constant whereas others are symbolic. As a result, we require a mechanism to track our symbolic variables. Fortunately, each manipulation we define above simply forms a new B´ezier surface where the new coeﬃcients are polynomials of the old. Thus, assume we have n unknown variables xi . We represent each coeﬃcient by a tuple (α, a, A, . . . ) where α is a constant, a is a vector, A is a matrix, and higher order terms are represented as tensors. In this manner, the actual value of this coeﬃcient is α + a, x + Ax, x + . . . Therefore, we must also define operations such as addition, subtraction, and multiplication on symbolic coeﬃcients. We define addition as (α, a, A, . . . ) + (β, b, B, . . . ) = (α + β, a + b, A + B, . . . ) multiplication of linear terms as (α, a)(β, b) = αβ, αb + βa, (abT + baT )/2 and the other operations analogously.

The Automatic Construction and Solution of a PDE from the Strong Form

683

3 Algorithm In the following section, we describe our algorithm used to solve the diﬀerential equation. We describe this process in three steps. First, we decompose the problem into a first order system. Second, we replace all functions with their B´ezier surface equivalents. Third, we construct and solve the least-squares problem. This involves combining all functions together using our calculus defined above, integrating the resulting surfaces, and solving the final optimization problem. 3.1 Decomposition into a First Order System As with most least squares approaches, we decompose our problem into a first order system. In the straightforward least squares algorithm, this is not necessary, but yields many benefits such as reducing the condition number of the final optimality system [3,4]. We emphasize that there are an infinite number of ways to decompose most problems and choosing one particular scheme may have benefits over the other. Our decomposition method is very simple and simply demonstrates that this decomposition is possible to automate. Our scheme mirrors the trick used to decompose an ordinary diﬀerential equation into a system of first order equations. Let us consider a kth order PDE, F(x, u, Dα1 u, . . . , Dαm u) = 0 where αi is a multiindex in 0p . We construct a graph where we label each node by α, for |α| ≤ k, and connect two nodes αi and α j when αi − α j 1 = 1. Thus, we connect two nodes if they diﬀer by a single index. Further, we only allow nodes α when α αi for some αi where denotes pointwise inequality. By its construction, this graph must be connected and contain a spanning tree. The spanning tree gives us our decomposition F(x, u, Dα1−β1 uβ1 , . . . , Dαm −βm uβm ) = 0

Dδ−γ uγ = uδ .

In the first equality, the constant βi is a label where αi βi and both αi and βi are connected in the spanning tree. The second equality must hold for all labels δ and γ such that βi δ γ and both δ and γ are connected in the spanning tree. Finally, we replace any instance of u0 by u. For example, consider the problem D20 u + D11 u + D02 u = 0. The spanning tree generated by the problem is 2,0

1,1 1,0

0,2 0,1

0,0

.

Thus, according to this decomposition, we would rewrite our problem as D10 u10 + D10 u01 + D01 u01 = 0

D10 u = u10

D01 u = u01 .

684

J. Young

3.2 Approximating Non-B´ezier Surfaces Our algorithm requires that all functions be B´ezier surfaces. Therefore, all non-B´ezier surfaces must be approximated. In order to accomplish this, we force the error in the approximation to be orthogonal to the Bernstein polynomials used to construct our B´ezier patches. This leads to the system of equations generated by |t|

αI j bkI, j,t , bkI,ˆ ˆj,t

j=1 I=|k|

L2 (Ω)

= f, bkI,ˆ ˆj,t

L2 (Ω)

for all |I|ˆ = k and all 1 ≤ ˆj ≤ |t|. The domain Ω is a square domain that encompasses the domain of the diﬀerential equation. After solving, the variable α defines the coeﬃcients of a B´ezier surface that approximates f . 3.3 Constructing the Least Squares Problem Given a first order decomposition, we rewrite a system of diﬀerential equations Fi (x, u, Dα1 u, . . . , Dαm u) = 0 on Ω

G j (x, u, Dα1 u, . . . , Dαm u) = 0 on ∂Ω

as the least squares problem min Fi (x, u, Dα1 u, . . . , Dαm u)2L2 (Ω) + G j (x, u, Dα1 u, . . . , Dαm u)2L2 (∂Ω) . u

i

j

We have chosen the L2 norm since we implement the straightforward least squares algorithm. From a constructive point of view, we require a norm that uses our algebra on B´ezier surfaces. As long as we restrict ourselves to the real numbers, this includes all W k,p norms for an integer k and even p. As a result, it may be possible to automate a more complicated least-squares algorithm as long as it adheres to these norms. However, we do not explore this idea further within this paper. Next, we replace all non-B´ezier surfaces with B´ezier surface approximations. We also discretize u using B´ezier surfaces as our shape functions. The coeﬃcients for each of these surfaces uses our symbolic scheme from above. In the case of our unknown coeﬃcients, initially all terms are zero except for the linear term which is the ith canonical vector, ei , where i denotes the index of that unknown coeﬃcient. Once we have made this substitution, we use our calculus on B´ezier patches from above to combine all patches into a single patch. In other words, we simplify all subtraction, addition, multiplication, negation, and diﬀerentiation operations including the squaring of the problem due to the L2 norm. Since we require this simplification, we focus solely on polynomial diﬀerential equations composed of these operations. In the end, we are left with the optimization problem ⎞ ⎞ ⎛ |t| ⎛⎜ ⎜⎜⎜ ⎟⎟⎟ ⎟⎟ ⎜⎜⎜ |t| ˆ ki ⎟ k ⎜ i ⎟ ⎜⎜⎜ ⎟⎟⎟ ⎜⎝⎜ minn pI, j (x)bI, j,t ⎟⎟⎠⎟ + q (x)b I, j I, j,t ⎝ ⎠ x∈ Ω ∂Ω j=1 |I|=ki

j=1 |I|=kˆ i

The Automatic Construction and Solution of a PDE from the Strong Form

685

where pI, j and qI, j are polynomials whose coeﬃcients are still represented using our symbolic scheme. Then, we integrate away all the B´ezier surfaces and obtain a polynomial r based on these symbolic coeﬃcients. Finally, we add stitching constraints based on the blossoming scheme described above. Thus, our complete problem has the form minn r(x) st Ax = 0 x∈

where A is a matrix generated by including all possible stitching constraints. As a note, we require the diﬀerentiability between patches to be equal to the highest order derivative in the problem. Even though we decompose our problem into first order form, we can not guarantee convergence unless all functions conform to the original space of functions. For example, if we solve Poisson’s equation, Δu = f , we require two derivatives between each patch. In the end, we have a linearly constrained polynomial program. In the case of a linear diﬀerential equation, this yields a linearly constrained quadratic program.

4 Numerical Experiments In order to experimentally verify the convergence of our method, we solve Poisson’s equation in two dimensions on the unit square u xx + uyy = x, y → −8π2 cos(2πx) sin(2πy) on [0, 1] × [0, 1] u = x, y → sin(2πy) when x = 0 or x = 1 u = 0 when y = 0 or y = 1. The solution to this problem is the function u where u(x, y) = cos(2πx) sin(2πy). We solved the problem formulated as both a first order system as well as an undecomposed problem using the original formulation. In both cases, we used fourth order B´ezier surfaces. The table below summarizes the relative error in these solutions as well as the condition number of the KKT conditions First Order System Original Formulation Subdivisions Error Rate κ Error Rate κ 1 6.00e-1 - 6.02e+3 6.43e-1 - 1.84e+3 . 2 4.03e-1 0.574 5.19e+3 2.85e-1 1.17 5.57e+3 4 1.18e-1 1.77 1.79e+4 1.88e-1 .600 8.21e+4 8 4.30e-2 2.13 1.28e+6 We do not have results for the first order system on an eight time subdivided surface since we ran out of memory. Although a better implementation would greatly alleviate the problem, this highlights that the symbolic manipulations are memory intensive. As a result, decomposing the problem into a first order system significantly increases the amount of memory required. In spite of this drawback, the method seems to converge.

686

J. Young

5 Conclusion We have presented an algorithm that takes a diﬀerential equation modeled in the strong form and automatically produces a solution. It accomplishes this by applying a first order system least squares finite element method. As our shape functions, we use surfaces composed of triangular B´ezier patches. We use these functions because their special properties allow us to combine all functions in the diﬀerential equation into a single B´ezier surface whose coeﬃcients are simply polynomials of our unknown variables. This allows us to produce a linearly constrained polynomial program which, when solved, produces our solution. Certainly, this algorithm will not work in all cases. However, it provides a mechanism that produces a solution with a minimal amount of eﬀort. Even in cases when the true solution contains features that are diﬃcult to model, we can use this method to quickly produce a rough approximation to the true solution.

References 1. Awanou, G., Lai, M.J., Wenston, P.: The multivariate spline method for scattered data fitting and numerical solution of partial diﬀerential equations. In: Chen, G., Lai, M.J. (eds.) Wavelets and Splines: Athens 2005, pp. 24–74. Nashboro Press (2006) 2. Bangerth, W., Hartmann, R., Kanschat, G.: deal.II–a general purpose object-oriented finite element library. ACM Transactions on Mathematical Software 33(2) (August 2007) 3. Bochev, P.B., Gunzburger, M.D.: Finite element methods of least-squares type. SIAM Review 40(4), 789–837 (1998) 4. Bochev, P.B., Gunzburger, M.D.: Least-Squares: Finite Element Methods. Springer, Heidelberg (2009) 5. de Boor, C.: B-form basics. In: Farin, G. (ed.) Geometric Modeling: Algorithms and New Trends (1987) 6. Dular, P., Geuzaine, C., Henrotte, F., Legros, W.: A general environment for the treatment of discrete problems and its application to the finite element method. IEEE Transactions on Magnetics 34(5), 3395–3398 (1998) 7. Dupont, T., Hoﬀman, J., Johnson, C., Kirby, R., Larson, M., Logg, A., Scott, R.: The FEniCS project, PREPRINT 2003-21 (2003) 8. Farin, G.: Triangular Bernstein-B´ezier patches. Computed Aided Geometric Design 3, 83– 127 (1986) 9. Hecht, F., Pironneau, O., Hyaric, A.L., Ohtsuka, K.: Freefem++, 2nd edn., Version 2.24-2-2, www.freefem.org 10. Logg, A.: Automating the finite element method. Sixth Winter School in Computational Mathematics (March 2006) 11. Long, K.: Sundance 2.0 tutorial. Technical Report SAND2004-4793, Sandia National Laboratory (July 2004) 12. Prautzsch, H., Boehm, W., Paluszny, M.: B´ezier and B-Spline Techniques. Springer, Heidelberg (2002) 13. Schumaker, L.L.: Computing bivariate splines in scattered data fitting and the finite-element method. Numerical Algorithms 48, 237–260 (2008) 14. Zheng, J., Sederberg, T.W., Johnson, R.W.: Least squares methods for solving diﬀerential equations using B´ezier control points. Applied Numerical Mathematics (48), 237–252 (2004)

Zienkiewicz-Type Finite Element Applied to Fourth-Order Problems Andrey B. Andreev1 and Milena R. Racheva2 1

Department of Informatics Technical University of Gabrovo 5300 Gabrovo, Bulgaria 2 Department of Mathematics Technical University of Gabrovo 5300 Gabrovo, Bulgaria

Abstract. In general, a finite element method (FEM) for fourth-order problems requires trial and test functions belonging to subspaces of the Sobolev space H 2 (Ω), and this would require C 1 −elements, i.e., piecewise polynomials which are C 1 across interelement boundaries. In order to avoid this requirement we will use nonconforming Zienkiewicz-type (Z-type) triangle applied to biharmonic problem. We propose a new approach to prove the order of convergence by comparison to suitable modified Hermite triangular finite element. This method is more natural and it could be also applied to the corresponding fourth-order eigenvalue problem. Some computational aspects are discussed and numerical example is given.

1

Introduction

A motivation for avoiding the use of C 1 − ﬁnite elements is their very high dimension. Also, in many cases the feasible C 0 -elements for fourth-order problems give more simple and ﬂexible computational schemes. Clearly, the eﬀective choice of a method is complex depending on many aspects of the underlying problem. Our purpose is to illustrate a new convergence analysis of Z-type nonconforming triangular ﬁnite element. Let Ω be a bounded polygonal domain in R2 with boundary ∂Ω. Let also m H (Ω) be the usual m−th order Sobolev space on Ω with a norm · m,Ω and a seminorm |·|m,Ω . Throughout this paper (·, ·) denotes the L2 (Ω)−inner product. Consider the following fourth-order model problem for f ∈ L2 (Ω): Δ2 u = f in Ω, ∂u = 0 on ∂Ω, u= ∂ν

(1)

where ν = (ν1 , ν2 ) is the unit outer normal to ∂Ω and Δ is the standard Laplacian operator. The weak form of the problem (1) is: ﬁnd u ∈ H02 (Ω) such that a(u, v) = (f, v), ∀ v ∈ H02 (Ω), I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 687–694, 2010. c Springer-Verlag Berlin Heidelberg 2010

(2)

688

A.B. Andreev and M.R. Racheva

Fig. 1.

where

a(u, v) =

2

Ω i,j=1

2 2 ∂ij u ∂ij v dx ∀ u, v ∈ H 2 (Ω).

We approximate the solution of (2) by the ﬁnite element method. Consider a family of triangulations τh of Ω which fulﬁll standard assumptions [6]. The partitions τh consist of triangles K and h is the mesh parameter. We introduce the FE spaces Vh related to the partitions τh . They are deﬁned by means of Z-type triangular elements which will be introduced. It is well-known, that the Zienkiewicz triangle represents a reduced cubic Hermite ﬁnite element (see [4,5,9]) for which: – K is a 2-simplex with vertices ai , 1 ≤ i ≤ 3; – one possible set of degrees of freedom is (for any test function p): p(ai ), 1 ≤ i ≤ 3 and Dp(ai )(aj − ai ), 1 ≤ i, j ≤ 3, i = j. – PK ⊂ P3 (K) and dimPK = 9 (Fig. 1). Using directional derivatives, there is a variety of ways to deﬁne a ﬁnite element using P3 , two of which are shown in Fig. 1. Note that arrows represent directional derivatives along the indicated directions at the points. Some Z-type triangular elements having the same degrees of freedom can be also proposed by means of diﬀerent ways (see, e.q. [9]). The following properties of Z-type element could be mentioned: (i) it is an incomplete and non-conforming C 0 −element for fourth-order problems; (ii) it uses the degrees of freedom just the same with the Zienkiewicz triangle, but its shape function space is diﬀerent; (iii) it takes values of function and its derivatives at vertices as degrees of freedom and by this the global number of degrees of freedom is an optimal one; (iv) it is convergent (applied to fourthorder problems) in contrast to Zienkiewicz triangle, which is only convergent in parallel line condition and divergent in general grids.

Zienkiewicz-Type Finite Element Applied to Fourth-Order Problems

689

Then, the corresponding approximate variational problem of (2) is: ﬁnd uh ∈ Vh ⊂ H02 (Ω) such that ah (uh , vh ) = (f, vh ), ∀ vh ∈ Vh ,

(3)

where

ah (uh , vh ) =

K∈τh

2 ∂ 2 uh ∂ 2 vh dx. K i,j=1 ∂xi ∂xj ∂xi ∂xj

For any 2-simplex K we deﬁne P(K) = P3 (K) = P2 (K) + span λ2i λj − λi λ2j , 1 ≤ i < j ≤ 3 , where λi , i = 1, 2, 3 are the barycentric coordinates of K. Then we can deﬁne the shape function space by P3 (K). Lemma 1. ([9], Lemma 1) The set of degrees of freedom is PK −unisolvent.

2

Main Result

In order to get convergence analysis of the considered Z-type element we also consider the Hermite triangle with a suitably modiﬁed 10-th degree of freedom, namely, integral value on K is taken instead of the value at the barycenter of K (Fig. 1). Let ΠK denote the interpolation operator corresponding to Z-type ﬁnite element partition τh and πh be the interpolation operator related to modiﬁed Hermite ﬁnite element. Our convergence analysis is based on the estimation of Πh v − πh v for any v ∈ H02 ∩ H 3 (Ω) on each element K ∈ τh [1]. For any v ∈ L2 (Ω) and v|K ∈ H m (K), ∀K ∈ τh we deﬁne the following mesh-dependent norm and seminorm [9]: vm,h =

K∈τh

1/2 v2m,K

,

|v|m,h =

1/2 |v|2m,K

.

K∈τh

Theorem 1. Let Vh be the FE space corresponding to the nonconforming Z-type element. Then there exists a constant C = C(Ω) > 0, independent of h and such that 2 inf hm |v − vh |m,h ≤ Ch3 v3,Ω , ∀v ∈ H02 ∩ H 3 (Ω). vh ∈Vh

m=0

Proof. First, we shall estimate Πh v − πh v on each ﬁnite element K ∈ τh . For this purpose we transform any triangle K to the reference element T = {(t1 , t2 ) : t1 , t2 ≥ 0, t1 + t2 ≤ 1}.

690

A.B. Andreev and M.R. Racheva

The shape functions of Z-type element on the reference element T are: 1 3 1 ϕ1 (t1 , t2 ) = − t1 t2 − t21 + t21 t2 + t1 t22 + t31 ; 2 2 2 ϕ2 (t1 , t2 ) = 2t1 t2 + 3t21 − 2t21 t2 − 2t1 t22 − 2t31 ; 1 1 1 ϕ3 (t1 , t2 ) = − t1 t2 − t21 t2 + t1 t22 ; 2 2 2 1 1 2 1 ϕ4 (t1 , t2 ) = − t1 t2 + t1 t2 − t1 t22 ; 2 2 2 ϕ5 (t1 , t2 ) = 2t1 t2 + 3t22 − 2t21 t2 − 2t1 t22 − 2t32 ; 1 1 3 ϕ6 (t1 , t2 ) = − t1 t2 − t22 + t21 t2 + t1 t22 + t32 ; 2 2 2 3 1 3 ϕ7 (t1 , t2 ) = −t2 + t1 t2 + 2t22 − t21 t2 − t1 t22 − t32 ; 2 2 2 ϕ8 (t1 , t2 ) = 1 − 4t1 t2 − 3t21 − 3t22 + 4t21 t2 + 4t1 t22 + 2t31 + 2t32 ; 3 3 1 ϕ9 (t1 , t2 ) = −t1 + 2t21 − t31 + t1 t2 − t21 t2 − t1 t22 . 2 2 2 By analogy, we obtain the shape functions of the Hermite element on T consecutively from a1 to a3 : ψ1 (t1 , t2 ) = 2t1 t2 − t21 − t21 t2 − 2t1 t22 + t31 ; ψ2 (t1 , t2 ) = −18t1t2 + 3t21 + 18t21 t2 + 18t1 t22 − 2t31 ; ψ3 (t1 , t2 ) = 2t1 t2 − 3t21 t2 − 2t1 x22 ; ψ4 (t1 , t2 ) = 2t1 t2 − 2t21 t2 − 3t1 t22 ; ψ5 (t1 , t2 ) = −18t1t2 + 3t22 + 18t21 t2 + 18t1 t22 − 2t32 ; ψ6 (t1 , t2 ) = 2t1 t2 − t22 − 2t21 t2 − t1 t22 + t32 ; ψ7 (t1 , t2 ) = −t2 + 4t1 t2 + 2t22 − 3t21 t2 − 4t1 t22 − t32 ; ψ8 (t1 , t2 ) = 1 − 24t1 t2 − 3t21 − 3t22 + 24t21 t2 + 24t1 t22 + 2t31 + 2t32 ; ψ9 (t1 , t2 ) = −t1 + 4t1 t2 + 2t21 − 4t21 t2 − 3t1 t22 − t31 ; ψ10 (t1 , t2 ) = 60t1 t2 − 60t21 t2 − 60t1 t22 . Using these shape functions we calculate: (Πh v − πh v)|T = (60t1 t2 − −

60t21 t2

−

60t1 t22 )

v(a1 ) + v(a2 ) + v(a3 ) 3

∂(a1 −a2 ) v(a1 ) + ∂(a1 −a3 ) v(a1 ) ∂(a2 −a1 ) v(a2 ) + ∂(a2 −a3 ) v(a2 ) − 24 24

Zienkiewicz-Type Finite Element Applied to Fourth-Order Problems

∂(a −a ) v(a3 ) + ∂(a3 −a2 ) v(a3 ) 1 − 3 1 − 24 meas T = 60t1 t2 (1 − t1 − t2 )ET (v) ≤ where

691

v(t) dt T

20 |ET (v)|, 9

(4)

⎡

⎤ 3 3 1 1 ⎢1 ⎥ ET (v) = ⎣ v(ai ) − ∂(ai −aj ) v(ai ) − v(t) dt⎦ 3 i=1 24 i,j=1 meas T T i=j

is the error functional of quadrature formula and ET (v) = 0 for any v ∈ P2 (T ). Let FK be the invertible aﬃne mapping which maps the reference ﬁnite element T on to the ﬁnite element K: FK : T → K, t → x = FK (t) = BK (t) + bK with BK ∈ R2×2 , bK ∈ R2×1 , and where det BK = O(h2 ). Therefore, from the Bramble-Hilbert lemma [6] there exists a constant C such that −1/2 |v|3,K . (5) |ET (v)| ≤ C|v|3,T ≤ Ch3 (det BK ) Thus, using that Πh v − πh v0,K = (det BK )1/2 Πh v − πh v0,T , from (4) and (5) it follows that Πh v − πh v0,h =

1/2 Πh v −

πh v20,K

≤ Ch3 v3,Ω .

(6)

K∈τh

Applying explicit calculations, we obtain: |∂i (Πh v − πh v) ||T ≤ Ch2 |v|3,K , |∂ij (Πh v − πh v) ||T ≤ Ch|v|3,K , i, j = 1, 2. These inequalities and (6) give Πh v − πh vm,h ≤ Ch3−m v3,Ω , m = 0, 1, 2.

(7)

Finally, we use the fact, that the order of v − πh vm,h , m = 0, 1, 2 is an optimal one for the cubic Hermite polynomials [5,6]. From (7) and applying v − Πh vm,h ≤ v − πh vm,h + Πh v − πh vm,h , from the FE interpolation theory we prove the theorem. Remark 1. However, for m = 0, 1 we have Πh v − πh vm,h = Πh v − πh vm,Ω .

692

A.B. Andreev and M.R. Racheva

Theorem 2. Let u ∈ H 3 (Ω) ∩ H02 (Ω) and uh ∈ Vh be the solutions of the problems (2) and (3), respectively. Then there exists a constant C = C(Ω) > 0, independent of h and such that u − uh 2,h ≤ Chu3,Ω .

(8)

Proof. By Theorem 1 for any v ∈ H02 (Ω) lim inf v − vh 2,h = 0.

h→0 vh ∈Vh

Therefore limh→0 u − uh 2,h = 0. Having in mind that Vh is constructed by Z-type nonconforming elements, we have (see also [9], Lemma 3): |ah (v, vh ) − (Δ2 v, vh )| ≤ Ch|v|3,Ω |vh |2,h , ∀v ∈ H 3 (Ω), ∀vh ∈ Vh .

(9)

For the solutions u and uh we apply the second Strang lemma (see [6], Theorem 31.1): ⎛ ⎞ |a (u, v ) − (f, v )| h h h ⎠ u − uh 2,h ≤ C ⎝ inf u − vh 2,h + sup . vh ∈Vh vh 2,h vh ∈Vh vh =0

Then, the main estimate (8) follows from (9) and the result of Theorem 1. We can apply previous results to the corresponding fourth-order eigenvalue problem (EVP) [3]: Δ2 w = λw in Ω ⊂ R2 , subject to homogeneous boundary conditions. The variational EVP takes the form a(w, v) = λ(w, v), ∀v ∈ V ⊂ H02 (Ω).

(10)

The ﬁnite dimensional analogue of the problem (10) by means of Z-type nonconforming elements is (see [8]): ah (wh , vh ) = λh (wh , vh ),

∀vh ∈ Vh .

(11)

It is to be noted here that the sesquilinear form ah is uniformly elliptic, i.e. (α > 0): αΔu20,Ω ≤ ah (u, u), ∀u ∈ H 2 (Ω). Theorem 3. (see [2]) Let (λ, w) and (λh , wh ) be eigensolutions of (10) and (11), (k) respectively. Then λh −→ λ(k) (h → 0), k = 1, . . . , Nh and for any sequence of (k) (k) normalized eigenfunctions wh ∈ Vh , wh 0,Ω = 1, there exist eigenfunctions w ∈ H02 (Ω) such that (k)

wh − w2,h −→ 0 (h → 0).

Zienkiewicz-Type Finite Element Applied to Fourth-Order Problems

693

Moreover, if w ∈ H 3 (Ω) ∩ H02 (Ω), then (k)

wh − w2,h ≤ Chw3,Ω , (k)

|λh − λ| ≤ Ch2 w23,Ω .

3

Numerical Results

To illustrate our theoretical results we report in this example on related twodimensional biharmonic eigenvalue problem. Let Ω be a square domain: Ω: −

π π < xi < , 2 2

i = 1, 2.

Consider the following model problem: Δ2 u = λu

in Ω,

∂u = 0 on ∂Ω. ∂ν For this problem the exact eigenvalues are not known. We use their lower and upper bounds obtained by Weinstein and Stenges [10] (see also Ishihara [7]). In Table 1 the results from our numerical experiments for the ﬁrst four eigenvalue are given. The domain is divided into uniform mesh with 2n2 issosceles triangular Z-type elements and the mesh parameter is h = π/n, n = 5, 6, 7, 8. As it can be seen, the numerical implementation conﬁrm the convergence asserted by Theorem 1 - Theorem 3. The proposed Z-type elements are eligible especially for computing eigenvalues. And, for the approximate eigenvalues and eigenfunctions obtained by means of the proposed elements some postprocessing technique could be easily applied for the sake of improving both the rate of convergence to the exact solution and the properties of the approximate eigenfunctions [2]. u=

Table 1. Eigenvalues computed by means of Z-type finite elements h

λ1

λ2

λ3

λ4

π/5

14.8023

64.1267

70.4634

162.3236

π/6

14.0465

60.3152

65.3925

156.5718

π/7

13.6470

58.6170

62.5207

153.7372

π/8

13.4721

57.7318

61.3971

152.4870

bounds: lower 13.2820 upper 13.3842

55.2400 56.5610

55.2400 56.5610

120.0070 124.0740

694

A.B. Andreev and M.R. Racheva

Acknowledgement This work is partially supported by the Bulgarian Ministry of Science under grant VU-MI 202/2006.

References 1. Andreev, A.B., Racheva, M.R.: Optimal Order FEM for a Coupled Eigenvalue Problem on 2-D Overlapping Domains. In: Margenov, S., Vulkov, L.G., Wa´sniewski, J. (eds.) NAA 2008. LNCS, vol. 5434, pp. 151–158. Springer, Heidelberg (2009) 2. Andreev, A.B., Racheva, M.R.: Acceleration of Convergence for eigenpairs Approximated by means of nonconforming Finite Element Methods. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2009. LNCS, vol. 5910, pp. 695–702. Springer, Heidelberg (2010) 3. Babuska, I., Osborn, J.: Eigenvalue Problems. In: Ciarlet, P.G., Lions, J.L. (eds.) Handbook of Numerical Analysis, vol. II, pp. 641–787. North-Holland, Amsterdam (1991) 4. Bazaley, G.P., Cheung, Y.K., Irons, B.M., Zienkiewicz, O.C.: Triangular Elements in Plate Bending - Conforming and Non-conforming Solutions. In: Proceedings of the Conference on Matrix Methods in Structural Mechanics, pp. 547–576. Wright Patterson A.F. Base, Ohio (1965) 5. Brenner, S., Scott, L.R.: The Mathematical Theory for Finite Element Methods. Springer, New York (1992) 6. Ciarlet, P.: Basic Error Estimates for the FEM, vol. 2, pp. 17–351. Elsevier, Amsterdam (1991) 7. Ishihara, K.: A mixed finite element method for the biharmonic eigenvalue problem of plate bending. Publ. Res. Institute of Mathematical Sciences 14, 399–414 (1978) 8. Rannacher, R.: Non-conforming Finite Element Methods for eigenvalue Problems in Linear Plate Theory. Numer. Math. 33, 23–42 (1979) 9. Wang, M., Shi, Z., Xu, J.: A New Class of Zienkiewicz-type Non-conforming Element in Any Dimensions. Numer. Math. 106, 335–347 (2007) 10. Weinstein, A., Stenger, W.: Methods of intermediate problems for eigenvalues, theory and applications. Academic Press, London (1972)

Acceleration of Convergence for Eigenpairs Approximated by Means of Non-conforming Finite Element Methods Andrey B. Andreev1 and Milena R. Racheva2 1 Department of Informatics Technical University of Gabrovo 5300 Gabrovo, Bulgaria 2 Department of Mathematics Technical University of Gabrovo 5300 Gabrovo, Bulgaria

Abstract. Non-conforming ﬁnite elements are commonly used for approximating fourth-order eigenvalue problems in linear plate theory. We derive error estimates for eigenpairs by nonconforming Zienkiewicz-type triangular elements. We also propose a simple postprocessing method which improves the order of convergence of ﬁnite element eigenpairs. Thus, an a posteriori analysis is presented by means of diﬀerent triangular elements. The technique applied is general. It is illustrated and discussed by numerical results.

1

Introduction

This study deals with a procedure for accelerating the convergence of ﬁnite element approximations of the eigenpairs for fourth-order problem. Zienkiewicztype (Z-type) two-dimensional nonconforming triangular element is applied. Then we can use conforming elements for more simple elliptic problem. Let Ω ⊂ R2 be a bounded polygonal domain with boundary ∂Ω. Let also m H (Ω) be the usual m−th order Sobolev space on Ω with a norm · m,Ω and a seminorm | · |m,Ω . Consider a thin elastic plate corresponding to the domain Ω. If the material is homogeneous and isotropic, the question about the possible small vibrations of the plate leads to the basic eigenvalue problem: Δ2 u = λu in Ω, ∂u = 0 on ∂Ω, u= ∂ν

(1)

where ν = (ν1 , ν2 ) is the unit outer normal to ∂Ω and Δ is the standard Laplacian operator. The variational eigenvalue problem (EVP) corresponding to (1) is: ﬁnd (λ, u) ∈ R × H02 (Ω) such that a(u, v) = λ(u, v), ∀ v ∈ V ⊂ H02 (Ω), I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 695–702, 2010. c Springer-Verlag Berlin Heidelberg 2010

(2)

696

A.B. Andreev and M.R. Racheva

where

a(u, v) =

2

Ω i,j=1

2 2 ∂ij u ∂ij v dx, ∀ u, v ∈ H 2 (Ω)

and (·, ·) denotes the L2 (Ω)−inner product. Obviously, the bilinear form a(·, ·) is symmetric and V −elliptic. Moreover, the inclusion of V in L2 (Ω) is compact. Therefore, problem (2) has a countable inﬁnite set of eigenvalues λj , all being strictly positive and having ﬁnite multiplicity, without a ﬁnite accumulation point (see, e.g. [9]). The corresponding eigenfunctions uj can be chosen to be orthonormal in L2 (Ω). They constitute a Hilbert basis for V . We shall approximate the solution of (2) bythe ﬁnite element method (FEM). Consider a family of regular partitions τh = i Ki of Ω consisting of triangular ﬁnite elements Ki which fulﬁll standard assumptions (see [7], Chapter 3). If hi is diameter of Ki , the h = maxi hi is the ﬁnite element parameter corresponding to any partition τh . With a partition τh we associate a ﬁnite dimensional subspace Vh of V by means of Z-type triangular elements such that every FE is an incomplete cubic polynomial (see [1,6,10]). For a test function p one possible set of degree of freedom is: = j, p(ai ), 1 ≤ i ≤ 3 and Dp(ai )(aj − ai ), 1 ≤ i, j ≤ 3, i where ai , i = 1, 2, 3 are the vertices of the element K. It is clear that PK ⊂ P3 (K) and dimPK = 9 (Fig. 1). We determine the approximate eigenpairs (λh , uh ) by FEM using nonconforming Z-type element. Then, the EVP corresponding to (2) is: ﬁnd (λh , uh ) ∈ R × Vh such that ah (uh , vh ) = λh (uh , vh ), ∀ vh ∈ Vh ,

Fig. 1.

(3)

Acceleration of Convergence for Eigenpairs Approximated

where ah (uh , vh ) =

K∈τh

2

2

K i,j=1

697

2 2 ∂ij uh ∂ij vh dx.

Convergence Analysis

First, we introduce the mesh-dependent norm and seminorm [1,10]. For any v ∈ L2 (Ω) and v|K ∈ H m (K), ∀K ∈ τh we deﬁne: vm,h =

1/2

v2m,K

,

|v|m,h =

K∈τh

1/2 |v|2m,K

.

K∈τh

We shall use the essential property of the FE space Vh (see [1], Theorem 1): if Vh corresponds to nonconforming Z-type element, then there exists a constant C = C(Ω) > 0, independent of h and such that inf

vh ∈Vh

2

hm |v − vh |m,h ≤ Ch3 v3,Ω , ∀v ∈ H02 ∩ H 3 (Ω).

(4)

m=0

Let us also deﬁne the elliptic projection Rh ∈ L(V, Vh ) by ah (Rh u − u, vh ) = 0, ∀ vh ∈ Vh .

(5)

Thus, Rh is an operator of orthonormal projection from V over Vh with respect to the scalar product ah (·, ·). The next theorem gives error estimate of the eigenvalues using nonconforming Z-type ﬁnite element. Theorem 1. Let (λ, u) and (λh , uh ) be eigensolutions of (2) and (3), respectively. Then for any simple eigenvalue λm (m ≥ 1), λm,h −→ λm (h → 0). Moreover, if the corresponding eigenfunction um belongs to H02 (Ω)∩H 3 (Ω), then |λm − λm,h | ≤ Ch2 um 23,Ω .

(6)

Proof. From (4) and (5) it is easy to see that for any u ∈ V u − Rh u2,h ≤ C inf u − vh 2,h . vh ∈Vh

(7)

Now, for any integer m ≥ 1 we estimate the diﬀerence λm −λm,h . For this purpose we introduce the space Vm which is generated by the ﬁrst m (exact) eigenfunctions {ui }, 1 ≤ i ≤ m. The approximate eigenvalue λm,h can be characterized as various extrema of the Rayleigh quotient [5]. Then (dim(Rh Vm ) = m): λm,h ≤

max

0 =vh ∈ Rh Vm

ah (vh , vh ) = vh 20,Ω

max

v∈ Vm v0,Ω =1

ah (Rh v, Rh v) . Rh v20,Ω

698

A.B. Andreev and M.R. Racheva

Since Rh v is an orthogonal projection over Vh w.r.t. ah (·, ·), we have: ah (Rh v, Rh v) ≤ ah (v, v), therefore λm,h ≤

sup vh ∈Vm v0,Ω =1

ah (v, v) ≤ λm Rh v20,Ω

sup v∈ Vm v0,Ω =1

1 . Rh v20,Ω

(8)

In the last inequality we suppose that ah (v, v) = a(v, v). This is true if for example Vm ⊂ H 2 (Ω). Let us also emphasize that vh s,h = vh s,Ω , s = 0, 1 for all vh ∈Vm . For this, letus consider a function v ∈ Vm such that v0,Ω = 1. m m Then v = i=1 αi ui with i=1 α2i = 1. Using that v is normalized w.r.t. L2 (Ω) we obtain: 1 − Rh v20,Ω = (v − Rh v, v + Rh v) = 2(v − Rh v, v) − v − Rh v20,Ω , or Rh v20,Ω ≥ 1 − 2(v − Rh v, v).

(9)

On the other hand, from (2) we derive (v − Rh v, v) =

m

αi (v − Rh v, ui ) =

i=1

m αi i=1

λi

ah (v − Rh v, ui ).

Applying equality (5) we get (v − Rh v, v) =

m αi i=1

λi

ah (v − Rh v, ui − Rh v).

Next, as ah (·, ·) is continuous, i.e. ah (u, v) ≤ M u2,hv2,h , we obtain: m α i (v − Rh v, v) ≤ M v − Rh v2,h (ui − Rh ui ) . λ i i=1 2,h

By the Cauchy-Schwarz inequality we estimate: m 1/2 m 1/2 m α2 αi i (ui − Rh ui )2,h ≤ ui − Rh ui 22,h 2 λ λ i i i=1 i=1 i=1 √ m ≤ λ1

sup v∈Vm v0,Ω =1

Combining (10), (9) and (8), we get ⎛ λm,h ≤ ⎝1 + C

v − Rh v2,h .

⎞ sup

v∈Vm v0,Ω =1

v − Rh v22,h ⎠ λm .

(10)

Acceleration of Convergence for Eigenpairs Approximated

699

We rewrite the last result as |λm,h − λm | ≤ C(λ)

v −

sup v∈Vm v0,Ω =1

Rh v22,h

≤C

m

ui − Rh ui 22,h .

i=1

Now, let us suppose that Vm ⊂ H 3 (Ω) ∩ H02 (Ω) and ui ∈ H 3 (Ω), i = 1, . . . , m. Applying (7) and the approximation property (4) to the last inequality we prove the estimate (6). As a corollary of the considerations above, if λm is a simple eigenvalue, for the corresponding eigenfunctions we have: m 1/2 2 ui − Rh ui 2,h . um,h − um 2,h ≤ C sup v − Rh v2,h ≤ C v∈Vm v0,Ω =1

i=1

Under Vm ⊂ H 3 (Ω) ∩ H02 (Ω) we get

um,h − um 0,h ≤ Ch

m

1/2 ui 23,h

.

(11)

i=1

3

Superconvergent Postprocessing Technique

Procedures for accelerating the convergence of FE approximations of the eigenpairs are developed by the authors for diﬀerent problems (see, e.g. [2,3,4,8]). Herein we prove that these ideas could be applied to biharmonic eigenvalue problem approximated by nonconforming ﬁnite elements. Now we present a relatively simple postprocessing method that gives better accuracy for eigen- values. It is based on a postprocessing technique that involves an additional solving of a source problem on an augmented FE space. Let uh be any approximate eigenfunction of (3) with uh 0,Ω = 1. Since the ﬁnite element solution uh obtained by nonconforming Z-type element is already known, we consider the following variational elliptic problem: a( u, v) = (uh , v), ∀v ∈ V. We deﬁne the number

= λ

(12)

1 . ( u, uh )

Theorem 2. Let the FE Vh be constructed by Z-type nonconforming triangular elements. If (λ, u) is an eigenpair of problem (2), u ∈ H 3 (Ω) and (λh , uh ) is the corresponding solution of (3) and eigenfunctions are being normalized u0,Ω = uh 0,Ω = 1, then

= O(u − uh 2 ). |λ − λ| 0,Ω If in addition the finite element partitions are regular, then the following superconvergent estimate holds:

≤ Ch6 u2 . |λ − λ| 3,Ω

(13)

700

A.B. Andreev and M.R. Racheva

The proof may proceed similarly as the same result obtained for other problems in [2,3,8]. The superconvergent estimate is very useful from a theoretical point of view. Nevertheless, in practice the exact solution of the source problem (12) is hardly ever available. So, the ﬁnite element solution, which corresponds to (12) is: ah ( uh , vh ) = (uh , vh ), ∀vh ∈ V h ,

(14)

where V h will be precised. Namely, (1) (i) V h is constructed by modiﬁed Hermite element (Fig. 1). It uses integral value over any element K as a degree of freedom instead of the value of shape function at the center of gravity of K. This is also a nonconforming ﬁnite element. (2) (ii) V h is a ﬁnite element space obtained by using conforming Bell’s triangle (Fig. 1) (see, also [7]). We point out herein, Bell’s triangle is optimal among triangular polygonal ﬁnite elements of class C 1 , since dimPK ≥ 18 for such kind of ﬁnite elements. We denote 1

h = λ , ( uh , uh )

h are the solutions of (3) and (14), respectively. where uh and u Theorem 3. Let the assumptions of Theorem2 be fulfilled and let us use the (s) finite element subspaces V h , s = 1, 2. Then the following estimate holds:

h | ≤ Ch2(s+1) , s = 1, 2. |λ − λ

(15)

Proof. First we express: 1 1 − = ( u, uh ) − ( uh , uh ) = a( u, u

) − ah ( uh , u

h )

λ λh = ah ( u−u

h , u

) + ah ( uh , u

) − ah ( uh , u

h ) = ah ( u−u

h , u

) − ah ( u−u

h , u

h ), consequently 1 1 − = ah ( u−u

h , u

−u

h ).

λ

h λ The continuity of the ah −form gives:

−λ

h | ≤ C |λ u−u

h 22,h . By similar arguments as in Theorem1 and by standard assumptions for the smoothness of u

we can derive the estimate as (11) (see, also [7]): u−u

h 2,h ≤ Chs+1 u2s+1,Ω , s = 1, 2. The superconvergent estimate (15) follows from the last inequality, (13) and triangle inequality.

Acceleration of Convergence for Eigenpairs Approximated

4

701

Numerical Results

Consider the following model problem: Δ2 u = λu u=

in Ω,

∂u = 0 on ∂Ω, ∂ν

where Ω be a square domain: Ω: −

π π < xi < , 2 2

i = 1, 2.

For this problem the only known concerning the exact eigenvalues are their lower and upper bounds obtained by Weinstein and Stenges [11]. In Table 1 the results from numerical experiments for the ﬁrst four eigenvalue are given. Regardless of the fact, that the second and the third eigenvalue are equal, the proposed postprocessing technique is put into eﬀect for both of them.Using uniform mesh, the domain is divided into 2n2 issosceles triangular Z-type elements. So that, the mesh parameter is π/n, n = 4, 6, 8. As it is shown in Table 1, the postprocessing takes eﬀect especially on a coarse grid. Postprocessing is implemented putting into use Hermite elements (Fig. 1). It is to be noted here that if one is mainly interested in computing of eigenvalues, he can successfully apply Hermite elements, because they completely ensure an improvement of the approximate values. However, concerning eigenfunctions the use of Bell’s ﬁnite elements for the postprocessing procedure gives not only a better improvement compared to the use of Hermite elements; it also improves the smoothness of approximate eigenfunctions. Table 1. Eigenvalues computed by means of Z-type ﬁnite elements (FEM) and their improvements as a result of postprocessing procedure (PP) by means of Hermite ﬁnite elements h

λ1

λ2

λ3

λ4

π/4

FEM PP

15.0780 14.8992

67.1107 63.2144

75.2068 71.7732

168.1998 153.0853

π/6

FEM PP

14.0465 13.9459

60.3152 58.7350

65.3925 63.3821

156.5718 149.8718

π/8

FEM PP

13.4721 13.4152

57.7318 56.1393

61.3971 58.9924

152.4870 148.6162

bounds: lower 13.2820 upper 13.3842

55.2400 56.5610

55.2400 56.5610

120.0070 124.0740

702

A.B. Andreev and M.R. Racheva

Acknowledgement This work is partially supported by the Bulgarian Ministry of Science under grant D002–147/2008.

References 1. Andreev, A.B., Racheva, M.R.: Zienkiewicz-type Finite Element Applied to Fourthorder Problems. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2009. LNCS, vol. 5910, pp. 687–694. Springer, Heidelberg (2010) 2. Andreev, A.B., Racheva, M.R.: On the Postprocessing Technique for Eigenvalue Problems. In: Dimov, I.T., Lirkov, I., Margenov, S., Zlatev, Z. (eds.) NMA 2002. LNCS, vol. 2542, pp. 363–371. Springer, Heidelberg (2003) 3. Andreev, A.B., Racheva, M.R.: Superconvergent Finite Element Postprocessing for Eigenvalue Problems with Nonlocal Boundary Conditions. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2007. LNCS, vol. 4818, pp. 645–653. Springer, Heidelberg (2008) 4. Andreev, A.B., Lazarov, R.D., Racheva, M.R.: Postprocessing and Higher Order Convergence of the Mixed Finite Element Approximations of Biharmonic Eigenvalue Problems. JCAM 182, 333–349 (2005) 5. Babuska, I., Osborn, J.: Eigenvalue Problems. In: Ciarlet, P.G., Lions, J.L. (eds.) Handbook of Numerical Analysis, vol. II, pp. 641–787. North-Holland, Amsterdam (1991) 6. Bazaley, G.P., Cheung, Y.K., Irons, B.M., Zienkiewicz, O.C.: Triangular Elements in Plate Bending - Conforming and Non-conforming Solutions. In: Proceedings of the Conference on Matrix Methods in Structural Mechanics, pp. 547–576. Wright Patterson A.F. Base, Ohio (1965) 7. Ciarlet, P.: Basic Error Estimates for the FEM, vol. 2, pp. 17–351. Elsevier, Amsterdam (1991) 8. Racheva, M.R., Andreev, A.B.: Superconvergence Postprocessing for Eigenvalues. Comp. Meth. in Appl. Math. 2(2), 171–185 (2002) 9. Raviart, P.A., Thomas, J.M.: Introduction a l’Analyse Numerique des Equations aux Derivees Partielles, Masson Paris (1988) 10. Wang, M., Shi, Z., Xu, J.: A New Class of Zienkiewicz-type Non-conforming Element in Any Dimensions. Numer. Math. 106, 335–347 (2007) 11. Weinstein, A., Stenger, W.: Methods of intermediate problems for eigenvalues, theory and applications. Academic Press, London (1972)

A Two-Grid Method on Layer-Adapted Meshes for a Semilinear 2D Reaction-Diﬀusion Problem Ivanka T. Angelova and Lubin G. Vulkov Faculty of Natural Science and Education University of Rousse, 8 Studentska str., Rousse 7017, Bulgaria [email protected], [email protected]

Abstract. A singularly perturbed semilinear reaction-diﬀusion equation, posed in the unit square, is discetizied by a two-grid scheme on layer-adapted meshes. In the ﬁrst step, the nonlinear problem is discretized on coarse grid. In the second step, the problem is discretized on a ﬁne grid and linearized around the interpolation of the computed solution on the ﬁrst step. We show theoretically and numerically that the global error on Shishkin or Bakhvalov mesh is the same as would have been obtained if the nonlinear problem had been solved directly on the ﬁne grid.

1

Introduction

Reaction-convection-diﬀusion equations (where reaction or convection terms, but not both, may be absent) appear in various applications [11,12]. Since 2000 there has been a surge of interest in numerical methods for such problems. Several papers prove convergence results that are uniform in the singular parameters. Most of the results are based on the ﬁtted (layer adapted) mesh methods (FMM). The layer-adapted meshes allow resolution of the structure of the layer [10,11,12] and provide uniform convergence in the singular perturbation parameter(s) even when they are combined with classical discretizations [10,11,12]. Besides Bakhvalov meshes (B-mesh) [3] and Shishkin’s piecewise equidistant meshes (S-mesh) [10,11,12], there are grids that are based on equidistribution [10,12], Bakhvalov-Shishkin meshes (B&S-mesh) and Vulanovic’s improved Shishkin mesh [10,11,12]. The choice of the mesh plays a key role for the uniform convergence of the numerical methods for nonlinear problems, see [9,15]. Interest in the construction of high-order approximation to singularly perturbed problems is currently at high level. But such constructions often lead to extension of the stencil or to discretizations that are not inverse monotone [12]. Another way to increase the accuracy of the numerical solution to singularly problems is the use of Richardson extrapolation. However the Richardson procedure requires solution of systems of nonlinear algebraic equations on each of the nested meshes, see [13]. For large nonlinear problems (2D, 3D scalar equations or systems of nonlinear ordinary and partial diﬀerential equations) the question for eﬀective solution of the nonlinear algebraic systems of equations that arise I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 703–710, 2010. c Springer-Verlag Berlin Heidelberg 2010

704

I.T. Angelova and L.G. Vulkov

after the discretizations is also very important. Two-grid method (TGM) is a discretization technique for nonlinear equations based on two grids of diﬀerent sizes and in the same time provides fast and economical algorithms for solution oft he discrete nonlinear systems. The idea is to use a coarse-grid space to produce a rough approximation of the solution of nonlinear problems, and then use it as the initial guess for one Newton-like iteration on the ﬁne grid. The method involves a nonlinear solve on the coarse grid of diameter H and a linear solve on the ﬁne grid of diameter h << H. Two-grid method was ﬁrst introduced in [2] and [18] for nonlinear elliptic equations. Later on, two-grid methods was further investigated by many authors for diﬀerent problems in physics, mechanics, engineering. In this paper we will consider FMM combined with TGM to solve the model singularly perturbed semilinear elliptic problem: −ε2 u + f (x, y, u) = 0 in Ω = (0, 1) × (0, 1),

(1)

u = g(x, y) on ∂Ω.

(2)

Our main assumption is that f has partial derivative fu (x, y, u) > c20 > 0 for all (x, y, u) ∈ Ω × R1

(3)

for some constant c0 > 0. Under this condition the reduced problem has a unique solution u0 which is suﬃciently smooth in Ω. N. Kopteva considered uniform convergence for (1), (2) under weaker assumptions [12]. Numerical experiments show that our algorithms still work for these more general problems. The paper is organized as follows. In Section 2 we recall some layer-adapted meshes and discuss uniform convergent numerical methods for (1)-(3) in the linear case. In Section 3 we prove uniform convergence for (1)-(3) via a Newton’s linearization procedure. Section 4 provides two-grid algorithms while Section 5 presents results of numerical experiments for the proposed algorithms. In one-dimension two-grid algorithms for singularly perturbed problems was constructed and studied in [1,16,17]. The present results can be extended to other adapted meshes, for example the graded meshes of [5].

2

Meshes and Linear Problem

On w introduce mesh w h = whx × w hy whx = {xi , 0 ≤ i ≤ Nx ; x0 = 0, xNx = 1; hxi = xi+1 − xi }, w hy = {yj , 0 ≤ j ≤ Ny ; y0 = 0, yNx = 1; hyj = yj+1 − yj }.

(4)

To discretize problem (1)-(2) in the linear case of f (x, y, u) = c(x, y)u − d(x, y) we use the standard central diﬀerence scheme − ε2 (Uxx + Uyy ) + f (x, y, U ) = 0, (x, y) ≡ (xi , yj ) ∈ wh , U = g, on the boundary ∂wh ,

(5)

Two-Grid Methods on Layer-Adapted Meshes

705

where Ux = Ux,i = (U (xi , yj ) − U (xi−1 , yj ))/hxi , Ux = Ux,i = Ux,i+1 , Uy = Uy,j = (U (xi , yj ) − U (xi , yj−1 ))/hyj , Uy = Uy,j = Uy,j+1 , Ux = Ux ,i = (U (xi+1 , yj ) − U (xi , yj ))/xi , Uy = Uy,j = (U (xi , yj+1 ) − U (xi , yj ))/yj , xi = (hxi + hxi+1 )/2, hx0 = hx1 /2, xNx = hxNx /2, yj = (hyj + hyj+1 )/2, hy0 = hy1 /2, yNy = hyNy /2, Uxx = Uxx,i = (Uxi − Uxi )/xi , Uyy = Uyy,j = (Uy,j − Uy,j )/yj . For the reaction-diﬀusion problem (1)-(2), a layer-adapted mesh from [11,12] is formed in the following manner. We divide each of the interval w x = [0, 1] and w y = [0, 1] into three parts [0, σx ], [σx , 1 − σx ], [1 − σx , 1] and [0, σy ], [σy , 1 − σy ], [1 − σy , 1], respectively. Assuming that Nx , Ny are divisible by 4, in the parts [0, σx ], [1 − σx , 1] and [0, σy ], [1 − σy , 1] we allocate Nx /2 + 1 and Ny /2 + 1 mesh points, respectively. Points σx , (1 − σx ) and σy , (1 − σy ) correspond to transition to the boundary layers. We consider meshes w hx and w xy which are equidistant in [xNx /4 , x3Nx /4 ] and [yNy /4 , y3Ny /4 ] but graded in [0, xNx /4 ], [x3Nx /4 , 1] and [0, yNy /4 ], [y3Ny /4 , 1]. On [0, xNx /4 ], [x3Nx/4 , 1] and [0, yNy /4 ], [y3Ny /4 , 1] let our mesh be given by a mesh generating function φ with φ(0) = 0 and φ(1/4) = 1 which is supposed to be continuous, monotonically increasing, and piecewise continuously diﬀerentiable. Then our mesh is deﬁned by ⎧ ξi = i/Nx , i = 0, . . . , Nx /4; ⎨ σx φ(ξi ), i = Nx /4 + 1, . . . , 3Nx /4 − 1; xi = ihh ⎩ 1 − σx (1 − φ(ξi )), ξi = (i − 3Nx /4)/Nx, i = 3Nx /4 + 1, . . . , Nx , ⎧ ξj = j/Ny , j = 0, . . . , Ny /4; ⎨ σy φ(ξj ), j = Ny /4 + 1, . . . , 3Ny /4 − 1; yj = jhy ⎩ 1 − σy (1 − φ(ξj )), ξj = (j − 3Ny /4)/Ny , j = 3Ny /4 + 1, . . . , Ny , hx = 2(1 − 2σx )Nx−1 , hy = 2(1 − 2σy )Ny−1 . We also assume that φ does not decrease. This condition implies that hxi ≤ hxi+1 , i = 1, . . . , Nx /4 − 1, hxi ≥ hxi+1 , i = 3Nx /4 + 1, . . . , Nx − 1, hyj ≤ hyj+1 , j = 1, . . . , Ny /4 − 1, hyj ≥ hyj+1 , j = 3Ny /4 + 1, . . . , Ny − 1, For S-mesh (see Fig.1 a) we choose the transition points σx , 1 − σx and σy , 1 − σy −1 −1 as in [11] σx = min{4−1 , c−1 , c0 ε ln Ny }. If σx = σy = 0 ε ln Nx }, σy = min{4 −1 −1 1/4 , then Nx , Ny are very small relative to ε. In this case, the diﬀerence scheme (5) can be analyzed using standard techniques. We therefore assume −1 that σx = c−1 0 ε ln Nx , σy = c0 ε ln Ny .

I.T. Angelova and L.G. Vulkov

1

1

0.9

0.9

0.8

0.8

0.7

0.7

0.6

0.6

0.5

0.5

y

y

706

0.4

0.4

0.3

0.3

0.2

0.2

0.1

0.1

0

0

0.1

0.2

0.3

0.4

0.5 x

0.6

0.7

0.8

0.9

1

0

0

0.1

0.2

0.3

0.4

a)

0.5 x

0.6

0.7

0.8

0.9

1

b) Fig. 1. a) S-mesh. b) B-mesh.

Consider the mesh generating function φ in the form φ(ξ) = 4ξ. In this case the meshes w hx and w hy are piecewise equidistant with step sizes −1 Nx−1 < hx < 2Nx−1 , hx = hx (ε) = 4c−1 0 εNx ln Nx , −1 Ny−1 < hy < 2Ny−1 , hy = hy (ε) = 4c−1 0 εNy ln Ny .

In the linear case at a some requirements to input data c, f that provide suﬃcient smoothness of the solution u it was proved in [4] that the diﬀerence scheme [6] on the S-mesh converges ε - uniformly to the solution of (1) - (3) and the estimate holds (6) U − uwh ,∞ ≤ CN −2 ln2 N, N = min{Nx , Ny }. The constant C is independent of ε and Nx , Ny . For B-mesh (see Fig.1 b) one chooses the transition points with −1 −1 σx = c−1 ), σy = c−1 ) and the mesh generating function φ in the 0 ε ln(ε 0 ε ln(ε form φ(ξ) = ln[1 − 4(1 − ε)ξ]/ ln ε. For the diﬀerence scheme (5) on B-mesh the estimate holds [4,12] U − uwh ,∞ ≤ CN −2 , N = min{Nx , Ny }.

3

(7)

Uniform Convergence for the Nonlinear Problem

From the theory of Holder space solutions of nonlinear elliptic problems [7] and results of Han&Kellog [8] for linear problems, we can prove that if f has second order derivatives with respect to u that are functions of C 2,α (Ω) and g ∈ C 4,α (S), where S is any side of the rectangle Ω , then u ∈ C 4,α (Ω). On this base we will develop a constructive proof for uniform convergence of the diﬀerence scheme (5) via the Newton’s linearization of problem (1)-(3): −ε2 u(m+1) + fu (x, y, u(m) )u(m+1) = −f (x, y, u(m) ) + fu (x, y, u(m) )u(m) , (8) u(m+1) |∂Ω = g(x, y), m = 0, 1, 2, · · · •

Two-Grid Methods on Layer-Adapted Meshes

707

The initial guess u(0) (x, y) one chooses from mathematical or physical considerations, but in the next section it plays a key role at the construction of the two-grid algorithm. It follows from the maximum principle that f (x, y, 0)L∞ (Ω) + gL∞ (∂Ω) . uL∞(Ω) ≤ l = c−1 0

(9)

Let introduce θ=

max

(x,y)∈Ω, |ξ|≤l+2ρ

fuu (x, y, ξ).

Assume that for the initial guess u0 (x, y) the estimate holds u(0) − uL∞ (Ω) ≤ ρ = const.

(10)

Lemma 1. Suppose that c−2 0 θρ < 1. Then m

2 u(m) − uL∞ (Ω) ≤ c20 θ(c−2 , m = 0, 1, 2 · · · • 0 θρ)

(11)

The proof is by induction applying the maximum principle to the boundary value problem for v = u(m+1) − u and it is similar to the corresponding one in the 1D case [1]. Let us consider the ﬁnite-diﬀerence analogue of the iterative process (8): (m+1)

Lh U (m+1) = −ε2 (Uxx = −f (xi , yj , U

(m)

)

(m+1)

+ Uyy

) + fu (xi , yj , U (m) )U (m+1)

(12)

+ fu (xi , yj , U (m) )U (m) , (m)

U

(x, y) ∈ wh , m = 0, 1, 2, . . . = g, on the boundary ∂wh .

Lemma ρ0 and h0 , independent of ε such that if h = 2. There exist constants max max hxi , max hyj ≤ h0 and ρ ≤ ρ0 , C independent of ε, h, such 1≤i≤Nx

1≤j≤Ny

that m

2 U (m) − u ≤ C[N −2 lnk N + (c−2 ], m = 0, 1, 2, . . . , 0 θρ)

(13)

where k = 2 for S-mesh and k = 0 for B-mesh. The proof is based on introducing of an axillary ﬁnite diﬀerence process corresponding to (8) and is similar to the corresponding one in the 1D case [1]. Now combining the estimates (6), (7) with Lemmas 1, 2 one can prove the following assertion. Theorem 1. Let u be the solution of problem (1)-(3) u and U be the solution of (9) on B-mh or on S-mh. Then U − u∞ ≤ CN

−2

ln N, k

k = 0 for B − mesh, k = 2 for S − mesh.

(14)

708

4

I.T. Angelova and L.G. Vulkov

Two-Grid Algorithm

In this section we propose a simple two-grid algorithm based on the estimate (13). If U = U (Nx , Ny ) is the solution of the nonlinear discrete problem (5) we denote its bilinear interpolant

(x, y) = U

Nx ,Ny

Uij φi (x)φj (y),

i,j

where φi (x) is the standard piecewise linear basis function associated with the interval (xi−1 , xi+1 ). If follows from Theorem 1 that

− u∞ ≤ u − uI + CN −2 lnk N, U where uI = uI (x, y) is the interpolant of the continuous solution u. From argu − u∞ ≤ ments in [4,14] we have the global parameter - uniform error bound U CN −2 lnk N . If in the iterative process (8) one takes m = 1 and the initial guess

(x, y), then in (13) we will have u(0) (x, y) = U m

(C0−2 θρ)2 = CN −4 ln2k N.

(15)

Next, if one solves the such linearized problem (12) on a ﬁnnier mesh, for example, Nxf = Nx2 , Nyf = Ny2 (f≡ﬁne), then the ﬁrst term in (13) looks as follows k = 0 for B − mesh, (N f )−2 lnk N f = 2N −4 ln2k N, (16) k = 2 for S − mesh. Therefore, (15) and (16) imply that as a result we obtain an accuracy of up to almost fourth order for S-mesh and fourth order for B-mesh, with respect to the coarse grid. Let us now present our algorithm. We ﬁrst introduce a ﬁne mesh, for example, the following one: wfh = w fhx × w fhy as (4) but Nxf = Nx2 , Nyf = Ny2 . 1. Solve the discrete problem (5) on the coarse grid wh and then construct the

(x, y) deﬁned on the domain Ω. function U 2. Solve the linear discrete problem

(xi , yj ))U (xi , yj ) −ε2 (Uxx,i + Uyy,j ) + fu (xi , yj , U

(xi , yj ))U (xi , yj ) − f (xi , yj , U

(xi , yj )), = fu (xi , yj , U i = 1, . . . , Nx2 − 1, j = 1, . . . , Ny2 − 1, U = g on the boundary ∂ωhf . to ﬁnd the ﬁne mesh numerical solution U f (x, y), (x, y) ∈ Ω.

f (x, y), (x, y) ∈ wh . 3. Interpolate U f to obtain U The next theorem is the main result of the present paper.

Two-Grid Methods on Layer-Adapted Meshes

709

Theorem 2. Let the assumptions in Theorem 1 hold. Then for the error of the two-grid method we have k = 4 for S − mesh, f −4 k

U − u∞,wh ≤ CN ln N, (17) k = 0 for B − mesh.

5

Numerical Experiments

In this section we present numerical experiments obtained by applying the twogrid algorithm. The test problem and the exact solution are: −ε2 u +

u−1 = f (x, y), u ∂Ω = 0, (x, y) ∈ Ω, 2−u

exp(−x/ε) + exp(−(1 − x)/ε) exp(−y/ε) + exp(−(1 − y)/ε) uε (x, y) = 1 − 1− , 1 + exp(−1/ε) 1 + exp(−1/ε)

and f (x, y) is calculated from the exact solution. The tables below present the errors EN = U − u∞ , where U is the numerical solution on a mesh with N Table 1. The maximum error and the numerical order of convergence for ε = 1, 10−1 , 10−2 , 10−3 , 10−4 for the scheme (11) and the TGA on S and B-mesh, where Nx = Ny = N and Nxf = Nyf = N f ε 1

10−1

10−2

10−3

10−4

N Nf I st ON II st ON f I st ON II st ON f I st ON II st ON f I st ON II st ON f I st ON II st ON f

4 8 16 64 6,558E-5 1,644E-5 1,9960 1,9990 4,105E-6 2,566E-7 4,0000 4,0001 8,911E-2 3,295E-2 1,4351 1,8486 8,827E-3 1,124E-3 2,9735 3,6891 1,016E-1 6,830E-2 0,5731 0,7524 1,679E-2 3,656E-3 2,1994 2,7158 1,013E-1 6,649E-2 0,6078 0,7145 1,566E-2 2,397E-3 2,7078 2.1115 1,099E-1 6,635E-2 0,7247 0,7157 1,555E-2 2,382E-03 2,7062 2,1026 S-mesh

16 256 4,113E-6

ε 1

1,604E-8 9,150E-3

10−1

8,714E-05 4,055E-2

10−2

5,565E-4 4,052E-2

10−3

5,548E-3 4,049E-2 5,546E-3

10−4

N Nf I st ON II st ON f I st ON II st ON f I st ON II st ON f I st ON II st ON f I st ON II st ON f

4 8 16 64 6,558E-5 1,644E-5 1,9960 1,9990 4,105E-6 2,566E-7 4,0000 4,0001 8,911E-2 2,295E-2 1,9568 1,9001 8,827E-3 1,124E-3 2,9735 3,8649 2,090E-2 5,571E-3 1,9073 2,0056 2,300E-2 2,090E-3 3,4604 3,7377 2,339E-1 5,979E-2 1,9681 2,0082 2,803E-2 1,138E-3 3,9860 3,9967 2,372E-1 5,980E-2 1,9875 2,0062 1,857E-2 1,138E-3 4,0284 4,0008 B-mesh

16 256 4,113E-6 1,604E-8 6,150E-3 7,714E-05 1,388E-3 1,566E-4 1,486E-2 7,129E-5 1,489E-2 7,107E-5

710

I.T. Angelova and L.G. Vulkov

mesh steps. Also, we calculated numerical orders of convergence by the formula: ON = (ln EN − ln E2N ) / ln 2. Table 1 displays the maximum error EN and the numerical orders of convergence ON on the 1st and second steps of the two-grid algorithms on S and B-meshes. These results indicate that the uniform order of convergence is in agrement with Theorems 1, 2.

References 1. Angelova, I.T., Vulkov, L.G.: Comparison of the Two-Grid Method on Diﬀerent Meshes for a Singularly Perturbed Semilinear Problem. Amer. Inst. of Phys. 1067, 305–312 (2008) 2. Axelsson, O.: On Mesh Independence and Newton Methods. Applications of Math. 4-5, 249–265 (1993) 3. Bakhvalov, N.S.: On the Optimization Methods for Solcing Boundary Value Problems with Bounsary Layers. Zh. Vychisl. Math. Fiz. 24, 841–859 (1969) (in Russian) 4. Clavero, C., Gracia, J., O’Riordan, E.: A Parameter Robust Numerical Method for a Two Dimensionsl Reaction-Diﬀusion Problem. Math. Comp. 74, 1743–1758 (2005) 5. Duran, A., Lombardi, A.: Finite element approximation of convection-diﬀusion problems using graded meshes. Appl. Numer. Math. 56, 1314–1325 (2006) 6. Farrell, P.A., Miller, J.J.H., O’Riordan, E., Shishkin, G.I.: On the Non-Existence of ε-Uniform Finite Diﬀerence Method on Uniform Meshes for Semilinear Two-Point Boundary Value Problems. Math. Comp. 67(222), 603–617 (1999) 7. Gilbarg, D., Trudinger, N.: Ellipric Partial Diﬀerential Equations of Second Order. Springer, Heidelberg (1998) 8. Han, H., Kellog, R.B.: Diﬀerentiability Properties of Solutions of the Equation −ε2 u + ru = f (x, y) in a Square. SIAM J. Math. Anal. 21, 394–408 (1990) 9. Herceg, D., Surla, K., Radeka, I., Malicic, I.: Numerical Experiments with Diﬀerent Schemes for Singularly Perturbed Problem. Novi Sad J. Math. 31, 93–101 (2001) 10. Linss, T.: Layer-Adapted Meshex for Convection-Diﬀusion Problems. Habilitation, TU Dresden (2007) 11. Miller, J.J.H., O’Riordan, E., Shishkin, G.I.: Fitted Numerical Methods for Sinhular Perturbed Problems. World Scientiﬁc, Singapore (1996) 12. Roos, H., Stynes, M., Tobiska, L.: Numerical Methods for Singular Perturbed Differential Equations. In: Convection-Diﬀusion and Flow Problems. Springer, Berlin (2008) 13. Shishkin, G.I., Shishkina, L.P.: A High-Order Richardson Method for a Quadilinear Singularly Perturbed Elliptic Reaction-Diﬀusion Equation. Diﬀ. Eqns. 41(7), 1030– 1039 (2005) (in Russian) 14. Stynes, M., O’Riordan, E.: A Uniformly Concergent Galerkin Method on a Shishkin Mesh for a Convection-Diﬀjsion Problem. J. Math. Anal. Applic. 214, 36–54 (1997) 15. Vulanovic, R.: Finite-Diﬀerence Methods for a Class fo Strongly Nonlinear Singular Perturbation Problems. Num. Math. Theor. Appl. 1(2), 235–244 (2008) 16. Vulkov, L., Zadorin, A.: Two-Grid Interpolation Algorithms for Diﬀerence Schemes of Exponential Type for Semilinear Diﬀusion Convection-Dominated Equations. Amer. Inst. of Phys. 1067, 284–292 (2008) 17. Vulkov, L., Zadorin, A.: A Two-Grid Algorithm for Solution of the Diﬀerence Equations of a System, of Singularly Perturbed Semilinear Equations. LNCS, vol. 5434, pp. 582–589. Springer, Heidelberg (2009) 18. Xu, J.: A Novel Two-Grid Method for Semilinear Ellipptic Equations. SIAM, J. Sci. Comput. 15(1), 231–237 (1994)

Comparative Analysis of Solution Methods for Delay Diﬀerential Equations in Hematology Gergana Bencheva Institute for Parallel Processing, Bulgarian Academy of Sciences, Acad. G. Bontchev, Bl. 25A, 1113 Soﬁa, Bulgaria [email protected]

Abstract. Considered model of blood cells production and regulation under the action of growth factors consists of a system of ordinary diﬀerential equations with delays. Six methods for solution of such systems are experimentally compared. XPPAUT, a software tool that brings together various algorithms and provides means for visualisation of the solution, is used for this purpouse. The computing times and the quality of the solution are analyzed with the help of test data for red blood cells.

1

Introduction

Recent achievements in the development of high-performance computer facilities turn the computer modelling (CM) into a classical tool for analysis and prediction of various real-life phenomena. Problems in biology and medicine are among the most widespread applications of CM since the approach “trial-error” is not recommended for dealing with questions related to understanding and predicting of human physiological processes in health and disease. The investigations in the current paper are inspired from a problem in hematology, where blood cells (BCs) production and regulation should be modelled in relation with blood pathologies. The complex biological process during which the mature BCs evolve from primitive cells in the bone marrow called hematopoietic pluripotent stem cells (HSCs) is known as hematopoiesis. Each type of BCs is a result of the action of speciﬁc proteins, known as Growth Factors or Colony Stimulating Factors (CSF), at speciﬁc moments during the hematopoiesis. This is possible due to the high self-renewal and diﬀerentiation capacity of the HSCs. The nonlinear system of delay diﬀerential equations (DDEs) proposed in [1] is considered as initial model of hematopoiesis. DDEs provide an important way of describing the time evolution of biological systems whose rate of change also depends on their conﬁguration at previous time instances. Introduction of delays (in this case it corresponds to the cell cycle duration) allows to improve the models by taking into account important aspects and to face more complicated phenomena based on feedback control. The applicability to the considered model of various solution methods is analyzed with the help of XPPAUT – a software tool available from the web page of G. B. Ermentrout [7]. XPPAUT brings together a number of algorithms for I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 711–718, 2010. c Springer-Verlag Berlin Heidelberg 2010

712

G. Bencheva

various types of equations, including DDEs, and provides means for visualisation of the solution. The implemented methods for DDEs are divided into groups – explicit/implicit, with ﬁxed/adaptive step, designed for nonstiﬀ/stiﬀ problems. A set of numerical tests for representative test examples are performed and analyzed for solvers from each group in order to tune the software parameters and to compare their performance. The remainder of the paper is organized as follows. Section 2 deals with the mathematical model of hematopoiesis as it is formulated in [1]. The package XPPAUT and the considered methods for numerical solution of DDEs are brieﬂy presented in section 3. Section 4 is devoted to the numerical tests and to the comparative analysis of the methods with respect to both measured cpu time and behaviour of the solution. The latter is investigated for diﬀerent time steps and its proximity to the steady states is studied.

2

Mathematical Model

In this section we brieﬂy present the system of DDEs proposed in [1] to model the blood cells production and regulation mediated by CSF. In the bone marrow, HSCs are divided into two groups: proliferating cells and nonproliferating (or quiescent) cells. Their populations at time t are denoted by P (t) ≥ 0 and Q(t) ≥ 0 respectively. The population of the circulating mature BCs is denoted by M (t) ≥ 0 and the growth factor concentration is E(t) ≥ 0. The following system of stiﬀ nonlinear ordinary DDEs have to be solved ⎧ dQ ⎪ ⎪ = −δQ(t) − g(Q(t)) − β(Q(t), E(t))Q(t) + 2e−γτ β(Q(t − τ ), E(t − τ ))Q(t − τ ) ⎪ ⎪ dt ⎪ ⎪ ⎪ ⎨ dM = −μM(t) + g(Q(t)) ⎪ dt ⎪ ⎪ ⎪ ⎪ ⎪ dE ⎪ ⎩ = −kE(t) + f (M(t)) dt

(1)

with appropriate initial conditions for t ∈ [−τ, 0]. The delay τ corresponds to the proliferating phase duration, which is assumed to be constant. The rates at which the proliferating and quiescent cells can die are represented in (1) by the parameters γ and δ respectively. The degradation rates μ and k of the mature BCs and of CSF in the blood are assumed to be positive. Quiescent cells can either be introduced in the proliferating phase with a rate β(Q(t), E(t)) or diﬀerentiate in mature BCs with a rate g(Q(t)). The negative feedback control f (M (t)) of the bone marrow production on the CSF production acts by the mean of circulating mature BCs: the more circulating BCs are, the less growth factor is produced. The trivial steady-state of (1) is not a biologically interesting equilibrium since it describes a pathological situation that can only lead to death without appropriate treatment. The existence of nontrivial positive steady-state is ensured by the conditions (see [1] for details):

0 < δ + g (0) < β

f (0) 0, k

, 0 ≤ τ < τmax

⎛ ⎞ f (0) 2β 0, k 1 ⎝

⎠. := ln f (0) γ δ + g (0) + β 0, k

(2)

Comparative Analysis of Solution Methods for DDEs in Hematology

713

The parameters and the coeﬃcient functions involved in the model depend on the particular type of blood cells.

3

Solution Methods and Software Tools

Various numerical methods for solution of systems of ordinary diﬀerential equations are developed for both stiﬀ and nonstiﬀ problems, as well as for the case of retarded arguments (see e.g. [4,5,6,2] and references therein). They fall in general into two classes: those which use one starting value at each step (“one-step methods”) and those which are based on several values of the solution (“multistep methods”). Each of the methods in these classes can be either explicit or implicit, and in addition they can be with ﬁxed or adaptive step. When dealing with delays, a special attention should be focused on the locating and including into the mesh of the so called breaking points (or primary discontinuities), at which the solution possesses only a limited number of derivatives, and remains piecewise regular between two consecutive such points. The explicit and implicit Euler’s methods are the most popular one-step methods. The s-stage Runge-Kutta methods are one-step methods, where s auxiliary intermediate approximations are computed to obtain a suﬃciently high accuracy for the approximations at the main step points. Rosenbrock methods are Runge-Kutta type methods for stiﬀ ODEs which are linearly implicit, i.e. the systems to be solved are linear even for nonlinear ODEs. This is not the case, e.g. for implicit Euler method, where Newton iterations are additionally applied to handle nonlinear problems. Adams methods are the ﬁrst representatives of the linear multi-step methods. If they are explicit they are known as AdamsBashforth, and the implicit ones are known as Adams-Moulton methods. The predictor-corrector methods are special multi-step methods in which an explicit linear multi-step method (predictor) is combined with an implicit one (corrector). Nordsieck methods are equivalent to the implicit Adams methods. They provide representation of multi-step methods which allows a convenient way of changing the step size. XPPAUT is “a tool for simulating, animating and analyzing dynamical systems”, created by G. B. Ermentrout and freely available through the webpage [7]. Some of the features of XPPAUT are: a) brings together algorithms for solving various types of equations, including DDEs; b) contains the code for bifurcation program AUTO; c) provides means for visualisation of the solution; d) is portable on various systems and its building requires the standard C compiler and Xlib. More details about the features and the usage of XPPAUT may be found in [3] and in the documentation distributed together with the package. The behaviour of the XPPAUT implementations of six widely used ODE solvers is compared in the current paper, namely: a) backward Euler (BE) – the simplest implicit method; b) classical Runge-Kutta (RK) – four stage method that gives O(h4 ) accuracy; c) Dormand-Prince 5 (DP5) – based on Runge-Kutta method of order 5 with adaptive step and automatic step size control; d) Rosenbrock (RB2) – based on Matlab version of the two stage Rosenbrock algorithms;

714

G. Bencheva Table 1. The methods

Runge-Kutta (RK) Adams-Bashforth (AD) Dormand-Prince 5 (DP5) Backward Euler (BE) CVODE (CV) Rosenbrock (RB2)

Explicit + + +

Implicit

Fixed Step + +

+ + +

+

Adaptive Step

Stiﬀ

+ + +

+ +

e) Adams-Bashforth (AD) – fourth order predictor-corrector; f) CVODE (CV) – uses C-version of LSODE, written by S. D. Cohen and A.C. Hindmarsh, LLNL. LSODE is adaptive method based on the Nordsieck representation of the ﬁxed step size Adams formulas. The ﬁrst four of these methods are one-step methods, while the last two are representatives of the multi-step solvers. The correspondence of the considered methods to the classes of explicit/implicit, ﬁxed/adaptive step or nonstiﬀ/stiﬀ integrators is indicated in Table 1. Delay equations are solved in XPPAUT by storing previous data and using cubic polynomial interpolation to obtain the delayed value. All three types of delays – constant, time dependent and state dependent can be handled with XPPAUT. The author recommends the usage of ﬁxed step integrators for solving DDEs “although adaptive will sometimes work” (see p. 56 of the documentation distributed together with the package). This remark is kept in mind for the analysis of the solvers in the next section.

4

Numerical Tests and Analysis of the Results

The numerical tests are performed with version 5.98 of XPPAUT, which has been slightly modiﬁed to allow measurement of the computing time and recompiled under Linux for 64 bit processors. The comparative analysis concerns the test data for erythropoiesis presented in [1]. Test example. The coeﬃcient functions for normalized initial conditions and the values of the parameters involved in the model are taken as follows: Function β(E) = β0

E , 1+E

g(Q) = GQ, f (M ) =

Parameter β0 > 0 G>0

a , a, K, r > 0 1 + KM r

τ ∈ [0, τmax ), τmax = 2.99 days

δ G β0 γ μ k a K r

Value

Range (day −1 )

0.01 day −1 0.04 day −1 0.5 day −1 0.2 day −1 0.02 day −1 2.8 day −1 6570 0.0382 7

0 – 0.09 0 – 0.09 0.08 – 2.24 0 – 0.9 0.001 – 0.1 — — — —

Comparative Analysis of Solution Methods for DDEs in Hematology

715

With the notation α(τ ) = 2e−γτ − 1 for τ ∈ [0, τmax ], the nontrivial steadystates of (1) for this choice of β, g and f are deﬁned by Q∗ =

μ 1 G K 1/r

aβ0 α(τ ) − (δ + G)(a + k) k(δ + G)

1/r ,

M∗ =

G ∗ Q , μ

E∗ =

δ+G . β0 α(τ ) − (δ + G)

(3)

They are stable for τ close to zero and τ ≥ 2.82 days. Hopf bifurcation occurs for τ = 1.4 and the steady states become unstable until the stability switch. The software parameters are ﬁrst tuned for each solver (the default values are not appropriate for some cases), and the results are compared with respect to both the computing time and the behaviour of the solution for three values of the delay τ : 0.5, 1.4, 2.9 and for three values of the mesh parameter dt. The latter denotes the step size for the ﬁxed step integrators and the step for which the solution is stored in the output data ﬁle for the adaptive ones. Time comparison. The measured cpu times in seconds are collected in Table 2. The ﬁrst column is for the method, where the error tolerance (1.e − 6 or 1.e − 9) for the adaptive solvers is given in brackets. The rest of the columns are in groups of three – one for each value of τ . The columns in each group are for the values of dt: 0.2, 0.1 and 0.05. As a general rule, when dt is reduced twice the computation time is increased approximately twice for all values of τ . Exceptions are CV and RB2, which is expected because they are implicit methods with adaptive step. The methods with the smallest and largest computing times are AD and CV with tolerance 1.e − 9 respectively, which requires special attention on the properties of their solutions. If they are comparable to (for AD) or much better than (for CV) the solution with the rest of the methods, AD and/or CV could become favourites for reliable future computer simulations. Solution comparison. Various plots are used to analyze the solvers, part of which are presented below. First set of plots is to compare the inﬂuence of the time step (see Fig. 1). For the case τ = 0.5, the solutions for dt = 0.1 and dt = 0.05 almost coincide for all solvers, while the one for dt = 0.2 is diﬀerent from the others. This conﬁrms the expectation that better approximations of the solution are obtained for smaller step sizes. The three populations for Runge-Kutta with Table 2. Computing times Method \dt RK AD BE DP5 (1.e − 6) DP5 (1.e − 9) CV (1.e − 6) CV (1.e − 9) RB2 (1.e − 6) RB2 (1.e − 9)

0.2 0.05 0.02 0.13 0.09 0.12 0.12 0.62 0.17 0.49

τ = 0.5 0.1 0.09 0.04 0.25 0.17 0.19 0.1 1.0 0.34 0.57

0.05 0.2 0.1 0.46 0.34 0.35 0.07 1.66 0.65 0.8

0.2 0.04 0.03 0.15 0.08 0.17 0.28 0.75 0.17 1.17

τ = 1.4 0.1 0.09 0.05 0.30 0.18 0.25 0.29 1.28 0.32 1.18

0.05 0.18 0.1 0.56 0.35 0.34 0.14 1.96 0.65 1.27

0.2 0.04 0.02 0.15 0.09 0.13 0.07 0.58 0.17 0.59

τ = 2.9 0.1 0.1 0.05 0.26 0.17 0.18 0.03 0.67 0.32 0.53

0.05 0.19 0.1 0.49 0.34 0.34 0.02 0.96 0.64 0.78

716

G. Bencheva Cell populations Q and M, Runge-Kutta, τ=0.5, dt=0.05

Growth factor E, Runge-Kutta, τ=0.5, dt=0.05

10

1 Quiscent stem cells Q Matured cells M

Growth factor E 0.8 Concentration 3 (x 10 mU/mL)

Cell populations 8 (x 10 cells/kg)

8

6

4

2

0.6

0.4

0.2

0

0 0

200

400

600

800

1000

0

200

400

Time

600

800

1000

Time

a) Solution with Runge-Kutta for τ = 0.5 Cell population M, Adams-Bashforth, τ=1.4

Growth factor E, Adams-Bashforth, τ=1.4

9

1.4 dt=0.2 dt=0.1 dt=0.05

dt=0.2 dt=0.1 dt=0.05

1.2 Concentration 3 (x 10 mU/mL)

Cell population 8 (x 10 cells/kg)

8

7

6

1 0.8 0.6 0.4

5 0.2 4

0 0

200

400

600

800

1000

0

200

400

Time

600

800

1000

Time

b) Solution with Adams-Bashforth for τ = 1.4 Cell population Q, CVODE, τ=2.9, tol=1.e-9

Growth factor E, CVODE, τ=2.9, tol=1.e-9

3.5

45 dt=0.2 dt=0.1 dt=0.05

dt=0.2 dt=0.1 dt=0.05

40 35 Concentration 3 (x 10 mU/mL)

Cell population 8 (x 10 cells/kg)

3 2.5 2 1.5

30 25 20 15 10

1 5 0.5

0 0

200

400

600 Time

800

1000

0

200

400

600

800

1000

Time

c) Solution with CVODE for τ = 2.9, tol = 1.e − 9 Fig. 1. Solution of the DDE system (1) with RK, AD, and CV

dt = 0.05 are presented in Fig. 1 a). After some oscillations at the beginning, they are stabilized around the steady states: Q∗ = 3.19592, M ∗ = 6.39183, E ∗ = 0.14091. The solution obtained with AD for τ = 1.4 and with CV for τ = 2.9 is presented respectively in Fig. 1 b) and in Fig. 1 c) for the three values of dt. The periodic solution for τ = 1.4 is due to the Hopf bifurcation. The behaviour for τ = 2.9 is similar to the case of τ = 0.5, but now the steady states are: Q∗ = 1.91601, M ∗ = 3.83203, E ∗ = 5.05134. The second set of plots aims to compare the behaviour in groups of solvers – explicit/implicit, ﬁxed/adaptive step, for nonstiﬀ/stiﬀ problems. They are prepared for each population separately with the smallest value of dt. Initially, the solution in the whole integration interval t ∈ [0, 1000] is drawn. In the cases

Comparative Analysis of Solution Methods for DDEs in Hematology Growth factor E, Fixed step, τ=2.9, dt=0.05

5.052

DP5 CV RB2 * E

5.054

Concentration 8 (x 10 cells/kg)

Concentration 8 (x 10 cells/kg)

Growth factor E, Adaptive step, τ=2.9, dt=0.05

RK AD BE * E

5.054

717

5.05 5.048 5.046

5.052 5.05 5.048 5.046

5.044 800

850

900

950

5.044 800

1000

850

900

Time

950

1000

Time

a) Comparison ﬁxed/adaptive step methods for τ = 2.9, t ∈ [800, 1000] Growth factor E, Nonstiff solvers, τ=2.9, dt=0.05

Growth factor E, Stiff solvers, τ=2.9, dt=0.05

20

20 RK AD BE DP5

15 Concentration 8 (x 10 cells/kg)

Concentration 8 (x 10 cells/kg)

15

CV RB2

10

5

10

5

0

0 0

200

400

600 Time

800

1000

0

200

400

600

800

1000

Time

b) Comparison nonstiﬀ/stiﬀ solvers for τ = 2.9, t ∈ [0, 1000] Fig. 2. Comparison of the methods in groups

where the ﬁgure looks like the solutions for various methods coincide, the plots are “zoomed” for better analysis and the last 150-200 days are plotted (i.e. t ∈ [800, 1000] or t ∈ [850, 1000]). Zoomed plots of the growth factor for ﬁxed step (left) and adaptive step (right) solvers are shown in Fig. 2 a). The additional line in both cases represents the steady state E ∗ . The solutions with the ﬁxed step integrators RK and BE coincide and are closer to E ∗ than the solution with AD. The adaptive integrators behave in a similar way – curves for DP5 and RB2 coincide and are closer to E ∗ than the one for CV. The comparisons of nonstiﬀ versus stiﬀ solvers (Fig. 2 b)) as well as of the explicit versus implicit solvers lead to the same conclusion – the one-step methods BE, RK, DP5 and RB2 have similar quality of the solution for this test example, while the methods AD and CV are not that close to the steady states. This observation requires additional set of plots in order to compare the two representatives of the multi-step methods (Fig. 3). For both values of τ with stable steady states, the solution obtained with AD is between the solutions obtained with CV for tol = 1.e − 6 and tol = 1.e − 9. For τ = 0.5 the closest to the steady state is the curve for CV, tol = 1.e − 9, while for τ = 2.9 this holds for AD. A possible reason for this behaviour of CV is hidden in the remark at the end of section 3: CV is an adaptive method and the way of handling DDEs in XPPAUT is not appropriate for it. Other techniques for handling DDEs in combinations with CV and its latest version SUNDIALS (available for download from [8]) should be analyzed in order to solve eﬃciently problems in hematology.

718

G. Bencheva Growth factor E, AD/CV, τ=0.5, dt=0.05

Growth factor E, AD/CV, τ=2.9, dt=0.05

0.15

5.08 CV, tol 1.e-9 CV, tol 1.e-6 AD * E

0.148

5.06 Concentration 8 (x 10 cells/kg)

Concentration 8 (x 10 cells/kg)

0.146

CV, tol 1.e-9 CV, tol 1.e-6 AD * E

5.07

0.144 0.142 0.14 0.138 0.136

5.05 5.04 5.03 5.02 5.01

0.134

5

0.132

4.99 860

880

900

920 Time

940

960

980

1000

860

880

900

920

940

960

980

1000

Time

Fig. 3. Comparison of AD and CV for τ = 0.5 (left) and τ = 2.9 (right), t ∈ [850, 1000]

Concluding Remarks. XPPAUT implementations of six methods for DDEs have been experimentally investigated. The solutions with the representatives of one-step methods have comparable quality, while they are slightly worse for the multi-step methods. Taking into account both the computing time and the quality of the solution, one could use XPPAUT implementations of RK, DP5 and RB2 as a toolbox for reliable future computer simulations. The particular choice among them depends on the properties of the problem – RK is the fastest one, DP5 can be applied in the cases where an adaptive step has to be used for a non-stiﬀ problem, while RB2 has better properties for stiﬀ problems. Acknowledgement. Supported in part by the Bulgarian NSF Grant DO02-214.

References 1. Adimy, M., Crauste, F., Ruan, S.: Modelling Hematopoiesis Mediated by Growth Factors with Applications to Periodic Hematological Diseases. Bulletin of Mathematical Biology 68(8), 2321–2351 (2006) 2. Bellen, A., Guglielmi, N., Maset, S.: Numerical methods for delay models in biomathematics. In: Quarteroni, A., Formaggia, L., Veneziani, A. (eds.) Complex Systems in Biomedicine, pp. 147–185. Springer, Milano (2006) 3. Ermentrout, B.: Simulating, analyzing and animating dynamical systems: a guide to XPPAUT for researchers and students. SIAM, Philadelphia (2002) 4. Hairer, E., Norsett, S.P., Wanner, G.: Solving ordinary diﬀerential equations I, Nonstiﬀ problems. Springer Series in Computational Mathematics. Springer, Heidelberg (2000) 5. Hairer, E., Wanner, G.: Solving ordinary diﬀerential equations II, Stiﬀ and Diﬀerential-Algebraic Problems. Springer Series in Computational Mathematics. Springer, Heidelberg (2002) 6. Lambert, J.D.: Numerical methods for ordinary diﬀerential systems: the initial value problem. John Wiley & Sons Ltd., Chichester (1991) 7. http://www.math.pitt.edu/~ bard/xpp/xpp.html 8. https://computation.llnl.gov/casc/sundials/main.html

Solving Non-linear Systems of Equations on Graphics Processing Units Lubomir T. Dechevsky, Børre Bang, Joakim Gundersen, Arne Laks˚ a, and Arnt R. Kristoﬀersen Narvik University College, P.O.B. 385, N-8505 Narvik, Norway

Abstract. In [6], [7] a method for isometric immersion of smooth mvariate n-dimensional vector ﬁelds, m = 1, 2, 3, 4, . . ., n = 1, 2, 3, 4, . . . onto fractal curves and surfaces was developed, thereby creating an opportunity to process high-dimensional geometric data on graphics processing units (GPUs) which in this case are used as relatively simple parallel computing architectures (with relatively very low price). For this construction, the structure of multivariate tensor-product orthonormal wavelet bases was of key importance. In the two afore-mentioned papers, one of the topics discussed was the spatial localization of points in high dimensional space and their images in the plane (corresponding to pixels in the images when processed by the GPU). In the present work we show how to compute approximately on the GPU multivariate intersection manifolds, using a new orthonormal-wavelet scaling-function basismatching algorithm which oﬀers considerable simpliﬁcations compared to the original proposed in [6], [7]. This algorithm is not as general as the Cantor diagonal type of algorithm considered in [6], [7], but is much simpler to implement and use for global mapping of the wavelet basis indices in one and several dimensions. This new, simpler, approach ﬁnds also essential use in the results obtained in [5] which can be considered as continuation of the present paper, extending the range of applications of the present simpliﬁed approach to GPU-based computation of multivariate orthogonal wavelet transforms. The new method can be also used to accelerate the initial phase of the so-called Marching Simplex algorithm (or any other ’marching’ algorithm for numerical solution of nonlinear systems of equations).

1

Introduction

In recent years, the performance of graphics hardware has increased more rapidly than that of central processing units (CPUs). CPU designs are optimized for high performance on sequential code, and it is becoming increasingly diﬃcult to use additional transistors to improve performance on this code, [12]. One of the ways to address the persistent need of ever-increasing computational power is, of

This work was supported in part by the 2006, 2007 and 2008 Annual Research Grants of the priority R&D Group for Mathematical Modeling, Numerical Simulation and Computer Visualization at Narvik University College, Norway.

I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 719–729, 2010. c Springer-Verlag Berlin Heidelberg 2010

720

L.T. Dechevsky et al.

course, the increasing use of multiprocessor architectures, ranging from petaﬂop supercomputers to large-scale heterogeneous networks and to individual multicore personal computers. This approach certainly “does the job”, but it tends to be rather expensive; moreover, the necessity to balance the load of all CPUs in the multiprocessor system throughout the whole time of the computation leads to severe complications in the parallelization of sequential algorithms. On the other hand, commodity graphic processing units (GPUs) are evolving rapidly in terms of processing speed, memory size, and most signiﬁcantly, programmability. During the last decade the performance of the GPUs has doubled every 9 months, [14]. In addition to the improvement in performance, the features have increased drastically, enhancing the ﬂexibility and programmable functionality. This development has largely been driven by the mass-market demand for faster and more realistic computer games and multi-media entertainment. In the last years graphics hardware design has been particularly sensitive to memory bandwidth problems due to a rapidly increasing data transfer volume. As a result, modern GPUs can access and transfer large data blocks tremendously faster than CPUs, [13]. Moreover, this access and transfer is parallel, thus turning the GPU into an increasingly advanced parallel computing architecture (which is comparatively very cheap and quite aﬀordable for the average user!). Most scientiﬁc applications have to handle large amounts of data and thus they suﬀer from both limited main memory bandwidth and the obsolete repeated command transfer, when the same operation has to be performed on each component of the data block. The GPU will do this operation simultaneously on the entire data block. Before the age of GPU-programming (also referred to as generalpurpose programming on GPUs, or GPGPU programming, see, e.g., [15]), only a minor part of the GPU’s capacity was in use, while now GPU-programming exploits this capacity to its very limits, and provides a framework which is open to dynamical upgrades of the features of the GPU. Let Vj−1 = Vj ⊕ Wj , j ∈ Z, be the nested V-subspaces and W-spaces of a multi-resolution analysis in L2 (Rn , Rm ) (see, e.g., [1], [11]). In [6], [7] we studied an algorithm for isometric mapping between smooth n-variate m-dimensional vector ﬁelds and fractal curves and surfaces, by using orthonormal wavelet bases. This algorithm matched the orthonormal bases of scaling-functions (the bases of the “V-spaces” of multiresolution analyses). Here we propose a new alternative to the Cantor diagonal type of algorithm of [6], [7], for matching the “V -space” bases. The new algorithm is applicable only when the deﬁnition domain of the n-variate m-dimensional vector ﬁelds in the setting of [6], [7] is a bounded hyper-cube (hyper-rectangle) in Rn , but it is much simpler than the general Cantor diagonal type of algorithm. In the present paper we shall consider applications of this new approach for GPU-based solving of nonlinear equations, not only in the case when this system has a unique solution, but also in the much more general situation when the solution is a d-dimensional manifold immersed in Rm , d = 0, 1, . . . , min{m, n} (here the notation for m and n has the same meaning as in [6], [7]). As a continuation of the present study, in [5] we shall propose a new algorithm which matches the orthonormal bases of

Solving Non-linear Systems of Equations on Graphics Processing Units

721

(mother) wavelets (the “W-spaces” of multiresolution analyses) with application to GPU-based computing of multivariate orthogonal wavelet transforms. This work, together with [6], [7] and [5], is part of the research of the R&D Group for Mathematical Modeling, Numerical Simulation and Computer Visualization at Narvik University College within two consecutive Strategic Projects of the Norwegian Research Council – the completed project ’GPGPU – Graphics Hardware as a High-end Computational Resource’ (2004-2007), and the ongoing project ’Heterogeneous Computing’ (2008-2010).

2

Preliminaries

A well-known property of Hilbert spaces with equal dimension/cardinality of the basis is that all such spaces are isometric, i.e., for any such two Hilbert spaces H1 and H2 , there exists a linear bijective operator T : H1 → H2 with T x, T xH2 = x, xH1 , for every x ∈ H1 , where ·, ·Hi is the respective inner product (see, e.g., [9]). In this case, the inverse operator T −1 exists, and T −1 = T ∗ , where T ∗ is the Hilbert adjoint of T ; T ∗ is an isometry between H2 and H1 . This is the fact which permits the study of Hilbert spaces to be reduced to study of only their canonical representatives: l2 or L2 with the respective cardinality of the basis. All spaces in consideration in this study are Hilbert spaces. When A is an orthogonal operator (isometry) acting between ﬁnite-dimensional Hilbert spaces H1 and H2 with equal dimensions dim H1 = dim H2 = μ < ∞, then A can be represented by a μ × μ orthogonal matrix A. When appropriate, here and in [5] we shall identify a linear operator with its representation matrix (and sometimes we shall need to distinguish between them). → − Let f ∈ Vj1 be compactly supported (to ensure ﬁniteness of the sums in k). Then, the following representation holds ⎞ ⎛ f1 (x1 , . . . , xn ) →− − ⎟ ⎜ .. f (→ x)=⎝ (1) ⎠= . fm (x1 , . . . , xn ) ⎛

⎞ α1,j1 ,k1 ,...,kn ⎜ ⎟ [0] [0] .. = ··· (2) ⎝ ⎠ϕj1 (x1 ) · · · ϕj1 (xn ) . k1 =−∞ kn =−∞ αm,j1 ,k1 ,...,kn [0] → − α − (3) = →ϕ − → (x1 , ..., xn ) → n j1 k j1 k − k ∈Z n j1 −1 2 →[l] [l] − [0] → − →ψ − (x , ..., x ) + = α − β − →ϕ − 1 n → → (x1 , ..., xn ) , (4) j kj j kj → n j0 k j0 k − → n l=1 j=j0 − k ∈Z kj ∈Z ∞

∞

and if, in particular, n = m = 1, then g ∈ Vj1 is a functional curve, with

722

L.T. Dechevsky et al.

g (t) =

∞

αj1 k ϕ j1 k (t)

(5)

k=−∞

=

α j0 k (t) + j0 k ϕ

k∈Z

β jkj ψjkj (t).

(6)

j=j0 kj ∈Z

In (4), (6) j1 and j1 are independent and j0 ≤ j1 , j0 ≤ j1 ; the scaling functions [0] [l] [0] ϕjk and wavelets ψjk belong to an orthonormal wavelet basis of L2 (Rn ); ϕj1 are the scaling functions belonging to a (generally diﬀerent) orthonormal wavelet ϕjk , ψ basis of L2 (Rn ), taken at the ﬁnest level j1 ; jk belong to an orthonormal wavelet basis of L2 (R). Here high-dimensional orthonormal wavelet bases are of tensor-product type (see e.g. [1], [4]). Typically, in the default case the respective [0] [0] bases would coincide, namely ϕ − jk = ϕjk , ψjk = ψjk for all j, k →=ϕ− →, ϕ jk jk → − and k , where ϕjk and ψjk are the univariate orthonormal scaling functions and [0] [0] wavelets generating the tensor-product basis functions ϕ − →, ψ − → . In the sequel, jk jk we shall always consider only compactly supported (multi)wavelet bases. Biorthonormal (in particular, orthonormal) compactly supported wavelet bases with suﬃcient regularity are unconditional Riesz bases in a broad variety of function spaces, and induce equivalent weighted l2 wavelet-coeﬃcient sequence norms in these spaces (see [1] and [4]). The norms used in deriving the isometric conversion results in [6], [7]and in the present paper are ∞ ∞

2 →

− 2 → f = · · · − α j1 k1 ···kn Rm (7)

L2(Rn ,Rm )

k1 =−∞

=

− n → k ∈Z

kn =−∞

2 − → α j0 k0 Rm +

j1

j=j0

− → k j ∈Zn

n 2 −1

l=1

2

−

[l]

→ →

,

β j− k j m

(8)

R

where . Rm is the usual Euclidean Hilbert norm in Rm , for compactly supported → − f ∈ Vj1 ⊂ L2 (Rn , Rm ), and 2

g L

2(R1 ,R1 )

=

∞ 2 α j k

(9)

1

k=−∞

=

2 2 α + β jkj , j0 k

k∈Z

(10)

j=j0 kj ∈Z

for compactly supported g ∈ Vj1 ⊂ L2 (R, R).

3 3.1

Main Results Isometric Conversion between Dimension and Resolution: A Simpler Approach

The concept of conversion between dimension and resolution, proposed in [6], [7], is based on the isometry between Hilbert spaces spanned over orthonormal

Solving Non-linear Systems of Equations on Graphics Processing Units

723

wavelet bases with higher number of variables and lower resolution level, and ones spanned over orthonormal wavelet bases with lower number of variables and higher resolution level, so that the dimensions of the respective Hilbert spaces are equal. For functions deﬁned on unbounded domains, e.g., on the whole Rn , the number of nonzero scaling coeﬃcients at a given resolution level depends on the concrete choice of the function, and for a compactly supported function is ﬁnite, but may be arbitrarily large, depending on the size of the support. Hence, in the general problem setting of [6], [7], the algorithm for matching of the orthonormal bases had to be of Cantor diagonal type. The complexity of the resulting basis-matching algorithm in [6], [7] is, however, rather high. Here we propose a diﬀerent new algorithm, valid only for the particular (but practically most important) case of a bounded hyper-rectangle in Rn . First of all, without loss of generality, two reductions can be made: the hyperrectangle can be rescaled to the “unit” hyper-cube [−1, 1]n ⊂ Rn , and the support of the univariate scaling function generating the multivariate tensor-product wavelet basis (see [1], [4], [11]) can be rescaled to [−1, 1]. We shall also make the natural assumption that the scaling-function basis is boundary-corrected [2] or appropriately periodized, with period 2 [3]. Under these assumptions, the range of the indices ki , i = 1, . . . , n, in (2) and k in (5) will be reduced to ki = 0, 1, . . . , 2j1 − 1 and k = 0, 1, . . . , 2j1 − 1, respectively. Furthermore, according to the results in [6], [7], the spans (3) and (5), endowed with the respective Hilbert norms (7) and (9), will be isometric if, and only if, their dimensions are the same, i.e., when j1 = nj1 . Denoting μ = j1 , we deﬁne an orthogonal operator An,μ (with matrix An,μ ) acting isometrically between the two Hilbert spaces by establishing 1-1 correspondence between the elements of the bases, as follows. → ν ∈ {0, 1, . . . , 2μn − 1} An,μ : (k1 , . . . , kn ) ∈ {0, 1, . . . , 2μ − 1}n is deﬁned via the 2μ -adic representation of integer numbers ν=

n

ki · 2μ(n−i) .

(11)

i=1

Its inverse (and, in view of its orthogonality, also its adjoint) is ATn,μ : ν ∈ {0, 1, . . . , 2μn − 1} → (k1 , . . . , kn ) ∈ {0, 1, . . . , 2μ − 1}n is given by the Euclid-type algorithm with consecutive steps ν μ(n−1) 2µ(n−1)

= k1 , ν − k1 · 2 μ(n−2)= ν1 , ν1 = k2 , ν1 − k2 · 2 = ν2 , 2µ(n−2) .. .. .

.ν n = k , ν − kn · 20 = 0. n n−1 µ(n−n) 2

(12)

Clearly this new construction cannot be applied in the most general case considered in [6], [7], while the Cantor diagonal type of basis matching proposed in [6], [7] will work, at least in principle, whenever the sums in (2),(3),(5) and the

724

L.T. Dechevsky et al.

respective sums in (7), (9) are all ﬁnite. It is also clear, however, that the new construction proposed here is much simpler and geometrically more straightforward. This is conﬁrmed also by a comparison between the planar images of the implicit manifolds which are solutions of the model systems of nonlinear equations provided below, and respective planar images of the same or similar manifolds in parametric form obtained by the Cantor diagonal type basis matching in [6], [7]. 3.2

Computing the Solutions of Nonlinear Operator Equations

The general idea how to use isometric conversion between dimension and resolution for solving nonlinear operator equations was already outlined in the concluding subsection 2.8 of [6] and section 3 of [7] but, due to the complexity of the Cantor diagonal basis-matching algorithm, in [6], [7] it was applied only for the representation of parametric manifolds, but not for manifolds deﬁned implicitly by systems of equations. With An,μ and A−1 n,μ deﬁned via (11) and (12), respectively, it is convenient to apply the ideas of [6], [7] to both parametric and implicit manifolds. Here we shall consider the latter case in detail. In this case, the main steps of the solution of a given nonlinear system of equations are, as follows. 1. Reduce the system of equations to the canonical operator form. → − − → − → x n) = O m, − x n ∈ Ω ⊂ Rn f m (→

2. 3.

4.

5. 6. 7.

(13)

(m equations with n-unknowns, homogeneous RHS and vector-valued LHS m → with coordinates (fi,m (− x n ))i=1 ). ˜ ⊂ [−1, 1]n, so that at least one point of the Scale the domain Ω ⊂ Rn to Ω ˜ boundary of Ω belongs also to the boundary of [−1, 1]n in Rn . → − → − ˜ ⊂ [−1, 1]m (for example, by multiplying Scale the range f m (Ω) to f˜ m (Ω) the i-th equation in the canonical form (13) by appropriately chosen factor → − → − → − → − ˜ is a ci > 0, i = 1, . . . , m). (Note that O m ∈ f i,m (Ω) and O m ∈ f˜ i,m (Ω) necessary condition for the system of equations to have a solution.) → − Depending on the available greyscale resolution, the nature of f and the context of the problem, divide [−1, 1] into M non-overlapping intervals, with M being the number of available greyscale levels, and assign 1-1 to each consecutive interval the respective consecutive greyscale level. In this construction, 0 (zero) should be interior point for “its” interval (and the size of “its” interval will determine the resolution of the method, so in practice this size will be tuned depending on the pixel size). → − − → Expand f˜ m ( x˜ m ) in n-variate expansion (2), (3). (The resolution level j1 depends on n and the pixel resolution).) → − Apply a direct mapping (11) to transform f˜i,m ( x ˜ n ) into a functional curve g˜i,1 (ti ), ti ∈ [−1, 1], i = 1, . . . , m, where g˜i,1 is given via (5). Apply an inverse mapping (12) and the scaling in item 4, to transform the graph of g˜i,1 (ti ) into the i-th planar greyscale pixel image, i = 1, . . . , m.

Solving Non-linear Systems of Equations on Graphics Processing Units

725

8. For the i-th greyscale pixel image, identify the pixels “having the greyscale level of 0”, mark these pixels as “black” and the remaining pixels as “white”, i = 1, . . . , m. 9. Intersect the i-th set of “black” pixels for all i = 1, . . . , m. The intersection of all m sets is a “black pixel” “point cloud” in the plane (or, eventually, an empty set). 10. All mappings invoked in items 1-7 were invertible. Now, invert all these mappings and compose them in reverse order and apply the method of pointwise localization proposed in [6], [7] to transform the planar point cloud in item 9 to a respective point cloud in Ω ⊂ Rn . This point cloud in Rn is accepted as the numerical solution of the nonlinear system of equations (operator equation) at the given resolution level. 11. Depending on the context of the problem, the point cloud obtained in item 10 can be processed further in one of several ways (see the next section for further details).

4

Concluding Remarks and Simple Model Examples

Case d = 0 in item 11 in subsection 3.2. In the case when the dimension d of the solution manifold is equal to 0 (i.e., there is no solution, or there is a unique solution, or there is a ﬁnite number of isolated solutions in Ω ⊂ Rn ) the method will provide localization of the solutions up to tolerance which depends on the pixel resolution and the greyscale resolution of the GPU and on the number of variables n; it does not depend on the number of equations m = 1, . . . , n. For LHS in (13) which are not spatially inhomogeneous, eﬃcient (although rough) localization of solutions can still be achieved for n up to 10-20, but for more spatially inhomogeneous LHS this range rapidly drops to 3-8. Nevertheless, the latter range is suﬃcient at least for intersection problems in computer aided geometric design where n is typically 2, 3, or 4. Case d > 0 in item 11 in subsection 3.2. In the case when d > 0 (d = 1 for curves in Rn , d = 2 for surfaces in Rn , ... , d = n − 1 for hypersurfaces in Rn ) the initial idea proposed in [6], [7] was to generate topology in the intersection manifold starting from a suﬃciently dense wireframe in [−1, 1]n , however, due to the very diﬀerent nature of the planar image that a smooth curve on the manifold may have (ranging from smooth planar curve to area-ﬁlling fractal), this approach was later abandoned. Instead, we found out that it is very eﬃcient to use the present algorithm described in items 1-10 as a replacement of the ﬁrst several local-reﬁnement iterations of the Marching-Simplex algorithm [8], [10]. The criterion for detection of singularities used in the Marching-Simplex algorithm is based on the Morse lemma and computes locally ranks of matrices; using the present algorithm for localization of the solution and keeping the use of the Marching-Simplex algorithm as the end solution-reﬁning algorithm leads to a considerable speed-up. Moreover, for values of n in the range 3-6, the resolution of the GPU is suﬃcient to achieve reasonable error tolerance in most cases, and for these cases the resulting point cloud in item 10 of subsection 3.2 can be used

726

L.T. Dechevsky et al.

Fig. 1. Example 1 (see section 4)

for the design of a stopping criterion for the local-reﬁnement iterations of the Marching-Simplex algorithm which is simpler and computationally cheaper to verify than the general stopping criterion given in [8]. Future work on this topic includes eﬃcient software implementation of the present algorithm (see items 1-10 of subsection 3.2) and its combination with the Marching-Simplex algorithm for values of n : 3 ≤ n ≤ 6, as well as comprehensive testing versus the “pure” standalone Marching-Simplex algorithm. Another promising direction of research is the use of the new basis-matching algorithm (11), (12), for GPU-based computation of d-variate orthogonal wavelet transforms, d = 3, 4, . . . (see [5]). It should be noticed that the representation of a GPU-computing architecture as a planar array of pixels with greyscale value is, of course, a gross oversimpliﬁcation. It is possible to enrich the model by

Solving Non-linear Systems of Equations on Graphics Processing Units

727

Fig. 2. Example 2 (see section 4)

considering RGB colours, α-values, number of pipelines, etc., but these additional degrees of freedom only increase the dimensionality of the computing array architecture, thereby improving the maximal possible resolution of the computing architecture. The idea of the algorithm considered in section 3 remains essentially the same. In conclusion of this exposition, let us consider two model examples. Example 1. m = 1, n = 4. Consider the 4-dimensional spherical hypersurface with center at (0,0,0,0) and radius 1, given by the implicit equation x2 + y 2 + z 2 + w2 − 1 = 0,

(14)

the 4-dimensional ellipsoid hypersurface with implicit equation x2 + 2y 2 + 3z 2 + 4w2 − 1 = 0,

(15)

728

L.T. Dechevsky et al.

and the 4-dimensional hyperboloid hypersurface with implicit equation x2 + y 2 − z 2 − w2 − 1 = 0,

(16)

In Figure 1a, b, c, respectively, are given (with the same error tolerance) the graphs of the planar point clouds of item 9 in subsection 3.2 for each one of the three manifolds, including also respective zooms into these planar graphs. (For n > 4 a hierarchy of at least n − 3 consecutive zooms into each of the graphs would be needed to reveal the ellipticity, hyperbolicity, etc., with respect to each coordinate.) Example 2. m = 2, n = 4. Given is a nonlinear system of 2 equations, the ﬁrst one being (14) and the second being the implicit equations of 2 parallel hyperplanes in 4D-space 2w2 − 1 = 0, (17) The solution is a surface in 4D-space, the points of which have constant w√ component (= ±1/ 2), and its projection onto the 3D-subspace Oxyz is the sphere with implicit equation x2 + y 2 + z 2 = 1/2,

(18)

In Figure 2, subﬁgure 1a, is given the superposition of the point clouds of item 9 of subsection 3.2 for the two hypersurfaces given by (14) and (17), respectively. In subﬁgure 1b is given a 3D-space projection of the resulting point cloud in item 10 of subsection 3.2, and this 3D-space point cloud is compared to the sphere (18) corresponding to the exact solution. It is seen that the distance between the point cloud and the sphere (i.e., the error) is relatively large. In Figure 2, subﬁgures 2a, 2b, are given the respective graphical results with higher resolution. In subﬁgure 2b it is seen that the error has considerably decreased.

References 1. Cohen, A.: Wavelet methods in numerical analysis. In: Ciarlet, P.G., Lions, J.L. (eds.) Handbook of Numerical Analysis, vol. VII, pp. 417–712. Elsevier, Amsterdam (2000) 2. Cohen, A., Daubechies, I., Vial, P.: Wavelets and fast wavelet transforms on the interval. Appl. Comput. Harmon. Anal. 1, 54–81 (2007) 3. Dahmen, W.: Wavelet and multiscale methods for operator equations. Acta Numerica, 55–228 (1997) 4. Dechevsky, L.T.: Atomic decomposition of function spaces and fractional integral and diﬀerential operators. In: Rusev, P., Dimovski, I., Kiryakova, V. (eds.) Transform Methods and Special Functions, part A (1999); Fractional Calculus & Applied Analysis, vol. 2(4), pp. 367–381 (1999) 5. Dechevsky, L.T., Gundersen, J., Bang, B.: Computing n-variate Orthogonal Discrete Wavelet Transforms on Graphics Processing Units. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2009. LNCS, vol. 5910, pp. 730–737. Springer, Heidelberg (2010)

Solving Non-linear Systems of Equations on Graphics Processing Units

729

6. Dechevsky, L.T., Gundersen, J.: Isometric Conversion Between Dimension and Resolution. In: Dæhlen, M., Mørken, K., Schumaker, L. (eds.) 6th International Conference on Mathematical Methods for Curves and Surfaces, pp. 103–114. Nashboro Press, Tromsø 2004 (2005) 7. Dechevsky, L.T., Gundersen, J., Kristoﬀersen, A.R.: Wavelet-based isometric conversion between dimension and resolution and some of its applications. In: Proceedings of SPIE: Wavelet applications in industrial processing V, Boston, Massachusetts, USA, vol. 6763 (2007) 8. Dechevsky, L.T., Kristoﬀersen, A.R., Gundersen, J., Laks˚ a, A., Bang, B., Dokken, T.: A ‘Marching Simplex’ algorithm for approximate solution of non-linear systems of equations and applications to branching of solutions and intersection problems. Int. J. of Pure and Appl. Math. 33(3), 407–431 (2006) 9. Dunford, N., Schwartz, J.T.: Linear Operators. General Theory, vol. 1. Wiley, Chichester (1988) 10. Gundersen, J., Kristoﬀersen, A.R., Dechevsky, L.T.: Comparing Between the Marching-Cube and the Marching Simplex Methods. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2009. LNCS, vol. 5910, pp. 756–764. Springer, Heidelberg (2010) 11. Mallat, S.: A Wavelet Tour of Signal Processing, 2nd edn. Acad. Press, New York (1999) 12. Purcell, T.J., Buck, I., Mark, W.R., Hanrahan, P.: Ray Tracing on Programmable Graphics Hardware. ACM Transactions on Graphics 21(3), 703–712 (2002); Proceedings of ACM SIGGRAPH 2002, ISSN 0730-0301 13. Rumpf, M., Strzodka, R.: Using Graphics Cards for Quantized FEM Computations. In: Proceedings VIIP 2001, pp. 193–202 (2001) 14. Rumpf, M., Strzodka, R.: Numerical Solution of Partial Diﬀerential Equations on Parallel Computers. In: Graphics Processor Units: New Prospects for Parallel Computing. Springer, Heidelberg (2004) 15. General-Purpose Computation Using Graphics Hardware, http://www.gpgpu.org/

Computing n-Variate Orthogonal Discrete Wavelet Transforms on Graphics Processing Units Lubomir Dechevsky, Joakim Gundersen, and Børre Bang Narvik University College, P.O.B. 385, N-8505 Narvik, Norway

Abstract. In [4,5] an algorithm was proposed for isometric mapping between smooth n-variate m-dimensional vector ﬁelds and fractal curves and surfaces, by using orthonormal wavelet bases. This algorithm matched only the orthonormal bases of scaling functions (the “V-spaces” of multiresolution analyses). In the present communication we shall consider a new algorithm which matches the orthonormal bases of wavelets (the “W-spaces” of multiresolution analyses). Being of Cantor diagonal type, it was applicable for both bounded and unbounded domains, but the complexity of its implementation was rather high. In [3] we proposed a simpler algorithm for the case of boundary-corrected wavelet basis on a bounded hyper-rectangle. In combination with the algorithm for the “V-spaces” from [4,5], the new algorithm provides the opportunity to compute multidimensional orthogonal discrete wavelet transform (DWT) in two ways – via the “classical” way for computing multidimensional wavelet transforms, and by using a commutative diagram of mappings of the bases, resulting in an equivalent computation on graphics processing units (GPUs). The orthonormality of the wavelet bases ensures that the direct and inverse transformations of the bases are mutually adjoint (transposed in the case of real entries) orthogonal matrices, which eases the computations of matrix inverses in the algorithm. 1D and 2D orthogonal wavelet transforms have been ﬁrst implemented for parallel computing on GPUs using C++ and OpenGL shading language around the year 2000; our new algorithm allows to extend general-purpose computing on GPUs (GPGPU) also to higher-dimensional wavelet transforms. If used in combination with the Cantor diagonal type algorithm of [4,5] (the “V-space” basis matching) this algorithm can in principle be applied for computing DWT of n-variate vector ﬁelds deﬁned on the whole Rn . However, if boundary-corrected wavelets are considered for vector-ﬁelds deﬁned on a bounded hyper-rectangle in Rn , then the present algorithm for GPU-based computation of n-variate orthogonal DWT can be enhanced with the new simple “V-space”-basis matching algorithm of [3]. It is this version that we consider in detail in the present study.

This work was supported in part by the 2007, 2008 and 2009 Annual Research Grants of the priority R&D Group for Mathematical Modeling, Numerical Simulation and Computer Visualization at Narvik University College, Norway.

I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 730–737, 2010. c Springer-Verlag Berlin Heidelberg 2010

Computing n-Variate Orthogonal Discrete Wavelet Transforms on GPUs

1

731

Introduction

Let Vj−1 = Vj ⊕ Wj , j ∈ Z, be the nested V-subspaces and W-spaces of an orthonormal multiresolution analysis in L2 (Rn , Rm ) (see, e.g., [1,6]), “⊕” denoting the direct sum of orthogonal subspaces. In [3] we proposed a new method for matching the bases of scaling functions in the V-spaces of orthonormal multiresolution analyses in L2 -spaces of n-variate m-dimensional vector-ﬁelds. In comparison with the Cantor diagonal type method proposed earlier in [4,5], the new method of [3] is less general, but also much simpler. In [3] we apply this new simple method for the solution of systems of nonlinear equations and the solution of geometric intersection problems in n-dimensions, n = 2, 3, 4, . . .. In the present study we expand the range of applications of the new approach proposed in [4,5,3] to the computation of n-variate orthogonal discrete wavelet transforms (DWTs). All needed preliminary notation and preliminary results about isometric Hilbert spaces, orthogonal operators and their adjoints, orthogonal wavelet expansions in one and several variables and sequence norms in terms of wavelet coeﬃcients can be found in section 2 of [3]. This work, together with [4,5,3] is part of the research of the R&D Group for Mathematical Modelling, Numerical Simulation and Computer Visualization at Narvik University College within two consecutive Strategic Projects of the Norwegian Research Council – the completed project ’GPGPU – Graphics Hardware as a High-end Computational Resource’ (2004-2007), and the ongoing project ’Heterogeneous Computing’ (2008-2010).

2

Main Result: A New Basis-Matching Algorithm

In this section we shall consider in detail the case n = d, m = 1 (the ddimensional DWT, i.e., the DWT for processing d-variate scalar-valued ﬁelds). The case of m-dimensional vector-ﬁelds, m = 2, 3, . . . , follows easily by applying the results of the present section to each coordinate of the vector-ﬁeld. 2.1

Canonical Isometries between Hilbert Spaces

Since all bases in consideration are orthonormal, the linear (transformation) operators/matrices A for change between such bases are unitary. For conciseness of the exposition, here we shall consider only real-valued Hilbert spaces, which means that the unitary operator A is orthogonal, or, in equivalent matrix form, A has real entries and A−1 = AT , where AT is the transposed matrix of A. Consider the Hilbert spaces Vd,J = VJ ⊗ VJ ⊗ . . . ⊗ VJ , with dimension d, d−times

resolution level J and, V1,dJ , with dimension 1, resolution level dJ, where the operator ⊗ corresponds to tensor-product of spaces spanned over scaling function bases, as in [1,2]. The bases in Vd,J and V1,dJ are denoted by Φd,J and Φ1,Jd , respectively, with dim Φd,J = 2Jd = dim Φ1,Jd .

732

L. Dechevsky, J. Gundersen, and B. Bang

Since the spaces Vd,J and V1,dJ have the same dimension, and each one of them has a selected ﬁxed basis in it, there is a “canonical” isometry Ad,J between them which maps 1 − 1 the ﬁxed basis of Vd,J onto the ﬁxed basis of V1,dJ . This choice of the canonical isometry is unique modulo right composition with an orthogonal operator σd,J and modulo left composition with an orthogonal operator σ1,dJ corresponding to a permutation of the basis in Vd,J and V1,dJ , respectively. Any such permutation operator would be represented by a permutation matrix (orthogonal matrix with only one non-zero element in every row and every column, the non-zero element being equal to 1). In the commutative diagrams given below, all linear operators will be identiﬁed with their matrices. 2.2

Discrete Wavelet Transform J−1 ¯ d,J,j = Vj ⊕ The space V Wμ is isometric to Vd,J for every j ≤ J − 1, they μ=j

are both Hilbert spaces and have the same dimension. Note, however, that while the orthonormal basis in Vd,J (the “space of all reconstructed signals”) consists ¯ d,J,j entirely of 2Jd scaling (father wavelet) functions, the orthonormal basis in V jd (the space of all transformed (decomposed) signals) consists of only 2 scaling (father wavelet) functions, complemented with 2jd (2(J−j)d − 1) wavelets (mother wavelet functions). ¯ d,J,j will be denoted by Ψd,J,j , dim Ψd,J,j = 2Jd . The orthonormal basis of V ¯ d,J,j , jd ≤ J − 1 is given by an orthogonal The isometry from Vd,J to V d operator (and orthogonal matrix) Ωd,J,jd . In this case Ψd,J,jd = Ωd,J,jd Φd,J

(1)

is the matrix of change of basis between Φd,J and Ψd,J,jd . Indeed, let f be an element of Vd,J expanded in the basis Φd,J by f = XT Φd,J , ¯ d,J,j expanded in the basis Ψd,J,j by and an element of V f = YT Ψd,J,j , where X and Y are 2Jd × 2Jd matrices. (We refer to the coordinate matrix X as the reconstructed d-dimensional signal of f and the coordinate matrix Y as the decomposed d-dimensional signal of f .) Changing the basis, we obtain −1 Y = ΩTd,J,jd X (2)

−1 and it is seen from (1) that the matrix ΩTd,J,jd is the matrix of the d −1 = dimensional DWT. But since Ωd,J,jd is an orthogonal matrix, ΩTd,J,jd Ωd,J,jd holds, and Y = Ωd,J,j X.

Computing n-Variate Orthogonal Discrete Wavelet Transforms on GPUs

733

Fig. 1. Commutative diagram and its inverse

So, in this notation the direct DWT corresponds to the orthogonal 2Jd × 2Jd matrix Ωd,J,j . The inverse (reconstruction) DWT then corresponds to ΩTd,J,jd . 2.3

Problem Setting

¯ d,J,j and V ¯ 1,dJ,j1 are both Hilbert spaces with the same dimension The spaces V d = 2Jd and are, therefore, isometric. There are inﬁnitely many isometries between these spaces – each one being given by one, and only one, orthogonal 2Jd × 2Jd matrix Bd,J , provided that a ﬁxed basis is considered in each of the two spaces. The main object of research in the present study is, as follows. Given the ﬁxed orthonormal bases Φd,J , Φ1,Jd , Ψd,J,jd and Ψ1,Jd,j1 in the isometric Hilbert ¯ d,J,j and V ¯ 1,Jd,j , respectively, compute the matrix Bd,J spaces Vd,J , V1,dJ , V 1 d of the orthogonal operator which makes the diagrams on Figure 1 commutative. Analogously to Ad,J , for the ﬁxed bases in consideration, Bd,J is unique mod¯ d,J,j and right ulo left composition with permutation Sd,J of the basis in V d ¯ 1,Jd,j1 (see Figure 1). composition with permutation S1,dJ of the basis in V From the commutative diagram it is clear that Bd,J can be computed as follows: Bd,J = Ω1,J,j1 Ad,J ΩTd,J,jd , provided that Ωd,J,jd and Ω1,J,j1 have been computed. Because of the orthonormality of all bases and the orthogonality of all operators involved in the commutative diagram on Figure 1, it is possible to invert this diagram, with all inverses being easy to compute via transposition (see Figure 1). In particular, the inverse of Bd,J needed for the computation of ΩTd,J,jd is equal to BTd,J . Rather than computing Ωd,J,j via the CPU, we shall use the afore-mentioned commutative diagram and its inverse to compute Ωd,J,j for j = jd by Ωd,J,jd = BTd,J Ω1,J,j1 Ad,J .

(3)

assuming that we have a realization of the orthogonal operator Bd,J and that Ω1,J,j1 is computed on the GPU.

734

L. Dechevsky, J. Gundersen, and B. Bang

Therefore, our task from now on is to ﬁnd a way to compute one possible solution for Bd,J . Remark 1. In 3 Ω1,J,j1 can be replaced by its 2D representation Ω2,J,j2 = B2,J Ω2,J,j2 AT2,J . Then, Ωd,J,jd = BTd,J B2,J Ω2,J,j2 AT2,J Ad,J is a new isometry better adapted for the 2-dimensional architecture of the GPU. For simplicity of presentation, in this exposition we shall continue to consider Ω1,J,j1 . It can also be mentioned that a 1D DWT on GPU is available in CUDA1 SDK2 . 2.4

Computing Bd,J

Let d ∈ N. Without loss of generality, we may assume that J ≥ 1 and 0 ≤ j = jd ≤ J − 1. Choose and ﬁx j0 : 0 ≤ j0 ≤ J − 1 and set j = jd = j0 , j1 = j0 · d. Decompose Vd,J and V1,Jd into direct sums of V- and W-spaces, as follows J−1 ¯ d,J,j0 = Vd,j0 ⊕ Wd,μ Vd,J isometric to V μ=j0 Jd−1

¯ 1,Jd,j1 = V1,j0 d ⊕ V1,Jd isometric to V

W1,ν

ν=j0 d

¯ d,J,j0 and V ¯ 1,Jd,j1 by proposing Now we shall construct an isometry between V an appropriate 1 – 1 bijection between their bases. ¯ 1,Jd,j1 to match RHS in V ¯ d,J,j0 in the Rewriting the -term in the RHS of V following way Jd−1

W1,ν =

ν=j0 d

J−1 (μ+1)d−1 μ=j0

W1,ν =

ν=μd

we get ¯ d,J,j0 = Vd,j0 ⊕ Vd,J isometric to V

d =W1,µ

J−1

d W1,μ ,

μ=j0

Wd,μ

μ=j0 J−1

¯ 1,Jd,j1 = V1,j0 d ⊕ V1,Jd isometric to V

J−1

μ=j0

d W1,μ

Therefore it now suﬃces to design appropriate bijections between the ord , μ = j0 , . . . , J − 1, given by thonormal bases of Wd,μ and W1,ν Wd,μ =

d 2 −1

σ1,l σ2,l σd,l ⊗ · · · ⊗ Z1,μ Z1,μ ⊗ Z1,μ

l=1 1 2

CUDA: the computing engine in NVIDIA’s GPUs. SDK: Software Development Kit.

Computing n-Variate Orthogonal Discrete Wavelet Transforms on GPUs

735

d Fig. 2. 2μd -block bijection between Wd,μ and W1,μ , µ = j0 , . . . , J − 1. The blocks μ d are 1of Wd,μ are d-dimensional of size 2 in each dimension; the blocks of W1,μ dimensional, of size 2μd .

and d,μ = W1,ν

d−1

W1,n+μd

n=0

Here σk,l ∈ {0, 1}, k = 1, . . . , d, l = 1, . . . , 2d − 1; σk,l , k = 1, . . . , d, are uniquely d σk,l 2d−k and determined by the binary representation of l; l =

σ

k,l Z1,μ =

k=1

V1,μ , σk,l = 0 W1,μ , σk,l = 1

k = 1, . . . , d, l = 1, . . . , 2d − 1, μ = j0 , j0 + 1, . . . , J − 1.

736

L. Dechevsky, J. Gundersen, and B. Bang

d Now Wd,μ is mapped 1-1 onto W1,μ for every μ = j0 , . . . , J − 1, in terms of blocks of basis functions of dimension 2μd each, as follows (see Figure 2).

1. Vd,j0 maps 1-1 onto V1,j0 d by the isometric operator Ad,j0 . 2. For every μ = j0 , . . . , J −1, Wd,μ will be mapped to the 2d −1 corresponding blocks of basis functions W1,n+μd for n = 0, . . . , d − 1. 3. Every corresponding block of basis functions in Wd,μ (with dimension d and 2μ ) is then being mapped 1-1 onto a respective block of basis functions W1,μ (with dimension 1 and size 2μd ) by the isometric operator Ad,μ . The selection of the operator Ad,μ can be made in many diﬀerent ways, for example, by the original Cantor diagonal method developed in [4,5], or by the method proposed in [3]. Here we opt for the latter method which is simpler to implement. Namely (see subsection 3.1 in [3]), Ad,μ : (k1 , . . . , kd ) ∈ {0, 1, . . . , 2μ − 1}d → n ∈ {0, 1, . . . , 2μd − 1} is deﬁned by n=

d

ki · 2μ(d−i) .

i=1

Its inverse (and, in view of its orthogonality, also its adjoint) ATd,J : n ∈ {0, 1, . . . , 2μd − 1} → (k1 , . . . , kd ) ∈ {0, 1, . . . , 2μ − 1}d is given by n μ(d−1) 2µ(d−1) = k1 , n − k1 · 2 μ(d−2)= n1 , n1 = k2 , n1 − k2 · 2 = n2 , 2µ(d−2) .. .. . n. 0 d = k , n d d−1 − kd · 2 = 0. 2µ(d−d) where [a] denotes, as customary, the integer part of a ≥ 0.

3

Concluding Remarks

The isometric mapping constructed here can be used as the basis for completion of several tasks, as follows: – Fast solution of nonlinear operator equations on the GPU. For example, consider ﬁnding the manifold obtained by the implicit d-dimensional m→→ − variate equation f (− x ) = 0. • Developing a pixel-based intersection algorithm in d-dimensional space, for any d = 2, 3, 4, . . .. • Generating the topology of intersection (the ordering of the point cloud). • First results in this direction are announced in [3] – GPU-computable multidimensional orthogonal DWT. As an important part of future work on this topic we see a detailed comparison of the performance of the new GPU-computable d-dimensional DWT discussed in this study to a standard CPU-computable d-dimensional DWT. As part of

Computing n-Variate Orthogonal Discrete Wavelet Transforms on GPUs

737

the research leading to [8], we conducted comparative study of the execution time for processing a set of model images by a standard CPU-computable 2dimensional wavelet transform and a respective GPU-computable 2 dimensional wavelet transform based on OpenGL shading language. At an average, the GPU based version was approximately 30 times faster [7]. More recently, a new generation of GPUs has been introduced, with a vast increase of programmability, computational power and speed, and it can be expected that using such a newer GPU would result in a speed-up of more than 102 times. We conjecture that the increase of the dimension d of the DWT will lead to increase of this speed-up.

References 1. Cohen, A.: Wavelet methods in numerical analysis. In: Ciarlet, P.G., Lions, J.L. (eds.) Handbook of Numerical Analysis, vol. VII, pp. 417–712. Elsevier, Amsterdam (2000) 2. Dechevsky, L.T.: Atomic decomposition of function spaces and fractional integral and diﬀerential operators. In: Rusev, P., Dimovski, I., Kiryakova, V. (eds.) Transform Methods and Special Functions, Part A (1999); Fractional Calculus & Applied Analysis, vol. 2(4), pp. 367–381 (1999) 3. Dechevsky, L.T., Bang, B., Gundersen, J., Laks˚ a, A., Kristoﬀersen, A.R.: Solving Non-linear Systems of Equations on Graphics Processing Units. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2009. LNCS, vol. 5910, pp. 719–729. Springer, Heidelberg (2010) 4. Dechevsky, L.T., Gundersen, J.: Isometric Conversion Between Dimension and Resolution. In: Dæhlen, M., Mørken, K., Schumaker, L. (eds.) 6th International Conference on Mathematical Methods for Curves and Surfaces, pp. 103–114. Nashboro Press, Tromsø (2005) ISBN 0-0-9728482-4-X 5. Dechevsky, L.T., Gundersen, J., Kristoﬀersen, A.R.: Wavelet-based isometic conversion between dimension and resolution and some of its applications. In: Proceedings of SPIE: Wavelet applications in industrial processing V, Boston, Massachusetts, USA, vol. 6763 (2007) 6. Mallat, S.: A Wavelet Tour of Signal Processing, 2nd edn. Acad. Press, New York (1999) 7. Moguchaya, T.: GM-Waves: A Wavelet Software Library, MSc-thesis, Narvik University College (2004) 8. Moguchaya, T., Grip, N., Dechevsky, L.T., Bang, B., Laks˚ a, A., Tong, B.: Curve and surface ﬁtting by wavelet shrinkage using GM Waves. In: Dæhlen, M., Mørken, K., Schumaker, L. (eds.) Mathematical Methods for Curves and Surfaces, pp. 263–274. Nashboro Press, Brentwood (2005)

Wavelet Compression, Data Fitting and Approximation Based on Adaptive Composition of Lorentz-Type Thresholding and Besov-Type Non-threshold Shrinkage Lubomir T. Dechevsky1 , Joakim Gundersen1 , and Niklas Grip2 1

Narvik University College, P.O.B. 385, N-8505 Narvik, Norway Department of Mathematics, Lule˚ a University of Technology, SE-971 87 Lule˚ a, Sweden

2

Abstract. In this study we initiate the investigation of a new advanced technique, proposed in Section 6 of [3], for generating adaptive Besov– Lorentz composite wavelet shrinkage strategies. We discuss some advantages of the Besov–Lorentz approach compared to ﬁrm thresholding.

1

Introduction

In [3] we considered 3 types of signals: – The typical case of quasi-sparseness of the wavelet-coeﬃcient vector is when the signal is suﬃciently smooth. In this case, it is usually suﬃcient to apply nonadaptive threshold shrinkage strategies such as, for example, hard and soft thresholding (in its various global, levelwise or block versions) (see,e. g., [6,7,8,9]). – Fractal signals and images which are continuous but nonsmooth everywhere (a simple classical example being the Weierstrass function). In this case, the vector of wavelet coeﬃcients looks locally full everywhere. This general case was speciﬁcally addressed in [5], where a family of wavelet-shrinkage procedures of nonthreshold type were considered for very general classes of signals that belong to the general scale of Besov spaces and may have a full, non-sparse, vector of wavelet coeﬃcients (see, in particular [5], Appendix B, item B9). – The intermediate case of spatially inhomogeneous signals which exhibit both smooth regions and regions with (isolated, or continual fractal) singularities. For this most interesting and important for the applications case, nonadaptive thresholding shrinkage tends to oversmooth the signal in a neighbourhood of every singularity, while nonadaptive nonthresholding shrinkage tends

Research supported in part by the 2008 and 2009 Annual Research Grants of the priority R&D Group for Mathematical Modeling, Numerical Simulation and Computer Visualization at Narvik University College, Norway. Research supported in part by the Swedish Research Council (project registration number 2004-3862).

I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 738–746, 2010. c Springer-Verlag Berlin Heidelberg 2010

Wavelet Compression, Data Fitting and Approximation

739

to undersmooth the signal in the regions where the signal has regular behaviour and locally quasi-sparse wavelet-coeﬃcient vector. On the basis of the preliminary study of this topic in [5,12], in [3] (Section 6) was proposed a method for developing a next, second, generation of composite wavelet shrinkage strategies having the new and very remarkable property to adapt to the local sparseness or nonsparseness of the vector of wavelet coeﬃcients, with simultaneously improved performance near singularities, as well as in smooth regions. A ﬁrst attempt for designing eﬃcient strategy for adaptive wavelet thresholding was the so-called firm thresholding, which consistently outperformed adaptive non-composite wavelet shrinkage techniques such as the soft and hard wavelet thresholding which appear as limiting cases of the newer ﬁrm thresholding. For this purpose, in [3] we proposed to upgrade firm threshold to incorporate all of the afore-mentioned wavelet shrinkage strategies within the very general setting of the so-called K-functional Tikhonov regularization of incorrectly posed inverse deterministic and stochastic problems [5], thereby obtaining a lucid uniform comparative characterization of the above-said approaches and their interrelations. The new approach proposed in [3] suggests, instead, to apply for the ﬁrst time a new type of thresholding – the so-called Lorentz-type thresholding (based on decreasing rearrangement of the wavelet coeﬃcient vector, as outlined for the ﬁrst time in [5], Appendix B, item B10(b)). Using this idea, in [3] we propose to upgrade firm thresholding to a composite adaptive shrinkage procedure based on data-dependent composition of the new Lorentz-type thresholding with the nonthreshold shrinkage procedures of [5] . It can be shown that the composition of these new highly adaptive strategies achieve the best possible rate of compression over all signals with prescribed Besov regularity (smooth signals as well as fractals). This is valid for univariate and multivariate signals. A full theoretical analysis of this construction would be very spacious and would require a very considerable additional theoretical and technical eﬀort. We intend to return to this theoretical analysis in the near future. In this short exposition we shall do preliminary comparative graphical analysis on a benchmark image with a local singularity (which has already been used for this purpose in [3,4,5,12]).

2 2.1

Preliminaries Riesz Unconditional Wavelet Bases in Besov Spaces

s For the deﬁnition of the inhomogenous Besov spaces Bpq (Rn ) (and for the respective range of the parameters p, q, s) we refer to [3, Section 4]. The same section in [3] contains the necessary information about the Riesz wavelet bases [0] [l] of orthogonal scaling functions ϕj0 k and wavelets ψjk , as well as their respective [l]

scaling and wavelet coeﬃcients αj0 k and βjk , j = j0 , . . . , j1 − 1, where j0 .j1 ∈ N, j0 < j1 .

740

2.2

L.T. Dechevsky, J. Gundersen, and N. Grip

Non-parametric Regression

For the benchmark model of non-parametric regression with noise variance δ 2 considered in this paper, see [3, Section 4, formula (7)]. The empirical scaling [l] coeﬃcients α ˆ j0 k and βˆjk are also deﬁned there, in formula (13). 2.3

Wavelet Shrinkage

The methodology to estimate f is based on the principle of shrinking wavelet coeﬃcients towards zero to remove noise, which means reducing the absolute value of the empirical coeﬃcients. (Mother) wavelet coeﬃcients (β-coeﬃcients) having small absolute value contain mostly noise. The important information at every resolution level is encoded in the coeﬃcients on that level which have large absolute value. One of the most important applications of wavelets - denoising - began after observing that shrinking wavelet coeﬃcients towards zero and then reconstructing the signal has the eﬀect of denoising and smoothing. To ﬁx terminology, a thresholding shrinkage rule sets to zero all coeﬃcients with absolute values below a certain threshold level, λ ≥ 0 , whilst a nonthresholding rule shrinks the non-zero wavelet coeﬃcients towards zero, without actually setting to zero any of the nonzero coeﬃcients. The cases when threshold shrinkage should be preferred over non-threshold shrinkage, and vice versa, were brieﬂy discussed in section 1. I. Threshold shrinkage. This is the most explored wavelet shrinkage technique. (i) Hard and soft threshold shrinkage. The hard and soft thresholding rules proposed by Donoho and Johnstone [7,8,9] (see also the seminal paper [6] of Delyon and Judistky) for smooth functions, are given respectively by: x, if |x| > λ δ (x; λ) = (1) 0, if |x| ≤ λ

and δ (x; λ) =

sgn(x)(|x| − λ), if |x| > λ 0, if |x| ≤ λ

(2)

where λ ∈ [0, ∞) is the threshold value. Asymptotically, both hard and soft shrinkage estimates achieve within a log n factor of the ideal performance. Soft thresholding (a continuous function) is a ’shrink’ or ’kill’ rule, while hard thresholding (a discontinuous function) is a ’keep’ or ’kill’ rule. (ii) Firm thresholding. After the initial enthusiasm of wavelet practicians and theorists in the early 1980-s generated by the introduction of hard and soft thresholding, a more serene second look at these techniques revealed a number of imperfections, and various attempts were made to design more comprehensive threshold rules (for further details

Wavelet Compression, Data Fitting and Approximation

741

on this topic, see [5,12,3] and the references therein). A relatively successful attempt was the introduction by Gao and Bruce [10] of the firm thresholding rule ⎧ |x|−λ ⎨ sgn(x)λ2 λ2 −λ11 , if |x| ∈ (λ1 , λ2 ] δ(x, λ1 , λ2 ) = (3) x, if |x| > λ2 ⎩ 0, if |x| ≤ λ1 By choosing appropriate thresholds (λ1 , λ2 ), ﬁrm thresholding outperforms both hard and soft thresholding; which are its two extreme limiting cases. Indeed, note, that lim δ f irm (x, λ1 , λ2 ) = δ sof t (x, λ1 )

(4)

lim δ f irm (x, λ1 , λ2 ) = δ hard (x, λ1 ).

(5)

λ2 →∞

and λ2 →λ1

The advantage of ﬁrm thresholding compared to its limiting cases comes at a price: two thresholds are required instead of only one, which doubles the dimensionality of optimization problems related to ﬁnding data dependent optimal thresholds (such as cross-validation, entropy minimization, etc.). Nevertheless, with the ever increasing computational power of the new computer generations, the advantages of ﬁrm thresholding compared to soft and hard ones tend to outweigh the higher computational complexity of ﬁrm thresholding algorithms. (iii) Lorentz-curve thresholding. Vidakovic [13], proposed a thresholding method based on the Lorentz curve for the energy in the wavelet decomposition. A brief outline of the idea of Lorentz-curve thresholding can be found in subsection 5.3 of [3]. (iv) General Lorentz thresholding. This is a far-going generalization of the concept of Lorentz-curve thresholding based on the combined use of two deep function-analytical and operator-theoretical facts: (A) computability of the Peetre K-functional between Lebesgue spaces in terms of the non-increasing rearrangement of a measurable function; (B) isometricity of Besov spaces to vector-valued sequence spaces of Lebesques type. This construction was introduced in [5], and the details, which we omit here for conciseness, can be found in [5], Appendix B, Item B10(b), or in [3], subsection 6.4. Item (B) is extended in a fairly straightforward way to the more general scale of Nikol’skii-Besov spaces with (quasi)norm (see [5], Appendix B, Item B12 and the references therein). II. Non-thresholding Shrinkage. In the last 10-15 years there has been increasing interest to the study of fractals and singularity points of functions (discontinuities of the function or its derivatives, cusps, chirps, etc.), and

742

L.T. Dechevsky, J. Gundersen, and N. Grip

this raised the necessity of studying non-threshold wavelet shrinkage. At this point, while threshold rules can be considered as well explored, non-threshold rules, on the contrary, are fairly new, and the corresponding theory is so far in an initial stage. This can be explained by the fact that traditionally only very smooth functions have been estimated. In [5] was proposed, and in [12,3] — further studied, a family of waveletshrinkage estimators of non-threshold type which are particularly well adapted for functions belonging to Besov spaces and have a full, non-sparse, vector of wavelet coeﬃcients. The approach, proposed by Dechevsky, Ramsay and Penev in [5], parallels Wahba’s spline smoothing technique based on Tikhonov regularization of ill-posed inverse problems, upgraded in [5] to be able to handle in a uniform way also the case of ﬁtting less regular curves, surfaces, volume deformations and, more generally, multivariate vector ﬁelds belonging to Besov spaces. The relevant details about the Besov non-threshold wavelet shrinkage proposed in [5] can be found there in Appendix B, Item B9(a,b), as well as in [12], section 5, or in [3], subsection 5.2. Similar to the general Lorentz thresholding, the above Besov shrinkage strategies are obtained as consequence of some deep function-analytical and operator-theoretical facts, namely, as follows. (C) The metrizability of quasinormed abelion groups via the Method of Powers (see [1], section 3.10). (D) The Theorem of Powers for the real interpolation method of PeetreLions (see [1], section 3.11), leading to explicit computation of K-functional which in this case is also called quasilinearization (see [1]). 2.4

Composite Besov-Lorentz Shrinkage

One criterion, which was announced in section 6 of [3] for the ﬁrst time, is to σ s use the real interpolation spaces between Bππ and Bpp where the parameters p, π, s and σ are as in [5], Appendix B, Item B9(a). It is known that the Besov s(θ) σ s , Bpp )θ,p(θ) = Bp(θ)p(θ) , where scale is closed under real interpolation, i.e., (Bππ 1 θ 0 ≤ θ ≤ 1, and p(θ) is deﬁned by p(θ) = 1−θ π + p. The parameter θ, which is a coeﬃcient in a convex combination, is determining the degree to which the composite estimator is of Lorentz-type and the degree to which it is a Besov shrinkage-type estimator. Since general Lorentz shrinkage is a threshold method, while Besov shrinkage is of non-threshold type, the last observation also implies that θ can be used also as a control parameter for regulating the compression rate. The parameter θ can be computed via cross-validation, considerations for asymptotic minimax rate, Bayesian techniques, or any other procedure for statistical parametric estimation. The resulting composite estimator is in this case highly adaptive to the local smoothness of the estimated function. We omit a more detailed exposition of this topic here, since it merits a much more detailed study.

Wavelet Compression, Data Fitting and Approximation

743

1.2 Noisy Original Firm Lorentz−Besov

1

0.8

y

0.6

0.4

0.2

0

−0.2 −0.5

0

0.5 x

1

1.5

Fig. 1. Non-parametric regression estimation of “λ-tear”, λ = 0.25. Noise: white, unbiased, δ 2 = 0.01. Sample size N = 1024. Wavelet basis of the estimators: orthonormal Daub 6. Wavelet shrinkage strategies: ﬁrm and Lorentz-Besov.

Note that the valuable additional functionality oﬀered by the composite BesovLorentz shrinkage is based again on several deep function-analytical and operatortheoretical facts which can be outlined as follows: properties (A-D) stated above, and facts (E), (F) given below. (E) The reiteration theorem for the real interpolation method of Peetre-Lions (see [1], section 3.4 and 3.11). (F) The generalization, via the Holmstedt formula (see [1], section 3.6), of the formula for computation of the Peetre K-functional between Lebesgue spaces in terms of the non-increasing rearrangement of a measurable function (see [1], section 5.2).

3

Besov-Lorentz Shrinkage versus Firm Thresholding

Composite Besov-Lorentz shrinkage has considerably more control parameters than ﬁrm thresholding and, therefore, optimization with respect to all parameters of the Besov-Lorentz model would be a considerably more challenging computational problem than optimization related to ﬁrm thresholding. However, this

744

L.T. Dechevsky, J. Gundersen, and N. Grip 1.4 Noisy Original Firm Lorentz−Besov

1.2

1

0.8

y

0.6

0.4

0.2

0

−0.2

−0.4

−0.6 −0.5

0

0.5 x

1

1.5

Fig. 2. Under the conditions in Figure 1, the noise variance is δ 2 = 0.1

is outweighed by the several advantages of Besov-Lorentz shrinkage, some of the most important of which are listed below. 1. Firm thresholding rule has been designed as a uniﬁcation of the hard and soft thresholding scale; it is not related with any function-analytic properties of the estimated signal. On the contrary, Besov-Lorentz shrinkage is derived from the important properties (A-F) stated above. In particular, Besov-Lorentz shrinkage provides a convenient framework for ﬁne control of the optimization which, unlike ﬁrm thresholding, can be performed under a rich variety of meaningful constraints. This allows the introduction of bias in the estimation process, whenever information about such bias is available, with drastic improvement in the quality of estimation. 2. The optimization criterion proposed in [10] for optimizing the thresholds of ﬁrm thresholding is of entropy type. It is general but, at the same time, it is inﬂexible with respect to introduction of meaningful bias information which may be available apriori. 3. For appropriate values of the parameters of Besov shrinkage (even without invoking Lorentz thresholding) one of the two possible limiting cases of ﬁrm thresholding - soft thresholding - is attained within the Besov scale (see [5], p. 344). On the other hand, the often limiting case of ﬁrm thresholding, that is, hard thresholding, can be attained within the Lorentz thresholding model, provided that this model is based on a Nikol’skii-Besov space scale. (The

Wavelet Compression, Data Fitting and Approximation

745

standard Besov space scales (with s(x) ≡ const, x ∈ Rn ) are insuﬃcient for implementing the hard threshold rule within the Lorentz threshold setting.) It can further be shown that any shrinkage strategy attained via the ﬁrm thresholding rule can be also attained within the Besov-Lorentz composite strategy, but not vice versa. Some more conclusions on the comparison between ﬁrm and Besov-Lorentz shrinkage can be drawn after comparative graphical analysis of the performance of the two shrinkage strategies on Figures 1 and 2, for the benchmark image “λ-tear” (see [5], Example 1), with λ = 0.25. In order to make ﬁrm thresholding suﬃciently competitive, instead of the usual entropy-based optimization criterion we have optimized here ﬁrm thresholding with respect to the same criterion as Besov-Lorentz shrinkage: a criterion based on a Nikol’skii-Besov scale which takes in consideration the singularity of “λ-tear” at x = 0. On Figure 1 is displayed the performance of the two estimators for medium-to-large noise (the noise variance δ 2 = 0.01 is the same as in Example 1 in [5]; the average noise amplitude is about 35% of the amplitude of the original signal); on Figure 2 is shown the performance of the estimators in the case of extremely large noise amplitude (noise variance δ 2 = 0.1; the average noise amplitude exceeds 100% of the amplitude of the original signal); the sample size for both Figure 1 and 2 is N = 1024; the compression rate of the Besov-Lorentz shrinkage is very high: 98.6% for Figure 1 and 99.4% for Figure 2. Note that in both Figures 1 and 2 Besov-Lorentz shrinkage outperforms ﬁrm thresholding in a neighbourhood of the singularity at x = 0 (this is especially well distinguishable in the presence of large noise (Figure 2) where ﬁrm thresholding exhibits the typical “oversmoothing” behaviour in a neighbourhood of x = 0). Note that the “overﬁtting” by the Besov-Lorentz estimator in the one-sided neighbourhoods left and right of the singularity at x = 0 is due to the relatively large support of the Daub 6 orthonormal wavelet used (which, on its part, is a consequence of the requirement for suﬃcient smoothness of the Daubechies wavelet). If in the place of Daubechies orthonormal wavelets one uses biorthonormal wavelets or, even better, multiwavelets or wavelet packets (see, e.g., [11]), the decrease of the support of the wavelets will lead to removal of the above said overﬁt while retaining the advantages of better ﬁtting of the singularity. Summarizing our ﬁndings, we can extend the list of comparison items 1-3, as follows. 4. The trade-oﬀ between error of approximation and rate of compression can be eﬃciently controlled with Besov-Lorentz shrinkage, while with ﬁrm thresholding no such control is available. 5. Besov-Lorentz shrinkage outperforms ﬁrm thresholding in ﬁtting singularities. If conventional orthonormal Daubechies wavelets are used, the good ﬁt of isolated singularities comes at the price of overﬁtting smooth parts of the signal neighbouring the respective isolated singularity. However, this overﬁt can be removed by using multiwavelets or wavelet packets which, unlike Daubechies orthogonal wavelets, simultaneously combine suﬃcient smoothness with narrow support.

746

L.T. Dechevsky, J. Gundersen, and N. Grip

References 1. Bergh, J., L¨ ofstr¨ om, J.: Interpolation Spaces. An Introduction. In: Grundlehren der Mathematischen Wissenshaften, vol. 223. Springer, Berlin (1976) 2. Dechevsky, L.T.: Atomic decomposition of function spaces and fractional integral and diﬀerential operators. In: Rusev, P., Dimovski, I., Kiryakova, V. (eds.) Transform Methods and Special Functions, Part A (1999); Fractional Calculus & Applied Analysis, vol. 2, pp. 367–381 (1999) 3. Dechevsky, L.T., Grip, N., Gundersen, J.: A new generation of wavelet shrinkage: adaptive strategies based on composition of Lorentz-type thresholding and Besovtype non-thresholding shrinkage. In: Proceedings of SPIE: Wavelet Applications in Industrial Processing V, Boston, MA, USA, vol. 6763, article 676308, pp. 1–14 (2007) 4. Dechevsky, L.T., MacGibbon, B., Penev, S.I.: Numerical methods for asymptotically minimax non-parametric function estimation with positivity constraints I. Sankhya, Ser. B 63(2), 149–180 (2001) 5. Dechevsky, L.T., Ramsay, J.O., Penev, S.I.: Penalized wavelet estimation with Besov regularity constraints. Mathematica Balkanica (N. S.) 13(3-4), 257–356 (1999) 6. Delyon, B., Juditsky, A.: On minimax wavelet estimators. Applied and Computational Harmonic Analysis 3, 215–228 (1996) 7. Donoho, D.L., Johnstone, I.M.: Ideal spatial adaptation via wavelet shrinkage. Biometrika 81(3), 425–455 (1994) 8. Donoho, D.L., Johnstone, I.M.: Minimax estimation via wavelet shrinkage. Annals of Statistics 26(3), 879–921 (1998) 9. Donoho, D.L., Johnstone, I.M., Kerkyacharian, G., Picard, D.: Wavelet shrinkage: Asymptopia? Journal of the Royal Statistical Society Series B 57(2), 301–369 (1995) 10. Gao, H.-Y., Bruce, A.G.: WaveShrink with ﬁrm shrinkage. Statist. Sinica 7(4), 855–874 (1997) 11. Mallat, S.: A Wavelet Tour of Signal Processing, 2nd edn. Acad. Press, New York (1999) 12. Moguchaya, T., Grip, N., Dechevsky, L.T., Bang, B., Laks˚ a, A., Tong, B.: Curve and surface ﬁtting by wavelet shrinkage using GM Waves. In: Dæhlen, M., Mørken, K., Schumaker, L. (eds.) Mathematical Methods for Curves and Surfaces, pp. 263– 274. Nashboro Press, Brentwood (2005) 13. Vidakovic, B.: Statistical Modeling by Wavelets. Wiley, New York (1999)

On Interpolation in the Unit Disk Based on Both Radon Projections and Function Values Irina Georgieva1 and Rumen Uluchev2 1

2

Institute of Mathematics and Informatics, Bulgarian Academy of Sciences, Acad. G. Bonchev, Bl. 8, Soﬁa 1113, Bulgaria [email protected] Department of Mathematics and Informatics, Todor Kableshkov HST (VTU), 158 Geo Milev Str., Soﬁa 1574, Bulgaria [email protected]

Abstract. We consider interpolation of function in two variables on the unit disk with bivariate polynomials, based on its Radon projections and function values. This is closely related to surface and image reconstruction. Due to the practical importance of these problems, recently a lot of mathematicians deal with interpolation and smoothing of bivariate functions with a data consisting of prescribed Radon projections or mixed type of data – Radon projections and function values. Here we present some new results and numerical experiments in this ﬁeld.

1

Introduction and Preliminaries

There are a lot of practical problems, e.g. in computer tomography, plasma tomography, electronic microscopy, etc., where information about the processes can be derived as line integrals of functions. Due to the great importance of these ﬁelds of the science, many researchers turn to the mathematical methods for data processing of Radon projections type of data. Such methods in engineering and medicine have been investigated by many mathematicians [5,12,13,15,16] and others. Interpolation, smoothing, and numerical integrating of bivariate functions with a data consisting of prescribed Radon projections was considered in [1,2,3,4,6,7,8,10,12,14]. In [7] smoothing of mixed type of data – both Radon projections and function values – was studied. Recent eﬀorts of the authors are directed to interpolation by bivariate polynomials of mixed type of data. Here we present a regular scheme of mixed type consisting of chords and points of the unit circle in the plane. Denote by Πn2 the set of all real algebraic polynomials in two variables of total degree at most n. Then, if P ∈ Πn2 we have P (x, y) =

αij xi y j ,

αij ∈ R.

i+j≤n

The dimension of the linear space Πn2 is

n+2 2 .

I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 747–755, 2010. c Springer-Verlag Berlin Heidelberg 2010

748

I. Georgieva and R. Uluchev

2 Let B := {x = (x, y) ∈ R : x ≤ 1} be the unit disk in the plane, 2 2 where x = x + y . Given t ∈ [−1, 1] and an angle of measure θ ∈ [0, π), the equation x cos θ + y sin θ − t = 0 deﬁnes a line perpendicular to the vector cos θ, sin θ and passing through the point (t cos θ, t sin θ). The set I(θ, t) := ∩B is a chord of the unit disk B, where θ determines the direction of I(θ, t) and t is the distance of the chord from the origin. Suppose that for a given function f : R2 → R the integrals of f exist along all line segments on the unit disk B. For ﬁxed t ∈ [−1, 1] and 0 ≤ θ < π the Radon projection of the function f over the segment I(θ, t) is deﬁned by f (x) dx. Rθ (f ; t) := I(θ,t)

Observe that Rθ ( · ; t) is a linear functional. Since I(θ, t) ≡ I(θ + π, −t) it follows that Rθ (f ; t) = Rθ+π (f ; −t). Thus, the assumption above for the direction of the chords 0 ≤ θ < π is not a loss of generality. It is well-known that the set of Radon projections Rθ (f ; t) : −1 ≤ t ≤ 1, 0 ≤ θ < π determines f uniquely (see John [11], Radon [16]). According to a more recent result of Solmon [17], an arbitrary function f ∈ L1 (R2 ) with compact support in B is uniquely determined by any inﬁnite set of Radon projections. It was shown by Marr [12] that every polynomial P ∈ Πn2 can be reconstructed uniquely by its projections only on a ﬁnite number of directions. Marr’s formula is a basic equality in the Chebyshev-Fourier analysis which shows the reduction of the multivariate case to a univariate one. An important role in approximation of functions on the unit disk play Chebyshev polynomials of second kind. We denote the Chebyshev polynomials of degree m as usually by sin(m + 1)θ Um (t) := , t = cos θ. sin θ They are used for construction of an orthonormal basis of Πn2 , which is a helpful tool in studying functions in the unit disk.

2

Interpolation Problem for Radon Projections Type of Data

For a given scheme of chords Ik , k = 1, . . . , n+2 2 , of the unit circle ∂B, ﬁnd a polynomial P ∈ Πn2 satisfying the conditions: . (1) P (x) dx = γk , k = 1, . . . , n+2 2 Ik

If (1) has a unique solution for every given set of values {γk } the interpolation problem is called poised and the scheme of chords – regular.

On Interpolation in the Unit Disk

749

Firstly, Hakopian [10] proved that the set of n+2 chords, joining any given 2 n + 2 points on the unit circle is a regular scheme. Later Bojanov and Georgieva [1] provided a new of regular schemes. family chords {I(θ, t)} of the The Radon projections are taken along a set of n+2 2 unit circle, partitioned into n + 1 subsets, such that the k-th subset consists of k + 1 parallel chords. More precisely, consider the scheme (Θ, T ), where Θ := {θ0 , θ1 , . . . , θn }, 0 ≤ θ0 < · · · < θn < π, and T := {tki } is a triangular matrix of distances 1 > tkk > · · · > tkn > −1, corresponding to the angle measures θk , k = 0, . . . , n. The problem is to ﬁnd a polynomial P ∈ Πn2 satisfying the interpolation conditions P (x) dx = γki , k = 0, . . . , n, i = k, . . . , n. (2) I(θk ,tki )

A necessary and suﬃcient condition for regularity of the schemes of this type was proved by Bojanov and Georgieva [1]. But for a set of points T , it is often diﬃcult to determine if the problem (2) has a unique solution. Several regular schemes of this type were suggested by Georgieva and Ismail [6] and by Georgieva and Uluchev [7], where the distances from the origin of segments from the origin correspond to the zeros of orthogonal polynomials. Another kind of regular scheme was proposed by Bojanov and Xu [4]. It n consists of 2 n+1 2 + 1 equally spaced directions with 2 + 1 chords, associated with the zeros of the Chebyshev polynomials, in each direction.

3

Interpolation Problem for Mixed Type of Data

Now we consider interpolation using mixed type of data – both Radon projections and function values at points lying on the unit circle. Suppose that • Θ := {θ0 , θ1 , . . . , θn }, 0 ≤ θ0 < · · · < θn < π; • T := {tki } is a triangular matrix with 1 > tkk > · · · > tk,n−1 > −1, k = 0, . . . , n − 1; • X := {x0 , . . . , xn }, where xk are points on the unit circle. The problem is to ﬁnd a polynomial P ∈ Πn2 satisfying the interpolation conditions P (x) dx = γki , k = 0, . . . , n − 1, i = k, . . . , n − 1, (3) I(θk ,tki ) P (xk ) = fk , k = 0, . . . , n. Note the diﬀerence with problem (2) is that we replace the interpolation condition on the smallest chord in each direction θk with a function value interpolation condition at a point xk . If (3) has a unique solution for every given set of values {γki } and {fk } we call the interpolation problem (3) poised and the scheme of chords and points (Θ, T, X) – regular.

750

I. Georgieva and R. Uluchev

Let us introduce the following matrices: ⎛ ⎞ Uk (tkk ) Uk+1 (tkk ) ... Un−1 (tkk ) Un (tkk ) ⎜Uk (tk,k+1 ) Uk+1 (tk,k+1 ) . . . Un−1 (tk,k+1 ) Un (tk,k+1 ) ⎟ ⎜ ⎟ ⎟ U∗k := ⎜ ⎜. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .⎟ . ⎝Uk (tk,n−1 ) Uk+1 (tk,n−1 ) . . . Un−1 (tk,n−1 ) Un (tk,n−1 )⎠ Uk (−1) Uk+1 (−1) ... Un−1 (−1) Un (−1) The following result is a necessary and suﬃcient condition for the interpolation problem (3) to be poised. The proof will appear in a subsequent article [9]. Theorem 1. For a given set of chords and points (Θ, T, X) of the unit circle in the plane with 0 ≤ θ0 < · · · < θn < π, 1 > tkk > · · · > tk,n−1 > −1, and xk = (− cos θk , − sin θk ), k = 0, . . . , n, the interpolation problem (3) is poised if and only if det U∗k = 0, k = 0, . . . , n.

4

Regular Schemes for Mixed Type of Data

Here we give a regular scheme for interpolation based on mixed type of data. The following assertion on Chebyshev systems is well-known. Proposition 1. The functions {sin lx}m l=1 form a Chebyshev system in (0, π). Now we state and prove the main result in this survey. Theorem 2. Let n be given positive integer, and (i) Θ = {θ0 , . . . , θn }, 0 ≤ θ0 < · · · < θn < π; (ii) tki = ηi = cos (i+1)π n+1 , i = k, . . . , n−1, be the zeros of Chebyshev polynomials of second kind Un (x); (iii) xk = (− cos θk , − sin θk ), k = 0, . . . , n. Then the interpolation problem (3) is poised, i.e. the scheme for interpolation (Θ, T, X) is regular. Proof. According to Theorem 1 it is suﬃcient to prove that det U∗k = 0 for all k = 0, . . . , n. Let us ﬁx an integer k, 0 ≤ k ≤ n − 1. For each j = 1, . . . , n, we obtain Un−j (tk,n−l ) =

(−1)n−j+1 sin l (n−j+1)π n+1 sin (n−l+1)π n+1

.

We shall make use of the notation ϕn−k (xn−k ) ϕn−k (xn−k−1 ) . . . ϕn−k (x0 ) ϕ ϕn−k−1 (x0 ) n−k−1 (xn−k ) ϕn−k−1 (xn−k−1 ) . . . Δk = . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . , ϕ1 (xn−k ) ϕ1 (xn−k−1 ) . . . ϕ1 (x0 ) (k + 1)(−1) 2 n−k+1 (k + 2)(−1) . . . (n + 1)(−1)

On Interpolation in the Unit Disk

751

where ϕl (x) = sin lx, l = 1, . . . , n,

and xj =

(n − j + 1)π , j = 1, . . . , n. n+1

Now recall the property Um (−1) = (m + 1)(−1)m of Chebyshev polynomials of second kind. Factoring out the common multipliers of the rows and columns in the determinant det U∗k we get det U∗k = Δk

n−k

(−1)n−j+1

j=1

sin (n−j+1)π n+1

,

where obviously the product factor does not vanish. We expand Δk in minors corresponding to the elements of the last row. If we denote by An−k+1,j the minor of Δk , obtained by taking the determinant of Δk with last row and column j “crossed out”, we obtain Δk =

n−k+1

n−k+1

j=1

j=1

(−1)j+n−k+1 (k+j)(−1)j An−k+1,j = (−1)n−k+1

(k+j)An−k+1,j .

Observe that from Proposition 1 above it follows that ϕn−k (τn−k ) ϕn−k (τn−k−1 ) . . . ϕn−k (τ1 ) ϕn−k−1 (τn−k ) ϕn−k−1 (τn−k−1 ) . . . ϕn−k−1 (τ1 ) =0 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ϕ1 (τn−k ) ϕ1 (τn−k−1 ) . . . ϕ1 (τ1 ) for all 0 < τ1 < · · · < τn < π. Then all minors An−k+1,j do not vanish and have one and the same sign, hence n−k+1

(k + j)An−k+1,j = 0.

j=1

Therefore Δk = 0 for all k = 0, 1, . . . , n − 1. For k = n we have Δn = Un (−1) = (n + 1)(−1)n and Δn = 0, too. Applying Theorem 1 we conclude that the interpolation problem (3) is poised and the given data (Θ, T, X) is a regular scheme.

5

Numerical Experiments

Here we present some results from numerical experiments carried out using the Mathematica software by Wolfram Research Inc. We apply the regular scheme of chords and points (Θ, T, X) from Theorem 2 with θk = (k+1)π n+2 , k = 0, . . . , n, in all examples below.

752

I. Georgieva and R. Uluchev

Example 1. We compare the interpolation based on Radon projections only with the interpolation based on mixed type of data for the function sin 3π (x − 0.3)2 + y 2 + 10−18 f (x, y) = . 3π (x − 0.3)2 + y 2 + 10−18 On Figure 1 the graphs of f (x, y), the interpolant P16 (x, y) and the error f (x, y)− P16 (x, y) for the problem (2) are shown. The regular scheme consists of 153 16 chords only, corresponding to the zeros of U17 (t), θk = (k+1)π , and tki = 18 k=0 16 cos (i+1)π , k = 0, . . . , 16. 18 i=k Figure 2 represents the graphs of the interpolating polynomial and the error function. The interpolating polynomial Q16 (x, y) is the solution of the problem (3) and it is based on 136 pieces of Radon projections and 17 function values at (k+1)π 16 the points xk = − cos (k+1)π . 18 , − sin 18 k=0 It should be pointed out that in the mixed case we get much better approximation. The relative error of interpolation is as follows: f − P16 2 = 4.77945 × 10−3 , f 2

f − Q16 2 = 1.16826 × 10−4 , f 2

f − P16 ∞ = 2.91338 × 10−2 , f ∞

f − Q16 ∞ = 2.59069 × 10−4 . f ∞

1

1

z0.5

z0.5 1

1 0

0

0.5 0

-1

0

-1

y

1

0.5 y

-0.5

0 x

-0.5

0 x

0.5

0 -0.5

-0.5

-0.5

0.5

z 0.02 0 -0.02 1 -1

-0.5

0 x

0.5

0.5 1 -1

1 -1

1 -1

f (x, y)

P16 (x, y)

f (x, y) − P16 (x, y)

Fig. 1.

1

z 0.5 1 0

1

0.5

-1

0

y

0.5 z 0.0001 0.0002 -0.0001 -0.0002 0 -1 1

-0.5

-0.5 x

0.5 1

-0.5

0

-0.5

0 x

0

0.5 1 -1

-1

Q16 (x, y)

f (x, y) − Q16 (x, y) Fig. 2.

y

y

On Interpolation in the Unit Disk

753

Example 2. We try our mixed regular scheme for interpolation of the function sin 3π (x − 0.3)2 + y 2 + 10−18 f (x, y) = 3π (x − 0.3)2 + y 2 + 10−18 sin 3π (x + 0.4)2 + (y + 0.1)2 + 10−18 + 1.3 3π (x + 0.4)2 + (y + 0.1)2 + 10−18 with polynomials Pn (x, y) of degree n = 10, 12, 14, 16. On Figure 3 we present the graphs of the function, its interpolating polynomial of degree n = 10 and the error function. Calculations show that for n ≥ 12 there is no visible diﬀerence between the graphs of the function and its interpolating polynomial of degree n. The relative interpolation error is given in Table 1. 1 y

1 y

0.5

0

0.5

0

-0.5

-0.5

-1

-1

1

1

z 0.5

z 0.5 1

0

0

-0.5 -1

z0.1 0 -0.1 -1 1

-0.5 -1 -0.5

0

x

x

0.5

1

0.5 1

1

f (x, y)

-0.5

0

0 0.5

y

-0.5

-0.5 0 x

0.5

P10 (x, y)

-1

f (x, y) − P10 (x, y)

Fig. 3.

Table 1. n

10

12

14

16

f − Pn 2 f 2

6.15932 × 10−2

8.82093 × 10−3

4.67868 × 10−4

8.09702 × 10−5

Example 3. We interpolate the function f (x, y) = sin 2x + sin 3y with polynomials Pn (x, y) of degree n = 8, 10, 12, 15. The relative error for the interpolation problem (3) is given in Table 2.

754

I. Georgieva and R. Uluchev Table 2. n

f − Pn 2 f 2 f − Pn ∞ f ∞

8

10

12

15

5.01005 × 10−4

9.77689 × 10−6

1.55919 × 10−7

8.92670 × 10−9

1.51321 × 10−3

4.80045 × 10−5

9.79461 × 10−6

1.75293 × 10−7

Acknowledgement This research was supported by the Bulgarian Ministry of Education and Science under Grant No. VU-I-303/07.

References 1. Bojanov, B., Georgieva, I.: Interpolation by bivariate polynomials based on Radon projections. Studia Math. 162, 141–160 (2004) 2. Bojanov, B., Petrova, G.: Numerical integration over a disc. A new Gaussian cubature formula. Numer. Math. 80, 39–59 (1998) 3. Bojanov, B., Petrova, G.: Uniqueness of the Gaussian cubature for a ball. J. Approx. Theory 104, 21–44 (2000) 4. Bojanov, B., Xu, Y.: Reconstruction of a bivariate polynomials from its Radon projections. SIAM J. Math. Anal. 37, 238–250 (2005) 5. Davison, M.E., Grunbaum, F.A.: Tomographic reconstruction with arbitrary directions. Comm. Pure Appl. Math. 34, 77–120 (1981) 6. Georgieva, I., Ismail, S.: On recovering of a bivariate polynomial from its Radon projections. In: Bojanov, B. (ed.) Constructive Theory of Functions, pp. 127–134. Marin Drinov Academic Publishing House, Soﬁa (2006) 7. Georgieva, I., Uluchev, R.: Smoothing of Radon projections type of data by bivariate polynomials. J. Comput. Appl. Math. 215(1), 167–181 (2008) 8. Georgieva, I., Uluchev, R.: Surface reconstruction and Lagrange basis polynomials. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2007. LNCS, vol. 4818, pp. 670–678. Springer, Heidelberg (2008) 9. Georgieva, I., Uluchev, R.: Interpolation of mixed type data by bivariate polynomials (to appear) 10. Hakopian, H.: Multivariate divided diﬀerences and multivariate interpolation of Lagrange and Hermite type. J. Approx. Theory 34, 286–305 (1982) 11. John, F.: Abh¨ angigkeiten zwischen den Fl¨ achenintegralen einer stetigen Funktion. Math. Anal. 111, 541–559 (1935) 12. Marr, R.: On the reconstruction of a function on a circular domain from a sampling of its line integrals. J. Math. Anal. Appl. 45, 357–374 (1974) 13. Natterer, F.: The Mathematics of Computerized Tomography. Classics in Applied Mathematics, vol. 32. SIAM, Philadelphia (2001)

On Interpolation in the Unit Disk

755

14. Nikolov, G.: Cubature formulae for the disk using Radon projections. East J. Approx. 14, 401–410 (2008) 15. Pickalov, V., Melnikova, T.: Plasma Tomography. Nauka, Novosibirsk (1995) (in Russian) ¨ 16. Radon, J.: Uber die Bestimmung von Funktionen durch ihre Integralwerte l¨ angs gewisser Mannigfaltigkeiten. Ber. Verch. S¨ achs. Akad. 69, 262–277 (1917) 17. Solmon, D.C.: The X-ray transform. J. Math. Anal. Appl. 56(1), 61–83 (1976)

Comparison between the Marching-Cube and the Marching-Simplex Methods Joakim Gundersen, Arnt R. Kristoﬀersen, and Lubomir T. Dechevsky Narvik University College, P.O.B. 385, N-8505 Narvik, Norway

Abstract. The marching-cube algorithm is one of the eﬃcient algorithms for computing the solutions of low-dimensional nonlinear systems of equations. It is widely used in industrial applications involving intersection problems in 2, 3 and, possibly, higher dimensions. In 2006, a research team, including the authors of this article, proposed a new ’marching’ approach which diﬀers essentially from the marching-cube approach. We coined this new algorithm as The Marching Simplex Algorithm. Some of the advantages of the marching simplex algorithm were mentioned already at the time of its introduction. However, a detailed comparison between the two algorithms has not been made so far, and the purpose of this article is to address the issues of such a comparison.

1

Introduction

We consider the following problem about solution of a system of n nonlinear equations, n = 1, 2, . . ., with n+m variables, m = 0, 1, 2, . . . (or, in equivalent geometric form, the problem about computing the intersection of n hyper-surfaces in Rn+m ) fi (x1 , ..., xn , xn+1 , ..., xn+m ) = 0, i = 1, ..., n, (1) We consider (1) under the following general assumptions: – The solution manifold has possibly variable rank r, 0 ≤ r ≤ n on diﬀerent points of the solution manifold. – In diﬀerent points of the solution manifold the system (1) may be solvable with respect to a diﬀerent (n + m − r)-tuple of variables among x1 , ..., xn+m , which will be implicit functions of the remaining r independent variables among x1 , ..., xn+m in a neighborhood of this point of the solution manifold. If we assume that the pointP = (x1,P , ..., xn,P , xn+1,P , ..., xn+m,P )T ∈ Rn+m belongs to the solution manifold of (1), then (1) can be rewritten at P in the following way: fi (x1 , ..., xr(P ) , yr(P )+1 , ..., yn+m ) = 0,

i = 1, ..., n,

(2)

Research supported in part by the 2006, 2007 and 2008 Annual Research Grants of the priority R&D Group for Mathematical Modeling, Numerical Simulation and Computer Visualization at Narvik University College, Norway.

I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 756–764, 2010. c Springer-Verlag Berlin Heidelberg 2010

Comparison between the Marching-Cube and the Marching-Simplex Methods

757

where 0 ≤ r(P ) ≤ n and xj = xj,P , j = 1, ..., r(P ), yl = xr(P )+l,P , l = 1, ..., n + m − r(P ),

(3)

and at P the system (2) can be solved explicitly with respect to the y-variables in (3) as functions of the x-variables in (3). To cover all possible cases when P varies over the solution manifold, it is necessary to extend (2) onto all possible permutations of the n+m variables x1 , ..., xn+m in (2). Note that, for all possible permutations in (3), and for any choice of the point P on the solution manifold in (1, 2), 1 ≤ r(P ) ≤ n, n + m − r(P ) ≥ m. (4) We shall pay special attention to the cases of planar curves (m = n = 1), surfaces in 3D (m = 1, n = 2) and curves in 3D obtained as intersections of two surfaces (m = 2, n = 1). These most important geometric cases correspond to constant maximal rank of the solution manifold. These are also the main cases of interest in computer-aided design (CAD), computer-aided geometric design (CAGD), scientiﬁc visualization in natural sciences, engineering and industrial applications, as well as for the development of simulators and computer games. For all cases in two and three dimensions (i.e., m + n = 2 or m + n = 3) there are some established “industrial standard” methods for computing and visualizing intersections. One of these methods is ray tracing (see, i.e., [13,7]), which has the advantage of only dealing with 1-dimensional intersection problems, since it is based on collecting information about intersections with geometric objects on the way of a visual ray, and transforming this information into colour corresponding to a given pixel within a given frame (time step). This method produces highly realistic images, but at the cost of extreme computational demands. A high-resolution real-time ray tracing algorithm is very unlikely to appear in the foreseeable future. An alternative real-time algorithm producing realistic level curves in 2 dimensions and level surfaces in 3 dimensions is the Marching Square/Cube algorithm. We have investigated in detail and used this algorithm for creating a software application for animated scientiﬁc visualization of the results obtained via the simulation testbed BedSim of the Swedish company LKAB (http://lkab.com) see [9]. In the process of developing this software, we became aware not only of the computational strongpoints of this algorithm, but also of its weaknesses. Because of these weaknesses, we addressed the issue of upgrading it in [11], where we considered one vertex-based CPU-computational and one pixel-based GPUcomputational upgrade of the Marching Square/Cube algorithm1 . Although these two upgrades proved to enhance considerably the range of applicability and the eﬃciency of the original Marching Square/Cube algorithm, these upgrades were still quite restrictive in a number of important aspects. In [5] we 1

Acronyms: CPU and GPU stand for “central” and “graphics” processing units of the computer, respectively.

758

J. Gundersen, A.R. Kristoﬀersen, and L.T. Dechevsky

proposed a “marching” algorithm which is essentially free from all limitations and constraints typical for the Marching Square/Cube algorithm and its various upgrades. This algorithm works in the most general setting (1-4) and remains eﬃcient for values of m and n which are higher than with any of the marching square/cube versions. The purpose of the present paper is to concisely compare the original Marching Square/Cube algorithm, as used in [9], its CPU/vertex-based and its GPU/ pixel-based upgrades from [11], and the Marching Simplex algorithm proposed in [5]. The main new contribution in this paper is the comparative analysis in section 3. Section 4 contains important information about future work related to the connection between the Marching Simplex algorithm and a new upgrade of the Marching Cube algorithm based on the concept of wavelet-based isometric conversion between dimension and resolution - see [3] and the references therein.

2

A Hierarchy of “Marching” Algorithms for Intersection

Here we give a concise outline of the Marching Square/Cube algorithm, its vertex based CPU-computable upgrade from [11], its pixel-based GPU-computable upgrade from [11], and the most general Marching Simplex algorithm proposed in [5]. 2.1

Marching Square/Cube Algorithm

In the general setting (1-4), the Marching Cube algorithm2 can be used for ﬁnding only hypersurfaces in Rm+1 (n = 1, m = 0, 1, . . .). Its practical use is mainly for computing isocurves (n = m = 1) in R2 and isosurfaces n = 1, m = 2 in R3 . These algorithms are based on diﬀerent lookup tables, depending on the dimension. This is a straightforward way which in 2 and 3 dimensions is fairly fast. The lookup table for 2D has only 24 = 16 entries and for 3D it has 28 = 256 entries. There are several places on the Internet where such tables are available. We have chosen to base our algorithm on the lookup table in [2]. The basic principle behind the marching square/cube algorithm is: – Subdivide the space into a series of smaller squares/cubes. – “March” through each of them testing the corner points if they are above or below the given iso-value (i.e., testing the sign of the LHS in the one and only equation (n = 1) in (1)). – Depending on the result of the testing, the lookup tables are used to decide what kind of iso-curve/-surface conﬁguration we have inside the square/cube. – By interpolating the edges between the corners it is possible to ﬁnd an approximate solution to where the iso-curve/-surface intersects the square/ cube. For more details about the marching square/cube algorithm see, e.g., [2]. 2

The marching cube algorithm has, until recently, been patented by William E. Lorensen and Harvey E. Cline. This 20-year patent expired in June 2005.

Comparison between the Marching-Cube and the Marching-Simplex Methods

2.2

759

A Vertex-Based CPU-Computable Upgrade

The Marching Square/Cube algorithm, even considered in its greatest possible generality (to which general case one should refer to as the Marching Hypercube algorithm), is still quite restrictive: within the general setting (1-4) it handles only the particular case n = 1 (i.e., when the intersection is a hypersurface in Rn+m ). In [11] we made an attempt to construct an upgrade of “the Marching Hypercube algorithm” which should be able to handle problem (1-4) in full generality, i.e., for all n = 1, 2, . . ., under the only constraint that either the solution manifold is compact, or only a compact part of the solution manifold will be computed. The main idea can be outlined, as follows. 1. Assuming that Ω ⊂ Rn+m , where Ω is compact, ﬁnd a hypercube (or hyperrectangle) Kn+m ⊂ Rn+m , so that Kn+m (it may be additionally required that Kn+m be minimal with the above-said property; we shall not discuss this topic in detail here). We assume that the sides of Kn+m are parallel to the coordinate axes in Rn+m . 2. For every one of the n + m sides of Kn+m consider a strictly increasing knotvector (i.e., all knots are simple) with suﬃciently small maximal distance between two neighbouring knots, and deﬁne the piecewise aﬃne B-spline system for this knot-vector. Then, approximate the LHS of each of the n equations in (1) by n + m-variate tensor product Lagrange interpolation with piecewise aﬃne B-splines in every variable. 3. In the place of (1) solve the approximating system of n equations with n + m unknowns on Kn+m , as obtained in item 2. The solution amounts to a “Marching hyper-rectangle” algorithm where for each hyper-rectangle a system of n (n+m)-polylinear equations has to be solved. In [11] we outlined how this system can be solved for n+m = 2 and n+m = 3. The construction of the solution can be extended by induction for any k = 1, 2, 3, 4, . . . , with n + m = k. 4. The solution of approximate system of item 3 for any given hyper-rectangle in Kn+m is an intersection (possibly, void) of n hypersurfaces in Rn+m . If this solution has a non-void intersection with the considered hyper-rectangle in Kn+m , then, and only then, this intersection is accepted as part of the approximate solution manifold. In this way, while “marching” over all hyperrectangles in the tensor-product grid in Kn+m , the whole approximate solution manifold in Kn+m is obtained. (The procedure would eventually require also a ﬁnalizing intersection with Ω ⊂ Kn+m .) For further details on this upgrade of the Marching Square/Cube algorithm see [11]. 2.3

A Pixel-Based GPU-Computable Upgrade

This was the second alternative of an upgrade of the original Marching Square / Cube algorithm which was proposed in [11], as a GPU-computable analogue of

760

J. Gundersen, A.R. Kristoﬀersen, and L.T. Dechevsky

the CPU-computable construction in subsection 2.2. In its basic version considered in detail in [11], this method works for the cases n + m = 1 : n = 1, m = 0; n + m = 2 : n = 1, m = 1 or n = 2, m = 0; n + m = 3 : n = 1, m = 2 or n = 2, m = 1

(5) or n = 3, m = 0;

which essentially cover all cases of 3-dimensional geometric modelling of intersections in CAGD. In [11] we speciﬁcally considered the cases n = 1, m = 1 (implicit equation of a curve in the plane), and n = 2, m = 1 (implicit equation of a curve in the 3-dimensional space). The key tool for solving both of these problems using the GPU was the 1-1 mapping of the values of a parametric surface in 3D deﬁned on Ω ⊂ R2 as an RGB-colouring of its parametric domain Ω, where the value of “red”, “green”, and “blue” corresponded to the (appropriately scaled) value of the x, y, and z coordinate of a point P on the surface, respectively. (This type of bijective colour coding of parametric surfaces in 3D was introduced in [4] as “RGB-colour map (mode 2)”.) For further details and examples about this upgrade of the Marching Square / Cube algorithm, see [11]. 2.4

The Marching Simplex Algorithm

Testing of the algorithm in subsection 2.2 showed that it has a number of deﬁciencies (more about this will be said in the next section). The source of these deﬁciencies was, in the ﬁrst place, the nonlinearity of the polylinear system of equations to be solved for every hyper-rectangle in the grid in Kn+m . In [5] we addressed the problem of eliminating these deﬁciencies by, modifying the tiling over which the “marching” was performed: in the place of hyper-rectangles in Rn+m , we considered hyper-simplices in Rn+m . The most important consequence of this change of the tiling was that the system of n equations with n + m unknowns which had to be solved for every tile, in the case of hypersimplices turned out to be linear (and not just polylinear, as was the case with hyper-rectangular tiles). The Marching Simplex algorithm proved to be a simple and robust general approach to computing intersections, free of the deﬁciencies of the tensor-product approach of subsection 2.2. For further details, see [5].

3

Comparison of the Considered Four Marching Algorithms

The main advantage of the original Marching Square/Cube algorithm (see subsection 2.1) is that it is a fast and simple method for computing iso-hypersurfaces in Rm+1 . However, its applicability is limited to essentially only the values m + 1 = 2 and m + 1 = 3 since the number of entries for higher dimensions increases fast, i.e., for 4D we get 216 = 65536 entries and, for 5D,

Comparison between the Marching-Cube and the Marching-Simplex Methods

761

Fig. 1. On the left hand side, the grid size is 22 × 22. In the middle, the grid size is 21 × 21. Typical artefact arising when using the Marching Square/Cube algorithm of subsection 2.1 or its tensor-product upgrade in subsection 2.2. On the right hand side, the grid size is 107 × 107. Visibly, the defect in the singularity point has disappeared, but it can still be found after suﬃcient zooming in. This situation may arize with any of the marching algorithms in subsection 2.1, 2.2 and 2.3. However, for the pixelbased GPU-computable algorithm in subsection 2.3 it is necessary to zoom down to “subpixel level”, which is impossible, and the singularity is correctly resolved at the minimal possible “pixel level”.

232 = 4294967296 entries! 3 Other essential limitations of the original Marching Square/Cube algorithm include the following. – The algorithm works only when the solution manifold has maximal rank everywhere and this maximal rank is 1. – If the equation (1-4) has isolated solutions (e.g., a unique solution), in which case the rank is 0, the algorithm can easily “lose” the solution. – The algorithm has no mechanism for detection of singularities and can easily misrepresent the topology of the solution manifold (see Figure 1). – The storage of the m-dimensional simplices obtained by the algorithm as part of the solution hyper-surface in Rm+1 becomes increasingly complicated, especially taking in consideration that these simplices will in general be nonuniform (unlike the Marching Simplex algorithm where the “marching” can be performed over uniform simplices only). The tensor-product upgrade, considered in subsection 2.2 improves upon the original Marching Square/Cube algorithm in several aspects, as follows. – The algorithm is applicable in the most general setting (1-4) for arbitrary number of equations n = 1, 2, . . .. – It can be used not only when the solution has constant maximal rank everywhere, but also in the case of variable rank and in the presence of singularities, including branching of solutions. – Conceptually, this algorithm has a mechanism for detection of singularities. Weaknesses of the algorithm are, as follows. 3

In [1] an algorithm for automatic generation of a lookup table of the iso-surface k and its triangulation for all possible 22 entries of the hypercube in a k-dimensional regular grid is proposed, but its eﬃciency is already questionable for k = m+1 = 4, 5.

762

J. Gundersen, A.R. Kristoﬀersen, and L.T. Dechevsky

– The system of n equations to be solved on every “marching” step is nonlinear, albeit polylinear. The total degree of the equations in this system is n + m. – With the increase of n, the computations become increasingly ill-conditioned. – The number of conditional statements that have to be veriﬁed is equal to the number of A-coeﬃcients of monomials n+m n+m of the n+m in the polylinear expansion product j=1 (aj + bxj ) = A1 + j=1 A1j xj + · · · + An+m,1 j=1 xj . It can be seen that this number is 2n+m , i.e., it increases exponentially with n + m. (The ordering of the conditional statements is, as follows - see [11]: ﬁrst An+m,1 = 0 is veriﬁed, then An+m−1,j = 0, j =1, . . . , n+ m, and so n+m n+m on. At level n + m − j there are = conditional n+m−j j statements to verify.) – Because of the ill-conditioned computations, the method is very sensitive (non-robust) with respect to round-up errors and other systematic and random errors. – Due to its non-robustness, it is possible to lose part of the solution or wrongly detect the presence or absence of singularity, or the type of singularity, leading to eventual misinterpreting of the topology of the solution manifold (see Figure 1). – It is possible to ensure correctness of the results obtained by this algorithm by appropriately chosen small tolerances, which is a common practice in the computation of intersections [6,12] but, due to the intrinsic non-robustness of the algorithm, tuning the tolerances is quite delicate. The algorithm in subsection 2.3 provides a limited upgrade of the algorithm in subsection 2.1: the eﬃcient range of the latter is n = 1, m = 1 or m = 2 as mentioned earlier, while the former algorithm of subsection 2.3 handles well all cases in (5). This algorithm is not as generally applicable as the one from subsection 2.2 but within its range of application it is highly eﬃcient and quite robust. Because of its robustness it is relatively straightforward to adjust the tolerances [11] for it. A very interesting, and perhaps rather surprising, fact is that within the given resolution this algorithm resolves correctly the singularities and, as a consequence, does not miss parts of the solution. The reason for this feature is explained by the situation given in Figure 1. As weakness of this algorithm we note the following. – Some important computations, in particular related to sorting, are still done on the CPU, and act as a bottleneck slowing down the overall high performance of the algorithm [11]. – In cases requiring long computations, the GPU, which is oriented towards real-time applications, tends to cut its computations and produce random results instead. This problem begins to be addressed only in the latest generation of GPUs with increased programmability. The Marching Simplex algorithm of subsection 2.4 works for all n and m in (1), like the algorithm in subsection 2.2, but without the above-listed deﬁciencies of the latter.

Comparison between the Marching-Cube and the Marching-Simplex Methods

763

– It outperforms the Marching Square/Cube algorithm of subsection 2.1 in iso-surfacing already for n = 2, 3, and its performance continues to be good for higher values of n, partly because no look-up tables are needed for it. – It also considerably outperforms the tensor-product approach in subsection 2.2 because the system of equations to be solved for every tile is linear. – Due to the convex nature of the simplex and the linearity of the related system of equations, for every n the needed computations are convex and, hence, intrinsically well-conditioned. – As a consequence of the previous items, the algorithm is robust with respect to round-up and random errors. – The algorithm allows highly eﬃcient and simple kd-tree reﬁnement (kd-tree in k dimensions (quad-tree in 2D, oct-tree in 3D, etc.)). – The kd-tree reﬁnement is coupled with a comprehensive criterion for singularity detection based on the Morse lemma [14]. Moreover, due to the above-said convexity of the computations, this criterion is also quite robust with respect to round-up and random errors. – As noted in the concluding section of [5], this algorithm can be used to compute approximately also solution manifolds which have fractal nature. One such example has been considered in [10]. The only shortcoming of the algorithm we are aware of is that it is CPUcomputable and currently it is run only as a sequential “marching” algorithm.

4

Future Work

Our plans for future work in the directions of research discussed in this paper are related to further development of the algorithms presented in subsections 2.3 and 2.4, and include the following. 1. Removing the bottleneck in the algorithm of subsec. 2.3, due to sorting via the CPU by using instead GPU-computable Bitonic Sort presented in [8]. 2. Upgrading the algorithm from subsection 2.3 by replacing the RGB colour (mode 2) coding [4] by RGB colour (mode 3) coding [4] which is based on the principle of isometric conversion between dimension and resolution [4]. In this context, we intend also to replace Cantor diagonal type of basis-matching algorithm in [4] with the simpler basis-matching algorithm proposed in [3]. 3. We intend to combine the algorithms in subsections 2.3 and 2.4 after the upgrade in item 2 (see [3]) by using the GPU-computable algorithm as initial localization stage, with subsequent reﬁnement stage via the CPU-computable Marching Simplex algorithm of subsection 2.4. 4. Full-scale parallelization of the Marching Simplex algorithm of subsection 2.4 is possible thanks to the kd-tree reﬁnement it uses.

References 1. Bhaniramka, P., Wenger, R., Crawﬁs, R.: Isosurfacing in Higher Dimensions. In: Proceedings of the 11th IEEE Visualization 2000, Conference (VIS 2000). IEEE Computer Society, Los Alamitos (2000)

764

J. Gundersen, A.R. Kristoﬀersen, and L.T. Dechevsky

2. Bourke, P.: Polygonising a Scalar Field (1997), http://astronomy.swin.edu.au/~ pbourke/modelling/polygonise/ 3. Dechevsky, L.T., Bang, B., Gundersen, J., Laks˚ a, A., Kristoﬀersen, A.R.: Solving nonlinear systems of equations on graphics processing units. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2009. LNCS, vol. 5910. Springer, Heidelberg (2009) 4. Dechevsky, L.T., Gundersen, J.: Isometric Conversion Between Dimension and Resolution. In: Dæhlen, M., Mørken, K., Schumaker, L. (eds.) Mathematical methods for Curves and Surfaces, pp. 103–114. Nashboro Press (2005) 5. Dechevsky, L.T., Kristoﬀersen, A.R., Gundersen, J., Laks˚ a, A., Bang, B., Dokken, T.: A “marching simplex” algorithm for approximate solution of non-linear systems of equations and applications to branching of solutions and intersection problems. Int. J. Pure Appl. Math. 33(3), 407–431 (2006) 6. Dokken, T.: Aspects of Intersection Algorithms and Approximation, Ph.D. Thesis, University of Oslo (1996) 7. Glassner, A.S.: An Introduction to Ray Tracing. Morgan Kaufmann, San Francisco (2000) 8. Govindaraju, N.K., Manocha, D., Raghuvanshi, N., Tuft, D.: GPUSort: High Performance Sorting using Graphics Processors. Department of Computer Science, UNC Chapel Hill (2005), http://gamma.cs.unc.edu/GPUSORT/results.html 9. Gundersen, J., Dechevsky, L.T.: Scientiﬁc visualization for the ODE-based simulator BedSim of LKAB. Int. J. Pure Appl. Math. 41(9), 1197–1217 (2007) 10. Kristoﬀersen, A.R.: M.Sc. Diploma Thesis, Narvik University College (2004) 11. Kristoﬀersen, A.R., Georgiev, D.T., Dechevsky, L.T., Laks˚ a, A., Gundersen, J., Bang, B.: Vertex versus Pixel Based Intersection: Using the GPU for Investigation of Surface Intersection. Int. J. Pure Appl. Math. 33(3), 407–431 (2006) 12. Sederberg, T.W., Chen, F.: Implicitization using moving curves and surfaces. Computer Graphics Annual Conference Series, pp. 301–308 (1995) 13. Shirley, P.: Realistic Ray Tracing. A K Peters, Natick (2000) 14. Zorich, V.A.: Mathematical Analysis. Nauka, Moscow, vol. 1 (1981) (in Russian)

Transitions from Static to Dynamic State in Three Stacked Josephson Junctions Ivan Christov, Stefka Dimova, and Todor Boyadjiev Faculty of Mathematics and Infromatics, University of Soﬁa, 5 James Bourchier Boulevard, 1164 Soﬁa, Bulgaria {ivanh,dimova,todorlb}@fmi.uni-sofia.bg

Abstract. Using the Sakai-Bodin-Pedersen model, a system of three perturbed sine-Gordon equations is numerically studied. Eﬀective numerical algorithms are proposed and realized to investigate the transitions from static to dynamic state in a stack of three Josephson junctions. Critical currents for individual junctions are found for diﬀerent values of the damping parameter at low magnetic ﬁeld. We show that the switching from static to dynamic state of the interior junction can trigger the switching of the exterior ones, and this process leads to current locking. We ﬁnd that the critical current of the individual junction depends on the damping parameter and on the static or dynamic states of the other junctions.

1

Introduction

Multistacked long Josephson Junctions (JJs) form an interesting physical system, where both nonlinearity and interactions between subsystems play an important role. Such systems allow to study physical eﬀects that do not occur in single JJs. One of the most interesting experimental results for two stacked JJs found in resent years is the so-called current locking (CL). The essence of this phenomenon is as follows: there exists a range of the external magnetic ﬁeld, where the diﬀerent junctions switch to dynamic state simultaneously when the external current exceeds some critical value. It was shown by means of numerical simulation [3] that experimentally found CL for two stacked JJs can be obtained and well explained in the framework of the inductive coupling model [5]. The simplest generalizable model of stacked JJs are the three stacked JJs, because it takes into account the diﬀerent behavior of the interior and exterior junctions. The ﬁrst and the third junctions are coupled only to one neighboring junction, while the second junction is coupled to its two neighbors below and above. In the next section the mathematical model is described. In the third section eﬀective numerical methods and algorithms are proposed to investigate the transitions of the junction from static to dynamic state. The numerical results are discussed in the last section. Critical currents for individual junctions are found for diﬀerent values of the damping parameter at low magnetic ﬁeld. It is shown that for three stacked symmetric junctions the critical current of the I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 765–770, 2010. c Springer-Verlag Berlin Heidelberg 2010

766

I. Christov, S. Dimova, and T. Boyadjiev

interior junction strongly depends on the dynamic state of the exterior ones and vice versa. At the same time, there is a range in the magnetic ﬁeld in which the junctions switch simultaneously to dynamic state, i.e., current locking occurs.

2

Mathematical Model T

The dynamics of the Josephson phases ϕ(x, t) = (ϕ1 (x, t), ϕ2 (x, t), ϕ3 (x, t)) in symmetric three stacked inductively coupled long JJs is described by the following system of perturbed sine-Gordon equations [5]: ϕtt + αϕt + J + Γ = L−1 ϕxx ,

−l < x < l,

0 < t ≤ T,

(1)

where α is the dissipation coeﬃcient (damping parameter), Γ = γ (1, 1, 1)T is the vector of the external current density, J = (sin ϕ1 , sin ϕ2 , sin ϕ3 )T is the vector of the Josephson current density, L = tridiag (1, S, 1), (−0.5 < S ≤ 0) and 2l is the length of the stack. L−1 is the following symmetric matrix: ⎛

1−S 2 1−2S 2

⎜ ⎜ −S L−1 = ⎜ 1−2S 2 ⎝ S2 1−2S 2

−S 1−2S 2

S2 1−2S 2

1 1−2S 2

−S 1−2S 2

−S 1−2S 2

1−S 2 1−2S 2

⎞ ⎟ ⎟ ⎟. ⎠

In this work we consider stacks of overlap geometry placed in external magnetic ﬁeld he , therefore the system (1) should be solved together with the boundary conditions: ϕx (−l) = ϕx (l) = H, (2) where H is the vector H = he (1, 1, 1)T . To close the diﬀerential problem appropriate initial conditions must be posed: ϕ(x, 0) − given,

ϕt (x, 0) − given.

(3)

The existence of Josephson current generates a speciﬁc magnetic ﬂux. When the external current is less than some critical value all the junctions are in some static state, i.e., we have a time independent solution of (1), (2). In this case the measured voltages in all junctions are zero. The exceeding of this critical value leads to switching to dynamic state of the system and we have nonzero voltage of at least one of the junction. The voltage in i-th junction is mathematically given by: T l 1 Vi = lim ϕi,t (t, x)dxdt. (4) T →∞ 2lT 0 −l

We deﬁne the critical current of individual junction as the value at which this junction switches to nonzero voltage. Also, we want to make a correspondence between the critical current values of switching to dynamic state and the bifurcation values of some static solutions.

Transitions from Static to Dynamic State

767

In the three stacked case we consider static solutions, which are combinations of solutions existing in the one junction case. The important solutions in the case of a single junction are: – Meissner solutions, denoted by M , for parameters he = 0, γ = 0: ϕ(x) = kπ, k = 0, ±1, ±2, . . . , – ﬂuxon (antiﬂuxon) solutions, for which there are exact analytical expressions in the case of inﬁnite junctions (l = ∞) and he = 0, γ = 0. The single ﬂuxon/antiﬂuxon solution has the well known form ϕ(x) = 4 arctan(exp (±x)) + 2kπ. Further, for n-ﬂuxon distributions we use the simple notation F n . In this work we consider triplets (M, M, M ), (M, F 1 , M ), (M, F 2 , M ), because, as the numerical experiments show, they play a signiﬁcant role at low magnetic ﬁeld he . To study the global stability of a possible static solution, the following SturmLiouville problem (SLP) is generated: −L−1 uxx + Q(x)u = λ u, l u, u dx − 1 = 0, ux (±l) = 0,

(5a) (5b)

−l

where Q(x) = Jz (ϕ(x)). To study the global stability is equivalent to study the positive deﬁniteness of the second variation of the potential energy of the system. The minimal eigenvalue λmin determines the stability of the solution under consideration. A minimal eigenvalue, equal to zero, means a bifurcation caused by change of some parameter, in our case — the external current γ. We use the calculated bifurcation values for the static solutions, found when solving the dynamic problem, for comparison.

3

Numerical Methods and Algorithms

To solve the dynamic problem (1), (2), (3), we use the ﬁnite diﬀerence method. The main equation (1) is approximated by the ”cross-shaped” scheme. To approximate the boundary conditions (2), the three point one-sided ﬁnite diﬀerences 2 are used. Let h and τ be the steps in space and time respectively, and δ = (τ /h) . Then the diﬀerence scheme is: −1

yˆkl = (1 + 0.5ατ )

[2ykl + (0.5ατ − 1)ˇ ykl − τ 2 (sin ykl + γ) +

3 m=1

k = 1, . . . , n − 1, l = 1, 2, 3, L−1 = (alm )3l,m=1 , yˆ0l = (4ˆ y1l − yˆ2l − 2hhe )/3,

l l yˆnl = (4ˆ yn−1 − yˆn−2 + 2hhe )/3.

The approximation error of this scheme is O(τ 2 + h2 ).

δalm yxm ¯x,k ],

768

I. Christov, S. Dimova, and T. Boyadjiev

To check the numerical stability and the real order of accuracy, we have made computations for ﬁxed time levels and embedded meshes in space. The results show second order of convergence in space and time. The numerical procedure for ﬁnding the critical currents of the individual junctions for ﬁxed parameters S, l, α works as follows. We start with he = 0, γ = 0, ϕ(x, 0) = 0, ϕt (x, 0) = 0. For given magnetic ﬁeld he , increasing the current γ from zero by a small amount δγ, we calculate the current-voltage dependence (denoted further I-V characteristics) by using the diﬀerence scheme. To ﬁnd the approximate value of the voltage Vi given by (4), an averaging procedure is proposed. The numerous experiments made show that it is reasonable to neglect the transient stage from one value of γ to the next, taking place in a time interval (0, t0 ). The value of t0 is determined by numerical experiments. Thus, we calculate the average voltages V i,k , i = 1, 2, 3 for t ≥ t0 for embedded intervals in t, Tk = T0 + kΔ: V i,k

1 = 2lTk =

1 2lTk

t0+Tk l

ϕi,t (t, x)dxdt t0

−l

l [ϕi (t0 + Tk , x) − ϕi (t0 , x)]dx. −l

The averaging is ﬁnished when the diﬀerences |V i,k+1 − V i,k | become less than a given value ε. Thus, the next point of the I-V characteristic is found. In our computations, for the next value of the current γ, the phase distributions of the last two time levels achieved are used as initial time levels. We increase the current until all the junctions switch to nonzero voltage. Then the external ﬁeld he is increased by a small amount and the I-V characteristic is calculated for this new value of he . As initial data, the phase distributions for γ = 0 and the previous value of he are used. Because of the symmetry of the stack we investigate only the case he ≥ 0. To ﬁnd the critical currents for he ∈ [0, 1.5] more than 500 000 000 time steps are needed. In order to ﬁnd the bifurcation curves of the static distributions of the magnetic ﬂux we solve numerically the static problem, corresponding to the dynamic one (1), (2). To solve the boundary-value problem for the nonlinear system of second order ordinary diﬀerential equations we use an iterative algorithm, based on the continuous analog of Newton’s method (CAMN) [4]. CANM gives a linear boundary value problem at each iteration step, which is solved numerically by means of Galerkin ﬁnite element method (FEM) and quadratic ﬁnite elements. FEM is used also to reduce the matrix Sturm-Liouville problem (5) to an algebraic eigenvalue problem whose few smallest eigenvalues and the corresponding eigenfunctions are found by the subspace iteration method [1]. To test the accuracy of the above methods, we have used the method of Runge by computing the solutions on three embedded meshes. The numerous experiments made show a

Transitions from Static to Dynamic State

CRITICAL CURRENT

1

769

MMM R R

0.5

R FF

MF1M

M-JJex

MF2M

,-JJin

0 0

0.5 1 EXTERNAL MAGNETIC FIELD

1.5

2l=10, S=-0.3, R - JJin in resistive state, FF - JJin in flux-flow state

Fig. 1. Bifurcation diagram in the plane (he , γ), α = 0.1

super-convergence of order four. For more detailed explanation of these methods see [2].

4

Numerical Results

Let us mention, the stack is symmetric, but it allows asymmetric solutions, for example asymmetric static solution (F, M, M ). Nevertheless all of the solutions, achieved when solving the dynamic problem, were symmetric - the solutions in the exterior junctions were of the same type. The results for the critical currents of the individual junctions with stack parameters S = −0.3, α = 0.1, 2l = 10 are graphically shown in Fig. 1. We consider the interval [0, 1.5] in he . In this interval the static solutions which play role are (M, M, M ), (M, F 1 , M ), (M, F 2 , M ), i.e., the ﬂuxons enter in the interior junction (JJ in ) ﬁrst. The following conclusions can be made. There is a domain in vicinity of zero ﬁeld, where the junctions switch to nonzero voltage simultaneously, i.e., we have CL. In addition there are two →

to resistive state

CURRENT

CURRENT

0.3

→

0.6

0.4

M-JJex ,-JJin

0.2

to resistive state

→

→

to resistive state

to resistive state

0.2

→

0.1

0

2l=10, S= -0.3, α=0.1

0

0

2

4

6

VOLTAGE

Fig. 2. I-V characteristic, he = 0.67

8

M-JJex ,-JJin

to flux-flow state

2l=10, S= -0.3, α=0.1 0

0.5

VOLTAGE

1

1.5

Fig. 3. I-V characteristic, he = 1.37

770

I. Christov, S. Dimova, and T. Boyadjiev

CRITICAL CURRENT

1 MMM

0.5

MF2M 0 0

0.25

0.5 0.75 1 1.25 EXTERNAL MAGNETIC FIELD

1.5

1.75

2l=10, S=-0.3, simultaneous swithing to dynamic state

Fig. 4. Bifurcation diagram in the plane (he , γ), α = 0.01

smaller domains at which we have CL divided by domains at which JJ in switches ﬁrst to resistive (R) state. At higher ﬁelds JJ in switches to ﬂux-ﬂow (FF) state (moving chain of ﬂuxons). In this domain we have no CL at all. The moving ﬂuxons create a time dependent perturbation of Josephson phase in the exterior junctions (JJ ex ) which results in switching to nonzero voltage at lower current than in the case when JJ in is in R-state. The I-V characteristics for he = 0.67 and he = 1.37 are shown in Fig. 2 and Fig. 3. Let us mention that for every he the lower critical current point lies on some bifurcation curves of the solutions (M, M, M ), (M, F 1 , M ), (M, F 2 , M ). As opposed to the case α = 0.1, for α = 0.01 we obtain simultaneous switching to nonzero voltage in the whole interval [0,1.5]. In this case the critical current points lie only on the bifurcation curves of (M, M, M ) and (M, F 2 , M ) solutions. The results are graphically shown in Fig. 4. Acknowledgment. This work is supported by Soﬁa University Scientiﬁc foundation under Grant No 39/2009.

References 1. Bathe, K.J., Wilson, E.: Numerical Methods in Finite Element Analisis. Prentice Hall, Englewood Cliﬀs (1976) 2. Christov, I., Dimova, S., Boyadjiev, T.: Stability and Bifurcation of the Magnetic Flux Bound States in Stacked Josephson Junctions. In: Margenov, S., Vulkov, L.G., Wa´sniewski, J. (eds.) NAA 2008. LNCS, vol. 5434, pp. 224–232. Springer, Heidelberg (2009) 3. Goldobin, E., Ustinov, A.V.: Current locking in magnetically coupled long Josephson junctions. Phys. Rev. B 59, 11532–11538 (1999) 4. Puzynin, I.V., et al.: Methods of computational physics for investigation of models of complex physical systems. Particals & Nucley 38 (2007) 5. Sakai, S., Bodin, P., Pedersen, N.F.: Fluxons in thin-ﬁlm superconductor-insulator superlattices. J. Appl. Phys. 73, 2411–2418 (1993)

Factorization-Based Graph Repartitionings Kateˇrina Jurkov´ a1 and Miroslav T˚ uma2

2

1 Technical University of Liberec, Studentsk´ a 2, 461 17 Liberec 1, Czech Republic [email protected] Institute of Computer Science, Czech Academy of Sciences, Pod Vod´ arenskou vˇeˇz´ı 2, 18207 Praha 8, Czech Republic [email protected]

Abstract. The paper deals with the parallel computation of matrix factorization using graph partitioning-based domain decomposition. It is well-known that the partitioned graph may have both a small separator and well-balanced domains but sparse matrix decompositions on domains can be completely unbalanced. In this paper we propose to enhance the iterative strategy for balancing the decompositions from [13] by graph-theoretical tools. We propose the whole framework for the graph repartitioning. In particular, new global and local reordering strategies for domains are discussed in more detail. We present both theoretical results for structured grids and experimental results for unstructured large-scale problems.

1

Introduction

The problem of proper graph partitioning is one of the classical problems of the parallel computing. The actual process of obtaining high-quality partitionings of undirected graphs which arises in many practical situations is reasonably well understood. In addition, the resulting algorithms are sophisticated enough [5,1]. Such situations are faced, e.g., if standard criteria for partitionings expressed by balancing sizes of domains and minimizing separator sizes are considered. However, the situation may be diﬀerent if one needs to balance the time to perform some speciﬁc operations. An example can be the time to compute sparse matrix decompositions, their incomplete counterparts, or the time for some auxiliary numerical transformations. It can happen that a partitioning which is well-balanced with respect to the above-mentioned standard criteria may be completely unbalanced with respect to some time-critical operations on the domains. The general framework of multi-constraint graph partitioning may not solve the problem. The graph partitioning problem is closely coupled with the general problem of load balancing. In particular, the partitioning represents a static load balancing. In practice, load distribution in a computation may be completely diﬀerent from the original distribution at the beginning of the computation. Further, dynamic load balancing strategies can then redistribute the work dynamically. A lot of interest was devoted to analysis of basic possible sources of such problems [4]. I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 771–778, 2010. c Springer-Verlag Berlin Heidelberg 2010

772

K. Jurkov´ a and M. T˚ uma

Principles of the cure of such problems one can ﬁnd, e.g., in [7,14]. In some situations, in order to cover complicated and unpredictably time-consuming operations on the individual domains, one can talk about minimization with respect to complex objectives [13], see also [12]. The strategy proposed in [13] consists in improving the partitioning iteratively during the course of the computation. In some cases it is known much more about such critical operations. This paper aims at exploiting this knowledge. Then the additional information may be included into the graph partitioner, or used to improve the graph partitioning in one simple step providing some guarantees on its quality at the same time. Both these strategies have their own pros and cons. While integration of the additional knowledge into the graph partitioner seems to be the most eﬃcient approach, it may not be very ﬂexible. In addition, an analysis of such approach may not be simple when the typical multilevel character of partitioning algorithms is taken into account. A careful redistribution of the work in one subsequent step which follows the partitioning seems to provide the useful ﬂexibility. Since the time-critical operation performed on the domains is the sparse matrix factorization, the key to our strategy is to exploit the graph-theoretic tools and indicators for the repartitioning. Let us concentrate on the complete factorization of a symmetric and positive deﬁnite (SPD) matrix which is partitioned into two domains. In this case, the underlying graph model of the factorization is the elimination tree. Our ﬁrst goal is to show the whole framework of theoretical and practical tools which may allow post-processing of a given graph partitioning in one simple step. Then the repartitioned graph should be better balanced with respect to the factorization. Further we will discuss one such tool in more detail. Namely, we will mention that we can directly determine counts of columns which should be modiﬁed in the new factorization once we change the border nodes, which are vertices of the separated domains incident to the separator. We conﬁrm both theoretically and experimentally that by carefully chosen reorderings we can decrease the number of these modiﬁcations. Section 2 summarizes some terminology and describes the problem which we would like to solve. Section 3 explains basic ideas of our new framework. Then we discuss the problem of minimizing modiﬁcations in factorized matrices on domains both theoretically and experimentally.

2

Basic Terminology and Our Restrictions

Let us ﬁrst introduce some deﬁnitions and concepts related to the complete sparse matrix factorizations and reorderings. For simplicity we assume that adjacency graphs of all considered matrices are connected. Also, we will discuss only the standard pointwise graph model. But note that practical strategies for graph repartitioning should be based on blocks or other coarse representations described, e.g., by factorgraphs or hypergraphs. The decomposition of an SPD matrix A is controlled by the elimination tree. This tree and its subtrees provide most of the structural information relevant to the sparse factorization. Just by traversing the elimination tree, sizes of matrix

Factorization-Based Graph Repartitionings

773

factors, their sparsity structure, supernodal structure or other useful quantities [3,10] can be quickly determined. The elimination tree T is the rooted tree with the same vertex set as the adjacency graph G of A and with the vertex n as its root. It may be represented by one vector, typically called PARENT[.], deﬁned as follows: min{i > j| lij = 0}, for j < n, (1) P AREN T [j] = 0, for j = n. where lij are entries of L. The n-th column is the only column which does not have any oﬀdiagonal entries. When applying Cholesky factorization to a sparse matrix, it often happens that some matrix entries which were originally zeros become nonzeros. These new nonzero entries are called ﬁll-in. High-quality sparse Cholesky factorization strongly minimizes the ﬁll-in. Tools for this minimization are called ﬁll-in minimizing reorderings. Basically, there are two categories of these reordering approaches. Global reorderings as nested dissection (ND) consider the graph as one entity and divide it into parts by some predeﬁned, possibly recursive heuristics. Local reorderings are based on subsequent minimization of the quantities which represent local estimates of the ﬁll-in. Important cases of such reorderings are MMD and AMD variations of the basic minimum degree (MD) algorithm. Many quantities related to the sparse factorization of SPD matrices can be eﬃciently computed only if the matrix is preordered by an additional speciﬁc reordering apart from a chosen ﬁll-in minimizing reordering. One such additional reordering useful in practical implementations is the postordering. It is induced by a postordering of the elimination tree of the matrix, being a special case of a topological ordering of the tree. For a given rooted tree, its topological ordering labels children vertices of any vertex before their parent. Note that the root of a tree is always labeled last. Further note that any reordering of a sparse matrix that labels a vertex earlier than its parent vertex in the elimination tree is equivalent to the original ordering in terms of ﬁll-in and the operation count. In particular, postorderings are equivalent reorderings in this sense.

3

Framework for the Graph-Based Repartitioning

In this section we will split the problem of repartitioning into several simpler tasks. Based on this splitting we will propose individual steps of our new approach. As described above, the problem arises if we encounter a lack of balance between sizes of the Cholesky factors on the domains. Using the elimination tree mentioned above, we are able to detect this imbalance without doing any actual factorization. This detection is very fast having its time complexity close to linear [10]. Then, the result of the repartitioning step is the new distribution of the graph vertices into domains which also implicitly deﬁnes the graph separator. The repartitioning step can be naturally split into two simpler subproblems. First, one needs to decide which vertices should be removed from one domain. Second, it should be determined where these removed vertices should be placed into the reordering sequence of the other domain. Alternatively, the domains may

774

K. Jurkov´ a and M. T˚ uma

be reordered and their factorizations recomputed from scratch. In the following two subsections, we will consider these two mentioned subproblems separately. For both of them we present new considerations. The third subsection of this section will present one simpler task in more detail as well as both theoretical and experimental results. 3.1

Removal of Vertices

Assume that the matrices on domains were reordered by a ﬁll-in minimizing reordering. Further assume that some vertices should be removed from one domain to decrease the potential ﬁll-in in the factorization. An important task is to determine which vertices should be removed from the domain such that their count would be as small as possible in addition to the further constraints mentioned below. In other words, the removal of chosen vertices should decrease the ﬁll-in as fast as possible. The following Algorithm 1 oﬀers a tool to solve this problem. It counts the number of row subtrees of the elimination tree in which each vertex is involved. Note that row subtrees represent sparsity structures of rows of Cholesky factor and they can be found by traversing the elimination tree. The algorithm is new, but it was obtained by modifying the procedure which determines the leaves of the row subtrees in the elimination tree in [11]. Algorithm 1. Count number of row subtrees in which the vertices are contained. for column j=1,n do COUNT(j):=n-j+1 PREV ROWNZ(j)=0 end for for column j=1,n do for each aij = 0, i > j do k=PREV ROWNZ(i) if k < j − |T [j]| + 1 for ξ = ct−1 , . . . , ct − 1 do COUNT(ξ ) = COUNT(ξ ) -1 end for end if PREV ROWNZ(i)=j end for end for Here T denotes the elimination tree of matrix A, and T [i] denotes the subtree of T rooted in the vertex i. T [i] also represents the vertex subset associated with the subtree, that is the vertex i and all its proper descendants in the elimination tree. |T [i]| denotes the number of vertices in the subtree T [i]. Consequently, the number of proper descendants of the vertex i is given by |T [i]| − 1. PREV ROWNZ is an auxiliary vector for tracking nonzeros in previously traversed rows. The

Factorization-Based Graph Repartitionings

775

computed quantity is denoted by COUNT. A critical assumption here is that the elimination tree is postordered. Having computed the counts, our heuristic rule for fast decrease of the ﬁll-in is to remove vertices with the largest COUNT. Let us note that the removal of vertices may also change the shape of the elimination tree, and our rule does not take this fact into account. To consider this, recent theory of sparse exact updates which uses multiindices should be taken into account, see the papers by Davis and Hager quoted in [3]. Further note that the removal should also take into account distance of the removed vertices from the border vertices. Therefore, we propose to use the counts from Algorithm 1 as the secondary cost for the Fiduccia-Mattheyses improvement of the Kernighan-Lin algorithm [6]. This is an iterative procedure which, in each iteration, looks for a subset of vertices from the two graph domains such that their swapping leads to a partition with the smaller size of the edge separator. Our modiﬁcation of the cost function then seems to enable more eﬃcient repartitionings. 3.2

Insertion of Vertices

Having a group of vertices to be inserted into the new domain D2 we need to determine where these vertices should appear in the new reordering sequence. Here the objective function is to minimize the eﬀect on the ﬁll-in in the corresponding Cholesky factor. Note that in the next subsection we will mention another motivation: minimize the number of columns to be recomputed in the Cholesky factor, if it was computed. Shortly, theoretical considerations related to the delayed elimination in [8] motivate an insertion of a vertex to the position of the parent of the least common ancestor of its neighbors in the elimination tree T which we have. Consider a vertex to be inserted, and denote by N the set of its neighbors in D2 . Let α be the least common ancestor of N in T . Denote by Tr [α] the unique subtree of T determined by α and N . Then the increase of the ﬁll-in in the new decomposition includes one edge for each vertex of Tr [α] and at most β multiples of the union of adjacency sets of the vertices from Tr [α] where β is the distance from α to the root of T plus one. In order to minimize the eﬀect of the insertion on the ﬁll-in, we need to minimize this amount. As in the previous subsection, this criterion may represent an additional cost function for the local heuristic like Kerninghan-Lin and we are about to perform an experimental study of its application. 3.3

Repartitioning for Generalized Decompositions

Consider the problem of repartitioning when we construct a factorization for which it may be diﬃcult to obtain a tight prediction of the ﬁll-in. An example can be an incomplete Cholesky decomposition. Similar situation can be faced when solving a sequence of systems of linear equations obtained, e.g., from the Newton’s method applied to solve a nonlinear boundary value problem. In this case, we may need a partial reuse of the partitionings over the sequence. Then

776

K. Jurkov´ a and M. T˚ uma

Table 1. Counts of columns which should be recomputed in Cholesky decomposition if boundary vertices are modiﬁed matrix application dimension nonzerosstandard MDnew approach bmw7st 1 structural mechanics 141,347 3,740,507 7,868 5,039 bodyy6 structural mechanics 19,366 77,057 2,354 476 cfd1 CFD pressure matrix 70,656 949,510 10,924 7,497 cfd2 CFD pressure matrix 123,440 1,605,669 15,021 10,416 hood car hood 220,542 5,494,489 7,099 2,192 kohn-sham4 quantum chemistry 90,300 270,598 3,564 2,233 m t1 tubular joint 97,578 4,925,574 9,093 7,095 pwtk pressurized wind tunnel 217,918 5,926,171 8,218 4,437 x104 beam joint 108,384 5,138,004 4,656 3,842

we face the following two problems: repartitioning as well as the recomputation of the decomposition. In this subsection we propose techniques that minimize in advance the eﬀort needed for recomputing the partition by a careful choice of reorderings. The eﬃciency of the new strategies is measured by the counts of columns or block columns which should be recomputed in order to get the new decomposition. These counts measure the additional work for the decomposition once the partition was changed. For simplicity, here we restrict ourselves to changes in the domain from which the vertices are removed. The ﬁrst approach which we propose generalizes the concept of local reorderings with constraints. This concept was introduced in [9] to combine local and global approaches, and recently investigated in [2]. Our procedure exploits the minimum degree reordering and uses the distance of vertices from the main separator as the second criterion which breaks the MD ties. Table 1 summarizes numerical experiments with the new reordering. All matrices except for the discrete Kohn-Sham equation are from the Tim Davis collection. The counts of factor columns to be recomputed (standard and new strategy, respectively) if a group of border nodes of the size ﬁxed to two hundred is removed are in the last two columns. The counts were computed via the elimination tree. For the ND reorderings we present a formalized theoretical result for the structured grids. We will show that the choice of the ﬁrst separator in the case of a k × k regular grid problem strongly inﬂuences the number of columns to be recomputed in case the border is modiﬁed by the removal or insertion. The situation is depicted in Figure 1 for k = 7. The ﬁgures represent the separator and the subdomain sets after four steps of ND. The border vertices are on the right and they are ﬁlled. The following theorem uses the separator tree in which the vertices describe the subdomain sets and separators, and which is a coarsening of the standard elimination tree. Theorem 1. Consider the matrix A from a k × k regular grid problem with ND ordering having l levels of separators. Assume that the matrix entries corresponding to the border vertices are modiﬁed. Denote by al and bl , respectively,

Factorization-Based Graph Repartitionings

777

Fig. 1. Grids with the ND separator structure related to Theorem 1. Type-I-grid on the left and Type-II-grid on the right.

maximum number of matrix block columns which may change in the Cholesky decomposition of A from Type-I-grid or Type-II-grid. Then liml→∞ al /bl = 3/2 for odd l and liml→∞ al /bl = 4/3 for even l. Proof. Clearly, a1 = 3 since the changes inﬂuence both domains. Consequently, all block columns which correspond to the entries of the separator tree have to be recomputed. Similarly we get b1 = 2 since the block factor column which corresponds to the subdomain without the border vertices does not need to be recomputed. Consider the Type-I-grid with k > 1 separators. It is clear that the separator structure of this grid we get by doubling Type-II-grid and separating them by a central separator. Consequently, ak+1 = 2∗bk +1, where the additional 1 corresponds to the central separator. Similarly we get the relation bk+1 = ak +1, since its separator structure is the same as if we would add to the considered Type-II-grid with k > 1 separators of another Type-II-grid and separate them by the central separator. The block columns of the new Type-II-grid do not need to be recomputed. Putting the derived formulas together we get ak+2 = 2 ∗ ak + 3 and bk+2 = 2 ∗ bk + 2. This gives al = 3(2l+1 − 1) and bl = 2(2l+1 − 1) for k = 2l + 1, and al = 4(2l ) − 3 and bl = 3.(2l ) − 2 for k = 2l, and we are done. Clearly, the choice of the ﬁrst separator of ND plays a decisive role. Further, there exist accompanying results for the generalized ND and one-way dissection. The counts of modiﬁed vertices were obtained from the separator tree [10].

4

Conclusion

We considered new ways to ﬁnd proper and fast graph repartitioning if our task is to decompose matrices on the domains. In this case it is possible to propose new and eﬃcient methods reﬁning the general-purpose concept of complex objectives. The approach goes beyond a straightforward use of symbolic factorization. After describing a comprehensive framework of the whole approach we

778

K. Jurkov´ a and M. T˚ uma

presented theoretical and experimental results for one particular problem. The explained techniques can be generalized to more domains and for general LU decomposition.

Acknowledgement This work was supported by the project No. IAA100300802 of the Grant Agency of the Academy of Sciences of the Czech Republic.

References 1. Cataly¨ urek, U.V., Aykanat, C.: Hypergraph-partitioning based decomposition for parallel sparse-matrix vector multiplication. IEEE Transactions on Parallel and Distributed Systems 20, 673–693 (1999) 2. Chen, Y., Davis, T.A., Hager, W.W., Rajamanickam, S.: Algorithm 887: CHOLMOD, Supernodal sparse Cholesky factorization and update/downdate. ACM Trans. Math. Softw. 35, 22:1–22:14 (2008) 3. Davis, T.A.: Direct Methods for Sparse Linear Systems. SIAM, Philadelphia (2006) 4. Hendrickson, B.: Graph partitioning and parallel solvers: Has the emperor no clothes? In: Ferreira, A., Rolim, J.D.P., Teng, S.-H. (eds.) IRREGULAR 1998. LNCS, vol. 1457, pp. 218–225. Springer, Heidelberg (1998) 5. Karypis, G., Kumar, V.: A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM J. Sci. Comput. 20, 359–392 (1999) 6. Kernighan, B.W., Lin, S.: An eﬃcient heuristic procedure for partitioning graphs. The Bell System Technical Journal 49, 291–307 (1970) 7. Kumar, V., Grama, A., Gupta, A., Karypis, G.: Introduction to Parallel Computing. Benjamin-Cummings (1994) 8. Liu, J.W.H.: A tree model for sparse symmetric indeﬁnite matrix factorization. SIAM J. Matrix Anal. Appl. 9, 26–39 (1988) 9. Liu, J.W.H.: The minimum degree ordering with constraints. SIAM J. Sci. Comput. 10, 1136–1145 (1989) 10. Liu, J.W.H.: The role of elimination trees in sparse factorization. SIAM J. Matrix Anal. Appl. 11, 134–172 (1990) 11. Liu, J.W.H., Ng, E.G., Peyton, B.W.: On ﬁnding supernodes for sparse matrix computations. SIAM J. Matrix Anal. Appl. 14, 242–252 (1993) 12. Pinar, A., Hendrickson, B.: Combinatorial Parallel and Scientiﬁc Computing. In: Heroux, M., Raghavan, P., Simon, H. (eds.) Parallel Processing for Scientiﬁc Computing, pp. 127–141. SIAM, Philadelphia (2006) 13. Pinar, A., Hendrickson, B.: Partitioning for complex objectives. In: Parallel and Distributed Processing Symposium, vol. 3, pp. 1232–1237 (2001) 14. Schloegel, K., Karypis, G., Kumar, V.: A uniﬁed algorithm for load-balancing adaptive scientiﬁc simulations. In: Proceedings of the ACM/IEEE Symposium on Supercomputing, vol. 59. ACM, New York (2000)

An Eﬃcient Algorithm for Bilinear Strict Equivalent (BSE)- Matrix Pencils Grigoris I. Kalogeropoulos1, Athanasios D. Karageorgos1, and Athanasios A. Pantelous2 1

Department of Mathematics, University of Athens, Greece {gkaloger,athkar}@math.uoa.gr 2 Department of Mathematical Sciences, University of Liverpool U.K. [email protected]

Abstract. In this short paper, we have two main objectives. First, to present the basic elements of the strict bilinear equivalence. Secondly, to describe an eﬃcient algorithm for investigating the conditions for two homogeneous matrix pencils sF1 − sˆG1 and sF2 − sˆG2 to be bilinear strict equivalent. The proposed problem is very interesting since the applications are merely numerous. The algorithm is implemented in a numerical stable manner, giving eﬃcient results. Keywords: Bilinear-Strict Equivalence, Matrix Pencil Theory, Algorithmic Technique.

1

Introduction

Matrix pencils are naturally associated with diﬀerential (diﬀerence) systems of the type S(F, G) : F x(t) ˙ = Gx(t)(F xk+1 = Gxk ), where F, G ∈ Fm×n , x ∈ Fn (F is a ﬁeld - R or C) which in turn describe a variety of problems of the descriptor system theory and constitute a more general class than linear state systems. The pencils sF − G and F − sˆG, where s, sˆ ∈ F, are related by the homogeneous pencil sF − sˆG (or equivalently by an ordered pair (F, −G)). In our paper, an eﬃcient algorithm for the investigation of the conditions for two homogeneous matrix pencils sF1 − sˆG1 and sF2 − sˆG2 to be bilinear strict equivalent is presented. The study of the Bilinear-Strict Equivalence (BSE) has been initiated by Turnbull and Aitken, see [6]. In their work, the covariance property of invariant polynomials and the invariance of the minimal indices have been established. In [2], the complete set of invariants for the BSE class of a matrix pencil (F, G) is deﬁned extending further the results proposed by [6]. Analytically, the complete set of invariants of homogeneous matrix pencil sF − sˆG under the BSE is provided by studying the invariants of the homogeneous binary polynomials f (s, sˆ) under the Projective-Equivalence transformations. Moreover, it is derived that the study of such type of invariants can be I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 779–786, 2010. c Springer-Verlag Berlin Heidelberg 2010

780

G.I. Kalogeropoulos, A.D. Karageorgos, and A.A. Pantelous

provided by the equivalent invariants of matrices under the Extended-Hermite type transformations, see for more details [2]. The [2] provides to us with the set of column and row minimal indices (i.e. c.m.i and r.m.i), the set of all indexes for the pencil (F, −G), the set of P l¨ ucker and Grassmann-type vectors. The proposed results construct a solid framework for the study of properties of linear state space systems under space and frequency coordinate transformations. For instance, recently, in [3], through the notion of bilinear equivalence an interesting stabilization criterion for the homogeneous matrix pencil is ﬁnally derived. Now, let us deﬁne (F, G) ∈ Fm×n ×Fm×n and (s, sˆ) be a pair of indeterminates. The polynomial matrix sF − sˆG ∈ Fm×n [s, sˆ] is deﬁned as the homogeneous matrix pencil of the pair (F, G). Clearly, sF − sˆG is a matrix over the ring F[s, sˆ], i.e. polynomials in (s, sˆ) with coeﬃcients from F, but it may be also viewed as a matrix over the rings F(s)[ˆ s] or F(ˆ s)[s]. Δ

Definition 1. Let L = {L : L = (F, −G) ; F, G ∈ Fm×n } be the set of ordered Δ pairs of m × n-matrices and Θ = {θ : θ = (s, sˆ)} be the set of ordered pairs of indeterminates. For every L = (F, −G) ∈ L and θ = (s, sˆ) ∈ Θ, the matrix [L] = [ F −G ] ∈ Fm×2n is called a matrix representation of L and the homogeneous polynomial matrix sIn Δ F −G = sF − sˆG Lθ = L (s, sˆ) = sˆIn is referred as the θ-matrix pencil of L. Definition 2. Define the following sets of matrix pencils Δ

Lθ = {Lθ : for a ﬁxed θ = (s, sˆ) ∈ Θ and for every L = (F, −G) ∈ L} , Δ

L (Θ) = {Lθ : for every θ = (s, sˆ) ∈ Θ and for every L = (F, −G) ∈ L} With the following deﬁnition, the BSE is deﬁned on L or equivalently on L (Θ). This equivalence relation is generated by the action of appropriate transformation groups acting on L or equivalently on L (Θ). Definition 3. Consider first the set Δ K = k : k = (M, N ) ; M ∈ Fm×m , N ∈ F2n×2n ; det M, det N =0 and a composition rule (∗) defined on K as follows: ∗ : K × K → K : for every k1 = (M1 , N1 ) ∈ K, and k2 = (M2 , N2 ) ∈ K, then

Δ

k1 ∗ k2 = (M1 , N1 ) ∗ (M2 , N2 ) = (M1 M2 , N2 N1 )

{It may be easily verified that (K, ∗) is a group with identity element (Im , I2n ).}

An Eﬃcient Algorithm for Bilinear Strict Equivalent (BSE)- Matrix Pencils

781

The action of K on L is defined by ◦ : K × L → L : for every k = (M, N ) ∈ K and L = (F, −G) ∈ L, then

k ◦ L = k ◦ (F, −G) = L = (F , −G ) ∈ L : [L ] = M [L] N Δ

or equivalently,

Δ F −G = M F −G N.

Such an action defines an equivalence relation EK on L, and EK (L) denotes the equivalence class or orbit of L ∈ L under K. Now, an important subgroup of K and the notion of the Bilinear-Strict Equivalence on L or L (Θ) is introduced. (Bilinear - Strict Equivalence): The subgroup (H − B, ∗) of (K, ∗), where Δ

H − B = {r : r = h ∗ b; for every h ∈ H for every b ∈ B}

(1)

where Δ =0 H = h : h = (R, P ) ; R ∈ Fn×n , P = diag {Q, Q} , Q ∈ Fn×n ; det R, det P and

Δ

B=

b:b=

αIn βIn αβ Im , ∈ F2×2 ; αδ − βγ =0 = (Im , Td ) , d = γ δ γIn δIn

is called the Bilinear-Strict Equivalence Group (BSEG). The action of H − B on L is deﬁned by ◦ : H − B × L → L : for every r ∈ H − B and for a L = (F, −G) ∈ L, then Δ

r ◦ L = (h ∗ b) ◦ (F, −G) = h ◦ {b ◦ (F, −G)} = b ◦ {h ◦ (F, −G)} = L = (F , −G ) ∈ L : [L ] = Im R [L] F Td or equivalently

Δ αIn βIn F −G = Im R F −G Q = αRF Q − γRGQ βRF Q − δRG Q γIn δIn

The equivalence relation EH−B deﬁned on L is called BSE, see [2] and [4]. Two ˆ 2 ∈ Lθ , θ = (s, sˆ) ∈ Θ and pencils L1θ = sF1 − sˆG1 ∈ Lθ , L2θ = λF2 − λG ˆ θ = (λ, λ) ∈ Θ are said to be bilinearly - strict equivalent, i.e. L1θ EH−B L2θ , if ˆ and thus a b ∈ B and only if there exists a transformation d : (s, sˆ) → (λ, λ) generated by d, and a h ∈ H such that (F2 , −G2 ) = (h ∗ b)◦(F1 , −G1 ). In matrix form the above condition may be expressed by 2 Lθ = (h ∗ b) ◦ L1θ ,

782

G.I. Kalogeropoulos, A.D. Karageorgos, and A.A. Pantelous

or

Δ αQ βQ F2 −G2 = R F1 −G1 γQ δQ = αRF1 Q − γRG1 Q βRF1 Q − δRG1 Q .

By EB (F, G) is denoted the BSE class or orbit of Lθ = sF − sˆG, or equivalently of L = (F, −G).

2

Creating the Conditions for the BSE

In the beginning of this section, we should deﬁne the Projective Equivalence (PE). Let Fd {Θ} be the set of homogeneous polynomials of degree d with coeﬃcients from the ﬁled F (i.e. R or C) in all possible indeterminates θ = (s, sˆ) ∈ Θ. ˆ be the projective transformation deﬁned by Let ξ : (s, sˆ) →(λ, λ) λ s αβ ξ: = (2) ˆ . sˆ γ δ λ The action of ξ on f (s, sˆ) ∈ Fd {Θ} may be deﬁned by ˆ = f (αλ + β λ, ˆ γλ + δ λ) ˆ ξ ◦ f (s, sˆ) ≡ f˜(λ, λ)

(3)

ˆ ∈ Fd {Θ} will be said to be projectively equivTwo polynomials f1 (s, sˆ), f2 (λ, λ) ˆ if there exists a ξ ∈ P GL(1, C/R) alent and it is denoted by f1 (s, sˆ)EP f2 (λ, λ), (i.e. the general projective group on the projective straight line of C/R) such ˆ and a c ∈ R − {0} that ξ : (s, sˆ) →(λ, λ) ˆ ξ ◦ f1 (s, sˆ) = c · f2 (λ, λ)

(4)

Clearly, (4) deﬁnes an equivalence relation EP on Fd {Θ}, which is called PE. ˆ i ∈ ρ} are two ordered sets of hoLet F1 = {fi (s, sˆ), i ∈ ρ}, F2 = {f˜i (λ, λ), mogeneous polynomials. F1 , F2 are said to be projectively equivalent, F1 EP F2 ˆ for every i ∈ ρ and for the same transportaif and only if fi (s, sˆ)EP f˜i (λ, λ) tion ξ. Moreover, the projective equivalence class of f (s, sˆ)(F ) shall be denoted by EP (f )(EP (F )). Now, in order to ﬁnd the conditions for BSE, the following Proposition is needed. ˆ = λF2 − λG ˆ 2 , F (F1 , G1 ) = Proposition 1. Let L1 (s, sˆ) = sF1 − sˆG1 , L2 (λ, λ) ˆ i ∈ ρ2 − k2 } be the correspond{fi (s, sˆ), i ∈ ρ1 − k1 }, and F (F2 , G2 ) = {f˜i (λ, λ), ˆ where (ρ1 , k1 ), ing homogeneous invariant polynomial sets of L1 (s, sˆ), L2 (λ, λ), (ρ2 , k2 ) are the respective pairs of rank and power. ˆ for some h ∈ H and some b ∈ B is generated by the If L1 (s, sˆ)EH−B L2 (λ, λ) ξ ˆ then function ξ ∈ P GL(1, C/R) such that (s, sˆ) →(λ, λ),

An Eﬃcient Algorithm for Bilinear Strict Equivalent (BSE)- Matrix Pencils

783

1. ρ1 = ρ2 = ρ and k1 = k2 = k. 2. And F (F1 , G1 )EP F (F2 , G2 ). An interesting result proposed and proved by [2] and [6] is presenting below. ˆ = λF2 −λG ˆ 2 , and Ic (F1 , G1 ), Proposition 2. Let L1 (s, sˆ) = sF1 −ˆ sG1 , L2 (λ, λ) Ic (F2 , G2 ) and Ir (F1 , G1 ), Ir (F2 , G2 ) be the corresponding sets of column and ˆ respectively. If row minimal indices (c.m.i.) and (r.m.i.) of L1 (s, sˆ) and L2 (λ, λ) ˆ L1 (s, sˆ)EH−B L2 (λ, λ), then Ic (F1 , G1 ) = Ic (F2 , G2 ) and Ir (F1 , G1 ) = Ir (F2 , G2 ). Thus, Propositions 1 and 2 express the covariance property of the homogeneous invariant polynomials and the invariance property of the sets of c.m.i. and r.m.i. of L(s, sˆ) under EH−B equivalence, respectively. By combinig the above two results, the following criterion for EH−B equivalence of matrix pencils is obtained. ˆ = λF2 − λG ˆ 2. Theorem 1. Let L1 (s, sˆ) = sF1 − sˆG1 , L2 (λ, λ) ˆ L1 (s, sˆ)EH−B L2 (λ, λ) if and only if the following conditions hold: 1. Ic (F1 , G1 ) = Ic (F2 , G2 ), Ir (F1 , G1 ) = Ir (F2 , G2 ) 2. F (F1 , G1 )EP F (F2 , G2 ) Proof. The necessity of the theorem is actually a straightforward result of Proposition 1 and 2. Next, in order to prove the suﬃciency of the theorem, we assume that the conditions (1) and (2) are true. Now, since F (F1 , G1 )EP F (F2 , G2 ), ˆ have the same rank ρ (note that ρ1 = ρ2 = ρ) and then L1 (s, sˆ), L2 (λ, λ) the same power k (also note that k1 = k2 = k). Moreover, there exists a ξ ˆ for which ξ ◦ fi (s, sˆ) = ci · transformation ξ ∈ P GL(1, C/R) : (s, sˆ) →(λ, λ) ˆ for every i = 1, 2, ..., ρ − k. This transformation ξ generates a b ∈ B fi (λ, λ) ˆG ˆ Now, since L1 (s, sˆ)EB L1 (λ, λ), ˆ ˜1 ≡ L ˜ 1 (λ, λ). such that b ◦ L1 (s, sˆ) = λF˜1 − λ ˜ ˜ ˜ ˜ then Ic (F1 , G1 ) = Ic (F1 , G1 ) = Ic (F2 , F2 ), and Ir (F1 , G1 ) = Ir (F1 , G1 ) = Ir (F2 , G2 ). Furthermore, the sets of the homogeneous invariant polynomials of ˜ 1 (s, sˆ) and L2 (s, sˆ) diﬀer only by scalars units of F[s, sˆ] (following the conL ˜ 2 (s, sˆ) have the same Smith form struction of ξ). Consequently L1 (s, sˆ) and L over F[s, sˆ] (or equivalently the same sets of elementary divisors) and the same ˜ 2 (s, sˆ), see for more details [1] and [2]. sets of c.m.i., r.m.i. and L1 (s, sˆ)EH L ˜ 2 (s, sˆ). Now, since Therefore, there exists a h ∈ H such that L1 (s, sˆ) = h ◦ L ˆ ˆ and ﬁnally, it ˜ L2 (s, sˆ) = b ◦ L2 (λ, λ), it follows that L1 (s, sˆ) = (h ∗ b) ◦ L2 (λ, λ) ˆ is derived that L1 (s, sˆ)EH−B L2 (λ, λ). As we have already seen in this section, the notion of EP equivalence, which is deﬁned on the set of homogeneous invariant polynomials F (F, G) of the pencil sF − sˆG is the milestone for the characterization of the EH−B . Moreover, the whole problem is reduced into the investigation of the conditions under which ˆ ∈ Fd {Θ} are EP equivalence. Thus, the problem two polynomials f (s, sˆ), f˜(λ, λ) ˆ is equivalent with the of ﬁnding the conditions under which f (s, sˆ)EP f˜(λ, λ) determination of the complete and independent set of invariant for the orbit EP (f (s, sˆ)).

784

3

G.I. Kalogeropoulos, A.D. Karageorgos, and A.A. Pantelous

The Algorithm for the BSE

Before we state the main result of this paper, we introduce some notations. We denote by D(F, G) the symmetric set of elementary divisors (e.d.) over C, by B(F, G) = {BR (F, G); BC (F, G)} the unique factorization set, and by J (F, G) = {JR (F, G); JC (F, G)} the list of F (F, G) (and thus for L(s, sˆ)). The set B(F, G) is assumed to be naturally ordered and if π ¯ B(F, G) = (πBR (F, G), π BC (F, G)) ∈< B(F, G) >, then the (π, π )-matrix basis is designed by the Tπ,π . Note that the T (F, G) denotes the family of matrix representations of B(F, G). In order to analyze further the signiﬁcant theorem 1, we need the following deﬁnition, see also [2] and [5]. Definition 4. Let f (s, sˆ) ∈ Rk {Θ}, B(f ) = {BR(f ); BC (f )} and let μ #BR (f ), ν #BC (f ) (# denotes the number of elements of the corresponding set). Let us also assume that π ∈< BR (f ) >, π ∈< BC (f ) > and that π

[B (f )] R

Tπ,π = B π,π (f ) = BCπ (f ) is the (π, π )-matrix basis of f (s, sˆ). We may define: (i) T (f ) {Tπ,π : ∀π ∈< BR (f ) >, ∀π ∈< BC (f ) >} as the family of matrix representation of B(f ) or of f (s, sˆ). (ii) If μ + ν ≥ 3, Tπ,π ∈ T (f ), r ∈< μ + ν >, (i1 , i2 ) ∈ Qr2,μ+ν−1 , then the (r, i1 , i2 ) − R-canonical Grassmann vector (CGV) of Tπ,π , g˜π,π is well r,i1 ,i2 defined and is referred as the (π, π ) − (r, i1 , i2 ) − R-CGV of f (s, sˆ). The set of all such vectors Gf {˜ gπ,π : ∀π ∈< BR (f ) >, ∀π ∈< BC (f ) >, ∀r ∈< μ + ν > r,i1 ,i2 and ∀(i1 , i2 ) ∈ Qr2,μ+ν−1 } is well defined and is called the R-CGV set of f (s, sˆ). Now, as an extension of theorem 1 and in order to be able to develop our algorithm, the following Theorem is very important, see also [2]. Theorem 2. Let L(s, sˆ) = sF − sˆG ∈ LΘ be a general order matrix pencil, B(F, G), J (F, G), G(F, G), Ic (F, G) and Ir (F, G) be the associated sets with the L(s, sˆ). The complete set of invariants for the EH−B equivalence class is defined by: (i) Ic (F, G), Ir (F, G), (ii) J (F, G) = {JR (F, G); JC (F, G)}, ¯ where ¯(E) = diag{E, E, ..., E}, E = [j]R . (iii) G(F, G) or G(F, G) modulo-C2 (E),

Finally, for every Tπ,π ∈ T (F, G), we denote by Prπ,π (F, G),P π,π (F, G) and P ∗ (F, G) the set of (π, π ) − r-prime Pl¨ ucker vectors (r ∈< μ + ν >) of Tπ,π , the set of all r-prime Pl¨ ucker of Tπ,π , and the Pl¨ ucker vector set of T (F, G) on the L(s, sˆ), respectively. Now, in this part of the section, we develop the algorithm for testing the EH−B ˆ = λF2 −λG ˆ 2 . During equivalence of two pencils L1 (s, sˆ) = sF1 −ˆ sG1 and L2 (λ, λ)

An Eﬃcient Algorithm for Bilinear Strict Equivalent (BSE)- Matrix Pencils

785

this process the sets of G(Fi , Gi ) or P ∗ (Fi , Gi ) should be computed. A systematic procedure for testing the EH−B equivalence implies the following steps. The algorithm for the BSE ˆ respectively. Step 1: Find the ranks ρ1 , ρ2 of L1 (s, sˆ), L2 (λ, λ), ˆ If ρ1 = ρ2 , we stop, since we do not have L1 (s, sˆ)EH−B L2 (λ, λ). Step 2: If ρ1 = ρ2 , ﬁnd the sets of degrees J1 and J2 of F (Fi , Gi ), for i = 1, 2. ˆ If J1 = J2 stop, since since we do not have L1 (s, sˆ)EH−B L2 (λ, λ). = Step 3: If J1 = J2 , compute the J (Fi , Gi ), for i = 1, 2. If J (F1 , G1 ) ˆ J (F2 , G2 ) stop, since we do not have L1 (s, sˆ)EH−B L2 (λ, λ). Step 4: If J (F1 , G1 )=J (F2 , G2 ), compute Ic (Fi , Gi ) and Ir (Fi , Gi ) for i=1, 2. If Ic (F1 , G1 ) = Ic (F2 , G2 ), or (and) Ir (F1 , G1 ) = Ir (F2 , G2 ) stop, ˆ since we do not have L1 (s, sˆ)EH−B L2 (λ, λ). Step 5: Now, if Ic (F1 , G1 ) = Ic (F2 , G2 ) and Ir (F1 , G1 ) = Ir (F2 , G2 ), then compute the unique factorization set B(Fi , Gi )={BR (Fi , Gi ); BC (Fi , Gi )} for i = 1, 2. The sets of BR (Fi , Gi ), BC (Fi , Gi ) characterize the sets of real and complex e.d. DR (Fi , Gi ), DC (Fi , Gi ) respectively. Thus deﬁne the set of homogeneous polynomials FR (Fi , Gi ) ∈ SRJa {Θ}, FC (Fi , Gi ) ∈ SRJb {Θ}. If we do not have FR (F1 , G1 )EB FR (F2 , G2 ). or FC (F1 , G1 )EB FC (F2 , G2 ), then stop, since we do not have ˆ F (F1 , G1 )EB F (F2 , G2 ) and thus we do not take L1 (s, sˆ)EH−B L2 (λ, λ). Step 6: Finally, if FR (F1 , G1 )EB FR (F2 , G2 ) and FC (F1 , G1 )EB FC (F2 , G2 ), then proceed to the computation of G(Fi , Gi ), for i = 1, 2 or continuing to the special cases’ tests for checking the EP equivalence of F (Fi , Gi ) for i = 1, 2.

4

Numerical Examples

In order to evaluate further the results of the previous sections, especially the proposed Algorithm for the BSE, which is based on the Theorem 1 and 2, we consider the homogenous pencils ⎡ ⎤ ⎤ ˆ 0 λ + 4λ 0 0 s − 2ˆ s 0 0 0 ⎢ ˆ ⎢ 0 2s − 4ˆ 0 ⎥ s0 0 ⎥ ˆ = ⎢ 0 λ + 4λ 0 ⎥ ⎥ , L2 λ, λ L1 (s, sˆ) = ⎢ ⎣ ⎣ 0 ⎦ ˆ 0 3s 0 0 0 λ + 2λ 0 ⎦ ˆ 0 0 0 −ˆ s 0 0 0 λ + 3λ ⎡

ˆ = 4. Thus, we It is easily veriﬁed that ρ1 = det(L1 (s, sˆ)) = ρ2 = det(L2 (λ, λ)) nd continue to the 2 step.

786

G.I. Kalogeropoulos, A.D. Karageorgos, and A.A. Pantelous

ˆ (λ + Now, F (F1 , G1 ) = {s − 2ˆ s, 6(s − 2ˆ s)s(−ˆ s)} and F (F2 , G2 ) = {λ + 4λ, ˆ ˆ ˆ 4λ)(λ + 2λ)(λ + 3λ)}, then it easily derived that J1 = J2 . So, following the 3rd step of the Algorithm, we should compute the J (Fi , Gi ), for i = 1, 2. Since, J (F1 , G1 ) = {(1, 1), (1, 3)}, and J (F2 , G2 ) = {(1, 1), (1, 3)}, we have to compute the Ic (Fi , Gi ) and Ir (Fi , Gi ) for i = 1, 2. However, in our case the Ic (F1 , G1 ) = Ic (F2 , G2 ) = O and Ir (F1 , G1 ) = Ir (F2 , G2 ) = O. Here, i.e. in the Step 5, we compute B(F1 , G1 ) = {BR (F1 , G1 )} = {(1, −2; 1), (1, −2; 1), (1, 0; 1), (0, 1; 1).}, and B(F2 , G2 ) = {BR (F2 , G2 )} = {(1, 4; 1), (1, 4; 1), (1, 2; 1), (1, 3; 1).}. Thus, the sets FR (Fi, Gi ) for i = 1, 2 are bilinearly equivalent, with the trans12 formation β = . Finally, according to the 6th step, we have to proceed 13 to the computation of G(Fi , Gi ), for i = 1, 2 or continuing to the special cases’ tests for checking the EP equivalence of F (Fi , Gi ) for i = 1, 2. In this case, we ˆ and b ◦ (6(s − 2ˆ prefer the second choice, i.e. b ◦ (s − 2ˆ s) = −(λ + 4λ) s)s(−ˆ s)) = ˆ ˆ ˆ Consequently, the two pencils are BSE. 6(λ + 4λ)(λ + 2λ)(λ + 3λ).

5

Conclusion-Further Research

In this paper, we have created an algorithm for testing the EH−B equivalence of two pencils. The EH−B equivalence class has been characterized by a complete set of invariants. However, there exists a number of other functions (also deﬁned for pencils), which are not EH−B invariant and generally speaking, they can ˜ ˜. This kind of problem might give diﬀerent values for the elements Lθ and L θ be interested in studying diﬀerent topics of numerical analysis or theoretical applications of system and control theory. Acknowledgments. The authors are very grateful to the anonymous referee for the comments and the signiﬁcant suggestions.

References 1. Gantmacher, R.F.: The theory of matrices, Chelsea, New York, U.S.A., vol. 1, 2 (1959) 2. Kalogeropoulos, G.I.: Matrix Pencils and linear systems theory, PhD thesis, Control Engineering Centre, City University, London, U.K. (1985) 3. Kalogeropoulos, G.I., Karageorgos, A.D., Pantelous, A.A.: A stabilization criterion for matrix pencils under bilinear transformation. Linear Algebra and its Applications 428(11–12), 2852–2862 (2008) 4. Karcanias, N., Kalogeropoulos, G.I.: Bilinear-strict equivalence of matrix pencils and autonomous singular diﬀerential systems. In: Proceedings of 4th I.M.A. Intern. Conf. on Control Theory, Robinson College, Cambridge, U.K. (1984) 5. Marcus, M.: Finite dimensional multilinear algebra (two volumes). Marcel and Deker, New York (1973) 6. Turnbull, H.W., Aitken, A.C.: An introductionto the theory of canonical matrices. Dover Publications, New York (1961)

Two-Grid Decoupling Method for Elliptic Problems on Disjoint Domains Miglena N. Koleva and Lubin G. Vulkov Faculty of Natural Science and Education University of Rousse, 8 Studentska str., Rousse 7017, Bulgaria {mkoleva,lvalkov}@ru.acad.bg

Abstract. We propose a two-grid ﬁnite element method for solving elliptic problems on disjoint domains. With this method, the solution of the multi-component domain problem(simple example — two disjoint rectangles) on a ﬁne grid is reduced to the solution of the original problem on a much coarser grid together with solution of several problems (each on a single-component domain) on ﬁne meshes. The advantage is the computational cost although the resulting solution still achieves asymptotically optimal accuracy. Numerical experiments demonstrate the eﬃciency of the algorithms.

1

Introduction

In this paper, we consider coupling elliptic transmission problem on a multicomponent domain with nonlocal interface conditions on parts of the boundary of the components. The study of such problems could be motivated physically by the occurrence of various nonstandard boundary and coupling conditions in modern physics, biology, engineering [2,3,10]. One dimensional problems was numerically solved in [5-8]. However, here we want to illustrate the two-grid idea in a new direction, namely we use the two-grid discretization method to decouple the multi-component domain problem to several elliptic equations, each of them solved on its own domain. The two-grid method was proposed of Axelsson [1] and J. Xu [12], independently of each other, for a linearization of the nonlinear problems. The two-grid ﬁnite element method was also used by J.Xu and many others scientists (see the reference in [4]) for discretizing nonsymmetric indeﬁnite elliptic and parabolic equations. By employing two ﬁnite element spaces of diﬀerent scales, one coarse and one ﬁne space, the method was used for symmetrization of nonsymmetric problems, which reduces the solution of a nonsymmetric problem on a ﬁne grid to the solution of a corresponding (but much smaller) nonsymmetric problem, discretized on the coarse grid and the solution of a symmetric positive deﬁnite problem on the ﬁne grid. There are also many applications to nonlinear elasticity problem, ﬂuid mechanics [9,11] etc. The two-grid approach in the present paper is an extension of the idea in [4], where it was used to decouple of a Shr¨ odinger system of diﬀerential equations. The system of partial diﬀerential equations is ﬁrst discretized on the coarse I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 787–795, 2010. c Springer-Verlag Berlin Heidelberg 2010

788

M.N. Koleva and L.G. Vulkov

grid, then a decoupled system is discterized on the ﬁne mesh. As a result, the computational complexity of solving the Shr¨ odinger system is comparable with solving two decoupled Poisson equations on the same grid. In this paper we consider a system of two elliptic equations, each of them solved on its own rectangle. The two problems are coupled with nonlocal interface conditions on parts of the rectangles’s boundary. First, the original system is approximated on the coarse mesh and then a decoupled system is discretized on the ﬁne grid. The rest of the paper is organized as follows. In Section 2, we introduce the model problem, used to demonstrate our method. In the next section, we study the ﬁnite element solution, while in Section 4 we propose the two grid ﬁnite element algorithms and analyze the convergence. Section 5 presents results of numerical experiments, showing the eﬀectiveness of our method. In the next section, by C we denote positive constant, independent of the boundary value solution and mesh sizes.

2

Two-Rectangles Model Problem

As a model example we study the elliptic problem, deﬁned on the disjoint rectangles Ωn = (an , bn ) × (cn , dn ) with boundaries ∂Ωn , n = 1, 2, see Figure 1, (1) − (pn (x, y)unx )x − q n (x, y)uny y + rn (x, y)un = f n (x, y), (x, y) ∈ Ωn , un (x, cn ) = un (x, dn ) = 0, an ≤ x ≤ bn ,

(2)

u (a1 , y) = 0, c1 ≤ y ≤ d1 , u (b2 , y) = 0, c2 ≤ y ≤ d2 , (3) d 2 p1 (b1 , y)u1x (b1 , y)+ s1 (b1 , y)u1 (b1 , y)= ϕ2 (a2 , η)u2 (a2 , η)dη, c1 ≤ y ≤ d1 , (4) 1

2

c2

−p

2

d1 ϕ1 (b1 , η)u1 (b1 , η)dη, c2 ≤ y ≤d2 . (5)

(a2 , y)u2x (a2 , y)+ s2 (a2 , y)u2 (a2 , y)=

c1

Throughout the paper we assume that the input datum satisfy the usual regularity and ellipticity conditions in Ωn , n = 1, 2, pn (x, y), q n (x, y), rn (x, y) ∈ L∞ (Ωn ), f n ∈ L2 (Ωn ),

(6)

0 < pn0 ≤ pn (x, y) < pn1 , 0 < q0n ≤ q n (x, y) < q1n , 0 ≤ rn (x, y) ≤ r1n ,

(7)

for (x, y) ∈ Ωn . In the real physical problems (see [2,3]) we often have ϕ1 , ϕ2 > 0. We introduce the product space L = {v = (v 1 , v 2 ) | v n ∈ L2 (Ωn )}, n = 1, 2, endowed with the inner product and the associated norm 1/2

(u, v)L = (u1 , v 1 )L2 (Ω1 ) + (u2 , v 2 )L2 (Ω2 ) , vL = (v, v)L , where n n (un , v n )L2 (Ωn ) = u v dxdy, n = 1, 2. Ωn

Two-Grid Decoupling Method for Elliptic Problems on Disjoint Domains

789

y d

2

Γ2

d1

Ω

Ω1

2

Γ1 c

c

Eij3

Eij4

1

Eij2

Eij1

Eij6

Eij5

2

0

a1

a2

b

1

b2

x

Fig. 1. Domains Ω1 and Ω2

We also deﬁne the spaces Hm = {v = (v 1 , v 2 ) | v n ∈ Hm (Ωn )}, m = 1, 2, . . . endowed with the inner products and norms 1/2

(u, v)Hm = (u1 , v 1 )Hm (Ω1 ) + (u2 , v 2 )Hm (Ω2 ) , vHm = (v, v)Hm , where (un , v n )Hm (Ωn ) =

j k j=0 l=0

∂ j un ∂ j vn , , n = 1, 2, m = 1, 2, . . . ∂xl ∂y j−l ∂xl ∂y j−l L2 (Ωn )

In particular, we set H0m = {v = (v 1 , v 2 ) ∈ H1 | v n = 0 on Γn , n = 1, 2}, m = 1, 2, . . . , where Γ1 = ∂Ω1 \{(b1 , y) | y ∈ (c1 , d1 )} and Γ2 = ∂Ω2 \{(a2 , y) | y ∈ (c2 , d2 )}, see Figure 1. Finally, with u = (u1 , u2 ) and v = (v 1 , v 2 ) we deﬁne the following bilinear form:

a(u, v) =

p1 u1x vx1

+

q 1 u1y vy1

1 1 1

+r u v

c1

c2

p2 u2x vx2 + q 2 u2y vy2 + r 2 u2 v 2 dx dy + 2

2

1

ϕ (a2 , η)u (a2 , η)v (b1 , y) dηdy − 1

1

1

2

2

d1 c1

Ω2

d1 d2

dx dy +

Ω1

+

−

2

d2 c2

1

c1

1

s (b1 , y)u (b1 , y)v (b1 , y) dy

(8)

1

1

2

ϕ (b1 , η)u (b1 , η)v (a2 , y) dηdy

= a (u , v ) + a (u , v ) + b (u , v ) + b2 (u1 , v 2 )

and c(v) = c2 (v 2 ) + c2 (v 2 ), where cn (v n ) =

2

1

s2 (a2 , y)u2 (a2 , y)v 2 (a2 , y) dy

d2 d1 c2

1

1

f n v n dxdy, n = 1, 2.

Ωn

Lemma 1. Under the conditions (6) and (7) the bilinear form a(u, v), defined by (8), is bounded on H1 × H1 . If besides it, the conditions sn ∈ L∞ (cn , dn ), ϕn ∈ L∞ ((cn , dn ) × (cn , dn )) a. e. in ∂Ωn , n = 1, 2, (9)

790

M.N. Koleva and L.G. Vulkov

are fulfilled, this form satisfies the G˚ arding’s inequality on H01 , i.e. there exist positive constants m and κ such that a(u, u) + κ u2L ≥ m u2H1 ,

∀ u ∈ H01 .

The equivalent variational problem of (1)-(5) is deﬁned as follows: ﬁnd (u1 , u2 ) ∈ H1 such that a(u, v) = c(v), ∀v ∈ H01 .

(10)

The proof of the next assertion is similar to this of Theorem 1 in [5]. Theorem 1. Let in additional to (6), (7) and (9), the first derivatives of pn (x, y) and q n (x, y), n = 1, 2 also satisfy (6). Then the variational problem (10) has an unique weak solution u ∈ H2 and uH2 ≤ Cf L2 .

3

Finite Element Method Solution

The discretization of Ω1 ∪ Ω2 is generated by the uniform mesh ωh = ω h1 ∪ ω h2 , ω hn = {(xi , yj ) : xi = an + (i − 1)hn , i = 1, . . . , Nn , xNn = bn , yj = cn + (j − 1)kn , j = 1, . . . , Mn , yMn = dn }, n = 1, 2. Now, we consider a standard piecewise ﬁnite element space Vh = Vh1 ∪ Vh2 , associated with ω h . Namely, we use linear triangular elements, see Figure 1. With Esij , s = 1, . . . , 6 we denote the sixth elements, associated with the grid node (xi , yj ). Let Φnh (x, y) is a basis of Vhn , n = 1, 2, Φh = Φ1h ∪ Φ2h . We seek the ﬁnite element solution solution u1h (x, y) ∈ Vh1 , u2h (x, y) ∈ Vh2 in the form u1h (x, y) =

N1 M1

1 Ui,j Φ1hi,j (x, y), u2h (x, y) =

i=1 j=1

N2 M2

2 Ui,j Φ2hi,j (x, y).

(11)

i=1 j=1

We lump the mass matrix by nodal-point integration, that is we get nonzero contributions only for diagonal elements of the local (element) mass matrix. For right hand side of (1) we use the product approximation formula. With integrals of known function ϕ and unknown solution U we deal as follows: for n = 1, 2

dn

cn

ϕn (·, η)U n (·, η)dη ≈

yl+1 Mn −1 1 [U n (·, yl ) + U n (·, yl+1 )] ϕn (·, η)dη. (12) 2 yl l=1

The remaining integral can be computed exactly, depending on the function ϕn . Such approximation conserves the second order of convergence rate.

Two-Grid Decoupling Method for Elliptic Problems on Disjoint Domains

791

The resulting discrete approximation of (1)-(5) is as follows

n n ˆn + Q ˇn Pˆi,j Pˇi,j Pˆ n + Pˇ n Q n n n n − 2 Ui+1,j − 2 Ui−1,j + + + ri,j Ui,j hn hn h2n h2n i,j

ˆn ˇn Q Q i,j n i,j n n − 2 Ui,j+1 − 2 Ui,j−1 = fi,j , n = 1, 2, i = 2, . . . , Nn −1, j = 1, . . . , Mn −1, hn hn n 1 2 n Ui,1 = U1,j = UN = Ui,M = 0, i = 1, . . . , Nn , j = 1, . . . , M1 , n = 1, 2, 2 ,j n

1 ˇ ˆ1 + Q ˇ 1 ) h1 r 1 PN1 ,j 1 Pˇ 1 h1 (Q 1 1 UN1 −1,j + + + +s UN − 1 ,j h1 h1 2k12 2 N1 ,j

M2 ˆ1 ˇ1 h1 Q h1 Q 1 h1 1 N1 ,j 1 N1 ,j 1 2 − UN1 ,j+1 − UN1 ,j−1 − U1,l ϕ 2l = f , 2k12 2k12 2 2 N1 ,j

(13)

l=1

j = 2, . . . , M1 − 1,

ˆ2 + Q ˇ 2 ) h2 r 2 Pˇ 2 h2 (Q 2 2 U + + + +s − h2 2,j h2 2k22 2 2 Pˆ1,j

2 U1,j − 1,j

ˆ2 h2 Q 1,j 2 U1,j+1 2k22

M1 ˇ2 h2 Q 1 h2 2 1,j 2 1 f , j = 2, . . . , M2 − 1, − U − UN ϕ 1 = 1,j−1 1 ,l l 2k22 2 2 1,j l=1

n ˆ n, Q ˇ n, where Ui,j = U n (xi , yj ), n = 1, 2 and the coeﬃcients of (13): Pˆ n , Pˇ n , Q n ϕ , n = 1, 2 can be computed exactly or numerically. For example,

Pˆijn =

⎧ ⎪ ⎪ ⎨

1 ( + )pn dxdy, hn kn ij ij E1 E6 pn h1 , ⎪ ⎪ ⎩ 1i+ n2 ,j n (pi,j + pi+1,j ), 2

exact midpont approx. trapezoidal approx.

⎫ 1 ( + )q n dxdy ⎪ hn kn ⎪ ⎬ ij ij E5 E6 n qi,j− k1 ⎪ ⎪ 2 ⎭ n 1 (q n + qi,j−1 ) 2 i,j

ˇn =Q ij

and ϕ nl are obtained from bn , (11) and (12). The error analysis of the above ﬁnite element discretization can be achieved by standard techniques, see e.g. [4]. Theorem 2. Under the assumptions in Theorem 1, uh = (u1h , u2h ) has the error estimate u − uh Hsh ≤ Ch2−s uL , s = 0, 1, where · s is the discrete s-Sobolev norm.

4

Two-Grid Decoupling FEM Algorithms

H h Let deﬁne a new mesh (coarse mesh) ωH = ωH 1 ∪ ω 2 , analogically to ω with h mesh steps: Hn > hn and Kn > kn , n = 1, 2. The mesh ω we will call fine mesh. As before, we consider a standard piecewise ﬁnite element space VH (with basis

792

M.N. Koleva and L.G. Vulkov

ΦH ), associated with ω H . Let Uni = [U n (xi , y1 ), U n (xi , y2 ), . . . , U n (xi , yMn −1 ), U n (xi , Mn )]T , i = 1, . . . , Nn , n = 1, 2. Algorithm 1 step 1. Find U1N1 and U21 from (13) on the coarse mesh ω H . step 2. For all Φh ∈ Vh on the ﬁne mesh ω h : a) Find ˚ u1h ∈ Vh1 such that a1 (˚ u1h , Φ1h ) = c1 (Φ1h ) − b1 (u2H , Φ1h ). 2 2 2 2 b) Find ˚ uh ∈ Vh such that a (˚ uh , Φ2h ) = c2 (Φ2h ) − b2 (˚ u1h , Φ2h ). Note that at step 2 the problems in Ω1 and Ω2 are computed consequently and separately. Now, Algorithm 1 can be improved in a successive fashion, organizing an iteration process, such that at each step to account the last computed values of U1N1 and U21 . Algorithm 2 n(1)

step 1. Find ˚ unh , n = 1, 2 from Algorithm 1, ˚ uh

:= ˚ unh

step 2. For k = 1, 2, . . . and all Φh ∈ Vh on the ﬁne mesh ω h : 1(k+1) 1(k+1) 2(k) a) Find ˚ uh ∈ Vh1 : a1 (˚ uh , Φ1h ) = c1 (Φ1h ) − b1 (˚ uh , Φ1h ). 2(k+1) 2(k+1) 1(k+1) b) Find ˚ uh ∈ Vh2 : a2 (˚ uh , Φ2h ) = c2 (Φ2h ) − b2 (˚ uh , Φ2h ). The proofs of the next theorems are based on the ideas in [4]. Theorem 3. Under the assumptions in Theorem 1, ˚ unh for n = 1, 2, computed with Algorithm 1 has the following error estimate un − ˚ unh 1 0.5(h2n + kn2 ) + 0.5(Hn3 + Kn3 ), n = 1, 2. n(k)

The next theorem shows that ˚ uh , k = 2, 3, . . . can reach the optimal accu2/(k+1) racy in H1 -norm, if the coarse mesh step size Hn (Kn ) is taken to be hn 2/(k+1) (kn ). As the dimension Vh is much smaller than VH , the eﬃciency of the algorithms is then evident. n(k)

Theorem 4. Under the assumptions in Theorem 1, ˚ uh with Algorithm 2, has the following error estimate n(k)

unh − ˚ uh

for n = 1, 2, computed

1 0.5(Hnk+1 + Knk+1 ), n = 1, 2, k ≥ 1,

Consequently, n(k)

un − ˚ uh n(k)

1 0.5(h2n + kn2 ) + 0.5(Hnk+1 + Knk+1 ), n = 1, 2. 2/(k+1)

Namely, ˚ uh , k ≥ 1 has the same accuracy as ˚ unh in H1 -norm, if Hn = hn 2/(k+1) and Kn = kn , n = 1, 2.

Two-Grid Decoupling Method for Elliptic Problems on Disjoint Domains

5

793

Numerical Examples

The test example is problem (1)-(5), with a1 = 1, b1 = 2, c1 = 0.2, d1 = 0.8, a2 = 3, b2 = 4.5, c2 = 0, d2 = 1 and zero Dirichlet boundary conditions. The coeﬃcients are: p1 (x, y) = ex+y , q 1 (x, y) = sin(x + y), r1 (x, y) = x + y, s1 (x, y) = 2xy, 2 2 2 p (x, y) = x + y , q 2 (x, y) = xy, r2 (x, y) = x − y, s2 (x, y) = 2(x + y), 2x x 1 2 ϕ (x, y) = 2b1 y(1−y) , ϕ (x, y) = a2 y(1−y) . In the right hand side of equations (4),(5) we add functions f 3 (x, y) and f 4 (x, y), respectively and determine f s (x, y), s = 1, . . . , 4, so that u = (u1 , u2 ), u1 (x, y) = 2[cos(10πy − π) + 1](x − a1 )2 y(d1 + 0.2 − y), u2 (x, y) = 2[cos(4πy − π) + 1](x − b2 )2 y(d2 − y), is the exact solution of the problem (1)-(5). In the examples, the coeﬃcients Pˆ , Pˇ , ˆ and Q ˇ are computed using midpoint approximation, while for the right hand Q side we use trapezoidal rule approximation. Mesh parameters are H1 = H2 = H, K1 = K2 = K for the coarse mesh and h1 = h2 = h, k1 = k2 = k for the ﬁne mesh. The results are given in H1 ( · 1 ), L2 and max discrete norms and the convergence rate (CR) is calculated using double mesh principle: E h = uh −u, h CR = log2 [E h /E 2 ]. All computations are performed by MATLAB 7.1. The generated linear systems of algebraic equations are solved by QR decomposition method with pivoting, using MATLAB libraries function ‘mldivide’. Example 1. (Algorithm 1 ) We chose h = H 3/2 and k = K 3/2 , such that to reveal the accuracy of the solution, computed with Algorithm 1. The results are listed in Table 1, which show that ˚ uh − u1 ≈ O(H 3 + K 3 ) = O(H 3 + K 3 + 2 2 h + k ), i.e. the assertion of Theorem 3. Example 2. (Algorithm 2 ) Here we will demonstrate the eﬃciency of Algon(k) rithm 2. In order to check the convergence rate, each ˚ uh , n = 1, 2 are computed (k+1)/2 (k+1)/2 with ﬁne mesh step size h = H ,k=K for all 1, . . . , k iterations. 3

3

n 2 2 Table 1. Error and CR in diﬀerent norms ˚ un h − u , h = H , k = K , Algorithm 1

Ω1 mesh H = 1/4, K = 1/5 H = 1/8, K = 1/10 H = 1/16, K = 1/20 H = 1/32, K = 1/40

2

Ω2 max

L2

H1

1.92 2.70e-1

8.82e-2 1.24e-2

3.97e-2 4.85e-3

5.70e-1 7.36e-2

(3.2299 )

(2.8301 )

(2.8304 )

(3.0331 )

(2.9532 )

1.24e-3

3.34e-2

1.73e-3

6.13e-4

9.43e-3

(3.0402 )

(3.0151 )

(2.8415 )

(2.9840 )

(2.9644 )

max

L

H

3.25e-1 4.02e-2

9.57e-2 1.02e-2

(3.0152 )

5.07e-3 (2.9871 )

1

7.64e-4

1.83e-4

4.25e-3

2.80e-4

8.84e-5

1.37e-3

(2.7303 )

(2.7604 )

(2.9743 )

(2.6273 )

(2.7921 )

(2.7831 )

794

M.N. Koleva and L.G. Vulkov k+1 2

Table 2. Error and CR in diﬀerent norms for h < H

,k
k+1 2

Ω1

Ω2

L2

max

mesh

, Algorithm 2

H1

L2

max

H1

k = 2, h H 5/2 , k K 5/2 — ﬁxed for all iterations 1.00e-2 2.58e-3 7.57e-2 3.17e-3 1.27e-3 2.00e-2 6.96e-4 1.50e-4 2.75e-3 1.63e-4 5.16e-5 7.43e-4

H = 1/4, K = 1/5 H = 1/8, K = 1/10

(3.8448 )

(4.1043 )

(4.7828 )

(4.2815 )

(4.6213 )

(4.7505 )

k = 3, h H 7/2 , k K 7/2 — ﬁxed for all iterations 4.48e-4 1.14e-4 3.40e-3 1.46e-4 5.70e-5 9.96e-4

H = 1/4, K = 1/5

2

0.02 0.015

1.5 0.01 1 U

Error

0.005

0.5

0 −0.005

0 −0.01 −0.5 1

−0.015 1 0.8

4.5 4

0.6

3.5 3

0.4

2.5

4.5 4

0.6

3.5

0

2.5 2

0.2

1.5 1

3

0.4

2

0.2 y

0.8

x

y

0

1.5 1

x

Fig. 2. Numerical solution (left) and error (right) for H = 0.1, K = 0.02, Alg.1, step 1

The results — errors in diﬀerent discrete norms and convergence rate, are shown n(k) in Table 2. We can see that ˚ uh −un 1 ≈ O(h2 +H k+1 ), which is the statement of Theorem 4. Moreover, we observe that for k ≥ 2 the convergence increases rather faster than the theoretical assertion.

6

Conclusions

The main advantage of Algorithms 1,2 is that we reach a high accuracy of the numerical solution, solving the whole problem (Ω1 ∪ Ω2 ) only once — on the coarse mesh, then the problem is separated into two problems and we solve them separately and consequently on the ﬁne mesh, increasing the convergence. This study can obviously extended to elliptic and parabolic problems with polygonal and curvilinear domains. Also, the nonlinear two-grid decoupling method [1,12] can be developed to nonlinear diﬀerential problems on disjoint domains of type [2,3].

Acknowledgement This research is supported by the Bulgarian National Fund of Science under Project Sk-Bg-203.

Two-Grid Decoupling Method for Elliptic Problems on Disjoint Domains

795

References 1. Axelsson, O.: On mesh independence and Newton methods. Appl. Math. 38(4–5), 249–265 (1993) 2. Amosov, A.A.: Global solvability of a nonlinear nonstationary problem with a nonlocal boundary condition of radiative heat transfer type. Diﬀ. Eqns. 41, 96–109 (2005) 3. Druet, P.E.: Weak solutions to a time-dependent heat equation with nonlocal radiation condition and right hand side Lp (p ≥ 1). WIAS Preprint 1253 (2008) 4. Jin, J., Shu, S., Xu, J.: A two-grid discretization method for decoupling systems of partial diﬀerential equations. Math. Comp. 75, 1617–1626 (2006) 5. Jovanovi´c, B.S., Vulkov, L.G.: Numerical solution of a parabolic transmission problem. IMA J. Numer. Anal. (2009) (to appear) 6. Jovanovi´c, B.S., Vulkov, L.G.: Numerical solution of a hyperbolic transmission problem. Comp. Meth. in Appl. Math. 4(4), 374–385 (2009) 7. Koleva, M.: Finite element solution of boundary value problems with nonlocal Jump Conditions. J. of Math. Modl. and Anal. 13(3), 383–400 (2008) 8. Koleva, M., Vukov, L.: A two-grid approximation of an interface problem for the nonlinear Poisson-Boltzmann equation. In: Margenov, S., Vulkov, L.G., Wa´sniewski, J. (eds.) NAA 2008. LNCS, vol. 5434, pp. 369–376. Springer, Heidelberg (2009) 9. Mu, M., Xu, J.: A Two-grid Method of a Mixed Stokes-Darcy Model for Coupling Fluid Flow with Porous Media Flow. SIAM J. Numerical Analysis 45, 1801–1813 (2007) 10. Tikhonov, A.N.: On functional equations of the Volterrra type and their applications to someproblems of mathematical physics. Byull. Mosk. Gos. Univ., Ser. Mat. Mekh. 1(8), 1–25 (1938) 11. Xu, J.: Two-grid discretization techniques for linear and nonlinear PDEs. SIAM Journal on Numerical Analysis 33, 1759–1777 (1996) 12. Xu, J.: A novel two-grid method for semilinear elliptic equations. SIAM J. Sci. Comput. 15(1), 231–237 (1994)

A Method for Sparse-Matrix Computation of B-Spline Curves and Surfaces Arne Laks˚ a Narvik University College, P.O.B. 385, N-8505 Narvik, Norway [email protected]

Abstract. Matrix methods of computing B-spline curves and surfaces have been considered in the work of several authors. Here we propose a new, more general matrix formulation and respective upgraded notation. The new approach is based on non-commutative operator splitting, where the domain and range of every factoring operator diﬀer by one dimension, and the factoring operators are represented by a product of sparse rectangular matrices with expanding dimensions diﬀering by 1, so that these matrices are d × (d + 1)-dimensional (with d increasing with an increment of 1) and have nonzero values only on their two main diagonals (ai,i ) and (ai,i+1 ), i = 1, . . . , d. In this new matrix formulation it is possible to obtain the generation of the B-spline basis and the algorithms of de Casteljau and Cox–de Boor in a very lucid uniﬁed form, based on a single matrix product formula. This matrix formula also provides an intuitively clear and straightforward uniﬁed approach to corner cutting, degree elevation, knot insertion, computing derivatives and integrals in matrix form, interpolation, and so on. For example, computing the matrix product in the formula from left to right results in the successive iterations of the de Casteljau algorithm, while computing it from right to left is equivalent to the successive iterations in the Cox– de Boor algorithm. Although the new matrix factorization is essentially non-commutative, in Theorem 1 we formulate and prove an important commutativity relation between this matrix factorization and the operator of diﬀerentiation. We use this relation further to propose a new, considerably more concise form of matrix notation for B-splines, with respective eﬃcient computation based on sparse-matrix multiplication.

1

Introduction

Modern B-spline theory has several branches. One of them is based on the blossoming method proposed by Ramshaw [8] and, in a diﬀerent form by de Casteljau [2]. However, here we shall look ﬁrst at B-splines in (what we can call) the classical way, introduced in 1972 by at least three authors simultaneously. Namely, Cox [7] established a recursive algorithm for simple knots, while deBoor [4] introduced the algorithm for general knots; in his paper deBoor mentioned that

Research supported in part by the 2006, 2007 and 2008 Annual Research Grants of the priority R&D Group for Mathematical Modeling, Numerical Simulation and Computer Visualization at Narvik University College, Norway.

I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 796–804, 2010. c Springer-Verlag Berlin Heidelberg 2010

A Method for Sparse-Matrix Computation of B-Spline Curves and Surfaces

797

Louis Mansﬁeld had also discovered the recursion. It can also be mentioned, that among others, Knut Moerken and Tom Lyche have been using matrix notation on B-splines. We usually denote B-splines in two diﬀerent ways, using a short and a more complete notation. These notations are Bk,i (t) = B(t; ti , . . . , ti+k ). When referring to about B-splines we mean normalized B-splines, that is, all B-splines deﬁned on an inﬁnite knot sequence sum up to 1 (also called form the partition of unity), i.e., j

B(t; ti , . . . , ti+k ) = 1,

tj ≤ t < tj+1 .

i=j−k

Compared to the original Curry-Schoenberg B-splines M (t; ti , . . . , ti+k ) (see [5]) we thus have the following relationship B(t; ti , . . . , ti+k ) =

ti+k − ti M (t; ti , . . . , ti+k ). k

Note that for integer non-multiple knots, i.e, (. . . , 1, 2, . . .), B(t; ti , . . . , ti+k ) = M (t; ti , . . . , ti+k ). This leads to the following deﬁnition of the B-spline that usually is called the Cox-de Boor recursion formula Definition 1. Given are k increasing real numbers {ti , ti+1 , . . . , ti+k }, where ti+k > ti (they do not otherwise need to be strictly increasing). The B-spline of degree d and order k = d + 1 is deﬁned by the recursion formula B(t; ti , . . . , ti+k ) = wd,i (t) B(t; ti , . . . , ti+k−1 )+(1 − wd,i+1 (t)) B(t; ti+1 , . . . , ti+k ) where the termination of the recursion is 1, B(t; tj , tj+1 ) = 0, and where wd,i (t) =

2

tj ≤ t < tj+1 , otherwise,

t − ti . ti+d − ti

Bezier Curves and the De Casteljau Algorithm

The de Casteljau corner-cutting algorithm is basically recursive convex linear interpolation between two points. This can be expressed in matrix form, computing from right to left, as follow; ⎛ ⎞ ⎛ ⎞ c0 1−t t 0 0 ⎜ ⎟ 1−t t 0 ⎝ 0 1 − t t 0 ⎠ ⎜ c1 ⎟ c(t) = 1 − t t (1) ⎝ c2 ⎠ 0 1−t t 0 0 1−t t c3

798

A. Laks˚ a

where the vector on right-hand side (RHS) is a vector with entries which are points, i.e., elements of an aﬃne space determined by a vector basis and a point origin (usually ∈ R2 or R3 ). The consecutive multiplication by matrices from right to left in the formula is reducing the dimension of the vector by 1 for each computation, i.e, the factor matrices are of dimension d × (d + 1); we denote them by Td (t). It follows that equation (1) can be rewritten as c(t) = T1 (t) T2 (t) T3 (t) c where c = (c0 , c1 , c2 , c3 )T . If we only compute the three matrices from the left we get a vector with the four Bernstein polynomials [1] of degree three, (1 − t)3 , 3(1 − t)2 t, 3(1 − t)t2 , t3 . We also deﬁne the matrix notation of the set of Bernstein polynomials of degree n to be T n (t) = T1 (t)T2 (t) · · · Tn (t). It follows that equation (1) can be expressed as c(t) = T 3 (t) c

3

The B-Spline Factor Matrix T (t)

A Bezier curve is actually a B-spline curve where the k = d + 1 ﬁrst knots are 0, and the k last knots are 1, and without internal knots. The B-spline recursion algorithm (deﬁnition 1) is, therefore, analogous to equation 1 computing only the matrices from left-hand side (LHS) to the right. We can, therefore, extend the d×(d+1) matrix Td (t) to a more comprehensive version, i.e. the B-spline type of this matrix. Recall the linear translation and scaling function from deﬁnition 1, t−ti , if ti ≤ t < ti+d (2) wd,i (t) = ti+d −ti 0, otherwise. As we can see, (2) describes a piecewise straight line. Recall here that in the case of Bernstein/Bezier type we just have k = d + 1 equal knots at 0 and d + 1 equal knots at 1. It then follows that wd,i (t) = t for all relevant d and i. Definition 2. The B-spline factor matrix Td (t) is a d × (d + 1) band-limited matrix with two nonzero elements on each row. The matrix is, as follows ⎛ ⎞ 1− wd,i−d+1 (t) wd,i−d+1 (t) 0 ... ... 0 ⎜ ⎟ ⎜ ⎟ .. ⎜ ⎟ ⎜ ⎟ 0 1− w (t) w (t) 0 · · · . d,i−d+2 d,i−d+2 ⎜ ⎟ Td (t) = ⎜ .. .. ⎟ .. .. .. .. ⎜ ⎟ . . . . . . ⎜ ⎟ ⎜ ⎟ .. .. ⎝ . (t) w (t) 0 ⎠ . 0 1− w 0

···

···

d,i−1

0

d,i−1

1− wd,i (t) wd,i (t)

A Method for Sparse-Matrix Computation of B-Spline Curves and Surfaces

799

Note here that in the matrix Td (t) the ﬁrst index is d for all w. The last index in the w is denoted i. This index is determined by the knot vector, and it is the index of the knot ﬁxed by the requirement ti ≤ t < ti+1 .

(3)

We observe in the matrix Td (t) that the last index of w is decreased by 1 from one line to the next line up. For the bottom line it is equal to i, for the line above the bottom line it will then equal to i − 1, and so on. For d = 3 we have the following formula for a B-spline curve (note the indices of the coeﬃcients on the RHS), 1 − w2,i−1 (t) w2,i−1 (t) 0 c(t) = 1 − w1,i (t) w1,i (t) 0 1 − w2,i (t) w2,i (t) ⎛ ⎞ ⎛ ⎞ ci−3 1 − w3,i−2 (t) w3,i−2 (t) 0 0 ⎜ ci−2 ⎟ ⎟ ⎝ 0 1 − w3,i−1 (t) w3,i−1 (t) 0 ⎠⎜ ⎝ ci−1 ⎠ , 0 0 1 − w3,i (t) w3,i (t) ci where the index i is determined by the requirement (3) in Deﬁnition 2. A general expression for a third degree B-spline curve is thus equivalent to the Bezier case c(t) = T1 (t)T2 (t)T3 (t) c = T 3 (t) c. T

where c = (ci−3 , ci−2 , ci−1 , ci ) and where the index i is determined by the requirement (3) in Deﬁnition 2. To investigate the derivative of Td (t) we ﬁrst look at the derivative of the linear translation and scaling function (2) which we denote 1 , if ti ≤ t < ti+d δd,i = ti+d −ti . (4) 0, otherwise. δd,i is a piecewise constant, independent of the translation, only reﬂecting the scaling. In the Bezier case, δd,i = 1 for all relevant i and d. The derivative of the matrix T (t) is then deﬁned by the following. Definition 3. The B-spline derivative matrix T is a d × (d + 1) band-limited matrix with two nonzero constant elements on each row (independent of t). The matrix is, as follows ⎞ ⎛ 0 ... ... 0 −δd,i−d+1 δd,i−d+1 ⎜ .. ⎟ ⎜ ⎟ ⎜ ··· . ⎟ 0 −δd,i−d+2 δd,i−d+2 0 ⎜ ⎟ ⎜ ⎟ . . . . . . ⎜ ⎟ . . . . . . Td = ⎜ . . . . . . ⎟ ⎜ ⎟ ⎜ ⎟ .. .. ⎟ ⎜ . . 0 −δd,i−1 δd,i−1 0 ⎠ ⎝ 0

···

···

0

−δd,i δd,i

The index i in the deﬁnition follows the rule (3) of the index in Td (t) from deﬁnition 2.

800

3.1

A. Laks˚ a

Commutativity Relations between T (t) Matrices and Their Derivatives

Before we can look at derivatives of B-spline curves we have to look at a special property of the matrix T (t) and its derivative T . Lemma 1. Commutativity relation between diﬀerentiation and multiplication of two T (t) matrices. For all d > 0 and for a given knot sequence tj−d , tj−d+1 , . . . , tj+d+1 , and ti ≤ t < ti+1 there holds Td (t) Td+1 = Td Td+1 (t).

(5)

Proof. We start by computing the LHS of 5. Taking the two elements that are diﬀerent from zero in one line of Td (t) and computing them with the sub-matrix of Td+1 yields something diﬀerent from zero, −δd+1,j−1 δd+1,j−1 0 1 − wd,j (t) wd,j (t) 0 −δd+1,j δd+1,j which gives −(1 − wd,j (t))δd+1,j−1 (1 − wd,j (t))δd+1,j−1 − wd,j (t)δd+1,j

wd,j (t)δd+1,j . (6)

We now apply the same procedure at the RHS of 5, 1 − wd+1,j−1 (t) wd+1,j−1 (t) 0 −δd,j δd,j 0 1 − wd+1,j (t) wd+1,j (t) which gives −(1 − wd+1,j−1 (t))δd,j

(1 − wd+1,j (t))δd,j − wd+1,j−1 (t)δd,j

wd+1,j (t)δd,j . (7)

We compute the ﬁrst element of (6), (1 − wd,j (t))δd+1,j−1 =

tj+d − t , (tj+d − tj )(tj+d+1 − tj−1 )

(8)

tj+d − t , (tj+d+1 − tj−1 )(tj+d − tj )

(9)

and the ﬁrst element of (7), (1 − wd+1,j−1 (t))δd,j =

which shows that the ﬁrst element of (6) is equal to the ﬁrst element of (7). We now compute the third element of (6), wd,j (t)δd+1,j =

t − tj , (tj+d − tj )(tj+d+1 − tj )

(10)

t − tj , (tj+d+1 − tj )(tj+d − tj )

(11)

and the third element of (7), wd+1,j (t)δd,j =

A Method for Sparse-Matrix Computation of B-Spline Curves and Surfaces

801

which shows that the third element of (6) is equal to the third element of (7). Finally, we compute the second element of (6), which is minus the ﬁrst element minus the third element of (6), i.e. (1 − wd,j (t))δd+1,j−1 − wd,j (t)δd+1,j = (8) − (10), and if we reorganize the second element of (7), we get minus the ﬁrst element minus the third element of (7), i.e. (1 − wd+1,j (t))δd,j − wd+1,j−1 (t)δd,j = (1 − wd+1,j−1 (t))δd,j − wd+1,j (t)δd,j = (9) − (11), which also shows that the second element of (6) is equal to the second element of (7). This shows that the respective elements are equal for the whole line (row) and, thus, every line, the whole expressions are equal, which completes the proof. The main commutativity result now follows. Theorem 1. Commutativity relation between diﬀerentiation and multiplication of several consecutive of T (t) matrices. For all d > 0 and for a given knot vector, the value of a matrix product of several consecutive T (t)-matrices, one of which is diﬀerentiated, does not depend on the choice of the diﬀerentiated matrix, i.e., Td (t) Td+1 (t) · · · Td+j−1 (t) Td+j = Td Td+1 (t) · · · Td+j−1 (t) Td+j (t).

(12)

Proof. This follows from the result of Lemma 1 by induction 3.2

B-Splines in Matrix Notation

First, a concrete example: given a third degree B-spline curve c(t) = T1 (t) T2 (t) T3 (t) c = T 3 (t) c. It follows from Theorem 1 that the derivative is c (t) = (T1 T2 (t) T3 (t) + T1 (t) T2 T3 (t) + T1 (t) T2 (t) T3 ) c = 3 T 2 (t) T3 c. We are now ready to give a general expression of a B-spline function/curve and its derivatives in matrix notation. Definition 4. A B-spline function/curve of degree d is given in matrix notation, as follows c(t) = T d(t) c (13) where the set of B-splines of degree d are the T d (t) = T1 (t) T2 (t) · · · Td (t),

802

A. Laks˚ a

where T1 (t) T2 (t) · · · Td (t) are given according to Deﬁnition 2, and where c is the coeﬃcient vector with d + 1 elements c = (ci−d , ci−d+1 , . . . , ci )T , and where the index i is given by the requirement ti ≤ t < ti+1 . The j-th derivative, 0 < j ≤ d, of a B-spline function/curve of degree d is of degree d − j and is expressed, as follows c(j) (t) = where

4

d! T d−j (t) T j c (d − j)!

(14)

T j = Td−j+1 Td−j+2 · · · Td .

Knot Insertion

Knot insertion is one of the most important algorithms on B-splines. It was introduced simultaneously in 1980 by Boehm [3] for single knots and Cohen, Lyche and Riesenﬁeld [6] for general knot insertion. Knot insertion can also be expressed in the context of matrix notation. In the case of insertion of single knots we have the following expression c = Td ( t) c

(15)

where t is the value of the new knot and c is a vector of the d new coeﬃcients/points replacing the d − 1 points that are in the interior of the vector c at the RHS (except for the ﬁrst and the last coeﬃcients). Here, as usual, the index i is determined by ti ≤ t < ti+1 . We shall look at an example, a second degree B-spline function where we get: ⎞ ⎛ ci−2 ci−1 1 − w2,i−i (t) w2,i−i (t) 0 ⎝ ci−1 ⎠ . = (16) ci 0 1 − w2,i ( t) w2,i ( t) ci In Figure 1 there is, at the LHS, a second degree B-spline curve (in R2 ) and its control polygon (c0 , c1 , c2 , c3 , c4 ). At the RHS the knot vector is illustrated (on R, horizontal). There are three equal knots at the beginning and at the end of the curve. In the middle a new knot, t, is to be inserted. It follows from the position of t that i = 3. The two “linear translation and scaling” functions involved in t) in expression (16), w2,2 and w2,3 , are shown at the RHS of the the matrix T2 ( ﬁgure. The result of the knot insertion is that the internal coeﬃcient at the RHS c2 and c3 . of (16), c2 , is replaced by two new coeﬃcients at the LHS of (16), This can clearly be seen on the LHS of Figure 1.

A Method for Sparse-Matrix Computation of B-Spline Curves and Surfaces

803

Fig. 1. At the LHS, there is a 2nd degree B-spline curve and its control polygon (in t is inserted. The R2 ). At the RHS, the knot vector is illustrated (in R). One new knot, result is, as we can see from the LHS, two new control points c2 and c3 instead of the old control point c2 .

5

The B-Spline Matrix

The B-spline matrix Bd (t) is a (d + 1) × (d + 1) matrix deﬁned as follows ⎞ ⎛ T d (t) ⎟ ⎜ d T d−1 (t) T d ⎟ ⎜ ⎜ (d − 1)d T d−2 (t) T d−1 ⎟ Bd (t) = ⎜ ⎟. ⎟ ⎜ .. ⎠ ⎝ . d! T 1

It follows that

6

⎛

⎞ ci−d ⎜ ⎟ ⎜ .. ⎟ ⎜ ⎟ ⎟ ⎜ ⎜ ⎟ = Bd (t) ⎜ . ⎟ ⎝ ⎠ ⎝ ci−1 ⎠ ci c(d) (t) c(t) c (t) .. .

⎞

⎛

Complete Hermite Interpolation with B-Splines

Complete Hermite interpolation with B-splines requires that the number of coeﬃcients n = m(d + 1), where m is the number of interpolation points and d is the polynomial degree of the B-spline curve. It follows that if ti ≤ t < ti+1 then Bd (t) is invertible and we get ⎞ ⎛ ⎞ ⎛ c(t) ci−d ⎜ c (t) ⎟ ⎜ .. ⎟ ⎜ ⎟ ⎜ . ⎟ ⎟ = Bd (t)−1 ⎜ .. ⎟ ⎜ ⎝ . ⎠ ⎝ ci−1 ⎠ ci c(d) (t) It thus follows that for each interpolation point where the position and d consecutive derivatives are given, we get d + 1 coeﬃcients using the equation above.

804

A. Laks˚ a

References 1. Bernstein, S.: D´emonstration du theor`eme de Weierstrass fond´ee sur le calcul des probabiliti´es. Comm. Soc. Math. 13(1-2) (1912) 2. de Casteljau, P.: Shape Mathematics and CAD. Kogan Page, London (1986) 3. Boehm, W.: Inserting new knots into b-spline curves. Computer Aided Geometric Design 12(4), 199–201 (1980) 4. de Boor, C.: On calculation with B-splines. Journal of Approximation Theory 6, 50–62 (1972) 5. Curry, B.H., Schoenberg, I.J.: On Polya frequency functions IV: The fundamental spline functions and their limits. J. d’Analyse Math. 17, 71–107 (1966) 6. Cohen, E., Lyche, T., Riesenfeld, R.: Discrete B-splines and subdivision techniques in computer aided geometric design and computer graphics. Comp. Graphics and Image Process 14(2), 87–111 (1980) 7. Cox, M.G.: Curve ﬁtting with piecewice polynomials. J. Inst. Math. Appl. 8, 36–52 (1972) 8. Ramshaw, L.: Blossoms are polar forms. Computer Aided Geometric Design 6(4), 323–359 (1989)

Parallel MIC(0) Preconditioning for Numerical Upscaling of Anisotropic Linear Elastic Materials Svetozar Margenov and Yavor Vutov Institute for Parallel Processing Bulgarian Academy of Sciences, Acad. G. Bontchev, Bl. 25A, 1113 Soﬁa, Bulgaria {margenov,yavor}@parallel.bas.bg

Abstract. Numerical homogenization is used for upscaling of the linear elasticity tensor of strongly heterogeneous microstructures. The implemented 3D algorithm is described in terms of six auxiliary elastic problems for the reference volume element (RVE). Rotated trilinear RannacherTurek ﬁnite elements are used for discretization of the involved subproblems. A parallel PCG method is implemented for eﬃcient solution of the arising large-scale systems with sparse, symmetric, and positive semidefinite matrices. The implemented preconditioner is based on modiﬁed incomplete Cholesky factorization MIC(0). The numerical homogenization scheme is derived on the assumption of periodic microstructure. This implies periodic boundary conditions (PBCs) on the RVE. From algorithmic point of view, an important part of this study concerns the incorporation of PBCs in the parallel MIC(0) solver. Numerical upscaling results are shown. The test problem represents a trabecular bone tissue, taking into account the elastic response of the solid phase. The voxel microstructure of the bone is extracted from a high resolution computer tomography image. The presented results evidently demonstrate that the bone tissues could be substantially anisotropic. The computations are performed on IBM Blue Gene/P machine at the Bulgarian Supercomputing Center.

1

Introduction

The Preconditioned Conjugate Gradient (PCG) method is known to be the best solution tool for large systems of linear equations with symmetric and positive deﬁnite sparse matrices [3]. The used preconditioning technique is crucial for the PCG performance. It is also know that the PCG method converges for semidefinite matrices in the orthogonal to the kernel subspace. This paper is organized as follows. The applied numerical homogenization scheme is given in section 2. In section 3 the parallel MIC(0) preconditioner is described. Some results from numerical experiments are presented in the last section. I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 805–812, 2010. c Springer-Verlag Berlin Heidelberg 2010

806

2

S. Margenov and Y. Vutov

Homogenization Scheme

Let Ω be a hexahedral domain representing our RVE and u = (u1 , u2 , u3 ) be the displacements in Ω. The components of the small strain tensor are: 1 ∂ui (x) ∂uj (x) εij (u (x)) = + (1) 2 ∂xj ∂xi We assume that the Hooke’s law holds: σij (x) = cijkl (x)εkl (x)

(2)

Here, the Einstein summation convention is assumed. The tensor c is called stiﬀness tensor and σ is the stress tensor. We can rewrite (2) in matrix notation ⎡ ⎤ ⎡ ⎤⎡ ⎤ σ11 c1111 c1122 c1133 c1123 c1113 c1112 ε11 ⎢ σ22 ⎥ ⎢ c2211 c2222 c2233 c2223 c2213 c2212 ⎥ ⎢ ε22 ⎥ ⎢ ⎥ ⎢ ⎥⎢ ⎥ ⎢ σ33 ⎥ ⎢ c3311 c3322 c3333 c3323 c3313 c3312 ⎥ ⎢ ε33 ⎥ ⎢ ⎥=⎢ ⎥⎢ ⎥ (3) ⎢ σ23 ⎥ ⎢ c2311 c2322 c2333 c2323 c2313 c2312 ⎥ ⎢ 2ε23 ⎥ . ⎢ ⎥ ⎢ ⎥⎢ ⎥ ⎣ σ13 ⎦ ⎣ c1311 c1322 c1333 c1323 c1313 c1312 ⎦ ⎣ 2ε13 ⎦ σ12 c1211 c1222 c1233 c1223 c1213 c1212 2ε12 The symmetric 6 × 6 matrix C is called stiﬀness matrix. For an isotropic materials C has only 2 independent degrees of freedom. For orthotropic materials (there are three orthogonal planes of symmetry in this case), the matrix C has 9 independent degrees of freedom — 3 Young’s moduli E1 , E2 , E3 , 3 Poisson’s ratios ν23 , ν31 , ν12 and 3 shear moduli μ23 , μ31 , μ12 .

C Ort

⎡ 1−ν ν 23 32 ν21 + ν31 ν23 ν31 + ν21 ν32 E E Δ E2 E3 Δ ⎢ E2 E3 Δ ⎢ ν12 + ν13 ν32 1 −2ν313 ν13 ν32 + ν31 ν12 ⎢ ⎢ E3 E1 Δ E E Δ E E 3 1 3 1Δ ⎢ ν13 + ν12 ν23 ν23 + ν13 ν21 1 − ν12 ν21 =⎢ ⎢ ⎢ E1 E2 Δ E1 E2 Δ E1 E2 Δ ⎢ μ23 ⎢ ⎣ μ31

⎤ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥, ⎥ ⎥ ⎥ ⎥ ⎦

(4)

μ12

1 − ν12 ν21 − ν13 ν31 − ν23 ν32 − 2ν12 ν23 ν31 ν12 ν21 ν23 ν32 where Δ = , = , = , E1 E2 E3 E1 E2 E2 E3 ν13 ν31 = . E3 E1 We follow the numerical upscaling method from [9], see also [5]. The homogenization scheme requires to ﬁnd the functions ξ kl = (ξ1kl , ξ2kl , ξ3kl ), k, l = 1, 2, 3, satisfying the following problem in a week formulation:

∂ξpkl ∂φi ∂φi dΩ = cijkl (x) dΩ, ∀φ ∈ H1P (Ω), (5) cijpq (x) ∂xq ∂xj ∂xj Ω Ω

Numerical Upscaling of Anisotropic Linear Elastic Materials

807

where φ = {φi }3i=1 and H1P (Ω) = {φ ∈ H1 : φi are Ω − periodic}. After computing the characteristic displacements ξ kl , we ﬁnd the homogenized elasticity tensor cH using the explicit formula:

∂ξpkl 1 H cijkl = cijkl (x) − cijpq (x) dΩ. (6) |Ω| Ω ∂xq Due to the symmetry of the stiﬀness tensor c, the following relations ξ kl = ξ lk hold. Therefore, it is enough to solve six problems (5) to get the homogenized elasticity tensor. Rotated trilinear (Rannacher-Turek) ﬁnite elements [11] are used for numerical solution of (5). This choice is motivated by the additional stability of the nonconforming FEM discretization in the case of strongly heterogeneous materials [2]. The construction of robust non-conforming FEM methods are generally based on application of mixed formulation leading to a saddle-point system. By the choice of non continuous ﬁnite elements for the dual (pressure) variable, it can be eliminated at the (macro)element level, and we get a symmetric positive (semi-)deﬁnite FEM system in primal (displacements) variables. We use this approach which is referred as reduced and selective integration (RSI) [10].

3

Parallel MIC(0) Preconditioning

Our preconditioning algorithm is based on a preexisting parallel MIC(0) elasticity solver [13], based on a parallel MIC(0) solver for symmetric and positive deﬁnite scalar elliptic problems [1]. The preconditioner uses the isotropic variant of the displacement decomposition (DD) method (see, e.g., [6]). We write the DD auxiliary matrix in the form ⎡ ⎤ A CDD = ⎣ A ⎦ (7) A where A is the stiﬀness matrix corresponding to the bilinear form

3

∂u ∂v a(u , v ) = E(x) ∂x i ∂xi Ω i=1 h

h

dx,

(8)

where u and v are Ω-periodic functions. The DD splitting is motivated by the second Korn’s inequality, which holds for the RSI FEM discretization under consideration. A brief introduction to the modiﬁed incomplete factorization [8] is given below. Let us rewrite the real N × N matrix A = (aij ) in the form A = D − L − LT

808

S. Margenov and Y. Vutov

where D is the diagonal and (−L) is the strictly lower triangular part of A. Then we consider the approximate factorization of A which has the form: CMIC(0) = (X − L)X −1 (X − L)T

(9)

with X = diag(x1 , · · · , xN ) being the diagonal matrix determined by the condition of equal rowsums. We are interested in the case when X > 0 and thus CMIC(0) is positive deﬁnite for the purpose of preconditioning. If this holds, we speak about stable MIC(0) factorization. Concerning the stability of MIC(0), the following theorem holds. Theorem 1. Let A = (aij ) be a symmetric real N × N matrix and let A = D − L − LT be the splitting of A. Let us assume that L ≥ 0, Ae ≥ 0, Ae + LT e > 0,

(10) e = (1, · · · , 1)T ∈ RN ,

i.e. that A is a weakly diagonally dominant with nonpositive oﬀdiagonal entries and that A + LT = D − L is strictly diagonally dominant. Then the relation xi = aii −

i−1 N

aik k=1

xk

akj > 0

j=k+1

holds and the diagonal matrix X = diag(x1 , · · · , xN ) deﬁnes stable MIC(0) factorization of A. The perturbed version of MIC(0) algorithm is used in our study. This means that ˜ The diagonal the incomplete factorization is applied to the matrix A˜ = A + D. ˜ ˜ ˜ ˜ perturbation D = D(ξ) = diag(d1 , . . . dN ) is deﬁned as follows: ξaii if aii ≥ 2wi d˜i = 1/2 ξ aii if aii < 2wi where 0 < ξ < 1 is a properly chosen parameter, and wi = j>i −aij . In particular, this allows us to satisfy the stability conditions (10) in the case of PBCs. The idea of our parallel algorithm is to apply the MIC(0) factorization on an auxiliary matrix B, which approximates A. The matrix B has a special block structure, which allows a scalable parallel implementation. Following the stan dard FEM assembling procedure we write A in the form A = e∈ωh LTe Ae Le , where Ae is the element stiﬀness matrix, Le stands for the restriction mapping of the global vector of unknowns to the local one, corresponding to the current element e. Let us consider the following approximation Be of Ae : ⎡ ⎤ ⎡ ⎤ a11 a12 a13 a14 a15 a16 b11 a12 a13 a14 a15 a16 ⎢ a21 a22 a23 a24 a25 a26 ⎥ ⎢ a21 b22 a23 a24 a25 a26 ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ a31 a32 a33 a34 a35 a36 ⎥ ⎢ a31 a32 b33 0 0 0 ⎥ ⎢ ⎥ ⎢ ⎥ Ae = ⎢ ⎥ , Be =⎢ a41 a42 0 b44 0 0 ⎥ . ⎢ a41 a42 a43 a44 a45 a46 ⎥ ⎢ ⎥ ⎣ a51 a52 a53 a54 a55 a56 ⎦ ⎣ a51 a52 0 0 b55 0 ⎦ a61 a62 a63 a64 a65 a66 a61 a62 0 0 0 b66

Numerical Upscaling of Anisotropic Linear Elastic Materials

809

Fig. 1. Structure of the solid phase: 128 × 128 × 128 voxels

The local numbering follows the pairs of the opposite nodes of the reference element. The diagonal entries of Be are modiﬁed to hold the rowsum criteria. the locally deﬁned matrices Be we get the global matrix B = Assembling T −1 A) ≤ 3 holds uniformly e∈ωh Le Be Le . The condition number estimate κ(B with respect to mesh parameter and possible coeﬃcient jumps (see for the related analysis in [1]). The modiﬁed matrix B has diagonal blocks, corresponding to the (x, y) cross sections. This allows to perform in parallel the solution of linear systems with matrix (9) [1]. It is important no note that the PBCs do not change the diagonal blocks of the stiﬀness matrix A as well as of the auxiliary matrix B. However, there are changes in the structure of the oﬀdiagonal blocks, which require some principal modiﬁcations in the parallel code. Finally, the implemented parallel preconditioner for the considered linear elasticity nonconforming FEM systems has the form: ⎤ ⎡ CMIC(0) (B) ⎦. CMIC(0) (B) CDD MIC(0) = ⎣ CMIC(0) (B)

4

Numerical Experiments

The analyzed test specimen is a part of trabecular bone tissue extracted from a high resolution computer tomography image [4]. The trabecular bone has a strongly expressed heterogeneous microstructure composed of solid and ﬂuid phases. To get a periodic RVE, the specimen is mirrored three times, see Fig. 1. In this article, our goal is to obtain the homogenized elasticity tensor of the trabecular bone tissue, taking into account the elastic response of the solid phase only. To this purpose, numerical experiments with exponentially decreasing Young modulus for the voxels corresponding to the ﬂuid phase are performed. In other words, the empty (ﬂuid) voxels are interpreted as ﬁctitious domain. Homogenized properties of diﬀerent RVEs with varying size of 32 × 32 × 32, 64 × 64 × 64

810

S. Margenov and Y. Vutov

and 128×128×128 voxels are studied. The Young modulus and the Poisson ratio of the solid phase are taken from [7] as follows: E s = 14.7GPa and ν s = 0.325. We set also ν f = ν s which practically doesn’t inﬂuence the numerical upscaling results. In what follows, the ﬁctitious domain Young modulus E f is given in Pascals (see, e.g., the values in the parentheses bellow). ⎤ ⎡ 4.86×108 7.49×107 6.74×107 ⎥ ⎢ 7.49×107 3.03×108 9.45×107 ⎥ ⎢ 7 7 8 ⎥ ⎢ 6.74 ×10 9.45 ×10 7.43 ×10 H 7 ⎥ ⎢ C [1.47×10 ] = ⎢ ⎥ 8.45 ×107 ⎥ ⎢ ⎦ ⎣ 6.22×107 3.37×107 ⎤

⎡

4.35×108 5.65×107 5.03×107 ⎢ 5.65×107 2.38×108 7.69×107 ⎢ ⎢ 5.03×107 7.69×107 7.06×108 H 6 C [1.47×10 ] = ⎢ ⎢ 6.66×107 ⎢ ⎣ 4.89×107

⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦ 1.40×107

⎡

⎤

4.29×108 5.46×107 4.86×107 ⎢ 5.46×107 2.30×108 7.49×107 ⎢ ⎢ 4.86×107 7.49×107 7.01×108 H 5 C [1.47×10 ] = ⎢ ⎢ 6.44×107 ⎢ ⎣ 4.74×107

⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦ 1.18×107

Here we have vanished the entries of C H which tend to zero with the increase of the PCG accuracy. The structure of C H corresponds to the case of orthotropic materials which is due to the enforced triple mirroring procedure. Following (4), the Young moduli Ei in each of the coordinate directions and the Poisson ratios νij = − εj /εi are computed explicitly by the formulas Ei = 1/sii

νij = −Ei sji

where sij stand for the elements of the matrix S = (C H )−1 , see [12]. Tables 1, 2 and 3 contain the computed homogenized Young moduli and Poisson ratios varying the ﬁctitious domain Young modulus E f for the considered three diﬀerent specimens. A stable behaviour of the implemented numerical homogenization scheme is observed in all cases. From practical point of view, a good accuracy of the computed homogenized Young moduli and Poisson ratios is achieved if the ﬁctitious domain modulus E f = δE s for δ ∈ 10−3 , 10−4 . The next observation is that in all three cases (RVEs composed of 323 , 643 and 1283 voxels) the orthotropy ratio is more than 3. This evidently conﬁrms that the hypothesis that the trabecular bone structure could be interpreted (approximated) as isotropic is not realistic. The last table illustrates the convergence rate of the implemented DD MIC(0) preconditioner. The available theoretical

Numerical Upscaling of Anisotropic Linear Elastic Materials

811

Table 1. Homogenized linear elasticity coeﬃcients: n=32 Ef 1.47×109 1.47×108 1.47×107 1.47×106 1.47×105

E1 4.52×109 2.03×109 1.67×109 1.63×109 1.62×109

E2 6.23×109 4.72×109 4.48×109 4.46×109 4.45×109

E3 6.24×109 4.67×109 4.45×109 4.42×109 4.42×109

ν12 0.208 0.095 0.074 0.072 0.071

ν23 0.300 0.271 0.264 0.263 0.262

ν31 0.286 0.229 0.212 0.210 0.210

μ23 2.29×109 1.73×109 1.66×109 1.65×109 1.65×109

μ31 1.39×109 4.81×108 3.56×108 3.42×108 3.40×108

μ12 1.35×109 3.80×108 2.42×108 2.26×108 2.24×108

Table 2. Homogenized linear elasticity coeﬃcients: n=64 Ef 1.47×109 1.47×108 1.47×107 1.47×106 1.47×105

E1 2.86×109 8.73×108 5.69×108 5.33×108 5.29×108

E2 3.11×109 1.12×109 8.02×108 7.62×108 7.58×108

E3 3.55×109 1.94×109 1.73×109 1.71×109 1.71×109

ν12 0.288 0.191 0.127 0.117 0.116

ν23 0.270 0.164 0.124 0.119 0.118

ν31 0.281 0.185 0.117 0.102 0.101

μ23 1.19×109 4.94×108 3.90×108 3.77×108 3.76×108

μ31 9.07×108 1.62×108 5.22×107 3.88×107 3.74×107

μ12 9.50×108 1.71×108 5.22×107 3.77×107 3.62×107

Table 3. Homogenized linear elasticity coeﬃcients: n=128 Ef 1.47×109 1.47×108 1.47×107 1.47×106 1.47×105

E1 2.66×109 7.90×108 4.65×108 4.20×108 4.15×108

E2 2.47×109 5.97×108 2.81×108 2.24×108 2.16×108

E3 2.67×109 9.51×108 7.10×108 6.78×108 6.75×108

ν12 0.315 0.282 0.228 0.222 0.222

ν23 0.284 0.180 0.114 0.100 0.098

ν31 0.278 0.171 0.094 0.076 0.073

μ23 8.76×108 1.93×108 8.46×107 6.66×107 6.44×107

μ31 8.78×108 1.68×108 6.22×107 4.89×107 4.75×107

μ12 8.87×108 1.64×108 3.37×107 1.40×107 1.18×107

Table 4. Number of iterations n 32 64 128

E f [Pa] 1.47×109 1.47×108 1.47×107 1.47×106 1.47×105 N 2 359 296 222 363 575 711 734 18 874 368 343 577 848 1 236 1 436 150 994 944 481 840 1 505 2 311 2 482

estimates concern some more model problems for homogeneous materials. In such cases, the number of iterations is nit = O(n1/2 ) = O(N 1/6 ). Here the number of iterations has very similar behaviour for coeﬃcient jumps of the range {10 − 102 }. The good news here is that even for very large jumps of up to 105 , the convergence is only slightly deteriorating. Based on the reported results we can conclude that the developed numerical homogenization algorithmis and software tools provide a reliable tool for computer simulation of strongly heterogeneous anisotropic voxel microstructure. As a next important step we plan to incorporate in the upscaling scheme the contribution of the ﬂuid phase of the bone tissues. The implementation of additionally stabilized solvers for PBC FEM problems as well as the incorporation of scalable AMG preconditioners in the composed algorithm are also under development.

812

S. Margenov and Y. Vutov

Acknowledgments The authors gratefully acknowledge the partial support provided via Bulgarian NSF Grant DO 02-147/08.

References 1. Arbenz, P., Margenov, S., Vutov, Y.: Parallel MIC(0) preconditioning of 3D elliptic problems discretized by Rannacher-Turek ﬁnite elements. Computers and Mathematics with Applications 55(10), 2197–2211 (2008) 2. Arnold, D.N., Brezzi, F.: Mixed and nonconforming ﬁnite element methods: Implementation, postprocessing and error estimates. RAIRO, Model. Math. Anal. Numer. 19, 7–32 (1985) 3. Axelsson, O.: Iterative Solution Methods. Cambridge University Press, Cambridge (1994) 4. Beller, G., Burkhart, M., Felsenberg, D., Gowin, W., Hege, H.-C., Koller, B., Prohaska, S., Saparin, P.I., Thomsen, J.S.: Vertebral Body Data Set ESA29-99-L3, http://bone3d.zib.de/data/2005/ESA29-99-L3/ 5. Bensoussan, A.: Asymptotic Analysis for Periodic Structures. Elsevier, Amsterdam (1978) 6. Blaheta, R.: Displacement decomposition — incomplete factorization preconditioning techniques for linear elasticity problems. Num. Lin. Algebr. Appl. 1, 107–126 (1994) 7. Cowin, S.: Bone poroelasticity. J. Biomechanics 32, 217–238 (1999) 8. Gustafsson, I.: Modiﬁed incomplete Cholesky (MIC) factorization. In: Evans, D.J. (ed.) Preconditioning Methods; Theory and Applications, pp. 265–293. Gordon and Breach, New York (1983) 9. Hoppe, R.H.W., Petrova, S.I.: Optimal shape design in biomimetics based on homogenization and adaptivity. Math. Comput. Simul. 65(3), 257–272 (2004) 10. Malkus, D., Hughes, T.: Mixed ﬁnite element methods. Reduced and selective integration techniques: an uniform concepts. Comp. Meth. Appl. Mech. Eng. 15, 63–81 (1978) 11. Rannacher, R., Turek, S.: Simple nonconforming quadrilateral Stokes Element. Num. Meth. Part. Diﬀ. Equs. 8(2), 97–112 (1992) 12. Saad, M.H.: Elasticity — Theory, Applications and Numerics. Elsevier, Amsterdam (2005) 13. Vutov, Y.: Parallel DD-MIC(0) Preconditioning of Nonconforming Rotated Trilinear FEM Elasticity Systems. In: Lirkov, I., Margenov, S., Wa´sniewski, J. (eds.) LSSC 2007. LNCS, vol. 4818, pp. 745–752. Springer, Heidelberg (2008)

The Bpmpd Interior Point Solver for Convex Quadratically Constrained Quadratic Programming Problems Csaba M´esz´aros Laboratory of Operations Research and Decision Systems, Hungarian Academy of Sciences [email protected]

Abstract. The paper describes the convex quadratically constrained quadratic solver Bpmpd which is based on the infeasible–primal–dual algorithm. The discussion includes subjects related to the implemented algorithm and numerical algebra employed. We outline the implementation with emhasis to sparsity and stability issues. Computational results are given on a demonstrative set of convex quadratically constrained quadratic problems.

1

Introduction

During the past two decades perhaps the most dramatic progress in computational optimization has been achieved in the ﬁeld of implementation of interior point methods. The recent implementation techniques follow basically the major contributions of the developers of commercial and public domain interior–point packages [1]. The success of interior point methods (IPM) in the linear programming (LP) practice resulted in an increased interest in the application of the IPM technology in nonlinear optimization. It is now widely accepted that on large–scale problems interior point methods can signiﬁcantly outperform the algorithms based on the generalizations of the simplex approach (see the benchmark results in [11]). The paper describes our IPM solver, Bpmpd, which is an eﬃcient implementation of the infeasible–primal–dual algorithm for solving linear (LP), convex quadratic (QP) and convex quadratically constrained quadratic (QCQP) programming problems. The paper is organized as follows: Section 2 presents the infeasible–primal– dual method on which our implementation is based. In section 3 we outline some of the most important implementation techniques used in Bpmpd ensuring eﬃciency and robustness. Numerical results are shown in Section 4 on large–scale QCQP test problems. I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 813–820, 2010. c Springer-Verlag Berlin Heidelberg 2010

814

2

C. M´esz´ aros

The Infeasible Primal–Dual Method

Without loss of generality, the convex QCQP problem is usually assumed to be in the following form: 1 T x Qi x 2

min 12 xT Q0 x + cT0 x, + cTi x ≥ bi for i = 1, . . . , m, x ≥ 0,

(1)

where x, ci ∈ Rn , bi ∈ R, Qi ∈ Rn×n , furthermore, Q0 and −Qi for i = 1 . . . m are symmetric positive semideﬁnite. A dual problem in the Wolfe-sense can be associated to (1) as follows: T max bT y − 12 xT Q0 x − m i=1 yi x Qi x , m C T y ≤ c0 + Q0 x − i=1 yi Qi x, (2) y ≥ 0, T

where y ∈ Rm and C = [c1 , . . . , cm ] . Primal–dual interior point algorithms are iterative approaches which seek to satisfy the Karush–Kuhn–Tucker type optimality conditions for the primal–dual problem (1-2). Megiddo, Mizuno and Kojima [4] and others contributed to the theoretical background of this type of algorithms, while the ﬁrst remarkable implementation results were shown by Lustig, Marsten, and Shanno [5]. The ﬁrst step to obtain the infeasible–primal–dual algorithm is to replace the nonnegativity constraints with logarithmic penalty terms for strictly positive approximate solutions x > 0 and s > 0, where s ∈ Rm . This results in the following logarithmic barrier problem: n m min 12 xT Q0 x + c0 T x − μ i=1 ln xi − μ i=1 ln si , (3) 1 T T 2 x Qi x + ci x − si = bi for i = 1, . . . , m, where μ > 0 barrier parameter. From the Lagrangian of the above problem, L(x, y, s), we can derive a set of nonlinear equations that describes the ﬁrst–order optimality conditions: ∇x L(x, y, s) = c0 + Q0 x − μX −1 e − C T y −

m

yi Qi x = 0,

(4)

i=1

1 T x Qi x + cTi x − si − bi = 0, for i = 1, . . . , m, 2 ∇s L(x, y, s) = −μS −1 e + y = 0,

∇y L(x, y, s) =

(5) (6)

T

where X = diag(x1 , . . . xn ), S = diag(s1 , . . . sm ), e = (1, . . . , 1) . Furthermore, we introduce z ∈ Rm as: z = μX −1 e and rewrite (7) and (6) as: z = μX −1 e =⇒ Xz = μe, −μS −1 e + y = 0 =⇒ Sy = μe.

(7)

The Bpmpd Interior Point Solver

815

The main idea of the primal-dual log barrier methods is that the resulted nonlinear system of equations is solved by the Newton method, while μ is decreased to zero. For describing the algorithm, we use the notations bellow: H(y) = Q0 − ai (x) = ci +

m

yi Qi ,

i=1 m

xT Qi , for i = 1, . . . , m,

i=1 T

A(x) = (a1 (x) , . . . , am (x)T )T , 1 ξi = bi − xT Qi x − cTi x + si for i = 1, . . . , m, 2 m ζ = c0 + Q 0 x − z − C T y − yi Qi x. i=1

It is easy to derive that in every iteration, the Newton method results in the following system of equations: ⎡ ⎤⎡ ⎤ ⎡ ⎤ A(x, y) −I 0 0 Δx ξ ⎢ −H(y) 0 A(x, y)T I ⎥ ⎢ Δs ⎥ ⎢ ⎥ ζ ⎢ ⎥⎢ ⎥=⎢ ⎥, (8) ⎣Z 0 0 X ⎦ ⎣ Δy ⎦ ⎣ μe − XZe ⎦ 0 Y S 0 Δz μe − SY e which can be reduced by pivoting on X and Y to

−ZX −1 − H(y) A(x, y)T Δx ζ − μX −1 e + Ze = . A(x, y) Y −1 S Δy ξ + μY −1 e − Se

(9)

Let M denote the matrix of the augmented system (9) and D and F denote: D = ZX −1 + H(y), F = Y −1 S. It is easy to see that the assumptions for (1) provide the positive deﬁniteness of D and F which make the matrix of the augmented system quasideﬁnite [13]. In every iteration an implementation of the primal–dual interior point method deﬁnes the matrix and right–hand side of (1) and solves the new linear equation system. The result is used for deriving the solution of (8) which will be applied as search direction from the current iterate. The implementation of a usual interior point solver consists of the following parts: – Presolve and scaling, sparse matrix ordering. – Choose starting point (x0 , s0 , y 0 , z 0 ) > 0. – Iteration cycle.

816

C. M´esz´ aros

• • • •

Check stopping criteria. Deﬁne M and μ. Solve the Newton system. Compute steplengths and update primal and dual solutions.

– Postsolve and descaling. The development of Bpmpd started in 1993 and in 1996 it was released in the Netlib in source code form as an interior-point implementation to solve LP problems. Since then, the functionality of the solver has been extended [6] and the eﬃciency [7] and robustness [8] were increased signiﬁcantly.

3

Implementation

The eﬃcient and robust implementation of a numerical algorithm is a highly complex task. The recent version of Bpmpd consists of approximately 25 thousand C source lines. The numeric computations are the steps most involved in any implementation. Making them eﬃcient is of extraordinary importance. Even the symbolical investigations are closely related to the numerical procedures, because they are used for reducing the number of necessary arithmetical operations as possible. Therefore, during the development of Bpmpd a signiﬁcant eﬀort was concentrated on making the “numerical algebra kernel” as robust and fast as possible. Nowadays, all general–purpose large-scale interior point implementations use direct decomposition to solve the reduced Newton system (9). This can be done by the symmetric factorization of a quasideﬁnite system as

−D AT (10) P T = LΛLT , P A F where L ∈ R(m+n)×(m+n) lower triangular, Λ ∈ R(m+n)×(m+n) diagonal and P ∈ R(m+n)×(m+n) permutation. Direct factorization approaches have some disadvantages, i.e., L may ﬁll up as the models getting larger, which increases memory and ﬂoating point computation requirements. However, due to the quasidefinite property the P permutation can be determined in a pure symbolical step, without numerical considerations. Once the permutation has been determined, the factorization pattern is ﬁxed for all interior point iterations [13]. Then the numerical factorization can take advantage of the ﬁxed factorization pattern by exploiting the computational hardware. 3.1

Improving the Numerical Properties of Factorizations

As shown in [9] one drawback of direct factorization approaches is that in some situations the suﬃcient numerical accuracy cannot be achieved. One possible solution, suggested in [8], overcomes the numerical diﬃculties by applying iterative reﬁnement and regularization

(0)

(k+1)

(k) Δx Δx Δx ∗ −1 = I − (M ) M + , (11) Δy Δy Δy

The Bpmpd Interior Point Solver

817

where M ∗ ∈ R(m+n)×(m+n) and diﬀers from M in the diagonal values only:

−D∗ AT M∗ = A F where where D∗ − D is diagonal. One can see that the requirement −1 λ I − (M ∗ ) M < 1 for the convergence of the iterative reﬁnement process (11) is equivalent to the condition. Dii max ∗ < 2, (12) Dii which describes the necessary conditions for the regularization matrix D∗ . This kind of regularized iterative reﬁnement process is implemented in Bpmpd as described in [8]. 3.2

Implementation of the Numerical Kernels

Virtually, all eﬃcient computer hardware have similar features: – Hierarchical memory. – Vector computing capabilities. – Parallel processing capabilities. The exploitation of hardware features like these is diﬃcult in sparse computations, therefore eﬃcient implementations try to identify parts of the sparse factorization which can be done in dense mode. The ideas in [7] were based on the eﬃcient exploitation of supernodal partitions [2] of the factors. A supernode is a set of continuous columns of L, which share the same sparsity structure. In order to speed up numerical updates by columns that do not form supernodes as described above, we have generalized the supernodes in the following manner: a set of columns is considered as a generalized supernode if the corresponding diagonal block in L is a dense lower triangular matrix. Since each column of a supernode like this will contain the sparsity pattern of the preceding columns as a subset, one can reorder the row indices and form a submatrix with dense blocks inside. The following example shows a generalized supernode in the original and permuted forms. ⎡ ⎤ ⎡ ⎤ ∗ 1 1 ∗ ⎥ ⎢∗ ∗ ⎥ 2 2⎢ ∗ ∗ ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎥ 3⎢ ⎢ ∗ ∗ ∗ ⎥ Permute rows ⎢ ∗ ∗ ∗ ⎥ 3 ⎢ ⎥ ⎢ ⎥ 4 ⎢ ∗ ∗⎥ =⇒ ⎢∗ ∗ ∗⎥ 5 ⎣ ⎣ ⎦ ∗ ∗ ∗⎦ 6 5 ∗∗∗ ∗∗ 4 6 ∗∗∗ Whereas the permuted index ordering will make the sparse update process a bit more costly, the computation of the update with a generalized supernode can be easily organized by dense block updates where the number of necessary

818

C. M´esz´ aros

dense block operations is at most the number of columns in the supernode. The generalization of the supernodes in the above manner has an additional beneﬁt, namely, this makes it possible to store the sparsity pattern of the columns of the generalized supernodes in a compressed form. This can result in signiﬁcant savings in the required storage space and make data localization better, which is also beneﬁcial for the hardware. By using the supernode decompositions above, a signiﬁcant part of the ﬂoating point operations can be performed in dense mode which can be implemented eﬃciently by using loop unrolling and blocking techniques [12,10]. The computational kernel process of our implementation can be described as follows: after removing the empty rows, the supernodes and generalized supernodes are decomposed into blocks: ⎤ ⎡ B0 ⎢ B1 ⎥ ⎢ ⎥ ⎣...⎦, Bk where B0 is dense lower triangular matrix and, Bi>0 is rectangular. Bi>0 is either a dense matrix or it has a dense block structure. A natural idea is to take the size of the cache memory into consideration when determining the supernode partitioning, i.e., splitting large supernodes into smaller ones such that each Bi ﬁts entirely to the cache memory. This approach balances well the beneﬁt of the faster cache memory and the overhead presented by the multiple updates of split supernode partitions. The Bi blocks are stored row–wise, allowing the use of inner product computations during the factorization process, which is eﬃcient for vector processing. In our push factorization approach the supernodes are processed one by one. The inner cycle of this factorization process computes the numerical values of a supernode and its numerical contribution on the remaining matrix as follows: 1. Compute the symmetric decomposition B0 = F0 Λ0 F0t . (parallel) for i = 1, . . . , k. 2. Update Bi ←− Bi F0−T Λ−1 0 for i = 1, i ≤ k 3. Compute the update matrices Uj = Bi Λ0 BjT (parallel) for j = 1, . . . , i−1. 4. Compute the lower triangular update matrix Ui = Bi Λ0 BiT . 5. Update the remaining matrix by Uj (parallel) for j = 1, . . . , i. end for In step 1 we store F0 in the storage place of B0 , thus the algorithm exploits the cache memory by re-using the data of B0 in steps 1 and 2 and the data of Bi in steps 3 and 4. This is a key point in parallel computations, since the access to the main memory is often a limiting factor. As noted in [7], the above process exploits vector processing very eﬃciently. 3.3

Sparse Matrix Ordering

The practical success in large–scale optimization highly depends on the exploitation of sparsity structures. In case of interior point methods, the pivot order of

The Bpmpd Interior Point Solver

819

the symmetric factorization has a very strong inﬂuence on the number of ﬁll–in and, therefore, on the storage requirement and the speed of factorizations. In the symbolic phase we try to ﬁnd a good row permutation P for (10) that minimizes the number of nonzeros in L. Since ﬁnding the “optimal” permutation is an NP complete problem [14], heuristics are used in practice. Bpmpd applies a variant of the multilevel bisection algorithm [3] combined with local heuristics [2].

4

Numerical Results and Conclusions

In this section we shall compare the eﬃciency of Bpmpd with Cplex version 11. Our tests were performed on the convex QCQP test problems published by Mittelmann at http://plato.asu.edu/ftp/ampl_files/qpdata_ampl/. The experiments were performed on a desktop machine equipped with one 3 Ghz Core 2 Duo Intel processor. From the test problems we omitted the small ones which were solved by both solvers in less than 2.5 seconds. Table 1 contains the statistics of the test problems and the solution times in seconds. Table 1. Numerical results on QCQP problems Problem Problem size Solution time name m n nz(A) CPLEX BPMPD boyd1 20 93262 652258 19.73 2.3 boyd2 186533 93264 423788 29.75 1.12 cont-100 9803 10198 58808 3.02 0.73 cont-200 39603 40398 237608 14.88 4.21 cont-201 40200 40398 209402 12.42 5.19 cont-300 90300 90598 471602 20.97 16.86 stadat1 4001 2002 11999 5.58 0.06 aug2dc 10002 20201 60202 2.56 0.28 cvxqp1-l 5002 10001 15000 424.98 5.78 cvxqp2-l 2502 10001 7501 210.58 3.78 cvxqp3-l 7502 10001 22499 638.61 7.04 ksip 1003 21 17393 4.67 0.03 liswet11 10002 10003 40004 8.77 0.36 liswet12 10002 10003 40004 8.02 1.33 liswet1 10002 10003 40004 10.64 0.36 liswet7 10002 10003 40004 12.7 0.26 liswet8 10002 10003 40004 15.2 0.78 liswet9 10002 10003 40004 11.17 0.95 stcqp2 2054 4098 413340 3.3 0.06 yao 2002 2003 8004 3.45 0.05

Our numerical results indicate that nonseparable QCQPs can be solved eﬃciently in the standard form, and the performance of our implementation compares favorably to that of Cplex. Furthermore, our experiences show that interior point methods with adequate numerical and sparsity techniques are robust and eﬃcient tools in large–scale nonlinear optimization.

820

C. M´esz´ aros

Acknowledgments This work was supported in part by Hungarian Research Fund OTKA K-77420 and K-60480.

References 1. Andersen, E., Gondzio, J., M´esz´ aros, C., Xu, X.: Implementation of interior point methods for large scale linear programs. In: Terlaky, T. (ed.) Interior Point Methods of Mathematical Programming, pp. 189–252. Kluwer Academic Publishers, Dordrecht (1996) 2. George, A., Liu, J.: Computer Solution of Large Sparse Positive Deﬁnite Systems. Prentice-Hall, Englewood Cliﬀs (1981) 3. Karypis, G., Kumar, V.: A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM Journal on Scientiﬁc Computing 20(1), 359–392 (1999) 4. Kojima, M., Megiddo, N., Mizuno, S.: Theoretical convergence of large-step primaldual interor-point algorithms for linear programming. Mathematical Programming 59(1) (1993) 5. Lustig, I., Marsten, R., Shanno, D.: Computational experience with a primal-dual interior point method for linear programming. Linear Algebra Appl. 20, 191–222 (1991) 6. M´esz´ aros, C.: The BPMPD interior-point solver for convex quadratic problems. Optimization Methods and Software 11-12, 431–449 (1999) 7. M´esz´ aros, C.: On the performance of the Cholesky factorization in interior point methods on Pentium 4 processors. CEJOR 13(4), 289–298 (2005) 8. M´esz´ aros, C.: On numerical issues of interior point methods. SIAM J. on Matrix Analysis 30(1), 223–235 (2008) 9. M´esz´ aros, C.: On the Cholesky factorization in interior point methods. Computers & Mathematics with Applications 50, 1157–1166 (2005) 10. M´esz´ aros, C.: Fast Cholesky factorization for interior point methods of linear programming. Computers & Mathematics with Applications 31(4/5), 49–54 (1996) 11. Mittelmann, H., Spellucci, P.: Decision tree for optimization software. World Wide Web (1998), http://plato.la.asu.edu/guide.html 12. Rothberg, E., Gupta, A.: Eﬃcient Sparse Matrix Factorization on High-Performance Workstations-Exploiting the Memory Hierarchy. ACM Trans. Mathematical Software 17(3), 313–334 (1991) 13. Vanderbei, R.: Symmetric quasi-deﬁnite matrices. SIAM J. on Optimization 5(1), 100–113 (1995) 14. Yannakakis, M.: Computing the minimum ﬁll–in is NP–complete. SIAM J. Algebraic Discrete Methods 2, 77–79 (1981)

On Shape Optimization of Acoustically Driven Microfluidic Biochips Svetozara I. Petrova1,2 1

2

Department of Mathematics, University of Applied Sciences, Am Stadtholz 24, 33609 Bielefeld, Germany Institute for Parallel Processing, Bulgarian Academy of Sciences, Acad. G. Bontchev, Block 25A, 1113 Soﬁa, Bulgaria

Abstract. The paper deals with the shape optimization of acoustically driven microﬂuidic biochips. This is a new type of nanotechnological devices with a wide range of applications in physics, pharmacology, clinical diagnostics, and medicine. The complex system of channels inside the chip can be optimized with regard to diﬀerent objectives and various optimal criteria. To simplify the model, we solve Partial Diﬀerential Equations (PDEs) constraint optimization problem based on a stationary Stokes-like system as equality constraints and bilateral inequality constraints with upper and lower bounds for the design parameters coming from geometrical and technological restrictions. The shape of the domain is designed by control parameters determining certain parts of the boundary. Preliminary ﬁxed segments of the lateral walls of the channel are approximated by B´ezier-curves with design parameters taken as the B´ezier control points. For the solution of the problem we use a pathfollowing interior-point method with inexact Newton solvers. Keywords: Surface acoustic waves, biochips, shape optimization, pathfollowing interior-point method, Newton solver. Mathematics subject classification: 65K10, 74F10, 90C30.

1

Introduction

In the last years, the use of acoustically driven microﬂuidic biochips as a novel type of nanotechnological devices has attracted a lot of attention (cf., e.g. [2,4,5,7,9]). These miniaturized chips with a surface approximately one quadrat centimeter or less contain a complex system of channels with appropriately located interdigital transducers, pumps, reservoirs, capillary barriers, mixers, valves, etc. and serve as biochemical labs, also called labs-on-a-chip. In particular, they can transport droplets or ﬂuid volumes of nanoliters and perform biochemical analysis of tiny samples. The small size of the devices in a scale of micrometers is of a signiﬁcant importance for reducing the costs in many practical applications. I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 821–828, 2010. c Springer-Verlag Berlin Heidelberg 2010

822

S.I. Petrova

Fig. 1. Transportation of a droplet containing probe in SAWs driven biochip

The basic working principle and the operating mode of the microﬂuidic biochips is based on piezoelectrically actuated Surface Acoustic Waves (SAWs) on the surface of a chip [9]. The acoustic streaming eﬀects excited by SAWs have been used for a long time in high frequency applications (cf., [2,7]). The piezoelectric substrate material on the chip is equipped with the so-called interdigital transducers on the surface. They convert electromagnetic input signals into elastic waves on the surface of the substrate and produce SAWs. The interaction of the SAWs generated by electric pulses of high frequency together with ﬂuid volumes of amounts of nanoliters leads to streaming patterns in the ﬂuid and serves as a driving force for a motion of the ﬂuid as whole. The acoustic waves propagate like a miniaturized earthquake and in this way transport liquids along the surface of the chip. A transportation of a droplet containing probe along a lithographically produced network of channels to marker molecules placed at prespeciﬁed surface locations is presented in Fig. 1. On the ﬂuidic network a small portion of titrate solution (middle) is separated from a larger volume (right). Surface acoustic waves transport this quantity towards the analyte (left) at the reaction site. Once a critical concentration is attained, it can be either detected by a change of the color of the analyte or a change of the conductivity. In the latter case, this can be easily measured by a sensor that is integrated on the same chip. In the recent years, the described acoustically driven microﬂuidic biochips have been actively used in physics, pharmacology, molecular biology, and clinical diagnostics. In most of these applications one needs a precise handling of very small amounts of tiny samples and costly reagents, cost-eﬀectiveness, better sensitivity, and signiﬁcant speed-up of biochemical analysis. We are interested in modeling and numerical simulation of microﬂuidic ﬂows on biochips and optimization techniques for an improved design of these devices (channel structure, location of the capillary barriers, optimal positioning of the interdigital transducers). More precisely, we focus on shape optimization of domains with channel geometries ﬁlled with a ﬂow and consider an objective functional depending on speciﬁc physical applications. The paper is organized as follows. Section 2 describes the physical model using the classical Navier-Stokes equations. Due to diﬀerent time scales, the model consists of two systems of Partial Diﬀerential Equations (PDEs). Our PDEs constrained optimization problem based on a stationary Stokes-like system and numerical results are presented in Section 3.

On Shape Optimization of Acoustically Driven Microﬂuidic Biochips

2

823

The Physical Model

The physical model relies on the compressible Navier-Stokes equations with velocity v, pressure p, and density ρ of the ﬂuid in a time-dependent computational domain Ω(t) ∂v η + [∇v]v = −∇p + ηΔv + ζ + ∇(∇ · v) , ρ ∂t 3 ∂ρ + ∇ · (ρv) = 0 , (1) ∂t p = p(ρ) = aργ in Ω(t), t > 0. Here, η and ζ are the shear and bulk viscosity coeﬃcients and a, γ > 0. Usually, a linear dependence (γ = 1) of ﬂuid (such as water) is assumed. Note that the right-hand side of the ﬁrst equation in (1) is the divergence of the Newtonian stress tensor Σ = Σ(v, p) deﬁned componentwise as follows ∂vi ∂vj 2 Σij (v, p) = −p δij + η + + ζ − η (∇ · v)δij . ∂xj ∂xi 3 The second row in (1) represents the continuity equation. The last (constitutive) equation is considered to close the system. The problem (1) is coupled by the boundary conditions v(t, x + u) =

∂u (t, x) ∂t

on ∂Ω(t), t > 0,

where u = u(t, x) is the SAW displacement (from the equilibrium state) of the substrate at the surface, which is typically a harmonic function of time. Suitable initial conditions for the unknowns v = 0 and p = p0 are prescribed at t = 0 in Ω0 := Ω(0). The system (1) is a multi-scale problem due to the extremely diﬀerent time scales. The acoustics (elastic waves on the chip surface interact with acoustic waves of ﬂuid) is a process with a time scale of nanoseconds /10−8s/, while the acoustic streaming is in milliseconds /10−3 s − 100 s/. We use the idea behind the approximation theory, assuming an expansion of the unknowns v, p, ρ with a scale parameter ε v = 0 + εv + ε2 v + O(ε3 ) , p = p0 + εp + ε2 p + O(ε3 ) , ρ = ρ0 + ερ + ε2 ρ + O(ε3 ) , where ε, 0 < ε 1, is proportional to the maximal SAWs displacement of the domain boundary. We deﬁne v 1 = εv , v 2 = ε2 v and analogously p1 = εp , p2 = ε2 p and ρ1 = ερ , ρ2 = ε2 ρ . Assume all quantities with subscript 0 to be known and constant in time and space. Collecting all terms of order O(ε) we get the linear system for damped propagation of acoustic waves ∂v 1 η − ηΔv 1 − ζ + ∇(∇ · v 1 ) + ∇p1 = 0 , ρ0 ∂t 3

824

S.I. Petrova

∂ρ1 + ρ0 ∇ · v 1 = 0 , ∂t p1 − c20 ρ1 = 0

(2) in Ω(t), t > 0

∂u (t, x) on ∂Ω(t), t > 0, coming from the ∂t observation u = O(ε) and the Taylor expansion with boundary condition v 1 (t, x) =

v(t, x + u) = v(t, x) + [∇v] u + O( u 2 ) =

∂u (t, x) ∂t

and initial conditions v 1 = 0 and p1 = 0 in Ω0 . The constant c0 stands for the small signal speed of the ﬂuid with c20 := aγργ−1 . The problem (2), with 0 dominated acoustic eﬀects, is our microscopic time-scale problem. After having solved the acoustic system (2) for the variables v 1 , p1 , and ρ1 , the collection of all second order terms O(ε2 ) results in ∂v 2 ∂v 1 η ρ0 − ηΔv 2 − ζ + ∇(∇ · v 2 ) + ∇p2 = −ρ1 − ρ0 [∇v 1 ]v 1 , ∂t 3 ∂t ∂ρ2 + ρ0 ∇ · v 2 = −∇ · (ρ1 v 1 ) , (3) ∂t in Ω(t), t > 0, p2 − c20 ρ2 = 0 v 2 = −[∇v 1 ] u p2 = 0 v 2 = 0,

on ∂Ω(t), t > 0, on Ω0 .

The system (3) can be interpreted as an instationary Stokes problem and refers to our macroscopic time-scale problem. We are interested in the acoustic streaming eﬀects observable on the larger (macroscopic) time scale. For that reason we consider the time-averaging operator 1 t0 +T < a >:= a(t) dt, T t0 where T := 2πω is the period of the harmonic SAW oscillation. To simplify the model and perform a numerical simulation, we neglect the time dependence of the domain which is possible due to the very small SAW displacements u = O(ε) and the scale parameter ε 1. Thus, we arrive at the following stationary Stokes system for the acoustic streaming: η ∂v 1 ∇(∇ · v 2 ) + ∇p2 = − ρ1 + ρ0 [∇v 1 ]v 1 , − ηΔv 2 − ζ + 3 ∂t ρ0 ∇ · v 2 = −∇ · (ρ1 v 1 ) in Ω , (4) 2 in Ω , p 2 − c0 ρ 2 = 0 v 2 = − [∇v 1 ] u

on ∂Ω.

The linear system (4) describes the stationary average ﬂow ﬁeld, resulting after relaxation of SAWs of high frequency. For more details of the model description see [2,7,9].

On Shape Optimization of Acoustically Driven Microﬂuidic Biochips

3

825

Shape Optimization Problem

In this section, we focus on the solution of the Stokes-like problem (4) and consider it as a state system in our PDEs constrained optimization. The standard algorithms for the physical model requires ﬁrst a solution of subproblem (2) for the damped propagation of acoustic waves to ﬁnd v 1 , p1 , ρ1 with a given periodic function u as the SAW displacement on the boundary of the domain. For this subproblem we rely on the numerical simulation presented in [5] and concentrate further on the solution of the second subproblem (4). For convenience, in what follows, we set v = v 2 , p = p2 , and ρ = ρ2 and denote ∂v 1 f v := − ρ1 + ρ0 [∇v 1 ]v 1 , fp := −∇ · (ρ1 v 1 ) . ∂t Our purpose is to ﬁnd the optimal structural design of a channel located on the chip surface. We suppose that the computational domain Ω ⊂ lR2 is occupied by a part of a channel with a capillary barrier and a reservoir, as illustrated in Fig. 2. The capillary barrier is placed between the channel and the reservoir to insure a proper ﬁlling of the reservoir with a precise amount of liquid probe. The additional outlet valves on upper and lower walls of the boundary are passive when the capillary barrier is opened and activate when it is in stopping mode. We consider an optimization problem with an objective functional J(w, α) with respect to the state variables w = (v, p, ρ)T and the design parameters α in the domain Ω(α) with a part of the boundary depending on α. Our aim is to compute the maximal pumping rate for the surface acoustic wave driven ﬂuidic transport. For that purpose, we consider a cross section Ω(α) of the channel and deﬁne the objective functional as follows J(w, α) := max ρv ·ds (5) α Ω(α) Suppose that α = (α1 , . . . , αM )T is the vector of the B´ezier control points of order M on both sides of the lateral walls of the channel. More precisely, we consider Mu points (α1 , . . . , αMu )T on the upper part of the lateral wall and Ml points on the lower part of the wall, so that M = Mu + Ml . Typically, the B´ezier control points are the prespeciﬁed coeﬃcients in the linear combination for the B´ezier-curve by means of Bernstein polynomials. Taylor-Hood P2/P1 (quadratic with respect to the velocity components and linear for the pressure and density) ﬁnite elements are used for the spatial discretization of the Stokes-like system with shape regular triangulation of the domain. Denote by v ∈ lRN1 , p ∈ lRN2 , and ρ ∈ lRN3 respectively, the velocity, the pressure, and the density components in the discretized model, so that w = (v, p, ρ)T ∈ lRN with N = N1 + N2 + N3 . In the sequel, we consider the discretized PDEs constrained shape optimization problem to optimize the objective (5) subject to the PDEs equality constraints η ∇(∇ · v) + ∇p − f v = 0 (6) C1 (w, α) := −ηΔv − ζ + 3

826

S.I. Petrova

C2 (w, α) := ρ0 ∇ · v C3 (w, α) := p −

c20

− fp = 0

ρ

in Ω(α) ,

=0

and the inequality constraints on α given entrywise as follows (i)

αmin < αi < α(i) max ,

1 ≤ i ≤ M.

(7)

The third constraint in (6) relates the third equation in (4) and is included only for clarity in presentation. Practically, this constraint is not taken into account in the numerical experiments due to its simplicity. Coupling the constraints by Lagrange multipliers we deﬁne the Lagrangian function associated with problem (5)-(7) L(w, α, λ, z) := J(w, α) + λT1 C1 (w, α) + λT2 C2 (w, α) + λT3 C3 (w, α)

(8)

− z T1 (α − αmin ) − z T2 (αmax − α). Here, λ := (λ1 , λ2 , λ3 )T ∈ lRN with λi ∈ lRNi , i = 1, 2, 3 refers to the Lagrange multipliers for the equality constraints. The vector z := (z 1 , z 2 )T with z 1 ≥ 0, z 2 ≥ 0 stands for the Lagrange multipliers corresponding to the inequality constraints. The limits vectors for the design parameters are denoted by T T (1) (M) (M) αmin = αmin , . . . , αmin , αmax = α(1) , . . . , α . max max Thus, we are led to the saddle-point problem inf sup L(w, α, λ, z) w , α λ,z with primal variables (w, α)T ∈ lRN × lRM and dual variables (λ, z)T ∈ lRN × lR2M . The inequality constraints (7) are treated by logarithmic barrier terms resulting in the perturbed objective functional β (μ) (w, α) := J(w, α) − μ

M

(i) log(αi − αmin ) + log(α(i) max − αi ) ,

(9)

i=1

where μ > 0, μ → 0, is the barrier parameter. The introduction of logarithmic terms leads to the following parameterized family of equality constrained optimization subproblems (10) inf β (μ) (w, α) w ,α subject to C1 (w, α) = 0 , C2 (w, α) = 0 , C3 (w, α) = 0 .

(11)

The parameterized Lagrangian function corresponding to (10)-(11) is deﬁned L(μ) (w, α, λ) := β (μ) (w, α) +

3 i=1

λTi Ci (w, α) .

(12)

On Shape Optimization of Acoustically Driven Microﬂuidic Biochips

827

Denote the unknown solution by ψ := ψ(μ) = (w(μ), α(μ), λ(μ))T . The ﬁrstorder necessary Karush-Kuhn-Tucker (KKT)-conditions for optimality read F (μ) (ψ) := ∇L(μ) = 0,

(13)

T (μ) (μ) (μ) (μ) (μ) where ∇L(μ) := Lv , Lp , Lρ , Lα , L is the gradient of the Lagrangian λ (12) with respect to the unknown variables. Then, (13) results in

(μ)

3

(μ)

3

(μ)

3

(μ)

i=1 3

Lv = ∇v J +

∂v (λTi Ci (w, α)) = 0 ,

i=1

Lp = ∇p J +

∂p (λTi Ci (w, α)) = 0 ,

i=1

Lρ = ∇ρ J + Lα = ∇α J + (μ)

∂ρ (λTi Ci (w, α)) = 0 , ∂α (λTi Ci (w, α)) − μ D1−1 e + μ D2−1 e = 0 ,

i=1

T

L = ( C1 (w, α), C2 (w, α), C3 (w, α) ) λ

= 0.

Here, we have denoted by e = (1, . . . , 1)T the vector of all ones and consider the so-called perturbed complementarity conditions D1 z 1 = μ e and D2 z 2 = μ e with (i) (i) diagonal matrices D1 = diag(αi −αmin ) and D2 = diag(αmax −αi ). The sequence Table 1. Convergence results of the path-following method N 1680 3424 6852

Mu Ml iter µ itern 8 8 4 2.1e-5 2 8 8 7 3.5e-5 3 8 8 10 7.4e-5 4

Fig. 2. Domain with capillary barrier and reservoir (initial and optimal shape)

828

S.I. Petrova

of solutions ψ(μ), μ → 0 of the nonlinear system of equations (13) deﬁne the central path (referred also as barrier trajectory). For the solution procedure we rely on path-following interior-point method with an inexact Newton solver. For description of the method and recent applications, see [1,3]. The optimal shape is obtained by using M = 16 B´ezier points on both sides of the boundary as design variables and is presented by dotted line in Fig.2. In Table 1 we report the convergence results of the path-following method. Here, N is the number of all degrees of freedom, Mu and Ml are the number of B´ezier points on upper and lower boundary, iter is the number of global iterations with a tolerance tol = 10−5 , μ is the ﬁnal value of the barrier parameter (starting value μ = 1), and itern is the number of iterations for the Newton solver with a tolerance toln = 10−3 .

Acknowledgements The main research in this paper has been supported by the German NSF Grant HO8779/1 within DFG SPP 1253 and the Bulgarian NSF Grant DO02–115/08. The author is thankful to all collaborators involved in the ﬁrst project for the fruitful discussions. The work has been also partially granted by the internal Research Funding Program 2010 and FSP AMMO at the University of Applied Sciences Bielefeld, Germany.

References 1. Antil, H., Hoppe, R.H.W., Linsenmann, C.: Optimal design of stationary ﬂow problems by path-following interior-point methods. Control and Cybernetics 37(4) (2008) 2. Bradley, C.E.: Acoustic streaming ﬁeld structure: The inﬂuence of the radiator. J. Acoust. Soc. Am. 100, 1399–1408 (1996) 3. Hoppe, R.H.W., Petrova, S.I.: Path-following methods for shape optimal design of periodic microstructural materials. Optimization Methods and Software 24(2), 205– 218 (2009) 4. Karniadakis, G., Beskok, A., Aluru, N.: Microﬂows and Nanoﬂows. Fundamentals and Simulation. Springer, Berlin (2005) 5. K¨ oster, D.: Numerical simulation of acoustic streaming on surface acoustic wavedriven biochips. SIAM J. Sci. Comput. 29(6), 2352–2380 (2007) 6. Mohammadi, B., Pironneau, O.: Applied Shape Optimization for Fluids. Oxford University Press, Oxford (2001) 7. Uchida, T., Suzuki, T., Shiokawa, S.: Investigation of acoustic streaming excited by surface acoustic waves. In: Proc. of the 1995 IEEE Ultrasonics Symposium, pp. 1081–1084. IEEE, Los Alamitos (1995) 8. Ulbrich, S., Ulbrich, M.: Primal-dual interior-point methods for PDE-constrained optimization. Math. Program. 117, 435–485 (2009) 9. Wixforth, A.: Acoustically driven programmable microﬂuidics for biological and chemical applications. JALA 11(6), 399–405 (2006)

Block Preconditioners for the Incompressible Stokes Problem M. ur Rehman, C. Vuik, and G. Segal Faculty of Electrical Engineering, Mathematics and Computer Science, Delft Institute of Applied Mathematics, Delft University of Technology Mekelweg 4, 2628 CD Delft, The Netherlands [email protected]

Abstract. This paper discusses the solution of the Stokes problem using block preconditioned iterative methods. Block preconditioners are based on the block factorization of the discretized problem. We focus on two speciﬁc types: SIMPLE-type preconditioners and the LSC preconditioner. Both methods use scaling to improve their performance. We test convergence of GCR in combination with these preconditioners both for a constant and a non-constant viscosity Stokes problem.

1

Introduction

Solution of the Stokes problem is a hot topic in the research community nowadays. Discretization of Stokes results in a so-called saddle point type problem. Saddle point problems appear not only in ﬂuid dynamics but also in elasticity problems and some other ﬁelds. An iterative method that is developed for one type of saddle point can be applied in other areas as well [1]. Our work is focused on solving the saddle point problem that arises from the ﬁnite element discretization of the Stokes problem. In case of stable discretization we can formulate the problem as: f F BT u = . (1) B 0 p g F corresponds to the viscos part, B T is the discretized gradient operator, and B the divergence operator. We deﬁne n as the number of velocity unknowns and m the number of pressure unknowns (n ≥ m). The system is symmetric positive semi-indeﬁnite. A Krylov subspace method is employed to solve the incompressible Stokes problem. Convergence of Krylov methods depends on the eigenvalue spectrum. A preconditioning technique is used to improve the convergence. Instead of solving Ax = b, one solves a system P −1 Ax = P −1 b, where P is the preconditioner. A good preconditioner should lead to fast convergence and the system of the form P z = r should be easy to solve. Since system (1) is symmetric indeﬁnite , in the literature [2], preconditioned MINRES [3] is frequently used to solve the Stokes problem. However, one of the requirement of MINRES is that the preconditioner should be symmetric positive deﬁnite. This restricts the choice to a I. Lirkov, S. Margenov, and J. Wa´ sniewski (Eds.): LSSC 2009, LNCS 5910, pp. 829–836, 2010. c Springer-Verlag Berlin Heidelberg 2010

830

M. ur Rehman, C. Vuik, and G. Segal

block diagonal preconditioner. Since we are using block triangular preconditioners which are not SPD, it is impossible to use MINRES. Therefore, we use GCR [4,7] to solve the Stokes problem. GCR also allows variable preconditioners so we can use inaccurate solvers for the subsystems. We compare preconditioners that use scaling based on the velocity matrix and the velocity mass matrix. The preconditioners that uses scaling with the velocity mass matrix perform better than the rest of the preconditioners even in the variable viscosity Stokes problem. 1.1

Block Preconditioners

Block preconditioners are based on factorization of formulation (1) and can be written as: As = Lb Db Ub , (2) where As is the system matrix, Lb the lower block, Db the block diagonal and Ub the upper block matrices represented as: I 0 F 0 I Mu−1 B T , Db = and Ub = , (3) Lb = 0 S 0 I Ml−1 B T I where Ml = Mu = F and S = −BF −1 B T is known as the Schur complement matrix. Since the computation of F −1 is not practical for large problem it is necessary to approximate the Schur complement. This is the basis for all block preconditioners. The generalized form of the Schur complement matrix can be written as Sˆ = BM −1 B T , where M −1 is a suitable approximation of F −1 which is cheap to compute. We deﬁne two extra block matrices F BT , (4) Lbt = Lb Db = 0 Sˆ

and Ubt = Db Ub =

F 0 . B Sˆ

(5)

We will discuss preconditioners based on these deﬁnitions in the next section. 1.2

SIMPLE-Type Preconditioners

In this section, we discuss variants of the SIMPLE preconditioner [8]. The SIMPLE preconditioner can be written as PS = Lbt Ub ,

(6)

where M = Mu = D with D the diagonal of the velocity matrix. The preconditioner consists of one velocity solve and one pressure solve. It appears from our experiments that the number of iterations increases with an increase in problem

Block Preconditioners for the Incompressible Stokes Problem

831

size (number of unknowns). However, this preconditioner shows robust convergence with rough accuracy for the subsystems. A variant of SIMPLE, SIMPLER, uses the same approximation for the Schur complement. To solve PSR Zup = rup , where PSR is the SIMPLER preconditioner and rup the residual, SIMPLER performs the following steps: −1 −1 zup = Ubt Lb rup , Ml = D,

(7)

Zup = zup + Ub−1 L−1 bt rup (rup − As zup ), Mu = D.

(8)

The above formulation shows that SIMPLER consists of two velocity solves and two pressure solves. Usually SIMPLER is used with one velocity solve and two pressure solves, it appears that the second velocity solve in (8) can be skipped without any eﬀect on the convergence. Lemma 1. In the SIMPLER preconditioner/algorithm, both variants (one or two velocity solves) are identical when the subsystems are solved by a direct solution method. For a proof see [9]. In this paper, we will use hSIMPLER instead of SIMPLER because we have observed that for Stokes, hSIMPLER performs better than SIMPLER. In hSIMPLER (PhSR ) we use SIMPLE in the ﬁrst iteration, whereas SIMPLER is employed afterwards. Another variant of SIMPLE known as MSIMPLER (Modiﬁed SIMPLER) uses the same formulation. However, instead of using D, the diagonal of the velocity mass matrix Q is used as scaling matrix. Q is deﬁned by: φi φj dΩ, (9) Qi,j = Ω

where φj and φi are the velocity basis functions. MSIMPLER (PMSR ) uses Ml = M = Mu = diag(Q) as scaling. This preconditioner gives better convergence than other variants of SIMPLE. Convergence of MSIMPLER is almost independent of the inner accuracy. 1.3

LSC Preconditioner

Next we discuss the diagonally scaled LSC preconditioner. A block triangular preconditioner is used with a special approximation of the Schur complement, deﬁned by: PLSC = Lbt , (10) where Sˆ is approximated by a least squares commutator approach [10]. The approximation is based on commutator of the convection diﬀusion operator (on the velocity and pressure space) with the gradient operator. The Schur complement approximation is given as: Sˆ−1 ≈ −Fp (BM1−1 B T )−1 ,

(11)

832

and

M. ur Rehman, C. Vuik, and G. Segal

Fp = (BM2−1 B T )−1 (BM2−1 F M1−1 B T ),

where M1 and M2 are scaling matrices. With M1 = M2 = diag(Q). The preconditioner shows nice convergence in solving Stokes and Navier-Stokes. Since it also involves two Poisson and one velocity solve, it construction and cost per iteration is comparable to MSIMPLER. In general the convergence of LSC depends both on the mesh size and the Reynolds number. According to [2] sometimes there is no h-dependency. For LSC some results with stabilized elements have been published. In this paper, we use scaling based on the diagonal of the velocity matrix (M1 = M2 = D), which we call LSCD . The same type of scaling is also used in [11]. In the next section, we compare all these preconditioners with two types of scaling for an isoviscos and a varying viscosity problem. For the subsystems solves, we use both ICCG(0) [5] and AMG preconditioned CG from PETSc (Portable, Extensible Toolkit for Scientiﬁc Computation) [6].

2

Numerical Experiments

We divide this section into two parts. In the ﬁrst part, we solve the Stokes problem with constant viscosity and in the second part, a variable viscosity Stokes problem is solved. The Stokes problem is solved up to an accuracy of k 2 10−6 . The iteration is stopped if the linear systems satisfy r b2 ≤ tol, where rk is the residual at the kth step of Krylov subspace method, b is the right hand side, and tol is the desired tolerance value. In all experiments, the Stokes equation discretized with Taylor-Hood Q2-Q1 elements. 2.1

Isoviscos Problem

In this section, we solve the driven cavity Stokes problem by employing preconditioners discussed in this paper. The deﬁnition of region and boundary conditions is given on the left-hand side of Fig. 1. The velocity subsystem is solved with AMG preconditioned CG and the pressure subsystem with ICCG(0). One V(1,1) cycle of AMG is used per preconditioning step in CG. In SIMPLE, MSIMPLER and LSC we compute the residual with accuracy 10−1 for velocity and 10−2 for the pressure subsystems. In hSIMPLER and LSCD we keep the inner accuracy 10−6 for both subsystems. This is done since the convergence of both preconditioners depends on the inner accuracy. A general convergence proﬁle of block preconditioners discussed in this paper is given in Fig. 2 that shows that the pairs of LSC and MSIMPLER and LSCD and hSIMPLER have the same convergence pattern. Compared to other preconditioners, SIMPLE shows slow convergence. From Table 1, it is clear that MSIMPLER and LSC outperform other preconditioners. MSIMPLER and LSC show less dependence on the grid size compared

Block Preconditioners for the Incompressible Stokes Problem

833

Fig. 1. Solution of the Stokes problem: (Left) Driven cavity, (Right) Die problem with varying viscosity

3

10

2

LSCD

10

SIMPLE hSIMPLER LSC MSIMPLER

1

GCR Relative Residual

10

0

10

−1

10

−2

10

−3

10

−4

10

−5

10

0

50

100

150 No. of iterations

200

250

300

Fig. 2. Convergence pattern of various block preconditioners Table 1. GCR iterations: Solution of the constant viscosity Stokes problem Grid 32 × 32 64 × 64 128 × 128 256 × 256

PS 84 162 310 705

PhSR PM SR PLSCD PLSCD 25 13 19 11 43 16 26 17 80 21 40 21 161 28 70 27

to other preconditioners in which the number of outer iterations is almost proportional to the number of elements in each direction. If we look at the inner iterations for the velocity and the pressure in Fig. (3), we see that MSIMPLER and LSC are the best choices in terms of consumption of inner and outer iterations. SIMPLE shows robust behavior with small inner accuracy. Per iteration, hSIMPLER and LScD are expensive due to the demand for high inner accuracy. Therefore, even with a large number of outer iterations in SIMPLE, its inner iterations are comparable to hSIMPLER.

834

M. ur Rehman, C. Vuik, and G. Segal

5

4

10

10

SIMPLE SIMPLER MSIMPLER LSC

SIMPLE hSIMPLER MSIMPLER LSC

D

4

10 ICCG(0) iterations

AMG PCG iterations

D

LSC

3

10

2

LSC

3

10

10

2

1

10 32x32

64x64

128x128

256x256

Grid size

10 32x32

64x64

128x128

256x256

Grid Size

Fig. 3. Inner iterations: Solution of the driven cavity Stokes problem: Velocity subsystem(Left), Pressure subsystem (Right)

2.2

Variable Viscosity Problem

The right-hand side of Fig. 1 gives the conﬁguration of the variable viscosity Stokes problem. The viscosity levels are plotted in color. The problem we consider is that of an aluminum round rod, which is heated and pressed through a die. In this way a prescribed shape can be constructed. In this speciﬁc example we consider the simple case of a small round rod to be constructed. The viscosity model used describes the viscosity as function of shear stress and temperature. The temperature and shear stress are the highest at the die where the aluminum is forced to ﬂow into a much smaller region. The end rod is cut which is modelled as a free surface. Boundary conditions are prescribed velocity at the inlet and stress free at the outlet. At the container surface (boundary of thick rod), we have a no-slip condition. At the die we have friction, which is modeled as slip condition along the tangential direction and a no-ﬂow condition in normal direction. The round boundary of the small rod satisﬁes free slip in tangential direction and no-ﬂow in normal direction. We observed that in constant viscosity problem, MSIMPLER and LSC show better convergence than the other preconditioners discussed. The main improving factor in these two preconditioners is scaling with the velocity mass matrix. Here, we solve the variable viscosity Stokes problem and we expect that since the velocity mass matrix does not contain any information about the viscosity variations, other preconditioners may perform better than these two. From Table 2, it seems that even for the variable viscosity problem, MSIMPLER and LSC perform much better than the other preconditioners. In this problem, instead of viscosity variation, grid size plays a dominant role. Although LSCD and SIMPLE are scaled with the velocity matrix, which depends on the viscosity, their performance is poor compared to MSIMPLER and LSC. For this problem hSIMPLER is not reported since it does not show convergence even with high inner

Block Preconditioners for the Incompressible Stokes Problem

835

Table 2. GCR iterations: Solution of the variable viscosity Stokes problem using block preconditioned GCR Unknowns PS PM SR PLSCD PLSC 33k 257 14 52 15 66k 419 16 64 17 110k 491 17 92 15 195k 706 17 110 19

6

5

10

10 SIMPLE MSIMPLER LSC

SIMPLE MSIMPLER LSCD

LSC

LSC No. of ICCG(0) iterations

No. of ICCG(0) iterations

D

5

10

4

10

3

10 33k

4

10

3

66k

110k Number of unknowns

195k

10 33k

66k

110k

195k

Number of unknowns

Fig. 4. Inner iterations: Solution of the variable viscosity Stokes problem: Velocity subsystem(Left), Pressure subsystem (Right)

accuracy. The inner accuracy of the preconditioners is the same that we used for constant viscosity problem except for LSC that is changed to 10−2 for the velocity part since it stagnates for 10−1 . The number of iterations consumed by the subsystems shown in Fig. 4, have the same pattern as we see in the constant viscosity Stokes problem.

3

Conclusions

In this paper, we solve constant and variable viscosity Stokes problem using block preconditioned GCR. SIMPLE and LSC-type preconditioners are used with two diﬀerent scalings. Table 3 shows dependence of the preconditioners on mesh size and inner accuracy. It is observed that MSIMPLER and LSC show Table 3. Dependence of various block preconditioners on mesh size and subsystem accuracies Dependence on PS PhSR PM SR PLSCD PLSC Mesh size Yes Yes Mild Yes Mild Inner accuracy No Yes No Yes Small

836

M. ur Rehman, C. Vuik, and G. Segal

better performance than other preconditioners for both type of problems. Both use scaling with the velocity mass matrix which has no information regarding the variation of viscosity. For further research, both preconditioners will be tested for problems with large viscosity contrasts.

References 1. Benzi, M., Golub, G.H., Liesen, J.: Numerical solution of saddle point problems. Acta Numerica 14, 1–137 (2005) 2. Elman, H.C., Silvester, D., Wathen, A.J.: Finite Elements and Fast Iterative Solvers with applications in incompressible ﬂuids dynamics. Oxford University Press, Oxford (2005) 3. Paige, C.C., Saunders, M.A.: Solution of sparse indeﬁnite systems of linear equations. SIAM J. Numerical Analysis 12(4), 617–629 (1975) 4. Eisenstat, C., Elman, H.C., Schultz, M.H.: Variational iterative methods for nonsymmetric systems of linear equations. SIAM J. Numer. Anal. 20(2), 345–357 (1983) 5. Meijerink, J.A., van der Vorst, H.A.: An iterative solution method for linear systems of which the coeﬃcient matrix is a symmetric M-matrix. Math. Comp. 31, 148–162 (1977) 6. Balay, S., Gropp, W.D., McInnes, L.C., Smith, B.F.: Eﬃcient Management of Parallelism in Object Oriented Numerical Software Libraries. Modern Software Tools in Scientiﬁc Computing, 163–202 (1997) 7. van der Vorst, H.A., Vuik, C.: GMRESR: a family of nested GMRES methods. Num. Lin. Alg. Appl. 1, 369–386 (1994) 8. Vuik, C., Saghir, A., Boerstoel, G.P.: The Krylov accelerated SIMPLE(R) method for ﬂow problems in industrial furnaces. Int. J. Numer. Meth. Fluids 33, 1027–1040 (2000) 9. ur Rehman, M., Vuik, C., Segal, G.: SIMPLE-Type preconditioners for the Oseen problem. Int. J. Numer. Meth. Fluids. Published Online (2008) 10. Elman, H., Howle, V.E., Shadid, J., Shuttleworth, R., Tuminaro, R.: Block Preconditioners Based on Approximate Commutators. SIAM J. Sci. Comput. 27(5), 1651–1666 (2006) 11. May, D.A., Moresi, L.: Preconditioned iterative methods for Stokes ﬂow problems arising in computational geodynamics. Physics of the Earth and Planetary Interiors 171, 33–47 (2008)

Author Index

Adler, J. 1 Alba, Enrique 318, Andreev, Andrey B. Angelova, Ivanka T. Anguelov, Roumen Apreutesei, Narcisa Aprovitola, Andrea Asenov, Asen 369 Ashok, A. 451 Atanassov, Emanouil Atanassov, Krassimir Axelsson, Owe 76

334 687, 695 703 231, 554 165 67

459, 507 173, 326

Bacu, Victor 499 Bang, Børre 719, 730 Bartholy, Judit 613 Baumgartner, O. 411 Bencheva, Gergana 711 Bochev, Pavel 637, 645 Bogatchev, A. 538 Bonﬁgli, Giuseppe 124 Bouman, Mick 653 Boyadjiev, Todor 765 Boyanova, Petia 84 Brzobohat´ y, Tom´ aˇs 92 Bultmann, Roswitha 239 Candiani, Gabriele 157 C´ ardenas-Montes, Miguel 483 Carnevale, Claudio 157 Castej´ on-Maga˜ na, Francisco 483 Castro, Francesc 379 Cervenka, J. 443 Chien, Li-Hsin 403 Christov, Ivan 765 Chryssoverghi, I. 247 Coletsos, J. 247 Coma, Jordi 395 Corominas, Albert 302 Cr˘ aciun, Ciprian 310 D’Ambra, Pasqua 67 Dechevsky, Lubomir T. 719, 730, 738, 756 Dimitriu, Gabriel 165, 621

Dimova, Stefka 765 Dimov, Ivan T. 197, 387, 419 di Seraﬁno, Daniela 67 D´ ob´e, P´eter 467 Dobrinkova, Nina 173 Donchev, Tzanko 256 Do-Quang, Minh 108 Dost´ al, Zdenˇek 92 Drozdowicz, Michal 475 Durillo, Juan J. 334 Ebel, Adolf 214 Efendiev, Y. 148 Elbern, Hendrik 214 Essien, Eddy 596 Farag´ o, Istv´ an 54, 563 Feichtinger, Gustav 239 Felgenhauer, Ursula 264 Fidanova, Stefka 173, 318, 326 Filippone, Salvatore 67 Filippova, Tatiana F. 272 Friese, Elmar 214 Gadzhev, G. 180, 223 Ganev, K. 180, 223, 531, 538 Ganzha, Maria 475 Garc´ıa-Villoria, Alberto 302 Georgieva, Irina 747 Georgieva, Rayna 197, 387 Georgiev, Ivan 100 Georgiev, Krassimir 188 Gerritsma, Marc 653, 662 G´ omez-Iglesias, Antonio 483 Goncharova, Elena 280 Gonz´ alez, Rosa M. 206 Goodnick, S.M. 427, 451 Goranova, Radoslava D. 491 Gorb, Y. 148 Gorgan, Dorian 499 Goris, Nadine 214 Grasser, T. 443 Grip, Niklas 738 Gundersen, Joakim 719, 730, 738, 756 Gunzburger, Max 637 Gusev, Mikhail I. 286

838

Author Index

Hannukainen, Antti 571 Hartin, O. 451 ´ Havasi, Agnes 54 Hofreither, Clemens 132 Hunyady, Adrienn 613 Iliev, Oleg P. 14 Ivanovska, Soﬁya 459, 475 Jenny, Patrick 124 Jordanov, G. 180, 223, 531, 538 Jurkov´ a, Kateˇrina 771 Kalogeropoulos, Grigoris I. 779 K´ apolnai, Rich´ ard 467 Karageorgos, Athanasios D. 779 Karaivanova, Aneta 459 Kar´ atson, J´ anos 580 Katragkou, E. 531, 538 Kokkinis, B. 247 Koleva, Miglena N. 787 Korotov, Sergey 571 Kosina, H. 443 Kosturski, Nikola 140 Koylazov, V.N. 419 Kozubek, Tom´ aˇs 92 Krastanov, Mikhail I. 294 Kraus, Johannes 100 Kristoﬀersen, Arnt R. 719, 756 Kˇr´ıˇzek, Michal 571 Lakat, M´ at´e 515, 546 Laks˚ a, Arne 719, 796 Langer, Ulrich 132 Lazarov, Raytcho D. 14 L´egr´ ady, D. 435 Lirkov, Ivan 475 Luna, Francisco 334 Magdics, M. 435 Manteuﬀel, T. 1 Margenov, Svetozar 84, 100, 140, 805 Marinova, Rossitza S. 588, 596 Marinov, Tchavdar T. 588 Markakis, K. 531, 538 Markopoulos, Alex 92 Markov, S. 231 Mart´ınez, Roel 395 McCormick, S. 1 Melas, D. 531, 538

M´esz´ aros, Csaba 813 Mihon, Danut 499 Millar, Campbell 369 Miloshev, N. 180, 223, 531, 538 Minani, F. 231 Mincsovics, Miklos E. 604 Misev, Anastas 507 Mitin, Vladimir 403 Moisil, Ioana 343 Molina, Guillermo 318 Morales-Ramos, Enrique 483 Morant, Jos´e L. 206 Nebro, Antonio J. 334 Nedjalkov, M. 411 N´emeth, D´enes 515, 546 Neytcheva, Maya 108 Nicoar˘ a, Monica 310 Nieradzik, Lars Peter 214 Nikolova, Evdokia 352 Nolting, J. 1 Olejnik, Richard 475 Olteanu, Alexandru-Liviu 343 Ostromsky, Tzvetan 197 Ovseevich, Alexander 280 Palha, Artur 653, 662 Pantelous, Athanasios A. 779 Paprzycki, Marcin 475 Pastor, Rafael 302 Penzov, A.A. 419, 435 P´erez, Juan L. 206 Petrova, Svetozara I. 821 Pieczka, Ildik´ o 613 Pisoni, Enrico 157 Pongr´ acz, Rita 613 Popov, P. 148 Poupkou, A. 531, 538 Prieto, F. 206 Prodanova, M. 180, 223, 531, 538 Racheva, Milena R. 687, 695 Raleva, K. 427 Reid, Dave 369 Ridzal, Denis 645 Rodila, Denisa 499 Roy, Gareth 369 Roy, Scott 369 Ruge, J. 1

Author Index Sabelfeld, Karl 26 San Jos´e, Roberto 206 Sbert, Mateu 379 Schwaha, P. 411 Segal, G. 829 Selberherr, S. 411, 443 Senobari, Mehrdad 475 Sergeev, Andrei 403 Shterev, Kiril S. 523 Simian, Corina 361 Simian, Dana 361 Sinnott, Richard 369 Sipos, Andr´ as 467 Spiridonov, V. 538 Spiteri, Raymond 596 Starke, Gerhard 671 S ¸ tef˘ anescu, R˘ azvan 165, 621 Stefanov, Stefan K. 523 Stefanut, Teodor 499 Stewart, Gordon 369 Stewart, Graeme 369 Stoica, Florin 361 Strunk, Achim 214 Sverdlov, V. 443 Syrakov, D. 180, 223, 531, 538 Szab´ o, Tam´ as 629 Szeber´enyi, Imre 467, 515 Szirmay-Kalos, L. 419, 435 Tang, L. 1 Telegin, Pavel

475

Todorova, A. 180, 223 Tomar, Satyendra 132 T¨ or¨ ok, J´ anos 546 T´ oth, B. 435 Tragler, Gernot 239 Tr¨ oltzsch, Fredi 40 T˚ uma, Miroslav 771 Uluchev, Rumen 747 ur Rehman, M. 829 Vagidov, Nizami 403 Vasicek, M. 443 Vasileska, D. 427, 451 Vega-Rodr´ıguez, Miguel A. 483 Veliov, Vladimir M. 294 Volta, Marialuisa 157 Vondr´ ak, V´ıt 92 Vuik, C. 829 Vulkov, Lubin G. 703, 787 Vutov, Yavor 805 Willems, Joerg Xin, He

14

108

Yang, Huidong Young, Joseph

116 679

Zaharie, Daniela 310 Zlatev, Zahari 54, 188, 197 Zulehner, Walter 116

839

Large-Scale Scientific Computing: 7th International Conference, LSSC 2009, Sozopol, Bulgaria, June 4-8, 2009 Revised Papers (Lecture Notes in Computer ... Computer Science and General Issues)