Formal Methods for Components and Objects: First International Symposium, FMCO 2002, Leiden, The Netherlands, November 5-8, 2002, Revised Lectures

Lecture Notes in Computer Science Edited by G. Goos, J. Hartmanis, and J. van Leeuwen 2852 3 Berlin Heidelberg New ...

Author: frank s. de (editor) ; bonsangue | marcello (editor) ; graf | susanne (editor) boer

22 downloads 873 Views 4MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

Lecture Notes in Computer Science Edited by G. Goos, J. Hartmanis, and J. van Leeuwen

2852

3

Berlin Heidelberg New York Hong Kong London Milan Paris Tokyo

Frank S. de Boer Marcello M. Bonsangue Susanne Graf Willem-Paul de Roever (Eds.)

Formal Methods for Components and Objects First International Symposium, FMCO 2002 Leiden, The Netherlands, November 5-8, 2002 Revised Lectures

13

Series Editors Gerhard Goos, Karlsruhe University, Germany Juris Hartmanis, Cornell University, NY, USA Jan van Leeuwen, Utrecht University, The Netherlands Volume Editors Frank S. de Boer Centre for Mathematics and Computer Science, CWI Kruislaan 413, 1098 SJ Amsterdam, The Netherlands E-mail: [email protected] Marcello M. Bonsangue Leiden University, Leiden Institute of Advanced Computer Science P.O. Box 9512, 2300 RA Leiden, The Netherlands, E-mail: [email protected] Susanne Graf VERIMAG 2 Avenue de Vignate, Centre Equitation, 38610 Grenoble-Gi`eres, France E-mail: [email protected] Willem-Paul de Roever Christian-Albrechts-University of Kiel Institute of Computer Science and Applied Mathematics Hermann-Rodewald-Straße 3, Kiel, Germany E-mail: [email protected] Cataloging-in-Publication Data applied for A catalog record for this book is available from the Library of Congress. Bibliographic information published by Die Deutsche Bibliothek Die Deutsche Bibliothek lists this publication in the Deutsche Nationalbibliografie; detailed bibliographic data is available in the Internet at . CR Subject Classification (1998): D.2, D.3, F.3, D.4 ISSN 0302-9743 ISBN 3-540-20303-6 Springer-Verlag Berlin Heidelberg New York This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer-Verlag. Violations are liable for prosecution under the German Copyright Law. Springer-Verlag Berlin Heidelberg NewYork a member of BertelsmannSpringer Science+Business Media GmbH www.springeronline.com c Springer-Verlag Berlin Heidelberg 2003 Printed in Germany Typesetting: Camera-ready by author, data conversion by Olgun Computergrafik Printed on acid-free paper SPIN: 10961087 06/3142 543210

Preface

Large and complex software systems provide the necessary infrastucture in all industries today. In order to construct such large systems in a systematic manner, the focus in the development methodologies has switched in the last two decades from functional issues to structural issues: both data and functions are encapsulated into software units that are integrated into large systems by means of various techniques supporting reusability and modiﬁability. This encapsulation principle is essential to both the object-oriented and the more recent componentbased sofware engineering paradigms. Formal methods have been applied successfully to the veriﬁcation of mediumsized programs in protocol and hardware design. However, their application to large systems requires the further development of speciﬁcation and veriﬁcation techniques supporting the concepts of reusability and modiﬁability. In order to bring together researchers and practioners in the areas of software engineering and formal methods, we organized the 1st International Symposium on Formal Methods for Components and Objects (FMCO) in Leiden, The Netherlands, November 5–8, 2002. The program consisted of invited tutorials and more technical presentations given by leading experts in the ﬁelds of Theoretical Computer Science and Software Engineering. The symposium was attended by more than 100 people. This volume contains the contributions of the invited speakers to FMCO 2002. We believe that the presented material provides a unique combination of ideas on software engineering and formal methods which we hope will be an inspiration for those aiming at further bridging the gap between the theory and practice of software engineering. The very idea to organize FMCO arose out of the NWO/DFG bilateral project Mobi-J. In particular we acknowledge the ﬁnancial support of the NWO funding of Mobi-J. Additional ﬁnancial support was provided by the Lorentz Center, the IST project Omega (2001-33522), the Dutch Institute for Programming Research and Algorithmics (IPA), the Royal Netherlands Academy of Arts and Sciences (KNAW), the Centrum voor Wiskunde en Informatica (CWI), and the Leiden Institute of Advanced Computer Science (LIACS).

July 2003

F.S. de Boer M.M. Bonsangue S. Graf W.-P. de Roever (Editors)

VI

Preface

The Mobi-J Project Mobi-J is a project founded by a bilateral research program of the Dutch Organization for Scientiﬁc Research (NWO) and the Central Public Funding Organization for Academic Research in Germany (DFG). The partners of the Mobi-J projects are: – Centrum voor Wiskunde en Informatica (F.S. de Boer) – Leiden Institute of Advanced Computer Science (M.M. Bonsangue) – Christian-Albrechts-Universit¨ at, Kiel (W.-P. de Roever) This project aims at the development of a programming environment which supports component-based design and veriﬁcation of Java programs annotated with assertions. The overall approach is based on an extension of the Java language called Mobi-J with the notion of a component which provides for the encapsulation of its internal processing of date and composition in a network by means of mobile asynchronous channels. The activities of Mobi-J include the organization of international symposia funded by the NWO and Ph.D. research funded by DFG. By means of regular meetings the partners discuss intensively Ph.D. research involving Mobi-Jrelated topics. Mobi-J also maintains contacts with other German universities, including the universities of Oldenburg and Munich, and a close collaboaration with the European IST project OMEGA.

The Omega Project The overall aim of the European IST project Omega (2001-33522) is the deﬁnition of a development methodology in UML for embedded and real-time systems based on formal veriﬁcation techniques. The approach is based on a formal semantics of a suitable subset of UML, adapted and extended where needed with a special emphasis on time-related aspects. The Omega project involves the following partners: VERIMAG (France, Coordinator) Centrum voor Wiskunde en Informatica (The Netherlands) Christian-Albrechts-Universit¨ at (Germany) University of Nijmegen (The Netherlands) Wiezmann Institute (Israel) OFFIS (Germany), EADS Launch Vehicles (France) France Telecom R&D (France) Israel Aircraft Industries (Israel) National Aerospace Laboratory (The Netherlands)

Table of Contents

A Tool-Supported Proof System for Multithreaded Java . . . . . . . . . . . . . . . . ´ E. Abrah´ am, F.S. de Boer, W.-P. de Roever, and M. Steﬀen

1

Abstract Behavior Types: A Foundation Model for Components and Their Composition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33 F. Arbab Understanding UML: A Formal Semantics of Concurrency and Communication in Real-Time UML . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71 W. Damm, B. Josko, A. Pnueli, and A. Votintseva Live and Let Die: LSC-Based Veriﬁcation of UML-Models . . . . . . . . . . . . . . . 99 W. Damm and B. Westphal Reactive Animation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 136 D. Harel, S. Efroni, and I.R. Cohen Model-Checking Middleware-Based Event-Driven Real-Time Embedded Software . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 154 X. Deng, M.B. Dwyer, J. Hatcliﬀ, G. Jung, Robby, and G. Singh Equivalent Semantic Models for a Distributed Dataspace Architecture . . . . 182 J. Hooman and Jaco van de Pol Java Program Veriﬁcation Challenges . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 202 B. Jacobs, J. Kiniry, and M. Warnier ToolBus: The Next Generation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 220 H. de Jong and P. Klint High-Level Speciﬁcations: Lessons from Industry . . . . . . . . . . . . . . . . . . . . . . . 242 B. Batson and L. Lamport How the Design of JML Accommodates Both Runtime Assertion Checking and Formal Veriﬁcation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262 G.T. Leavens, Y. Cheon, C. Clifton, C. Ruby, and D.R. Cok Finding Implicit Contracts in .NET Components . . . . . . . . . . . . . . . . . . . . . . . 285 K. Arnout and B. Meyer From Co-algebraic Speciﬁcations to Implementation: The Mihda Toolkit . . 319 G. Ferrari, U. Montanari, R. Raggi, and E. Tuosto A Calculus for Modeling Software Components . . . . . . . . . . . . . . . . . . . . . . . . 339 O. Nierstrasz and F. Achermann

VIII

Table of Contents

Speciﬁcation and Inheritance in CSP-OZ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 361 E.-R. Olderog and H. Wehrheim Model-Based Testing of Object-Oriented Systems . . . . . . . . . . . . . . . . . . . . . . 380 B. Rumpe Concurrent Object-Oriented Programs: From Speciﬁcation to Code . . . . . . . 403 E. Sekerinski Design with Asynchronously Communicating Components . . . . . . . . . . . . . . . 424 J. Plosila, K. Sere, and M. Wald´en Composition for Component-Based Modeling . . . . . . . . . . . . . . . . . . . . . . . . . . 443 G. G¨ ossler and J. Sifakis Games for UML Software Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 467 P. Stevens and J. Tenzer Making Components Move: A Separation of Concerns Approach . . . . . . . . . 487 D. Pattinson and M. Wirsing

Author Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 509

A Tool-Supported Proof System for Multithreaded Java ´ Erika Abrah´ am1 , Frank S. de Boer2 , Willem-Paul de Roever1 , and Martin Steﬀen1 1

Christian-Albrechts-University Kiel, Germany 2 CWI Amsterdam, The Netherlands

Abstract. Besides the features of a class-based object-oriented language, Java integrates concurrency via its thread classes. The concurrency model includes shared-variable concurrency via instance variables, coordination via reentrant synchronization monitors, synchronous message passing, and dynamic thread creation. To reason about safety properties of multithreaded Java programs, we introduce an assertional proof method for a multithreaded sublanguage of Java, covering the mentioned concurrency issues as well as the objectbased core of Java. The veriﬁcation method is formulated in terms of proof-outlines, where the assertions are layered into local ones specifying the behavior of a single instance, and global ones taking care of the connections between objects. From the annotated program, a translator tool generates a number of veriﬁcation conditions which are handed over to the interactive theorem prover PVS.

1

Introduction

Besides the features of a class-based object-oriented language, Java integrates concurrency via its thread classes. The semantical foundations of Java [GJS96] have been thoroughly studied ever since the language gained widespread popularity (e.g. [AF99,SSB01,CKRW99]). The research concerning Java’s proof theory mainly concentrated on sequential sub-languages (e.g. [Hui01,vON02,PHM99]). This work presents a tool-supported assertional proof system for Javasynch , a subset of Java, featuring dynamic object creation, method invocation, object references with aliasing, and, speciﬁcally, concurrency and Java’s monitor discipline. The behavior of a Javasynch program results from the concurrent execution of methods. To support a clean interface between internal and external object behavior, Javasynch does not allow qualiﬁed references to instance variables. As a consequence, shared-variable concurrency is caused by simultaneous execution within a single object, only, but not across object boundaries. To mirror this

Part of this work has been ﬁnancially supported by IST project Omega (IST-200133522) and NWO/DFG project Mobi-J (RO 1122/9-1, RO 1122/9-2).

F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 1–32, 2003. c Springer-Verlag Berlin Heidelberg 2003

2

´ E. Abrah´ am et al.

modularity, the assertional logic and the proof system are formulated at two levels, a local and a global one. The local assertion language describes the internal object behavior. The global behavior, including the communication topology of the objects, is expressed in the global language. As in the Object Constraint Language (OCL) [WK99], properties of object-structures are described in terms of a navigation or dereferencing operator. The proof system is formulated in terms of proof outlines [OG76], i.e., of programs augmented by auxiliary variables and annotated with Hoare-style assertions [Flo67,Hoa69]. The satisfaction of the program properties speciﬁed by the assertions is guaranteed by the veriﬁcation conditions of the proof system. The execution of a single method body in isolation is captured by standard local correctness conditions, using the local assertion language. Interference between concurrent method executions is covered by the interference freedom test [OG76,LG81], formulated also in the local language. It has especially to accommodate for reentrant code and the speciﬁc synchronization mechanism. Possibly aﬀecting more than one instance, communication and object creation is treated in the cooperation test, using the global language. The communication can take place within a single object or between diﬀerent objects. As these cases cannot be distinguished syntactically, our cooperation test combines elements from similar rules in [AFdR80] and in [LG81] for CSP. Our proof method is modular in the sense that it allows for separate interference freedom and cooperation tests (Fig. 1). This modularity, which in practice simpliﬁes correctness proofs considerably, is obtained by disallowing the assignment of the result of communication and object creation to instance variables. Clearly, such assignments can be avoided by additional assignments to fresh local variables and thus at the expense of new interleaving points.

Java program

augmentation annotation

Proof Verger Verification outline tool conditions

Syntax of assertions

Javasynch correctness

interference freedom Fig. 1. Modularity of the proof system.

PVS

Program correctness proof

Semantics of assertions

Proof of the conditions

cooperation test

Condition generation

sequential correctness

Fig. 2. The veriﬁcation process.

Computer-support is given by the tool Verger (VERiﬁcation condition GEneratoR), taking a proof outline as input and generating the veriﬁcation conditions as output. We use the interactive theorem prover PVS [ORS92] to verify the conditions, for which we only need to encode the semantics of the assertion language (cf. Figure 2). The veriﬁcation conditions are generated by a syntax-

A Tool-Supported Proof System for Multithreaded Java

3

directed Hoare logic based on a logical modeling of assignments by means of substitutions, instead of more semantic approaches using the global store model [AL97,JKW03,vON02,PHM99], which requires an explicit encoding of the semantics of assignments. To transparently describe the proof system, we present it incrementally in three stages: We start with a sequential, class-based sublanguage of Java and its proof system in Section 2, featuring dynamic object creation and method invocation. This level shows how to handle activities of a single thread of execution. On the second stage we include concurrency in Section 3, where the proof system is extended to handle dynamic thread creation and aspects of interleaving and shared variable concurrency. Finally, we integrate Java’s monitor synchronization mechanism in Section 4. Section 5 shows how we can prove deadlock freedom, and Section 6 discusses related and future work. The incremental development shows how the proof system can be extended stepwise to deal with additional features of the programming language. Further extensions with for example the concepts of inheritance and subtyping build topics for future work. In this paper, the veriﬁcation conditions are formulated as standard Hoaretriples {ϕ}stm{ψ}. Their meaning is, that if stm is executed in a state satisfying ϕ, and the execution terminates, then the resulting state satisﬁes ψ. For the formal semantics of Hoare-triples, given by means of a weakest precondition calculus, for soundness and completeness of the proof method, and for the de´ scription of the tool support see [AdBdRS03].

2

The Sequential Sublanguage

In this section we start with a sequential part of our language, ignoring concurrency issues of Java, which will be added in later sections. Furthermore —and throughout the paper— we concentrate on the object-based core Java, i.e., we disregard inheritance and consequently subtyping, overriding, and late-binding. For simplicity, we neither allow method overloading, i.e., we require that each method name is assigned a unique list of formal parameter types and a return type. In short, being concerned with the veriﬁcation of the run-time behavior, we assume a simple monomorphic type discipline. Programs, as in Java, are given by a collection of classes containing instance variable and method declarations. Instances of the classes, i.e., objects, are dynamically created, and communicate via method invocation, i.e., synchronous message passing. The languages we consider are strongly typed languages. Besides class types c, they support booleans Bool and integers Int as primitive types, furthermore pairs t × t and lists list t as composite types. Each domain is equipped with a standard set of operators. Without inheritance and subtyping, the type system is rather straightforward. Throughout the paper, we tacitly assume all constructs of the abstract syntax to be well-typed, without further explicating the static semantics here. We thus work with a type-annotated abstract syntax where we omit the explicit mentioning of types when no confusion can arise.

´ E. Abrah´ am et al.

4

2.1

Syntax

The abstract syntax of the sequential language Javaseq is summarized in Table 1. Though we use the abstract syntax for the theoretical part of this work, our tool supports Java syntax. For variables, we notationally distinguish between instance variables x ∈ IVar and local or temporary variables u ∈ TVar . Instance variables hold the state of an object and exist throughout the object’s lifetime. Local variables are stack-allocated; they play the role of formal parameters and variables of method deﬁnitions and only exist during the execution of the method to which they belong. We use Var = IVar ∪˙ TVar for the set of program variables with typical element y. The set IVar c of instance variables of a class c is given implicitly by the instance variables occurring in the class; the set of local variables of method declarations is given similarly. Besides using instance and local variables, expressions e ∈ Exp are built from the self-reference this, the empty reference null, and from subexpressions using the given operators. To support a clean interface between internal and external object behavior, Javaseq does not allow qualiﬁed references to instance variables. As statements stm ∈ Stm , we allow assignments, object creation, method invocation, and standard control constructs like sequential composition, conditional statements, and iteration. We write for the empty statement. A method deﬁnition consists of a method name m, a list of formal parameters u1 , . . . , un , and a method body of the form stm; return eret , i.e., we require that method bodies are terminated by a single return statement, giving back the control and possibly a return value. The set Meth c contains the methods of class c. We denote the body of method m of class c by body m,c . A class is deﬁned by its name c and its methods, whose names are assumed to be distinct. A program, ﬁnally, is a collection of class deﬁnitions having diﬀerent class names, where class main deﬁnes by its run-method the entry point of the program execution. We call the body of the run-method of the main class the main statement of the program1 . The run-method cannot be invoked. Besides the mentioned simpliﬁcations on the type system, we impose for technical reasons the following restrictions: We require that method invocation and object creation statements contain only local variables, i.e., that none of the expressions e0 , . . . , en in a method invocation e0 .m(e1 , . . . , en ) contains instance variables. Furthermore, formal parameters must not occur on the left-hand side of assignments. These restrictions imply that during the execution of a method the values of the actual and formal parameters are not changed, and thus we can use their equality to describe caller-callee dependencies when returning from a method call. The above restrictions could be released by storing the identity of the callee object and the values of the formal and actual parameters in additional 1

In Java, the entry point of a program is given by the static main-method of the main class. Relating the abstract syntax to that of Java, we assume that the main class is a Thread-class whose main-method just creates an instance of the main class and starts its thread. The reason to make this restriction is, that Java’s main-method is static, but our proof system does not support static methods and variables.

A Tool-Supported Proof System for Multithreaded Java

5

Table 1. Javaseq abstract syntax. e ::= x | u | this | null | f(e, . . ., e) eret ::= | e stm ::= x := e | u := e | u := newc | u := e.m(e, . . ., e) | e.m(e, . . ., e) | | stm; stm | if e then stm else stm ﬁ | while e do stm od . . . meth ::= m(u, . . ., u){ stm; return eret } meth run ::= run(){ stm; return } class ::= c{meth. . .meth} class main ::= c{meth. . .meth meth run } prog ::= class. . .class class main

built-in auxiliary variables. However, the restrictions simplify the proof system and thus they make it easier to understand the basic ideas of this work. Finally, the result of an object creation or method invocation statement may not be assigned to instance variables. This restriction allows for a proof system with separated veriﬁcation conditions for interference freedom and cooperation. It should be clear that it is possible to transform a program to adhere to this restrictions at the expense of additional local variables and thus new interleaving points. Also this restriction could be released, without loosing the mentioned modularity, but it would increase the complexity of the proof system. 2.2

Semantics

States and Conﬁgurations. Let Val t be the disjoint domains of the various t ˙ types t and Val = t Val , where ∪˙ is the disjoint union operator. For class names c, the disjunct sets Val c with typical elements α, β, . . . denote inﬁnite sets / of object identiﬁers. The value of the empty reference null in type c is null c ∈ Val c . In general we will just write null , when c is clear from the context. We deﬁne Val cnull as Val c ∪˙ {null c } and correspondingly for compound types, and Val null = ˙ t Val tnull . Let Init : Var → Val null be a function assigning an initial value to each variable y ∈ Var , i.e., null , false, and 0 for class, boolean, and integer types, respectively, and analogously for compound types, where sequences are initially empty. We deﬁne this ∈ / Var , such that the self-reference is not in the domain of Init 2 . A local state τ ∈ Σloc of type TVar Val null is a partial function holding the m,c of method m values of the local variables of a method. The initial local state τinit of class c assigns to each local variable u of m the value Init(u). A local conﬁguration (α, τ, stm) of a thread executing within an object α speciﬁes, in addition to its local state τ , its point of execution represented by the statement stm. A thread conﬁguration ξ is a stack of local conﬁgurations (α0 , τ0 , stm 0 ) . . . (αn , τn , stm n ), representing the call chain of the thread. We write ξ ◦ (α, τ, stm) for pushing a new local conﬁguration onto the stack. 2

In Java, this is a “ﬁnal” instance variable, which for instance implies, it cannot be assigned to.

6

´ E. Abrah´ am et al.

An object is characterized by its instance state σinst ∈ Σinst , a partial function of type IVar ∪˙ {this} Val null , which assigns values to the self-reference c,init this and to the instance variables. The initial instance state σinst of instances c instance of class c assigns a value from Val to this, and to each of its remaining variables x the value Init(x). A global state σ ∈ Σ of type ( ˙ c Val c ) Σinst stores for each currently existing object, i.e., an object belonging to the domain dom(σ) of σ, its instance state. The set of existing objects of type c in a state σ is given by Val c (σ), and Val cnull (σ) = Val c (σ) ∪˙ {null c }. For the remaining types, Val t (σ) and Val tnull (σ) are deﬁned correspondingly, Val (σ) = ˙ t Val t (σ), and Val null (σ) = ˙ t Val tnull (σ). A global conﬁguration T, σ describes the currently existing objects by the global state σ, where the set T contains the conﬁguration of the executing thread. For the concurrent languages of the later sections, T will be the set of conﬁgurations of all currently executing threads. In the following, we write (α, τ, stm) ∈ T if there exists a local conﬁguration (α, τ, stm) within one of the execution stacks of T . We denote by τ [u → v] the local state which assigns the value v to u and agrees with τ on the values of all other variables; σinst [x → v] is deﬁned analogously, where σ[α.x → v] results from σ by assigning v to the instance variable x of object α. We use these operators analogously for vectors of variables. We use τ [y → v] also for arbitrary variable sequences, where instance variables are untouched; σinst [y → v] and σ[α.y → v] are analogous. Finally for global states, σ[α → σinst ] equals σ except on α; note that in case α ∈ / Val (σ), the operation extends the set of existing objects by α, which has its instance state initialized to σinst . Operational Semantics. Expressions are evaluated with respect to an instance local state (σinst , τ ), where the instance state gives meaning to the instance variables and the self-reference, whereas the local state determines the values of the σ ,τ local variables. The main cases of the evaluation function are [[x]]E inst = σinst (x) σinst ,τ = τ (u). The operational semantics of Javaseq is given inductively and [[u]]E by the rules of Table 2 as transitions between global conﬁgurations. The rules are formulated such a way that we can re-use them for the concurrent languages of the later sections. Note that for the sequential language, the sets T in the rules are empty, since there is only one single thread in global conﬁgurations. We elide the rules for the remaining sequential constructs —sequential composition, conditional statement, and iteration— as they are standard. Before having a closer look at the semantical rules for the transition relation −→, let us start by deﬁning the starting point of a program. The initial conﬁgc,init uration T0 , σ0 of a program satisﬁes dom(σ0 ) = {α}, σ0 (α) = σinst [this → α], run,c and T0 = {(α, τinit , body run,c )}, where c is the main class, and α ∈ Val c . A conﬁguration T, σ of a program is reachable if there exists a computation T0 , σ0 −→ ∗ T, σ such that T0 , σ0 is the initial conﬁguration of the program and −→ ∗ the reﬂexive transitive closure of −→. A local conﬁguration (α, τ, stm) ∈ T is enabled in T, σ, if it can be executed, i.e., if there is a computation step T, σ → T , σ executing stm in the local state τ and object α. Assignments to instance or local variables update the corresponding state component (see rules Assinst and Assloc ). Object creation by u := newc , as

A Tool-Supported Proof System for Multithreaded Java

7

Table 2. Javaseq operational semantics. σ(α),τ T ∪˙ {ξ ◦ (α, τ, x:=e; stm)}, σ −→ T ∪˙ {ξ ◦ (α, τ, stm)}, σ[α.x →[[e]]E ]

σ(α),τ T ∪˙ {ξ ◦ (α, τ, u:=e; stm)}, σ −→ T ∪˙ {ξ ◦ (α, τ [u →[[e]]E ], stm)}, σ

β ∈ Val c \Val (σ)

c,init σinst = σinst [this → β]

σ = σ[β → σinst ]

→ β], stm)}, σ T ∪˙ {ξ ◦ (α, τ, u:=newc ; stm)}, σ −→ T ∪˙ {ξ ◦ (α, τ [u

Assinst

Assloc

New

m(u){ body } ∈ Meth c β=

σ(α),τ [[e0 ]]E

∈ Val c (σ)

σ(α),τ

m,c τ = τinit [u →[[e]]E

]

Call

T ∪˙ {ξ ◦ (α, τ, u := e0 .m(e); stm)}, σ −→ T ∪˙ {ξ ◦ (α, τ, receive u; stm) ◦ (β, τ , body)}, σ σ(β),τ

→[[eret ]]E τ = τ [uret

]

Return

T ∪˙ {ξ ◦ (α, τ, receive uret ; stm) ◦ (β, τ , return eret )}, σ −→

T ∪˙ {ξ ◦ (α, τ , stm)}, σ

T ∪˙ {(α, τ, return)}, σ −→ T ∪˙ {(α, τ, )}, σ

Returnrun

shown in rule New, creates a new object of type c with a fresh identity stored in the local variable u, and initializes its instance variables. Invoking a method extends the call chain by a new local conﬁguration (cf. Call). After initializing the local state and passing the parameters, the thread begins to execute the method body. When returning from a method call (cf. Return), the callee evaluates its return expression and passes it to the caller which subsequently updates its local state. The method body terminates its execution and the caller can continue. We have similar rules not shown in the table for the invocation of methods without return value. The executing thread ends its lifespan by returning from the run-method of the initial object (see Returnrun ). 2.3

The Assertion Language

The assertion logic consists of a local and a global sublanguage. Local assertions p, q, . . . are used to annotate methods in terms of their local variables and of the instance variables of the class to which they belong. Global assertions P, Q, . . . describe a whole system of objects and their communication structure and will be used in the cooperation test. In the assertion language we add the type Object as the supertype of all classes, and we introduce logical variables z diﬀerent from all program variables. Logical variables are used for quantiﬁcation and as free variables to represent local variables in the global assertion language.

´ E. Abrah´ am et al.

8

Table 3. Semantics of assertions. ([[∃z. p]]L

ω,σinst ,τ

= true)

iﬀ

ω,σinst ,τ

([[p]]L

([[∃z∈e. p]]L

= true)

iﬀ

([[z∈e ∧ p]]L

[[E.x]]ω,σ G

=

σ([[E]]ω,σ G )(x)

= true)

iﬀ

([[P ]]G

([[∃z. P ]]ω,σ G

ω[z → v],σinst ,τ

= true for some v ∈ Val )

ω[z → v],σinst ,τ

ω[z → v],σ

=true for some v ∈ Val null )

= true for some v ∈ Val null (σ))

Expressions and assertions are interpreted relative to a logical environment ω, assigning values to logical variables. Assertions are boolean program expressions, extended by logical variables and quantiﬁcation3 . Global assertions may furthermore contain qualiﬁed references. Note that when the global expressions E and E refer to the same object, that is, E and E are aliases, then E.x and E .x denote the same variable. Quantiﬁcation can be used for all types, also for reference types. However, the existence of objects dynamically depends on the global state, something one cannot speak about on the local level. Nevertheless, one can assert the existence of objects on the local level, provided one is explicit about the domain of quantiﬁcation. Thus quantiﬁcation over objects in the local assertion language is restricted to ∀z ∈ e. p for objects and to ∀z e. p for lists of objects, and correspondingly for existential quantiﬁcation and for composite types. Unrestricted quantiﬁcation ∀z. p can be used in the local language for boolean and integer domains only. Global assertions are evaluated in the context of a global state. Thus, quantiﬁcation is allowed unrestricted for all types and ranges over the set of existing values. ω,σ ,τ The evaluations of local and global assertions are given by [[p]]L inst and ω,σ [[P ]]G . The main cases are shown in Table 3. We write ω, σinst , τ |=L p for ω,σ ,τ [[p]]L inst = true, and |=L p if p holds in all contexts; we use analogously |=G for global assertions. To express a local property p in the global assertion language, we deﬁne the lifting substitution p[z/this] by simultaneously replacing in p all occurrences of this by z, and transforming all occurrences of instance variables x into qualiﬁed references z.x. We assume z not to occur in p. For notational convenience we view the local variables occurring in the global assertion p[z/this] as logical variables. Formally, these local variables are replaced by fresh logical variables. We will write P (z) for p[z/this], and similarly for expressions. 2.4

The Proof System

The proof system has to accommodate for dynamic object creation, aliasing, and method invocation. Before describing the proof method we ﬁrst show how to augment and annotate programs resulting in proof outlines or asserted programs. 3

In this paper we use mathematical notation like ∀z. p etc. for phrases in abstract syntax. The concrete syntax used by Verger is an adaptation of jml.

A Tool-Supported Proof System for Multithreaded Java

9

Proof Outlines. For a complete proof system it is necessary that the transition semantics of Javaseq can be encoded in the assertion language. As the assertion language reasons about the local and global states, we have to augment the program with fresh auxiliary variables to represent information about the control points and stack structures within the local and global states. Invariant program properties are speciﬁed by the annotation. An augmentation extends a program by atomically executed multiple assignments y := e to auxiliary variables, which we call observations. Furthermore, the observations have, in general, to be “attached” to statements they observe in an atomic manner. For object creation this is syntactically represented by the augmentation u := newc ; y := enew which attaches the observation to the object creation statement. Observations y 1 := e 1 of a method call and observations y 4 := e 4 of the corresponding reception of a return value4 is denoted by u := e0 .m(e); y 1 := e 1 !call y 4 := e 4 ?ret . The augmentation y 2 := e 2 ?call stm; return eret ; y 3 := e 3 !ret of method bodies speciﬁes y 2 := e 2 as the observation of the reception of the method call and y 3 := e 3 as the observation attached to the return statement. Assignments can be observed using y := e; y := e ass . A stand-alone observation not attached to any statement is written as y := e ; it can be inserted at any point in the program. The augmentation does not inﬂuence the control ﬂow of the program but enforce a particular scheduling policy. An assignment statement and its observation are executed simultaneously. Object creation and its observation are executed in a single computation step, in this order. For method call, communication, sender, and receiver observations are executed in a single computation step, in this order. Points between a statement and its observation are no control points, since the statement and its observation are executed in a single computation step; we call them auxiliary points. In the following we call assignment statements with their observations, unobserved assignments, alone-standing observations, or observations of communication or object creation general as multiple assignments, since they are executed simultaneously. In order to express the transition semantics in the logic, we identify each local conﬁguration by the object in which it executes together with the value of its built-in auxiliary local variable conf storing a unique object-internal identiﬁer. Its uniqueness is assured by the auxiliary instance variable counter, incremented for each new local conﬁguration in that object. The callee receives the “return address” as auxiliary formal parameter caller of type Object × Int, storing the identities of the caller object and the calling local conﬁguration. The parameter caller of the initial invocation of the run-method of the initial object get the value (null , 0). Syntactically, the built-in augmentation translates each method deﬁnition m(u){stm} into m(u, caller){conf, counter := counter, counter + 1?call stm}. Cor4

To exclude the possibility, that two multiple assignments get executed in a single computation step in the same object, we require that caller observations in a self´ communication may not change the values of instance variables [AdBdRS03].

´ E. Abrah´ am et al.

10

respondingly, method invocation statements u := e0 .m(e) get extended to u := e0 .m(e, (this, conf)). For readability, in the examples of the following sections we will not explicitly list the built-in augmentation; they are meant to be automatically included. To specify invariant properties of the system, the augmented programs are annotated by attaching local assertions to each control and auxiliary point. We use the triple notation {p} stm {q } and write pre(stm) and post(stm) to refer to the pre- and the post-condition of a statement. For assertions at auxiliary points we use the following notation: The annotation {p0 }

u := new c; {p1 }new y := enew {p2 }

of an object creation statement speciﬁes p0 and p2 as pre- and postconditions, where p1 at the auxiliary point should hold directly after object creation but before the observation. The annotation { p0 } u

:= e0 .m(e);

{p1 }!call y 1

:= e1 !call

{p2 }wait

{p3 }?ret y 4

:= e4 ?ret

{p4 }

assigns p0 and p4 as pre- and postconditions to the method invocation; p1 and p3 are assumed to hold directly after method call and return, resp., but prior to their observations; p2 describes the control point of the caller after method call and before return. The annotation of method bodies stm; return e is as follows: {p0 }?call y 2

:= e2 ?call {p1 }

stm;

{p2 }

return e; {p3 }!ret y 3 := e3 !ret {p4 }

The callee postcondition of the method call is p1 ; the callee pre- and postconditions of return are p2 and p4 . The assertions p0 resp. p3 specify the states of the callee between method call resp. return and its observation. Besides pre- and postconditions, the annotation deﬁnes for each class c a local assertion Ic called class invariant, specifying invariant properties of instances of c in terms of its instance variables5 . We require that the precondition of each method’s body is the class invariant. Finally, a global assertion GI called the global invariant speciﬁes properties of communication between objects. As such, it should be invariant under object-internal computation. For that reason, we require that for all qualiﬁed references E.x in GI with E of type c, all assignments to x in class c occur in the observations of communication or object creation. We require that in the annotation no free logical variables occur. In the following we will also use partial annotation. Assertions which are not explicitly speciﬁed are by deﬁnition true. Veriﬁcation Conditions. The proof system formalizes a number of veriﬁcation conditions which inductively ensure that for each reachable conﬁguration the 5

The notion of class invariant commonly used for sequential object-oriented languages diﬀers from our notion: In a sequential setting, it would be suﬃcient that the class invariant holds initially and is preserved by whole method calls, but not necessarily in between.

A Tool-Supported Proof System for Multithreaded Java

11

local assertions attached to the current control points in the thread conﬁguration as well as the global and the class invariants hold. The conditions are grouped, ´ as usual, into initial conditions [AdBdRS03], and for the inductive step into local correctness and tests for interference freedom and cooperation. Arguing about two diﬀerent local conﬁgurations makes it necessary to distinguish between their local variables, since they may have the same names; in such cases we will rename the local variables in one of the local states. We use primed assertions p to denote the given assertion p with every local variable u replaced by a fresh one u , and correspondingly for expressions. Local Correctness. A proof outline is locally correct, if the properties of method instances as speciﬁed by the annotation are invariant under their own execution. For example, the precondition of an assignment must imply its postcondition after its execution. The following condition should hold for all multiple assignments being an assignment statement with its observation, an unobserved assignment, or an alone-standing observation: Deﬁnition 1 (Local Correctness: Assignment). A proof outline is locally correct, if for all multiple assignments {p1 } y := e {p2 } in class c, which is not the observation of object creation or communication, |=L {p1 }

y := e

{p2 } .

(1)

The conditions for loops and conditional statements are similar. Note that we have no local veriﬁcation conditions for observations of communication and object creation. The postconditions of such statements express assumptions about the communicated values. These assumptions will be veriﬁed in the cooperation test. The Interference Freedom Test. Invariance of local assertions under computation steps in which they are not involved is assured by the proof obligations of the interference freedom test. Its deﬁnition covers also invariance of the class invariants. Since Javaseq does not support qualiﬁed references to instance variables, we only have to deal with invariance under execution within the same object. Aﬀecting only local variables, communication and object creation do not change the instance states of the executing objects. Thus we only have to cover invariance of assertions at control points over assignments in the same object, including observations of communication and object creation. To distinguish local variables of the diﬀerent local conﬁgurations, we rename those of the assertion. Let q be an assertion at a control point and y := e a multiple assignment in the same class c. In which cases does q have to be invariant under the execution of the assignment? Since the language is sequential, i.e., q and y := e belong to the same thread, the only assertions endangered are those at control points waiting for return earlier in the current execution stack. Invariance of a local conﬁguration under its own execution, however, need not be considered and is excluded by requiring conf

= conf . Interference with the matching return statement in a self-communication need also not be considered, because

12

´ E. Abrah´ am et al.

communicating partners execute simultaneously. Let caller obj be the ﬁrst and caller conf the second component of caller. We deﬁne waits for ret(q, y := e) by – conf

= conf, for assertions {q }wait attached to control points waiting for return, if y := e is not the observation of return; = conf ∧ (this

= caller obj ∨ conf

= caller conf), for assertions {q }wait , if – conf

y := e observes return; – false, otherwise. The interference freedom test can now be formulated as follows: Deﬁnition 2 (Interference Freedom). A proof outline is interference free, if for all classes c and multiple assignments y := e with precondition p in c, |=L {p ∧ Ic }

y := e

{Ic } .

(2)

Furthermore, for all assertions q at control points in c, |=L {p ∧ q ∧ waits for ret(q, y := e)}

y := e

{q } .

(3)

Note that if we would allow qualiﬁed references in program expressions, we would have to show interference freedom of all assertions under all assignments in programs, not only for those occurring in the same class. For a program with n classes where each class contains k assignments and l assertions at control points, the number of interference freedom conditions is in O(c · k · l), instead of O((c · k) · (c · l)) with qualiﬁed references. Example 1. Let {p1 } this.m(e); {p2 }!call stm 1 !call {p3 }wait {p4 }?ret stm 2 ?ret {p5 } be an annotated method call statement in a method m of a class c with an integer auxiliary instance variable x, such that all assertions imply conf = x. I.e., the identity of the executing local conﬁguration is stored in the instance variable x. The annotation expresses that the method m of c is not called recursively. That means, in our sequential language, no pairs of control points in m of c can be simultaneously reached. The assertions p2 and p4 do not have to be shown invariant, since they are attached to auxiliary points. Interference freedom neither requires invariance of the assertions p1 and p5 , since they are not at control points waiting for return, and thus the antecedents of the corresponding conditions are false. Invariance of p3 under the execution of the observation stm 1 with precondition p2 requires validity of |=L {p2 ∧ p3 ∧ waits for ret(p3 , stm 1 )} stm 1 {p3 }. The assertion p2 ∧ p3 ∧ waits for ret(p3 , stm 1 ) implies (conf = x) ∧ (conf = x) ∧ (conf = conf), which evaluates to false. Invariance of p3 under stm 2 is analogous. Example 2. Assume a partially6 annotated method invocation statement of the form {p1 } this.m(e); {conf = x ∧ p2 }wait {p3 } in a class c with an integer auxiliary instance variable x, and assume that method m of c has the annotated 6

As already mentioned, missing assertions are by deﬁnition true.

A Tool-Supported Proof System for Multithreaded Java

13

return statement {q1 } return; {caller = (this, x)}!ret stm !ret {q2 } . The annotation expresses that the local conﬁgurations containing the above statements are in caller-callee relationship. Thus upon return, the control point of the caller moves from the point at conf = x ∧ p2 to that at p3 , i.e, conf = x ∧ p2 does not have to be invariant under the observation of the return statement. Again, the assertion caller = (this, x) at an auxiliary point does not have to be shown invariant. For the assertions p1 , p3 , q1 , and q2 , which are not at a control point waiting for return, the antecedent is false. Invariance of conf = x ∧ p2 under the observation stm with precondition caller = (this, x) is covered by the interference freedom condition |=L { caller = (this, x) ∧ (conf = x ∧ p2 )∧ waits for ret((conf = x ∧ p2 ), stm) } stm {conf = x ∧ p2 } . = (this, conf ), which contradicts the The waits for ret assertion implies caller

assumptions caller = (this, x) and conf = x; thus the antecedent of the condition is false. Satisfaction of caller = (this, x) directly after communication and satisfaction of p3 and q2 after the observation is assured by the cooperation test. The Cooperation Test. Whereas the interference freedom test assures invariance of assertions under steps in which they are not involved, the cooperation test deals with inductivity for communicating partners, assuring that the global invariant, and the preconditions and the class invariants of the involved statements imply their postconditions after the joint step. Additionally, the preconditions of the corresponding observations must hold immediately after communication. The global invariant expresses global invariant properties using auxiliary instance variables which can be changed by observations of communication, only. Consequently, the global invariant is automatically invariant under the execution of non-communicating statements. For communication and object creation, however, the invariance must be shown as part of the cooperation test. We start with the cooperation test for method invocation. Since diﬀerent objects may be involved, the cooperation test is formulated in the global assertion language. Local properties are expressed in the global language using the lifting substitution. As already mentioned, we use the shortcuts P (z) for p[z/this], Q (z ) for q [z /this], and similarly for expressions. To avoid name clashes between local variables of the partners, we rename those of the callee. Remember that after communication, i.e., after creating and initializing the callee local conﬁguration and passing on the actual parameters, ﬁrst the caller, and then the callee execute their corresponding observations, all in a single computation step. Correspondingly for return, after communicating the result value, ﬁrst the callee and then the caller observation gets executed. Let z and z be logical variables representing the caller, respectively the callee object in a method call. We assume the global invariant, the class invariants of the communicating partners, and the preconditions of the communicating statements to hold prior to communication. For method invocation, the precondition of the callee is its class invariant. That the two statements indeed repre-

14

´ E. Abrah´ am et al.

sent communicating partners is captured in the assertion comm, which depends on the type of communication: For method invocation e0 .m(e), the assertion E0 (z) = z states, that z is indeed the callee object. Remember that method invocation hands over the “return address”, and that the values of formal parameters remain unchanged. Furthermore, actual parameters may not contain instance variables, i.e., their interpretation does not change during method execution. Therefore, the formal and actual parameters can be used at returning from a method to identify partners being in caller-callee relationship, using the built-in auxiliary variables. Thus for the return case, comm additionally states u = E(z), where u and e are the formal and the actual parameters. Returning from the run-method terminates the executing thread, which does not have communication eﬀects. As in the previous conditions, state changes are represented by assignments. For the example of method invocation, communication is represented by the assignment u := E(z), where initialization of the remaining local variables v is covered by v := Init(v). The assignments z.y 1 := E 1 (z) and z .y 2 := E 2 (z ) stand for the caller and callee observations y 1 := e 1 and y 2 := e 2 , executed in the objects z and z , respectively. Note that we rename all local variables of the callee to avoid name clashes. Deﬁnition 3 (Cooperation Test: Communication). isﬁes the cooperation test for communication, if

A proof outline sat-

|=G {GI ∧ P1 (z) ∧ Q1 (z ) ∧ comm ∧ z = null ∧ z = null} fcomm {P2 (z) ∧ Q2 (z )} |=G {GI ∧ P1 (z) ∧ Q1 (z ) ∧ comm ∧ z = null ∧ z = null} fcomm ; fobs1 ; fobs2 {GI ∧ P3 (z) ∧ Q3 (z )}

(4)

(5)

hold for distinct fresh logical variables z of type c and z of type c , in the following cases: 1. Call: For all statements {p1 } uret := e0 .m(e); {p2 }!call y 1 := e 1 !call {p3 }wait (or such without receiving a value) in class c with e0 of type c , where method m of c has body {q2 }?call y 2 := e 2 ?call {q3 } stm; return eret , formal parameters u, and local variables v except the formal parameters. The callee class invariant is q1 = Ic . The assertion comm is given by E0 (z) = z . Furthermore, fcomm is u , v := E(z), Init(v), fobs1 is z.y 1 := E 1 (z), and fobs2 is z .y 2 := E 2 (z ). 2. Return: For all uret := e0 .m(e); y 1 := e 1 !call {p1 }wait {p2 }?ret y 4 := e 4 ?ret {p3 } (or such without receiving a value) occurring in c with e0 of type c , such !ret that method m of c has the return statement {q1 } return eret ; {q2 } y 3 := e 3 !ret {q3 } , and formal parameter list u, the above equations must hold with (z ), comm given by E0 (z) = z ∧ u = E(z), and where fcomm is uret := Eret fobs1 is z .y 3 := E 3 (z ), and fobs2 is z.y 4 := E 4 (z).

A Tool-Supported Proof System for Multithreaded Java

15

3. Returnrun : For {q1 } return; {q2 }!ret y 3 := e 3 !ret {q3 } in the run-method of the main class, p1 = p2 = p3 = true, comm = true, fobs1 is z .y 3 := E 3 (z ), and furthermore fcomm and fobs2 are the empty statement. Example 3. This example illustrates how one can prove properties of parameter passing. Let {p} e0 .m(v, e), with p given by v > 0, be a (partially) annotated statement in a class c with e0 of type c , and let method m(u, w) of c have the body {q } stm; return where q is u > 0. Inductivity of the proof outline requires that if p is valid prior to the call (besides the global and class invariants), then q is satisﬁed after the invocation. Omitting irrelevant details, Condition 5 of the cooperation test requires proving |=G {P (z)} u := v {Q (z )}, which expands to |=G {v > 0} u := v {u > 0}. Example 4. The following example demonstrates how one can express dependencies between instance states in the global invariant and use this information in the cooperation test. Let {p} e0 .m(e), with p given by x > 0 ∧ e0 = o, be an annotated statement in a class c with e0 of type c , x an integer instance variable, and o an instance variable of type c , and let method m(u) of c have the annotated body {q } stm; return where q is y > 0 and y an integer instance variable. Let furthermore z ∈ LVar c and let the global invariant be given by ∀z. (z

= null ∧ z.o

= null ∧ z.x > 0) → z.o.y > 0. Inductivity requires that if p and the global invariant are valid prior to the call, then q is satisﬁed after the invocation (again, we omit irrelevant details). The cooperation test Condition 5, i.e., |=G {GI ∧ P (z) ∧ comm ∧ z

= null ∧ z = null} u := E(z) {Q (z )} expands to = null ∧ z.o

= null ∧ z.x > 0) → z.o.y > 0)∧ |=G {(∀z. (z

(z.x > 0 ∧ E0 (z) = z.o) ∧ E0 (z) = z ∧ z = null ∧ z = null } u := E(z) {z .y > 0} Instantiating the quantiﬁcation by z, the antecedent implies z.o.y > 0 ∧ z = z.o, i.e., z .y > 0. Invariance of the global invariant is straightforward. Example 5. This example illustrates how the cooperation test handles observations of communication. Let {¬b} this.m(e){b}wait be an annotated statement in a class c with boolean auxiliary instance variable b and let m(u) of c have the body {¬b}?call b := true?call {b} stm; return. Condition 4 of the cooperation test assures inductivity for the precondition of the observation. We have to show |=G {¬z.b ∧ comm}u := E(z){¬z .b}, i.e., since it is a self-call, |=G {¬z.b ∧ z = z }u := E(z){¬z .b}, which is trivially satisﬁed. Condition 5 of the cooperation test for the postconditions requires |=G {comm}u := E(z); z .b := true{z.b∧z .b} which expands to |=G {z = z }u := E(z); z .b := true{z.b ∧ z .b}, whose validity is easy to see.

16

´ E. Abrah´ am et al.

Besides method calls and return, the cooperation test needs to handle object creation, taking care of the preservation of the global invariant, the postcondition of the new-statement and its observation, and the new object’s class invariant. We can assume that the precondition of the object creation statement, the class invariant of the creator, and the global invariant hold in the conﬁguration prior to instantiation. The extension of the global state with a freshly created object is formulated in a strongest postcondition style, i.e., it is required to hold immediately after the instantiation. We use existential quantiﬁcation to refer to the old value: z of type list Object represents the existing objects prior to the extension. Moreover, that the created object’s identity stored in u is fresh and that the new instance is properly initialized is expressed by the global assertion Fresh(z , u) deﬁned as InitState(u)∧u

∈ z ∧∀v. v ∈ z ∨v = u, where Init is a syntactical operator with interpretation Init (cf. page 5), IVar is theset of instance variables of u, and InitState(u) is the global assertion u

= null ∧ x∈IVar \{this} u.x = Init(x), expressing that the object denoted by u is in its initial instance state. To express that an assertion refers to the set of existing objects prior to the extension of the global state, we need to restrict any existential quantiﬁcation in the assertion to range over objects from z , only. So let P be a global assertion and z of type list Object a logical variable not occurring in P . Then P ↓ z is the global assertion P with all quantiﬁcations ∃z. P replaced by ∃z. obj(z) ⊆ z ∧ P , where obj (v) denotes the set of objects occurring in the value v. Thus a predicate (∃u. P ) ↓ z , evaluated immediately after the instantiation, expresses that P holds prior to the creation of the new object. This leads to the following deﬁnition of the cooperation test for object creation: Deﬁnition 4 (Cooperation Test: Instantiation). A proof outline satisﬁes the cooperation test for object creation, if for all classes c and statements c {p1 } u := new ; {p2 }new y := enew {p3 } in c : |=G z

=null ∧ z

=u ∧ ∃z . Fresh(z , u) ∧ (GI ∧ ∃u. P1 (z)) ↓ z (6) → P2 (z) ∧ Ic (u) |=G {z

=null ∧ z

=u ∧ ∃z . Fresh(z , u) ∧ (GI ∧ ∃u. P1 (z)) ↓ z } z.y := E(z) {GI ∧ P3 (z)}

(7)

with z of type c and z of type list Object fresh. Example 6. Assume a statement u := newc ; {u = this} in a program, where the class invariant of c is x ≥ 0 for an integer instance variable x. Condition 6 of the cooperation test for object creation assures that the class invariant of the new object holds after its creation. We have to show validity of |=G (∃z . Fresh(z , u)) → u.x ≥ 0, i.e., |=G u.x = 0 → u.x ≥ 0, which is trivial. For the postcondition, Condition 7 requires |=G {z = u}{u = z} with the empty statement (no observations are executed), which is true.

A Tool-Supported Proof System for Multithreaded Java

3

17

Multithreading

In this section we extend the language Javaseq to a concurrent language Javaconc by allowing dynamic thread creation. Again, we deﬁne syntax and semantics of the language, before formalizing the proof system. 3.1

Syntax and Semantics

Expressions and statements can be constructed as in Javaseq . The abstract syntax of the remaining constructs is summarized in Table 4. Table 4. Javaconc abstract syntax. meth meth run class class main prog

::= ::= ::= ::= ::=

m(u, . . ., u){ stm; return eret } run(){ stm; return } c{meth. . .meth meth run meth start } class class. . .class class main

As we focus on concurrency aspects, all classes are Thread classes in the sense of Java: Each class contains the pre-deﬁned methods start and run. The runmethods cannot be invoked directly. The parameterless start-method without return value is not implemented syntactically; its semantics is described below. Note, that the syntax does not allow qualiﬁed references to instance variables. As a consequence, shared-variable concurrency is caused by simultaneous execution within a single object, only, but not across object boundaries. The operational semantics of Javaconc extends the semantics of Javaseq by dynamic thread creation. The additional rules are shown in Table 5. The ﬁrst invocation of a start-method brings a new thread into being (Callstart ). The new thread starts to execute the user-deﬁned run-method of the given object while the initiating thread continues its own execution. Only the ﬁrst invo7 cation of the start-method has this eﬀect (Callskip start ) . This is captured by the predicate started (T, β) which holds iﬀ there is a stack (α0 , τ0 , stm 0 ) . . . (αn , τn , stm n ) ∈ T such that β = α0 . A thread ends its lifespan by returning from a run-method (Returnrun of Table 2) 8 . 3.2

The Proof System

In contrast to the sequential language, the proof system additionally has to accommodate for dynamic thread creation and shared-variable concurrency. Before describing the proof method, we show how to extend the built-in augmentation of the sequential language. 7 8

In Java an exception is thrown if the thread is already started but not yet terminated. The worked-oﬀ local conﬁguration (α, τ, ) is kept in the global conﬁguration to ensure that the thread of α cannot be started twice.

18

´ E. Abrah´ am et al. Table 5. Javaconc operational semantics. σ(α),τ

β = [[e]]E

∈ Val c (σ)

¬started (T ∪ {ξ ◦ (α, τ, e.start(); stm)}, β)

run,c T ∪˙ {ξ ◦ (α, τ, e.start(); stm)}, σ −→ T ∪˙ {ξ ◦ (α, τ, stm), (β, τinit , body run,c )}, σ

σ(α),τ

β = [[e]]E

∈ Val (σ)

started (T ∪ {ξ ◦ (α, τ, e.start(); stm)}, β)

T ∪˙ {ξ ◦ (α, τ, e.start(); stm)}, σ −→ T ∪˙ {ξ ◦ (α, τ, stm)}, σ

Callstart

Callskip start

Proof Outlines. As mentioned, an important point of the proof system to achieve completeness is the identiﬁcation of communicating partners. For the concurrent language we additionally have to be able to identify threads. We identify a thread by the object in which it has begun its execution. This identiﬁcation is unique, since an object’s thread can be started only once. We use the type Thread thus as abbreviation for the type Object. During a method call, the callee thread receives its own identity as an auxiliary formal parameter thread. Additionally, we extend the auxiliary formal parameter caller by the caller thread identity, i.e., let caller be of type Object × Int × Thread, storing the identities of the caller object, the calling local conﬁguration, and the caller thread. Note that the thread identities of caller and callee are the same in all cases but the invocation of a start-method. The run-method of the initial object is executed with the values (α0 , (null , 0, null )) assigned to the parameters (thread, caller), where α0 is the initial object. The boolean instance variable started, ﬁnally, remembers whether the object’s start-method has already been invoked. Syntactically, each formal parameter list u in the original program gets extended to (u, thread, caller). Correspondingly for the caller, each actual parameter list e in statements invoking a method diﬀerent from start gets extended to (e, thread, (this, conf, thread)). The invocation of the parameterless start-method of an object e0 gets the actual parameter list (e0 , (this, conf, thread)). Finally, the callee observation at the beginning of the run-method executes started := true. The variables conf and counter are updated as in the previous section. Veriﬁcation Conditions. Local correctness is not inﬂuenced by the new issue of concurrency. Note that local correctness applies now to all concurrently executing threads. The Interference Freedom Test. An assertion q at a control point has to be invariant under an assignment y := e in the same class only if the local conﬁguration described by the assertion is not active in the computation step executing the assignment. Note that assertions at auxiliary points do not have to be shown invariant. Again, to distinguish local variables of the diﬀerent local conﬁgurations, we rename those of the assertion. If q and y := e belong to the same thread, i.e., thread = thread , then we have the same antecedent as for the sequential language. If the assertion and the assignment belong to diﬀerent threads, interference freedom must be shown in

A Tool-Supported Proof System for Multithreaded Java

19

any case except for the self-invocation of the start-method: The precondition of such a method invocation cannot interfere with the corresponding observation of the callee. To describe this setting, we deﬁne self start(q, y := e) by caller = (this, conf , thread ) iﬀ q is the precondition of a method invocation e0 .start(e) and the assignment is the callee observation at the beginning of the run-method, and by false otherwise. Deﬁnition 5 (Interference Freedom). A proof outline is interference free, if the conditions of Deﬁnition 2 hold with waits for ret(q, y := e) replaced by interleavable(q, y := e) = thread = thread → waits for ret(q, y := e) ∧ (8) thread

= thread → ¬self start(q, y := e) . def

Example 7. Assume an assignment {p} stm in an annotated method m of c, and an assertion q at a control point in the same method, which is not waiting for return, such that both p and q imply thread = this. I.e., the method is executed only by the thread of the object to which it belongs. Clearly, p and q cannot be simultaneously reached by the same thread. For invariance of q under the assignment stm, the antecedent of the interference freedom condition implies p ∧ q ∧ interleavable(q, stm). From p ∧ q we conclude thread = thread , and thus by the deﬁnition of interleavable(q, stm) the assertion q should be at a control point waiting for return, which is not the case, and thus the antecedent of the condition evaluates to false. The Cooperation Test. The cooperation test for object creation is not inﬂuenced by adding concurrency, but we have to extend the cooperation test for communication by deﬁning additional conditions for thread creation. Invoking the start-method of an object whose thread is already started does not have communication eﬀects. The same holds for returning from a run-method, which is already included in the conditions for the sequential language as for the termination of the only thread. Note that this condition applies now to all threads. Deﬁnition 6 (Cooperation Test: Communication). A proof outline satisﬁes the cooperation test for communication, if the conditions of Deﬁnition 3 hold for the statements listed there with m

= start, and additionally in the following cases: 1. Callstart : For all statements {p1 } e0 .start(e); {p2 }!call y 1 := e 1 !call {p3 } in class c with e0 of type c , comm is given by E0 (z) = z ∧ ¬z .started, where {q2 }?call y 2 := e 2 ?call {q3 } stm is the body of the run-method of c having formal parameters u and local variables v except the formal parameters. As in the Callcase, q1 = Ic , fcomm is u , v := E(z), Init(v), fobs1 is z.y 1 := E 1 (z), and fobs2 is z .y 2 := E 2 (z ). 2. Callskip start : For the above statements, the equations must additionally hold with the assertion comm given by E0 (z) = z ∧ z .started, q1 = Ic , q2 = q3 = true, fobs1 is z.y 1 := E 1 (z), and fcomm and fobs2 are the empty statement.

´ E. Abrah´ am et al.

20

The Language Javasynch

4

In this section we extend the language Javaconc with monitor synchronization. Again, we deﬁne syntax and semantics of the language Javasynch , before formalizing the proof system. 4.1

Syntax and Semantics

Expressions and statements can be constructed as in the previous languages. The abstract syntax of the remaining constructs is summarized in the Table 6. Formally, methods get decorated by a modiﬁer modif distinguishing between non-synchronized and synchronized methods9 . We use sync(c, m) to state that method m in class c is synchronized. In the sequel we also refer to statements in the body of a synchronized method as being synchronized. Furthermore, we consider the additional predeﬁned methods wait, notify, and notifyAll, whose deﬁnitions use the auxiliary statements !signal, !signal all, ?signal, and returngetlock 10 . Table 6. Javasynch abstract syntax. modif meth meth run meth wait meth notify meth notifyAll meth predef class class main prog

::= ::= ::= ::= ::= ::= ::= ::= ::= ::=

nsync | sync modif m(u, . . ., u){ stm; return eret } nsync run(){ stm; return } nsync wait(){ ?signal; returngetlock } nsync notify(){ !signal ; return } nsync notifyAll(){ !signal all; return } meth start meth wait meth notify meth notifyAll c{meth. . .meth meth run meth predef } class class. . .class class main

The operational semantics extends the semantics of Javaconc by the rules of Table 7, where the Call rule is replaced. Each object has a lock which can be owned by at most one thread. Synchronized methods of an object can be invoked only by a thread which owns the lock of that object (Call), as expressed by the predicate owns, deﬁned below. If the thread does not own the lock, it has to wait until the lock gets free. A thread owning the lock of an object can recursively invoke several synchronized methods of that object, which corresponds to the notion of reentrant monitors. The remaining rules handle the monitor methods wait, notify, and notifyAll. In all three cases the caller must own the lock of the callee object (Callmonitor ). A thread can block itself on an object whose lock it owns by invoking the object’s wait-method, thereby relinquishing the lock and placing itself into the 9 10

Java does not have the “non-synchronized” modiﬁer: methods are non-synchronized by default. Java’s Thread class additionally support methods for suspending, resuming, and stopping a thread, but they are deprecated and thus not considered here.

A Tool-Supported Proof System for Multithreaded Java

21

Table 7. Javasynch Operational semantics. m∈ / {start, run, wait, notify, notifyAll} σ(α),τ

β = [[e0 ]]E

∈ Val c (σ)

modif m(u){ body } ∈ Meth c σ(α),τ

m,c τ = τinit [u →[[e]]E

]

sync(c, m) → ¬owns(T, β)

Call

T ∪˙ {ξ ◦ (α, τ, u := e0 .m(e); stm)}, σ −→ T ∪˙ {ξ ◦ (α, τ, receive u; stm) ◦ (β, τ , body)}, σ m ∈ {wait, notify, notifyAll} σ(α),τ

β = [[e]]E

∈ Val c (σ)

owns(ξ ◦ (α, τ, e.m(); stm), β)

T ∪˙ {ξ ◦ (α, τ, e.m(); stm)}, σ −→

Callmonitor

m,c T ∪˙ {ξ ◦ (α, τ, receive; stm) ◦ (β, τinit , body m,c )}, σ

¬owns(T, β) T ∪˙ {ξ ◦ (α, τ, receive; stm) ◦ (β, τ , returngetlock )}, σ −→

Returnwait

T ∪˙ {ξ ◦ (α, τ, stm)}, σ Signal T ∪˙ {ξ ◦ (α, τ, !signal; stm)} ∪˙ {ξ ◦ (α, τ , ?signal; stm )}, σ −→ T ∪˙ {ξ ◦ (α, τ, stm)} ∪˙ {ξ ◦ (α, τ , stm )}, σ wait(T, α) = ∅ T ∪˙ {ξ ◦ (α, τ, !signal; stm)}, σ −→ T ∪˙ {ξ ◦ (α, τ, stm)}, σ

Signalskip

T = signal (T, α) T ∪˙ {ξ ◦ (α, τ, !signal all; stm)}, σ −→ T ∪˙ {ξ ◦ (α, τ, stm)}, σ

SignalAll

object’s wait set. Formally, the wait set wait(T, α) of an object is given as the set of all stacks in T with a top element of the form (α, τ, ?signal; stm). After having put itself on ice, the thread awaits notiﬁcation by another thread which invokes the notify-method of the object. The !signal-statement in the notify-method thus reactivates a non-deterministically chosen single thread waiting for notiﬁcation on the given object (Signal). Analogously to the wait set, the notiﬁed set notiﬁed (T, α) of α is the set of all stacks in T with top element of the form (α, τ, returngetlock ), i.e., threads which have been notiﬁed and trying to get hold of the lock again. According to rule Returnwait , the receiver can continue after notiﬁcation in executing returngetlock only if the lock is free. Note that the notiﬁer does not hand over the lock to the one being notiﬁed but continues to own it. This behavior is known as signal-and-continue monitor discipline [And00]. If no threads are waiting on the object, the !signal of the notiﬁer is without eﬀect (Signalskip ). The notifyAll-method generalizes notify in that all waiting threads are notiﬁed via the !signal all-broadcast

22

´ E. Abrah´ am et al.

(SignalAll). The eﬀect of this statement is given by setting signal (T, α) as (T \ wait(T, α)) ∪ {ξ ◦ (β, τ, stm) | ξ ◦ (β, τ, ?signal; stm) ∈ wait(T, α)}. Using the wait and notiﬁed sets, we can now formalize the owns predicate: A thread ξ owns the lock of β iﬀ ξ executes some synchronized method of β, but not its wait-method. Formally, owns(T, β) is true iﬀ there exists a thread ξ ∈ T and a (β, τ, stm) ∈ ξ with stm synchronized and ξ ∈ / wait(T, β) ∪ notiﬁed (T, β). The deﬁnition is used analogously for single threads. An invariant of the semantics is that at most one thread can own the lock of an object at a time. 4.2

The Proof System

The proof system has additionally to accommodate for synchronization, reentrant monitors, and thread coordination. First we deﬁne how to extend the augmentation of Javaconc , before we describe the proof method. Proof Outlines. To capture mutual exclusion and the monitor discipline, the instance variable lock of type Thread × Int stores the identity of the thread who owns the lock, if any, together with the number of synchronized calls in the call chain. Its initial value free = (null , 0) indicates that the lock is free. The instance variables wait and notiﬁed of type list(Thread×Int) are the analogues of the waitand notiﬁed -sets of the semantics and store the threads waiting at the monitor, respectively those having been notiﬁed. Besides the thread identity, the number of synchronized calls is stored. In other words, these variables remember the old lock-value prior to suspension which is restored when the thread becomes active again. All auxiliary variables are initialized as usual. For values thread of type Thread and wait of type list(Thread × Int), we will also write thread ∈ wait instead of (thread , n) ∈ wait for some n. Syntactically, besides the augmentation of the previous section, the callee observation at the beginning and at the end of each synchronized method body executes lock := inc(lock) and lock := dec(lock), respectively. The semantics of σ ,τ incrementing the lock [[inc(lock)]]E inst is (τ (thread), n+1) for σinst (lock) = (α, n). Decrementing dec(lock) is inverse. Instead of the auxiliary statements of the semantics, notiﬁcation is represented in the proof system by auxiliary assignments operating on the wait and notiﬁed variables. Entering the wait-method gets the observation wait, lock := wait ∪ {lock}, free; returning from the wait-method observes lock, notiﬁed := get(notiﬁed, thread), notiﬁed\{get(notiﬁed, thread)}. Given a thread α, the get function retrieves the value (α, n) from a wait or notiﬁed set. The semantics assures uniqueness of the association. The !signal-statement of the notify-method is represented by the auxiliary assignment wait, notiﬁed := notify(wait, notiﬁed), where notify(wait, notiﬁed ) is the pair of the given sets with one element, chosen nondeterministically, moved from the wait into the notiﬁed set; if the wait set is empty, it is the identity function. Finally, the !signal all-statement of the notifyAll-method is represented by the auxiliary assignment notiﬁed, wait := notiﬁed ∪ wait, ∅.

A Tool-Supported Proof System for Multithreaded Java

23

Veriﬁcation Conditions. Local correctness agrees with that for Javaconc . In case of notiﬁcation, local correctness covers also invariance for the notifying thread, as the eﬀect of notiﬁcation is captured by an auxiliary assignment. The Interference Freedom Test. Synchronized methods of a single object can be executed concurrently only if one of the corresponding local conﬁgurations is waiting for return: If the executing threads are diﬀerent, then one of the threads is in the wait or notiﬁed set of the object; otherwise, both executing local conﬁgurations are in the same call chain. Thus we assume that either not both the assignment and the assertion occur in a synchronized method, or the assertion is at a control point waiting for return11 . Deﬁnition 7 (Interference Freedom). A proof outline is interference free, if Deﬁnition 5 holds in all cases, such that either not both p and q occur in a synchronized method, or q is at a control point waiting for return. For notiﬁcation, we require also invariance of the assertions for the notiﬁed thread. We do so, as notiﬁcation is described by an auxiliary assignment executed by the notiﬁer. That means, both the waiting and the notiﬁed status of the executing thread are represented by a single control point in the wait-method. The two statuses can be distinguished by the values of the wait and notiﬁed variables. The invariance of the precondition of the return statement in the waitmethod under the assignment in the notify-method represents the notiﬁcation process, whereas invariance of that assertion over assignments changing the lock represents the synchronization mechanism. Information about the lock value will be imported from the cooperation test as this information depends on the global behavior. Example 8. This example shows how the fact, that at most one thread can own the lock of an object, can be used to show mutual exclusion. We use the assertion owns(thread, lock) for thread

= null ∧ thread(lock) = thread, where thread (lock ) is the ﬁrst component of the lock value. Let furthermore free for(thread, lock) be thread

= null ∧ (owns(thread, lock) ∨ lock = free). Let q, given by owns(thread, lock), be an assertion at a control point and let {p}?call stm ?call

def

with p = free for(thread, lock) be the callee observation at the beginning of a synchronized method in the same class. Note that the observation changes the lock value. The interference freedom condition |=L {p ∧ q ∧ interleavable(q, stm)}stm{q } assures invariance of q under the observation stm. The assertions p and q imply thread = thread . The points at p and q can be simultaneously reached by the same thread only if q describes a point waiting for return. This fact is mirrored by the deﬁnition of the interleavable predicate: If q is not at a control point waiting for return, then the antecedent of the condition evaluates to false. Otherwise, after the execution of the built-in augmentation lock := inc(lock) in stm we have owns(thread, lock), i.e., owns(thread , lock), which was to be shown. 11

This condition is not necessary for a minimal proof system, but reduces the number of veriﬁcation conditions.

24

´ E. Abrah´ am et al.

The Cooperation Test. We extend the cooperation test for Javaconc with synchronization and the invocation of the monitor methods. In the previous languages, the assertion comm expressed, that the given statements indeed represent communicating partners. In the current language with monitor synchronization, communication is not always enabled. Thus the assertion comm has additionally to capture enabledness of the communication: In case of a synchronized method invocation, the lock of the callee object has to be free or owned by the caller. This is expressed by z .lock = free ∨ thread(z .lock) = thread, where thread is the caller-thread, and where thread(z .lock) is the ﬁrst component of the lock value, i.e., the thread owning the lock of z . For the invocation of the monitor methods we require that the executing thread is holding the lock. Returning from the wait-method assumes that the thread has been notiﬁed and that the callee’s lock is free. Note that the global invariant is not aﬀected by the object-internal monitor signaling mechanism, which is represented by auxiliary assignments. Deﬁnition 8 (Cooperation Test: Communication). A proof outline satisﬁes the cooperation test for communication, if the conditions of Deﬁnition 6 hold for the statements listed there with the exception of the Call-case, and additionally in the following cases: 1. Call: For all statements {p1 } uret := e0 .m(e); {p2 }!call y 1 := e 1 !call {p3 }wait (or such without receiving a value) in class c with e0 of type c , where method m ∈ / {start, wait, notify, notifyAll} of c is synchronized with body ?call {q 2 } y 2 := e 2 ?call {q3 } stm, formal parameters u, and local variables v except the formal parameters. The callee class invariant is q1 = Ic . The assertion comm is given by E0 (z) = z ∧ (z .lock = free ∨ thread(z .lock) = thread). Furthermore, fcomm is u , v := E(z), Init(v), fobs1 is given by z.y 1 := E 1 (z), and fobs2 is z .y 2 := E 2 (z ). If m is not synchronized, z .lock = free ∨ thread(z .lock) = thread in comm is dropped. 2. Callmonitor : For m ∈ {wait, notify, notifyAll}, comm is given by E0 (z) = z ∧ thread(z .lock) = thread. 3. Returnwait : For {q1 } returngetlock ; {q2 }!ret y 3 := e 3 !ret {q3 } in a wait-method, comm is E0 (z) = z ∧ u = E(z) ∧ z .lock = free ∧ thread ∈ z .notiﬁed. Example 9. Assume the invocation of a synchronized method m of a class c, where m of c has the body stm ?call {thread(lock) = thread} stm ; return. Note that the built-in augmentation in stm sets the lock owner by the assignment lock := inc(lock). Omitting irrelevant details again, the cooperation test requires |=G {true}z .lock := inc(z .lock){thread(z .lock) = thread }, which holds by the deﬁnition of inc.

5

Proving Deadlock Freedom

The previous sections described a proof system which can be used to prove safety properties of Javasynch programs. In this section we show how to apply the proof system to show deadlock freedom.

A Tool-Supported Proof System for Multithreaded Java

25

A system of processes is in a deadlocked conﬁguration, if no one of them is enabled to compute, but not yet all started processes are terminated. A typical deadlock situation can occur, if two threads t1 and t2 both try to gather the locks of two objects z1 and z2 , but in reverse order: t1 ﬁrst applies for access to synchronized methods of z1 , and then for those of z2 , while t2 ﬁrst collects the lock of z2 , and tries to become the lock owner of z1 . Now, it can happen, that t1 gets the lock of z1 , t2 gets the lock of z2 , and both are waiting for the other lock, which will never become free. Another typical source of deadlock situations are threads which suspended themselves by calling wait and which will never get notiﬁed. What kind of Javasynch -statements can be disabled and under which conditions? The important cases, to which we restrict, are – the invocation of synchronized methods, if the lock of the callee object is neither free nor owned by the executing thread, – if a thread tries to invoke a monitor method of an object whose lock it doesn’t own, or – if a thread tries to return from a wait-method, but either the lock is not free or the thread is not yet notiﬁed. To be exact, the semantics speciﬁes method calls to be disabled also, if the callee object is the empty reference. However, we won’t deal with this case; it can be excluded in the preconditions by stating that the callee object is not null. Assume a proof outline with global invariant GI . For a logical variable z of type Object, let I(z) = I[z/this] be the class invariant of z expressed on the global level. Let the assertion terminated(z) express that the thread of z is already terminated. Formally, we deﬁne terminated(z) = q[z/thread][z/this], where q is the postcondition of the run-method of z. For assertions p in z let furthermore blocked(z, z , p) express that the thread of z is disabled in the object z at control point p. Formally, we deﬁne blocked(z, z , p) by = free ∧ thread(e0 .lock) = thread if p is the – ∃v. p[z/thread][z /this] ∧ e0 .lock

precondition of a call invoking a synchronized method of e0 , – ∃v. p[z/thread][z /this] ∧ thread(e0 .lock)

= thread if p is the precondition of a call invoking a monitor method of e0 , – ∃v. p[z/thread][z /this] ∧ (z .lock

= free ∨ z ∈ / z .notiﬁed) if p is the precondition of the return-statement in the wait-method, and – false otherwise, where v is the vector of local variables in the given assertion without thread, and z and z fresh. Let ﬁnally blocked(z, z ) express thatthe thread of object z is blocked in the object z . It is deﬁned by the assertion p∈Ass(z ) blocked(z, z , p), where Ass(z ) is the set of all assertions at control points in z . Now we can formalize the veriﬁcation condition for deadlock freedom:

´ E. Abrah´ am et al.

26

Deﬁnition 9. A proof outline satisﬁes the test for deadlock freedom, if |=G (GI ∧ (9) (∀z. z

= null → (I(z) ∧ = null ∧ blocked(z, z )))))) ∧ (z.started → (terminated(z) ∨ (∃z . z

(∃z. z

= null ∧ z.started ∧ (∃z . z

= null ∧ blocked(z, z )))) → false . The above condition states, that the assumptions that all started processes are terminated or disabled, and that at least one thread is not yet terminated, i.e., that the program is in a deadlocked conﬁguration, lead to a contradiction. Soundness of the above condition, i.e., that the condition indeed assures absence of deadlock, is easy to show. Completeness results directly from the completeness of the proof method. Example 10. In the following example, the assertion owns is as in Example 8, proj (v, i) denotes the ith component of the tuple v, and not owns(thread, lock) is thread

= null ∧ proj(lock, 1)

= thread. Again, the built-in augmentation is not listed in the code. We additionally list instance and local variable declarations type name;, where type name; declares auxiliary variables. We sometimes skip return statements without giving back a value, and write explicitly ∀(z : t).p for quantiﬁcation over t-typed values. The proof outline below deﬁnes two classes, Producer and Consumer, where Producer is the main class. The initial thread of the initial Producer-instance creates a Consumer-instance and calls its synchronized produce method. This method starts the consumer thread and enters a non-terminating loop, producing some results, notifying the consumer, and suspending itself by calling wait. After the producer suspended itself, the consumer thread calls the synchronized consume method, which consumes the result of the producer, notiﬁes, and calls wait, again in a non-terminating loop. For readability, we only list a partial annotation and augmentation, which already implies deadlock freedom. Invariance of the properties listed below has been shown in PVS using an extended augmentation and annotation. Also deadlock freedom has been proven in PVS. def

GI = (∀(p : Producer ).(p = null ∧ ¬p.outside ∧ p.consumer = null) → p.consumer .lock = (null, 0))∧ (∀(c : consumer ).(c = null ∧ c.started) → (c.producer = null ∧ c.producer .started))∧ (∀(c1 : consumer ).(c1 = null → (∀(c2 : consumer ).c2 = null → c1 = c2)) def

IProducer = true def

IConsumer = (lock = (null, 0) ∨ (owns(this, lock ) ∧ started) ∨ owns(producer , lock ))∧ length(wait) ≤ 1 class Producer { Consumer consumer ; boolean outside ; nsync wait () {

{false }

}

A Tool-Supported Proof System for Multithreaded Java

27

nsync run () { Consumer c ; c = newConsumer ; consumer = cnew {c = consumer ∧ ¬outside ∧ consumer = null ∧ consumer = this ∧ thread = this } c . produce () ; outside = (if c = this then outside else true ﬁ)!call {false } } } class Consumer { int buffer ; Producer producer ; nsync wait () { {started ∧ not owns(thread, lock ) ∧ (thread = this ∨ thread = producer )∧ (thread ∈ wait ∨ thread ∈ notiﬁed)} } sync produce () { int i ; = proj (caller , 1)?call i =0; start () ; while ( true ) { // produce i here buffer = i ; {owns(thread, lock )} notify () ; {owns(thread, lock )} wait () }

producer

} nsync run () { {not owns(thread, lock ) ∧ thread = this } consume () ; {false } } sync consume () { int i ; while ( true ) { i = buffer ; // consume i here {owns(thread, lock )} notify () ; {owns(thread, lock )} wait () } } }

Both run-methods have false as postcondition, stating that the corresponding threads don’t terminate. The preconditions of all monitor method invocations express that the executing thread owns the lock, and thus execution cannot be enabled at these control points. The wait-method of Producer-instances is not invoked; we deﬁne false as the precondition of its return-statement, implying that disabledness is excluded also at this control point. The condition for deadlock freedom assumes that there is a thread which is started but not yet terminated, and whose execution is disabled. This thread is either the thread of a Producer-instance, or that of a Consumer-instance. We discuss only the case that the disabled thread belongs to a Producerinstance z diﬀerent from null; the other case is similar. Note that the control

28

´ E. Abrah´ am et al.

of the thread of z cannot stay in the run-method of a Consumer-instance, since the corresponding local assertion implies thread = this, which would contradict the type assumptions. Thus the thread can have its control point prior to the method call in the run-method of a Producer-instance, or in the wait-method of a Consumer-instance. In the ﬁrst case, the corresponding local assertion and the global invariant imply that the lock of the callee is free, i.e., that the execution is enabled, which is a contradiction. In the second case, if the thread of z executes in the wait-method of a Consumer-instance z , the local assertion in wait together with the type assumptions implies z .started ∧ not owns(z, z .lock) ∧ z = z .producer, and that z is either in the wait- or in the notiﬁed-set of z . According to the assumptions of the deadlock freedom condition, also the started thread of z is disabled or terminated; its control point cannot be in a Producer-instance, since that would contradict to the type assumptions. Thus the control of z stays in the run- or in the wait-method of a Consumer-instance; the annotation implies that the instance is z itself. If the control stays in the run-method, then the corresponding local assertion and the class invariant imply that the lock is free, since neither the producer, nor the consumer owns it, which leads to a contradiction, since in this case the execution of the thread of z would be enabled. Finally, if the control of the thread of z stays in the wait-method of z , then the annotation assures that the thread doesn’t own the lock of z ; again, using the class invariant we get that the lock is free. Now, both threads of z and z have their control points in the wait-method of z , and the lock of z is free. Furthermore, both threads are disabled, and are in the wait- or in the notiﬁed set. If one of them is in the notiﬁed set, then its execution is enabled, which is a contradiction. If both threads are in the wait set, then from z

= z we imply that the wait-set of z has at least two elements, which contradicts the class invariant of z . Thus the assumptions lead to a contradiction, which was to be shown.

6

Conclusion

This paper presents a tool-supported assertional proof method for a multithreaded sublanguage of Java including its monitor discipline. This builds on ´ ´ earlier work ([AMdB00] and especially [AMdBdRS02]). The underlying theory, the proof rules, their soundness and completeness, and the tool support are ´ presented in greater detail in [AdBdRS03]. Related Work. As far as proof-systems and veriﬁcation support for objectoriented programs is concerned, research mostly concentrated on sequential languages. Early examples of Hoare-style proof systems for a sequential objectoriented languages are [dF95] and [LW90,LW94]. With Java’s rise to prominence, research more intensively turned to (sublanguages of) Java, as opposed of capturing object-oriented language features in the abstract. In this direction, jml [LBR98,LCC+ 03] has emerged as some kind of common ground for asserting Java programs. Another trend is to oﬀer mechanized proof support for the languages.

A Tool-Supported Proof System for Multithreaded Java

29

For instance, Poetzsch-Heﬀter and M¨ uller [PHM99,PH97b,PH97a,PHM98] develop a Hoare-style programming logic presented in sequent formulation for a sequential kernel of Java, featuring interfaces, subtyping, and inheritance. Translating the operational and the axiomatic semantics into the HOL theorem prover allows a computer-assisted soundness proof. The work in the Loop-project (cf. e.g. [Loo01,JvdBH+ 98]) also concentrates on a sequential subpart of Java, translating the proof-theory into PVS and Isabelle/HOL. The work [RW00,RHW01] use a modiﬁcation of the object constraint language OCL as assertional language to annotate UML class diagrams and to generate proof conditions for Java-programs. The work [BFG02] presents a model checking algorithm and its implementation in Isabelle/HOL to check type correctness of Java bytecode. In [vO01] a large subset of JavaCard, including exception handling, is formalized in Isabelle/HOL, and its soundness and completeness is shown within the theorem prover. The work in [AL97] presents a Hoare-style proof-system for a sequential object-oriented calculus [AC96]. Their language features heap-allocated objects (but no classes), side-eﬀects and aliasing, and its type system supports subtyping. Furthermore, their language allows nested statically let-bound variables, which requires a more complex semantical treatment for variables based on closures, and ultimately renders their proof-system incomplete. Their assertion language is presented as an extension of the object calculus’ language of type and analogously, the proof system extends the type derivation system. The close connection of types and speciﬁcations in the presentation is exploited in [TH02] for the generation of veriﬁcation conditions. Work on proof systems for parallel object-oriented languages or in particular the multithreading aspects of Java is more scarce. [dB99] presents a sound and complete proof system in weakest precondition formulation for a parallel object-based language, i.e., without inheritance and subtyping, and also without reentrant method calls. Later work [PdB03,dBP03,dBP02] includes more features, especially catering for Hoare logic for inheritance and subtyping. A survey about monitors in general, including proof-rules for various monitor semantics, can be found in [BFC95]. Future Work. As ti future work, we plan to extend Javasynch by further constructs, like inheritance, subtyping, and exception handling. Dealing with subtyping on the logical level requires a notion of behavioral subtyping [Ame89].

Acknowledgments We thank Cees Pierik for fruitful discussions and suggestions, and furthermore Tim D’Avis for careful reading and commenting on an earlier version of this document.

References AC96.

Mart´ın Abadi and Luca Cardelli. A Theory of Objects. Monographs in Computer Science. Springer, 1996.

30

´ E. Abrah´ am et al.

´ AdBdRS03.

´ Erika Abrah´ am, Frank S. de Boer, Willem-Paul de Roever, and Martin Steﬀen. A Hoare logic for monitors in Java. Techical report TR-ST03-1, Lehrstuhl f¨ ur Software-Technologie, Institut f¨ ur Informatik und Praktische Mathematik, Christian-Albrechts-Universit¨ at zu Kiel, April 2003. AF99. Jim Alves-Foss, editor. Formal Syntax and Semantics of Java, volume 1523 of LNCS State-of-the-Art-Survey. Springer-Verlag, 1999. AFdR80. K. R. Apt, N. Francez, and W.-P. de Roever. A proof system for communicating sequential processes. ACM Transactions on Programming Languages and Systems, 2:359–385, 1980. AL97. Mart´ın Abadi and K. Rustan M. Leino. A logic of object-oriented programs. In Michel Bidoit and Max Dauchet, editors, Proceedings of TAPSOFT ’97, volume 1214 of Lecture Notes in Computer Science, pages 682–696, Lille, France, April 1997. Springer-Verlag. An extended version of this paper appeared as SRC Research Report 161 (September 1998). ´ ´ AMdB00. Erika Abrah´ am-Mumm and Frank S. de Boer. Proof-outlines for threads in Java. In Catuscia Palamidessi, editor, Proceedings of CONCUR 2000, volume 1877 of Lecture Notes in Computer Science. Springer-Verlag, August 2000. ´ ´ AMdBdRS02. Erika Abrah´ am-Mumm, Frank S. de Boer, Willem-Paul de Roever, and Martin Steﬀen. Veriﬁcation for Java’s reentrant multithreading concept. In Mogens Nielsen and Uﬀe H. Engberg, editors, Proceedings of FoSSaCS 2002, volume 2303 of Lecture Notes in Computer Science, pages 4–20. Springer-Verlag, April 2002. A longer version, including the proofs for soundness and completeness, appeared as Technical Report TR-ST-02-1, March 2002. Ame89. Pierre America. A behavioural approach to subtyping in objectoriented programming languages. 443, Phillips Research Laboratories, January/April 1989. And00. Gregory R. Andrews. Foundations of Multithreaded, Parallel, and Distributed Programming. Addison-Wesley, 2000. BdBdRG03. Marcello Bosangue, Frank S. de Boer, Willem-Paul de Roever, and Susanne Graf, editors. Proceedings of the First International Symposium on Formal Methods for Components and Objects (FMCO 2002), Leiden, Lecture Notes in Computer Science. Springer-Verlag, 2003. BFC95. Peter A. Buhr, Michel Fortier, and Michael H. Coﬃn. Monitor classiﬁcation. ACM Computing Surveys, 27(1):63–107, March 1995. BFG02. David Basin, Stefan Friedrich, and Marek Gawkowski. Veriﬁed bytecode model checkers. In Victor A. Carre˜ no, C´esar A. Mu˜ noz, and Soﬁ`ene Tahar, editors, Theorem Proving in Higher Order Logics (TPHOLs’02), volume 2410 of Lecture Notes in Computer Science, pages 47–66. Springer-Verlag, August 2002. CKRW99. P. Cenciarelli, A. Knapp, B. Reus, and M. Wirsing. An event-based structural operational semantics of multi-threaded Java. In Alves-Foss [AF99], pages 157–200. dB99. Frank S. de Boer. A WP-calculus for OO. In Wolfgang Thomas, editor, Proceedings of FoSSaCS ’99, volume 1578 of Lecture Notes in Computer Science, pages 135–156. Springer-Verlag, 1999.

A Tool-Supported Proof System for Multithreaded Java dBP02.

dBP03.

dF95. Flo67. GJS96. HJ89. Hoa69. Hui01. JKW03. JvdBH+ 98.

LBR98. LCC+ 03. LG81. Loo01. LW90.

LW94.

OG76.

31

Frank S. de Boer and Cees Pierik. Computer-aided speciﬁcation and veriﬁcation of annotated object-oriented programs. In Bart Jacobs and Arend Rensink, editors, Proceedings of the Fifth International Conference on Formal Methods for Open Object-Based Distributed Systems (FMOODS 2002), volume 209, pages 163–177. Kluwer, 2002. Frank S. de Boer and Cees Pierik. Towards an environment for the veriﬁcation of annotated object-oriented programs. Technical report UU-CS-2003-002, Institute of Information and Computing Sciences, University of Utrecht, January 2003. C. C. de Figueiredo. A proof system for a sequential object-oriented language. Technical Report UMCS-95-1-1, University of Manchester, 1995. Robert W. Floyd. Assigning meanings to programs. In J. T. Schwartz, editor, Proc. Symp. in Applied Mathematics, volume 19, pages 19–32, 1967. J. Gosling, B. Joy, and G. Steele. The Java Language Speciﬁcation. Addison-Wesley, 1996. C. A. R. Hoare and Cliﬀ B. Jones, editors. Essays in Computing Science. International Series in Computer Science. Prentice Hall, 1989. C. A. R. Hoare. An axiomatic basis for computer programming. Communications of the ACM, 12:576–580, 1969. Also in [HJ89]. Marieke Huisman. Java Program Veriﬁcation in Higher-Order Logic with PVS and Isabelle. PhD thesis, University of Nijmegen, 2001. Bart Jacobs, Joseph Kiniry, and Martijn Warnier. Java program veriﬁcation challenges. In Bosangue et al. [BdBdRG03]. Bart Jacobs, Jan van den Berg, Marijke Huisman, M. van Barkum, Ulrich Hensel, and Hendrik Tews. Reasoning about classes in Java (preliminary report). In Object Oriented Programing: Systems, Languages, and Applications (OOPSLA) ’98, pages 329–340. ACM, 1998. in SIGPLAN Notices. Gary T. Leavens, Albert L. Baker, and Clyde Ruby. JML: a Java modelling language. In Formal Underpinnings of Java Workshop (at OOPSLA’98), 1998. G. T. Leavens, Y. Cheon, C. Clifton, C. Ruby, and D. R. Cok. How the design of jml accommodates both runtime assertion checking and formal veriﬁcation. In Bosangue et al. [BdBdRG03]. G. M. Levin and D. Gries. A proof technique for communicating sequential processes. Acta Informatica, 15(3):281–302, 1981. The LOOP project: Formal methods for object-oriented systems. http://www.cs.kun.nl/˜bart/LOOP/, 2001. Gary T. Leavens and William E. Wheil. Reasoning about objectoriented programs that use subtypes. In Object Oriented Programing: Systems, Languages, and Applications (OOPSLA) ’90 (Ottawa, Canada), pages 212–223. ACM, 1990. Extended Abstract. Gary T. Leavens and William E. Wheil. Speciﬁcation and veriﬁcation of object-oriented programs using supertype abstraction. Acta Informatica, 1994. An expanded version appeared as Iowa State Unversity Report, 92-28d. Susan Owicki and David Gries. An axiomatic proof technique for parallel programs. Acta Informatica, 6(4):319–340, 1976.

32 ORS92.

PdB03.

PH97a.

PH97b. PHM98.

PHM99.

RHW01.

RW00. SSB01. TH02.

vO01. vON02.

WK99.

´ E. Abrah´ am et al. S. Owre, J. M. Rushby, and N. Shankar. PVS: A prototype veriﬁcation system. In D. Kapur, editor, Automated Deduction (CADE-11), volume 607 of Lecture Notes in Computer Science, pages 748–752. SpringerVerlag, 1992. Cees Pierik and Frank S. de Boer. A syntax-directed Hoare logic for object-oriented programming concepts. Technical report UU-CS-2003010, Institute of Information and Computing Sciences, University of Utrecht, 2003. Arnd Poetzsch-Heﬀter. A logic for the veriﬁcation of object-oriented programs. In Rudolf Berghammer and Friedeman Simon, editors, Proceedings of Programming Languages and Fundamentals of Programming, pages 31–42. Institut f¨ ur Informatik und Praktische Mathematik, Christian-Albrechts-Universit¨ at Kiel, November 1997. Bericht Nr. 9717. Arnd Poetzsch-Heﬀter. Speciﬁcation and Veriﬁcation of ObjectOriented Programs. Technische Universit¨ at M¨ unchen, January 1997. Habilitationsschrift. Arnd Poetzsch-Heﬀter and Peter M¨ uller. Logical foundations for typed object-oriented languages. In David Gries and Willem-Paul de Roever, editors, Proceedings of PROCOMET ’98. International Federation for Information Processing (IFIP), Chapman & Hall, 1998. Arnd Poetzsch-Heﬀter and Peter M¨ uller. A programming logic for sequential Java. In S.D. Swierstra, editor, Programming Languages and Systems, volume 1576 of Lecture Notes in Computer Science, pages 162–176. Springer, 1999. Bernhard Reus, Rolf Hennicker, and Martin Wirsing. A Hoare calculus for verifying Java realizations of OCL-constrained design models. In H. Hussmann, editor, Fundamental Approaches to Software Engineering, volume 2029 of Lecture Notes in Computer Science, pages 300–316. Springer-Verlag, 2001. Bernhard Reus and Martin Wirsing. A Hoare-logic for object-oriented programs. Technical report, LMU M¨ unchen, 2000. Robert St¨ ark, Joachim Schmid, and Egon B¨ orger. Java and the Java Virtual Machine. Springer-Verlag, 2001. Francis Tang and Martin Hofmann. Generation of veriﬁcation conditions for Abadi and Leino’s logic of objects (extended abstract). In Proceedings of the 9th International Workshop on Foundations of ObjectOriented Languages (FOOL’02), 2002. A longer version is available as LFCS technical report. David von Oheimb. Hoare logic for Java in Isabelle/HOL. Concurrency and Computation: Practice and Experience, 13(13):1173–1214, 2001. David von Oheimb and Tobias Nipkow. Hoare logic for NanoJava: Auxiliary variables, side eﬀects and virtual methods revisited. In L.H. Eriksson and P.-A. Lindsay, editors, Proceedings of Formal Methods Europe: Formal Methods – Getting IT Right (FME’02), volume 2391 of Lecture Notes in Computer Science, pages 89–105. Springer-Verlag, 2002. Jos B. Warmer and Anneke G. Kleppe. The Object Constraint Language: Precise Modeling With Uml. Object Technology Series. AddisonWesley, 1999.

Abstract Behavior Types: A Foundation Model for Components and Their Composition Farhad Arbab CWI, Amsterdam, The Netherlands [email protected]

Abstract. The notion of Abstract Data Type (ADT) has served as a foundation model for structured and object oriented programming for some thirty years. The current trend in software engineering toward component based systems requires a foundation model as well. The most basic inherent property of an ADT, i.e., that it provides a set of operations, subverts some highly desirable properties in emerging formal models for components that are based on the object oriented paradigm. We introduce the notion of Abstract Behavior Type (ABT) as a higherlevel alternative to ADT and propose it as a proper foundation model for both components and their composition. An ABT deﬁnes an abstract behavior as a relation among a set of timed-data-streams, without specifying any detail about the operations that may be used to implement such behavior or the data types it may manipulate for its realization. The ABT model supports a much looser coupling than is possible with the ADT’s operational interface, and is inherently amenable to exogenous coordination. We propose that both of these are highly desirable, if not essential, properties for models of components and their composition. To demonstrate the utility of the ABT model, we describe Reo: an exogenous coordination language for compositional construction of component connectors based on a calculus of channels. We show the expressive power of Reo, and the applicability of ABT, through a number of examples.

1

Introduction

An Abstract Data Type (ADT) deﬁnes an algebra of operations with mathematically well-deﬁned semantics, without specifying any detail about the implementation of those operations or the data structures they operate on to realize them. As such, ADT is a powerful abstraction and encapsulation mechanism that groups data together with their related operations into logically coherent and loosely-dependent entities, such as objects, yielding better structured programs. ADT has served as a foundation model for structured and object oriented programming for some thirty years. The immense success of object oriented techniques has distracted proper attention away from critical evaluation of some of its underpinning concepts from the perspective of their utility for components. We propose that the most basic inherent property of an ADT, i.e., that it provides a set of operations in F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 33–70, 2003. c Springer-Verlag Berlin Heidelberg 2003

34

F. Arbab

its interface, subverts some highly desirable properties in emerging models for component based systems. This is already evident in the current attempts at extending the object oriented models into the realm of components (see, e.g., Sections 3 and 5). We introduce the notion of Abstract Behavior Type (ABT) as a higher-level alternative to ADT and propose it as a proper foundation model for both components and their composition. An ABT deﬁnes an abstract behavior as a relation among a set of timed-data-streams, without specifying any detail about the operations that may be used to implement such behavior or the data types it may manipulate for its realization. In contrast with the algebraic underpinnings of the ADT model, the (generally) inﬁnite streams that are the elements of behavior in the ABT model naturally lend themselves to the coalgebraic techniques and the coinduction reasoning principle that have recently been developed as a general theory to describe the behavior of dynamic systems. The ABT model supports a much looser coupling than is possible with ADT and is inherently amenable to exogenous coordination. We propose that both of these are highly desirable, if not essential, properties for components and their composition. In our view, a component based system consists of component instances and their connectors (i.e., the “glue code”), both of which are uniformly modeled as ABTs. Indeed, the only distinction between a component and a connector is just that a component is an atomic ABT whose internal structure is unknown, whereas a connector is known to be an ABT that is itself composed out of other ABTs. As a concrete instance of the application of the ABT model, we describe Reo: an exogenous coordination model wherein complex coordinators, called “connectors” are compositionally built out of simpler ones [3,4]. Reo can be used as a glue language for compositional construction of connectors that orchestrate component instances in a component based system. We demonstrate the surprisingly expressive power of ABT composition in Reo through a number of examples. The rest of this paper is organized as follows. In Section 2 we motivate our view of components and their composition as a conceptual model at a higher level of abstraction than objects and their composition. Section 3 contains a brief overview of some related work. We review the formal notion of abstract data types in Section 4, and elaborate on its links with and implications on object oriented programming in Section 5. We argue that some of these implications impede the ability of component models based on the object oriented paradigm to support ﬂexible composition and exogenous coordination, both of which, we propose, are highly desirable properties in component based systems. Section 6 is an informal description of our component model, and in Section 7 we describe its accompanying model of behavior. Section 8 is an introduction to Abstract Behavior Types and their composition. In Section 9 we show how channels, connectors, and their composition in Reo are easily expressed as ABTs and their composition. Finally, we close with our concluding remarks in Section 10.

Abstract Behavior Types: A Foundation Model for Components

2

35

A Component Manifesto

The bulk of the work on component based systems is primarily focused on what components are and how they are to be constructed. Relatively little attention has been paid to alternative models and languages for composing components into (sub)systems, which is typically considered to be the purpose of the so-called glue code, assumed to be written in some scripting language. Clearly, components and their composition are not independent of one another: explicitly emphasizing one deﬁnes or at least constrains the other as well, if only implicitly. A conspicuous driving force behind the upsurge of interest and activity in component based software is the recognition that the object oriented paradigm is not the silver-bullet that some of its over-zealous advocates purported it to be. Nevertheless, presently, the dominant view of what components are or should be reﬂects a prominent object oriented legacy: components are fortiﬁed collections of classes and/or objects, with very similar interfaces. It follows that the interactions among and the composition of components must use mechanisms very similar to those for interactions among and composition of classes and objects. Thus, the method invocation semantics of message passing in object oriented programming becomes the crux of the component composition mechanisms in scripting languages. This approach to components “solves” some of the problems that are rooted in the inadequacies of the object oriented paradigm simply by shifting them elsewhere. For instance, the relatively tight coupling that must be established between a caller and a callee pair of objects indeed disappears as a concern at the intra-component level when the two objects reside in diﬀerent component instances, but becomes an issue to be addressed in the glue code and its underlying middleware used to compose those components. As long as components and their interfaces are essentially the same as objects and their interfaces, the (scripting) programs that constitute the glue code end up to be inherently no different than other object oriented software. In complex systems, the body of such specialized glue code can itself grow in size, complexity, intricacy, fragility, and rigidity, rendering the system hard to evolve and maintain, in spite of the fact that this inﬂexible code wraps and connects otherwise reusable, upgradeable, and replaceable components. An alternative view of components emerges if we momentarily ignore how they are made or even what they are made of, emphasizing instead what we want to do with them. Beyond fashionable jargon, hype, and merely technical idiosyncrasies, if there is to be any conceptual substance behind the term “component” deserving its minting, it must be that components are less interdependent and are easier and more ﬂexible to compose than objects and classes. The deﬁnition of a class or an object speciﬁes the methods it oﬀers to other entities, and the method calls within the code of its methods determine the services and entities it requires to work. This results in a rather tight semantic interdependence among objects/classes and grants each individual a signiﬁcant degree of control over precisely how it is composed with other classes or objects. In contrast to objects and classes, it is highly desirable for components to be semantically independent of one another and internally impose no restrictions on

36

F. Arbab

the other components they compose with. This yields a level of composition ﬂexibility that is not possible with objects and classes1 and which is a prerequisite for another highly desirable property in component based systems: we would like for the whole (system) to be more than the mere sum of its (component) parts. This implies that not only it should be generally possible to produce diﬀerent systems by composing the same set of components in diﬀerent ways, but also that the difference between two systems composed out of the same set of components (i.e., the diﬀerence between the “more” than the “sum of the parts” in each system) must arise out of the actual rules that comprise their two diﬀerent compositions, i.e., their glue code. The signiﬁcance of the latter point is that it requires the glue code to contribute to the semantics of the whole system well beyond the mere so-called “wiring-standard-level” support provided by the current popular middleware and component based technologies. On the other hand, we intuitively expect glue code to be void of any application-domain speciﬁc functionality: its job is merely to connect components, facilitating their communication and coordinating their interactions, not to perform any application-domain speciﬁc computation. This leads to a subtlety regarding the interaction between glue code and components which fundamentally impacts both. If the contribution of the glue code to the behavior of a composed system is no more than connecting its components, facilitating their communication and coordinating their interactions, then the diﬀerence between the behavior of two systems composed out of the same set of components can arise not out of any application-domain speciﬁc computation (and certainly not out of the components), but only out of how the glue code connects and coordinates these components to interact with one another. Since glue code is external to the components it connects, this implies that (1) the components must be amenable to external coordination control and (2) the glue code must contain constructs to provide such external coordination. The ﬁrst implication constrains the mechanisms through which components can interact with their environment. The second implication means that the glue code language must incorporate an exogenous coordination model [2]. Finally, if glue code is to have its own non-trivial semantics in a composed system, it is highly desirable both for the glue code itself to be piece-wise explicitly identiﬁable, and for the semantics of each of its pieces to be independent of the semantics of the speciﬁc components that it composes. This promotes the recognition of the glue code as an identiﬁable, valuable software commodity, emphasizes the importance of its reusability, and advocates glue code construction through composition of reusable glue code pieces. The notion of compositional construction of glue code out of smaller, reusable pieces of glue code all but eliminates the conceptual distinctions between components and glue code. This behooves us to ﬁnd conceptual models and formal 1

Observe that generally speaking, it is the code for the methods of an object that determines the other objects it “composes with” to function properly. Thus, objects/classes “decide for themselves” how they compose with each other and their composition generally cannot be determined or inﬂuenced from outside.

Abstract Behavior Types: A Foundation Model for Components

37

methods for component based systems wherein the same rules for compositional construction indiscriminately apply to both components as well as their glue code connectors. In such a model, the (perhaps somewhat subjective) distinction between components and their (pieces of glue code) connectors still makes practical sense: although they are indistinguishable when used as primitives to compose more complex constructs, components and connectors are still diﬀerent in that components are black-box primitives whose internal structures are invisible, whereas the internal structure of a connector shows that it, in turn, is constructed out of other (connector and/or component) primitives according to the same rules of composition.

3

Background and Related Work

In popular models of components (e.g., Enterprise Java Beans [36,18], CORBA [55,16], and DCOM [21]) component instances are fortiﬁed (collections of) objects. Consequently, they typically use variants of message passing with the semantics of method invocation for inter-component communication. The tight coupling inherent in the method call semantics is more appropriate for intracomponent communication. In contrast, inter-component communication invariably requires a minimum level of “control from the outside” of the participating components. In order to break the tight coupling induced by the method call semantics and reduce the interdependence of components on each other, the underlying middleware that supports these component models provides mechanisms or entities (such as the ORB in CORBA) to intercept inter-component messages. Messages may be intercepted to, for instance, provide services (e.g., binding and name servers), enforce imposed constraints (e.g., suppress certain messages in certain states), ensure protocols, and/or enact assigned roles. One way or the other, the middleware’s intervention loosens the otherwise tight coupling that would be imposed by targeted active messages (i.e., messages with methodinvocation semantics) and furthermore, enforces a certain restricted form of coordination from outside the components. Coordination languages [19,45] oﬀer an alternative for inter-component communication, as exempliﬁed by JavaSpaces in the Jini architecture [31,44,35]. They impose a stricter sense of temporal and spacial decoupling that supports a looser inter-component semantic dependency, compared with the method invocation semantics of message passing in object oriented paradigms. Most common component models deﬁne components as reusable binary units of software with interfaces that have no more than a syntactic content. This view of components enforces information hiding in only a rather primitive way: the good practice discipline of using questionably suggestive symbolic names in component interfaces non-withstanding, such an interface does not reveal any of the externally relevant semantics of the contents of its component. Such component models cannot support (semi-)formal speciﬁcation/veriﬁcation of their external behavior. A broader deﬁnition of components is oﬀered by the Eiﬀel language [37,38,20]: components are client-oriented software with the desirable property that a com-

38

F. Arbab

ponent, x , can be used by other programs that do not need to be known to x . This property is supported in Eiﬀel through formal speciﬁcation techniques which include pre- and post-conditions and invariants. In general, this notion of components requires enhanced speciﬁcation and veriﬁcation techniques, as also observed by Hennicker and Wirsing [57,24]. Our notion of components [9,6,17] uses channels as the basic intercomponent communication mechanism. A channel is a point-to-point medium of communication with its own unique identity and two distinct ends. A channel supports transfer of passive data only; no transfer of control (e.g., procedure calls, exchange of pointers to internal objects/entities) can take place through a channel. Using channels as the only means of inter-component communication allows a clean, ﬂexible, and expressive model for construction of the glue code for component composition which also supports exogenous coordination. Synchronous channels are the basic primitives in π-calculus [39,40]. Some of the variants of π-calculus and its asynchronous versions [54] have been used in models proposed for component interaction and composition. Notably, Piccola [43] is an experimental component composition language based on a higherorder version of the asynchronous π-calculus, extended with explicit name-spaces called forms. Forms in Piccola provide a uniﬁed mechanism to address such aspects of component composition as styles, scripts, and glue code. The agents and channels provided by Piccola’s underlying calculus support the coordination aspect of component composition. In contrast to such calculi, our notion of channel is very general and we speciﬁcally allow a variety of diﬀerent channel types (even user-deﬁned ones) to be used simultaneously and composed together. This diﬀerentiates our model from the way channels are used in virtually all other channel-based models, which typically allow only one or at most a small number of simple predeﬁned channel types. Speciﬁcally, our liberal notion of channels, the potency that our model derives from mixing and composing channels of diﬀerent types, and their consequent harmonious combination of synchrony and asynchrony are unique. For instance, these features of our model are in sharp contrast with the use of channels in the Ptolemy project [15,34,33] which ascribes a single interpretation for its connecting channels in each context. Asynchronous channels form the basis of the dataﬂow architecture for networks of components as proposed and formally investigated by Broy and his group [13,25]. In this architectural model, large systems can be realized, allowing programmers to easily understand the input/output behavior of a system as the composition of the behavior of its individual components. Our model of component composition is fundamentally diﬀerent than (even dynamic) dataﬂow models because it (1) supports a much wider and more general notion of channels and diﬀerent channel types; and (2) introduces the notion of channel composition as the construct through which channels are connected to other channels, forming higher level and more sophisticated connectors for component composition.

Abstract Behavior Types: A Foundation Model for Components

4

39

Abstract Data Types

Formally, an ADT is a triplet S, O, A, where S is a set of sorts denoting the required types, O is a set of operators over S, and A is a set of axioms written as algebraic equations deﬁning the results of various combinations of operations in O on data items of various types in S. Stack

Queue

S: stack , data, boolean

S: queue, data, boolean

O: top(s) → d pop(s) → s push(s, d ) → s empty(s) → b

O: ﬁrst(q) → d deq(q) → q enq(q, d ) → q empty(q) → b

A: empty(λ) = true A: empty(λ) = true empty(push(s, d )) = false empty(enq(q, d )) = false top(push(s, d ) = d ﬁrst(enq(λ, d )) = d pop(push(s, d ) = s ﬁrst(enq(enq(q, d1 ), d2 )) = ﬁrst(enq(q, d1 )) deq(enq(λ, d )) = λ pop(λ) = 1 pop(1 ) = 1 deq(enq(enq(q, d1 ), d2 )) = enq(deq(enq(q, d1 ), d2 ) top(λ) = 2 deq(λ) = 1 deq(1 ) = 1 ﬁrst(λ) = 2 Fig. 1. Abstract Data Types for stack and queue.

For example, Figure 1 shows the formal ADT deﬁnitions for the two common data types stack and queue, in separate columns. The set S contains stack , data, and boolean types for stack, and queue, data, and boolean types for queue. We use s, q, d , and b to represent items of types stack , queue, data, and boolean, respectively. Furthermore, in the stack column in this ﬁgure λ is an item of type stack representing the empty stack, and likewise in the queue column λ is an item of type queue representing the empty queue. Similarly, in each column 1 and 2 are special error values of their respective types. The set O in each column deﬁnes the signature of four operations. For the case of the stack, top(s) is expected to produce the data item at the top of the stack s; pop(s) is expected to produce the stack obtained by removing the data item at the top of the stack s; push(s, d ) is expected to produce a stack obtained by pushing the data item d on top of the stack s; and empty(s) is expected to produce a boolean indicating whether or not the stack s is empty. For the case of queue, ﬁrst(q) is expected to produce the ﬁrst data item at the head of the queue q; deq(q) is expected to produce the queue obtained by removing the ﬁrst data item at the head of the queue q (dequeue); enq(q, d ) is expected to produce a queue obtained by adding the data item d to the tail end of the queue

40

F. Arbab

q (enqueue); and empty(q) is expected to produce a boolean indicating whether or not the queue q is empty. Of course, the set O contains only the signatures of these operations and as such it is void of any formal hint of what they (are expected to) do. It is the set of axioms, A, that formally deﬁnes the semantics of the operations in O in terms of their mutual eﬀects on each other. In the case of the stack, the two axioms for the empty operation state that (1) empty(λ) = true, and (2) empty applied to a stack obtained from a push operation on any stack yields false. The top axioms state that (1) top applied to the empty stack yields an error (2 ), and (2) top applied to a stack obtained from pushing the data item d onto some other stack, yields d . The pop axioms state that (1) popping a stack obtained from pushing a data item onto some other stack, s, yields s; (2) popping an empty stack yields an error (1 ); and (3) popping this error value yields the same error value. Any stack is canonically represented as a sequence of push operations that add data items on the result of their preceding push, starting with the empty stack, e.g., push(push(push(push(λ, d1 ), d2 ), d3 ), d4 ). An expression that cannot be transformed into such a cannonical form, e.g., push(push(pop(pop(push(λ, d1 ))), d2 ), d3 ), is not a legal stack. Our stack axioms leave empty(1 ) and top(1 ) undeﬁned; alternatively, we can explicitly deﬁne them as errors too. Many of the queue axioms are analogous to their respective stack axioms. The axioms for ﬁrst and deq are a bit more interesting. Any queue is canonically represented as a sequence of enq operations that add data items on the result of their preceding enq, starting with the empty queue; e.g., enq(enq(enq(enq(λ, d1 ), d2 ), d3 ), d4 ). The ﬁrst axioms state that to ﬁnd the ﬁrst element in a queue, we must “peel” it away until we reach the empty queue, at which point we obtain the ﬁrst data item at the head of the queue. Thus: ﬁrst(enq(enq(enq(enq(λ, d1 ), d2 ), d3 ), d4 )) = ﬁrst(enq(enq(enq(λ, d1 ), d2 ), d3 )) = ﬁrst(enq(enq(λ, d1 ), d2 )) = ﬁrst(enq(λ, d1 )) = d1 Analogously, deq peels away the canonical representation of a queue, but it also reconstructs it as it moves inside. For instance: deq(enq(enq(enq(enq(λ, d1 ), d2 ), d3 ), d4 )) = enq(deq(enq(enq(enq(λ, d1 ), d2 ), d3 )), d4 ) = enq(enq(deq(enq(enq(λ, d1 ), d2 )), d3 ), d4 ) = enq(enq(enq(deq(enq(λ, d1 )), d2 ), d3 ), d4 ) = enq(enq(enq(λ, d2 ), d3 ), d4 ) These examples show that an ADT deﬁnes a data type in terms of the operations on that data type and how they mutually aﬀect each other. It abstracts away from the implementation of those operations and the data structures they manipulate. The semantics of an ADT is given as algebraic equations. The strong

Abstract Behavior Types: A Foundation Model for Components

41

conceptual link between abstract data types and object oriented programming stems from the common manner in which they associate data and the operations that manipulate them together. The ADT for a type, T , deﬁnes all operations applicable to entities of type T . It encapsulates the representation of T and the implementation of its operations. This prevents manipulation of the entities of type T in any way other than through its own deﬁned operations.

5

ADT and Object Oriented Programming

Their common aspiration to (1) encapsulate data structures behind operations that manipulate them, and (2) hide the details of those operations as well, has made ADT a suitable foundation model for object oriented programming. An ADT can be seen as a formal description of the interface of an object/class. This encapsulation signiﬁcantly loosens the coupling between the implementation of an ADT (or object/class) and other code that can use it only through its prescribed operations. The operational interface of an ADT (or object/class) also readily supports extensibility in the form of polymorphism. Extensibility in object oriented programming typically goes beyond mere polymorphism, through some form of inheritance that gives rise to object/class hierarchies. Although a formal semantics of its operations is an integral part of the deﬁnition of an ADT, object/class interfaces in object oriented languages are purely syntactic and contain no semantics. Moreover, the explicit deﬁnition of the set of all sorts (both provided and required) by an ADT has no correspondence in the object/class interface deﬁnitions in main-stream object oriented languages: they do not mention what their respective objects/classes require, but specify only the operations that they provide. The diﬀerences between the ADT model and object oriented programming give rise to a number of problems that have already been discussed in the literature. Some counter-measures for problems such as the conﬂict between inheritance and encapsulation [56], the purely syntactic nature of interfaces, and their asymmetric speciﬁcation of oﬀered/required services, have been integrated in the design of certain more advanced object oriented languages and component models. What has not been explored so explicitly and extensively in the literature is how message passing in the object oriented paradigm aﬀects software composition and what alternative mechanisms can be used in its place for components. The method invocation semantics of object oriented message passing implies a rather tight semantic coupling between the caller and callee pairs of objects. By this semantics, if an object c sends a message m(p) to another object e, then c is invoking the method m of e with the actual parameters p. For this to happen: – c must know (how to ﬁnd) e; – c must know the syntax and the semantics of the method m of e; – e must (pretend to) perform the activated method m on parameters p, and return its result to c upon its completion (the “pretense” refers to when e delegates the actual execution of m to a third object); and

42

F. Arbab

– c typically suspends between its sending of m and the receiving of its (perhaps null) result. Not only this “rendezvous semantics” is far from trivial, it is still susceptible to signiﬁcantly diﬀerent and mutually incompatible variations (e.g., with synchronous vs. asynchronous message passing, active vs. passive objects, etc.). Underneath the precise semantics of this rendezvous and its various incarnations in diﬀerent object oriented models, is a strong conceptual link with ADT. By its virtue of providing a set of operations, all that one can do with an ADT is to perform one of its operations. Similarly, the fact that an object provides a set of methods in its interface means that one can do nothing with an object but to invoke those methods. This operational interface (of objects or ADTs) induces an asymmetric, unidirectional semantic dependency of users (of operations) on providers (of those operations). On the one hand, the operations provided by an ADT (or object) can be used by any other entity (that has access to it). On the other hand, an ADT internally decides what operation of what other ADT to perform. This puts users and providers in asymmetric roles. Users internally make the decisions on what operations are to be performed, and generally rely on some speciﬁc semantics that they expect of these operations, while it is left to be the responsibility of the providers to carry out the decisions made by the users to satisfy their expectations. Far from a universal pitfall, it can even be argued that the presumed level of intimacy required among a set of objects composed together through message passing, is an advantage in building individual components. However, at the inter-component level, such intimacy subverts independence of components, contributes to breaking of their encapsulation, and leads to a level of interdependence among components that is no looser than that among objects within a component.

6

A Bland Notion of Components

Instead of relying on targeted active messages for inter-component communication, our component model allows a component instance to exchange only untargeted, passive messages with its environment. Passive messages contain only data and carry no control information (e.g., imply no method invocation). Not implying the exchange of any control information makes passive messages more abstract and more ﬂexible than active messages. For instance, because no form of “call” is implied, the receiver of a message need not interpret the message as an operation that it must perform. The receiver of a message is not even obligated to reply. Consequently, the sender does not necessarily suspend waiting for a result either. Untargeted messages break the asymmetry between senders and receivers that is inherent in models that use targeted messages. With targeted messages, the knowledge of who the receiver of a message is, or at least how it can be identiﬁed, must be contained in its sender. The receiver of a message, on the other hand, is not required to know anything about its sender beforehand: it

Abstract Behavior Types: A Foundation Model for Components

43

is prepared to receive messages “from its environment” not from any speciﬁc sender. This asymmetry makes the sender of a message semantically dependent on (properties and the scheme used to identify) its receiver. This inherent semantic dependency stiﬂes exogenous coordination by severely restricting the ability of a third party to, e.g., set up the interaction of such a sender with a receiver of its own choosing instead of the one prescribed by the sender. With untargeted messages, both senders and receivers symmetrically exchange messages only with their environment, not with any pre-speciﬁed entity. In contrast to the more sophisticated mechanisms necessary for exchanging targeted passive messages, or even more sophisticated ones to support (remote) method invocation for active messages, the mechanism necessary for exchanging untargeted passive messages essentially supports only the mundane I/O primitives: an untargeted message itself is merely some passive data that an entity exchanges with its environment; “sending” such a message is just a write operation; and “receiving” it is just a “read” operation. The I/O operations read and write are performed by a component instance on “contact points” that are recognized by its environment for the purpose of information exchange. We refer to these contact points as the ports of a component instance. Without loss of generality, we assume ports are unidirectional, i.e., the information ﬂows through a port in one direction only: either from the environment into its component instance (through read) or from its component instance to the environment (through write). Each I/O operation inherently synchronizes the entity that performs it with its environment: a write operation suspends until the environment accepts the data it has to oﬀer through its respective port; likewise, a read operation suspends until the environment oﬀers the suitable data it expects through its respective port. This view of component communication leads to a generic component model. In this model, a component instance is a black box that contains one or more active entities. An active entity is one that has its own independent thread of control. Examples of active entities are processes, threads, active objects, agents, etc. No assumption is made in this model about how the active entities inside a component instance communicate with each other. However, simple I/O operations through its ports are the only means of communication for the active entities inside a component instance with any entity not in the same component instance. By this deﬁnition, a Unix process, for instance, qualiﬁes as a component instance: it contains one or more threads of control which may even run in parallel on diﬀerent physical processors, and its ﬁle descriptors qualify as ports. A component instance may itself consist of a collection of other component instances, perhaps running in a distributed environment. Thus, by identifying their relevant ports through which they exchange data with their environment, entire systems can be viewed and used as component instances, abstracting away their internal details of operation, structure, geography, and implementation. Such a simple model of components may at ﬁrst appear rather banal. Nevertheless, it leads to a simple yet useful notion of behavior and behavioral interface. One of the strengths of this model is that it innately espouses anonymous com-

44

F. Arbab

munication: entities that communicate with each other need not know each other. It makes the model inherently amenable to exogenous coordination and supports highly ﬂexible composition possibilities, yielding a very powerful paradigm for component/behavior composition.

7

Elements of a Behavioral Interface

There are diﬀerent ways in which one can represent behavior. Given our model of components, the most direct and obvious way to represent the observable behavior of a component instance is to model it as a relation on its observable input and output. Because this input/output takes place through the ports of the component instance, sequences of data items that pass through a port emerge as the key building blocks for describing behavior. Relating sequences of data items that pass through diﬀerent ports of a component instance requires a sense of relative temporal order to inter-relate otherwise independent events. We need to state, for instance, that a certain data item passes through this port before or after some other data item passes through that port. The assumption of a global clock is stiﬂing in distributed systems and is an overkill for our purpose. Indeed, what we need is a very diluted notion of time that is much less restrictive than the notion of global time. We need to accommodate for: – ordering of events: stating that the occurrence of a certain event precedes or succeeds that of another; – atomicity of a set of events: stating that a given set of events occur only atomically. – temporal progression: stating that only a ﬁnite set of events can occur within any bounded temporal interval. Observe that we do not speak of simultaneity in our list of requirements here. Simultaneity is a rather ambiguous notion in distributed systems. Instead, we speak of atomicity. The atomicity of a set of events means that either none of them occurs, or else they all occur before any other event (not in that set) occurs, i.e., the occurrence of an atomic set of events cannot be interleaved with the occurrence of any other event. Stating that a set of events must occur atomically allows but does not require (any subset of) those events to occur simultaneously. It also allows for those events to occur in any nondeterministic order, so long as either they all occur or none occurs at all. Atomicity can be seen as a relaxing generalization of simultaneity. It is as if an atomic set of events all happen “simultaneously,” except that we elongate the moment of their occurrence into a temporal interval. The provision that no other event may interleave with the occurrence of those in the set ensures that our “elongation of the time moment into an interval” is not detectable by other entities in the system. Requiring that only a ﬁnite set of events can occur within any bounded temporal interval precludes anomalies such as Zeno’s paradox.

Abstract Behavior Types: A Foundation Model for Components

45

We use positive numbers to represent moments in time, with the proviso that it is not the actual numeric values of the time moments, but only their relative ordering that is signiﬁcant. The numerical less-than relation represents the ordering of events. The numeric equal-to relation represents atomicity, not simultaneity. Temporal progression can be enforced by requiring that in every temporal sequence a, for any number N ≥ 0 there exists an i ≥ 0 such that the i th element in a exceeds N .

8

Abstract Behavior Types

An ABT deﬁnes an abstract behavior as a relation among the observable input/output that occur through a set of “contact points” (e.g., ports of a component instance) without specifying any detail about: (1) the operations that may be used to implement such behavior; or (2) the data types those operations may manipulate for the realization of that behavior2 . This deﬁnition parallels that of an ADT, which abstracts away from the instructions and the data structures that may be used to implement the operational interface it deﬁnes for a data type. In contrast, an ABT deﬁnes what a behavior is in terms of a relation (i.e., constraint) on the observable input/output of an entity, without saying anything about how it can be realized. More formally, an ABT is a (maximal) relation among a set of timed-datastreams. The notion of timed-data-streams as well as most of the technical content in this section come from the work of J. Rutten on coalgebras [49,30], stream calculus [48], and a coalgebraic semantics for Reo [8]. Coalgebraic methods have been used for dynamical systems, automata and formal languages, modal logic, transition systems, hybrid systems, inﬁnite data types, the control of discrete event systems, formal power series, etc. (see for instance [53], [41], [42], [50], [51], [52], [22], [27]). Coalgebras have also been used as models for various programming paradigms, notably for objects and classes (see, e.g., [47], [28], and [26]). One of the ﬁrst applications of coalgebras to components appears in [11]. Deﬁning observable behavior in terms of input/output implants a dataﬂow essence within ABTs akin to such dataﬂow-like networks and calculi as [10], [32], and especially [14]. The coalgebraic model of ABT presented here diﬀers from all of the above-mentioned work in a number of respects. Most importantly, the ABT model is compositional. Its explicit modeling of ordering/timing of events in terms of separate time streams provides a simple foundation for deﬁning complex synchronization and coordination protocols using a surprisingly expressive small set of primitives. The use of coinduction as the main deﬁnition and proof principle to reason about both data and time streams allows simple compositional construction of ABTs representing many diﬀerent generic coordination schemes involving combinations of various synchronous and asynchronous primitives that are not present (and not even expressible) in any of the aforementioned models. 2

The term “Abstract Behavior Type” is a variation of the term “Abstract Behavioral Type” proposed by F. de Boer for a related concept.

46

F. Arbab

A stream (over A) is an inﬁnite sequence of elements of some set A. Streams over sets of (uninterpreted) data items are called data streams and are typically denoted as α, β, γ, etc. Zero-based indices are used to denote the individual elements of a stream, e.g., α(0), α(1), α(2), ... denote the ﬁrst, second, third, etc., elements of the stream α. We use the inﬁx “dot” as the stream constructor: x .α denotes a stream whose ﬁrst element is x and whose second, third, etc. elements are, respectively, the ﬁrst and its successive elements of the stream α. Following the conventions of stream calculus [48], the well-known operations of head and tail on streams are called initial value and derivative: the initial value of a stream α (i.e., its head) is α(0), and its (ﬁrst) derivative (i.e., its tail) is denoted as α . The k th derivative of α is denoted as α(k ) and is the stream that results from taking the ﬁrst derivative of α and repeating this operation on the resulting stream for a total of k times. Relational operators on streams apply pairwise to their respective elements, e.g., α ≥ β means α(0) ≥ β(0), α(1) ≥ β(1), α(2) ≥ β(2), .... Time streams are constrained streams over (positive) real numbers, representing moments in time, and are typically denoted as a, b, c, etc. To qualify as a time stream, a stream of real numbers a must be (1) strictly increasing, i.e., the constraint a < a must hold; and (2) progressive, i.e., for every N ≥ 0 there must exist an index n ≥ 0 such that a(n) > N . We use positive real numbers instead of natural numbers to represent time because, as observed in the world of temporal logic [23], real numbers induce the more abstract sense of dense time instead of the notion of discrete time imposed by natural numbers. Speciﬁcally, we sometimes need ﬁnitely many steps within any bounded time interval for certain ABT equivalence proofs (see, e.g., [8]). This is clearly not possible with a discrete model of time. Recall that the actual values of “time moments” are irrelevant in our ABT model; only their relative order is signiﬁcant and must be preserved. Using dense time allows us to locally break strict numerical equality (i.e., simultaneity) arbitrarily while preserving the atomicity of events. A Timed Data Stream is a twin pair of streams α, a consisting of a data stream α and a time stream a, with the interpretation that for all i ≥ 0, the input/output of data item α(i ) occurs at “time moment” a(i ). Two timed data streams α, a and β, b are equal if their respective elements are equal, i.e. α, a = β, b ≡ α = β ∧ a = b. An Abstract Behavior Type (ABT) is a (maximal) relation over timed data streams. Every timed data stream involved in an ABT is tagged either as its input or its output. The input/output tags of the timed data streams involved in an ABT are meaningless in the relation that deﬁnes the ABT. However, these tags are crucial in ABT composition described in Section 8.2. Generally, we use the preﬁx notation R(I1 , I2 , ..., Im ;O1 , O2 , ..., On ) and the separator “;” to designate the ABT deﬁned by the (m + n)-ary relation R over the m ≥ 0 sets of input timed data streams Ii , 0 ≤ i ≤ m and the n ≥ 0 sets of output timed data streams Oj , 0 ≤ j ≤ n. As usual, m + n is called the arity of R and we refer to m and n individually as the input arity and the output arity

Abstract Behavior Types: A Foundation Model for Components

47

of R. In the special case where m = n = 1 it is sometimes convenient to use the inﬁx notation I R O instead of the standard R(I ;O). To distinguish the set of timed data streams that appears in a position in the relation that deﬁnes an ABT (i.e., a column in the relation) from a speciﬁc timed data stream in that set (i.e., which may appear in a row of the relation in that position) we refer to Ii and Oj as, respectively, the i th input and the j th output portals of the ABT. Formally, a component, as deﬁned in Section 6, with m ≥ 0 input and n ≥ 0 output ports is an ABT with m input and n output portals. The set of all possible streams of data items that can pass through each port of the component, together with their respective timing, comprise the set of timed data streams of the ADT’s portal that corresponds to that port. 8.1

ABT Examples

In this section we show the utility of the ABT model through a number of examples. Basic Channels. Following is a list of some useful simple binary abstract behavior types. Each has a single input and a single output portal. 1. The behavior of a synchronous channel is captured by the Sync ABT, deﬁned as α, a Sync β, b ≡ α, a = β, b. Because α, a = β, b ≡ α = β ∧ a = b, the Sync ABT represents the behavior of any entity that (1) produces an output data stream identical to its input data stream (α = β), and (2) produces every element in its output at the same time as its respective input element is consumed (a = b). Recall that “at the same time” means only that the two events of consumption and production of each data item by a Sync channel occur atomically. 2. The behavior of an asynchronous unbounded FIFO channel is captured by the FIFO ABT, deﬁned as α, a FIFO β, b ≡ α = β ∧ a < b. The FIFO ABT represents the behavior of any entity that (1) produces an output data stream identical to its input data stream (α = β), and (2) produces every element in its output some time after its respective input element is observed (a < b). 3. The behavior of an asynchronous channel with the bounded capacity of 1 is captured by the FIFO1 ABT, deﬁned as α, a FIFO1 β, b ≡ α = β ∧ a < b < a . The FIFO1 ABT represents the behavior of any entity that (1) produces an output data stream identical to its input data stream (α = β), and (2) produces every element in its output some time after its respective input element is observed (a < b) but before its next input element is observed (b < a which means b(i ) < a(i + 1) for all i ≥ 0).

48

F. Arbab

4. The behavior of an asynchronous channel with the bounded capacity of 1 ﬁlled to contain the data item D as its initial value is captured by the FIFO1 (D) ABT, deﬁned as α, a FIFO1 (D) β, b ≡ β(0) = D ∧ α = β ∧ b < a < b . The FIFO1 (D) ABT represents the behavior of any entity that (1) produces an output data stream consisting of the initial data item D followed by the input data stream of the ABT (β(0) = D ∧ α = β ), and (2) for i ≥ 0 performs its i th input operation some time between its i th and i + 1st output operations (b < a < b ). 5. The behavior of an asynchronous channel with the bounded capacity of k > 0 is captured by the FIFOk ABT, deﬁned as α, a FIFOk β, b ≡ α = β ∧ a < b < a (k ) . Recall the a (k ) is the k th -derivative (i.e., the k th -tail) of the stream a. The FIFOk ABT represents the behavior of any entity that (1) produces an output data stream identical to its input data stream (α = β), and (2) produces every element in its output some time after its respective input element is observed (a < b) but before its k th -next input element is observed (b < a (k ) which means b(i ) < a(i + k ) for all i ≥ 0). Observe that FIFO1 is indeed a special case of FIFOk with k = 1. It is illuminating to compare the FIFO ABT deﬁned above with the deﬁnition of the queue ADT in Figure 1. They are both mathematically well-deﬁned constructs that describe the same thing: an unbounded FIFO queue. The ADT deﬁnes a queue in terms of a set of operations and a set of axioms that constrain the observable mutual eﬀect of those operations on each other. It abstracts away the actual instructions for the implementation of those operations and the data structures that they manipulate. The ABT deﬁnes a queue in terms of what data items it exchanges with its environment, when it consumes and produces them, and a set of axioms that constrain their interrelationships. It abstracts away the operations for the realization (or enforcement) of those relationships and the data types that they may utilize to do so. Merge and Replicate. We now deﬁne two other ABTs that, as we see in Section 9, form a foundation for an interesting and expressive calculus: merger and replicator. The merger ABT is deﬁned as: Mrg(α, a, β, b;γ, c) ≡  α(0) = γ(0) ∧ a(0) = c(0) ∧ Mrg(α , a , β, b;γ , c ) if a(0) < b(0)    ∃t:a(0) < t < min(a(1), b(1)) ∧ ∃r , s ∈ {a(0), t} ∧ r = s ∧ if a(0) = b(0) Mrg(α, r .a , β, s.b ;γ, c)    if a(0) > b(0) β(0) = γ(0) ∧ b(0) = c(0) ∧ Mrg(α, a, β , b ;γ , c ) Intuitively, the Mrg ABT produces an output that is a merge of its two input streams. If α(0) arrives before β(0), i.e. a(0) < b(0), then the ABT produces

Abstract Behavior Types: A Foundation Model for Components

49

γ(0) = α(0) as its output at c(0) = a(0) and proceeds with the tails of the streams in its ﬁrst input timed data stream. If α(0) arrives after β(0), i.e. a(0) > b(0), then the ABT produces γ(0) = β(0) as its output at c(0) = b(0) and proceeds with the tails of the streams in its second input timed data stream. If the α(0) and β(0) arrive “at the same time” (i.e., a(0) = b(0)), then in this formulation Mrg picks an arbitrary number t in the open time interval (a(0), min(a(1), b(1))) and uses it to nondeterministically break the tie. The assumption of dense time guarantees the existence of an appropriate t. Recall that the construct r .a is a stream whose derivative (tail) is a and whose initial value (head) is r . Thus, for a(0) = b(0) Mrg nondeterministically changes the head of one of the two time streams, a or b, thereby “delaying” the arrival of its corresponding data item to break the tie. The ﬁnite delay introduced by Mrg in this case is justiﬁed because although it breaks simultaneity, its value is constrained to preserve atomicity. Observe that Mrg(α, a, β, b;γ, c) = Mrg(β, b, α, a;γ, c). The replicator ABT is deﬁned as: Rpl (α, a;β, b, γ, c) ≡ β = α ∧ γ = α ∧ b = a ∧ c = a It is easy to see that this ABT captures the behavior of any entity that synchronously replicates its input stream into its two identical output streams. Observe that Rpl (α, a;β, b, γ, c) = Rpl (α, a;γ, c, β, b). Sum. As an example of an ABT that performs some computation, consider a simple dataﬂow adder. The behavior of such a component is captured by the Sum ABT deﬁned as Sum(α, a, β, b;γ, c) ≡ γ(0) = α(0) + β(0)∧ ∃t:max (a(0), b(0)) < t < min(a(1), b(1)) ∧ c(0) = t∧ Sum(α , a , β , b ;γ , c ). Sum deﬁnes the behavior of a component that repeatedly reads a pair of input values from its two input ports, adds them up, and writes the result out on its output port. As such, its output data stream is the pairwise sum of its two input data streams. This component behaves asynchronously in the sense that it can produce each of its output data items with some arbitrary delay after it has read both of its corresponding input data items (c(0) = t ∧t > max (a(0), b(0))). However, it is obligated to produce each of its output data items before it reads in its next input data item (t < min(a(1), b(1))). Philosophers and Chopsticks. The classical dining philosophers problem can be described in terms of n > 1 pairs of instances of two components: philosopher instances of Phil and chopstick instances of Chop. We deﬁne the externally observable behavior of each of these components as an ABT. We show in Section 9

50

F. Arbab

how instances of these components can be composed into diﬀerent component based systems both to exhibit and to solve the famous deadlock problem. We assume that a chopstick component has two input ports, t (for take) and f (for free), through which it reads in the timed data streams αt , at and αf , af , respectively. The data items in αt and αf are tokens whose actual values are not of interest to us. In practice, it is a good idea for these tokens to contain the identiﬁer of the entity (e.g., philosopher) who uses the chopstick, but as long as such informative requirements do not aﬀect behavior, they are irrelevant for our ABT deﬁnition. When a chopstick is free (its initial state) it is ready to accept a take request and thus reads from its t port the next take request token out of αt , at . Once taken, a chopstick is ready to accept a free request and thus reads from its f port the free request token out of αf , af . For the user of the chopstick, the success of its I/O operation on port t or f means the chopstick has accepted its (take or free) request. This simple behavior is captured by the Chop ABT deﬁned as Chop(αt , at , αf , af ;) ≡ at < af < at . Because we are not interested in the actual value of the take/free tokens, the Chop ABT has nothing to say about the data streams αt and αf ; it is only the timing that is relevant here. The timing equation simply states that initially, there must be a take, followed by a free, and this sequence repeats. We assume that a philosopher component has four output ports, lt (for lefttake), lf (for left-free), rt (for right-take), and rf (for right-free), through which it writes the timed data streams αlt , alt , αlf , alf , αrt , art , and αrf , arf , respectively. The two ports lt and lf are “on the left” and two ports rt and rf are “on the right” of the philosopher component, so to speak. The philosopher’s requests to take and free the chopsticks on its left and right are issued through their respective ports. The externally observable behavior of a philosopher component is as follows. After some period of “thinking” it decides to eat, at which point it attempts to obtain its two chopsticks by issuing take requests on its lt and rt ports. We assume it always issues a request for its left chopstick before requesting the one on its right. The philosopher component interprets the success of its write operation as the acceptance of its request (e.g., for exclusive access to the chopstick). Once, and if, both of its take requests are granted, it proceeds to “eat” for some time, at the end of which it then issues requests to free its left and right chopsticks by writing tokens to its lf and rf ports. The philosopher component then repeats the cycle by entering its thinking period again. This behavior is captured by the Phil ABT deﬁned as Phil (;αlt , alt , αlf , alf , αrt , art , αrf , arf ) ≡ alt < art < alf < arf < alt . Again, because we are not interested in the actual values of the take/free tokens that this component produces, the Phil ABT says nothing about the data

Abstract Behavior Types: A Foundation Model for Components

51

streams. All we are interested in is the timing constraints: an arbitrary “thinking” delay; followed by a request to take the left chopstick; once granted, followed by a request to take the right chopstick; once granted, followed by an arbitrary “eating” delay; followed by the requests to free the left and the right chopsticks; and the cycle repeats. 8.2

ABT Composition

Abstract behavior types can be composed to yield other abstract behavior types through a composition similar to the relational join operation in relational databases. Two ABTs can be composed over a common timed data stream if one is the producer and the other the consumer of that timed data stream. The same two ABTs can be composed over zero or more common timed data streams, each ABT playing the role of the producer or the consumer of one of the timed data streams, independent of its role regarding the others. Observe that the producer and the consumer of a timed data stream, α, a, necessarily synchronize their I/O operations on their respective portals for the mutual exchange of the data items in its data stream α, according to the schedule in its twin time stream a. This is accomplished simply by “fusing” their respective portals together such that the timed data stream observed on one is identical to the one observed on the other. Consider two ABTs B1 with arity p = pi + po and B2 with arity q = qi + qo , where pi and po are, respectively, the input arity and the output arity of B1 , and qi and qo , those for B2 . B1 and B2 can be composed with 0 ≤ k ≤ min(pi , qo ) + min(po , qi ) pairs of mutually fused portals, where the data items produced through an output portal, O, of one ABT are fed for consumption by the other ABT through its input portal that is fused with O. We deﬁne the k -dyad composition of B1 (I 11 , I 12 , ...I 1pi ;O11 , O12 , ...O1po ) and B2 (I 21 , I 22 , ...I 2qi ;O21 , O22 , ...O2qo ) as a special form of the join of the two relations B1 and B2 where k distinct portals (i.e., relational columns) of B1 are paired each with a distinct portal of B2 into k dyads such that (1) the two portals in each dyad have opposite input/output tags, and (2) the two timed data streams of the portals in each dyad are equal. The k -dyad composition of B1 and B2 yields a new ABT, B (I1 , I2 , ...Im ;O1 , O2 , ...On ), with arity m +n = p+q −2×k , deﬁned as a relation over those portals of B1 and B2 that are not involved in a dyad (i.e., the fused portals disappear from the resulting relation). The list I1 , I2 , ...Im is obtained from the list I 11 , I 12 , ...I 1pi , I 21 , I 22 , ...I 2qi by eliminating every one of its elements involved in a dyad. Similarly, the list O1 , O2 , ...On is obtained from the list O11 , O12 , ...O1po , O21 , O22 , ...O2qo by eliminating every one of its elements involved in a dyad. We use the dyad indices 1 ≤ l ≤ k as superscripts to mark the corresponding portals of B1 and B2 in their k -dyad composition. For example, B = B1 (α, a, β, b1 ;γ, c) ◦ B2 (δ, d ;µ, m1 ) denotes the 1-dyad composition of the two abstract behavior types B1 and B2 where the output (portal) of B2 is identical to the second input (portal) of B1 . The resulting ABT is deﬁned through the relation B ≡ {α, a, δ, d ;γ, c | α, a, β, b;γ, c ∈ B1 ∧

52

F. Arbab

δ, d ;µ, m ∈ B2 ∧ β, b = µ, m}. Another example is the ABT B = B1 (α, a, β, b1 ;γ, c2 ) ◦ B2 (δ, d 2 ;µ, m1 , ν, n), which denotes the 2-dyad composition of the two abstract behavior types B1 and B2 where the ﬁrst output of B2 is identical to the second input of B1 and the output of B1 is identical to the input of B2 . The resulting ABT is deﬁned as the relation B ≡ {α, a;ν, n | α, a, β, b;γ, c ∈ B1 ∧ δ, d ;µ, m, ν, n ∈ B2 ∧ β, b = µ, m ∧ γ, c = δ, d }. The common case of the 1-dyad composition of B1 and B2 where the single output of B1 is identical to the single input of B2 is abbreviated as B1 (...;α, a)◦ B2 (β, b;...) instead of B1 (...;α, a1 ) ◦ B2 (β, b1 ;...). This abbreviation is particularly convenient together with the inﬁx notation for binary abstract behavior types. For instance, B = α, aB1 β, b ◦ γ, cB2 δ, d denotes the 1-dyad composition of the two abstract behavior types B1 and B2 where the output of B1 is identical to the input of B2 . Of course, the resulting ABT is deﬁned as the relation α, aB δ, d ≡ {α, a;δ, d | α, a;β, b ∈ B1 ∧ γ, c;δ, d ∈ B2 ∧ β, b = γ, c}. For example, consider the binary ABTs deﬁning the basic channels presented in Section 8.1. It is not diﬃcult to see that the (1-dyad) composition of these ABTs produces results that correspond to our intuition. For instance, the composition of two Sync ABTs produces a Sync ABT. Indeed, composition of a Sync ABT with any other ABT (on its left or right) yields the same ABT. More interestingly, the composition of two FIFO ABTs produces a FIFO ABT. Composing two FIFO1 ABTs produces a FIFO2 ABT. The formal proof of this latter equivalence relies on our notion of dense time (as opposed to discrete time) and is given in [8], together with the formal treatment of many other interesting examples.

9

Reo

The ABT model provides a simple formal foundation for deﬁnition and composition of components. The k -dyad composition of ABTs supports a very ﬂexible mechanism for software composition in component based systems. This furnishes the desired level of composition ﬂexibility we expect in a component model. However, composing components directly with one another in this way reduces the glue code to essentially nothing more than repeated applications of the k -dyad composition operator. More importantly, it all but extinguishes the possibility of wielding exogenous coordination through the glue code. The ABT model is too low-level to directly provide any form of non-trivial coordination (beyond the simple synchronization implied by its timed data streams); for that, we need an eﬀective exogenous coordination model. Reo is a channel-based exogenous coordination model wherein complex coordinators, called connectors are compositionally built out of simpler ones [7,4,8]. The simplest connectors in Reo are a set of channels with well-deﬁned behavior supplied by users. Reo can be used as a language for coordination of concurrent processes, or as a “glue language” for compositional construction of connectors that orchestrate component instances in a component based system. The

Abstract Behavior Types: A Foundation Model for Components

53

emphasis in Reo is on connectors and their composition only, not on the entities that connect to, communicate, and cooperate through these connectors. Each connector in Reo imposes a speciﬁc coordination pattern on the entities (e.g., component instances) that perform I/O operations through that connector, without the knowledge of those entities. Channel composition in Reo is a very powerful mechanism for construction of connectors. The expressive power of connector composition in Reo has been demonstrated through many examples in [3,4,8]. For instance, exogenous coordination patterns that can be expressed as (meta-level) regular expressions over I/O operations performed by component instances can be composed in Reo out of a small set of only ﬁve primitive channel types. A mobile channel allows (physical or logical) relocation of one of its ends without the knowledge or the involvement of the entity at its other end. Logical mobility changes the topology of the interconnections of communicating entities, while physical mobility can have other implications, e.g., on an entity’s (eﬃciency of) access to various resources. An eﬃcient distributed implementation of channels supporting this notion of mobility is described in [5]. Both component instances and channels are mobile in Reo. Logical mobility of channel ends in Reo allows dynamic reconﬁguration of connectors, even while they are being used by component instances. In this respect, Reo resembles dynamically reconﬁgurable generalized Kahn networks, as in IWIM [1] and Manifold [12], and its dataﬂow nature is also related to Broy’s timed dataﬂow model, although Reo is more general and more expressive that these and similar models. Much as Reo supports physical mobility through its move operation to allow more eﬃcient ﬂow of data, it ascribes no semantic signiﬁcance to it. The move operation does not semantically aﬀect connector topologies, ﬂow of data, or connectivity of components to connectors. In this sense, Reo is orthogonal to the concerns involving the physical mobility of code, e.g., in models such as that of [46]. It turns out that the ABT model is quite adequate for deﬁning the channel and connector composition operation which is the crux of exogenous coordination in Reo. In the rest of this section we show how connector construction in Reo can be seen as an application of the ABT model. 9.1

Channels and Connectors

Channels are the only primitive medium of communication between two components in Reo. The notion of channel in Reo is far more general than its common interpretation. A channel in Reo has its own unique identity and always has exactly two directed ends, each with its own unique identity. Based on their direction, there are two types of channel ends: source and sink ends. Data enters through a source channel end into its respective channel, and it leaves through a sink channel end from its respective channel. (Channels themselves have no direction in Reo, only their ends do.) Beyond a small set of mild obvious requirements, such as enabling I/O operations to read/write data items from/to their ends, Reo places no restrictions on the behavior of channels. This allows an open-ended set of diﬀerent chan-

54

F. Arbab

nel types to be used simultaneously together in Reo, each with its own policy for synchronization, buﬀering, ordering, computation, data retention/loss, etc. Some typical examples of conventional channels are, e.g., the ones deﬁned in Section 8.1. These channels happen to each have a source end and a sink end. More unconventional channels are also possible in Reo, especially because a channel can also have only two source ends or only two sink ends. A few examples of some such exotic channels appear in Section 9.3; even more examples are presented in [3,7,4]. Strictly speaking, Reo itself neither provides nor assumes the availability of any speciﬁc set of channel types; it simply assumes that an appropriate assortment of channel types, each with its properly well-deﬁned semantics, is provided by users for it to operate on. Nevertheless, it is reasonable to expect that in practice certain most primitive channel types, e.g., synchronous channels, will always be made available in all cases. Reo deﬁnes a connector as a set of channel ends and their connecting channels organized in a graph of nodes and edges such that: – Zero or more channel ends coincide on every node. – Every channel end coincides on exactly one node. – There is an edge between two (not necessarily distinct) nodes if and only if there is a channel one end of which coincides on each of those nodes. We use x → N to denote that the channel end x coincides on the node N , and x to denote the unique node on which the channel end x coincides. For a node N , we deﬁne the set of all channel ends coincident on N as [N ] = {x | x → N }, and disjointly partition it into the sets Src(N ) and Snk (N ), denoting the sets of source and sink channel ends that coincide on N , respectively. Observe that nodes are neither components nor locations. Although some nodes are attached to component instances to allow their exchange of information, nodes and components are diﬀerent notions and not every node can be associated with or attached to a component instance. A node is a fundamental concept in Reo representing an important topological property: all channel ends x ∈ [N ] coincide on the same node N . This property entails speciﬁc implications in Reo regarding the ﬂow of data among the channel ends x ∈ [N ], irrespective of concern for the location of those channel ends or N , or the possible attachment of N to a component instance. A node N is called a source node if Src(N ) = ∅ ∧ Snk (N ) = ∅. Analogously, N is called a sink node if Src(N ) = ∅ ∧ Snk (N ) = ∅. A node N is called a mixed node if Src(N ) = ∅ ∧ Snk (N ) = ∅. By the above deﬁnition, every channel represents a (simple) connector with two nodes. From the point of view of Reo a port of a component instance is just a node that (initially) contains a single channel end. An input port is (initially a singleton) source node, and an output port is (initially a singleton) sink node. From the point of view of a component instance, each of its ports is merely a simple connector corresponding to a synchronous channel (the node of) one end of which is made publicly accessible for I/O by its environment, while (the node of) its other end is hidden for exclusive use by the component instance itself. An

Abstract Behavior Types: A Foundation Model for Components

55

output port of a component instance has the sink node of its synchronous channel public while its source node is available only for I/O operations performed by that component instance. Likewise, an input port has the source node of its synchronous channel public while its sink node is hidden for exclusive use by its component instance. Reo provides I/O operations on source and sink nodes only; components cannot read from or write to mixed nodes. A component instance can write to a source node and can read from a sink node using node I/O operations of Reo. The graph representing a connector is not directed. However, for each channel end xc of a channel c, we use the directionality of xc to assign a local direction in the neighborhood of xc to the edge that represents c. The local direction of the edge representing a channel c in the neighborhood of the node of its source xc is presented as an arrow emanating from xc . Likewise, the local direction of the edge representing a channel c in the neighborhood of the node of its sink xc is presented as an arrow pointing to xc . See Figures 2 and 3 for examples. Complex connectors are constructed in Reo out of simpler ones using its join operation. The join operation in Reo is deﬁned only on nodes. Joining two nodes N1 and N2 destroys both nodes and produces a new node N with the property that [N ] = [N1 ] ∪ [N2 ]. This single operation allows construction of arbitrarily complex connector graphs involving any combination of channels picked from an open-ended set of channel types. The semantics of a connector is deﬁned as a composition of the semantics of its (1) constituent channels, and (2) nodes. Because Reo does not provide any channels, it does not deﬁne their semantics either. What Reo deﬁnes is the composition of channels into connectors and the semantics of this composition through the semantics of its (three types of) nodes. Intuitively, a source node replicates every data item written to it as soon as all of its coincident source channel ends can consume that data item. Reading from a sink node nondeterministically selects one of the data items available through its coincident sink channel ends. A mixed node is a self-contained “pumping station” that combines the behavior of a sink node and a source node in an atomic iteration of an inﬁnite loop: in each atomic iteration it nondeterministically selects an appropriate data item available through its coincident sink channel ends and replicates that data item into all of its coincident source channel ends. A data item is appropriate for selection in an iteration only if it can be consumed by all source channel ends that coincide on that node. 9.2

ABT Models of Nodes and Connectors

Consider a sink node N with [N ] = {x , y}, as in Figure 2.a. The read operations performed on this node induce an output timed data stream, αN , aN , for this sink node. We use αx , ax and αy , ay to designate the timed data streams corresponding to the channel ends x and y, respectively. The semantics of this sink node is deﬁned by the ABT Mrg(αx , ax , αy , ay ;αN , aN ). The semantics of a sink node N where [N ] = {x , y, z }, as in Figure 2.b, is deﬁned as the 1-dyad composition

56

F. Arbab x y a

y

b

x

x

x

z

y

z c

y

d

e

Fig. 2. Representation of nodes in Reo.

Mrg3(αx , ax , αy , ay , αz , az ;αN , aN )) ≡ Mrg(αx , ax , αy , ay ;β1 , b1 1 ) ◦ Mrg(γ1 , c1 1 , αz , az ;αN , aN ) where αN , aN is the output timed data stream of the node, as before, and β1 , b1 and γ1 , c1 are internal timed data streams. Because Mrg is associative with respect to its input portals, merging the intermediate result of the merge of x and y with z is the same as merging x with the intermediate result of the merge of y and z ; i.e., Mrg3 is associative with respect to its input portals. As such, the simple graphical notation of Reo (e.g., in Figures 2.a and b) is quite appropriate because it does not suggest any precedence for the Mrg operations. Clearly this scheme can be used to deﬁne the semantics of sink nodes with more coincident channel ends in general as the ABT Mrgk with k > 0 input and one output portals. For completeness, we deﬁne Mrg1(αx , ax ;αN , aN )) ≡ αx , ax = αN , aN and consider Mrg2 to be a pseudonym for Mrg. The write operations performed on a source node N with [N ] = {x , y}, as in Figure 2.c, induce an input timed data stream, αN , aN , for N . The semantics of N in this case is deﬁned by the ABT Rpl (αN , aN ;αx , ax , αy , ay . The semantics of a source node N with [N ] = {x , y, z }, as in Figure 2.d, is deﬁned as the 1-dyad composition Rpl 3(αN , aN ;αx , ax , αy , ay , αz , az ) ≡ Rpl (αN , aN ;αx , ax , β1 , b1 1 ) ◦ Rpl (γ1 , c1 1 ;αy , ay , αz , az ) where αN , aN is the input timed data stream of the node, as before, and β1 , b1 and γ1 , c1 are internal timed data streams. Because Rpl is associative with respect to its output portals, the precedence of the Rpl operations is irrelevant and Rpl 3 is also associative with respect to its output portals. Similarly, the general ABT Rpl k with one input and k > 0 output portals deﬁnes the semantics of a source node with k coincident channel ends. Again, for completeness, we deﬁne Rpl 1(αN , aN ;αx , ax )) ≡ αx , ax = αN , aN and consider Rpl 2 to be a pseudonym for Rpl . A mixed node, as in Figure 2.e, is a composition of two “half-nodes,” a source and a sink. Because no component is allowed to perform an I/O operation on a mixed node, no input/output timed data stream can be deﬁned for a mixed node. A mixed node is a closed entity that does not interact with any component; instead it internally pumps data items from its sink channel ends to its source channel ends. The semantics of a mixed node N with m > 0 sink and n >

Abstract Behavior Types: A Foundation Model for Components

57

0 source channel ends is thus deﬁned as the 1-dyad composition of the two ABTs describing the behavior of each of its half nodes: Mrgm(I1 , I2 , ...Im ;β1 , b1 ) and Rpl n(γ1 , c1 ;O1 , O2 , ...On ). The portals Ii and Oj designate the timed data streams observed at the m sink and the n source channel ends coincident on N , respectively. The resulting ABT is thus P mx n(I1 , I2 , ...Im ;O1 , O2 , ...On ) ≡ Mrgm(I1 , I2 , ...Im ;β1 , b1 ) ◦ Rpl n(γ1 , c1 ;O1 , O2 , ...On ). For instance, the behavior of the mixed node in Figure 2.e is captured by the ABT deﬁned as the relation P 3x 2(I1 , I2 , I3 ;O1 , O2 ) over the timed data streams of its respective 3 sink and 2 source channel ends. Every edge of a connector corresponds to a channel whose semantics is deﬁned as an ABT. Since a connector consists of (three types of) nodes and edges, all of whose semantics are now deﬁned as ABTs, the semantics of every connector in Reo can be derived as a composition of the ABTs of its constituent nodes and edges. 9.3

A Cogent Set of Primitive Channels

To demonstrate the utility of Reo we must supply it with a set of primitive channels. The fact that Reo accepts and the ABT model allows deﬁnition of an open-ended set of arbitrarily complex channels is interesting. What is more interesting, however, is that connector composition in Reo is itself powerful enough to yield surprisingly expressive complex connectors out of a very small set of trivially simple channels. A useful set of primitive channels for Reo consists of 7 channel types: Sync, FIFO, FIFO1 , FIFO1 (D), Filter(P ), LossySync, and SyncDrain. This is not a minimal set, in the sense that some of the channel types in this set can themselves be composed in Reo out of others; however, minimality is not our concern here and these channel types turn out to be both simple and frequently useful enough to deserve their own explicit mention. The ﬁrst four channel types were deﬁned as ABTs in Section 8.1. We deﬁne the ABTs for the rest below. The common characteristics of the last three channels, above, are that they are all (1) synchronous, and (2) lossy. Neither channel has a buﬀer to store data and if necessary, delays the I/O operation on either one of its ends until it is matched with an I/O operation on its other end. A channel is lossy if it does not deliver through its sink end every data item it consumes through its source end. The diﬀerence between these three channels is in their loss policy. 1. A Filter(P ) channel is a synchronous channel with a source and a sink end that takes a pattern P parameter upon its creation. It behaves like a Sync channel, except that only those data items that match the pattern P can actually pass through it; others are always accepted by its source, but are immediately lost. The behavior of such a channel is captured by the Filter(P ) ABT deﬁned as

58

F. Arbab

α,a Filter(P ) β, b ≡ β(0) = α(0) ∧ b(0) = a(0) ∧ α , a Filter(P ) β , b if α(0) P otherwise α , a Filter(P ) β, b The inﬁx operator α(0) P denotes whether or not the data item α(0) matches with the pattern P . If so, α(0) passes through, otherwise it is lost, and the ABT proceeds with the rest of its timed data streams. 2. A LossySync channel is also like a Sync channel except that it is always ready to consume every data item written to its source end. If a matching read operation is pending at its sink, the data item written to its source is actually transferred; otherwise, the written data item is lost. The behavior of this channel is captured by the LossySync ABT deﬁned as α, a LossySync β, b ≡ if a(0) > b(0)  α, a LossySync β, a(0).b β(0) = α(0) ∧ α , a LossySync β , b if a(0) = b(0)  otherwise α , a LossySync β, b 3. A SyncDrain is a channel with two source ends. Because it has no sink end, it has no way to ever produce any data items. Consequently, every data item written to its source ends is simply lost. SyncDrain is synchronous because a write operation on one of its ends remains pending until a write is performed on its other end as well; only then both write operations succeed together. The behavior of this channel is captured by the SyncDrain ABT deﬁned as α, a SyncDrain β, b ≡ a = b 9.4

Coordinating Glue Code

To demonstrate the expressive power of connector composition, in this section we describe a number of examples in Reo. More examples are presented elsewhere [3,7,8,4]. Write-Cue Regulator. Consider the connector in Figure 3.a, composed out of the three channels ab, cd, and ef. Channels ab and cd are of type Sync and ef is of type SyncDrain. This connector shows one of the most basic forms of exogenous coordination: the number of data items that ﬂow from a to d is the same as the number of write operations that succeeds on f. (Recall that a designates the unique node on which the channel end a coincides.) The analogy between the behavior of this connector and a transistor in the world of electronic circuits is conspicuous. A component instance with a port connected to f can count and regulate the ﬂow of data between the two nodes a and d by the timing and the number of write operations it performs on f. The entity that regulates and/or counts the number of data items through f need not know anything about the entities that write to a and/or take from d, nor that its write actions actually regulate this ﬂow. The two entities that communicate through a and d need not know

Abstract Behavior Types: A Foundation Model for Components a

b,e,c

d

f

a

b,e,c

g

d

h,f,i

a

a

c

b

j

b

d

c b b

a

b

c

d

59

c

c

a

a

o Sequencer

e

f

Sequencer

g

Fig. 3. Examples of connectors in Reo.

anything about the fact that they are communicating with each other, nor that the volume of their communication is regulated and/or measured by a third entity at f. Barrier Synchronizers. We can build on our write-cue regulator to construct a barrier synchronization connector, as in Figure 3.b. The four channels ab, cd, gh, and ij are all of type Sync. The SyncDrain channel ef ensures that a data item passes from a to d only simultaneously with the passing of a data item from g to j (and vice versa). This simple barrier synchronization connector can be trivially extended to any number of pairs, as shown in Figure 3.c. Ordering. The connector in Figure 3.d consists of three channels: ab, ac, and bc. The channels ab and ac are SyncDrain and Sync, respectively. The channel bc is of type FIFO1 . The behavior of this connector can be seen as imposing an order on the ﬂow of the data items written to a and b, through to c: the data items obtained by successive read operations on c consist of the ﬁrst data item written to a, followed by the ﬁrst data item written to b, followed by the second data item written to a, followed by the second data item written to b, etc. See [3,4] for more detail and [8] for a formal treatment of this connector. The coordination pattern imposed by our connector can be summarized as c = (ab)∗, meaning the sequence of values that appear through c consist of zero or more repetitions of the pairs of values written to a and b, in that order. Sequencer. Consider the connector in Figure 3.e. The enclosing box represents the fact that the details of this connector are abstracted away and it provides only the four nodes a, b, c, and d for other entities (connectors and/or component instances) to (in this case) read from. Inside this connector, we have four Sync, a FIFO1 (o), and three FIFO1 channels connected together. The FIFO1 (o) channel is the leftmost one and is initialized to have a data item in its buﬀer, as indicated by the presence of the symbol “o” in the box representing its buﬀer. The actual

60

F. Arbab

value of this data item is irrelevant. The read operations on the nodes a, b, c, and d can succeed only in the strict left to right order. This connector implements a generic sequencing protocol: we can parameterize this connector to have as many nodes as we want, simply by inserting more (or fewer) Sync and FIFO1 channel pairs, as required. Figure 3.f shows a simple example of the utility of our sequencer. The connector in this ﬁgure consists of a two-node sequencer, plus a pair of Sync channels and a SyncDrain channel connecting each of the nodes of the sequencer to the nodes a and c, and b and c, respectively. The connector in Figure 3.f is another connector for the coordination pattern c = (ab)∗, although there is a subtle diﬀerence between the behavior of this connector and the one in Figure 3.d. See [3,4] for more detail. It takes little eﬀort to see that the connector in Figure 3.g corresponds to the meta-regular expression c = (aab)∗. Figures 3.f and g show how easily we can construct connectors that exogenously impose coordination patterns corresponding to the Kleen-closure of any “meta-word” made up of atoms that stand for I/O operations, using a sequencer of the appropriate size. 9.5

Fibonacci Series

A simple example of how a composition of a set of components yields a system that delivers more than the sum of its parts is the computation of the classical Fibonacci series. To assemble a component based application to deliver this series we actually need only one (instance of one) component plus a number of channels. The component we need is a realization of the Sum ABT that we already saw in Section 8.1.

0 Sum 1

Fig. 4. Computing the Fibonacci series.

Figure 4 shows a component (the outermost thick enclosing box) with only one output port (the only exposed node on the right border of the box). This is our component based application for computing the Fibonacci series. Peeking inside this component, we see how it is made out of an instance of Sum, a FIFO1 (1), a FIFO1 (0), a FIFO1 , and ﬁve Sync channels.

Abstract Behavior Types: A Foundation Model for Components

61

As long as the FIFO1 (0) channel is full, nothing can happen: there is no way for the value in FIFO1 (1) to move out. At some point in time, the value in FIFO1 (0) moves into the FIFO1 channel. Thereafter, the FIFO1 (0) channel becomes empty and the two values in the FIFO1 (1) and the FIFO1 channels become available for Sum to consume. The intake of the value in FIFO1 (1) by Sum inserts a copy of the same value into the FIFO1 (0) channel. When Sum is ready to write its computed value out, it suspends waiting for some entity in the environment to accept this value. Transfer of this value to the entity in the environment also inserts a copy of the same value into the now empty FIFO1 (1) channel. At this point we are back to the initial state, but with diﬀerent values in the buﬀers of the FIFO1 (1) and the FIFO1 (0) channels. The ABT models of the component Sum, channels, and Reo nodes that we presented earlier suﬃce for a formal analysis of the behavior of their composition in this example. Observe that all entities involved in this composed application are completely generic and, of course, neither knows anything about the Fibonacci series, nor the fact that it is “cooperating” with other entities to compute it. It is the speciﬁc glue code of this application, made by composing 8 simple generic channels in a speciﬁc topology in Reo, that coordinates the communication of the components (in this case, only one) with one another (in this case, with itself) and the environment to compute this series. 9.6

Dining Philosophers

We can vividly demonstrate the signiﬁcance of exogenous coordination in component based system composition through the classical dining philosophers problem. In this section we use instances of two components, each of which is a realizations of one of the two ABTs Phil and Chop deﬁned in Section 8.1, to (1) compose a dining philosophers application that exhibits the famous deadlock problem; and (2) compose another dining philosophers application that prevents the deadlock. Figure 5.a shows 4 philosophers and 4 chopsticks around a virtual round table. Each philosopher has 4 output ports, corresponding to the lt, lf , rt, and rf portals of the Phil ABT in Section 8.1. In this ﬁgure, philosophers face the table, thus their sense of left and right is obvious. Each chopstick has two input ports, corresponding to the t and f input portals of the Chop ABT in Section 8.1. In Figure 5.a, chopstick ports on the outer-edge of the table are their t ports and the ones closer to the center of the table are their f ports. The t (take) port of each chopstick is connected to the take ports of its adjacent philosophers, and its f port to their respective free ports. All channels are of type Sync. Consider what happens in the node at the three-way junction connected to the t port of Chop1 . If Chop1 is free and is ready to accept a token through its t port, as it initially is, whichever one of the two philosophers Phil1 and Phil4 happens to write its take request token ﬁrst will succeed to take Chop1 . Of course, it is possible for Phil1 and Phil4 to attempt to take Chop1 at the same time. In this case, the semantics of this mixed node (by the deﬁnition of the ABT Mrg) guarantees that only one of them succeeds, nondeterministically; the

F. Arbab

l2 hi

Chop4

hi

a

l4

l4

P

hi P

P l3

Chop1

hi

hi

Chop4

Chop3

P

l3

l1

P

hi

hi

l1

P

P

hi

Chop1

Chop3

Chop2

P

Chop2

l2

62

b Fig. 5. Dining philosophers in Reo.

write operation of the other remains pending until Chop1 is free again. Because the deﬁnition of the ABT Phil states that a philosopher frees a chopstick only after it has taken it, there is never any contention at the three-way junction connected to the f port of a chopstick. The composition of channels in this Reo application enables philosophers to repeatedly go through their “eat” and “think” cycles at their leisure, resolving their contentions for taking the same chopsticks nondeterministically. The possibility of starvation is ruled out because the nondeterminism in Mrg is assumed to be fair. This simple glue code composed of nothing but common generic Sync channels directly renders a faithful implementation of the dining philosophers problem; all the way down to its possibility of deadlock. Because all philosophers are instances of the same component, they all attempt to fetch their chopsticks in the same order. The Phil ABT deﬁnes this to be left-ﬁrst. If all chopsticks are free and all philosophers attempt to take their left chopsticks at the same time, of course, they will all succeed. However, this leaves no free chopstick for any philosopher to take before it can eat. No philosopher will relinquish its chopstick before it ﬁnishes its eating cycle. Therefore, this application deadlocks, as expected. Avoiding the Deadlock. Interestingly, with Reo, solving the deadlock problem requires no extra code, central authority, or modiﬁcation to any of the components. In order to prevent the possibility of a deadlock, all we need to do is to change the way in which we compose our application out of the very same components. Figure 5.b shows a slightly diﬀerent composition topology of the same set of Sync channels comprising the glue code that connects the exact same instances of Phil and Chop as before. We have ﬂipped one philosopher’s left and right connections to its adjacent chopsticks (in this particular case, those of Phil2 ) without its knowledge. None of the components in the system are aware

Abstract Behavior Types: A Foundation Model for Components

63

of this change, nor is any of them modiﬁed in any way to accommodate it. Our ﬂipping of these connections is purely external to all components. It is not diﬃcult to see why this new topology prevents deadlock. If all philosophers attempt to take their left chopsticks now at the same time, one of them, namely Phil2 , will actually reach for the one on its right-hand-side. Of course, Phil2 is unaware of the fact that as it reaches out through its left port to take its ﬁrst chopstick, it is actually the one on its right-hand-side it competes to take. In this case it competes with Phil3 , which is also attempting to take its ﬁrst chopstick. It makes no diﬀerence which one of the two wins this competition, one will be denied access to its ﬁrst chopstick. This ensures that at least one chopstick will remain free (no philosopher attempts to take Chop2 as its ﬁrst chopstick) to enable at least one philosopher to obtain its second chopstick as well and complete its eating cycle. Comparing the composition topologies in Figures 5.a and b, we see that in Reo (1) diﬀerent glue code connecting the same components produces diﬀerent system behavior; and (2) coordination protocols are imposed by glue code on components that cooperate with one another through the glue code, without being aware of each other or their cooperation. The two fundamental notions that underpin this pair of highly desirable provisions are: – The underlying notion of component (Section 6) in the ABT model prevents a component from distinguishing individual entities within its environment directly. Components can exchange only passive data with their environment through communication primitives that (1) do not allow them to discern speciﬁc targets as communication partners, and (2) do not entail any further obligation on behalf of the environment. The ABT model of components, thus, grants the environment great ﬂexibility in making late, even dynamic, decisions about how components are composed. This makes ABT components highly susceptible to exogenous coordination, although the ABT model itself oﬀers no non-trivial coordination primitives. – Reo is a coordination model that takes full advantage of the composition ﬂexibility oﬀered by the ABT model and oﬀers a calculus of connector composition based on a user-deﬁned set of primitive channels, all deﬁned as ABTs. The crux of this calculus is the join operator in Reo for composing channel ends into composite nodes, and the speciﬁc semantics it deﬁnes for these nodes as ABTs (Section 9.2). Connector composition in Reo offers a simple yet surprisingly expressive exogenous coordination model that eﬀectively exploits the ﬂexibility of components in the ABT model. The two systems in Figures 5.a and b are made of the same number of constituent parts of the same types: the same number of component instances of the same kinds, and the same number of primitive connectors (Sync channels). The only diﬀerence between the two is in the topology of their inter-connections. This topological diﬀerence is the only cause of the diﬀerence between the “more than sum of the parts” in these two systems.

64

F. Arbab

Making of a Chopstick. A moment of reﬂection reveals that, especially since there is no computation involved in the behavior of a chopstick, it should be easy to realize the behavior deﬁned by the ABT Chop through channel composition. The behavior deﬁned as Chop is indeed all coordination: it must alternate enabling the write operations on one (t) then on the other (f ) of its two input ports. Indeed, we can easily use a two-port sequencer (Figure 3.e) plus two SyncDrain channels to realize this behavior. But a much simpler construction is possible as well.

t

f Fig. 6. Inside of a chopstick.

The connector hidden inside the enclosing box in Figure 6 is a simpliﬁed two-port sequencer which exactly implements the behavior of the ABT Chop. This connector consists of two channels: a FIFO1 and a SyncDrain. Initially, the FIFO1 is empty, therefore enabling the ﬁrst write to its port t to succeed immediately. While this channel is empty, a write to its f port suspends because there is no data item to be “simultaneously” consumed by the opposite (source) end of the SyncDrain. Once a write to t succeeds, the FIFO1 channel becomes full and the next write operation on port t will suspend until this channel becomes empty again. When the FIFO1 channel is full, a write to f succeeds, causing the SyncDrain channel to consume the contents of the FIFO1 channel as well. This returns the connector to its original state allowing it to cyclically repeat the same behavior. Adaptation of a Philosopher. As a simple example of the usefulness of Filter(P ) channels, suppose the interface of the philosopher component we acquire for our application does not exactly match that of our Phil ABT. The component we obtain, Philos has only one output port and it writes all its tokens to the same port. Figure 7 shows how Philos can be adapted to ﬁt the interface of Phil , using four ﬁlter channels. The wiggly segment in the representation of a ﬁlter channel suggests a “resistor” that inhibits the transmission of values that do not match its ﬁlter pattern. The text above the wiggly line is the ﬁlter pattern. Because Philos writes all of its tokens to the same port, it must distinguish them by their values. We assume it writes the four values lt, lf, rt, and rf to identify these tokens. Every value written to the output port of Philos is automatically replicated into the source ends of the four channel ﬁlters that coincide on this node. This copying happens whenever all four source channel ends are ready to consume the replicated value. Whatever the value is, three of the four channels will always be ready to accept it

Abstract Behavior Types: A Foundation Model for Components

lf

65

rf rt

rf

lf

lt lt

rt

Philos

Fig. 7. Adapting Philos to appear as Phil .

unconditionally, because it will not match their ﬁlters and they will immediately lose the value. The fourth channel, the one whose pattern matches the written value, is the one whose acceptance triggers the actual replication/transfer. This happens only when the node at the sink end of this ﬁlter channel can synchronously dispose of the value, which is possible only when there is a read on that node.

10

Conclusion

The operational interface that is inherent in the Abstract Data Type model and object oriented programming introduces two very diﬀerent concepts for (1) entities, and (2) the mechanism of their composition. To their outside world, entities are what their interfaces advertise them to be: a set of operations. The mechanism that composes entities is based on performing the operations of other entities. This makes composition endogenous (i.e., an entity internally decides what operations of which other entities to perform) and relies on rather strong assumptions about the environment (i.e., the actual availability of appropriate other entities to support those operations with their expected semantics). Unlike the ADT model, main-stream object oriented models do not oﬀer any formal semantics in their object/class interfaces. The purely syntactic nature of their interfaces becomes the weakest link in the reliability of the assumptions that underlie the validity of each composition: unless the entity that invokes the operation knows the entity whose operation it invokes rather intimately, the semantics that one assumes may be diﬀerent than what the other guarantees; even subtle diﬀerences here can sabotage a composition. Furthermore, the composition of two objects does not produce another object. Components are expected to be independent commodities, viable in their binary forms in the (not necessarily commercial) marketplace, developed, offered, exploited, deployed, integrated, maintained, and evolved by separate autonomous organizations in mutually unknown and unknowable contexts, over very long spans of time. The level of intimacy that is implicitly required of objects that compose by invoking each other’s methods, is simply too unrealistic in the world of such components. Component models that rely on (variations of)

66

F. Arbab

object oriented programming (e.g., components as fortiﬁed collections of objects) and its composition mechanism of method invocation must, on the one hand, ameliorate its inherent endogenous rigidity (e.g., by intercepting, interpreting, retargeting, or suppressing messages), and on the other hand yield quite brittle compositions. Composition of two components, in such models, does not by itself yield another component. Abstract Behavior Types presented in this paper oﬀer a simpler and far more ﬂexible model of components — and of their composition. An ABT is a mathematical construct that deﬁnes and/or constrains the behavior of an entity without any mention of operations or data types that may be used to realize that behavior. This puts the ABT model at a higher-level of abstraction than ADTs and makes it more suitable for components. The endogenous nature of their composition means that it is not possible for a third party, e.g., an entity in the environment, to compose two objects (or two ADTs) “against their own will” so to speak. In contrast, the composition of any two ABTs is always well-deﬁned and yields another ABT. The building blocks in the mathematical construction of the ABT model are the (generally) inﬁnite streams that represent the externally observable sequences of I/O events that occur at an entity’s interaction points (e.g., ports) through which it exchanges data with its environment. Such inﬁnite structures, and thus the ABT model, naturally lend themselves to coalgebraic techniques and the coinduction reasoning principle. The ABT model supports a much looser coupling than is possible with ADT and is inherently amenable to exogenous coordination. We advocate both of these as highly desirable, if not essential, properties for component based systems. The ABT model provides a simple formal foundation for deﬁnition and composition of components. However, direct composition of component ABTs does not generally provide much of an opportunity to systematically wield exogenous coordination. Reo is a channel-based exogenous coordination model that can be used as a glue language for dynamic compositional construction of component connectors in (non-)distributed and/or mobile systems. Connector construction in Reo can be seen as an application of the ABT model. A channel in Reo is just a special kind of an atomic connector (i.e., component): whereas components and connectors oﬀer one or more ports to exchange information with their environment, a channel is an ABT that oﬀers exactly two ports (i.e., its channel-ends) for interaction with its environment. Because all Reo connectors are ABTs, the semantics of channel composition in Reo can be deﬁned in terms of ABT composition.

Acknowledgment I am thankful for the fruitful discussions and the collaboration of all my colleagues at CWI, especially J. Rutten, M. Bonsangue, and F. de Boer, who have contributed to the ideas behind abstract behavior types. I am grateful for the attention and the creative inﬂuence of the participants in the ACG seminar se-

Abstract Behavior Types: A Foundation Model for Components

67

ries of J. de Bakker at CWI, where various aspects of Reo were presented and discussed in 2001 and 2002. I immensely appreciate the work of my colleagues involved in the development and implementation of Reo. I am particularly grateful for J. Rutten’s keen interest in Reo and his inspiring work on a coalgebraic formal semantics for it.

References 1. F. Arbab. The IWIM model for coordination of concurrent activities. In Paolo Ciancarini and Chris Hankin, editors, Coordination Languages and Models, volume 1061 of Lecture Notes in Computer Science, pages 34–56. Springer-Verlag, April 1996. 2. F. Arbab. What do you mean, coordination? Bulletin of the Dutch Association for Theoretical Computer Science, NVTI, pages 11–22, 1998. Available on-line http://www.cwi.nl/NVTI/Nieuwsbrief/nieuwsbrief.html. 3. F. Arbab. A channel-based coordination model for component composition. Technical Report SEN-R0203, Centrum voor Wiskunde en Informatica, Kruislaan 413, 1098 SJ Amsterdam, The Netherlands, February 2002. 4. F. Arbab. Reo: A channel-based coordination model for component composition. Mathematical Structures in Computer Science, 2003. 5. F. Arbab, F. S. de Boer, M. M. Bonsangue, and J. V. Guillen Scholten. MoCha: A framework for coordination using mobile channels. Technical Report SEN-R0128, Centrum voor Wiskunde en Informatica, Kruislaan 413, 1098 SJ Amsterdam, The Netherlands, December 2001. 6. F. Arbab, F.S. de Boer, and M.M. Bonsangue. A coordination language for mobile components. In Proc. ACM SAC’00, 2000. 7. F. Arbab and F. Mavaddat. Coordination through channel composition. In F. Arbab and C. Talcott, editors, Coordination Languages and Models: Proc. Coordination 2002, volume 2315 of Lecture Notes in Computer Science, pages 21–38. Springer-Verlag, April 2002. 8. F. Arbab and J. J. M. M. Rutten. A coinductive calculus of component connectors. Technical Report SEN-R0216, Centrum voor Wiskunde en Informatica, Kruislaan 413, 1098 SJ Amsterdam, The Netherlands, September 2002. 9. Farhad Arbab, F. S. de Boer, and M. M. Bonsangue. A logical interface description language for components. In Antonio Porto and Gruia-Catalin Roman, editors, Coordination Languages and Models: Proc. Coordination 2000, volume 1906 of Lecture Notes in Computer Science, pages 249–266. Springer-Verlag, September 2000. 10. J.W. de Bakker and J.N. Kok. Towards a Uniform Topological Treatment of Streams and Functions on Streams. In W. Brauer, editor, Proceedings of the 12th International Colloquium on Automata, Languages and Programming, volume 194 of Lecture Notes in Computer Science, pages 140–148, Nafplion, July 1985. Springer-Verlag. 11. L. Barbosa. Components as Coalgebras. PhD thesis, Universidade do Minho, Braga, Portugal, 2001. 12. M.M. Bonsangue, F. Arbab, J.W. de Bakker, J.J.M.M. Rutten, A. Scutell´ a, and G. Zavattaro. A transition system semantics for the control-driven coordination language manifold. Theoretical Computer Science, 240:3–47, 2000.

68

F. Arbab

13. M. Broy. A logical basis for component-based system engineering. Technical report, Technische Universit¨ at M¨ unchen, Nov. 2000. 14. M. Broy and K. Stolen. Speciﬁcation and development of interactive systems, volume 62 of Monographs in Computer Science. Springer, 2001. 15. J. Buck, S. Ha, E. A. Lee, and D. G. Messerschmitt. Ptolemy: a framework for simulating and prototyping heterogeneous systems. International Journal of Computer Simulation, special issue on Simulation Software Development(3), January 1990. 16. CORBA. See: http://www.omg.org. 17. F. S. de Boer and M. M. Bonsangue. A compositional model for conﬂuent dynamic data-ﬂow networks. In M. Nielsen and B. Rovan, editors, Proc. International Symposium of the Mathematical Foundations of Computer Science (MFCS), volume 1893 of Lecture Notes in Computer Science, pages 212–221. Springer-Verlag, August-September 2000. 18. Enterprise JavaBeans. See: http://java.sun.com/products/ejb. 19. D. Gelernter and N. Carriero. Coordination languages and their signiﬁcance. Communication of the ACM, 35(2):97–107, February 1992. 20. J. Gore. Object Structures: Building Object-Oriented Software Components. Addison Wesley, 1996. 21. R. Grimes. Professional DCOM Programming. Wrox Press, 1997. 22. H.P. Gumm and T. Schr¨ oder. Covarieties and complete covarieties. In [29], 1998. 23. H. Barringer, R. Kuiper, and A. Pnueli. A really abstract current model and its temporal logic. In Proceedings of Thirteenth Annual ACM Symposium on principles of Programming Languages, pages 173–183. ACM, 1986. 24. R. Hennicker and M. Wirsing. A formal method for the systematic reuse of speciﬁcation components. In Methods of Programming, volume 544 of LNCS, pages 49–75. Springer Verlag, 1991. 25. F. Huber, A. Rausch, and B. Rumpe. Modeling dynamic component interfaces. In M. Singh, B. Meyer, J. Gil, and R. Mitchell, editors, Proc. Technology of ObjectOriented Languages and Systems (TOOLS’98), pages 58–70. IEEE Computer Society, 1998. 26. B. Jacobs. Behaviour-reﬁnement of object-oriented speciﬁcations with coinductive correctness proofs. Report CSI-R9618, Computing Science Institute, University of Nijmegen, 1996. Also in the proceedings of TAPSOFT’97. 27. B. Jacobs. Coalgebraic speciﬁcations and models of deterministic hybrid systems. In M. Wirsing and M. Nivat, editors, Algebraic Methods and Software Technology, number 1101 in Lecture Notes in Computer Science, pages 520–535. SpringerVerlag, 1996. 28. B. Jacobs. Inheritance and cofree constructions. In P. Cointe, editor, European Conference on Object-Oriented Programming, number 1098 in Lecture Notes in Computer Science, pages 210–231. Springer-Verlag, 1996. 29. B. Jacobs, L. Moss, H. Reichel, and J.J.M.M. Rutten, editors. Proceedings of the ﬁrst international workshop on Coalgebraic Methods in Computer Science (CMCS ’98), volume 11 of Electronic Notes in Theoretical Computer Science. Elsevier Science B.V., 1998. Available at URL: www.elsevier.nl/locate/entcs. 30. B. Jacobs and J.J.M.M. Rutten. A tutorial on (co)algebras and (co)induction. Bulletin of EATCS, 62:222–259, 1997. Available on-line http://www.cs.kun.nl/˜bart/PAPERS/JR.ps.Z. 31. Jini. See: http://www.sun.com/jini.

Abstract Behavior Types: A Foundation Model for Components

69

32. J.N. Kok. Semantic Models for Parallel Computation in Data Flow, Logic- and Object-Oriented Programming. PhD thesis, Vrije Universiteit, Amsterdam, May 1989. 33. E. A. Lee and T. M. Parks. Dataﬂow process networks. In Proceedings of the IEEE, volume 83, pages 773–801, May 1995. 34. Edward A. Lee and David G. Messerschmitt. An overview of the ptolemy project. Technical report, Dept. of Electrical Engineering and Computer Sciences, University of California at Berkeley, 1993. 35. S. Li et al. Professional Jini. Mass Market Paperback, 2000. 36. V. Matena and B. Stearns. Applying Enterprise JavaBeans: Component-Based Development for the J2EE Platform. Java Series, Enterprise Edition. AddisonWesley, 2000. 37. B. Meyer. Eiﬀel: The Language. Prentice Hall, 1992. 38. B. Meyer. Reusable Software: The Base Object-Oriented Component Libraries. Prentice Hall, 1994. 39. R. Milner. Elements of interaction. Communications of the ACM, 36(1):78–89, January 1993. 40. R. Milner, J. Parrow, and D. Walker. A calculus of mobile processes, parts I and II. Information and Computation, 100(1):1–77, 1992. 41. Lawrence S. Moss. Coalgebraic logic. Annals of Pure and Applied Logic, 96(1– 3):277–317, 1999. 42. Lawrence S. Moss and Norman Danner. On the foundations of corecursion. Logic Journal of the IGPL, 5(2):231–257, 1997. 43. O. Nierstrasz and F. Achermann. A calculus for modeling software components. In F. S. de Boer, M. M. Bonsangue, S. Graf, and W.-P. de Roever, editors, Proc. First International Symposium on Formal Methods for Components and Objects (FMCO ’02), LNCS. Springer, 2003. (this volume). 44. S. Oaks and H. Wong. Jini in a Nutshell. O’Reilly & Associates, 2000. 45. G.A. Papadopoulos and F. Arbab. Coordination models and languages. In M. Zelkowitz, editor, Advances in Computers – The Engineering of Large Systems, volume 46, pages 329–400. Academic Press, 1998. 46. D. Pattinson and M. Wirsing. Making components move: A separation of concerns approach. In F. S. de Boer, M. M. Bonsangue, S. Graf, and W.-P. de Roever, editors, Proc. First International Symposium on Formal Methods for Components and Objects (FMCO ’02), LNCS. Springer, 2003. (this volume). 47. H. Reichel. An approach to object semantics based on terminal coalgebras. Mathematical Structures in Computer Science, 5:129–152, 1995. 48. J. J. M. M. Rutten. Elements of stream calculus (an extensive exercise in coinduction. In S. Brookes and M. Mislove, editors, Proc. of 17th Conf. on Mathematical Foundations of Programming Semantics, Aarhus, Denmark, 23–26 May 2001, volume 45 of Electronic Notes in Theoretical Computer Science. Elsevier, Amsterdam, 2001. 49. J.J.M.M. Rutten. Universal coalgebra: A theory of systems. Technical Report CS-R9652, Centrum voor Wiskunde en Informatica, Kruislaan 413, 1098 SJ Amsterdam, The Netherlands, 1996. Available on-line http://www.cwi.nl/ftp/CWIreports/AP/CS-R9652.ps.Z. 50. J.J.M.M. Rutten. Automata and coinduction (an exercise in coalgebra). Report SEN-R9803, CWI, 1998. Available at URL: www.cwi.nl. Also in the proceedings of CONCUR ’98, LNCS 1466, 1998, pp. 194–218.

70

F. Arbab

51. J.J.M.M. Rutten. Automata, power series, and coinduction: taking input derivatives seriously (extended abstract). Report SEN-R9901, CWI, 1999. Available at URL: www.cwi.nl. Also in the proceedings of ICALP ’99, LNCS 1644, 1999, pp. 645–654. 52. J.J.M.M. Rutten. Coalgebra, concurrency, and control. Report SEN-R9921, CWI, 1999. Available at URL: www.cwi.nl. Extended abstract in: Discrete Event Systems, R. Boel and G. Stremersch (eds.), Kluwer, 2000. 53. J.J.M.M. Rutten. Universal coalgebra: a theory of systems. Theoretical Computer Science, 249(1):3–80, 2000. 54. Davide Sangiorgi. Asynchronous process calculi: the ﬁrst-order and higher-order paradigms (tutorial). Theoretical Computer Science, 253, 2001. 55. Jon Siegel. CORBA: Fundamentals and Programming. John Wiley & Sons Inc., New York, 1 edition, 1996. 56. Alan Snyder. Encapsulation and inheritance in object-oriented programming languages. In OOPSLA ’86, pages 38–45, September 1986. 57. M. Wirsing, R. Hennicker, and R. Breu. Reusable speciﬁcation components. Technical Report MIP-8817, Passau University, 1988.

Understanding UML: A Formal Semantics of Concurrency and Communication in Real-Time UML Werner Damm1 , Bernhard Josko1 , Amir Pnueli2 , and Angelika Votintseva1

2

1 OFFIS, Oldenburg, Germany {damm,josko,votintseva}@offis.de The Weizmann Institute of Science, Rehovot, Israel [email protected]

Abstract. We deﬁne a subset krtUML of UML which is rich enough to express all behavioural modelling entities of UML used for real-time applications, covering such aspects as active objects, dynamic object creation and destruction, dynamically changing communication topologies in inter-object communication, asynchronous signal based communication, synchronous communication using operation calls, and shared memory communication through global attributes. We deﬁne a formal interleaving semantics for this kernel language by associating with each model M ∈ krtUML a symbolic transition system STS(M ). We outline how to compile industrial real-time UML models making use of generalisation hierarchies, weak- and strong aggregation, and hierarchical statemachines into krtUML, and propose modelling guidelines for real-time applications of UML. This work provides the semantical foundation for formal veriﬁcation of real-time UML models described in the companion paper [11].

1

Introduction

The establishment of a real-time proﬁle for UML [25], the proposal for a UML action language [24], and the installation of a special interest group shared between INCOSE and OMG to develop a proﬁle for UML addressing speciﬁcation of real-time systems at the system-level all reﬂect the pressure put on standardisation bodies to give a rigorous foundation to the increasing level of usage of UML to develop hard real-time systems. Its increased use also for safety critical applications mandates the need to complement these modelling oriented activities with an agreement on the formal semantics of the employed modelling constructs, as a prerequisite for rigorous

This research was partially supported by the German Research Council (DFG) within the priority program Integration of Speciﬁcation Techniques with Engineering Applications under grant DA 206/7-3 and by the Information Society DG of the European Commission within the project IST-2001-33522 OMEGA (Correct Development of Real-Time Embedded Systems).

F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 71–98, 2003. c Springer-Verlag Berlin Heidelberg 2003

72

W. Damm et al.

formal analysis methods, such as formal veriﬁcation of compliance to requirements. This need has been perceived by the research community, leading to a substantial body of formalisation of various subsets of UML, discussed in detail in Section 5 of this paper. The precise UML group has in a series of papers [5,6,7] been proposing a meta-modelling based approach, which however lacks the capability to address dynamics aspects at the level of detail required for formal veriﬁcation. Approaches based on translation into existing formalisms, such as e.g the π-calculus [26,27], ASMs [23], CASL [30], Object-Z [18] fall short of covering the rich range of behavioural modelling constructs covered in this paper. Closest to our work addressing the intricacies of understanding active objects are [29,30]. Our approach takes into account functional aspects of real-time systems, considering discrete time model with two levels of granularity. In this paper we focus our investigation on the semantic foundation of such critical features of real-time applications as concurrency (including the speciﬁcation of the time points for interferences) and two types of inter-object communication — synchronous, via operation calls, and asynchronous, via signal event emission. The described approach beneﬁts from numerous discussions with industrial users employing UML tools for the development of real-time systems, e.g. the partners of the IST projects Omega1 and AIT-Wooddes2 . The IST project Omega has developed an agreed speciﬁcation rtUML of those modelling concepts from UML required to support industrial users in their application development (Deliverable IST/33522/WP1.1/D1.1.1, [10]), subsuming such concepts as inheritance, polymorphism, weak and strong aggregation, hierarchical state machines, rich action language, active, passive, and reactive objects, etc., taking into account detailed issues such as navigability, visibility, changeability and ordering of association end-points, and allowing unbounded multiplicity of these. We propose a two-stage approach to give a formal semantics to rtUML: A precompilation step translates rtUML models into a suﬃciently compact sublanguage krtUML, eliminating the need at the kernel level to address the various facets of associations, inheritance, polymorphism, and hierarchical state-machines. We then give a formal semantics of krtUML, using the formalism of symbolic transition systems [22]. In this semantic framework, the state-space of the transition system is given by valuations of a set of typed system variables, and initial states and the transition relation are deﬁned using ﬁrst-order predicate logic. We show how to capture a complete snapshot of the dynamic execution state of a UML model, using unbounded arrays of object conﬁgurations to maintain the current status of all objects, and a pending request table modelling the status of all submitted, but not yet served operation calls. Object conﬁgurations include information on the valuation of the object’s attributes, the state-conﬁguration of it’s state-machine, as well as the pending events collected in an event-queue. Due to space restrictions, this paper focusses on the deﬁnition and formal semantics of krtUML, and only sketches some ideas of the precompilation phase. 1 2

IST-2001-33522, http://www-omega.imag.fr/index.php IST-1999-10069, http://wooddes.intranet.gr

Understanding UML: A Formal Semantics

73

We refer the reader to [10] for a full description of this step, as well as for the speciﬁcation of rtUML. The paper is organized as follows. Section 2 gives a formal deﬁnition of the constituents of a krtUML model. Section 3, the heart of this paper, develops the STS-based semantics, motivating and introducing in consecutive sections the system variables spanning the state-space of the transition systems, and the transition relation itself. Section 4 highlights aspects of the pre-compilation step, addressing inheritance and aggregation. Section 5 discusses related work.

2

The krtUML Language

In developing krtUML, we strived to maintain in puriﬁed form those ingredients of UML relating to the interaction of active objects. Intuitively, an active object (i.e., an instance of an active class) is like an event-driven task, which processes its incoming requests in a ﬁrst-in-ﬁrst-out fashion. It comes equipped with a dispatcher, which picks the top-level event for the event-queue, and dispatches it for processing to either its own state-machine, or to one of the passive reactive objects associated with this active object, inducing a so-called run-tocompletion step. We generalize this concept in Section 4 by proposing to group one active object and a collection of passive server objects into what we call components 3 . Within a component, all passive objects delegate event-handling to the one active object of the component; pre-compilation will capture this delegation relation by allowing to refer through my ac to the active object responsible for event-handling of a passive object. While the semantical model is rich enough to support communication through shared attributes, operation calls, and signals, we restrict our communication model so that all inter-component communications are purely asynchronous, i.e. via signal events. Our kernel language thus still caters for the diﬀerence between active and passive objects. All objects are assumed to be reactive, that is their behaviour can be made dependent on the current state of the system. We support so-called triggered operations, i.e. operation calls, whose return value depends on the current state of the system, as distinguished from what we call primitive operations, the body of which is deﬁned by a program in the supported action language. Since primitive operations only involve services of an object within the same component, pre-compilation can eliminate all calls to primitive operations by inlining (assuming, that the call-depth of primitive operations is bounded). In contrast, for triggered operations the willingness of the object to accept a particular operation call in a given state is expressed within the state-machine, by labeling transitions emerging from the state with the operation name as triggering guard, in the same style as the willingness of the object to react to a given signal event is speciﬁed by using this signal as triggering guard. Reﬂecting the wish to make the return value of triggered operations dependent on the object state, 3

In this paper, we use the notion of components which is a restriction of the more general concept from the standard UML. Namely, we consider only a kind of components containing exactly one active object.

74

W. Damm et al.

its “body” is “spread out” over the state-machine itself: the acceptance of a call will induce a run-to-completion step, hence the transition-labels passed during this run-to-completion step determine the response for this particular invocation of the triggered operation. Pre-compilation will have ﬂattened the hierarchical state-machines of rtUML into the ﬂat state-machines considered in our kernel language. It will also have split compound transition annotations, hence within the kernel language, only atomic actions and triggering guards (signal/operation names possibly with conditions) are allowed as labels of transitions. We now elaborate on the formal deﬁnition of krtUML models. Note that the diﬀerent ingredients are mutually dependent, hence we collect them in one formal deﬁnition. Deﬁnition 1 (krtUML model). A krtUML model M = (T, F, Sig, <, C, croot , A) consists of the following elements: •

T ⊇ {void, IB, IN}: A set of basic types comprising at least booleans and natural numbers.

•

F: A set of typed predeﬁned primitive functions.

•

Sig: A ﬁnite set of signals. Every instance of a signal is called signal event, or event for brevity.

•

< ⊂ Sig × Sig: A generalisation relation on signals, i.e. the transitive closure <+ is irreﬂexive, where ev1 < ev2 denotes that ev2 is a generalisation of ev1 . In the following, we use ≤ to denote the reﬂexive transitive closure of <.

•

C: A ﬁnite, non-empty set of classes. A class c = (c.isActive, c.attr, c.ops, c.sm) consists of: - c.isActive: A predicate. Class c ∈ C is called active iﬀ c.isActive = true. - c.attr: A ﬁnite set of typed attributes, which may not be of type void. - c.ops: A ﬁnite set of typed triggered operations. - c.sm: A c-state-machine as explained in (v) below in terms of c-actions over c-expressions. Each class contains two speciﬁc implicit attributes (introduced by the preprocessing): self ∈ c.attr keeping the reference to the object itself, and my ac ∈ c.attr specifying the event-handling object associated with class c.

•

croot ∈ C: The class of the root object (serving to specify system initialisation as deﬁned in Deﬁnition 7).

•

A ⊂ C: A subset of active classes called actors and used to denote external objects (part of the environment).

Understanding UML: A Formal Semantics

75

krtUML allows for some set of base types T, as well as a set F of functions operating on them, including, in particular, booleans and natural numbers together with all logical and arithmetical operators. Signals as well as operations may have parameters of well-deﬁned types. Note that we support explicitly generalisation hierarchies on signals (while generalisation hierarchies on objects are eliminated during pre-compilation). We now elaborate on the elements of krtUML model deﬁned so far, and start by deﬁning the supported types. Here we clear distinguish between base types and reference types (visible on the UML level), as well as a third category of types catering for implicit attributes representing association end-points, which typically hold a number of references depending on their multiplicity. By choosing to type these uniformly with functions from the naturals to classes, we cater for unbounded multiplicity. Operationally, we hence view such implicit attributes as unbounded arrays, with each index pointing to an associated object of a given class, or containing a nil-pointer. Deﬁnition 1 (Continued). (i) Typing: A krtUML model M deﬁnes the set of types T(M ) =df T ∪ TC ∪ Tas where TC =df {Tc | c ∈ C} is the set of reference types and Tas =df {IN → Tc | c ∈ C} the set of association types, which will be used to represent all kinds of associations described in [10] (i.e., composition, aggregation and neighbour). For each type τ ∈ T(M ), we assume existence of a designated element nilτ ∈ τ as a default value. We use ‘type’ to denote the type of attributes, functions etc. as follows: • For each class c ∈ C and each attribute a ∈ c.attr, type(a) ∈ T(M ) denotes the type of a ∈ c.attr, where type(self) = Tc ∈ TC and type(c.my ac) ∈ TC . •

For each class c ∈ C and each triggered operation op ∈ c.ops, typepar (op) = T1 × · · · × Tn denotes the parameter type where Ti ∈ T(M ) is the type of the i-th parameter and typer (op) ∈ T(M ) denotes the type of the reply value (typer (op) = void if op does not yield a return value). The type of op is deﬁned as type(op) = typepar (op) → typer (op).

•

For each f ∈ F, typepar (f ) = T1 × · · · × Tn denotes the parameter type where Ti ∈ T(M ) is the type of the i-th parameter and typer (f ) denotes the value type of f . The type of f is type(f ) = typepar (f ) → typer (f ).

•

For each ev ∈ Sig, typepar (ev) = T1 × · · · × Tn denotes the parameter type of ev where Ti ∈ T(M ) is the type of the i-th parameter.

We next introduce the expression language, supporting navigation expressions, accessing objects through association end-points, and closing this under application of base-type functions (including equality and boolean operations). Expressions are terms deﬁned in the scope of a class that can be used in transition guards orprimitive actions of this class.

76

W. Damm et al.

Deﬁnition 1 (Continued). (ii) Expressions: For a class c ∈ C, a c-expression ‘expr’ is deﬁned inductively as follows: •

•

•

Navigation expression: expr ::= r.a, where r ∈ c.attr with type(r) = Tc0 ∈ TC and a ∈ c0 .attr. We set type(expr) =df type(a). Note, that we only consider “ﬂat” navigation expressions in krtUML, where r can also refer to the object itself (if r = self). Association access: expr ::= expr1 [expr2 ], where expr1 and expr2 are c-expressions type(expr1 ) = (IN → Tc ) ∈ Tas and type(expr2 ) ∈ IN. We set type(expr) =df Tc . Function application: expr ::= f (expr1 , . . . , exprn ), where expr1 , . . . , exprn are c-expressions, f ∈ F, and type(expri ) matches the type of the i-th parameter of f , 0 < i ≤ n. We deﬁne type(expr) = typer (f ).

In the following deﬁnition of c-guards, c-actions and c-state-machines, ‘expr’, ‘expr1 ’, and ‘expr2 ’ denote c-expressions. Guards can be just boolean expressions, or express the willingness to accept a signal event or an operation call, possibly conjoined with a boolean condition. Deﬁnition 1 (Continued). (iii) Guards: For a class c ∈ C, a triggering guard to be used in the state-machine of class c ∈ C, c-guard for short, is one of the following: • Signal trigger: ev[expr], where ev ∈ Sig and type(expr) = IB. • Call trigger: op[expr], where op ∈ c.ops and type(expr) = IB. • Condition: [expr], where type(expr) = IB. We support a rich action language, allowing for object-creation and destruction, operation calls, event emission, and assignments of attributes and association end-points. The expression passed in an object creation is intended to pass the identity of the active-object responsible for event-handling. Reply actions serve to deﬁne the return value of a triggered operation. Deﬁnition 1 (Continued). (iv) Actions: A (primitive) action to be used in the state-machine of class c ∈ C, c-action for short, is one of the following: • Object creation: r.a := createc (expr), with r ∈ c.attr, type(r) = Tc0 ∈ TC , a ∈ c0 .attr and type(a) = Tc ∈ TC , and type(expr) = type(c .my ac). • Object creation (into association place): r.a[expr1 ] := createc (expr2 ), with r ∈ c.attr, type(r) = Tc0 ∈ TC , a ∈ c0 .attr, type(a) = (IN → Tc ) ∈ Tas , type(expr1 ) = IN, and type(expr2 ) = type(c .my ac).

Understanding UML: A Formal Semantics

77

•

Attribute assignment: r.a := expr, with r ∈ c.attr, type(r) = Tc0 ∈ TC , a ∈ c0 .attr, and type(a) = type(expr).

•

Association place assignment: r.a[expr1 ] := expr2 , with r ∈ c.attr, type(r) = Tc0 ∈ TC , a ∈ c0 .attr, type(expr1 ) = IN, type(a) = (IN → Tc ∈ Tas ), and type(expr2 ) = Tc .

•

Event emission: r.send(ev, expr1 , . . . , exprn ), with r ∈ c.attr and type(r) ∈ TC , ev ∈ Sig, n and (×i=0 type(expri )) = typepar (ev).

•

Operation call (ignoring reply value): r.call(op, expr1 , . . . , exprn ), with r ∈ c.attr, type(r) ∈ TC , op ∈ type(r).ops, n and (×i=0 type(expri )) = typepar (op).

•

Operation call (assigning value): r.a := r .call(op, expr1 , . . . , exprn ), with r ∈ c.attr, type(r) = Tc0 ∈ TC , a ∈ c0 .attr, and r ∈ c.attr, n type(r ) ∈ TC , op ∈ type(r ).ops, and (×i=0 type(expri )) = typepar (op), and type(a) = typer (op).

•

Operation call (assigning value into association place): r.a[expr0 ] := r .call(op, expr1 , . . . , exprn ), with r ∈ c.attr, type(r) = Tc0 ∈ TC , a ∈ c0 .attr, and r ∈ c.attr, n type(r ) ∈ TC , op ∈ type(r ).ops, and (×i=0 type(expri )) = typepar (op), and type(a) = (IN → c ) ∈ Tas , type(expr0 ) = IN, and typer (op) = c .

•

Setting reply value: replyτ (expr), with τ ∈ T ∪ TC and type(expr) = τ .

•

Object destruction: destroy(expr), with type(expr) ∈ TC .

Triggering Guards and Actions appear as decorations of transitions of the statemachine of a class. We assume a designated destruction state. Pre-compilation will extend the user-deﬁned state-machine by pre-ﬁxing the initial state with a sequence of transitions modelling constructor actions, while the destruction state is the unique entry point into a section added by pre-compilation modelling destructor-code. Pre-compilation also transfers hierarchical statecharts into ﬂat state-machines, each extended by a destruction state having no incoming transitions. Deﬁnition 1 (Continued). (v) State-machines: A c-state-machine for a class c ∈ TC is a tuple c.sm = (c.Q, c.q0 , c.qx , c.tr) where • c.Q is a ﬁnite set of states. •

c.q0 ∈ c.Q is the initial state.

•

c.qx ∈ c.Q is the destruction state, which is used to mark the beginning of the destructor’s actions.

78

W. Damm et al. •

c.tr ⊆ c.Q × ({γ | γ is a c-guard or c-action}) × c.Q is the transition relation.

We will use krtUML to denote the set of all krtUML models. Note that on the krtUML level, there is intentionally no inheritance relation on classes, since for each class c ∈ C, inheritance is explained by the introduction of uplink - and downlink -neighbour associations uplinkc , downlink ∈ c.attr for each superclass c of c in the preprocessing step. The uplink-association is used to model static polymorphism, whereas downlink-association allows to capture dynamic one. Further note that association access is restricted to accessing a single index, i.e. on the krtUML level, there are no operations like iteration over associations or adding references. We assume that such operations are also explained in terms of primitive actions by the preprocessing. The identiﬁcation of actors is not considered necessary from a semantical point of view, since actors should be treated as every other active class. But the information whether an object is an actor instance can be exploited in formal veriﬁcation: these objects need not necessarily be encoded like ordinary objects but can be interpreted as an assumption about environment behaviour, i.e. the expected sequences of input stimuli. In the following, we assume that the preprocessing step as outlined in Subsection 4.1 establishes the following set of requirements regarding the sets of attributes and the state-machines of a krtUML model, which we rely on in Section 3 when explaining the semantics. (i) All attribute and triggered operation names of all classes are pairwise different, for example qualiﬁed by a class name like c::a, and all states of all state-machines are pairwise diﬀerent. (ii) For each class c ∈ C, c.attr contains the attribute c::my ac to store the reference to the responsible active object such that c::my ac is of type Tc and c .isActive = true. (iii) Values of the implicit attributes c::self and c::my ac are assigned once at the initialization of the corresponding object and do not change during the life-time of the object. (iv) For each triggered operation op ∈ c.ops, c ∈ C, there are attributes c::oppi ∈ c.attr, 1 ≤ i ≤ n to hold local copies of the parameters, typed s.t. (c::opp1 , . . . , c::oppn ) = typepar (op) (v) For each ev ∈ Sig which c ∈ C is willing to receive, i.e. there is a transition (q, ev[expr], q ) ∈ c.tr, there are attributes c::evpi ∈ c.attr, 1 ≤ i ≤ n to hold local copies of the signal parameters, typed s.t (c::evp1 , . . . , c::evpn ) = typepar (ev).

3

krtUML Semantics

We will give the semantics of krtUML in terms of symbolic transition systems, proposed in [22] under the name Synchronous Transition Systems. Separate sub-

Understanding UML: A Formal Semantics

79

sections derive from types of krtUML models the type structure of related symbolic transition systems, and introduce the system variables required to represent a snapshot in the dynamic execution of a krtUML model. We then elaborate the way snapshots can evolve, deﬁning for each of the possible cases a transition predicate. Finally, we deﬁne the predicate characterizing initial snapshots, and collect all pieces of the transition relation into a full predicative deﬁnition of the transition relation, leading to a deﬁnition of the symbolic transition system associated with krtUML model. 3.1

Symbolic Transition Systems

We ﬁrst introduce the semantic model of symbolic transition systems, which allow for a purely syntactical description of a transition system by ﬁrst-order logic predicates over a set of typed system variables. Deﬁnition 2 (STS). A symbolic transition system (STS) S = (V, Θ, ρ) consists of V, a ﬁnite set of typed system variables, Θ, a ﬁrst-order predicate over variables in V characterizing the initial states, and ρ, a transition predicate, that is a ﬁrst-order predicate over V, V , referring to both primed and unprimed versions of the system variables (their current and next states). An STS induces a transition system on the set of interpretations of its variables as follows. Deﬁnition 3 (Runs of an STS). Let S = (V, Θ, ρ) be an ST S and T the set of types of variables in V. Let Dτ be a semantic domain for each τ ∈ T. (i) A snapshot s:V→

Dτ

τ ∈T

of S is a type-consistent interpretation of V, assigning to each variable v ∈ V a value s(v) over its domain. Σ denotes the set of snapshots of S. (ii) A snapshot s ∈ Σ inductively deﬁnes the value [[expr]](s) for ﬁrst-order predicates ‘expr’ over V and the value [[expr]](s, s ) for ﬁrst-order predicates ‘expr’ over V, V , where s provides the interpretation of unprimed and s the interpretation of primed variables in ‘expr’. (iii) A snapshot s ∈ Σ is called initial, iﬀ [[Θ]](s) = true. (iv) Let s, s ∈ Σ be snapshots of S. Snapshot s is called S-successor of s, iﬀ [[ρ]](s, s ) = true. (v) A computation, or run, of S is an inﬁnite sequence of snapshots r = s0 s1 s2 . . . , satisfying the following requirements: • Initiation: s0 is initial. •

Consecution: Snapshot sj+1 is an S-successor of sj , for each j ∈ IN0 .

80

W. Damm et al.

(vi) The set of all computations of S is denoted as runs(S). We use r(i) to denote the i-th snapshot of a run r ∈ runs(S) and r/i =df r(i) r(i + 1) r(i + 2) . . . to denote the inﬁnite suﬃx starting at r(i), i ∈ IN0 . 3.2

System Variables for the krtUML Semantics

We motivate our choice of types and system variables using snapshots related to the Automated Rail Car System described in [15], a model of autonomous rail-bound cars which transport passengers between terminals and which adhere to a simple arrive- and departure protocol to allocate and de-allocate platforms inside the terminal. We refer the reader to [15] for details.

Fig. 1. System Conﬁguration: A variable of type Tsconf contains one object conﬁguration for every object identiﬁer in OC . The example of an object conﬁguration oconf for the object (Car, 5) is shown enlarged.

Figure 1 depicts the way, how an object-conﬁguration is captured. It shows enlarged the entry of an object of class Car, currently executing. The current state-machine conﬁguration is illustrated by a state-machine, where in fact only the current state is stored. An object onﬁguration not only gives the current valuation of all its attributes as well as its current state conﬁguration, but also maintains the current object status (elaborated below), the event-queue (for active objects only), and a dispatcher status (for active objects only) used to enforce a single thread of control within the objects grouped into one component. The semantic entity representing a single class is a (potentially unbounded) array of object conﬁgurations, with each entry corresponding to a single instance of this class. The object status reﬂects the life-cycle of an object (see Figure 2). Prior to creation, objects are perceived as being dormant. Creation of a new object instance will pick a dormant index of the corresponding class, and awake the object to realities of life. During life, objects become suspended when waiting for completion of an operation call, and idle, (except for the special case discussed below) when becoming stable, i.e. when a run-to-completion step terminates. This happens when reaching a state, where all outgoing transitions are

Understanding UML: A Formal Semantics

81

either guarded by signal triggers (of the form ev[expr]) or call triggers (of the form op[expr]), or conditions (of the form [expr]) which are evaluated to false. In the particular case of accepting destruction, the object status will switch to dying, remaining in this status until its last run-to-completion step induced from the objects’ destructor is ﬁnally completed. From then on, the object status will remain dead. Note, that destruction of an aggregate object (w.r.t. the composition association, deﬁned in rtUML) induces destruction of all its parts, hence dying may be a long and painful process. Our semantics thus allows to observe nastities such as sending events to dying objects, as well as detecting dangling references. alive queue empty and no pending calls create dormant

locally enabled transition

idle take event or accept trig. op.

become stable destroy

executing initiate trig. op. call

dying

destruction completed

dead

pick up result

suspended

Fig. 2. Object life-cycle.

For the rest of the current section, let M = (T, F, Sig, <, C, croot , A) be a krtUML model. We now deﬁne for the semantic types employed in the deﬁnition of the associated symbolic transition system, as well as the semantic domain of all semantic types. The type-system of semantic types subsumes all types of the krtUML model. Deﬁnition 4 (Object Reference Types and Domains). For each basic type τ ∈ T, we assume the existence of a corresponding semantic type Tτ with domain Dτ . For each type Tc ∈ TC , we denote by Oc or TTc the corresponding semantic type and choose DOc =df {c} × IN as its domain. We call OC with domain DOC =df c∈C DOc , the object reference type resp. domain. For each object type Oc , we assume existence of a designated element nilc ∈ DOc to serve as a default value. For each each association type τ = (IN → Tc ) ∈ Tas , Dτ =df (IN → DOc ) is the domain of Tτ . We now deﬁne the semantic type of system conﬁgurations and its associated domain, by ﬁrst deﬁning the semantic type of object conﬁgurations. Deﬁnition 5 (Object and System Conﬁguration). (i) An object conﬁguration oconf = (status, ac, sc, eq, ds) consists of the following elements:

82

W. Damm et al. •

An object status ‘status’ of type Tobjstatus with associated semantic domain DTobjstatus =df {dormant, idle, executing, suspended, dying, dead}.

• •

An object attribute conﬁguration ‘ac’ of type Tac =df c∈C (c.attr → TT(M ) ). An object state-machine conﬁguration ‘sc’ of type Tsc with associated semantic domain DTsc =df c.Q. c∈C

•

∗ The event queue eq of type Teq =df Teqe , i.e. a sequence of entries Ttypepar (ev) . (dest, ev, par) of type Teqe =df OC × Sig × ev∈Sig

For an event-queue entry, ‘dest’ denotes the destination, ‘ev’ the event type (i.e. signal name), and ‘par’ the event parameters. We will use ε to denote empty event queue. • A dispatch reference ds of type Tds =df OC , i.e. a reference to some object of any class which is used to denote the object currently processing an event. Thus the type of an object conﬁguration of M is Toconf (M ) =df Tobjstatus × Tac × Tsc × Teq . × Tds (ii) The type of a system conﬁguration is Tsconf (M ) =df OC → Toconf (M ). (iii) We will call a component to be a set Cm(o) = {o |o .my ac = o} of objects assigned to the same event dispatcher o. The symbolic transition system uses the variable sconf : Tsconf to maintain the object-conﬁguration of all objects of M . Note that, in general, the assignment of an event dispatcher to a reactive object can be user deﬁned. In [10], a default assignment is given derived from the composition association.

Fig. 3. Pending Request Table: The pending request table is a system variable of type Tprt . It contains one entry for every object identiﬁer in OC .

We collect the status of all pending operation calls within a pending request table. An example on Figure 3 shows enlarged the entry for calls from an object

Understanding UML: A Formal Semantics

83

of class Car. Currently the call of triggered operation engage for a Cruiser is pending. Here we exploit the fact, that all objects become suspended on calling an operation. We can thus maintain the status of all operation calls in a table indexed by sender objects resp. actors. Each entry in the pending request table maintains the identity of the receiver, the name of the requested operation, the list of parameters, a result-ﬁeld, and status information. The life-cycle of an entry in the pending request table is depicted in Figure 4. Whenever the object owning the entry emits a new operation call, the status of the entry switches to pending. It will remain in this status until the receiving object is willing to serve the call, which causes the status to switch to busy. Once the run-to-completion step induced from accepting the call is terminated, the result of the call is entered into the result ﬁeld of the entry, and its status changes to completed. This will allow the calling object to pick up the result and resume computation, causing the status of the entry to become unused.

unused

caller calls trig. op.

pending

callee accepts call

busy

callee becomes stable

completed

caller picks up result

Fig. 4. Life-cycle of a triggered operation call.

Deﬁnition 6 (Pending Request Table). (i) A pending request table entry opreq = (dest, op, status, result, params) maintains: • The receiver of a triggered operation call ‘dest’ of type Tdest with associated semantic domain DTdest =df OC . • The triggered operation identiﬁer ’op’ of type Top with associated se mantic domain DTop =df c.ops. c∈C •

The triggered operation status ’status’ of type Topstatus with semantic domain DTopstatus =df {unused, pending, busy, completed}.

•

The result (or reply) ‘result’ of type Tres with associated semantic domain typer (op). DTres =df c∈C op∈c.ops

•

The parameters ‘params’ of type Tpar with associated semantic domain typepar (op). DTpar =df c∈C op∈c.ops

84

W. Damm et al.

Thus the type of a pending request table entry is Topreq (M ) =df Tdest × Top × Topstatus × Tres × Tpar . (ii) The type of the pending request table is Tprt (M ) =df OC → Topreq (M ).

The symbolic transition system uses the variable prt : Tprt to maintain the operation requests of all objects of M . Furthermore, we need a boolean ﬂag sysfail, which is used to indicate an undeﬁned state of the system if e.g., it is tried to read the attribute of object reference nil or if the type of the reply action does not match the type of the currently processed triggered operation. Performing some arithmetic computations can also raise this ﬂag in failure situations (e.g., division by 0). Initially, sysfail is set to false and it remains set, once it changed to true. 3.3

The Transition Predicate

Intuitively, there is a transition between two snapshots s, s if there exists exactly one object o ∈ OC whose conﬁguration changes by one of the following reasons: •

• • •

Object o is idle and an event is dispatched to it by its active object or an event with destination o is discarded since it is not enabled in o’s state-machine. (Coarse-granularity ﬂow of control is kept by elements ds of active objects’ conﬁgurations.) Object o is idle and accepts a triggered operation call. (Fine-granularity ﬂow of control is kept by elements dest of the pending request table.) Object o is executing or dying, unstable, and takes a transition of its statemachine and thereby executing an action. (No changes in the ﬂow of control.) Object o is suspended and picks up the result of a triggered operation call which has been completed by the callee. (Fine-granularity ﬂow of control kept by dest in prt.)

The system may remain in snapshot s if no object is executing (implying that pending request table is empty) and all event queues are empty. In the following, we formalize each of the above conditions separately as ﬁrstorder logic predicates which are then used to construct the transition predicate of the semantics S(M ). For brevity, we use the following abbreviations for o ∈ OC in the deﬁnitions of predicates over sconf and prt: • • • •

o.status ≡df sconf(o).status and analogously for sc, ds, eq. o.a ≡df sconf(o)(a), i.e. the value of attribute a. o.a.b ≡df sconf(sconf(o)(a))(b), for attributes b of reference type. For an event or operation parameter tuple e, we use o.evp := e to denote simultaneous assignment of the i-th components of e to their corresponding attributes evpi in o.

Understanding UML: A Formal Semantics

85

A primed abbreviation indicates that the primed system variable is to be used, for example o.a ≡ sconf (o).a. For an event-queue q = e1 e2 . . . en ∈ DTeq we introduce the following abbreviations: • • •

head(q) =df e1 denotes the ﬁrst entry of the queue if q = ε. tail(q) =df e2 . . . en denotes q with the ﬁrst entry removed and ε if n < 2. enqueue(e, q) =df q e denotes the result of appending entry e : Teqe to q.

For brevity we assume that boolean expressions expr are evaluated to ⊥ if it is for example tried to read an attribute via a reference with value nil. Note that in the following incremental deﬁnition of the transition predicate, we use an assigment symbol “:=” which has to be processed as explicated in Deﬁnition 7 to yield the ﬁnal transition predicate. Informally, this symbol indicates that there is no diﬀerence between the current and next states of the system variables other than speciﬁed explicitly in the sequence of the “:=”-expressions (or their constituents). We will use ⊕ to denote logical XORoperator: a ⊕ b =df (a ∨ b) ∧ ¬(a ∧ b). We ﬁrst deﬁne for each object o ∈ Oc the predicate stable(o) in the current system conﬁguration as follows4 : stable(o) =df ∀ (q, γs , q ) ∈ c.tr : q = o.sc =⇒ ((γs ≡ “ev[expr1 ]” ∧ sysfail := (sysfail ∨ expr1 = ⊥)) ∨ (γs ≡ “op[expr2 ]” ∧ sysfail := (sysfail ∨ expr2 = ⊥)) ∨ (γs ≡ “[expr3 ]” ∧ ¬expr3 ∧ sysfail := (sysfail ∨ expr3 = ⊥))) Getting an Event. Formally, an event ev1 with destination o can be dispatched to o from the event queue of its active object, if no other object of o’s active object is currently processing an event and if a transition (q, γ, q ) guarded by a superclass ev of ev1 is enabled, i.e. originating at the current state q (cf. Figure 5): ρget event =df γ ≡ “ev[expr]” ∧ o.my ac.ds = nil ∧ expr = true ∧ sysfail := (sysfail ∨ expr = ⊥) = ε ∧ head(o.my ac.eq).dest = o ∧ o.my ac.eq ∧ o.my ac.eq := tail(o.my ac.eq) ∧ (∃ ev1 ∈ Sig : ∧ head(o.my ac.eq).ev = ev1 ∧ ev1 ≤ ev ∧ (¬stable(o) =⇒ (o.my ac.ds := o ∧ o.status := executing)) ∧ o.evp := head(o.my ac.eq).par) 4

Here and later on: γ ≡ “ev[expr]” (γ ≡ “op[expr]” or γ ≡ “[expr]”) means that the label γ of the current transition (q, γ, q ) is of the form ev[expr] (op[expr] or [expr], resp.), i.e. a signal trigger (a call trigger or a condition, resp., cf. Deﬁnition 1 (iii) ).

86

W. Damm et al.

sconf(o)

(c2,i)

...

...

my_ac ep1 ep2 sc

...

(c2,i) eqi nil eqi

(c2,i) ? ? q

(O,ev,evp1,evp2, ...)

.. . tail(eq i)

q0 q

...

get_event

my_ac eq ds

O=(c1,n) status idle

sconf´(o)

(c2,i)

ev[expr]/

O=(c1,n) status exec

...

my_ac ep1 ep2 Sc

...

(c2,i) evp1 evp2 q´

q´

my_ac eq ds

(c2,i) eqi (c1,n) eqi

.. . tail(eq i)

q0 q

ev[expr]/

q´

Fig. 5. Transition relation: ρget event .

Element o.my ac.ds, when not equal to nil, locks its component for processing a signal event. It can be released (and the component can start to process the following event, i.e. new run-to-completion step) only when all computations within the component are completed. Note that we exploit the fact, that the syntactic category of boolean expression used in the deﬁnition of krtUML models is subsumed in the expression language of the ﬁrst-order logic used to deﬁne transition predicates. In particular the above deﬁned abbreviations apply to expressions of transition predicates thus providing the intended relation to sconf. Accepting a Triggered Operation. Object o can accept a triggered operation call op if a transition (q, γ, q ) with guard op is enabled in the current state q and some object o1 has called op of o: ρaccept op =df γ ≡ “op[expr]” ∧ expr = true ∧ sysfail := (sysfail ∨ expr = ⊥) ∧ (∃ o1 ∈ OC : prt(o1 ).dest = o ∧ prt(o1 ).op = op ∧ prt(o1 ).status = pending ∧ (¬stable(o) =⇒ prt(o1 ).status := busy ∧ o.status := executing) ∧ (stable(o) =⇒ prt(o1 ).status := completed) ∧ prt(o1 ).result := nil ∧ o.opp := prt(o1 ).opp ) Note that an object can call a trigger operation only from an object of the same component because of the restrictions on the inter-component communication. Thus, o.my ac.ds = o.my ac.ds = o1 during the execution of operations within one RTC-step (the change of the control between objects at this level of communication is captured by prt(o).dest with prt(o).status ). Skipping Guards. Object o can take a transition guarded with a boolean expression only, if the expression evaluates to true: ρskip guard =df γ ≡ “[expr]” ∧ expr = true ∧ sysfail := (sysfail ∨ expr = ⊥)

Understanding UML: A Formal Semantics

87

Discarding Events. If there is an event for object o in the queue of o’s active object but o is not willing to accept it, i.e. if no transition with a matching signal is enabled, then the event is simply discarded: ρdiscard event =df o.my ac.ds = nil = ε ∧ head(o.my ac.eq).dest = o ∧ o.my ac.eq ∧ o.my ac.eq := tail(o.my ac.eq) ∧ (∀ (q, ev1 [expr], q ) ∈ c.tr : ≤ head(o.my ac.eq).ev) (expr = false ∨ ev1 ∧ sysfail := (sysfail ∨ expr = ⊥)) Note that triggered operation calls are not discarded, but remain until the callee accepts the call. Executing Actions. Object o can execute an action if the current transition (q, γ, q ) is enabled and annotated with the action. Below we distinguish two types of actions — operation calls and other actions — treating them in two separated subformulas. These subformulas will be combined with diﬀerent contexts — conditions on their performance — in the ﬁnal transition predicate. An assignment action simply assigns a value to the destination. An event-sending action causes a new event to be appended to the queue of the destination’s active object. A reply action causes the parameter value to be written into the reply ﬁeld of the pending request table at o1 if o processes the call from another object o1 ; otherwise system failure is indicated. A destroy action causes the destination’s state-machine conﬁguration to be changed s.t. qx is the current state and the status is “dying”. Then the subsequent steps will execute the actions of the destructor. Killing a dying or dead object causes a system failure: ρnon op action =df

(γ ≡ “r.a := expr” ∧ (expr = ⊥ =⇒ o.r.a := expr) ∧ (sysfail := (sysfail ∨ o.r = nil ∨ expr = ⊥))) ∨(γ ≡ “r.send(ev, expr1 , . . . , exprn )” n ∧ sysfail := (sysfail ∨ o.r = nil ∨ expri = ⊥) i=0

∧ o.r.my ac.eq := enqueue(o.r.my ac.eq, (o.r, ev, (expr1 , . . . , exprn )))) ∨(γ ≡ “replyτ (expr)” ∧ [(∃ o1 ∈ OC : prt(o1 ).dest = o ∧ prt(o1 ).status = busy =⇒ prt(o1 ).result := expr ∧ sysfail := (sysfail ∨ τ = typer (o1 ) ∨ expr = ⊥)) ⊕ sysfail := true])

88

W. Damm et al.

∨ (γ ≡ “destroy(expr)” = nil ∧ [(expr = ⊥ ∧ ∃ o1 ∈ OC : o1 = expr ∧ o1 .my ac = o.my ac ∧ (o1 .status ∈ {dormant, dying, dead} =⇒ [o1 .sc := qx ∧ (¬stable(o) =⇒ o1 .status := dying) ∧ (stable(o) =⇒ o1 .status := dead)))] ⊕ sysfail := true]) An operation call action suspends object o and conﬁgures o’s entry of the pending request table s.t. it denotes the callee, the called triggered operation, and the parameters. Initially the status of the operation is “pending”. A creation action is handled like a triggered operation since the caller should block until an object of the desired class is readily created with all inherited parts and all aggregated parts: ρinit opcall or

create

=df (γ ≡ “r.call(op, expr1 , . . . , exprn )” ∨ γ ≡ “r1 .a := r.call(op, expr1 , . . . , exprn )”) ∧ o.r.my ac = o.my ac ∧ sysfail := (sysfail ∨ o.r = nil ∨

n

expri = ⊥

i=1

∨ o.r.my ac = o.my ac) ∧ (o.status := suspended ∧ prt(o).dest := o.r ∧ prt(o).op := op ∧ prt(o).status := pending ∧ prt(o).result := nil ∧ prt(o).opp := (expr1 , . . . , exprn )) ∨(γ ≡ “r.a := createc1 (expr)” ∧ o.my ac = expr ∧ = expr ∨ sysfail := (sysfail ∨ expr = ⊥ ∨ o.my ac (expr = o1 ∈ OC ∧ o1 .status ∈ {dormant, dying, dead})) ∧ (∃ o1 ∈ Oc1 \ {nilc1 } : o1 .status = dormant ∧ o1 .status := idle ∧ (¬c1 .isActive =⇒ o1 .my ac := expr) ∧ (c1 .isActive =⇒ o1 .my ac := o1 ) ∧ o.r.a := o1 ∧ o.status := suspended ∧ prt(o).dest := o1 ∧ prt(o).op := createc1

∧ prt(o).status := pending ∧ prt(o).result := nil ∧ prt(o).params := nil)

Understanding UML: A Formal Semantics

89

Becoming Stable. If object o becomes stable, some bookkeeping takes place. If o was processing an event, the dispatch reference of its active object is reset. If o was executing a triggered operation, the pending request table status is set to “completed ” to event completion to the caller. In both cases, o becomes idle. If o is executing the run-to-completion step starting at qx , then it becomes dead: ρbecoming

stable

=df (stable(o) =⇒ [o.my ac.ds = o =⇒ (o.my ac.ds := nil ∧ o.status := idle)] ∧ [∀ o1 ∈ OC : prt(o1 ).dest = o ∧ prt(o1 ).status = busy =⇒ (prt(o1 ).status := completed ∧ o.status := idle)]) ∧ (o.status = dying =⇒ o.status := dead)

Picking Up a Result. Object o can pick up the result of a previous triggered operation call if the callee has set the status of o’s pending request table entry to “completed ”: ρpick

up result

=df prt(o).status = completed =⇒ prt (o) := nil ∧ (¬stable(o) =⇒ o.status := executing) ∧ ((γ ≡ “r1 .a := r0 .call(op, expr1 , . . . , exprn )” ∧ ¬o.r1 = nil) =⇒ o.r1 .a := prt(o).result) ∧ (sysfail := (sysfail ∨ o.r1 = nil)

The complete execution of an example of a triggered operation engage() is illustrated in Figure 6. The ﬁrst row of the tables show the relevant part of the system conﬁguration at time t , just after c has entered the call into the pending request table. Note that c has not yet taken the transition, it remains in its previous state. The second row shows time t , just after the Cruiser m has accepted the call. At time t , m has just completed its run-to-completion step, i.e. written the result, changed the operation’s status to “completed”, and become idle. This is an indicator for c to pick up the result at time t , i.e. read the reply value from the table, clear the table entry, and now take the transition. c is executing and continues its run-to-completion step, assuming that c does not become stable. 3.4

The STS Semantics of a krtUML Model

Putting all speciﬁcations of diﬀerent kinds of transitions together we deﬁne the semantics of krtUML as a symbolic transition system over the three system variables (from Subsection 3.2) with the initial condition and combined transition relation speciﬁed in the following deﬁnition.

90

W. Damm et al.

c:Car

itsCar itsCrs

m:Cruiser disengaged

dep

/itsCruiser.call(engage)

crs>

disengage/

engage/reply(0)

engaged

m.status

m.sc

status

ret

par

idle

disengaged

prt[c]’: m engage

pending

nil

nil

sconf’’: suspended dep executing disengaged

prt[c]’’: m engage

busy

nil

nil

sconf’’’: suspended dep

idle

engaged

prt[c]’’’: m engage completed

0

nil

sconf’’’’:

idle

engaged

prt[c]’’’’: nil

nil

nil

c.status

c.sc

sconf’: suspended dep

executing

crs

dest

op

nil

nil

Fig. 6. Triggered Operation Call. Changing status of the caller c, callee m, and the called operation engage in the pending request table between the beginning (unprimed variables, not depicted here) and the end (at time t ) of the operation call: the values of selected elements of sconf and prt.

Deﬁnition 7 (krtUML Model Semantics). Let M = (T, F, Sig, <, C, croot , A) be a krtUML model. The semantics of M is the STS STS(M ) = (V, Θ, ρ), where System Variables: V =df {sconf : Tsconf (M ), prt : Tprt (M ), sysfail : IB}. Initial condition: Initially a single object of class croot exists and has status “executing”. All other objects are dormant, and all attributes have default values: Θ =df ∃ o0 ∈ Ocroot \ {nilcroot } : (o0 .status = executing ∧ o0 .ds = o0 ∧ o0 .sc = croot .q0 ∧ o0 .eq = ε ∧ (∀ o1 = (c1 , n1 ) ∈ OC \ {o0 } : o1 .status = dormant ∧ o1 .sc = c1 .q0 ∧ o1 .ds = nil ∧ o1 .eq = ε)) ∧ ∀ o = (c, n) ∈ OC : o.c::self = o ∧ (∀ a ∈ c.attr : o.a = niltype(a) ) The unique single object of class croot which is alive at the beginning of a run r is called the root object of r. Transition relation: The intermediate predicate ρ0 composes the above introduced subpredicates as follows: ρ0 =df ∀ o ∈ OC : o.status = executing ∧ o.eq = ε ∨ ¬sysf ail ∧ ∃ o = (c, n) ∈ OC ∃ (q, γ, q ) ∈ c.tr :

Understanding UML: A Formal Semantics

91

o.sc = q ∧ o.sc := q ∧ ( [o.status = idle ∧ (ρget event ⊕ ρaccept op )] ∨ [(o.status = executing ∨ o.status = dying) ∧ (ρskip guard ∨ ρnon op action )] ∨ [o.status = suspended ∧ ρpick up result ]) ∧ ρbecoming stable ∨ o.sc := o.sc ∧ ([o.status = idle ∧ ρdiscard event ] ∨ [o.status = executing ∧ ρinit opcall or create ]) The ﬁnal transition relation ρ is obtained from ρ0 by adding a frame axiom which requires that only those places of s are allowed to change in the transition to s , which get new values by an assignment “:=” in ρ0 , and changing the assignments to “=”. The semantics of a krtUML model M is given as the set runs(STS(M )) of all computations in M . It is easy to see that ρ eﬀectively restricts activity to at most one object, resulting in an interleaving of actions from diﬀerent objects. The following consequence from Deﬁnition 3 and Deﬁnition 7 formalises the main properties of the described krtUML semantics. Consequence: Let M be a krtUML model, r ∈ runs(STS(M )), (sconf, prt, sysfail) ∈ r, and o = o¯ ∈ OC . Then (i) (Level of the Computation Concurrency) sconf(o).status = sconf(¯ o).status = executing ⇐⇒ o ∈ Cm(o1 ), o¯ ∈ = o2 — only objects from diﬀerent components can be exeCm(o2 ) ∧ o1 cuting at the same time. (ii) (Component Interference Points) sconf(o).my ac.ds = nil ⇐⇒ (∀ˆ o ∈ OC : sconf(ˆ o).my ac = sconf(o).my ac ⇒ sconf(ˆ o).status ∈ {idle, dead, dying}). An object o can accept an event only if other objects from its component are not currently executing.

4

Assessing the Expressiveness of krtUML

In this section we indicate how to reduce richer UML models in rtUML as supported in the IST project Omega [10] to the krtUML subset deﬁned in Section 2. Besides, we explain the choice of the design decision behind the formal semantics. 4.1

Translating rtUML to krtUML

UML deﬁnes associations and association end-points to capture relations between classes. Semantically, association-end points maintain pointers to objects accessible through this association end-point (subject to restrictions on visibility

92

W. Damm et al.

and navigability). Our precompilation introduces these as what we call implicit attributes, and translates code invoked when creating compound objects for establishing links employing a set of implicit operations such as ‘add to association end’. Note the introduction of the special type Tas provided for such attributes in the krtUML model. In particular, pre-compilation will create implicit attributes for maintaining knowledge about all (possibly dynamically created) component objects5 of a strong aggregation (also called composition); it will include calls for creation of component objects with bounded multiplicity in the constructor code of the aggregate object; it will contain calls for destroying every existing component object within the destructor code of the aggregate object. Regarding generalisation of objects, we create private instances of all predecessors in the generalisation hierarchy (much as the creation of a compound objects induces creation of its components) to keep all operations and statecharts overwritten in the specialised objects. This allows to capture both static and dynamic polymorphisms using implicit attributes uplink and downlink to navigate across these instances to ﬁnd e.g. the deﬁnition of operations matching a call. We do not require any restrictions on the statechart inheritance: a sub-class might have state-machine overwritten independently from that of the corresponding super-class. All private copies maintain their own object-conﬁguration, hence e.g. accepting a triggered operation will only change the state-conﬁguration of that state-machine corresponding to the object oﬀering the operation in the generalisation hierarchy. The statechart inheritance described in [10] can be considered as a particular case. Another precompilation step transfers hierarchical UML statecharts from rtUML to ﬂat state-machines of krtUML without changing its behavior. The states in a ﬂattened state-machine correspond to state conﬁgurations (a set of states) from the original statechart extended with the history function (keeping information for the history connectors). Besides, for the statechart of each reactive class c we add the following kinds of auxiliary states: •

•

•

5

One or several “creation” states q0 , . . . , qn (n ≥ 0), where q0 has the outgoing transition guarded by triggered operation createc and followed by the constructor code, ending with the initial state of the original (hierarchical) statechart. Only the statechart of the root-class does not contain any triggered operation at its “creation” transitions. A “destruction” state qx with outgoing transitions containing the destructor code. Then every state in the ﬂattened state-machine containing termination vertice (from the original statechart) has an outgoing transition to some auxiliary state without triggering guard and with action destroy(self); Several “internal” states necessary to split complex transitions, e.g. transitions containing non-primitive actions.

Note the diﬀerence between a component object, speciﬁed by the composition association as a “part” of an aggregate object and used at the rtUML level, and the notion of component as a group of one active and several passive objects, used at the krtUML level.

Understanding UML: A Formal Semantics

4.2

93

The Choice of rtUML Communication Scheme

Certain transformations in the pre-compilation steps are based on modelling assumptions. In this paper we only elaborate on the concept of components as introduced in Deﬁnition 5 (iii). When targeting distributed system-implementations of real-time systems, synchronous operation calls clearly cannot be used for component communication. Indeed, any estimation of worst-case execution time would have to cater for a waiting delay until the receiving component is able to accept a call, which itself may be blocked while awaiting serving of an operation call by yet a third component. We thus assume a modeling style, where inter-component communication is restricted to signal-based communication. To exploit this, we allow the grouping of objects into components; within a component, no restrictions are placed as to inter-object communication. Based on the pragmatics of active objects in UML, we mandate, that each such componentgroup contains exactly one active object, and allow to include an arbitrary number of passive objects in the group. Active objects are assumed to be reactive, and reactive passive objects are required to delegate their event-handling to the one active object within the group.

my_ac my_ac

eq

car:Car itsCar

itsCar

term:Terminal itsTerm

my_ac

itsCruiser

itsHnd

crs:Cruiser

my_ac

eq

itsTerm

itsHnd

hnd:CarHandler

itsPmgr

mgr:PlatformMgr

Comp.1

Comp.2

Fig. 7. UML Components: A snapshot of a model part shows active objects car and term (with their event queues) and passive objects crs, hnd, and mgr. Reactive objects car, crs, and hnd are denoted by associated schematic state-machines. Active objects car and term designate their components Comp.1 and Comp.2, respectively.

Figures 7 and 8 illustrate the concepts of components and intercomponent communication using the Automated Rail Cars System example from [15]. The graphical representation of a snapshot of a model on Figure 7 shows objects on the krtUML level. Each reactive object has a link to an active object via my ac which is assumed to be constant for the object’s lifetime. Objects referring to the same active object form a component. Figure 7 shows two components with a single link across a component-boundary. All event-handling is delegated to the component’s active object, which keeps all events in its event queue. When

94

W. Damm et al.

the event has reached the top of the queue, the active object may decide to take the event from the queue and dispatch it to the destination. This is indicated in Figure 8 by light-gray arrows. The semantics in Section 3 is explained from the perspective of the destination.

DepartReq dispatch

my_ac

my_ac

eq

car:Car itsCar

itsCar

term:Terminal itsTerm

my_ac

itsCruiser

itsHnd

crs:Cruiser

my_ac

eq

itsTerm

itsHnd

hnd:CarHandler

itsPmgr

mgr:PlatformMgr

Comp.1

Comp.2

Fig. 8. Event communication: Sending an event of DepartReq from car to hnd in fact enters the event into the event queue of term, which is the active object associated with hnd.

The semantics enforces, that at most a single thread of control is active within one component. We feel, that deviating from this modelling paradigm, and in particular allowing multiple threads to execute within one object could easily cause modelling errors not acceptable for hard real-time applications.

5

Related Works

All attempts to deﬁne UML semantics can be classiﬁed into diﬀerent orthogonal dimensions. One direction in the semantics classiﬁcation is the level of UML coverage. Many people have been trying to build the semantics of individual diagrams of the UML – e.g., [20,4] etc. on state-machines, [12] on collaboration diagrams, [13,16] etc. on class diagrams, [28] on use cases, [3] on activity diagrams – or just to give formal foundations for action language (e.g., [24,2]). Because all diagrams are only views on one and the same model, the attempts to give semantics for separated UML diagrams fail in producing the right semantics for the entire UML. In our approach a symbolic transition system represents both static and dynamic aspects. The combination of statics and dynamics is also given in [29] which considers the problem of deﬁning active classes with associated state-machines. It gives a very ﬁne interleaving semantics for state-machines in terms of transition systems. In diﬀerence to our approach, the authors do not give precise semantics for state-machines, for event queue handling, consider a

Understanding UML: A Formal Semantics

95

limited inheritance, and they treat only ﬂat UML state-machines without action semantics. Another coverage level relates to the problems with possible concurrency as well as aspects of objects communication, which have been uncovered in [29] and not addressed in the original UML 1.x documents itself. Such open problems are typical for so called loose semantics introduced in [16], where the aspects of concurrency and object communication are not ﬁxed to some design decision, but cover diﬀerent implementations. Such loose semantics is not suitable for formal veriﬁcation. Our paper tries to overcome this problem by ﬁxing one detailed semantics as an example of the feasibility of UML semantic for veriﬁcation purposes. From other side, there are a number of UML modelling and/or veriﬁcation tools implementing precise semantics by translating UML models to programming language or model checker internal formats ([17,9,1,21]). These tools have diﬀerent limitations on the supported UML features and do not provide formal description of the implemented semantics or it is just technical translations. H. Hußmann [16] proposes the third dimension for the classiﬁcation of attempts towards the UML formal semantics, dividing approaches into the following groups: 1) Naive set-theoretic approach. M. Richters and M. Gogolla [31] have suggested to use simple set-theoretic interpretation for UML class diagrams. In this approach, the semantics of a class diagram is described as a set of hypergraphs, corresponding to a conﬁguration of objects. This approach has the low level of abstraction, where the concepts of UML itself are more abstract than the formal semantics given to it. This kind of semantics is mostly used for the formal deﬁnition of OCL constraints within UML models. We do not consider OCL in our approach. 2) Metamodelling semantics. This group of approaches is based on the application of a “bootstrapping” principle [7], where the semantics of UML is described using a small subset of UML as a core based on static semantics only. The approach of the pUML group to the UML semantics is given in [6,5,2]. Essentially, an algebraic speciﬁcation is used to describe legal (local) snapshots of the system without treating actions, whereas our approach gives a formal semantics for dynamic behaviour taking into account primitive action semantics as well. The study of A. Kleppe and J. Warmer [19] is based on the pUML OO meta modelling approach. In addition, it takes into account that static and dynamic viewpoints on the system can not be separated. But the formal semantics for state-machines is not really deﬁned, the set of primitive actions is very restrictive, and the transporting mechanism for signal inter-object communication is not speciﬁed. In our approach, we give a formal semantics for actual state-machines (not their unfolding into actions) with a larger set of primitive actions. We also resolved open problems with concurrency. 3) Translation semantics. An approach, which tries to keep the right abstraction level, deﬁnes translation from UML class diagrams to traditional speciﬁcation languages (Z [14], Object-Z [18], CASL [30] etc.). For example, G. Reggio et al. [30] proposed a general schema of the UML semantics by using an exten-

96

W. Damm et al.

sion of the algebraic language CASL for describing individual diagrams (class diagrams and state-machines) and then their semantics are composed to get the semantics of the overall model. Also other UML diagram types have been translated to formal notations, e.g., using Abstract State Machines ([4,3,23,8]). E. B¨ orger et al. [4] deﬁned the dynamic semantics of UML in terms of ASM extended by new construct to cover UML state-machine features. The model covers the event-handling and the run-to-completion step, and formalises object interaction by combining control and data ﬂow features. However, the authors did not give a complete solution to solve transition conﬂicts and it is not clear how ﬁrable transitions are selected. The semantics implemented by UML-tool vendors via code generation or model simulation can be also classiﬁed to this group of approaches (among other: [17,9,1]). Diﬀerently from these approaches, our study provides one formalism (STS) for both static and dynamic semantics, which also contains action language. 4) Combination of the approaches mentioned above. An example of combination of several approaches can be found in [23]. In this research, static semantics is deﬁned using meta-modelling mechanism of UML, the execution semantics is expressed as ASM programs. The study covers all features contained in the class diagrams, and in the body of the operations (quite thorough set of action types). The aspects of inter-object communications were not really covered and the semantics of UML statecharts was not addressed, although it can be accompanied by the complementary papers [3] and [4]. But these articles consider state-machines separated from the rest of UML, whereas our approach provides one semantics for class diagrams and statecharts.

6

Conclusion

With respect to the approaches sketched above, the main novelty of our approach is that it resolves uncovered problems with concurrency and object communication by giving a formal semantics for a chosen concrete decision. W. Damm and B. Westphal [11] have shown that this semantics can be used for formal veriﬁcation. In our approach we allow that both active and passive objects can be reactive, thus considering event communication between all objects. We also capture two diﬀerent kinds of inter-object communication – synchronous (via triggered operation calls) and asynchronous (via signal events). Thus, we have provided the semantical foundation for a rich sublanguage of UML which is expressive enough to deal with industrial UML models for real-time applications. Our partners from Verimag have proposed extensions of the semantical model focussed on real-time, in particular taking into account the need to support annotations for real-time scheduling. Ongoing work within Omega builds on the semantical foundation layed down in this paper to develop a veriﬁcation environment for real-time UML.

Acknowledgements We gratefully acknowledge the contribution of our Omega partners in ﬁnetuning the semantics.

Understanding UML: A Formal Semantics

97

References 1. Telelogic AB. Telelogic Tau, 2003. http://www.telelogic.com/products/tau/index.cfm. 2. J.M. Alvarez, T. Clark, A. Evans, and P. Sammut. An Action Semantics for MML. In Proc. UML 2001, 2001. http://www.cs.york.ac.uk/puml/mmf/AlvarezUML2001.pdf. 3. E. B¨ orger, A. Cavarra, and E. Riccobene. An ASM Semantics for UML Activity Diagrams. In T. Rus, editor, Proc. AMAST 2000, volume 1816 of LNCS, pages 293–308. Springer-Verlag, 2000. 4. E. B¨ orger, A. Cavarra, and E. Riccobene. Modeling the Dynamics of UML State Machines. In Y. Gurevich, Ph.W. Kutter, M. Odersky, and L. Thiele, editors, Abstract State Machines, Theory and Applications, International Workshop, ASM 2000, Proceedings, volume 1912 of LNCS, pages 223–241. Springer-Verlag, 2000. DBLP, http://dblp.uni-trier.de. 5. T. Clark, A. Evans, and S. Kent. The Metamodelling Language Calculus: Foundation Semantics for UML. In Proc. FASE 2001, pages 17–31, 2001. www.dcs.kcl.ac.uk/staﬀ/tony/docs/MMLCalculus.ps. 6. T. Clark, A. Evans, S. Kent, S. Brodsky, and S. Cook. A Feasibility Study in Rearchitecting UML as a Family of Languages Using a Precise OO Meta-Modelling Approach, version 1.0, September 2000. Available from http://www.puml.org. 7. T. Clark, A. Evans, S. Kent, and P. Sammut. The MMF Approach to Engineering Object-Oriented Design Languages. In Proc. Workshop on Language Descriptions, Tools and Applications (LDTA2001), 2001. Available via http://www.puml.org. 8. K. Compton, J. Huggins, and W. Shen. A Semantic Model for the State Machine in the UML. In G. Reggio, A. Knapp, B. Rumpe, B. Selic, and R. Wieringa, editors, Dynamic Behaviour in UML Models: Semantic Questions, Workshop Proceedings, UML 2000 Workshop, Bericht 0006, pages 25–31. Ludwig-Maximilians-Universit¨ at M¨ unchen, Institut f¨ ur Informatik, October 2000. http://www.kettering.edu/∼jhuggins/papers/uml2000.ps. 9. Rational Software Corporation. Rational Rose Family, 2003. http://www.rational.com/products/rose/index.jsp. 10. W. Damm, B. Josko, A. Pnueli, and A. Votintseva. A Formal Semantics for a UML Kernel Language. Omega Technical report, part 1 of the deliverable D1.1.2, Project IST-2001-33522 OMEGA, January 2003. Available from http://www-omega.imag.fr/doc/d1000009 6/D112 KL.pdf. 11. W. Damm and B. Westphal. Live and Let Die: LSC-based Veriﬁcation of UMLModels. In Proceedings FMCO’02, 2003. (to appear). 12. G. Engels, J.H. Hausmann, R. Heckel, and S. Sauer. Dynamic Meta Modeling: A Graphical Approach to the Operational Semantics of Behavioral Diagrams in UML. In Proceed. of the 3d International Conference on the UML 2000, October 2000. 13. A. Evans, R. France, K. Lano, and B. Rumpe. The UML as a Formal Modeling Notation. In The Unifed Modeling Language: the ﬁrst international workshop, June 1998. Springer-Verlag, 1999. 14. A.S. Evans and A.N. Clark. Foundations of the Uniﬁed Modeling Language. In 2nd Northern Formal Methods Workshop, Ilkley, electronic Workshops in Computing. Springer-Verlag, 1998. http://www.cs.york.ac.uk/puml/papers/nfmw97.ps.

98

W. Damm et al.

15. D. Harel and E. Gery. Executable Object Modeling with Statecharts. IEEE Computer, 30(7):31–42, 1997. 16. H. Hußmann. Loose Semantics for UML, OCL. In Proceedings 6th World Conference on Integrated Design and Process Technology (IDPT 2002). Society for Design and Process Science, June 2002. 17. I-Logix Inc. Rhapsody, 2002. http://www.ilogix.com/products/rhapsody/index.cfm. 18. S.-K. Kim and D. Carrington. Formalizing the UML Class Diagrams Using ObjectZ. In France and Rumpe, editors, Proc. UML’99, volume 1723 of LNCS, pages 83–98. Springer-Verlag, 1999. 19. A. Kleppe and J. Warmer. Uniﬁcation of Static and Dynamic Semantics of UML, 2001. http://www.klasse.nl/english/uml/uniﬁcation-report.pdf. 20. G. Kwon. Rewrite Rules and Operational Semantics for Model Checking UML Statcharts. In Proceed. of the 3d International Conference on the UML 2000, University of York, October 2000. 21. J. Lilius and I.P. Paltor. vUML: a Tool for Verifying UML Models. Turku Centre for Computer Science, Abo Akademi University, Finland. Technical Report. 22. Z. Manna and A. Pnueli. The Temporal Logic of Reactive and Concurrent Systems: Speciﬁcation. Springer-Verlag, New York, 1991. 23. I. Ober. Harmonizing Design Languages with Object-Oriented Extensions and an Executable Semantics, PhD Thesis. Institut National Polytechnique de Toulouse, France, April 2001. 24. Object Management Group. UML 1.4 with Action Semantics, Final Adopted Speciﬁcation, ptc/02-01-09, January 2002. Available from http://www.kc.com/as site/home.html. 25. Object Management Group. UML Proﬁle for Schedulability, Performance, and Time Speciﬁcation, ptc/02-03-02 OMG Adopted Speciﬁcation, March 2002. Available from http://cgi.omg.org/docs/ptc/02-03-02.pdf. ¨ 26. G. Overgaard. Formal Speciﬁcation of Object-Oriented Meta-Modelling. In T.Maibaum, editor, Proceedings Fundamental Approaches to Software Engineering, FASE, number 1783 in LNCS. Springer-Verlag, 2000. ¨ 27. G. Overgaard. Using the BOOM Framework for Formal Speciﬁcation of the UML. In Proceedings of Deﬁning Precise Semantics for UML, 2000. ¨ 28. G. Overgaard and K. Palmkvist. A Formal Approach to Use Cases and Their Relationships. In UML 1998, 1998. 29. G. Reggio, E. Astesiano, C. Choppy, and H. Hußmann. Analyzing UML Active Classes and Associated State Machines - A Lightweight Formal Approach. In FEAS 2000, 2000. ftp://ftp.disi.unige.it/pub/person/ReggioG/Reggio99a.ps. 30. G. Reggio, M. Cerioli, and E. Astesiano. Towards a Rigorous Semantics of UML Supporting Its Multiview Approach. In FASE 2001, 2001. ftp://ftp.disi.unige.it/pub/person/CerioliM/FASE2001.pdf. 31. M. Richters and M.Gogolla. On Formalizing the UML Object Constraint Language OCL. In T.-W. Ling, S. Ram, and M.L. Lee, editors, Proc. 17th International Conference Conceptual Modelling (ER’98), volume 1507 of LNCS, pages 449–464. Springer-Verlag, 1998.

Live and Let Die: LSC-Based Veriﬁcation of UML-Models Werner Damm and Bernd Westphal Carl von Ossietzky Universit¨ at Oldenburg Department f¨ ur Informatik PO Box 2503, 26111 Oldenburg, Germany {damm,westphal}@informatik.uni-oldenburg.de

Abstract. We present a strategy for automatic formal veriﬁcation of Live Sequence Chart (LSC) speciﬁcations against UML models in the semantics of [7] employing the symmetry-based technique of Query Reduction [18,34,44] and the abstraction technique Data-type Reduction [34]. Altogether this allows for automatic formal veriﬁcation without providing ﬁnite bounds on the numbers of objects created during a run of the system. Our presentation is grounded on a speciﬁc formal interpretation of LSCs for the UML domain in terms of [7] which is rich enough to in particular express properties about objects which are created only during activation of the LSC.

1

Introduction

The increasing use of UML or specialised sublanguages thereof in the domain of safety-critical systems design raises a need for formal veriﬁcation techniques. A necessary pre-requisite is on the one hand a formal semantics of a UML sublanguage suﬃciently rich for behavioural description as provided by [7] and on the other hand a formal foundation of one of UML’s speciﬁcation languages for inter-object communication like Sequence or Collaboration Diagrams. Previous eﬀorts towards automatic formal veriﬁcation of a signiﬁcant sublanguage of UML concentrate on diﬀerent subsets of the state-machine language and consider only a single object or an explicitly given ﬁnite set of objects, i.e. do not address dynamic creation or destruction of objects during runtime [9,27,4,40,41,39,28] or don’t elaborate on this topic [12]. Recent achievements [43] implement object creation and destruction explicitly in the input language of the employed formal veriﬁcation tool based on “switching on and oﬀ” objects like in [7] or translate the UML model into an

This research was partially supported by the German Research Council (DFG) within the priority program Integration of Speciﬁcation Techniques with Engineering Applications under grant DA 206/7-3 and by the Information Society DG of the European Commission within the project IST-2001-33522 OMEGA (Correct Development of Real-Time Embedded Systems).

F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 99–135, 2003. c Springer-Verlag Berlin Heidelberg 2003

100

W. Damm and B. Westphal

intermediate language [37] which provides constructs for this kind of dynamics s.t. the problem of choosing a ﬁnite representation is shifted to the translationstep from the intermediate language to the employed formal veriﬁcation tool. Both approaches presuppose a ﬁnite bound on the number of objects alive in each snapshot of a run as long as the target is a ﬁnite-state formal veriﬁcation tool1 . In the following we present a technique which allows to overcome these limitations under certain premises even for ﬁnite-state methods. The speciﬁcation languages used in previous approaches range from temporal logic expressions over variable names on the level of the underlying modelchecker’s input language [41,39,28] to temporal logic (resp. patterns) on the UML model level [43,12]. The tools presented in [27,40] provide automatic solving of “drive to collaboration” tasks, i.e. to verify for UML with real-time information whether a given system is able to show the behaviour described by collaboration diagrams. We propose to adopt the language of Live Sequence Diagrams [5] (LSC) for UML as the speciﬁcation language for inter-object communication. The LSC language is a superset of UML’s Sequence Diagram language – thus appealing for the UML designer – and explicitly designed to overcome the limitations in expressiveness of Sequence Diagrams as discussed in [5]. Our deﬁnition of LSCs for UML is in particular designed to express properties over instances which are created during a run of the system, and even during the activation of the LSC, by interpreting instance lines as universally or existentially quantiﬁed logical variables, thus a system satisﬁes an LSC if all runs satisfy all instantiations of the quantiﬁcation resp. if there exists a run which satisﬁes an instantiation. Hence the whole speciﬁcation can be discharged by carrying out numerous separate concrete veriﬁcation tasks, each a binding of concrete object instances to instance lines [25,42]. The observable communication comprises events and so called triggered operations, i.e. operations whose behaviour is deﬁned by a state-machine, and is integrated into the fully abstract LSC semantics of [24,26]. The authors of [30] present an alternative approach to explain binding of instances to instance lines with the same underlying intuition, but in addition allow to quantify single instance lines. The description is tailored for the application in their play-in/play-out tool [14], that is, for observing or “playing-out” a complete system. Our presentation is in contrast chosen to be able to apply the theory of symmetry reduction and data-type reduction to consider only a reduced system for automatic formal veriﬁcation. The most closely related approach for temporal logic patterns in the domain of object-oriented systems is the (textual) Bandera Speciﬁcation Language [3] of the Bandera Java veriﬁcation toolset, which allows to express similar quantiﬁcations. 1

Note that these obstacles are raised by aiming at automatic formal veriﬁcation, they naturally do not apply in general to proposals for formal semantics of UML, which in contrast use methods which intentionally do not require ﬁnite bounds and don’t explicitly represent “not existing” objects, e.g. [15,23,36,35].

Live and Let Die: LSC-Based Veriﬁcation of UML-Models

101

The numerous veriﬁcation tasks obtained by binding objects to instance lines in the system are a prominent application of a technique for which the authors of [44] coined the term “Query Reduction”, since not the state-space of the veriﬁed transition system is reduced, but the number of concrete bindings to be proven. A small representative set implies the (possibly inﬁnite) whole set of properties by symmetry. This way to exploit symmetry was ﬁrst demonstrated by the author of [31,34] for general temporal-logic properties of the form of a universal quantiﬁcation over a symmetric type. Then it is suﬃcient to prove only a representative set of concrete bindings since all other bindings are implied by symmetry. This technique applies to systems where the state of replicated components is kept in an array data-structure indexed by a symmetric type and to properties which claim that for all indices i a property φ0 (i) holds, thus in particular to our interpretation of LSCs. The idea to exploit in formal veriﬁcation the symmetries of a system caused by replicated components, like processors in a cache-coherency protocol, actually dates back to 1993, when the authors of [10] and [18] independently discovered that symmetries of transition systems can be exploited to prove certain properties on the quotient graph by the equivalence relation induced by symmetry instead of on the full transition system. The authors of [18] even provided a set of criteria which allows to declare and syntactically check symmetric data types in the system description language of the MurΦ model-checker [19]. A disadvantage of the quotient graph approach is that the set of properties is restricted to safety properties independent from individual identities like “none of the symmetric components of the system runs into a deadlock” [18] or to properties which are itself symmetric, e.g. identical under permutation of indices [10]. The dSPIN [16] variant of the SPIN model-checker is an application of this kind of symmetry reduction in software-veriﬁcation exploiting heap symmetries — system states which diﬀer only in the allocation of objects into memory places on the heap are equivalent on the program level in languages like Java — and analogously process (allocation) symmetries [17]. The published results yet comprise only checking for absence of deadlock. As mentioned before, the direct eﬀect of query reduction is just not a reduction of the transition graph, although indirectly a reduction may be obtained by standard techniques like cone-of-inﬂuence reduction which apply more eﬀectively on the concretely bound properties and may render tasks feasible, which are far too complex in the original form. Yet cone-of-inﬂuence reduction alone does not address the problem that the state-space of a UML model is in general inﬁnite if there are no ﬁnite bounds on the number of objects. Therefore we propose to apply the over-abstraction technique Data-type Reduction [34] which by heuristics abstracts away all objects which are not explicitly referenced in the concrete binding of object instances to LSC instance lines from the view of the bound objects s.t. these remaining objects can in particular not determine the actual number of objects in the system.

102

W. Damm and B. Westphal

Thus there is no requirement for a ﬁnite bound on the number of simultaneously alive objects. This technique also supports reasoning about parameterised systems: if a property can be proven for an abstraction constructed for an arbitrary concrete choice of parameters, than for any choice of larger parameters the abstraction also satisﬁes the property since it is also an abstraction for the larger parameters. Note that the length of the event queue is a priori still unbounded in a UML design, thus for the scope of this paper we assume a ﬁnite upper bound on the length of event queues to reach the domain of automatic techniques for ﬁnite state veriﬁcation. For the category of so called mode separated models, [6] presents an exact abstraction which eliminates queues from the model and yields a ﬁnite representation. The remainder of the paper is organised as follows. In section 2 we introduce signatures, expressions, and interpretations to be able to deﬁne symbolic transition systems (STS) [29], our computational model, and to deﬁne linear temporal logic (LTL) over expressions which provides the ground for the presentation of the general semantics of LSCs and the specialisation of the LSC language for the context of UML in terms of [7] in Section 3. In section 4, we provide the theory of query reduction, contribute the yet missing proofs, show how it applies to LSCs in the context of UML, and brieﬂy introduce a running example for the subsequent sections. Section 5 presents the theory of data-type-reduction together with yet missing proofs and discusses the common class of “interference” false-negatives caused by the data-type-reduction abstraction. Section 6 discusses brieﬂy how interference could be avoided by separately proving and then assuming non-interference lemmata derived from information in the UMLmodel, based on the methodology of [34], and section 7 concludes.

2

Preliminaries

As our computational model, we take symbolic transition systems [29], which allow a purely syntactical description of a transition system by ﬁrst-order-logic expressions over a signature. Section 2.2 deﬁnes a symbolic transition systems as two ﬁrst-order-logic predicates over a signature, so we ﬁrst introduce signatures, predicate- and ﬁrst-orderlogic expressions and their interpretation in section 2.1. Section 2.3 deﬁnes linear temporal logic and the satisfaction of LTL formulae by a symbolic transition system to be able to explain the semantics of live sequence charts in the following section and to provide the formal foundation of the proofs in Sections 4 and 5. All deﬁnitions are standard, hence the reader may safely skip this section on the ﬁrst reading. 2.1 First-Order-Logic Expressions Deﬁnition 1 (Predicate-logic expressions). Let V be a set of typed variables and Ω a set of typed constants. The pair B = (V, Ω) is called signature. The set Expr(B) of typed expressions over B is deﬁned inductively as follows:

Live and Let Die: LSC-Based Veriﬁcation of UML-Models •

•

103

Let v ∈ V be variable of type τ . Then v is an expression over B of type τ , type(v) =df τ . Let f ∈ Ω be a constant of type τ1 × · · · × τn → τ , n ∈ IN0 . Let expri ∈ Expr(B), type(expri ) = τi , for 0 < i ≤ n. Then expr = f (expr1 , . . . , exprn ) is an expression over B of arity n, type(expr) =df τ . A constant with range type τ = IB is called predicate.

We use TB to denote the set of types used by B. The elements of the set Expr PL (B) =df {expr ∈ Expr(B) | type(expr) = IB} are called predicate-logic expressions over B. ˙ =⇒ , ⇐⇒ } for In the following, we assume Ω ⊇ {true, ∨, ¬, false, ∧, ∨, ˙ (exclusive or), “ =⇒ ”, and each signature, where the symbols “false”, “∧”, “∨” “ ⇐⇒ ” are used as abbreviations with the conventional deﬁnition for brevity. Deﬁnition 2 (First-order-logic expressions). Let B = (V, Ω) be a signature. The set Expr FO (B) of ﬁrst-order-logic (FOL) expressions over B is deﬁned inductively as follows: •

•

Let expr ∈ Expr PL (B) be a predicate-logic formula. Then ‘expr’ is a ﬁrstorder-logic expression. Let expr ∈ Expr PL (B) be a ﬁrst-order-logic formula and τ a type. Then ∃ x ∈ τ : expr is a ﬁrst-order-logic expression of type IB. Every occurrence of the variable x ∈ V in ‘expr’ is called bound.

Let expr ∈ Expr FO (B). A variable x ∈ V occurring in ‘expr’ is called free in ‘expr’ if not all occurrences are bound. We call a ﬁrst-order-logic expression over B = (V ∪V , Ω), that is, an expression referring to both unprimed and primed versions of the variables in V , a ﬁrst order logic transition predicate over B. In the following, we use the conventional deﬁnition of the symbol “∀” in ﬁrst-order-logic expressions for brevity. Deﬁnition 3 (Interpretation). Let B = (V, Ω) be a signature, D = τ ∈TB Dτ a domain for all types used in B, and I an interpretation of the constants which assigns to each constant f ∈ Ω of type τ a value I(f ) ∈ Dτ . Then the tuple M = (D, I) is called a structure of B. A function s : V → D is called (type-consistent) valuation of V if it assigns each variable v ∈ V a value s(v) ∈ Dτ . The set of valuations is called Σ. We use M[[expr]](s) to denote the canonical interpretation of the ﬁrst-order-logic expression ‘expr’ in the valuation s. The interpretation M[[expr]](s, s ) of a transition predicate is deﬁned analogously letting s provide the interpretation of unprimed and s the interpretation of primed variables in expr. In the following, we consider only interpretations M which gives the canonical interpretation to the constants “true”, “∨”, and “¬”.

104

2.2

W. Damm and B. Westphal

Symbolic Transition Systems

Deﬁnition 4 (STS). A symbolic transition system (STS) S = (B, Θ, ρ) consists of B = (V, Ω), a signature with a ﬁnite set V of variables, Θ ∈ Expr FO (B), and ρ, a FOL transition predicate over B. The set of variables v ∈ V which are free in Θ or ρ are called system variables of S. An STS induces a transition system on the set of valuations of its system variables as follows: Deﬁnition 5 (Runs of an STS). Let S = (B, Θ, ρ) be an ST S and M a structure of B. (i) A valuation of the system variables of S is called snapshot of S. (ii) A snapshot s ∈ Σ of S is called initial, iﬀ M[[Θ]](s) = true. (iii) Let s, s ∈ Σ be snapshots of S. Snapshot s is called S-successor of s, iﬀ M[[ρ]](s, s ) = true. (iv) A computation or run of S is an inﬁnite sequence of snapshots, r = s 0 s 1 s 2 . . . , satisfying the following requirements: •

Initiation: s0 is initial.

•

Consecution: Snapshot sj+1 is an S-successor of sj , for each j ∈ IN0 .

(v) The set of all computations of S is called runs(S). We use r(i) to denote the i-th snapshot of a run r ∈ runs(S) and r/i =df r(i) r(i + 1) r(i + 2) . . . to denote the inﬁnite sequence starting at r(i), i ∈ IN0 . 2.3

Linear Time Logic

In section 3, we assume a formalisation of the temporal properties expressed within an LSC in the well-known temporal logic LTL (linear time logic): Deﬁnition 6 (LTL). Let B be a signature. An LTL formula over B is deﬁned inductively as follows: (i) expr ∈ Expr PL (B) is an LTL formula. (ii) ¬f and f ∨ g are LTL formulae if f and g are LTL formulae, and (iii) X f (“next f ”), G f (“globally f ”), and f U g (“f until g”) are LTL for mulae if f and g are LTL formulae. In the following we deﬁne what it means for a run to satisfy an LTL formula and in addition introduce the orthogonal notions of existential vs. universal and initial vs. invariant satisfaction of an LTL formula as the foundation for the semantics of life sequence charts.

Live and Let Die: LSC-Based Veriﬁcation of UML-Models

105

Deﬁnition 7 (Satisfaction of an LTL formula). Let S = (B, Θ, ρ) be an STS, φ an LTL formula over B, and M a structure of B. Let r = s 0 s 1 s 2 . . . be a (suﬃx of a) run of S. We say the run r satisﬁes φ wrt. M, denoted by r |=M φ, iﬀ: (i) (ii) (iii) (iv) (v) (vi)

φ ≡ expr and M[[expr]](r(0)) = true, or φ ≡ f ∨ g and r |=M f or r |=M g, or φ ≡ ¬f and r |=M f , or φ ≡ X f and r/1 |=M f , or φ ≡ G f and ∀ i ∈ IN0 : r/i |=M f , or φ ≡ f U g and ∃ i ∈ IN0 : r/i |=M g ∧ ∀ 0 ≤ j < i : r/j |=M f .

The STS existentially satisﬁes φ invariantly, denoted by S |=M,∃ φ, iﬀ ∃ r ∈ runs(S) ∃ i ∈ IN0 : r/i |=M φ, and initially, denoted by S |=M,∃,0 φ, iﬀ ∃ r ∈ runs(S) : r/0 |=M φ. The STS universally satisﬁes φ invariantly, denoted by S |=M,∀ φ, iﬀ ∀ r ∈ runs(S) ∀ i ∈ IN0 : r/i |=M φ, and initially, denoted by S |=M,∀,0 φ, iﬀ ∀ r ∈ runs(S) : r/0 |=M φ.

3

Live Sequence Charts

Live Sequence Charts (LSC) are an extension of Messsage Sequence Charts (MSC), introduced to overcome serious deﬁciencies of the MSC language wrt. formal veriﬁcation, so we begin with a short overview of the MSC language and the MSC dialect of UML Sequence Diagrams. We recall the deﬁciencies of both formalisms, followed by a brief introduction of the subset of the LSC language which we propose to use as a speciﬁcation language for formal veriﬁcation of UML models. 3.1

From Message Sequence Charts and Sequence Diagrams to LSCs

The MSC language is a well-known visual formalism to describe behaviour of a system by visualising the inter-entity communication basically as arrows (representing asynchronous messages) between vertical instance lines (representing entities within the system). Intuitively, the semantics of MSCs is a (partial) ordering in time of the observations of messages which is derived from the relative positions of message arrows and their beginning or ending at instance lines. The MSC language is standardised in diﬀerent versions [20,21,22] which extend the core language by means to structure and compose MSCs, to express loops and branches, by diﬀerent annotations for timers and timing-constraints, by means to explicitly state ordering informations, and by diﬀerent kinds of messages, e.g. synchronous messages to express method calls and replies.

106

W. Damm and B. Westphal

Although the MSC language was originally formalized in the telecommunication domain to match this domain’s system speciﬁcation language, it is not inherently bound to a particular domain, design-language, or paradigm, but the kind of entities represented by an instance line can be chosen when giving semantics for a particular domain. Typical kinds of entities are processes in the context of process-oriented languages and objects in the object-oriented domain. The Sequence Diagram language [38] of UML is an adoption of MSCs for UML where instance lines are in fact restricted to represent objects and where concrete message types are provided to represent event based resp. method call communication. The main deﬁciencies of MSCs and SDs wrt. their use in formal veriﬁcation are that they are meant to show only a sample run of the system – one scenario – where one would rather like to express that the system always behaves as depicted in the MSC, and that MSCs do not allow to express liveness properties, i.e. to distinguish whether progress is enforced or not. Furthermore, the MSC versions except for MSC-2000 do not allow to specify an activation time thus it is left open when a system has to show the behaviour described by the MSC in order to fulﬁl it. The intention of an MSC describing the behaviour in case of erroneous input, for example, is typically meant to be observed only after a particular error-condition holds. No MSC version allows to express this activation in terms of a sequence of messages, for example to express that error handling takes place after a sequence of a particular number of error-events have been observed. Other major drawbacks are the facts that conditions annotated to locations on instance lines (which are not even present in SDs) are merely comments up to MSC-2000, i.e. it is not possible to e.g. specify that the system should be in a particular state when sending an event, and that simultaneity of items like messages and conditions cannot be expressed. Only MSC-2000 provides simultaneity but restricted to pairs of messages and timers. Aside these concerns of expressiveness, MSC-2000 and SD are not directly usable for formal veriﬁcation since they are not provided with an oﬃcial formal semantics. For a complete discussion of the sequence charts dialects and their shortcomings see [24]. Note that although LSCs also provide a more sophisticated semantical treatment of timers and time-annotations in comparison to MSCs or SDs, we don’t consider timers and time-annotations at all in the following since the UML semantics of [7] which our presentation is based on, is an un-timed semantics. LSC were introduced in [5] to overcome the deﬁciencies of MSCs and SDs named above employing the basic idea to distinguish mandatory and possible behaviour per LSC element and for the whole LSC. Intuitively, a possible or existentially quantiﬁed LSC is meant as a scenario, just like MSCs, i.e. it expresses that there is a run of the system which complies to the LSC, while a mandatory or universally quantiﬁed LSC requires that, whenever the LSC is activated, the system shows the behaviour depicted in the LSC.

Live and Let Die: LSC-Based Veriﬁcation of UML-Models

107

The activation point of an LSC can be speciﬁed by giving a boolean activation condition and a so called pre-chart which is itself a restricted LSC. The LSC is then activated whenever the activation condition holds and then the (possibly empty) behaviour depicted in the pre-chart is observed. Additionally, the activation of an LSC depends on the activation mode of “initial”, “initial ﬁrst”, “invariant”, or “iterative”. In the following we only consider the activation modes which directly correspond to our Deﬁnition 7: “initial”, i.e. the LSC is activated at most once per run and only if its pre-chart is observed from the initial step of a run on, and “invariant”, i.e. the LSC may be activated multiple times during a run, there may even be overlapping activations. Within the LSC, each location, i.e. each place of an element on an instance line, e.g. a message start or end, is equipped with a temperature. A mandatory or hot location enforces progress, that is, eventually the next location has to be reached. A possible or cold location allows to stay at the location forever, that is, the behaviour following a cold location need not be observed. A possible or cold condition is a legal exit point of an LSC, i.e. if a run of the system adheres to the preﬁx of an LSC up to a cold condition and the condition does not hold, then the run is said to satisfy the LSC, since the LSC “exits” and is no longer activated. Reaching a location with a hot condition which does not hold is considered to be a violation of the speciﬁcation. As an extension of conditions, LSCs also provide (possible or mandatory) local invariants, i.e. conditions which are not bound to a single location but to a start and end location. The concrete graphical representation of LSCs generally follows MSCs, but in the following we use a concrete syntax more similar to UML sequence diagrams. The mandatory elements are, as usual for LSCs, depicted by solid lines and possible elements by dashed lines (cf. Fig. 1). For a complete presentation of the LSC features, see [24]. In [5], the LSC language is introduced as a conservative extension of MSCs, thus LSCs are as domain, design-language, and paradigm independent as MSCs. In particular, [24,26] give the formal semantics of LSCs independent from the mapping, abstracting from what “sending a message” actually means in a concrete system from a particular domain, only the ordering and temporal constraints expressed in the LSC are considered. Thus for an application of the LSC language in the UML domain, we have to provide the concrete syntax for e.g. message and condition annotations and the derivation of a mapping, that is a characterisation of the points in time when we want to consider a message to be sent and received, resp., and we have to explain a binding of instance lines to entities in the system. The topic of binding of instance lines goes beyond the presentation of a specialisation of LSCs for the domain of Statemate-designs as presented in [24] where the author requires an explicit static binding of instance lines to Statemateactivities, which is possible since Statemate designs have a static structure, i.e. there is no dynamic creation or destruction of “system entities” as there is in the UML domain.

108

W. Damm and B. Westphal

The rest of this chapter is structured as follows. In section 3.2 we provide a deﬁnition of general (yet domain-independent) LSCs which abstracts from syntactical aspects and from the elements which don’t need a mapping, e.g. simultaneous regions for simultaneity, and we brieﬂy report their abstract formal semantics as given by [24,26]. That is, we do not elaborate on the temporal properties induced by the relative position or partial ordering of the parts of an LSC but take for granted that [24,26] (indirectly) provide us with an LTL formula which expresses just these temporal properties. In section 3.3 we deﬁne LSCs for UML models (in the sense of [7]) by giving constraints on the annotations of LSC elements s.t. we can construct a so-called observer extension for a UML model. The satisfaction of an LSC by the UML model is then deﬁned in terms of the model’s observer extension, binding objects to instance lines, thus taking objects as the kind of entities to be bound. 3.2

Live Sequence Charts

In general and independent from the design-language domain, the intuition of an instance line within an LSC is the denotation of an entity of the system the LSC refers to, where it of course depends on the domain what is considered an entity. If there are multiple instances of the same type of entity, then we take an LSC as an abbreviation for all possible bindings of concrete system entity instances to instance lines, thus instance lines can be seen as free or logical variables of the speciﬁcation which are quantiﬁed over the entity type. In the following, we technically formalise this intuition by relating instancelines to 0-ary constants from the given signature. A concrete binding is then given by the structure which interprets the LSC’s signature. In addition to these constants for instance-lines, we allow to refer to a general set of constants called speciﬁcation variables in the LSC which are also intended to be bound to concrete values. Deﬁnition 8 (LSC). Let B = (V, Ω) be a signature and Msg a set of message names. A live sequence chart L = (, ac, pch, m, X, actmode, quant) over B and Msg consists of the following components: •

• • • • • •

: The ﬁnite body of the LSC, comprising the following body elements: instance lines, synchronous and asynchronous message sending and reception, conditions, and local invariants. ac: The activation condition. pch: The possibly empty body of the pre-chart. m: The annotation of body elements as deﬁned below. X = {x1 , . . . , xn } ⊆ Ω: A ﬁnite set of 0-ary logical variables. actmode: The activation mode from {initial, invariant}. quant: The (chart-)quantiﬁcation from {existential, universal}.

The bodies and pch of L together deﬁne the sets inst(L) of instance lines, send(L) and recv(L) of synchronous and asynchronous message sendings resp.

Live and Let Die: LSC-Based Veriﬁcation of UML-Models

109

receptions, and cond(L) of conditions and local invariants including the activation condition. Message sendings and receptions are required to be pairwise related, i.e. there exists a bijection between send(L) and recv(L), and to be uniquely related to an instance line. The annotation m is a partial function which maps instance lines, messages, and conditions of L to an expression obeying the following restrictions: (i) If p ∈ inst(L), then m(p) = x : τ where x ∈ X is a 0-ary constant of type τ. (ii) If p ∈ send(L) ∪ recv(L), then m(p) = msg(expr1 , . . . , exprn ), n ∈ IN0 , where msg ∈ Msg and expri ∈ Expr(B). (iii) If p ∈ cond(L), then m(p) = expr ∈ Expr(B) of boolean type or m(p) = ¬dest.msg(expr1 , . . . , exprn ), n ∈ IN0 , where dest ∈ X, msg ∈ Msg, and expri ∈ Expr(B). The latter case is used to assume the absence of messages in a local invariant. (iv) If p = ac, then m(p) = expr or m(p) = dest.msg(expr1 , . . . , exprn ) ∧ expr, n ∈ IN0 , where dest ∈ X, msg ∈ Msg, and expri ∈ Expr(B), and expr ∈ Expr(B) is of boolean type. The latter case is used to activate on messages. The semantics of an LSC over signature B = (V, Ω) and message set Msg is explained symbolically by [24,26] in terms of a Timed B¨ uchi Automaton (TBA). Using the annotation m, the TBA can be translated into an LTL formula Φ(L) over B using the constant function symbols send and receive which act as placeholders for the domain-dependent deﬁnition of message sending and reception. By deﬁnition, the TBA and hence the formula Φ(L) depend on the chartquantiﬁcation of L: for an existential L it is principally the sequential composition of pre- and main-chart, while for an universal L it states that an observation of the prechart implies the main-chart. For example consider the message arrow (async snd3 , async rcv3 ) in the LSC body in Fig. 1 between instance lines inst2 and inst3 . An annotation m(async snd3 ) = m(async rcv3 ) = msg(expr1 , . . . , exprn ) results in both, send(msg, m(inst2 ), m(inst3 ), expr1 , . . . , exprn ) and receive(msg, m(inst2 ), m(inst3 ), expr1 , . . . , exprn ), occurring in Φ(L), the former observing the sending and the latter observing the reception of msg. Note that synchronous and asynchronous messages are not distinguished on this level of predicates but the distinction is incorporated into the LTL formula: for synchronous messages, sending and reception is observed in the same snapshot whereas for asynchronous messages, reception has to be observed at least one snapshot later than sending. When explaining LSCs for a particular application domain, it is often a matter of choice which of the domain’s “observable events” are better mapped to synchronous and which to asynchronous messages of the LSC.

110

W. Damm and B. Westphal AC: cond0

AM: invariant

inst2

inst1 pre-chart body

asyncrcv 2

syncsnd1

asyncsnd 2

asyncrcv 1

inst3

asyncsnd 1

syncrcv1

cond1 asyncsnd3

LSC body

asyncrcv 4

asyncrcv3

locinv1 asyncsnd 4

cond2 syncrcv2

syncsnd2

Fig. 1. LSC: A graphical representation of the LSC L with body = {inst1,2,3 , async snd3,4 , async rcv3,4 , sync snd1,2 , sync rcv1,2 , cond1,2 , locinv1 }, activation condition ac = cond0 , pre-chart pch = {inst1,2,3 , async snd1,2 , async rcv1,2 }, actmode = invariant, and quantiﬁcation quant = universal as indicated by the solid line around the body of the LSC. The LSC is yet unmapped, i.e. the annotation m is empty. Independent from the mapping, the LSC L is satisﬁed by all runs in which, any time after the two asynchronous messages in the pre-chart have been observed, a synchronous communication takes place between inst1 and inst2 , and eventually – since the location between sync rcv1 and async snd3 is hot – an asynchronous communication takes place between inst2 and inst3 with the restriction that at the same point in time, when async snd3 is observed, cond1 is supposed to hold. cond1 is a cold condition as indicated by the dashed border, thus if cond1 does not hold, then the LSC is “exited successfully”, i.e. the run satisﬁes the LSC. Below async rcv3 , there is a cold cut, i.e. the current position on each instance line lies on a cold location hence the following communication need not take place as long as the local invariant locinv1 holds. locinv1 is mandatory (as indicated by the solid line), thus if the condition locinv1 is violated after async rcv3 but before async snd4 , the whole LSC is not satisﬁed. The condition cond2 is a mandatory condition, i.e. if async rcv4 is observed and cond2 does not hold at the same point in time, then the run does not satisfy L. Since the subsequent locations are hot, both sync snd2 and sync rcv2 have to be observed in order to exit the LSC successfully. Note that L may be activated multiple times in a run and even overlapping. The run satisﬁes the LSC only if it is not violated in any activation.

Deﬁnition 9 (Satisfaction of an LSC). Let L = (, ac, pch, m, X, actmode, quant) be an LSC over signature B = (VB , ΩB ) and messages Msg. Let Φ (L) be the LTL formula representation of L over B with all occurrences of ‘send’ and ‘receive’ replaced by boolean predicate-logic expressions over B. Let S = ((V, Ω), Θ, ρ) be an STS with V = VB and Ω ⊇ ΩB \ X, and M a structure of B.

Live and Let Die: LSC-Based Veriﬁcation of UML-Models

111

Then the model S satisﬁes the LSC wrt. M, S |=M L, iﬀ •

quant = existential and - actmode = initial and ∃ x01 : Dtype(x1 ) , . . . , x0n : Dtype(xn ) : S |=M ,∃,0 ac ∧ X Φ (L), or - actmode = invariant and ∃ x01 : Dtype(x1 ) , . . . , x0n : Dtype(xn ) : S |=M ,∃ ac ∧ X Φ (L), or

•

quant = universal and - actmode = initial and ∀ x01 : Dtype(x1 ) , . . . , x0n : Dtype(xn ) : S |=M ,∀,0 ac =⇒ X Φ (L), or - actmode = invariant and ∀ x01 : Dtype(x1 ) , . . . , x0n : Dtype(xn ) : S |=M ,∀ ac =⇒ X Φ (L), → x0i | 1 ≤ i ≤ n}). where M = (D, I ∪ {xi

3.3

LSCs for UML

In the following we elaborate on ideas already outlined in [25]. We refer to UML models in the deﬁnition of [7], i.e. a UML model is a tuple M = (T, F, Sig, <, C, croot , A), with T a set of basic types, F a set of predeﬁned primitive functions, e.g. arithmetic operations on T, Sig a ﬁnite set of signals, <⊂ Sig × Sig a generalisation relation on signals, C a ﬁnite non-empty set of (further structured) classes, croot ∈ C the class of the root object, and A ⊂ C the set of active classes (for the details the reader is referred to the companion paper [7]). In order to explain syntactical transformations on the transition predicate of STS(()M ) in Section 5, in the following we assume F to contain =: τ ×τ → IB, the comparison for equality on all types, and ( · ? · : · ) : IB × τ 2 → τ , the if-then-else function. We denote by Tc the type of references to objects of class c ∈ C and by TC the set of all Tc . For each class c ∈ C, Oc denotes the semantic type or domain of Tc and OC the union of all Oc . An LSC over M is basically an LSC over a signature derived from M and the set of events and triggered operations in M as set of messages Msg together with a number of well-formedness rules: Deﬁnition 10 (LSC over UML model). Let M = (T, F, Sig, <, C, croot , A) be a UML model. An LSC over M is an LSC over B = (∅, F ∪ X ∪ {.}), where each x ∈ X is of a type from T ∪ TC and “.” is the binary navigation operator, and the message set Msg = Sig ∪ {createc | c ∈ C} ∪ {destroy} ∪ {replyτ | τ ∈ T ∪ TC } ∪ c.ops c∈C

which obeys the following well-formedness rules:

112

W. Damm and B. Westphal

(i) If p ∈ inst(L), then m(p) ∈ X is of a reference type Tc for c ∈ C. All m(p) ∈ inst(L) are pairwise diﬀerent. (ii) If p ∈ send(L) is an asynchronous message sending or reception and m(p) = msg(expr1 , . . . , exprn ), then msg = ev ∈ Sig and either n = 0 or n matches the number of parameters of ev and type(expri ) matches the type of the i-th parameter of ev 2 . An ev ∈ Sig may not occur more than once in the whole LSC since the used underlying semantics does not provide identities of events. (iii) If p ∈ send(L) is a synchronous message sending or reception from instance line i1 to i2 and m(p) = msg(expr1 , . . . , exprn ), then msg ∈ Msg \ Sig. If p1 and p2 are the related synchronous message sending and reception, then m(p1 ) = m(p2 ). If msg = op and m(i2 ) is of type Tc , c ∈ C, then op ∈ c.ops and either n = 0 or n matches the number of parameters of op and type(expri ) matches the type of the i-th parameter of msg. If msg = replyτ , then n ≤ 1 and there is a uniquely identiﬁed synchronous message sending p from i2 to i1 , i.e. in the opposite direction, with msg(p ) ∈ c.ops for a c ∈ C and τ = typer (msg(p )). That is, a reply has to be related to an operation call. If msg = createc , then n = 0 and p is the ﬁrst message or condition of the destination instance line and p is the only message annotated by creation. If msg = destroy, then n = 0 and p is the last message or condition of the destination instance line and p is the only message annotated by destruction. (iv) For each creation p ∈ send(L) to instance line i, there exists a cold condition q ∈ cond(L) with m(q) ≡ justcreated(i), yet another placeholder which will allow us to legally exit the LSC in each run, where the creation operation did not create the object bound to i. (v) If p ∈ cond(L) is not the activation condition and not a local invariant, then m(p) ∈ Expr of boolean type. (vi) If p ∈ cond(L) is the activation condition or a local invariant and m(p) = dest.msg(expr1 , . . . , exprn ) resp. m(p) = ¬dest.msg(expr1 , . . . , exprn ), then dest is of type Tc ∈ TC and msg ∈ c.ops∪Sig and either n = 0 or n matches the number of parameters of msg and type(expri ) matches the type of the i-th parameter of msg. Note that c.ops comprises only triggered operations of class c ∈ C, i.e. operations whose behaviour is deﬁned by c’s state-machine. So called primitive operations which are deﬁned by a method are no longer visible on the semantics level of [7]. 2

Providing only means to restrict either all or none of the parameters in the LSC is a matter of choice for brevity. The generalisation to restriction of only some parameters is straightforward in case practical evaluation reveals a demand.

Live and Let Die: LSC-Based Veriﬁcation of UML-Models

113

As outlined in section 3.2, we obtain an LTL formula Φ(L) for an LSC over a UML model which uses for example for an asynchronous message sending from instance i1 to i2 the placeholder send(ev, m(i1 ), m(i2 ), expr1 , . . . , exprn ). To explain what it means for a UML model M to satisfy an LSC L, we use the STS semantics of M , STS(M ), according to [7]. The placeholders for the message send and receive are replaced by predicates over system variables including new system variables which are introduced to explicitly observe events and triggered operation based communications. We need to introduce new system variables, since predicates over the unchanged model can only refer to the valuation of a single snapshot while we want to observe e.g. sending of an event E ∈ Sig from object o1 to object o2 in a snapshot r(i + 1) of a run r ∈ runs(STS(M )) only if the transition from r(i) to r(i + 1) in STS(M ) corresponds to o1 taking a transition which is annotated by an event sending action which enters an E into the event queue of o2 ’s active object. To observe the intended relation between two subsequent snapshots, we construct an observer extension of STS(M ) by introducing ﬁve new system variables justsend, justrecv and justcall, justret, and justcreated. whose value has to be deﬁned by the transition relation s.t. for example justsend becomes valid in snapshot r(i+1) and holds the type and parameter values of the event sent when taking the transition from r(i) to r(i + 1). The ﬁrst component of the former variables is a boolean ﬂag which indicates that the variable’s value is valid. A single ﬂag is suﬃcient due to the strictly interleaving and atomic nature of the underlying semantics [7]. All of the ﬁrst four variables carry sender, destination, and all parameters since e.g. the return value of a triggered operation is actually no longer visible in r(i + 1) in the pending-request-table. The variable justcreated is just an object reference which, if non-nil, contains the identity of the object created in the transition to the current state. Deﬁnition 11 (Observer extension). Let M = (T, F, Sig, <, C, croot , A) be a UML model, and S = STS(M ) = (B, Θ, ρ) its semantics according to [7]. The observer extension of S, So = (Bo , Θo , ρo ), with Bo = (Vo , Ωo ) is obtained from S as follows: (i) V is extended by variables to observe events justsend, justrecv : IB × Sig × OC × OC ×

Ttypepar (ev) ,

ev∈Sig

to observe triggered operation calls c.ops) × OC × OC × justcall : IB × ( c∈C

to observe completion of triggered operations justret : IB × ( c.ops) × OC × OC × c∈C

Ttypepar (op) ,

c∈C op∈c.ops

c∈C op∈c.ops

and to observer object creation justcreated : OC .

Ttyper (op) ,

114

W. Damm and B. Westphal

(ii) Θ is changed s.t. the ﬁrst four variables’ ﬁrst components get the value false initially. (iii) ρnon op action which formalises taking a transition annotated with a nonoperation call action is conjoined with the following predicate: (γ ≡ “r.send(ev, expr1 , . . . , exprn )”

∧ ¬sysfail =⇒ justsend := (true, o, o.r, ev, (expr1 , . . . , exprn )))

where γ denotes the considered transition and o the object taking the transition (cf. [7] for the full set of used abbreviations). Eﬀectively, “justsend” observes the enqueueing of an event of type ev with destination o.r when object o takes a transition. It holds a valid value in the ﬁrst snapshot where ev shows up in the queue. (iv) ρget event and ρdiscard event which formalise dispatching resp. discarding of an event are conjoined with (¬sysfail =⇒ justrecv := (true, nil, o, head(o.my ac.eq).ev, o.evp )) where nil is used as the sender, since the sender is not retained with the event, and o is the destination object. head denotes the ﬁrst entry in the event queue of o’s active object and o.evp the attributes of o holding copies of the event parameters. “justrecv” observes the dequeuing of an event of type ev with destination o. It holds in the ﬁrst snapshot where ev has disappeared from the queue. (v) ρinit opcall or create which formalises calling a triggered operation is conjoined with (¬sysfail =⇒ justcall := (true, o, r, prt(o).op , prt(o).opp )) where o is the object initiating the call and r the destination. “justcall” observes the operation call op when the caller changes status from executing to suspended and writes op with receiver r into its pending request table entry. It holds in the ﬁrst snapshot where o is suspended due to the call. (vi) ρpick up result which formalises picking up the result of a triggered operation call by the caller is conjoined with (¬sysfail =⇒ justret := (true, o, prt(o).dest, prt(o).op, prt(o).opp )) “justret” observes return from operation call op when the caller o changes status from suspended back to executing or idle. It holds in the ﬁrst snapshot where o is no longer suspended due to the call. (vii) ρnon op action is also conjoined with the following ﬁrst-order predicate: (γ ≡ “destroy(expr)” ∧ ¬sysfail =⇒ justcall := (true, o, expr)) Thus destruction is observed just like a triggered operation call.

Live and Let Die: LSC-Based Veriﬁcation of UML-Models

115

(viii) ρ is ﬁnally changed s.t. (¬sysfail =⇒ ([∀ o ∈ OC , o = nil : (o.status = dormant ∧ o.status = executing) =⇒ justcreated = o] ∨ justcreated = nil)) and s.t. for each other observer variable the ﬁrst component gets the value false if the observer variable is not “assigned” to in a step. Note that in the above deﬁnition we chose to consider triggered operation calls as synchronous and observe only the call and picking up the result, although they are actually asynchronous since a call can soonest be accepted one step after the call. It is still to be assessed whether it is a better choice to consider triggered operation calls as asynchronous (in the sense of LSCs). The following deﬁnition builds the predicates characterising message sending and reception from a system extended with observer variables and thereby deﬁnes the semantics of LSCs for UML. Deﬁnition 12 (Satisfaction of an LSC for UML). Let M = (T, F, Sig, <, C, croot , A) be a UML model, and So = (Bo , Θo , ρo ) the observer extension of its semantics. Let L be an LSC over M , Φ(L) the LTL formula representation of L and M a structure of Bo . The UML model M satisﬁes the LSC L wrt. M, M |=M L, iﬀ STS(M ) |=M L where Φ (L) is obtained as follows: (i) For an event ev ∈ Sig, o1 , o2 ∈ OC , and expressions expr1 , . . . , exprN , N = 0 or N = n, we set: send(ev, o1 , o2 , . . . ) ≡df justsend = (true, ev, ˆ o1 , o2 , . . . ), ev≤ev ˆ

receive(ev, o1 , o2 , . . . ) ≡df

justrecv = (true, ev, ˆ nil, o2 , . . . ).

ev≤ev ˆ

If send resp. receive do not refer to expressions, then the parameter values of justsend resp. justrecv are not considered. Otherwise the i-th parameter value of justsend resp. justrecv is to be compared with the i-th parameter expression of send resp. receive. (ii) For a triggered operation, creation, or destruction, op ∈ c.ops ∪ {createc , destroy}, c ∈ C, o1 , o2 ∈ OC , we set: send(op, o1 , o2 , . . . ) ≡df receive(op, o1 , o2 , . . . ) ≡df justcall = (true, op, o1 , o2 , . . . ). Parameter expressions in send resp. receive are treated as explained above. Creation and destruction don’t have parameters.

116

W. Damm and B. Westphal

(iii) For a reply op = replyτ , o1 , o2 ∈ OC we set: send(op, o1 , o2 , . . . ) ≡df receive(op, o1 , o2 ) ≡df justret = (true, op, o1 , o2 , . . . ). The optional parameter expression in send resp. receive is treated as explained above. (iv) Each occurence of justcreated(i) is replaced by justcreated = m(i). (v) And or each oi whose instance line does not begin with a creation, the activation condition is conjoined with oi .status ∈ {idle, executing, suspended}, to require that the object denoted by oi is alive at the time of activation.

4

Query Reduction

Consider the LSC speciﬁcation of the ARCS system [13] depicted in ﬁgure 2 (for brevity we don’t present the UML-model of the ARCS , but only implicitly introduce the classes relevant for our discussion; for details the reader is referred to the description in [13]). By the LSC semantics of Section 3 it can be checked whether the system satisﬁes the speciﬁcation by checking all concrete bindings of Car and Terminal objects to instance lines. But intuitively, it should be suﬃcient to check a single concrete binding for the Car identity car0 since if the instance with this identity always behaves as required, then every Car behaves like that, since they are all instances of the same class with the same behaviour. The reason is that new objects are chosen non-deterministically at creation time, thus if an object car1 would violate the speciﬁcation in a run of the system, then there existed a run which choses car0 instead of car1 at creation time and thus there existed a run where car0 violates the property, too. In the following, we provide a formal basis for the just outlined intuition in full generality referring to the work of Ip and Dill [18] and McMillan [34] in Sections 4.1–4.3. In Section 4.4 we demonstrate the application of these results to the UML domain and in particular the example of ﬁgure 2. 4.1

Introduction

By a result of [34], queries over quantiﬁed variables of a “symmetric” or scalarset [18] type are implied by a ﬁnite set of representative cases. If the representative cases are proven separately, there is not only an anticipated beneﬁt from the smaller size of the formulae compared to the original quantiﬁcation. The representative formulae are also more specialised than the original in that they refer to only a concrete binding of the quantiﬁed variables,

Live and Let Die: LSC-Based Veriﬁcation of UML-Models

117

AC: car.alert100(term)

Exit

car:Car

Entry

term:Terminal arrivReq hnd:CarHandler

|=?

Terminal

hnd.itsCar == car

CarHandler

arrivAck RIP

Car

Fig. 2. LSC over ARCS : Whenever car (an instance of class Car ) is 100 units ahead of a Terminal term, then it starts the entering protocol by sending an arrivReq event to term whose identity it obtained from one of its sensors. The terminal then creates an instance of a class CarHandler which subsequently manages the whole entering and leaving procedure, i.e. it reserves and frees platforms and exits within term and sets the switches. Once the car-handler obtained a platform and set the switch, it sends an arrivAck event back to the car which then enters the terminal. The car stores the identity of the sending car-handler for the further communication. When the car is about to leave the terminal, it sends another request to its car-handler which sends back a granting event once the switches of the desired exit are set and free (not shown in the LSC). After having left the terminal, car sends an event RIP to its CarHandler which causes hnd to free the reserved platform and exit and ﬁnally to destroy itself.

thus standard model-reduction techniques like cone-of-inﬂuence reduction [2] can be applied more eﬀectively. Hence the reduction is at ﬁrst not at all a model reduction – which would also be possible based on symmetric types by building the quotient of the transition relation wrt. the equivalence relation induced on the snapshots by symmetry [18,10,1,19] as discussed in the introduction in Section 1 – but only a decomposition of formulae s.t. standard model-reduction techniques yield better results. The split into seperate tasks for the representative formulae may yet render proofs feasible, for which proving the whole property is not possible due to space or time complexity. be feasible. The quotient graph approach to symmetry-based model-reduction is in general not applicable for LSCs since it applies only to a subset of LTL [10] which is not expressive enough for LSCs. In this section we ﬁrst introduce the general theory for special LTL formulae over STSs and then demonstrate that the semantics of LSCs for a UML model according to Def. 12 is a formula of the form the theory applies to. The remainder of this section is structured as follows. In Section 4.2 we brieﬂy provide the concept of automorphisms and the observation of [18] that permutations on a certain type – called scalarset – induce automorphisms on the state-space, thus allow to infer properties of the shape of the state-space.

118

W. Damm and B. Westphal

Section 4.3 concludes that LTL formulae which are quantiﬁcations over scalarset types can be proven by considering only a ﬁnite, representative set, yielding the general theory of query reduction. Section 4.4 demonstrates LSCs in the interpretation of Section 3 over the STS-semantics of UML [7] as a prominent application domain for these results. 4.2

Permutations and Automorphisms

Deﬁnition 13 (Permutation and Automorphism). Let A be a ﬁnite set. A bijection π : A → A is called permutation on A. The set of all permutations on A is called Sym(A). Let S = (B, Θ, ρ) be an STS and M a structure of B. A permutation π ∈ Sym(Σ) is called an automorphism of S iﬀ (i) ∀ s ∈ Σ : M[[Θ]](s) =⇒ M[[Θ]](π(s)) (ii) ∀ s, s ∈ Σ : M[[ρ]](s, s ) =⇒ M[[ρ]](π(s), π(s ))

A central ingredient for the theory of query-reduction is the following notion of a relation between a permutation on the domain of an STS’s system variable’s type and the STS’s state-space. Deﬁnition 14 (Induced Permutation). Let B = (V, Ω) be a signature and M a structure of B. Let S = (B, Θ, ρ) be an STS. A permutation π : Dτs → Dτs on the domain of type τs induces the permutation π ˜ ∈ Sym(Σ) deﬁned inductively pointwise for each snapshot s ∈ Σ of STS by • • • • •

M[[v]](˜ π (s)) = π ˆ (M[[v]](s)), M[[a[expr]]](˜ π (s)) = π ˆ (M[[a]](s)(M[[expr]](˜ π (s)))), M[[expr.x]](˜ π (s)) = π ˆ (M[[expr]](s).x), M[[expr0 .a[expr1 ]]](˜ π (s)) = π ˆ (M[[expr0 .a]](s)(M[[expr1 ]](˜ π (s)))), M[[expr]](˜ π (s)) = M[[expr]](s), otherwise,

where π ˆ is π on Dτs , π ˆ |Dτs = π, and the identity otherwise, π ˆ |Dτs = id.

In the following we are in particular interested in so called scalarset types, i.e. types s.t. every permutation on their domain induces an automorphism. Note that [18] use the term scalarset for a type which obeys the set of syntactical rules given in Lemma 1 below, but the rules are suﬃcient but obviously not necessary criteria for scalarsets in the sense of the following deﬁnition: Deﬁnition 15 (Scalarset). A type τ with at most one special element nil ∈ Dτ is called scalarset iﬀ for every permutation π ∈ Sym(Dτ ) with π(nil) = nil the induced permutation π ˜ ∈ Sym(Σ) is an automorphism. In the following, we simply translate the results of [18] for abstract transition programs – that a particular set of syntactic criteria is suﬃcient for the scalarset property – into the domain of STSs:

Live and Let Die: LSC-Based Veriﬁcation of UML-Models

119

Lemma 1 (Automorphism). Let S = (B, Θ, ρ) be an STS, τ a type of variables in B. The type τ is a scalarset if the predicates Θ and ρ obey the following syntactical rules [18]: S1 Scalarset values are not used literally, except for the special element nil. S2 Scalarset terms may be compared for equality. In a comparison, both sides must be terms of exactly the same scalarset type. S3 If the left hand side of an assignment is of scalarset type, then the right hand side must be of exactly the same type. S4 A scalarset type τs may be used as the index-type of arrays. Such arrays are only indexed by terms of type τs . S5 A variable of scalarset type τs may be used as the running index of a forloop if the body of the loop is independent from the order of the iterations. S6 Other operations are not allowed, in particular may scalarsets not be used as operands of “+” or “casted” into an integer type.

Proof. Analogous to [18].

The following Lemma 2 states that the truth-value of a property over scalarset constants oi in a π-permuted interpretation and evaluated in snapshot π(s) can be obtained by evaluating the property in the original interpretation and original snapshot s. Lemma 2 (Substitution). Let B = (V, Ω) be a signature with o1 , . . . , on ∈ Ω, n ∈ IN0 , 0-ary constants of scalarset type τs . Let φ ∈ Expr PL (B) be an expression over B. Let o1i , o2i ∈ Dτs \ {nil} for 1 ≤ i ≤ n. Let M1 = (D1 , I1 ) be a structure of B with I1 (oi ) = o1i and M2 = (D2 , I2 ) a structure of B with I2 (oi ) = o2i , and I1 (f ) = I2 (f ), f ∈ Ω, otherwise. Set → o2i , o2i → o1i | 1 ≤ i ≤ n} ∈ Sym(Dτs ) π = {o1i and let s ∈ Σ be a valuation of V. If φ obeys the scalarset rules (S1)–(S6), then π (s)). M1 [[φ]](s) = M2 [[φ]](˜

Proof. (By induction over the structure of φ.) Since φ obeys the scalarset rules, a constant oi , 1 ≤ i ≤ n, can without loss of generality only appear in in the following places: •

Comparison against another constant: φ ≡ oi = oj : M1 [[φ]](s) = M1 [[oi = oj ]](s) = eq(o1i , o1j ) !

= eq(o2i , o2j ) = M2 [[oi = oj ]](˜ π (s)) = M2 [[φ]](˜ π (s)), since o1i = o1j ⇐⇒ o2i = o2j and o1i = o1j ⇐⇒ o2i = o2j by deﬁnition of π.

120 •

W. Damm and B. Westphal

Comparison against a variable x of type τs : φ ≡ x = oi : M1 [[φ]](s) = M1 [[x = oi ]](s) = eq(s(x), o1i ) !

= eq(π(s(x)), o2i )

=

Def.14

M2 [[x = oi ]](˜ π (s)) = M2 [[φ]](˜ π (s)),

since s(x) = o1i ⇐⇒ π(s(x)) = o2i and s(x) = o1i ⇐⇒ π(s(x)) = o2i by deﬁnition of π. •

Array index and comparison against indexed value: φ ≡ a[oi ] = oj : M1 [[φ]](s) = eq(s(a)(o1i ), o1j ) !

= eq(π(s(a)(o1i )), o2j )) = eq(π(s(a)(π(o2i ))), o2j ) π (s)) = M2 [[φ]](˜ π (s)), = M2 [[a[oi ] = oj ]](˜

Def.14

since π(s(a)(s(o1i ))) = o2j iﬀ s(a)(o1i ) = o1j by deﬁnition of π. •

Comparison against structure component of type τs : φ ≡ expr.x = oi : !

M1 [[φ]](s) = eq(M1 [[expr]](s).x, o1i ) = eq(π(M1 [[expr]](s).x), o2i ) = eq(π(M2 [[expr]](˜ π (s)).x), o2i ) = M2 [[expr.x = oi ]](˜ π (s)) ind.

Def.14

π (s)), = M2 [[φ]](˜ since π(M1 [[expr]](s).x) = o2i iﬀ M1 [[expr]](s).x = o1i by deﬁnition of π. •

Comparison against a general array expression of type τs : φ ≡ expr0 .a[expr1 ] = oi : M1 [[φ]](s) = eq(M1 [[expr0 .a[expr1 ]]](s), o1i ) !

= eq(π(M1 [[expr0 .a[expr1 ]]](s)), o2i ) = eq(π(M1 [[expr0 .a]](s)(M1 [[expr1 ]](s))), o2i ) π (s))(M2 [[expr1 ]](˜ π (s)))), o2i ) = eq(π(M2 [[expr0 .a]](˜

ind.

= M2 [[expr0 .a[expr1 ] = oi ]](˜ π (s)) = M2 [[φ]](˜ π (s))

Def.14

by deﬁnition of π. 4.3

Query Reduction

If a quantiﬁed property uses only a single quantiﬁcation constant of scalarset type τs , then it is suﬃcient to prove one particular binding. But if there are two quantiﬁcation constants in the property, we need at least two concrete bindings:

Live and Let Die: LSC-Based Veriﬁcation of UML-Models

121

one which represents all bindings which bind the same value to both constants and one which represents the bindings with diﬀerent values. Def. 16 introduces the concept of a representative set and Lemma 3 claims that a ﬁnite representative set exists for every ﬁnite property over a scalarset type. Deﬁnition 16 (Representative Set). Let τs be a scalarset-type. A set R ⊂ (Dτs \{nil})n , n ≥ 1, is called representative set for τsn iﬀ ∀ (o1 , . . . , on ) ∈ Dτns ∃ r0 = (o01 , . . . , o0n ) ∈ R, π ∈ Sym(τs ) : (π(o01 ), . . . , π(o0n )) = (o1 , . . . , on ). R is called minimal representative set if all proper subsets R R are not representative. Lemma 3 (Representative Set). Let τs be a scalarset-type and n ≥ 1. Then there exists a ﬁnite representative set R for τsn . Proof. Choose o1 , . . . , on ∈ Dτs \ nil pairwise diﬀerent. Set R = {o1 } × {o1 , o2 } × · · · × {o1 , . . . , on }.

Note that the R constructed in Lemma 3 is in general not minimal, since e.g. (o1 , o1 , o2 ) and (o1 , o1 , o3 ) are equivalent in case of n = 3 but both are in R. Lemma 4 (Query Reduction). Let φ be an LTL expression over signature B = (VB , ΩB ) with 0-ary constants o1 , . . . , on ∈ ΩB , n ∈ IN0 , of type τs , and 0-ary constants x1 , . . . , xm ∈ ΩB , m ∈ IN0 , not of type τs . Let S = ((V, Ω), Θ, ρ) be an STS with V = VB and ΩB \ {oi , xj | 1 ≤ i ≤ n, 1 ≤ j ≤ m} ⊆ Ω. Let M be a structure of B. If R is a representative set for τsn then (∀ (o01 , . . . , o0n ) ∈ R ∀ x01 ∈ Dtype(x1 ) , . . . , x0m ∈ Dtype(xm ) : S |=M ,q φ) ⇐⇒ (∀ o01 ∈ Dtype(o1 ) , . . . , o0n ∈ Dtype(on ) ∀ x01 ∈ Dtype(x1 ) , . . . , x0m ∈ Dtype(xm ) : S |=M ,q φ). for q ∈ {∃; ∃, 0; ∀; ∀, 0} and M constructed as in Def. 12.

(1) (2)

Proof. Let q = ∃. In order to prove direction (1) =⇒ (2), choose o1 , . . . , on ∈ Dτs . Since R is representative, there exists (or1 , . . . , orn ) ∈ R and π ∈ Sym(τs ) s.t. oi = π(ori ). ori | 1 ≤ i ≤ n}, i.e. ∃ r ∈ By premise, S |=M ,q φ, where M = M ∪ {oi → runs(S) ∃ i ∈ IN : r/i |=M ,q φ. → oi | 1 ≤ i ≤ n}. Then Set rπ = π(r(0)) π(r(1)) . . . and M = M ∪ {oi rπ ∈ runs(S) by Lemma 1 and rπ /i |=M ,q φ by induction over the structure of φ:

122

W. Damm and B. Westphal

•

φ ≡ expr: M [[φ]](rπ (i)) = M [[φ]](r(i)) = true by premise.

•

φ ≡ f ∨ g: Then r |= f or r |=M ,q g. Thus by induction hypothesis rπ |=M ,q f or rπ |=M ,q g.

•

φ ≡ ¬f : analogously to the previous case.

•

φ ≡ X f : Then r/i + 1 |=M ,q f , thus rπ /i + 1 |=M ,q f by induction hypothesis.

•

φ ≡ G f : Then for all j ≥ i, r/j |=M ,q f , thus also rπ /j |=M ,q f by induction hypothesis.

•

φ ≡ f U g: Then exists k ≥ i s.t. r/k |=M ,q g, thus also rπ /k |=M ,q g by induction hypothesis. For all i ≤ j < k, r/j |=M ,q f thus also rπ /j |=M ,q f by induction hypothesis.

Lem.2

M ,q

The case q = ∃, 0 is obtained analogous, the cases q = ∀ and q = ∀, 0 similar by contradiction. The direction (2) =⇒ (1) holds trivially. 4.4

Verifying LSCs against UML Models

The basis for query reduction in the domain of UML is the following observation that in the UML semantics of [7] the object reference types Oc are in fact scalarset types: Lemma 5 (Scalarsets in UML). Let M = (T, F, Sig, <, C, croot , A) be a UML-Model and STS(M ) = (B, Θ, ρ) its semantics according to [7]. (i) All object reference types Tc , c ∈ C are scalarsets with special element nil. (ii) For all unordered association ends a, the index type τa is a scalarset without special element. (iii) For all unordered behavioral features f , the index type τf is a scalarset without special element. The proof of 5.(i) is by syntactically checking Θ and ρ against the rules (S1)–(S6) 3 . The proof of 5.(ii) and (iii) cannot be obtained as directly since iterators over associations and behavioural features are not present in Θ and ρ because iteration is supposed to take multiple steps of the transition system in the semantics. Although, they are visible on a higher language level, thus the property of rule (S5) has to be checked on this higher level and then to be preserved by the preprocessing steps of [7]. According to Section 3, the semantics of an LSC wrt. a UML model is an LTL formula quantiﬁed over constants of types Oc , hence Lemma 4 directly applies. 3

The typing in the presentation of [7] and even in Section 3 is actually too weak to fulﬁl the syntactical criteria “as is”: for brevity both refer to OC which is the union of all object reference types. It is straightforward to obtain a representation which separates the object reference types s.t. we can obtain their scalarset property directly by syntactical criteria.

Live and Let Die: LSC-Based Veriﬁcation of UML-Models

123

A representative set may be obtained as demonstrated in the proof of Lemma 3. By taking into account the activation condition, some of the representative cases may already be found to trivially fulﬁl the requirement. For example if the speciﬁcation contains two instances of the same type which the activation condition requires to be diﬀerent at activation time. From a technical point of view, all veriﬁcations of representative cases form completely independent tasks thus may be carried out fully parallel thus allow for further reductions of completion time for the whole task. Back to the example from the beginning of the section, we ﬁnd that we refer to only one instance of classes Car, Terminal, and CarHandler in the LSC, thus the set R =df {(Car, 0), (Terminal, 0), (CarHandler, 0)} ⊂ C × IN0 is a (minimal) representative set (assuming (c, 0) = nilc for classes c ∈ C). Thus it is suﬃcient to verify only this single case of concrete bindings (see ﬁgure 3).

Exit

Entry

Terminal Car CarHandler

0

1

2

3

4

...

0

1

2

3

4

...

0

1

2

3

4

...

CarHandler 0 status speed

exe

status itsCar

exe

10

...

...

...

Terminal 0

...

Car 0

Fig. 3. Query Reduction: The focus is on only the single Car 0, Terminal 0, and CarHandler 0. The model itself is a priori not reduced.

5

Model Abstraction by Data-Type-Reduction

As suggested by ﬁgure 3, there is a priori no model reduction due to the application of query reduction. Unfortunately, the standard technique of coneof-inﬂuence reduction may not work well for UML models due to the indirect addressing of array places. Thus intuitively, we want to refer to “all other cars” by a single object reference (Car, ⊥) and over-approximate “their” behaviour. It is obviously not suﬃcient to only change the unbounded array to a width of two in the example s.t. (Car, ⊥) denotes the second entry whose values are freed, i.e. may take all possible values during a run, since an expression may refer to two “other cars” at once, for example p.speed + q.speed would always yield the even value 2 · p.speed if p and q refer to “an other car”, which is not necessarily the case in the original implementation where p and q might refer to diﬀerent other cars.

124

W. Damm and B. Westphal

An abstraction technique, which yields the desired result, is the data-type reduction introduced by McMillan [34]. It can be made explicit by a syntactical transformation of the system description which modiﬁes the “places” of objects referring to an other object in the sense that every reference which possibly reads a value of an other object is modiﬁed. For the above example expression it would yield (p = ⊥ ? guess1 (τ ) : p.speed) + (q = ⊥ ? guess2 (τ ) : q.speed) where τ is the type of attribute speed and ‘guessi ’ are new free variables (unrestricted system inputs) of type τ . This expression can evaluate to every possible value allowed by the typing of attribute speed, thus it over-approximates the valuations observed in the original model. The following deﬁnitions of section 5.1 introduce the notion of data-type reduction and how the initial snapshot and transition predicate have to be changed to implement a data-type reduction. Lemma 6 claims, based on the notion of a projection, that for every predicate holding in a snapshot of the original system there exists a snapshot of the abstracted system in which the predicate holds. In section 5.2 we show that the data-type reduced system simulates the original system and thus is in fact an over-approximation of the original system. 5.1

Data-Type-Reduction

Deﬁnition 17 (Data-type reduction). Let τs be a scalarset type. A subset dtr(τs ) ⊆ Dτs of its domain is called a data-type reduction (DTR) on τs . Let t = {T | T type} ⊇ {τs } be a set of types. The data-type reduction dtr induces a data-type reduction d on the domains of the (structured) types constructed from t inductively as follows: (i) If τ ∈ t is an unstructured type, then d(Dτ ) = dtr(τs ) ∪ {⊥}, if τ = τs , and d(Dτ ) = Dτ otherwise. (ii) If τ = τ1 × · · · × τn is a record type, then d(Dτ ) = d(Dτ1 ) × · · · × d(Dτn ). (iii) If τ = τ1 → τ2 is an array type, then d(Dτ ) = d(Dτ1 ) → d(Dτ2 ).

If multiple data-type reductions are applied, multiple distinct symbols ⊥ representing “all other values”, one for each type, have to be introduced. In the following it is clear by context, of which type ⊥ is. The following deﬁnition describes how we obtain a “data-type reduced expression” which implements the over-approximation as a prerequisite for datatype reduction of STSs. Deﬁnition 18 (Data-type reduction for Expressions). Let ‘expr’ be a FOL expression over signature B. The data-type reduced expression d(expr) (or exprd ) over Bd (as deﬁned below) is obtained from ‘expr’ by applying the following syntactical transformations:

Live and Let Die: LSC-Based Veriﬁcation of UML-Models

125

(i) Indexing an array a with scalarset index-type τs at index ‘expr’ not on the left hand side of an “assignment” is changed s.t. it yields non-deterministically a value from the component-type T of a if the index expression has value ⊥: a[expr] (expr = ⊥ ? guess(T ) : a[expr]), Every guess(T ) stands for a fresh system variable g ∈ Vd \ V which is not restricted by Θ or ρ and thus for each snapshot s and each value x ∈ T there exists a snapshot s which coincides with s on all other variables and evaluates guess(T ) to x. We set exprg ≡df a[expr], i.e. exprg denotes the expression g was introduced for. (ii) Two transformed expressions expr1 , expr2 of the data-type reduced scalarset type τs which are compared for equality are changed s.t. the comparison yields a non-deterministic truth-value if both expressions have value ⊥: expr1 = expr2 (expr1 = expr2 = ⊥ ? guess(IB) : expr1 = expr2 ). Set exprg ≡df expr1 = expr2 . (iii) Indexing an array a with scalarset index-type τs at index expr1 on the left hand side of an “assignment” (cf. [7]) is changed s.t. it is considered only if the index expression does not have value ⊥: a[expr1 ]sel := expr2 (expr1 = ⊥ =⇒ a[expr1 ]sel := expr2 ) Here ‘sel’ denotes a possibly empty selection if the values of a are again of a structured type. The signature Bd is

Bd = (Vd , Ωd ) = (V ∪ G, Ω)

where v ∈ Vd is associated with domain d(Dtype(v) ). The set G denotes all fresh system variables introduced into exprd above. Note that in the general case of Def. 20, a value from the component domain of an array is “guessed” which might be a large structure like in the UML semantics. If parts of the structure are subsequently selected in the expression, a trivial optimisation consists of “guessing” only a value of the selected part’s type. Furthermore, there need not be an actual storage-place for the ⊥-th entry of a data-type reduced array a. If there would be an actual place addressed by ⊥, then its value would never be visible in any expression, since every expression using a is changed according to rule (i) which does not actually read a[⊥] but provides that any value can be taken. The following Lemma 6 states, based on the notion of a projection from Def. 19, that the data-type reduced expression can evaluate to every value observable for the original expression if the valuation is chosen appropriately. This is the main building block for the claim of Section 5.2, that the data-type reduced STS simulates the original one.

126

W. Damm and B. Westphal

Deﬁnition 19 (Projection). Let B = (V, Ω) be a signature, dtr(τs ) a datatype reduction on the scalarset type τs , and s ∈ Σ a valuation of the variables in V. The projection of s onto d, πd (s), is deﬁned inductively as follows: (i) If v ∈ V is a variable of basic type T , then s(v) , if s(v) ∈ d or T = τs πd (s)(v) = ⊥ , otherwise (ii) If v ∈ V is a variable of record type T1 × · · · × Tn and s(v) = (x1 , . . . , xn ) then πd (s)(v) = (πd (s)(x1 ), . . . , πd (s)(xn )). (iii) If v ∈ V is a variable of array type T1 → T2 and s(v)(i) = x, i ∈ T1 then πd (s(v))(i) = πd (x). Lemma 6 (Satiﬁability). Let B = (V, Ω) be a signature, d a data-type reduction on the scalarset type τs , M a structure of B, and s a valuation of variables in V. Let ‘expr’ be a predicate-logic expression over B which obeys the scalarset rules (S1)–(S6). Let Vd and exprd be the result of the operations of Def. 18 applied. Let the valuation s¯ of variables in Vd be deﬁned pointwise as follows: πd (s(v)) , if v ∈ V s¯(v) = πd (M[[exprg ]](s)) , if v = g ∈ G Then M[[expr]](s) = M[[exprd ]](¯ s).

Proof. (By induction over the structure of expr.) Let s be a valuation of variables in V. The data-type reduction aﬀects only the following cases: •

expr ≡ a[expr1 ], with expr1 an expression of type τs and a with components of boolean type: Then expr has been changed to exprd ≡ (expr1 = ⊥ ? guess(IB) : a[expr1 ]). where guess(IB) denotes a variable g ∈ Vd , thus M[[exprd ]](¯ s) = M[[(expr1 = ⊥ ? g : a[expr1 ])]](¯ s) = I(? :)(M[[expr1 ]](¯ s) = ⊥, s¯(g), M[[a[expr1 ]]](¯ s)) = M[[a[expr1 ]]](s). since M[[expr1 ]](¯ s) = ⊥ implies s¯(g) = M[[a[expr1 ]]](s) by construction of s¯. Analogously for structured array values from which a component of boolean type is selected.

Live and Let Die: LSC-Based Veriﬁcation of UML-Models •

127

expr ≡ expr1 = expr2 , both expr1 , expr2 of type τs : Then expr has been changed to exprd ≡ (expr1 = expr2 = ⊥ ? guess(IB) : expr1 = expr2 ). where guess(IB) denotes a variable g ∈ Vd , thus s) = M[[(expr1 = expr2 = ⊥ ? g : expr1 = expr2 )]](¯ s) M[[exprd ]](¯ = M[[expr1 = expr2 ]](s).

5.2

Simulation

Given an STS over a signature with a scalarset type, the data-type reduced STS is obtained as follows: Deﬁnition 20 (Data-type reduced STS). Let S = (B, Θ, ρ) be an STS. The data-type reduced STS is d(S) =df Sd =df (Bd , Θd , ρd ) where Bd , Θd , and ρd are obtained from Θ and ρ by applying the transformations of Def. 18. The following lemma claims, that the data-type reduced system is in fact a simulation of the original system in the sense of Def. 21 below, i.e. for every run in the original system there is a related run in the abstract system. Deﬁnition 21 (Simulation). Let S1 = (B1 , Θ1 , ρ1 ) and S2 = (B2 , Θ2 , ρ2 ) be STSs, M1 and M2 structures of their signatures, and Σ1 and Σ2 the sets of their snapshots. We say S1 simulates S2 , S1 S2 , iﬀ there exists a relation ⊆ Σ1 × Σ2 s.t. (i) ∀ s2 ∈ Σ2 : M2 [[Θ2 ]](s2 ) =⇒ ∃ s1 ∈ Σ1 : M1 [[Θ1 ]](s1 ) ∧ (s1 , s2 ) ∈ (ii) ∀ (s1 , s2 ) ∈ ∀ s2 ∈ Σ2 : M2 [[ρ2 ]](s2 , s2 ) =⇒ ∃ s1 ∈ Σ1 : M1 [[ρ1 ]](s1 , s1 ) ∧ (s1 , s2 ) ∈ The relation is called simulation relation.

Lemma 7 (DTR Simulation). Let S = (B, Θ, ρ) be an STS and d a datatype reduction on the scalarset type τs and Sd the data-type reduced STS. Let M be a structure of S and Sd . Then Sd S. Proof. Deﬁne ⊆ Σ × Σd for snapshots s, s s.t. s is the projection of s onto d for variables in B and assigns the value M[[exprg ]](s) to every variable g ∈ G introduced during construction of Sd as in Lemma 6: =df {(s, s¯) | s ∈ Σ}. is a simulation relation:

128

W. Damm and B. Westphal

(i) Let s2 ∈ Σ s.t. M[[Θ]](s2 ) = true. Choose s1 = s¯2 as in Lemma 6. Then (s1 , s2 ) ∈ and M[[Θd ]](s1 ) = true by Lemma 6, since Θd is obtained from the boolean expression Θ by applying the transformations of Def. 20. (ii) Let (s1 , s2 ) ∈ and s2 ∈ Σ s.t. M[[ρ]](s2 , s2 ) = true. Choose s1 = s¯2 as in Lemma 6. Then (s1 , s2 ) ∈ and M[[ρd ]](s1 , s1 ) = true by Lemma 6. Putting it all together, the following Lemma shows that the relation chosen in the proof of Lemma 7 provides us with the snapshots required as premise of Lemma 6, hence yielding the desired result for LTL formulae. Lemma 8 (Data-Type Reduction). Let S = (B, Θ, ρ) be an STS and dtr(τs ) a data-type reduction on the scalarset type τs and Sd the data-type reduced STS. Let M be a structure of S and Sd . Let φ be an LTL formula wrt. S which obeys the scalarset rules (S1)–(S6) and where no subterm except for oi is of scalarset type τs . Then Sd |=M,∀(,0) φ =⇒ S |=M,∀(,0) φ and S |=M,∃(,0) φ =⇒ Sd |=M,∃(,0) φ.

Proof. (By contraposition.) (i) Let S |=M,∀ φ, i.e. ∃ r ∈ runs(S) ∃ i ∈ IN : r/i |=M φ. By (the proof of) ¯ Lemma 7 exists a run rd ∈ runs(Sd ) s.t. ∀ i ∈ IN : rd (i) = r(i). Then Sd |=M,∀ φ follows by induction over the structure of φ: • φ ≡ expr: Then M[[expr]](r(i)) = false. By Lemma 6, M[[expr]](rd (i)) = false, thus rd /i |=M φ. • φ ≡ f ∨ g: Then r/i |=M f and r/i |=M g. By induction hypothesis, rd /i |=M f and rd /i |=M g. • φ ≡ ¬f : Then r/i |=M f . By induction hypothesis rd /i |=M f . • φ ≡ X f : Then r/i + 1 |=M f . By induction hypothesis rd /i + 1 |=M f . • φ ≡ G f : Then ∃ j ∈ IN0 : r/i + j |=M f . By induction hypothesis rd /i + j |=M f , thus rd /i |=M φ. • φ ≡ f U g: Diﬀerentiate between two cases: a) ∀ j ∈ IN0 : r/i + j |=M g. Then by induction hypothesis ∀ j ∈ IN0 : rd /i + j |=M g, thus rd /i |=M φ. b) For every j ∈ IN0 s.t. r/i+j |=M g, exists 0 ≤ k < j s.t. r/i+k |=M f . Let j ∈ IN0 s.t. rd /i + j |=M g. Then by induction hypothesis exists k < j s.t. rd /i + k |=M f , thus rd /i |=M φ. Similar for |=M,∀,0 . (ii) The cases |=M,∃ and |=M,∃,0 are obtained directly by construction of Sd and Lemma 6. The requirement on φ is not as strong as it seems since a boolean expression with a subterm expr0 of type τs can easily be integrated into the model as an auxiliary (or observer) variable, i.e. a boolean variable which is assigned the

Live and Let Die: LSC-Based Veriﬁcation of UML-Models

129

value of expr0 . Then φ references the auxiliary variable instead of expr0 . Within the model, expr0 undergoes the changes of Def. 18. Note that for formal veriﬁcation, only the former implication of Lemma 8 is of practical relevance: if we are able to prove the property for the abstract system, then it holds in the original system. But if we are seeking for an example run, there is no guarantee that a run found in the abstract system is also a run of the concrete system. 5.3

Parameterised Designs

A direct corollary of the previous Section 5.2 is the following [34]: Corollary 1. Let S = (B, Θ, ρ) be an STS and dtr(τs ) a data-type reduction on the scalarset type τs and Sd the data-type reduced STS. Let φ be an LTL formula over B which obeys the scalarset rules (S1)–(S6). Then Sd |=∀(,0) φ =⇒ ∀d ⊇ d : Sd |=∀(,0) φ

That is, if it can be proven that a property φ holds for some data-type reduced system, then it holds for every larger system. This re-formulation of Lemma 8 is in particular relevant for parameterised systems like the ARCS , which is parameterised in the number of terminals, platforms per terminal, and cars. 5.4

Data-Type Reduction for UML

In the domain of LSC veriﬁcation for UML, data-type reduction is applied depending on the speciﬁcation analogous to the methodology proposed by McMillan [34]: For an LSC with instance lines annotated by N diﬀerent types, we apply N (orthogonal) data-type reductions as follows. Let ik1 , . . . , ikn , 1 ≤ k ≤ N , be the chosen concrete objects of type Ock and d k = {ok1 , . . . , okn } a datatype reduction. Then S k =df dk (S k−1 ), where S 0 =df S. The overall data-type reduced system is d(S) =df S N . This yields a very coarse abstraction. For example in the ARCS , a speciﬁcation which requires that two cars don’t collide may refer to only two concrete objects at ﬁrst: two cars. Since the position of a car depends on its speed which is measured by its cruiser, there will be a false-negative if all cruiser objects are abstracted according to the heuristic data-type reduction. An iteration of the speciﬁcation would introduce a cruiser instance for every car relating them in the activation condition but not showing any communication between cars and cruisers4 . Then the heuristics would yield a system with as much concrete cruisers as needed for the cars s.t. the property might hold unless there are further iterations necessary. 4

This is the LSC equivalent to case splitting in the methodology of McMillan [34].

130

W. Damm and B. Westphal

Another main source of false-negatives, so called interference, is discussed in Section 6. For the running example, we would apply the data-type-reductions, dtr(Car) =df {(Car, 0)}, dtr(Terminal) =df {(Terminal, 0)}, dtr(CarHandler) =df {(CarHandler, 0)}, and yield a system as illustrated by ﬁgure 4. Terminal "other Car"

Car CarHandler

"other Terminal"

0 0 0

"other CarHandler" CarHandler exe

status itsCar

exe

10

...

...

...

Terminal 1

status speed

...

Car 1

Fig. 4. Data-Type Reduction: In the reduced system, the Car with identity 0 can distinguish only itself and “some other car” at the Terminal with identity 0 or at “some other terminal”.

Note that actually drawing the objects with index ⊥ is kind of misleading since there is in fact no place to store attribute values of the object with ⊥. But from the point of view of someone holding a reference to ⊥, it is a valid reference value and behaves in terms of types as expected, i.e. it oﬀers the same set of attributes as every object of this class. It just behaves “strange” in the sense that reading the same attribute over the same, unchanged reference may yield diﬀerent values due to the introduced over-approximation. To illustrate the eﬀect of the abstraction on the observable dynamic behaviour, in the following we brieﬂy “play through” event sending and operation calls; the eﬀect on expressions has already been discussed above. Consider class Car in the ARCS which has an association to a CarHandler (cf. ﬁgure 6). The CarHandler is created by a Terminal and its identity is then passed over to the Car. By executing the create action, the Terminal may get a reference to a concrete object or to “an other CarHandler ”, ⊥CarHandler . In the following, assume it got the ⊥CarHandler , passed it to the Car and the Car now sends an event to this CarHandler. The event is entered into the event queue responsible for the CarHandler, which is in fact guessed, s.t. the event may end up in any event queue. In the event queue of a concrete active object, the event will move to the top of the queue and thus become ready to be dispatched. The changed transition predicate causes the current state of the destination ⊥CarHandler to be guessed. Thus the event may be discarded or accepted. If the choice is for acceptance, the corresponding active object notes ⊥CarHandler as the

Live and Let Die: LSC-Based Veriﬁcation of UML-Models Terminal

0

1 EvQueue

10 waitEnter arrivAck/...

status itsCar

exe

...

exe

...

... eq

1

...

status speed

0

...

CarHandler

status itsCar

...

Car

0

131

C

ArrivAck Fig. 5. False negative: When checking the property “two Cars will not crash when entering the Terminal since their CarHandler s set the switches safely”, the heuristics of Section 5 yields the depicted system. “An other CarHandler ” sending an event arrivAck to the concrete Car awaiting this event could cause the Car to enter the terminal although all platforms are already occupied.

currently processing object and as long as the predicate stable(⊥CarHandler ) is not evaluated to true, all possible actions of the state-machine of class CarHandler may be executed [7]. Whenever during the execution of actions the associations and attributes of “⊥CarHandler ” are evaluated, the value is in fact guessed, hence if there is a transition which for example sends an event back to an object of class Car, then the execution might choose any object of class Car in the system as destination (cf. Sec. 6). Analogously, when a Car calls a triggered operation of ⊥CarHandler , the object reference ⊥CarHandler is entered into the Car ’s pending-request table as the receiver. The transformed transition predicate again allows to execute arbitrary transitions or becoming stable. The pending-request table entry is then changed to ‘completed ’ and the caller continues. If there is a reply action on a transition of the callee, then the caller may ﬁnd any possible value as the reply; otherwise the default will remain. Note that the “other callee” in particular needs not become stable, thus we can observe any number of steps between the call of the triggered operation and its completion.

6

Interference and Non-interference Lemmata

The discussion of event sending at the end of the previous section already named a typical reason for false-negatives: “an other CarHandler ” may send an event to a Car although it not the CarHandler known by the Car (cf. ﬁgure 5). In the original system, only a single CarHandler actually knows a Car and sends events only when appropriate.

132

6.1

W. Damm and B. Westphal

Non-interference Lemmata

In his SMV-tutorial [32], K.L. McMillan demonstrates how to address this problem by so called non-interference lemmata. A non-interference lemma is a property of the following general form: “If some entity sends something to me, then it is allowed to do so”. The lemma is veriﬁed in a separate proof and taken as an assumption in the proof of the main property to rule out unwanted interferences. In general, it might be necessary to explicitly introduce auxiliary variables which keep track of “the ones” who are allowed to send. But in UML models, a binary associations a with association ends e1 , e2 , each of multiplicity 0..1 or 1, are often intended to be “bi-directional”5 , i.e. the navigation forth and back along the association yields the identity: self.e1 .e2 = self. If furthermore communication in the model is closely related to associations in the sense that an event’s destination is always given in terms of an association, and if the modeller provides annotations of all such associations with a set of signals, which are intended to be sent “along” this association, we can heuristically derive the non-interference lemma: sending an event to e2 implies e2 .e1 = self.

CarHandler

itsCarHandler 0,1

1 itsCar

Car

Fig. 6. Class diagram: Relation between Car and CarHandler.

In the above example, there exists only the single association between class Car and class CarHandler depicted in ﬁgure 6. Thus we would derive ∀ o0 ∈ OCarHandler ∀ o1 ∈ OCar : justsend.snd = o0 ∧ justsend.dest = o1 =⇒ o0 .itsCarHandler = o1 which has to be proven separately. Note that again all symmetry reduction and abstraction techniques apply to this property.

7

Conclusion

We have provided a formal semantics for LSCs in the domain of UML in terms of the STS semantics of [7] and shown that LSCs in our interpretation are a prominent application domain for query-reduction, since UML models span an inherently symmetric state-space and LSCs are interpreted as quantiﬁcations over object identiﬁers, and provided yet missing proofs. We formally described the abstraction technique of data-type reduction, discussed its advantages of an anticipated signiﬁcant reduction of complexity and 5

Although this interpretation is (reasonably) not enforced by the UML 2.0 proposals.

Live and Let Die: LSC-Based Veriﬁcation of UML-Models

133

the possibility to prove properties without the need to provide ﬁnite bounds on the number of objects created during a run. The methodology applies in particular to parameterised systems. Its disadvantages comprise the extensive introduction of new free variables (inputs) and the introduction of false-negatives, which are possibly avoidable by automatically deriving non-interference lemmata from information in the UML model.

Acknowledgements We thank T. Toben for proofreading and -listening and the anonymous reviewer for valuable comments.

References 1. E. M. Clarke, R. Enders, T. Filkorn, and S. Jha. Exploiting Symmetry in Temporal Logic Model Checking. Formal Methods in System Design, 9(1/2):77–104, August 1996. 2. E. M. Clarke, O. Grumberg, and D. A. Peled. Model Checking. The MIT Press, 1999. 3. J. Corbett, M. Dwyer, J. Hatcliﬀ, and Robby. A Language Framework for Expressing Checkable Properties of Dynamic Software. In K. Havelund, J. Penix, and W. Visser, editors, SPIN Model Checking and Software Veriﬁcation, 7th International SPIN Workshop, Stanford, CA, USA, August 30 - September 1, 2000, Proceedings, volume 1885 of LNCS, pages 205–223. Springer-Verlag, 2000. 4. Gy. Csert´ an, G. Huszerl, I. Majzik, Zs. Pap, A. Pataricza, and D. Varr´ o. Viatra - visual automated transformations for formal veriﬁcation of uml models. In Proceedings International Conference on Automated Software Engineering (ASE) 2002, 2002. 5. W. Damm and D. Harel. LSCs: Breathing Life into Message Sequence Charts. Formal Methods in System Design, 19(1):121–141, 2001. 6. W. Damm and B. Jonsson. Eliminating Queues from RT UML Model Representations. In Damm and Olderog [8], pages 375–394. 7. W. Damm, B. Josko, A. Pnueli, and A. Votintseva. Understanding UML: A Formal Semantics of Concurrency and Communication in Real-Time UML. In Proceedings FMCO’02, 2003. 8. Werner Damm and Ernst-R¨ udiger Olderog, editors. Formal Techniques in RealTime and Fault-Tolerant Systems, 7th International Symposium, FTRTFT 2002, Co-sponsored by IFIP WG 2.2, Oldenburg, Germany, September 9-12, 2002, Proceedings, volume 2469 of LNCS. Springer-Verlag, 2002. 9. A. David, M. O. M¨ oller, and W. Yi. Formal Veriﬁcation of UML Statecharts with Real-Time Extensions. In R.-D. Kutsche and H. Weber, editors, Fundamental Approaches to Software Engineering (FASE’2002), volume 2306 of LNCS, pages 218–232. Springer-Verlag, April 2002. 10. E. A. Emerson and A. P. Sistla. Symmetry and Model Checking. Formal Methods in System Design, 9(1/2):105–131, August 1996. 11. M. Feather and M. Goedicke, editors. Proceedings of ASE2001 (16th IEEE International Conference on Automated Software Engineering). IEEE CS Press, nov 2001.

134

W. Damm and B. Westphal

12. Alain Le Guennec. Genie Logiciel et Methodes Formelles avec UML - Speciﬁcation, Validation et Generation de Tests. PhD thesis, Universit´e de Rennes 1, 2001. 13. D. Harel and E. Gery. Executable Object Modeling with Statecharts. IEEE Computer, 30(7):31–42, 1997. 14. D. Harel and R. Marelly. Specifying and executing behavioral requirements: The play-in/ play-out approach. Technical Report MCS01-15, The Weizmann Institute of Science, 2001. 15. Heinrich Hussmann. Loose semantics for uml,ocl. In Proceedings 6th World Conference on Integrated Design & Process Technology (IDPT 2002). Society for Design and Process Science, June 2002. 16. Radu Iosif. Exploiting heap symmetries in explicit-state model checking of software. In M. Feather and M. Goedicke, editors, Proceedings of ASE-2001: The 16th IEEE Conference on Automated Software Engineering. IEEE CS Press, nov 2001. 17. Radu Iosif. Symmetry reduction criteria for software model checking. In D. Bosnacki and S. Leue, editors, 9th International SPIN Workshop, Grenoble, France, April 11-13, 2002. Proceedings, number 2318 in LNCS, pages 22–41. Springer-Verlag, 2002. 18. C. N. Ip and D. L. Dill. Better Veriﬁcation Through Symmetry. Formal Methods in System Design, 9(1/2):41–75, 1996. 19. C. N. Ip and D. L. Dill. Verifying Systems with Replicated Components in Murφ. Formal Methods in System Design, 14(3):273–310, 1999. 20. ITU-T. ITU-T Recommendation Z.120: Message Sequence Chart (MSC). ITU-T, Geneva, 1993. 21. ITU-T. ITU-T Recommendation Z.120: Message Sequence Chart (MSC). ITU-T, Geneva, 1996. 22. ITU-T. ITU-T Recommendation Z.120: Message Sequence Chart (MSC). ITU-T, Geneva, 1999. 23. Kleppe and Warmer. Uniﬁcation of static and dynamic semantics of uml. Technical report, Klasse Objecten, Soest, Netherlands, 2001. 24. J. Klose. Syntax and Semantics of Live Sequence Charts. PhD thesis, Carl von Ossietzky Universit¨ at Oldenburg, 2003. To appear. 25. J. Klose and B. Westphal. Relating LSC Speciﬁcations to UML Models. In Hartmut Ehrig and Martin Grosse-Rhode, editors, Proceedings INT2002- International Workshop on Integration of Speciﬁcation Techniques for Applications in Engineering, April 2002. 26. J. Klose and H. Wittke. An Automata Based Interpretation of Live Sequence Charts. In Tiziana Margaria and Wang Yi, editors, Tools and Algorithms for the Construction and Analysis of Systems, 7th International Conference, TACAS 2001 Held as Part of the Joint European Conferences on Theory and Practice of Software, ETAPS 2001 Genova, Italy, April 2-6, 2001, Proceedings, number 2031 in LNCS, pages 512–527. Springer-Verlag, 2001. 27. A. Knapp, S. Merz, and C. Rauh. Model Checking Timed UML State Machines and Collaborations. In Damm and Olderog [8], pages 395–416. 28. D. Latella, I. Majzik, and M. Massink. Automatic Veriﬁcation of a Behavioral Subset of UML Statechart Diagrams Using the SPIN Model-checker. Formal Aspects of Computing, 11(6):637–664, 1999. 29. Z. Manna and A. Pnueli. The Temporal Logic of Reactive and Concurrent Systems: Speciﬁcation. Springer-Verlag, New York, 1991.

Live and Let Die: LSC-Based Veriﬁcation of UML-Models

135

30. Rami Marelly, David Harel, and Hillel Kugler. Multiple instances and symbolic variables in executable sequence charts. In Proceedings of the 2002 ACM SIGPLAN Conference on Object-Oriented Programming Systems, Languages and Applications, OOPSLA 2002, November 4-8, 2002, Seattle, Washington, USA, number 37(11) in SIGPLAN Notices, pages 83–100. ACM, November 2002. 31. K. L. McMillan. Veriﬁcation of an Implementation of Tomasulo’s Algorithm by Compositional Model Checking. In A. J. Hu and M. Y. Vardi, editors, Computer Aided Veriﬁcation, 10th International Conference, CAV ’98, Vancouver, number 1427 in LNCS, pages 110–121. Springer-Verlag, 1998. 32. K. L. McMillan. Getting Started with SMV. Technical report, Cadence Berkeley Labs, March 1999. http://www-cad.eecs.berkeley.edu/˜kenmcmil/tutorial.ps. 33. K. L. McMillan. Veriﬁcation of Inﬁnite State Systems by Compositional Model Checking. In L. Pierre and T. Kropf, editors, Correct Hardware Design and Veriﬁcation Methods, 10th IFIP WG 10.5 Advanced Research Working Conference, CHARME’99, number 1703 in LNCS, pages 219–233. Springer-Verlag, 1999. 34. K. L. McMillan. A Methodology for Hardware Veriﬁcation using Compositional Model Checking. Science of Computer Programming, 37:279–309, 2000. 35. Ileana Ober. Harmonizing Design Languages with Object-Oriented Extensions and an Executable Semantics. PhD thesis, Institut National Polytechnique de Toulouse, April 2001. 36. Ileana Ober. An asm semantics of uml derived from the meta-model and incorporating actions. In Egon Boerger, Angelo Gargantini, and Elvinia Riccobene, editors, Abstract State Machines - Advances in Theory and Applications, 10th International Workshop, ASM 2003 Taormina, Italy, March 2003, number 2589 in LNCS. Springer-Verlag, March 2003. 37. Iulian Ober and Marius Bozga. Adapting and optimizing existing timed model checking tools to uml tools. Technical Report IST/33522/WP2.1/D2.1.2, Verimag, December 2002. 38. OMG. OMG Uniﬁed Modeling Language Speciﬁcation, September 2001. Version 1.4. 39. I. Paltor and J. Lilius. Formalising uml state machines for model checking. In Robert B. France and Bernhard Rumpe, editors, UML’99: The Uniﬁed Modeling Language - Beyond the Standard, Second International Conference, Fort Collins, CO, USA, October 28-30, 1999, Proceedings, volume 1723 of LNCS, pages 430–445. Springer-Verlag, 1999. 40. T. Sch¨ afer, A. Knapp, and S. Merz. Model Checking UML State Machines and Collaborations. Electronic Notes in Theoretical Computer Science, 55(3), 2001. 41. W. Shen, K. Compton, and J. K. Huggins. A toolset for supporting uml static and dynamic model checking. In Feather and Goedicke [11], pages 315–318. 42. B. Westphal. Exploiting Object Symmetry in Veriﬁcation of UML-Designs. Master’s thesis, Carl von Ossietzky Universit¨ at Oldenburg, April 2001. 43. Xie, Levin, and Browne. Model Checking for an Executable Subset of UML. In Feather and Goedicke [11]. 44. F. Xie and J. Browne. Integrated State Space Reduction for Model Checking Executable Object-oriented Software System Designs. In Ralf-Detlef Kutsche and Herbert Weber, editors, Fundamental Approaches to Software Engineering, 5th International Conference, FASE 2002, held as Part of the Joint European Conferences on Theory and Practice of Software, ETAPS 2002, Grenoble, France, April 8-12, 2002, Proceedings, volume 2306 of LNCS. Springer-Verlag, 2002.

Reactive Animation David Harel1 , Sol Efroni1,2 , and Irun R. Cohen2 1

Dept. of Computer Science and Applied Mathematics 2 Dept. of Immunology Weizmann Institute of Science 76100 Rehovot, Israel {sol.efroni,dharel,irun.cohen}@weizmann.ac.il

Abstract. Software engineers use system visualization mainly in two domains: algorithm visualization and system visualization, and both of these are often animated. In this paper we provide a generic link between the speciﬁcation and animation of complex object-oriented reactive systems, which constitute one of the most important and diﬃcult classes of systems. The link and its methodology form a basis for communication between standard reactive speciﬁcation tools and standard animation tools. Reactive Animation can be used in a wide range of applications: computer games, navigation and traﬃc systems, interactive scientiﬁc visualization. Reactive Animation helps make the programming of such applications more reliable, expeditious and natural to observe and comprehend. We illustrate two examples: a complex biological model of thymic T-cell behavior and a traﬃc simulation1 .

1

Introduction

We describe a generic link between two kinds of computerized tools. The ﬁrst are tools that aid in the development of complex reactive systems [1], such as aerospace, automotive, communication and medical diagnostic systems. Such tools make possible to specify and execute complex behavior, and are based on visual formalisms [2]. The second tools serve for high-quality graphic animation. We call this combination Reactive Animation. We shall explain the need for a generic means of linking these two quite diﬀerent kinds of tools, concentrating on complex object-oriented (OO) systems, and describe the promise of such a link. We then describe the link using two particular tools: Rhapsody from I-Logix, Inc. [3], and Flash from Macromedia [4]. We illustrate Reactive Animation with two applications: the behavior of T-cells in the thymus gland and a vehicle traﬃc system. 1.1

Reactive Systems

One of the central issues in software and system engineering over the last decades has been to develop languages, methods and tools for the reliable construction of 1

Some technical parts of the work described here are patent pending.

F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 136–153, 2003. c Springer-Verlag Berlin Heidelberg 2003

Reactive Animation

137

reactive systems. This term denotes systems whose complexity stems not necessarily from complicated computation but from complicated reactivity and interaction with the environment — users and/or other systems [5]. Reactive Systems have to handle discrete incoming stimuli, to which they react and with which they interact over time, and which they may also manipulate. Reactive systems are often highly concurrent and time-intensive, and exhibit hybrid behavior that is predominantly discrete in nature but has continuous aspects too. Their structure consists of many interacting, often distributed, components. Very often the structure itself is dynamic, with its components being repeatedly created and destroyed during the system’s life. Thus, we are especially interested in systems modeled according to the OO paradigm. The heart of the problem is the need for good approaches to modeling and analyzing the dynamic behavior of reactive systems. The most widely used frameworks for developing reactive systems feature visual formalisms [2], which are both graphically intuitive and mathematically rigorous, supported by powerful tools that enable full model executability and/or the automatic generation of ﬁnal runnable code [6],[7],[8] (see [9] for a review). This framework enables realistic simulation prior to actual implementation. Such languages and tools are often based on the OO paradigm, and some are strengthened by veriﬁcation modules, making it possible not only to execute and simulate the system models (test and observe) but also to verify dynamic properties thereof (prove) [10]. The tools are also being linked to other tools to deal with the system’s continuous aspects in a full hybrid fashion. Most of the available tools are state-based, encouraging intra-object style speciﬁcation [11], but there are also recent scenario-based approaches, which enable executable inter-object speciﬁcation [12],[13]. The world is full of well-known types of computerized reactive systems, whose reliability and time-critical nature make the issue of good support tools crucial. Interestingly, biology supplies another kind of reactive systems, which exhibit similar characteristics and problematics (apart from the fact that we don’t have to design them but, rather, to understand them). In this paper we use the language of statecharts [14] in its OO version, as implemented in the Rhapsody tool [3],[11]. Statecharts, together with object model diagrams, as described in [11] constitute the core executable part of the UML standard [15],[16]. The speciﬁcation in Rhapsody, and in other similar tools, is diagrammatic, with the diﬀerent diagrams describing diﬀerent aspects of the system, and is also amenable to execution. To build a model of a system in Rhapsody, one usually starts with a description of the relevant classes and possible relations between them. The end result of this systemic account is a diagrammatic representation of the structure of the system, in an object model diagram. However, the main eﬀort is exerted on specifying the system’s behavior, in this case using statecharts. This is an intricate process, during which we identify the host of possible states, events and inter-state transitions, that best describe the system’s conduct, the interactions between objects that compose the

138

D. Harel, S. Efroni, and I.R. Cohen

system, the diﬀerent attributes that are included in each class and the diﬀerent functions classes own. 1.2

Model Execution

Once we have a valid model, a tool like Rhapsody enables us to automatically generate code, which can be run on a desired operating system, and allows us to check and play with the model. For example, we can carry out diagrammatic animation, meaning that during execution we get to see the diagrams changing in a way that indicates the dynamic behavior that is being executed. To make model execution useful for a broader spectrum of less-technical people, including endusers, modelers of reactive systems often use graphical user interfaces (GUIs) to portray the look and feel of the modeled system during execution. GUI’s also provide a convenient way of manipulating the model during execution. Such interfaces are sometimes created using a programming language’s visual “forms”, or even coded-in while creating the application’s code. Other ways to create such a visual interface is to use speciﬁc tools, such as Altia [17]. Some of these tools also come with class libraries that enable the integration of such visual interfaces with the running code. The pressing of buttons, adjustments of knobs, the keying in of diﬀerent parameters are therefore conveyed to the running model, where the proper operations are performed and the results are sent back to be displayed in the visual interface. It is important to realize, however, that most GUI tools are limited, and are mainly useful for obtaining a semi-realistic rendition of the highly discrete user interface of the system in operation. They are in fact almost static in nature. For example, if we wanted to build an interface for a cellular phone, we could use GUI tools to render easy access to buttons, displays, and sounds; a button would be clickable and a display would receive values, etc. During execution, such events ﬂow from the user manipulating the GUI to the reactive model of the cellular phone, where they are handled, and appropriate messages are sent back to the GUI to be displayed. GUI tools are thus designed to handle a wellcharacterized set of element that are found in systems such as watches, phones, cars, planes, electric appliances, software applications, calculators, etc. They do not supply a general programming environment that gives us full control over the diﬀerent features of the components, or powerful scripting languages to perform simple manipulations of components. They have no general functional abilities or animation techniques. In short, the front end of reactive system tools is very limited, and does not provide us with what we may call animative freedom. 1.3

Animation

In this paper we are interested in true computerized animation, of the type one ﬁnds in computer games and animation movies. True animation holds extraordinary illustrative and explanatory power, due to its typically high-quality of realism and non-verbal mode of communication. This power is often used

Reactive Animation

139

in technical ﬁelds too, appearing in tutorials, presentations, algorithmic animation, and more. Animation has the ability to detach objects from their speciﬁc implementation while holding on to their deﬁning features. One of our claims is that many reactive systems can be better represented using true animation, in which we can capture the deﬁning features of the system in a realistic manner. By enhancing the representation with the power of animation, we can show the system changing locations, sizes, colors and shapes, switching components, rotating and shifting. And we can also show the system impact on its own structure, or that of other systems, by eliminating parts or giving birth to new ones. Running animation serves as an explanatory tool to the driving simulation. It tells a visual story that comes as close as we want to a real system, limited only by the graphical manipulative power of the animation tool itself, and, in the case of multi-agent simulation, it can tell many stories at once that merge into a comprehensive whole. How to link up a reactive system “engine” such as Rhapsody, with a true animation tool, such as Flash — resulting in what we call here Reactive Animation— is the technical contribution of this paper. 1.4

Related Work

Over the past three decades there has been much work on topics related to the ideas presented here. The ﬁrst is algorithm animation. Various researchers developed this area into a well-formulated and ordered sub-ﬁeld of computer science. Algorithm animation began as a visual abstraction of program operation and its data and dynamics. The main motivation for the algorithm animation tools of the early 1990’s was to ﬁnd diﬀerent ways of representing and abstracting the text-based, mathematically rich formulation of the dynamics of algorithms with a graphically rich, visually dynamic environment that would provide an intuitive rendition of the logic behind the algorithm. Thus, tools like BALSA [18], Tango [19] and Polka [20] provide the user with the ability to construct a front-end and associate it with the behavior of the animated algorithm. Other tools harnessed the power of a diagrammatic language to animation [21]. Most approaches shared the view of the algorithm as a sequence of events occurring in time, and the most interesting events were chosen to be reﬂected in the animation. These systems had to confront elementary problems of animation, such as achieving smooth transitions over a discrete course of events, and developing viewers for the animation. Much of the eﬀort in building and using such tools was associated with animation as a teaching tool for computer programming [22]. The software tool Leonardo [23], for example, is used to learn and teach through the visualization of software algorithms. One group [24] makes a distinction between abstract animation - the process of changing data or relations - and real shape, which is directly related to how real objects appear. Thus, if we adopt this terminology, we might say that these approaches were concerned mainly with animating the abstract properties of systems. Over the years, the developers of such tools were able to handle

140

D. Harel, S. Efroni, and I.R. Cohen

distributed technologies [25], and three dimensional representation [26], and to solve problems of compatibility [27]. Other tools, which we will not review here, dealt with the complexity of software engineering and visualized the connection between diﬀerent classes, diﬀerent ﬁles or diﬀerent behaviors at run-time. The conceptual thread that runs through these previous eﬀorts, and leads naturally to our present work, is the process of identifying attention-grabbing events at one level and showing them dynamically at another. The work presented here is not merely another, more powerful tool that claims to make the connection between algorithms and animation quicker or easier. Rather, it provides a generic methodology and a speciﬁc implementation to use state-of-the-art tools of one kind in order to build a front-end, and, in a separate eﬀort, to use state-of-the-art tools of a completely diﬀerent kind to build the speciﬁcation of the dynamics. We believe that our methodology can be easily kept up to date on both kinds of tools. Tools for the speciﬁcation of reactive systems are continuously progressing to match the highest criteria of system modeling, design and deployment, while animation tools have been improving rapidly to satisfy a wide variety of relentlessly fast-growing applications and needs. By dividing the tasks, and isolating the link between these two worlds, we are able to stay updated as both of them develop, resulting in the most powerful “inner parts” of the speciﬁcation of reactive system dynamics that also “look good” on the outside.

2

Reactive Animation

Most scripting languages that come with animation tools provide a set of instructions that make it possible to perform some manipulation of the animated objects. Such scripting languages, although in principle of universal computing power, are not built with any intention of matching the strength of programming languages. The underlying diﬀerence is that here we have the script, a set of commands that is run by some other language and not directly by the computer’s processor (as a compiled program would). Scripting languages are common in multimedia tools in general and in animation tools in particular. We would like to use the term Reactive Animation for the working combination between the power of reactive modeling languages and tools, such as statecharts in Rhapsody, and the power of animation tools like Flash. Reactive Animation provides a vivid representation built on a rigorous, hard core model of the system under description. Furthermore, we achieve this representation without the need to carry out diﬃcult and tedious coding in the animation tool’s scripting language, and without having to build complex and cumbersome animation capabilities on top of the reactive modeling tool. 2.1

The Two Components

We simulate and represent systems using two separate, detached environments. The simulation is designed without the need of any animation, and the animated

Reactive Animation

141

components are designed without the need of any code from the model. The ﬁnal result, nevertheless, is an attachment of the two — a reactive animation (something like a sophisticated interactive movie). We build a model of the system without necessarily taking into account the fact that we plan to later represent it in animation. The model will therefore be a regular reactive model of the system, and when building it we do not have to try to specify it according to the way we think it should ultimately be animated. We do not specify the system for the sake of animation. We animate the speciﬁcation. We design a model by specifying its classes and their interconnections (e.g., with object model diagrams), and their behavior (e.g., with statecharts). We add functions and attributes. We check the model, we run it, we modify it, etc. Some information on what needs to be prepared in the animation tool is provides below (2.2). We integrate the speciﬁcation and the desired animation by building a few paths of control to coordinate directions and appearance, and to synchronize the running simulation with components from the animation. The viewer sees the result as a running movie, which spontaneously leads to the notion of a directed sequence of events. However, the movie is in fact generated on the ﬂy by connecting the two facets of the system: our representation of how the system works and our representation of what the system looks like. It is important to stress that we do not merely use statecharts to specify some intricate behavior of animated scenes or of animated systems, as is suggested elsewhere [28]. We are also not merely trying to incorporate reactive abilities into an animation tool. 2.2

How to Link the Two

Figure 1 is a sketch of the diﬀerent parts of the scheme and their inter relationships. As the reactive modeling tool (on the left of the ﬁgure, in blue) we use Rhapsody from I-Logix. The ﬁrst eﬀort in representing the speciﬁcation as animation is to identify the changes in the simulated system that would be best described by animation. We call these changes visual landmarks. Although some of these landmarks are obvious (for example, the movement of parts of the system), some can only be identiﬁed by having some pre-knowledge of the system. For example, when we simulate cells (explained in more detail below), we might want to show the appearance and disappearance of receptors on the cell’s surface. However, visual landmarks do not include the decision trees incorporated into the cell’s behavior. Path (1) in Figure 1 is, therefore, the identiﬁcation of visual landmarks and making them, through a communication interface, amenable for the animation tool. On the other side of the ﬁgure we see the animation tool in purple, where we build components that are able to handle each of these visual landmarks. We do not build the movie itself, i.e, the way these landmarks are combined. Rather, we only supply the running simulation with enough tools to build the movie by itself. We thus design components that would be able to collectively build, during runtime, a representation of classes from the simulation. Such components need not

142

D. Harel, S. Efroni, and I.R. Cohen

Fig. 1. A diagramtic description of Reactive Animation.

have a stationary non-dynamic representation, even though they are triggered by immediate changes. Scripting languages help animate these immediate changes into a smoothly running animative representation. Path (3) in the ﬁgure may be understood as making use of the animation tool’s projector abilities. Path (4) stands for the dynamic attachments between components that turn them into a representation of the system, and not merely a detached representation of components. Path (5) in the ﬁgure is the intervention the user may wish to apply to the simulation while it is running. Such interventions are mediated by components provided by the animation tool. These components are, again, linked to matching events in the simulated system. The setup required for our two-faceted Reactive Animation therefore consists of: (i) visual landmarks identiﬁed in the system; (ii) components in the animation tool that are able to respond to alerts about visual landmarks; (iii) a medium to transfer messages back and fourth between the two tools. So far, we have not discussed the third of these, the medium that enables communication

Reactive Animation

143

between the two tools. In principle, this medium may be any applicable form of communication that allows data transfer in real time: a communication protocol over real-time systems, Windows DLL ﬁles, network protocols, or others. We currently use TCP/IP as the medium and XML ﬁles to transfer the data between the two tools we work with. It is not possible to provide optimal tactics on how one should animate an arbitrary reactive system. Most systems live in a pre-known environment of people and nomenclature, and come with some history of analysis, with its terms and relevant behaviors. These should be identiﬁed and sought for in the simulation and should have visual renditions in the animation. 2.3

Conceptual Layout

Reactive animation builds upon the concepts and methods that were introduced earlier, in the ﬁeld of algorithm animation, and the work that stemmed from this ﬁeld. The conceptual layout is portrayed in Figure 2.

Fig. 2. The conceptual layout of reactive animation. The layout of the concept is similar to any algorithmic animation. The ﬁgure is a re-adaptation of a ﬁgure from [24]. AR - Abstract Representation, AO - Application Operation, AN - Animation, (a) - code generation. (b) Statecharts animation.

The general procedures in creating reactive animation are similar to algorithm animation: 1. Build the algorithm 2. Identify visual landmarks (or any other terminology for the interesting event during algorithm execution) 3. Build a bridge between the two. The novelty and the contribution of the work we present here is 1. We do not try to represent only the abstract but also real shape 2. Applicability.

144

3

D. Harel, S. Efroni, and I.R. Cohen

Examples

We illustrate Reactive Animation using two examples: the development of T cells through the thymus and the simulation of traﬃc ﬂow. Short videos showing parts of these are available. 3.1

The Thymus Example

This work started from an eﬀort to understand a biological system — the thymus. This is an organ in which thymocytes develop to become mature T cells, which are an important part of the immune system [29],[30]. We have modeled the behavior of this system, and especially that of thymocytes in their journey within the thymus. Our speciﬁcation of the behavior of cells that comprise the thymus is done using statecharts and object model diagrams in Rhapsody. We analyze the data about the biological objects (e.g., the cells) as they appear in literature. In this analysis, we transform current understanding into states, transitions and parameters. We then compile the functioning Rhapsody model and run it, thus obtaining a simulation of the relevant aspects of the thymus. The problem with this Rhapsody simulation is that it is very diﬃcult to understand. What we see when executing the model is a diagrammatic simulation, showing the generation of instances, the switching between states, the events that have been consumed, the events that are about to be consumed, and so on. This representation is detailed and rigorous, and it does help in understanding what the thymus is doing, but not nearly enough. Biologists, for example, ﬁnd extremely diﬃcult to get a feeling for the living system from such a simulation. To alleviate this problem, we built a Flash front-end that visually resembles the way cells actually appear when viewed through a microscope (or at least the way diagrams can depict this). This front-end animates the course of events in the thymus. It shows the viewer of the animation how cells move, change their receptors, interact with other cells, proliferate, mature, die and secrete diﬀerent substances. All the processes occur within the Rhapsody model, and can, in principle, be viewed in the Rhapsody simulation itself. We could have viewed the statecharts as they are animated: we could also follow text lines or watch events being consumed. This might be feasible for a system with a small number of cells, but when we have myriads of cells — as is the case here — this is infeasible, and to understand the process, we must have a visual animated front-end to the simulation. Figure 3 illustrates a snapshot of a very small part of the diagrammatic animation and the true reactive animation. A more elaborate animation may be seen in the accompanying movie. The right hand side of part (a) of the ﬁgure shows a small portion of the statechart of a thymocyte. The full statechart is much larger and much more complicated, and cannot be shown and explained in full within the scope of this paper. On the left hand side we see the symbolic presentation. Attached to this spherical-looking rendition of a cell we can see

Reactive Animation

145

Fig. 3. a) The various features that control the appearance of an animated T cell. b) The procedure for dictating a T cell’s color.

“arms” that represent Part (b) of the ﬁgure, following the trail of a single simulated biological event as it starts with the simulation to be transferred to the animation.

146

D. Harel, S. Efroni, and I.R. Cohen

Rhapsody continuously assigns a cell phase to each conﬁguration of receptors on a cell’s surface. Every such cell phase is given a matching color in the animation. When Rhapsody ﬁnds that a state of a cell has been changed, it sends a message to Flash, in which the proper cell is identiﬁed, and its phase is declared. When Flash receives this message, it ﬁrst locates the appropriate cell from among the many animated cells, and then changes that cell’s color according to the predeﬁned color code. Since the user of our simulation receives a large quantity of data via the animation, we would like him/her to be able to interact with the simulation through the animation. To that end, we have also built the animation as an interactive user interface. The user may then choose his/her current focus, send orders to the reactive engine, receive data from it, apply statistical tools, and so on. For example, the user is able to control the diﬀerent receptors on each of the cells’ surfaces, ﬁnd out what are some of the more important cell’s attributes, or ask for the percentages of cells that are found at diﬀerent developmental stage. In short, the user may perform experiments with the data and see the outcome. It is noteworthy that the T-cell thymus model itself is very detailed and complex, and captures a tremendous amount of information known on the behavior of the system. In fact, we have incorporated information from around 250 biology papers. 3.2

The Traﬃc Example2

Vehicle traﬃc is another telling example of emerging complexity on both levels. To simulate the collective behavior of moving cars in a complex context, and adhering to the need for real-time decision-making, we have again used the language of Statecharts in Rhapsody. Statecharts are used to specify the choices the driver makes while driving, the behavior of scene elements in the environment, such as traﬃc lights, the actual movement of the cars, and the overall management of the project. On the graphical side, we used Flash to build a library of animated cars, traﬃc lights, pedestrians, houses, lanes, and so on, in order to facilitate a real-time graphical representation of the interacting objects. The speciﬁcation of the simulation serves to accomplish two primary goals. The ﬁrst is the implementation of the overall management of the project. This includes such activities as the entry and removal of all the interactive objects in the environment (e.g., cars, pedestrians and obstacles), along with maintaining a map of their locations in the scene, and the handling and parsing of incoming and outgoing messages between the speciﬁcation and animation sides of the simulation. The second is the speciﬁcation of the reactive behavior of the objects. This includes the cycling of traﬃc-light states and the movement of pedestrians. The most complex element of the speciﬁcation is the implementation of the logic of driving a car. Each car must be aware of all relevant objects in its vicinity. Based on the conditions in its immediate neighborhood, a car performs a complex series of calculations in order to determine its correct behavioral 2

This part of the work was done jointly with Aron Inger.

Reactive Animation

147

Fig. 4. A general view on layering in Reactive Animation.

response. Decisions are made to avoid obstacles, to obey the rules of the road and to maximize eﬃciency. In order to accomplish these objectives, cars may choose to increase or decrease their speed, to turn or to switch lanes. This “awareness” of the car’s surroundings and the calculation of its reactions are implemented with Statecharts. Nevertheless, the collective interactions of such a complex multi-agent system cannot be appreciated through the inspection of the Statecharts alone. Animating of the interactive objects, using the graphical front-end, simultaneously and in real time, facilitates a much greater understanding of the simulation. The animation displays a running movie of the current locations of all the objects, as well as their behavior. Traﬃc lights can be seen cycling through their various colors, while cars move, turn, change lanes, wait in traﬃc jams, stop for red lights, and even crash into each other. In the true spirit of reactive animation, the information required for this representation is received from Rhapsody and not directly calculated by Flash. The user may also interact with the animation by requesting to display certain kinds of information. Various details, such as the current speed or age of each car, can

148

D. Harel, S. Efroni, and I.R. Cohen

be displayed, in numeric format or by color-coding the cars in the scene. The awareness neighborhood of individual cars can also be reﬂected on the map. This sort of high-level information aids in understanding the overall behavior of the simulation in a way that would not be possible without the animation. An example of the traﬃc ﬂow simulation can be seen in Figure 5, and a movie of the animation accompanies this paper. The animation of the simulation may be seen on the left side, with some of the Statechart logic displayed on the right. Combining speciﬁcation and animation is not only helpful for understanding the behavior of the system but also facilitates the coding of the system as well. Generating the logic behind each car’s decision-making process along with debugging this logic as it is coded is diﬃcult to perform in an abstract setting. The animation provides an intuitive demonstration of the resulting behavior generated by the code. This eases the process of designing and implementing the simulation itself.

4

The Implementation

This section describes the way we are currently implementing Reactive Animation in speciﬁc hardware and software. In addition to thymus and traﬃc simulations we are currently working on additional projects with the same implementation: cell migration and the ﬁne details of choices made by the immunological repertoire. Somewhat diﬀerent setups are used for these projects, but in all of them we use Rhapsody (either in C++ or Java) and Flash MX, but we are in the process of integrating other tools and are incorporating three dimensional animation tools. The connection between the reactive system’s behavior (Rhapsody statecharts in execution) and Flash’s projector (running animation) is carried out through TCP/IP. At run-time, the messages we send back and fourth between the simulation and animation consist of relatively small XML ﬁles. Flash provides built-in tools to handle XML ﬁles. We built special-purpose components into the Rhapsody model to be able to handle outgoing and incoming XML ﬁles. The fact that we are using TCP/IP as the transfer medium makes it easy to implement the same conﬁguration of a reactive simulation and its animation over a network. In such a network, one machine can be dedicated to all (hard) reactive simulation work and another machine is dedicated to animation display. This structure presents some advantages: – Strong computer hardware is dedicated to performing hard tasks related to reactive simulation. CPU power and memory are not busy with tasks related to graphical display. – Diﬀerent machines with diﬀerent hardware conﬁgurations may be used in optimal conditions for diﬀerent computing jobs. Hardware conﬁgurations well suited for displaying graphics are not well suited for multi-threading or other computational tasks. By separating the tasks — simulation on one machine and animation on the other — we can provide an optimal hardware environment for each of these missions.

Reactive Animation

Fig. 5. Reactive animation exempliﬁed in traﬃc simulation.

149

150

D. Harel, S. Efroni, and I.R. Cohen

– We can assume that reactive simulation will usually be more demanding than animation. We may also assume that very powerful machines will always be rarer than conventional computing power. To accommodate the limitations of computer power, we can install the reactive simulation on a powerful machine and allow people interested in running the simulation, even if they do not own the necessary computing power, to be able to run the simulation and view its animated interface. – There are cases where many users wish to simultaneously access the simulation. Examples are multi-participant games, the simulation of war games, traﬃc ﬂow, air traﬃc, and more. In such cases, the various multimedia players are available for almost all operating systems, so that a variety of users can access the simulation run on a host machine. Since we can apply layering (explained below), every user may use his/hers own relevant view of the system’s behavior. If we take air traﬃc control as an example, diﬀerent users may view diﬀerent areas of the country, while air ﬁeld designers may wish to look at statistical data generated by some other layer in real-time or post run. Moreover, since the internet is an easy medium for TCP/IP, all this may be done over the internet. To make the connection between the simulation and the animation, we send the simulation information, in capsules. Currently, only one entity continuously “listens” for such information. In principle, it is possible to add other “listeners”, not necessarily animation tools, who may make use of this information. For example, in our thymus simulation, we continuously release large amounts of data about a large amount of cells. It is diﬃcult to statistically analyze so much data. Still, special tools are now available to make this kind of real-time statistical analysis possible. We could have such tools tapping the ﬂowing data and analyzing it in real-time. Once this analysis is ﬁnished, the results may be sent to the animation tool for display (see ﬁgure 4). One can easily think of other such connections, since any large system may be viewed from many angles, each of which can be equipped with a listener, providing its own interpretation to the information that comes in. 4.1

Supporting Movies

The movies that accompany this paper provide brief illustrations of the two example applications, as they appear in our implementations. The movies are available at: http://www.wisdom.weizmann.ac.il/˜sol/reactiveAnimation2003.zip To view the movies, download the ﬁle “reactiveAnimation2003.zip”, extract it to some folder, and open the “main.htm” ﬁle. The ﬁrst movie, on the thymus, starts with the simplest example for the interplay between reactive speciﬁcation and front-end animation. It shows the change of receptors on a T cell’s surface, ﬁrst in the tool we use for speciﬁcation — Rhapsody — and later in the Flash animation. Following this, we see a population of T cells, which is actually the animated representation of multiple instances of the T-cell object. All of these are created and manipulated

Reactive Animation

151

by the underlying code that is automatically generated by Rhapsody from the statechart model in order to drive the simulation. The second movie, on the traﬃc example, also shows some of the Statecharts in Rhapsody and some scenes from the animated front end. This movie does not include the kind of explanations that the ﬁrst movie does, only run-time snapshots. In this movie it is particularly important to realize that in such a format we can only show very few of the statecharts that drive the simulation. The statechart portion that is shown in the background dynamically changing states represents a part of a single instance of a car, while in practice there are numerous such instances viewable on the animation.

5

Conclusions

We have presented a method for animating reactivity, or, equivalently, for driving a reactive animation by programming its dynamics using tools developed speciﬁcally for classical reactive systems. Thus, we make it possible to better understand the function of a reactive system through an animated front-end. This front-end is not a movie about the reactive system; rather, it is continuously generated by the underlying simulated reactive system. We do not change the methodology commonly used by tools for reactive systems, but instead introduce the well established explanatory power of animation. Further, to achieve this, we do not require that the reactive system tools be extended by specially designed add-ons to enable animation, but show how this can be done by directly linking them to available animation tools. Reactive Animation has other beneﬁts too, including better ways to make use of available hardware; new possibilities for multi-user simulations and games; and ways to incorporate available tools for statistical analysis on the bridge between animation and simulation. Reactive Animation also oﬀers substantial help in handling multi-agent simulations, where dynamic representation is demanding. By using this methodology and its technical beneﬁts, we make use of current (and probably future) state-of-the-art tools. We take advantage of two diﬀerent facets of expertise in software companies: some make excellent tools for specifying reactive behavior, and others make excellent tools for producing interactive animation. By using Reactive Animation, we separate the eﬀorts of modelers and animators and equip each with state of the art tools, only to later join the two. We are currently in the process of applying Reactive Animation to diﬀerent simulations and with diﬀerent tools than the ones illustrated here. One application is the animated representation of models speciﬁed with tools for inter-object description [12], such as the recently developed Play-Engine, based on the language of live sequence charts [31] and the play-in/play-out methodology [13]. Other paths we are exploring include a 3-D representation of another biological environment, using Macromedia’s Director [4]. Reactive Animation promises to bring, with relatively small eﬀort, many beneﬁts to people developing or working with reactive systems, and to people

152

D. Harel, S. Efroni, and I.R. Cohen

interested in enriching the reactive and interactive capabilities of animation per se.

Acknowledgments We wish to thank Aron Inger for his dedicated work on the traﬃc example, and Yeda Research and Development, Ltd., the Weizmann Institute’s technology transfer company, for ﬁnancial support of that work.

References 1. D. Harel and A. Pnueli, “On the development of reactive systems,” in Logics and Models of Concurrent Systems (K. R. Apt, ed.), vol. F-13, pp. 477–498, SpringerVerlag, New York, November 1993. 2. D. Harel, “On visual formalisms,” Comm. Assoc. Comput. Mach., vol. 31, no. 5, pp. 514–530, 1988. 3. I-Logix Inc. http://www.ilogix.com. 4. Macromedia Inc. http://www.macromedia.com. 5. R. J. Weiringa, Design Methods for Reactive Systems: Yourdon, Statemate, and the UML. Boston: Morgan Kaufmann, 2002. 6. Honeywell DOME. http://www.htc.honeywell.com/dome/. 7. Aonix. Software Through Pictures. http://www.aonix.com. 8. Rational Software. http://www.rational.com. 9. R. Mili and R. Steiner, “Software engineering - introduction,” LNCS, vol. 2269, pp. 129–137, 2002. 10. Z. Manna and A. Pnueli, Temporal Veriﬁcation of Reactive Systems: Safety. New York: Springer, 1995. 11. D. Harel and E. Gery, “Executable object modeling with statecharts,” IEEE Computer, vol. 30, no. 7, pp. 31–42, 1997. 12. D. Harel and R. Marelly, Come, Let’s Play: A Scenario-Based Approach to Programming. In Preperation. 13. D. Harel and R. Marelly, “Specifying and executing behavioral requirements: The play in/play-out approach,” Software and System Modeling, To Appear 2003. 14. D. Harel, “Statecharts: A visual formalism for complex systems,” Sci. Comput. Programming, vol. 8, pp. 231–274, 1987. 15. Uniﬁed Modeling Language. http://www.uml.org. 16. J. Rumbaugh, I. Jacobson, and G. Booch, The uniﬁed modeling language reference manual. Reading, Mass.: Addison-Wesley, 1999. 17. Altia Inc., Embedded Systems Graphics. http://www.altia.com. 18. M. H. Brown, “Exploring algorithms using balsa-ii,” IEEE Computer, vol. 21, no. 15, pp. 14–36, 1988. 19. J. T. Stasko, “Tango: a framework and system for algorithm animation,” IEEE Computer, vol. 23, no. 9, pp. 27–39, 1990. 20. B. Topol and J. T. Stasko, “Integrating visualization support into distributed computing systems,” Tech. Rep. GIT-GVU-92-20, Georgia Institute of Technology, 1994. 21. B. Meyer, “Formalization of visual mathematical notations,” in DR-II: AAAI Symp. on Diagrammatic Reasoning, (Boston), 1997.

Reactive Animation

153

22. C. D. Hundhausen, S. A. Douglas, and J. T. Stasko, “A mets-study of algorithm visualization eﬀectiveness,” Journal of Visual Languages and Computing, vol. 13, no. 3, pp. 259–290, 2002. 23. P. CRESCENZI, C. DEMETRESCU, I. FINOCCHI, and R. PETRESCHI, “Reversible execution and visualization of programs with leonardo,” Journal of Visual Languages and Computing, vol. 11, no. 2, pp. 125–150, 2000. 24. S. Takahashi, K. Miyashita, S. Matsuoka, and A. Yonezawa, “A framework for constructing animations via declarative mapping rules,” in Proceedings of IEEE Symposium on Visual Languages, (St. Louis), pp. 314–322, 1994. 25. G. F. Italiano, G. Cattaneo, U. Ferraro, and V. Scarano, “Catai: Concurrent algorithms and data types animation over the internet,” in Proceedings of 15th IFIP World Computer Congress. 26. J. T. Stasko and J. F. Wehrli, “Three-dimensional computation visualization,” Tech. Rep. GIT-GVU-92-20, Georgia Institute of Technology, 1992. 27. J. E. Baker, I. F. Cruz, G. Liotta, and R. Tamassia, “Algorithm animation over the world wide web,” in Proceedings of the 1996 ACM Workshop on Advanced Visual Interfaces, pp. 203–212, 1996. 28. J. Kaye and D. Castillo, Flash MX for Interactive Simulation: How to Construct & Use Device Simulations. OnWord Press, 2002. 29. I. R. Cohen, Tending Adam’s Garden: Evolving the Cognitive Immune Self. San Diego, CA: Academic Press, 2000. 30. R. A. Goldsby, T. J. Kindt, and B. A. Osborne, Kuby Immunology. New York: W. H. Freeman and Company, 2000. 31. W. Damm and D. Harel, “LSCs: Breathing life into message sequence charts,” Formal Methods in System Design, vol. 19, no. 1, pp. 45–80, 2001.

Model-Checking Middleware-Based Event-Driven Real-Time Embedded Software Xianghua Deng, Matthew B. Dwyer, John Hatcliﬀ, Georg Jung, Robby, and Gurdip Singh Department of Computing and Information Sciences Kansas State University 234 Nichols Hall, Manhattan KS, 66506, USA {deng,dwyer,hatcliff,jung,robby,singh}@cis.ksu.edu

Abstract. Component frameworks such as the CORBA Component Model (CCM) and middleware services such as the CORBA Event Service are increasingly being used to build safety / mission-critical distributed real-time embedded (DRE) systems. In this paper, we present a novel model-checking infrastructure for checking global temporal properties of DRE systems built on top of a Real-Time CORBA Event Service using CCM architectures. We describe how (a) building support for OO structures and communication layers directly in an extensible model-checker and (b) leveraging domain properties related to priorities, scheduling, and timing can dramatically reduce the costs of checking realistic systems.

1

Introduction

Modern distributed systems are often built using sophisticated component and middleware frameworks such as Enterprise Java Beans, the CORBA Component Model (CCM), and Microsoft’s .NET. Moreover, real-time versions of these frameworks such as RT-CORBA and CCM increasingly are being used to build safety/mission-critical Distributed Real-time Embedded (DRE) systems [9]. Figure 1 displays the typical architecture of these systems: loosely-coupled components communicate through middleware layers that hide the complexities associated with moving data across network connections. The implementations of both the components and the middleware itself make extensive use of objectoriented (OO) features and design patterns to facilitate reuse. Middleware frameworks include sophisticated services to support functions commonly required in distributed systems such as transactions, persistence, etc. In particular, loose

This work was supported in part by the U.S. Army Research Oﬃce (DAAD190110564), by DARPA/IXO’s PCES program (AFRL Contract F33615-00C-3044), by Rockwell-Collins, by Intel Corporation (Grant 11462), and by Honeywell Technology Center and NASA Langley Research Center (NCC-1-399).

F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 154–181, 2003. c Springer-Verlag Berlin Heidelberg 2003

Model-Checking Middleware-Based Event-Driven Real-Time

155

Component Component Component

Component

Middleware

Component

Component Component Component

Fig. 1. The middleware architecture of a distributed system.

coupling of components is often achieved using asynchronous/event-based communication infrastructures such as the CORBA Event Service. Event services allow components to easily plug and unplug into the system via publish/subscribe mechanisms, and they provide support for deﬁning event types, event ﬁltering, and event correlation. Real-time event services [13] also provide capabilities for specifying real-time and quality of service attributes. Moreover, unlike typical concurrent architectures where each component might include one or more threads, real-time event services often achieve greater control of scheduling and timing constraints by providing a thread pool. In this approach, components are passive (they include no threads). Instead, the event service itself provides and manages threads which execute event-handlers of subscribing components when an event is published. We are interested in reasoning about component-based DRE systems using model-checking techniques. Previous work that is relevant for this task falls roughly into two categories: (1) high-level modeling and analysis of asynchronous systems, and (2) recent work on checking OO software. First, high-level modeling and veriﬁcation of asynchronous systems as embodied by tools such as Spin [16] has been a prominent theme of research in computer-aided veriﬁcation. However, most existing tools for modeling distributed systems lack the direct support for OO features such as dynamically created objects, component connections, etc. that are ubiquitous in modern distributed systems. Moreover, the state representation and state exploration techniques in existing tools are general-purpose, and thus are not optimized to take advantage of numerous opportunities for reduction that could be enabled by considering, e.g., particular threading models or scheduling policies of DRE systems. Second, recent research has produced checking tools such as Bandera [5], Java Path Finder (JPF) [3], and dSpin [7] that directly support checking of concurrent systems implemented in OO languages by providing built-in representations of references, dynamically created objects, garbage collection, etc. However, despite the popularity of component-based frameworks and their potential to be utilized in mission- and safety-critical applications, relatively little has been done to enhance existing software model-checking techniques to provide native support for the complex event-based middleware of distributed component systems, as well as to exploit the speciﬁc properties of DRE systems to scale up analysis to feasibly approach real life problems.

156

X. Deng et al.

To address this lack of support, we have built Cadena [14] – a development and veriﬁcation environment for building DRE systems using CCM. Cadena supports speciﬁcation of components using the CCM Interface Deﬁnition Language (IDL) with extensions to enable multiple forms of light-weight speciﬁcation of component behavior and dependencies. Cadena, with its sophisticated GUI for component conﬁguration and selection of real-time middleware conﬁguration parameters, design-level slicing and dependence checks, along with extensive auto-coding facilities all presented inside IBM’s Eclipse open-source development environment [10] provides support for end-to-end development of real-time CCM systems. In addition to the light-weight static analyses mentioned above, we have also built a model checking backbone into Cadena, focusing on safety and event-sequencing properties. In our previous work [14], Cadena translated CCM system descriptions to the input language of the dSpin model-checker. dSpin directly supports references, and dynamic creation and deletion of objects, and therefore is well-suited for modeling the OO features of modern distributed systems. However, dSpin does not provide direct support for the threading model and scheduling policies of real-time middleware, nor does it have state-space reduction techniques that are tailored for these systems. In this paper, we describe a new model-checking core for Cadena that (a) provides several substantial advances in the representation of middleware models over our previous work and (b) signiﬁcantly increases our ability to model and check properties of CCM-based DRE systems. Speciﬁcally, the contributions of this paper are as follows. – We introduce a new model-checker called Bogor that, in addition to supporting direct modeling of OO software with a variety of sophisticated state space reduction techniques, also includes a powerful extension mechanism that allows new primitives to be added to the modeling language1 . – We describe how to model a Real-Time CORBA Event Service using Bogor’s extension facilities. Compared to previous approaches for modeling publish/subscribe systems [11], our approach allows much more direct modeling of realistic systems. – We describe how (Bold Stroke) CCM architecture descriptions with lightweight behavioral annotations can be realized as Bogor models. – Further, we present a series of strategies for leveraging timing/scheduling properties of soft real-time systems via Bogor primitives that allow checking of many systems that would otherwise be infeasible to check. This checking infrastructure is being used to check avionics systems from Boeing and Rockwell-Collins. In this paper, we focus on Boeing’s Bold Stroke application framework [9] and discuss how our strategy is inﬂuenced by the actual Bold Stroke development process. In fact, we believe that this “customerdriven” context is one of the things that makes this work interesting and relevant: we address analysis of widely-used general purpose middleware frameworks and languages, and we design the functionality and features of our analysis tools 1

We have completed a robust implementation of Bogor [23].

Model-Checking Middleware-Based Event-Driven Real-Time

157

to mesh with an actual industrial development process. This close cooperation with industry gives us better feedback about the feasibility of the introduced techniques than any purely theoretical approach could provide. We further gain insight in how to exploit domain information and scheduling policies to scale up model checking signiﬁcantly. The methods introduced here suggest general approaches for exploiting domain info that may be adaptable to other component architecture models. A detailed formalization of the model-checking strategy that we employ is presented in the extended version of this paper [8]. Here, due to space constraints, we focus on on giving motivation and an overview of the implementation of our framework. Section 2 gives a brief overview of Bold Stroke, describes which aspects of the Bold Stroke development process that we attempt to support, and presents our strategy for modeling Bold Stroke system behavior. Section 3 gives an overview of CCM and Cadena’s support for development of DRE applications using CCM. Section 4 describes the CORBA Real-Time Event Channel and highlights factors that inﬂuence the design of appropriate models. Section 5 presents our strategy for modeling Bold Stroke systems using Bogor. Section 6 reports on experimental studies that we have carried out to validate our approach. Section 7 discusses related work, and Section 8 concludes.

2 2.1

Bold Stroke and Our Modeling Approach Bold Stroke

Boeing’s Bold Stroke program is an example where CORBA middleware has been embraced in a DRE domain for the reasons outlined above [9]. Bold Stroke is a product-line based program providing object-oriented mission critical avionics software to a variety of military aircraft produced by the Boeing company. Avionics software acts as the center of mission control for an aircraft pilot. It manages the cockpit displays, navigation and tactical sensors as well as weapon deployments. These complex systems have hard and soft real-time deadlines involving large amounts of periodic and aperiodic processing, and support thousands of operating modes. In addition, the software developed for military aircraft is maintained and updated over the course of many years. Although the development process is repeated for each update, each update aims to preserve as much legacy software as possible to reduce cost and risk. Bold Stroke represents a signiﬁcant technological advance over Boeing’s previous mission computing development practices which were largely assembly code based. There are many aspects of a Bold Stroke system’s functional and real-time behavior that we do not attempt to model in Cadena. We choose to focus on supporting the system design and assembly phase which Boeing engineers have pin-pointed as being one of the most challenging aspects of system construction. In this phase, a component integrator attempts to satisfy the functional and soft real-time requirements of a system by (a) hooking together general-purpose and project-speciﬁc components drawn from a component library and (b) selecting

158

X. Deng et al.

distribution strategies, execution priorities, and particular event/data communication layers. Rate-monotonic scheduling theory and conventional schedulability analysis techniques are employed to ensure that real-time deadlines are achieved. However, the inability to reason about high-level control-ﬂow and abstractions of the system data state often leads to costly iterations in the designcode-test process. Speciﬁcally, engineers desire support for reasoning about (a) intra-component control-ﬂow realized by conventional control-constructs (conditionals, locking, etc.), (b) the mode states of components and the eﬀect that these mode states have on enabling/disabling particular component actions or communication patterns, and (c) inter-component control-ﬂow realized by event subscription patterns and orderings of broadcast, reception, and correlation of events, as well as method calls. 2.2

Modeling Approach

Building on the approach of Garlan and Khersonsky [11] for model checking publish-subscribe systems, we factor Cadena system descriptions into three parts: (1) a collection of semantic descriptions for the components that make up the system (these are developer-speciﬁed, application dependent and capture aspects (a) and (b) above), and (2) a collection of reusable models of run-time event-delivery infrastructure (these are provided to the developer, they are re-used in each application that Cadena supports, and they capture the semantics of inter-component communication identiﬁed in aspect (c) above), and (3) a collection of connection actions that speciﬁes the connection topology of the components and hooks the component models of part (1) to the middleware models of part (2). Component Models: Component models are deﬁned using a simple transitionbased modeling language that is similar to, e.g., Promela [16] but also includes object references and method calls. Modeling intra-component control-ﬂow as required by aspect (a) above is straightforward using the control constructs of this language. Reasoning about system mode states as required by aspect (b) ﬁts nicely with veriﬁcation by model-checking since mode variables have small ﬁnite domains (e.g., a component is an enabled or disabled mode). We model such modes using enumerated types in our modeling language. In summary, our component models need only include simple control-ﬂow skeletons with transition actions consisting of reads/writes of mode variables, event publish/receives, and method calls to local objects. Middleware Infrastructure Models: Modeling inter-component communication is more diﬃcult since the semantics of CORBA communication layers must be captured at a level of abstraction that is ﬁne enough to expose interleavings that can lead to property violations, but also coarse enough to avoid state-space

Model-Checking Middleware-Based Event-Driven Real-Time

159

explosion for systems with a large number of components. We deﬁne several variants that trade precision for space/time to varying degrees using Bogor’s extension facilities. This involves deﬁning Bogor extensions to represent prioritybased event queues and customizing Bogor modules to the particular scheduling strategies used in the RT CORBA middleware. When forming a system model, the developer chooses a particular variant for the middleware model from a library provided by Cadena. Connection Actions: Bogor’s native support for method calls, dynamic object creation, and object references allows components to be connected to the communication layer by passing object references in a manner the closely follows the actual implementation. Accordingly, system initialization is modeled by a sequence of Bogor object-creation statements to create models components and middleware services, followed by a sequence of connection actions that pass appropriate references to establish connectivity.

3 3.1

CORBA Component Model and Cadena CCM Component Interface Deﬁnitions

In the CCM architecture, a system is realized as a collection of components. Each component has a component interface consisting of one or more ports that are used to connect to other components. There are two diﬀerent kinds of connections – interface connections and event connections. Each port is unidirectional, so there are four types of ports: an interface supplier (a facet port), an interface consumer (a receptacle port), an event supplier (an event source port), and an event consumer (an event sink port). The CCM interface deﬁnition language (IDL) is used to deﬁne component interfaces consisting of named ports as well as interface and event types used as port types. Figure 2 gives the CCM IDL that deﬁnes the interfaces for two component types called LazyActive and Modal1 (these component types are used in an example of a simple avionics system called ModalSP that we deﬁne below). We use these deﬁnitions to illustrate the basic mechanisms for interface and event connections. The type of an interface connection is deﬁned by an interface deﬁnition – a collection of method signatures corresponding to the conventional notion of interface in Java or CORBA. Interfaces provide a mechanism for components to exchange data via synchronous method calls. Frequently, an interface will contain an accessor method get F and a mutator method set F to manipulate data associated with a particular data ﬁeld F . Such methods are abbreviated by declaring ﬁeld F to be an attribute of a particular interface (the IDL compiler will then automatically generate the accessor/mutator methods). For example, the ReadData interface of Figure 2 contains a single attribute data (the readonly qualiﬁer indicates that only the accessor method get data should be in the interface).

160

X. Deng et al.

#pragma p r e f i x ” c a d e n a ” module m o d a l s p { i n t e r f a c e ReadData { r e a d o n l y a t t r i b u t e any d a t a ; }; e v e n t t y p e TimeOut { } ; eventtype DataAvailable {}; enum L a z y A c t i v e M o d e { s t a l e , f r e s h } ; component L a z y A c t i v e { provides ReadData dataOut ; uses ReadData d a t a I n ; publishes DataAvailable outDataAvailable ; consumes DataAvailable inDataAvailable ; a t t r i b u t e LazyActiveMode d a t a S t a t u s ; };

enum OnOffMode { e n a b l e d , d i s a b l e d } ; i n t e r f a c e ChangeMode { a t t r i b u t e OnOffMode modeVar ; }; component Modal1 { p r o v i d e s ChangeMode modeChange ; provides ReadData dataOut ; uses ReadData d a t a I n ; publishes DataAvailable outDataAvailable ; consumes DataAvailable inDataAvailable ; }; };

Fig. 2. CCM/Cadena artifacts for ModalSP (excerpts).

The type of an event connection is deﬁned by an event type declaration – essentially, a record or structure containing zero or more data ﬁelds. For example, the TimeOut and DataAvailable are event types that both have zero data ﬁelds (in our application, we will only be interested in the arrival of such events – no payload is needed). The LazyActive component type declares four ports: it provides a facet interface of type ReadData with the name dataOut, it uses a receptacle interface of type ReadData with the name dataIn, it publishes an event of type DataAvailable through the port outDataAvailable and it consumes an event of type DataAvailable on port inDataAvailable. Additionally, the LazyActive component declares a component mode variable via an attribute dataStatus of type LazyActiveMode. In general, attributes are used either in component conﬁguration or to represent some other aspect of component state. 3.2

CCM Systems

Figure 3 presents the CCM architecture for a very simple avionics system that shows steering cues on a pilot’s navigational display. The pilot can choose between two diﬀerent display modes: a tactical display mode displays steering cues related to a tactical (i.e., mission) objective, while a navigation display mode displays cues related to a navigational objective. Cues for the navigation display are derived in part from navigation steering points data that can be entered by the navigator. The system is realized as a collection of components coupled together via interface and event connections. Input position data is gathered periodically at a rate of 20 Hz in the GPS component and then passed to an intermediate AirFrame component (which in a more realistic system would take position data from a variety of other sensors). Both the NavSteering and TacticalSteering component produce cue data for Display based on airframe position data. The Navigator

Model-Checking Middleware-Based Event-Driven Real-Time

Timer[5]

PushDataSource

Passive

Modal2

Navigator

NavSteeringPoints

NavSteering

161

disabled enabled

Timer[20]

Interface (Data) Ports facet (interface provided) receptacle (interface used)

Device

LazyActive

GPS

AirFrame stale fresh

Display

ModeSource

Display

PilotControl

Timer[1]

Event Ports source (event published) sink (event consumed) Modal1

Event Correlation

TacticalSteering

and−correlation enabled disabled

Fig. 3. Simple avionics system.

component polls for inputs from the plane navigator at a rate of 5 Hz that are used to form NavSteeringPoints data. This data is then used to form navigational steering cues in NavSteering. PilotControl polls for a pilot steering mode at a rate of 1 Hz and enables or disables NavSteering and TacticalSteering accordingly. The time between two occurrences of a timeout of rate r is referred to as the frame of r, e.g., the length of the frame associated with the 5 Hz rate is 200 milliseconds. There are several architectural aspects of this example that are peculiar to real-time and Bold-Stroke applications. First, note that periodic processing is achieved by having a component such as GPS subscribe to a periodic time-out (e. g. Timer[20]) that is published by the RT event-channel itself (the RT-event channel contains dedicated timer threads to publish such events). More details of the event-channel threading model are given in the following section. Second, Bold Stroke applications follow in most cases a control-push data-pull architecture in which data is transferred between components in a two step process. First, a data producer such as GPS publishes a DataAvailable event indicating that it has updated some data that is ready to be consumed. Then, when a subscribing data consumer such as AirFrame receives the event, it calls an accessor method in a facet provided by the supplier (e.g., dataOut of type ReadData) to retrieve the data. Thus, threads never block waiting for data to become available, and this simpliﬁes the design of real-time aspects. Note that under this strategy, component connections come in pairs consisting of an asynchronous event connection for notiﬁcation and a synchronous interface/method connection for fetching the data. 3.3

Cadena System Speciﬁcations

Thus far in the presentation of our example system, the only speciﬁcation artifact that has been used is CCM IDL which speciﬁes the interfaces of components. To generate system models appropriate for model-checking, additional speciﬁcation forms are needed. Following our modeling strategy presented in Section 2, we

162

X. Deng et al.

component L a z y A c t i v e { mode s t a t u s r e p r e s e n t s LazyActive . dataStatus

component Modal1 { mode o n O f f o f OnOffMode i n i t e n a b l e d ; init stale ; behavior { behavior { any d a t ; any d a t ; push inDataAvailable ( DataAvailable e) push inDataAvailable ( DataAvailable e) { { i f ( o n O f f == e n a b l e d ) { state := s t a l e ; dat : − d a t a I n . getData ( ) ; outDataAvailable (e ); outDataAvailable (e ); } } any dataOut . g e t d a t a ( ) { } i f ( s t a t u s == s t a l e ) { any dataOut . g e t d a t a ( ) { return dat ; dat : − dataIn . g e t d a t a ( ) ; state := fresh ; } } OnOffMode modeChange . g e t m o d e V a r ( ) { return dat ; r e t u r n onOff ; } } } v o i d modeChange . s e t m o d e V a r ( OnOffMode m) { } o n O f f : = m; }}}

Fig. 4. Cadena Property Speciﬁcation (CPS) (excerpts).

need transition system speciﬁcations of components and speciﬁcation of connection information – we address each of these in turn. First, CCM does not include any mechanism for giving a high-level description of the behavior of components. Therefore, in Cadena we add a component property speciﬁcation (CPS) format that allows developers to state several different light-weight semantic properties of components including mode-aware dependency speciﬁcations and transition system descriptions. Figure 4 presents the transition system portion of the CPS description for the LazyActive and for the Modal1 component type of Figure 2. The LazyActive component type implements a variant of control-push data-pull strategy to handle situations where a component C (e.g., AirFrame of Figure 3) depends on data that is updated much more frequently than C’s clients require C’s data. For instance, in the example system of Figure 3, the AirFrame component does not fetch data immediately from GPS when notiﬁed that GPS’s data is available, but instead simply sets its dataStatus attribute to indicate that its data is stale (i. e. it was calculated based on GPS data that is now obsolete) and notiﬁes its clients (e.g., TacticalSteering) that its data is available. It then retrieves and calculates the new data from the GPS only when it is actually ordered by one of the clients via its facet dataOut. In the CPS this behavior is captured by a mode variable status which links to the dataStatus attribute declared in the IDL. The behavior part provides a coarse description of the incoming ports’ methods, i. e. in this example the handler method for an incoming DataAvailable event on port inDataAvailable (called the push-method in CORBA terminology), as well as the interface method get data for the facet port dataOut (which is the accessor method for the data attribute of the ReadData interface). When the handler method receives an event, it sets the mode to stale and publishes an event itself. The interface method ﬁrst checks on the mode and updates the internal data of

Model-Checking Middleware-Based Event-Driven Real-Time

163

the component accordingly before returning the value. If the value is updated, the mode is reset to fresh, so that in subsequent calls from the clients the data can be returned without retrieving new input via the dataIn receptacle again. Note that in a real avionics system, there would be signiﬁcant numeric computation to transform raw GPS data into a form that is useful for other components such as AirFrame. We do not represent this computation in our model for several signiﬁcant reasons. First, in the actual systems supplied to us by Boeing, all such computation is stripped out for security reasons and to avoid dissemination of propriety information. Second, Boeing engineers are concerned with reasoning about control properties associated with modes, and the data computations that are stripped out often do not inﬂuence the modal behavior of the system. In essence, Boeing engineers have by happenstance performed a manual abstraction of the system – an abstraction that produces a system that is very well-suited for model-checking in that remaining mode data domains are ﬁnite and small. We represent the lack of concrete information about such stripped out data values by representing the associated variables with the CORBA type any (the top value in the CORBA type hierarchy). In addition, a statement such as dat :- dataIn.getData(); is not an assignment statement. Rather, it declares a dataﬂow dependency between dat and dataIn.getData(); which abstracts a situation where a series of assignments or method calls in the actual code transforms raw data received on the dataIn port to a reﬁned value held in the dat variable. These dependency declarations are used in other components of Cadena, but for model-checking, they are left out of generated models since they have no inﬂuence on properties being checked. Both NavSteering and TacticalSteering are modal components that have two modes (enabled, disabled). These modes are set by PilotControl via ChangeMode facets provided by the modal components. Figure 4 shows the Modal1, which is the type of the TacticalSteering component of Figure 3. When a modal component is disabled, any events received are simply discarded by the component. When enabled, the component responds according to the control-push data-pull strategy (e.g., TacticalSteering responds to a DataAvailable from AirFrame by calling AirFrame’s get data method). Having introduced the CPS format for specifying component behaviors, we now turn to a the Cadena Assembly Description (CAD) format for specifying component instance allocations and connection information. While CCM allows components to be dynamically created and (dis)connected, Bold Stroke applications follow typical practice in safety/mission-critical systems and employ a static component allocation and conﬁguration policy by creating and connecting components only in a system initialization phase. Accordingly, even though our modeling strategy in Bogor can easily support dynamic (dis)connecting due to its direct support of references, the CAD format itself follows the Bold Stroke restriction of not supporting dynamic reconﬁguring. The CORBA 3.0 speciﬁcation does not provide a dedicated language for static system conﬁguration. Instead, an XML-based component assembly description is speciﬁed, leaving

164

X. Deng et al.

system ModalSPScenario { i m p o r t c a d e n a . common , c a d e n a . m o d a l s p ;

}

Rates 1 , 5 , 2 0 ; // Hz r a t e g r o u p s L o c a t i o n s l 1 , l 2 , l 3 ; // a b s t r a c t d e p l o y m e n t l o c a t i o n s ... I n s t a n c e AirFrame implements L a z y A c t i v e { connect t h i s . i n D a t a A v a i l a b l e t o GPS . o u t D a t a A v a i l a b l e r u n R a t e 2 0 ; c o n n e c t t h i s . d a t a I n t o GPS . dataOut ; } I n s t a n c e T a c t i c a l S t e e r i n g i m p l e m e n t s Modal1 on l 2 { connect t h i s . i n D a t a A v a i l a b l e to AirFrame . o u t D a t a A v a i l a b l e runRate 5 ; c o n n e c t t h i s . d a t a I n t o A i r F r a m e . dataOut ; } ...

Fig. 5. Cadena Assembly Description for ModalSP (excerpts).

tool developers free to build a variety of conﬁguration facilities which produce the XML data. Cadena provides graphical, textual, and form-based incremental static conﬁguration facilities with the abstract syntax tree of the textual form providing the canonical representation. Figure 5 displays a fragment of the textual Cadena Assembly Description (CAD) for the example system. In CAD, a developer declares the component instances that form a system, along with event channel rate groups and abstract distribution locations. For receptacle and event sink ports, a connect clause declares a connection between a port of the current instance and a port of the component that provides the interface/event. This follows a convention that connections are declared on the client-side of an interface/event connection. Each event sink port connection uses the runRate clause to indicate which rate group thread should run the event handler upon event dispatch (thread rate groups are explained in the following section). A type-checking procedure ensures well-typed connections.

4

Real-Time Event Channel

In this section, we give a more detailed description of the CORBA real-time event channel and its elements as shown in Figure 6. This description will be used as a basis of explaining the Bogor event-channel models that Cadena uses when model-checking Bold Stroke systems. In Bold Stroke applications, even though at a conceptual level component event source ports are connected to event sink ports, in the implementation, event communication is factored through a real-time CORBA event channel. Use of such infrastructure is central to Bold Stroke computation because it provides not only a mechanism for communicating events, but also a pool-based threading model, time-triggered periodic events, and event correlation. In order to shield application components from the physical aspects of the system, for product-line ﬂexibility, and for run-time eﬃciency, all components are passive – component methods are run by event-channel threads that dispatch events by calling the

Model-Checking Middleware-Based Event-Driven Real-Time

Thread Pool

Component (Supplier)

push

Proxy Consumer

...

subscriber− list

Reference push

Proxy Consumer

1Hz

...

push

Component (Consumer)

push

Component (Consumer)

Proxy Reference Supplier

5Hz dispatch queue

Correlator Component (Supplier)

5Hz

20Hz dispatch queue

subscriber− list

Reference

20Hz

165

Proxy Reference Supplier

...

EVENT CHANNEL Fig. 6. RT CORBA Event Channel.

event handlers (“push methods” in CORBA terminology) associated with event sink ports. Thus, the event channel service is the engine of the system in the sense that the threads from its pool drive all the computation of the system2 . As deﬁned in the CORBA standard an event connection consists of two types of objects, one being the supplier (i. e. an event source port), the other the consumer (i. e. the event sink port). An object of type consumer must provide a push method, i. e. the event handler method, which takes the event as argument. An object of type supplier stores a reference to a push method. To connect a supplier to a consumer, the supplier’s reference is set to point to the consumer’s push method. To publish an event e, the supplier simply calls the push method via its reference with the event e as argument (i. e. it “pushes” the event e). Complying to that scheme, the Event Channel oﬀers a proxy consumer, i. e. a push method for each supplier to connect to. Similarly, for every consumer the Event Channel provides a proxy supplier, a reference to point to the consumer’s push method. This simple reference-to-method pattern allows only one-to-one connections. Since more than one consumer might be interested in the events published by one supplier, the proxy consumer inside the Event Channel features a list of consumers to which an event originating from that supplier will be pushed. This list is called the subscriber list, as the consumers subscribe to the events of the supplier. This way every consumer and supplier only needs to handle one-to-one connections with the Event Channel, while a multiplexing of the events is done inside the channel. 2

Actually, there are other threads in the I/O subsystem and in the ORB, but for the level of abstraction that we are modeling, the behavior of these other threads is not relevant.

166

X. Deng et al.

The Event Channel also provides event correlation and event ﬁltering mechanisms. In the example system of Figure 3, and -correlation is used, for instance, to combine event ﬂows from NavSteering and AirFrame into Display. The semantics of and -correlation on two events e1 and e2 is that the event channel waits for an instance of both e1 and e2 to be published before creating a notiﬁcation event that is dispatched to the consumer of the correlation. The semantics of a correlator is deﬁned by an automaton over event traces derived from the correlation expression [24]. The thread-pool shown in Figure 6 contains the three threads necessary to support the rate groups 20 Hz, 5 Hz, and 1 Hz of the example system of Figure 3. Following rate-monotonic theory, the 20 Hz thread has highest priority, followed by the 5 Hz thread and then the 1 Hz thread. There is an event dispatch queue associated with each thread tr that holds pairs (s, e) where e is the event to be dispatched and s is the reference for an event sink port that is subscribed to the event. Thread tr dispatches an event e from its queue by running the push method associated with port s via the reference in the related proxy supplier with the event e passed as a parameter. Periodic computation is initiated by time-triggered events er for each rate r (e.g., events associated with event sinks of Navigator, GPS, and PilotControl). At the speciﬁed rate r (e.g., at 20 Hz for the GPS event sink), a special internal timer thread (not displayed) places a pair (s, er ) in r’s queue for each component port s that subscribes to the timeout event er . Thread tr dispatches these events by calling the push methods of subscribers via their proxy suppliers, which in turn may execute methods calls and publish other events to drive further computation. There are three diﬀerent paths through the event channel that e can take on its way to a particular subscriber sk . First, in the normal path, the proxy consumer obtains the reference for sk and the rate/priority r declared for sk ’s handler for e (recall from the discussion of Figure 5 that each non-time-triggered event sink port also has a rate identiﬁer speciﬁed at conﬁguration time that indicates which pool thread should be used to dispatch the event to which it is subscribed) from its subscriber list and puts the pair (sk , e) in the queue for tr . Second, if sk is associated with e via an event correlation the pair (sk , e) is not placed in the queue. Rather, the correlator state machine is advanced to account for the publishing of e. If the correlator reaches an accepting state, then a pair (sk , ec ) is placed in the queue matches the rate declared for sk ’s handler where ec is a correlation result event possibly combines information from one or more events that were correlated. The third path is an optimization path that short-cuts several steps in the event dispatch process based on the following observation: if there is no correlation associated with (sk , e) and if subscriber sk ’s handler for e is declared to have the same rate/priority r as the thread tr that is running e’s publisher, then (sk , e) will be immediately placed in r’s dispatch queue and the same thread tr will end up dispatching e. In this case, RT event channel implementation opti-

Model-Checking Middleware-Based Event-Driven Real-Time

167

mizes by having tr directly call the push method for sk with e as a parameter – thus, bypassing the queuing/dequeuing of (sk , e). In detail, a trace using example of Figure 3 considering the 20 Hz thread would look as follows. A system interrupt causes the event channel’s special timer thread to place a 20 Hz timeout pair (sk , e20 ) in the 20 Hz rate group dispatch queue for each 20 Hz time subscriber sk (in this case, the only subscriber is the timeout port of GPS. Since the 20 Hz queue is no longer empty, the 20 Hz-rategroup thread is started to call the event handler (push method) of GPS. Running the GPS handler for the timeout event reads data from the physical GPS device and issues a DataAvailable event, i. e. it calls the according push method in the connected proxy consumer inside the Event Channel. This method then, still executed by the same thread t20 , would typically queue the event into the dispatch queues of the subscribing components’ thread groups. The subscriber for this DataAvailable event from the GPS is the AirFrame, which also belongs to the 20 Hz thread group, so in this case the optimization path is used and t20 thread directly calls the inDataAvailable event handler of AirFrame. This handler itself pushes a DataAvailable event the consumer proxy associated with the AirFrame outDataAvailable port. This proxy has a longer list of subscribers: it queues the event into the dispatch queue for the NavSteering and for the TacticalSteering component, and it forwards the event to the and -correlators which also consume events from NavSteering and TacticalSteering respectively. The state change in the correlators which reﬂects the incoming event is also executed by the 20 Hz thread, and so is the potential queuing of the correlated event into the Display’s rate group’s dispatch queue. Since all of these components also run at 20 Hz, the according events are now found in the 20 Hz dispatch queue, and the 20 Hz thread will continue to execute. Assuming that the TacticalSteering component is enabled, while the NavSteering component is disabled, the push method which handles the incoming event for the NavSteering component simply ignores the event, while the TacticalSteering calls the AirFrame’s facet to fetch the newly available data. Upon this call, the AirFrame itself fetches the data from the GPS, switches to fresh-mode and returns the data. After receiving the updated values, TacticalSteering issues a DataAvailable event itself. Its proxy now forwards this event to the correlator, which already is in a state indicating that the AirFrame has already sent his event, so that now a correlated event is queued for the Display. Again in the 20 Hz group this event is the last one in the queue executed by the thread. The thread runs the push method of the Display, which calls the facet of the TacticalSteering and receives the new data, and calls the facet of the AirFrame, which is in fresh-mode now so that it also immediately returns the new data. The data then is processed and displayed, and the 20 Hz thread ends until the next 20 Hz timeout.

5

Behavioral Models of Cadena Assemblies

We now explain how Cadena models are represented in the Bandera Intermediate Representation (BIR) – the input language for the Bogor model checker.

168

X. Deng et al.

CAD . Component T a c t i c a l S t e e r i n g ; enum E n a b l e d D i s a b l e d { ENABLED , DISABLED } EnabledDisabled tacticalSteeringMode ; T a c t i c a l S t e e r i n g : = CAD . c r e a t e C o m p o n e n t ( ” T a c t i c a l S t e e r i n g ” ) ; t a c t i c a l S t e e r i n g M o d e : = E n a b l e d D i s a b l e d . ENABLED ; CAD . d e c l a r e E v e n t S o u r c e P o r t <EventType >( T a c t i c a l S t e e r i n g , ” o u t D a t a A v a i l a b l e ” , CAD . d e c l a r e E v e n t S i n k P o r t <EventType >( T a c t i c a l S t e e r i n g , ” i n D a t a A v a i l a b l e ” , EventType . D a t a A v a i l a b l e ) ; CAD . c r e a t e F i e l d ( T a c t i c a l S t e e r i n g , ” ReadData . d a t a ” ) ; CAD . b i n d H a n d l e r <EventHandlerEnum> ( EventHandlerEnum . t a c t i c a l S t e e r i n g p u s h i n D a t a A v a i l a b l e , T a c t i c a l S t e e r i n g , ” inDataAvailable ”); ... CAD . c o n n e c t E v e n t ( AirFrame , ” o u t D a t a A v a i l a b l e ” , TacticalSteering , ” inDataAvailable ” , 20 , false ); ... function tacticalSteering push inDataAvailable ( EventHandlerEnum eh , CAD . E v e n t e v e n t ) { Data d a t ; l o c l o c 0 : l i v e {} when ( o n O f f == E n a b l e d D i s a b l e d . ENABLED ) do { } goto l o c 1 ; when ! ( o n O f f == E n a b l e d D i s a b l e d . ENABLED ) do { } r e t u r n ; l o c l o c 1 : l i v e { } i n v i s i b l e i n v o k e a i r F r a m e f a c e t ( ) goto l o c 2 ; l o c loc2 : l i v e {dat} when t r u e do { d a t : = CAD . g e t F i e l d (AirFrame , ” ReadData . d a t a ” ) ; } goto l o c 3 ; l o c l o c 3 : l i v e {} when t r u e do { CAD . s e t F i e l d ( T a c t i c a l S t e e r i n g , ” ReadData . d a t a ” , d a t ) ; } goto l o c 4 ; l o c l o c 4 : l i v e {} i n v i s i b l e invoke pushOfProxy ( T a c t i c a l S t e e r i n g , ” o u t D a t a A v a i l a b l e ” , . . . ) ; return ; } ...

Fig. 7. Bogor component and assembly descriptions for ModalSP (excerpts).

5.1

Representing Component Structure and Connections

As noted in Section 3, connections in current Bold Stroke systems are established in an initialization phase, and then remain ﬁxed throughout the lifetime of the system. This means that although connection information must be represented, it does not need to be stored in the state vector. Similarly the interpretation of Cadena models in Bogor can be seen as two phases: ﬁrst a buildup phase establishes the static part of the system in a single atomic step, then the connected system is checked over the state vector discussed below. The buildup phase begins with the creation of component instances followed by actions that connect the ports of each instance to ports of other instances (in the case of facets/ receptacles) or the model of the real-time event-channel (in the case of event source/sinks). Figure 7 shows how the Bogor CAD extension supports the buildup of data structures representing components. This extension (not shown) declares two new types Event and Component (which is used as the type of the BIR TacticalSteering variable). Further, the extension deﬁnes a number of operations such as createComponent and declareEventSourcePort that are implemented by Java methods in the extension. Note for example the

Model-Checking Middleware-Based Event-Driven Real-Time

169

use of the bindHandler operation which declares that the BIR function displayed at the bottom of Figure 7 is to be used as the event handler for events ﬂowing into the inDataAvailable port of TacticalSteering. Below the declaration of the component structure, Figure 7 illustrates the use of the connectEvent method. This action causes a Bogor reference to the inDataAvailable port of TacticalSteering to be added to the subscriber list (recall the discussion of Figure 6 in Section 4) for the outDataAvailable port of AirFrame. When implementing a Bogor extension, one must deﬁne a state manager that walks over the extension’s state and produces a representation suitable for placing in the model-checker’s state vector. This ﬂexibility can be leveraged in a variety of ways, e.g., to omit various ﬁelds from the data vector, to form abstractions of the state, or to build canonical representations necessary for achieving symmetry reductions in the representations of sets [22]. In Cadena models, we use this mechanism to avoid storing the static connection information in the state vector. This also allows us to increase the granularity of actions in initialization and in middleware actions – thus, soundly reducing the number of interleavings. Moreover, traversal of subscriber lists can sometimes be carried out atomically (depending on priorities of threads involved), since there is no chance of interfering updates to subscriber lists once execution begins. 5.2

Representing Component Behavior

Event handlers and other methods of CCM components are represented as BIR functions. Figure 7 shows the BIR model of the event handler for the indataAvailable event sink from Modal1 component type as deﬁned in CPS deﬁnition of Figure 4. The transitions capture the handler behavior as deﬁned in the CPS ﬁle: if the component is disabled, the handler simply returns, otherwise it fetches data using its dataIn port, updates its local data, and then publishes a dataAvailable event on its outDataAvailable port. In the example Bold Stroke systems supplied by Boeing, the concrete internal data of components consists exclusively of the values of component mode variables (e. g. the enabled/ disabled values of mode variable onOﬀ from component Modal1, Figure 4). As discussed in Section 3, Boeing engineers abstract away the other data values such as the actual numerical data produced by e. g. GPS devices. Thus, such values are represented by a BIR extension type Data (as equivalently by the type any in the CPS, Figure 4) that has a single dummy value. Using the state representation mechanism introduced above, component ﬁelds of type Data are not held in the state vector. This means that component models only contribute the values of mode variables to the state vector. 5.3

Representing the Real-Time Event Channel

The BIR model of the real-time CORBA event service represents the threadpool, event dispatch queues, and correlators presented in Figure 6 of Section 4. Recall from Section 4 that dispatch queues hold event/subscriber pairs (s, e). In

170

X. Deng et al.

Queue . t y p e

> Q5 ; ... Q5 : = Queue . c r e a t e <... >( M a x C a p a c i t y . QUEUE ) ; CAD . b i n d D i s p a t c h i n g Q u e u e <... >(Q5 , 5 ) ; ... thread threadgroup5 () { P a i r . t y p e<EventHandlerEnum , CAD . Event > p a i r ; EventHandlerEnum h a n d l e r ; CAD . E v e n t e v e n t ;

}

l o c l o c 0 : l i v e { h a n d l e r , e v e n t } when Queue . s i z e <... >(Q5) > 0 do i n v i s i b l e { p a i r : = Queue . g e t F r o n t <... >(Q5 ) ; Queue . dequeue <... >(Q5 ) ; h a n d l e r : = P a i r . f i r s t <... >( p a i r ) ; e v e n t : = P a i r . s e c o n d <... >( p a i r ) ; } goto l o c 1 ; l o c l o c 1 : l i v e {} i n v i s i b l e i n v o k e v i r t u a l f ( h a n d l e r , e v e n t ) goto l o c 0 ;

Fig. 8. Bogor dispatch queue and thread model for ModalSP (excerpts).

the Bogor model, queues are modeled using Queue and Pair extensions. Figure 8 illustrates the 5 hertz rate queue of pending event dispatches and the thread, threadgroup5, that cyclically dequeues dispatch pairs and invokes the component event handler encoded in each pair (note that pair type declarations are elided (i.e., <...>) for improved readability). Each correlator is represented as a deterministic ﬁnite-state automaton whose transition function is encoded as a static transition table. For each correlator, there is a single state variable that holds the current correlator state. Since the structure of correlators is ﬁxed for a given system, the transition tables are not held in the state vector. 5.4

Summary of Data Portion of State-Vector

To summarize the modeling strategy discussed above, we present the state vector components related to data state of Cadena systems. The observable state of a Cadena assembly is comprised of all non-ﬁxed system data. As we have noted above, correlator transition tables, subscriber lists, and component connection information are all ﬁxed and are not considered part of the observable state. Deﬁnition 1. Cadena Data States are tuples (c, r, a, t, p) where: c = (c1 , . . . , ck ) stores the data states of component instances, each of which is comprised of a, possibly empty, set of mode attributes as deﬁned by ci ’s component type. r = (qr1 , . . . , qrn ) are rate-speciﬁc queues of pairs, (c, e), recording the dispatch of event e to port c. a = (a1 , . . . , al ) stores the current states of each of the event correlation recognition automata. t records an abstraction of time used to trigger timeouts. p records the priority of the current thread being executed.

Model-Checking Middleware-Based Event-Driven Real-Time

171

The initial state is deﬁned to have instance modes set to their initial values, correlation automata set to their start state, rate speciﬁc queues to be empty, t = 0, and the priority variable is set to the highest priority. The values of local variables in component handlers and methods and in the implementation of push methods and rate-speciﬁc threads cannot be observed outside their method activation by other threads or by property observables and are also not considered part of the observable state. Local variable are held in the state vector, but only during the corresponding method activations. In addition to the data state described above, Bogor maintains control information in the form of a program counter and a method call-stack frame for each thread. 5.5

Strategies for Modeling Scheduling and Time

The behavior of Cadena systems is driven by the triggering of middleware timeouts as described in Section 4 and is controlled by the scheduling policies of the thread-pool in the real-time event channel. Finding an eﬀective strategy for modeling these timeouts and thread-scheduling is a central issue in the construction of Cadena models. When analyzing concurrent systems, most model-checkers do not attempt to exploit knowledge of speciﬁc timing or scheduling strategies but instead explore all possible interleavings of concurrent actions. If we followed this approach, we would allow timeout events to occur non-deterministically between every system transition and we would allow actions from diﬀerent threads to be interleaved non-deterministically without consideration of priorities or other scheduling constraints. While such a strategy is sound in that it covers all possible system behaviors, the number of states generated makes it impractical for all but the smallest systems. In the subsections below, we describe several strategies that we use to reduce infeasible interleavings. Each strategy incorporates constraints based on observations about priority scheduling and timeout policies implemented by the real-time middleware. Priority-based scheduling: Having the model-checker non-deterministically explore interleavings without considering thread priorities obviously introduces schedules that are infeasible in the actual system, e.g., a schedule that continues to execute transitions from a lower priority thread even though a higher-priority thread is enabled. Inter-rate-group timeout constraints: Having the model-checker nondeterministically generate timeout events introduces schedules that are infeasible in the actual system, e.g., a 5 Hz timeout event should not occur more frequently than a 20 Hz timeout event. We present strategies that reduce infeasible interleavings by taking into account the appropriate relative frequency of timeout events, i.e., by taking into account constraints that exist between timeouts of diﬀerent rate groups. Intra-rate-group timeout constraints: Having the model-checker nondeterministically generate timeout events introduces infeasible schedules

172

X. Deng et al.

where a timeout for a rate group r occurs before all events in the current frame for r are dispatched or before the previous timeout from group r is even dispatched. We constrain the generation of time-out events to ensure that timeouts from the same rate group are not triggered “too quickly”. This strategy constrains the occurrence of timeouts by considering the relative lengths of the real-time frames and constrains scheduling by considering priority information. Lazy-time with priority scheduling: In addition to the techniques used in the strategy above, this strategy also considers timing estimates for system transition which allows additional infeasible schedules to be removed from consideration. 5.6

Representing Priority-Based Scheduling Information

Bold Stroke systems are priority scheduled based on the results of rate monotonic analysis of a set of harmonic rate groups. The CAD call connectEvent(), illustrated in Figure 7, assigns a rate, and hence a priority, to each component handler for a given event. The default non-deterministic scheduling policy in Bogor is implemented by a module that calculates the set of enabled transitions in a given state and passes that set to the state exploration module, which explores each possible outgoing transition. When reporting our experiments, we refer to models that use this strategy as priority-unaware. For Cadena models, a Bogor plugin is used that intercepts the set of enabled transitions in a given state, selects the transitions with the highest priority and passes these transitions on to the state exploration module. As expected, this yields dramatic reductions in the state space, as shown in Section 6, and also improves the precision of the state space since only infeasible schedules are eliminated (i.e., ones on which a lower-priority transition executes when a higher-priority transition is enabled). We refer to models that use this strategy as priority aware. Variations of this plugin are used in the following models to allow for interleaving of timeouts with the highest-priority enabled transition. 5.7

Representing Intra-rate-group Timing Constraints

The treatment of time, t, determines, in part, the ﬁdelity of the model with respect to the real system’s behaviors. If detailed timing information is available one can keep track of time as component actions are executed and use that time value to trigger periodic events. However, even when timing information is not available, one can still reduce the occurrence of timeout events based on both intra- and inter-rate-group constraints. Intra-rate-group constraints that we consider involve the notion of frame overrun. A frame overrun occurs when a timeout event er for rate group r occurs before all events e triggered directly or indirectly by the previous timeout for r are processed by the rate group’s thread tr . In normal situations, a timeout er occurs and is dispatched, other events arrive in the event channel’s dispatch queues (including those associated with r), and thread tr becomes idle after all

Model-Checking Middleware-Based Event-Driven Real-Time

173

events associated with r have been dispatched. The time that tr remains idle waiting for the next r timeout is called slack time. If a system has a frame overrun error, a thread tr has no slack time – it is unable to ﬁnish all of its work before the next timeout er arrives. Note that exploring the state-space of systems where arbitrary frame overruns are modeled results in a huge number of additional system behaviors that would very likely be infeasible if actual timing data were considered (timing data would allow us to conclude that in most cases frame overruns do not occur). While frame overruns are a real source of bugs in Bold Stroke systems, engineers have other tools and methods for detecting these types of errors. Accordingly, we will reduce the state space that we explore using two strategies. The ﬁrst strategy which we call no overruns assumes that no frame overruns occur at all. This is implemented by having the model-checker scheduler only emit a timeout event for rate group r if there are no enabled transitions associated with rate group r – which models the situation where tr has become idle because its associated dispatch queue is empty. The second strategy which we call limited overruns is implemented by having the model-checker scheduler only emit a timeout event er if there is no other timeout event remaining in the r dispatch queue (but other non-timeout events may still be waiting in the queue for dispatch). Intuitively, this model includes overruns that only spill over into the very next frame but does not include overruns where processing is ‘late’ by more than one additional frame. 5.8

Representing Inter-rate-group Timing Constraints

The strategies related to frame overrun in the previous section constrain timeout events by considering when they should occur relative to other timeouts from the same rate group. We now describe a strategy which we call the relativetime (RT) strategy that constrains the issuing of timeout events by considering when a timeout for r should occur relative to a timeout for a diﬀerent rate group r . Speciﬁcally, we take advantage of the fact that in rate-monotonic scheduling theory (which is used in Bold Stroke systems), the frame associated with a rate can be evenly divided into some whole number of r -frames for each rate r that is higher than r. In the example system of Figure 3, the frame of the slowest rate (1 Hz) can be divided into 5 5 Hz frames, and each 5 Hz frame can be divided into 4 20 Hz frames. The longest frame/period (the frame associated with the lowest rate) is called the hyper-period. The relative-time model enforces the following constraints related to issuing of timeouts: – a single timeout is issued for the slowest rate group in the hyper-period, – timeouts for rate groups, ri and rj where ri > rj , are issued such that ri /rj timeouts of rate ri are issued in a rj frame. These constraints determine the total number and relative ordering of instances of timeouts that may occur in the hyper-period.

174

X. Deng et al.

CAD . Component Timer ; ... Timer : = CAD . c r e a t e C o m p o n e n t ( ” Timer ” ) ; CAD . d e c l a r e E v e n t S o u r c e P o r t <EventType >(Timer , ” timeOut5 ” , EventType . TimeOut ) ; ... thread timerThread () { l o c l o c 0 : l i v e {} when t r u e do { t i m e : = ( t i m e + 1 ) % 2 0 ; } goto l o c 0 ; } ... thread timeOutSenderThread ( ) { ... l o c l o c 1 : / / 5 Hz t i m e o u t c a s e when t i m e % ( 2 0 / 5 ) = = 0 do i n v i s i b l e { } goto l o c I n v o k e 5 ; when t i m e % ( 2 0 / 5 ) ! = 0 do i n v i s i b l e { } goto l o c 2 ; ... loc locInvoke5 : l i v e { localTime} i n v i s i b l e i n v o k e p u s h O f P r o x y ( Timer , ” timeOut5 ” , CAD . c r e a t e E v e n t <EventType > ( EventType . TimeOut ) ) goto l o c 2 ; ... l o c l o c 2 : / / 1 Hz t i m e o u t c a s e ... }

Fig. 9. Timer and TimeOutSender thread models for ModalSP (excerpts).

Figure 9 shows the Bogor code for two threads that are used to model this strategy. Thread timerThread increments an abstraction of time where each ’tick’ (i.e., each increment of the time variable) represents the passing of time corresponding to the shortest frame in the system (e.g., in the ModalSP, each tick represents a 20 Hz frame). The time variable wraps around every 20 ticks which corresponds to the fact that there are 20 Hz frames in the 1 Hz hyperperiod. Thread timeOutSenderThread models the behavior of the rate-speciﬁc timer threads in the middleware discussed in Section 4. This thread monitors time and when it observes a change in the time value, it passes through a case statement to see which timeout events should be dispatched at that point. Since a time tick represents the period of the shortest frame, a new timeout event for the fastest rate is issued on each pass through the case statement. In our example system, the 5 Hz timeout happens every fourth tick. To represent the occurrence of a timeout, the thread enqueues the timeout event through the standard push call. From the explanation above, it is clear that the RT model only establishes the occurrence of timeouts relative to each other – it does not relate timeout occurrences to the time required by component event handlers and method execution. Thus, it is now important to understand when timeout actions may occur with respect to actions that occur inside of component handlers, i.e., when can these actions be interrupted by timeouts. To see that the model safely approximates all interleavings of timeouts and component actions (given the constraint on no frame overruns) consider Figure 10. This ﬁgure illustrates four points during a system execution which contains 5 Hz and 10 Hz rate processing. The 10 Hz and 5 Hz timeouts are queued together (e.g., at the point 1) since they both have frames that begin at the

Model-Checking Middleware-Based Event-Driven Real-Time

10Hz and 5Hz

175

10Hz timeout

10Hz

10Hz execution step 5Hz timeout 5Hz execution step

10Hz priority delay interleaving

5Hz time 1

2

3

4

Fig. 10. Relative-time Environment.

same point. However, the 10 Hz timeout event is dispatched ﬁrst due to its higher priority. Once the all the actions associated with 10 Hz component processing complete (e.g., at point 2), the model-checker scheduler begins consideration of lower priority actions and the 5 Hz timeout is dispatched leading to 5 Hz component processing. Our no overruns assumption entails that processing the 10 Hz component actions does not require more time than the period of the 10 Hz frame — thus, the next 10 Hz timeout cannot occur before point 2. Since we are not modeling the actual time required for carrying out component actions, it impossible to determine the relationship between the time required for 5 Hz component action processing (e.g., the duration from point 2 to 4) and the time until the next 10 Hz timeout (e.g., the duration from point 2 to 3). To safely cover all possibilities, we must allow for any relationship between these durations. To model all such relationships, we adapt Bogor’s standard scheduling module to consider all interleavings of enabled timeouts with the enabled transitions. On the right in Figure 10 the interleavings of the 10 Hz timeout and the enabled transitions performed during 5 Hz component processing are illustrated. The dark grey double circle represents the dispatching of the leftmost 10 Hz timeout event. This is followed by three dark grey circles representing transitions in 10 Hz component processing: the ﬁrst branch point represents the choice between the next 10 Hz timeout (on the left) or dispatching the already queued 5 Hz timeout event (on the right). If 5 Hz processing is selected then the choice between the 10 Hz timeout and 5 Hz processing repeats for the next enabled 5 Hz transition illustrated as a light grey circle. 5.9

Lazily-Timed Components

In the relative-time model, timeouts are arranged in a proper order and ratio with respect to each other, but there are no constraints that guarantee that, e.g., the interval between time outs is appropriate for the correspond period. This means that the model may have interleavings in which a timeout, e.g., for ri , occurs prematurely with respect to an action sequence whose duration is less than period(ri ). For example, if the 5 Hz component processing (i.e., from point 2 to 4) in Figure 10 is guaranteed to be less than the time to the next

176

X. Deng et al. 10Hz

10Hz priority delay

5Hz time 1

2

3

Fig. 11. Lazily-timed Environment.

10 Hz timeout (i.e., point 4 comes before point 3) then the interleavings of 10 Hz timeouts with 5 Hz processing in the RT model will be infeasible. The lazilytimed (LT) component model addresses this by leveraging worst-case estimates of the running time of components; these will be available for Cadena systems to support rate monotonic analysis. This model can be conﬁgured for whatever granularity of timing information is available. Here we consider worst-case timing estimates for event handlers. Conceptually, the estimates are used to determine whether a handler can run without interruption before the next timeout occurs and, if not, the model non-deterministically interleaves action sequences from the handler with timeouts and higher-priority actions that follow from timeouts. This model modiﬁes the data associated with time to record the intra-hyperperiod (IHP) time normalized by the least common factor of all handler durations and timeout periods, the guards in timeOutSenderThread from Figure 9 are adjusted accordingly, and each component handler is modiﬁed to include an increment of time. Figure 11 illustrates how these increments are performed. It shows the execution of 5 Hz component processing subsequent to completion of 10 Hz processing in a frame. There are two cases: (1) the worst-case time estimate of the 5 Hz processing (i.e., which runs up to point 2) is less than or equal to the next timeout (i.e., timeout occurs at point 3) or (2) it is not (i.e., timeout occurs at point 1 and interrupts the 5 Hz actions). In case (1), the IHP time is incremented by the worst-case timing estimate of the currently running 5 Hz event handler and the state space exploration algorithm proceeds; note that there is no branching in the state space for this case. In case (2), the IHP time is incremented to the next timeout (i.e., point 1), a non-deterministically chosen preﬁx of the currently running 5 Hz handler is executed, and then the 10 Hz timeout is performed. By choosing a preﬁx of the handler actions, we are modeling all possible distributions of timing across the actions of the handler. The remaining portion of the handler is left for the state-space exploration algorithm after the 10 Hz timeout, and subsequent 10 hertz processing is performed. The diﬀerence between point 2 and point 1 (i.e., the worst-case execution time of the handler remaining time of the 10 Hz frame) is assigned to that remaining portion as its duration. This model can be seen as a reﬁnement of the RT model. It eliminates interleavings when the timing estimates guarantee that a group of highest-priority enabled transitions are guaranteed to complete before the next timeout. In the example in Figure 10, if the right-most three light-grey circles correspond to a

Model-Checking Middleware-Based Event-Driven Real-Time

177

5 Hz component handler body whose worst-case execution bound is less than the time to the next 10 Hz timeout, then there would be no branching in that portion of the state space (i.e., the lower two left outgoing arcs to 10 Hz timeouts are eliminated).

6

Experimental Results

Table 1 shows the results of evaluating our strategies using four example systems provided by Boeing engineers. As an example of how to read a system description, the ModalSP scenario that we have used as an example has three threads (for rate groups 1 Hz, 5 Hz, and 20 Hz), 8 components, an event correlation (e/c), and 125 events being generated per one second hyper-period (hp). For each scenario, we give data for ﬁve models that incorporate the modeling strategies presented in the previous section. – (R) is the reference model. There is no scheduling policy for the thread groups in the scenario (it is priority unaware and has no intra-rate-group timing constraints). Since a completely interleaved execution is infeasible to check, the relative time constraints are used though. – (RT-1) uses two policies: priority aware scheduling and the relative time environment where we implement the no frame overruns strategy for the highest-priority thread only. – (RT-2) is like (RT-1), but also assumes there are no frame overruns for all threads. – (LT) is like (RT-2) but uses the lazy time environment model. For each example, we collect the number of transitions trans, states, time, and memory consumption mem at the end of the search. The numbers of transitions and states are both listed because some of steps in the model are marked as invisible (atomic) for which Bogor will not save the states. The experiments were run on a Pentium 4 2.53 GHz with 1.5Gb RAM using the Java 2 Platform. Bogor’s collapse compression[17] and heap symmetry [18] and process symmetry [2] reductions are used in all of the experiments. Each of the experiments represents a complete exploration of the state-space of the system. From the table, the state space generally decreases from model (R), (RT-1), (RT-2), to (LT). This shows that by incorporating more knowledge (e.g., the scheduling policy) of the model that is being checked, less states need to be explored. For example, Medium, the largest scenario that we have, cannot be model checked using Bogor or our previous dSpin implementation [14] without employing the reduction strategies used in (RT-2) and (LT). For Basic the states are the same for model (RT-1) and (RT-2) because it only has a single thread (thus, there is no interleaving). Model (R) has a larger number of states because the lack of constraints allows the timeout to occur even when events associated with the current frame are still being dispatched. Model (LT) has two more states than (RT-2) due to the overhead introduced by the timing transitions. Bogor runs out of memory checking ModalSP (R) (at 3 million states) and Medium (RT-1) (at 13 million states). It is interesting that the states for ModalSP

178

X. Deng et al. Table 1. Experiment Data. Example System Basic Scenario Threads: 20Hz Components: 3 Events: 2 per .05sec hp Multi-Rate Scenario Threads: 20Hz, 40Hz Components: 6 Events: 6 per .05sec hp ModalSP Scenario Threads: 1Hz, 5Hz, 20Hz Components: 8 (e/c) Events: 125 per 1sec hp Medium Scenario Threads: 1Hz, 20Hz Components: 50 Events: 820 per 1sec hp

(R) trans 111 states 20 time .16 sec mem .51Mb trans 1.36M states .12M time 5 min mem 16Mb trans o.m. states 3M+ time o.m. mem o.m. trans o.m. states — time o.m. mem o.m.

(RT-1) 42 12 .11 sec .5Mb 7.5K 1.5K 1.9 sec .77Mb .92M 20.9K 20 sec 4.1Mb o.m. 13M+ o.m. o.m.

(RT-2) 42 12 .09 sec .5Mb .98K .1K .38 sec .61Mb 38.2K 9.1K 8.59 sec 1.61Mb 3.79M .74M 29 min 71.8 Mb

(LT) 44 14 .11 sec .51Mb .15K 33 .19 sec .61Mb 6.27K 1.56K 2.11 sec 1.45Mb .36M 74.5K 3 min 21.5Mb

(R) require more memory than the states for Medium (RT-1). This is an eﬀect of the collapse compression that is used. Speciﬁcally, there are three threads in ModalSP (R), but only two threads in Medium (RT-1). In addition, ModalSP (R), which has fewer scheduling constraints, allows more interleaving than Medium (RT-1). Thus, the collapse compression can save more in Medium (RT-1) than ModalSP (R), because there are more similar state bit patterns in Medium (RT-1) than in ModalSP (R).

7

Related Work

Garlan and Khersonsky [11] describe an approach for checking publish-subscribe systems (which they refer to as implicit invocation systems) using SMV. We build on their key insights of factoring system models into two parts: (1) a reusable model of run-time event-delivery infrastructure, and (2) application dependent, user-speciﬁed component models. However, we support much more directly the forms of component structure and connections (i.e., CCM structures and object references) and event-delivery mechanisms (i.e., RT CORBA middleware) found in real systems. This advance is achieved by leveraging Bogor’s direct modeling of OO concepts (as compared to the SMV input language which provides very little support for modeling programming language features) and Bogor’s extension mechanism (which allows complex middleware behavior to be captured internally to the model-checker). In addition, [11] does not provide any performance data, nor does that work consider any state-space reductions based on priorities, scheduling, or timing constraints that seem critical for scaling to realistic applications.

Model-Checking Middleware-Based Event-Driven Real-Time

179

There has been a large body of work on timing and schedulability analysis for component-based systems (see also [12] in this volume). As these techniques have matured, they have been integrated into environments that support the development of real-time systems. For example, MetaH [25] and Geodesic [6] are frameworks that support the reuse of components written in Ada and Java, respectively, in real-time systems. These frameworks include a range of timing analyses and automatically generate infrastructure code that coordinates the execution of component code in a way that achieves the system’s timing requirements. Cadena is complementary to this work in that it targets logical properties of a system using both light-weight and heavy-weight analysis techniques. Ptolemy [20] is a framework that allows a wide variety of formal descriptions of components and their behavior to be integrated into a single system. User’s provide suﬃcient detail in these descriptions to allow implementations to be automatically generated. Ptolemy provides a run-time infra-structure to mediate between components that have diﬀerent execution models. In contrast, Cadena models intentionally leave out detail in order to provide more abstract system descriptions that are amenable to analysis for large systems. While Cadena provides some code generation capabilities, we do attempt to generate component method implementations. Model checking [4] has become extremely popular as a technology for analyzing behavioral models of software artifacts. Researchers have extracted such models from source code (e.g., [15,5]), UML design artifacts (e.g., [21,19]) and architectural descriptions (e.g., [1]). The diﬃculty with all applications of model checking is scaling it up to apply to realistically large and complex systems. Recent years have seen an enormous amount of research on the systematic abstraction of models to enable more tractable reasoning. We take a diﬀerent approach in Cadena by exploiting the natural abstractions that arise when developing high-level design models of systems.

8

Conclusions

We believe the idea of a ﬂexible model-checking framework that allows domainspeciﬁc extensions (such as the ones that we have used for encoding models of CORBA communication layers) can be a very eﬀective approach for modelchecking modern distributed system designs and implementations. We are currently working with Boeing engineers to incorporate other forms of domain information (e.g., common speciﬁcation idioms) and other forms of light-weight checking (e.g., interface protocol checking, reﬁnement checking) and static analysis into Cadena.

Acknowledgements Thanks to various members of the Boeing Bold Stroke team including David Sharp, Carol Sanders, Wendy Roll, and Dennis Noll for their comments on the work described in this paper.

180

X. Deng et al.

References 1. R. Allen and D. Garlan. A formal basis for architectural connection. ACM Transactions on Software Engineering and Methodology, July 1997. 2. D. Bosnacki, D. Dams, and L. Holenderski. Symmetric spin. In International Journal on Software Tools for Technology Transfer. Springer-Verlag, 2002. 3. G. Brat, K. Havelund, S. Park, and W. Visser. Java PathFinder – a second generation of a Java model-checker. In Proceedings of the Workshop on Advances in Veriﬁcation, July 2000. 4. E. Clarke, O. Grumberg, and D. Peled. Model Checking. MIT Press, 2000. 5. J. C. Corbett, M. B. Dwyer, J. Hatcliﬀ, S. Laubach, C. S. P˘ as˘ areanu, Robby, and H. Zheng. Bandera : Extracting ﬁnite-state models from Java source code. In Proceedings of the 22nd International Conference on Software Engineering, June 2000. 6. D. de Niz and R. Rajkumar. Geodesic - a reusable component framework for embedded real-time systems. Technical report, Carnegie Mellon University, 2002. 7. C. Demartini, R. Iosif, and R. Sisto. dspin : A dynamic extension of SPIN. In Theoretical and Applied Aspects of SPIN Model Checking (LNCS 1680), Sept. 1999. 8. W. Deng, M. Dwyer, J. Hatcliﬀ, G. Jung, Robby, and G. Singh. Model-checking middleware-based event-driven real-time embedded software (extended version). Forthcoming – April 2003. 9. B. Doerr and D. Sharp. Freeing product line architectures from execution dependencies. In Proceedings of the Software Technology Conference, May 1999. 10. Eclipse Consortium. Eclipse website. http://www.eclipse.org, 2001. 11. D. Garlan and S. Khersonsky. Model checking implicit-invocation systems. In Proceedings of the 10th International Workshop on Software Speciﬁcation and Design, Nov. 2000. 12. G. Goessler and J. Sifakis. Composition for component-based modeling. In Proceedings of the First International Symposium on Formal Methods for Components and Objects, volume (this volume) of Lecture Notes in Computer Science. Springer, 2002. 13. T. H. Harrison, D. L. Levine, and D. C. Schmidt. The design and performance of a real-time corba event service. In Proceedings of the 1997 ACM SIGPLAN conference on Object-oriented programming systems, languages and applications, pages 184–200. ACM Press, 1997. 14. J. Hatcliﬀ, W. Deng, M. Dwyer, G. Jung, and V. Prasad. Cadena: An integrated development, analysis, and veriﬁcation environment for component-based systems. In Proceedings of the 25th International Conference on Software Engineering (to appear), 2003. 15. K. Havelund and T. Pressburger. Model checking Java programs using Java PathFinder. International Journal on Software Tools for Technology Transfer, 1999. 16. G. J. Holzmann. The model checker SPIN. IEEE Transactions on Software Engineering, 23(5):279–294, May 1997. 17. G. J. Holzmann. State compression in SPIN: Recursive indexing and compression training runs. In Proceedings of Third International SPIN Workshop, Apr. 1997. 18. R. Iosif. Symmetry reduction criteria for software model checking. In Proceedings of Ninth International SPIN Workshop, volume 2318 of Lecture Notes in Computer Science, pages 22–41. Springer-Verlag, Apr. 2002.

Model-Checking Middleware-Based Event-Driven Real-Time

181

19. D. Latella, I. Majzik, and M. Massink. Automatic veriﬁcation of a behavioural subset of UML statechart diagrams using the SPIN model-checker. Formal Aspects of Computing, 11(6):637–664, 1999. 20. E. A. Lee. Overview of the ptolemy project. Technical Report UCB/ERL M01/11, University of California, Berkeley, Mar. 2001. 21. J. Lilius and I. P. Paltor. vUML: A tool for verifying UML models. In Proceedings of the 14th IEEE International Conference on Automated Software Engineering, 1999. 22. Robby, M. B. Dwyer, and J. Hatcliﬀ. Bogor: An extensible and highly-modular model checking framework. In Proceedings of the 2003 ACM Symposium on Foundations of Software Engineering (FSE 2003), 2003. 23. Robby, M. B. Dwyer, and J. Hatcliﬀ. Bogor Website. http://www.cis.ksu.edu/bandera/bogor, 2003. 24. H. Sipma. Event correlation: A formal approach. Technical Report Draft, Stanford University, July 2002. 25. S. Vestal. Metah user’s manual. http://www.htc.honeywell.com/metah, 1998.

Equivalent Semantic Models for a Distributed Dataspace Architecture Jozef Hooman1,2 and Jaco van de Pol2 1

University of Nijmegen, Nijmegen, The Netherlands [email protected] http://www.cs.kun.nl/∼hooman/ 2 CWI, Amsterdam, The Netherlands [email protected] http://www.cwi.nl/∼vdpol/

Abstract. The general aim of our work is to support formal reasoning about components on top of the distributed dataspace architecture Splice. To investigate the basic properties of Splice and to support compositional veriﬁcation, we have deﬁned a denotational semantics for a basic Splice-like language. To increase the conﬁdence in this semantics, also an operational semantics has been deﬁned which is shown to be equivalent to the denotational one using the theorem prover PVS. A veriﬁcation framework based on the denotational semantics is applied to an example of top-down development and transparent replication.

1

Introduction

The general aim of our work is to support the development of complex applications on top of industrial software architectures by means of formal methods. As a particular example, we consider in this paper components on top of the software architecture Splice [Boa93,BdJ97] which has been devised at Thales Nederland (previously called Hollandse Signaalapparaten). It is used to build large and complex embedded systems such as command and control systems, process control systems, and air traﬃc management systems. The main goal of Splice is to provide a coordination mechanism between loosely-coupled heterogeneous components. In typical Splice-applications it is essential to deal with large data streams from sensors, such as radars. Hence Splice should support real-time and high-bandwidth distribution of data. Another aim is to support fault-tolerance, e.g., it should be possible to replicate components transparently, i.e. without aﬀecting the overall system behaviour. Splice is data-oriented, with distributed local databases based on keys. It uses the publish-subscribe paradigm to get loosely-coupled data producers and consumers. An important design decision is to have minimal overhead for data management, allowing a fast and cheap implementation that indeed allows huge data streams. For instance, Splice has no standard built-in mechanisms to ensure global consistency or global synchronization. If needed, this can be constructed for particular data types on top of the Splice primitives. Note that this is quite F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 182–201, 2003. c Springer-Verlag Berlin Heidelberg 2003

Equivalent Semantic Models for a Distributed Dataspace Architecture

183

diﬀerent from Linda [Gel85] and JavaSpaces [FHA99], which have a central data storage. The latter also has a transaction mechanism to handle distributed access. Our aim is to reason about components of the distributed dataspace architecture Splice in a compositional way. This means that we want to deduce properties of the parallel composition of Splice-components using only the speciﬁcations of the externally visible behaviour of these components. Compositionality supports veriﬁcation of development steps during the development process. Many examples in the literature show that it is convenient to specify components using explicit assumptions about the environment. Concerning Splice, in [HH02] we propose a framework with an explicit assumption about the quality of data streams published by environment and a similar commitment of the component about its produced data. When putting components in parallel, assumptions can be discharged if they are guaranteed by other components. Reasoning with assumption/commitment [MC81] or rely/guarantee [Jon83] pairs, however, easily leads to unsound reasoning. There is a danger of circular reasoning, two components which mutually discharge each others assumptions, leading to incorrect conclusions. Hence it is important to prove the soundness of the veriﬁcation techniques. Correctness of compositional veriﬁcation rules is usually based on a denotational semantics which assigns a meaning to compound constructs based on the meaning of its constituents. Earlier work on the veriﬁcation of Splicesystems [HvdP02] was based on a complex semantics with environment actions and its correctness was not obvious. In this paper we deﬁne a denotational semantics for a simple Splice-like language which is more convenient as a basis for compositional reasoning using assumptions about the environment. It is, however, far from trivial that this semantics captures the intuitive understanding of the Splice architecture. Hence, we also provide a more intuitive operational semantics and prove formally that it is equivalent to the denotational one. Moreover, we show by a small example of transparent replication how the denotational framework can be used to support veriﬁcation. Both versions of the semantics have been formulated in the language of the interactive theorem prover PVS [OSRSC01]. The equivalence result has been checked completely using PVS. Also the example application (transparent replication) has been veriﬁed in detail using PVS. Related to our semantic study is earlier work on the semantics of Splice-like languages such as a transition system semantics for a basic language of write and read statements (without query) [BKBdJ98] and a comparison of semantic choices using an operational semantics [BKBdJ98,BKZ99]. In previous work on a denotational semantics for Splice [BHdJ00] the semantics of local storages is not very convenient for compositional veriﬁcation; it is based on process identiﬁers and a partial order of read and write events with complex global conditions. New in this paper is an operational semantics that deals with local time stamps and their use for updating local databases. We deﬁne an equivalent denotational semantics which includes assumptions about the environment of a

184

J. Hooman and J. van de Pol

application process

application process

...... read

read

write

write local database

local database

......

agent send

agent receive

send

receive

Fig. 1. Splice applications.

component. Moreover, we show that this denotational semantics forms the basis of a formal framework for specifying and verifying Splice applications. This paper is structured as follows. Section 2 contains a brief informal explanation of Splice. In Sect. 3 we present a formal syntax of the Splice primitives considered in this paper. The operational and denotational semantics of this language are deﬁned in Sect. 4 and Sect. 5, respectively. The main outline of the equivalence proof can be found in Sect. 6. A framework for speciﬁcation and veriﬁcation is described in Sect. 7 and applied to an example with top-down design and transparent replication in Sect. 8. Finally, Sect. 9 contains a number of concluding remarks.

2

Informal Splice Introduction

The Splice architecture provides a coordination mechanism for concurrent components. Producers and consumers of data are decoupled. They need not know each other, and communicate indirectly via the Splice primitives; basically readand write- operations on a distributed dataspace. This type of anonymous communication between components is strongly related to coordination languages such as Linda [Gel85] and JavaSpaces [FHA99]. These languages, however, have a single shared dataspace, whereas in Splice each component has its own dataspace, see Fig. 1. Communication between components takes place by means of local agents. A data producer writes data records to the other dataspaces via its agent. A data consumer uses its agent to subscribe to the required types of data; only data which matches this subscription is stored. Data items may be delayed and re-ordered and sometimes may even get lost. It is possible to associate certain quality-of-service policies with data delivery and data storage. For instance, for a particular data type, delivery maybe guaranteed (each item is delivered at least once) or best eﬀort (zero or more times). Data storage can be volatile, transient, or persistent.

Equivalent Semantic Models for a Distributed Dataspace Architecture

185

Each data item within Splice has a unique sort, specifying the ﬁelds the sort consists of and deﬁning the key ﬁelds [BdJ97]. In each local dataspace, at most one data item is present for each key. Basically, a newly received data item overwrites the current item with the same key (if any). To avoid that old data items overwrite newer information (recall that data may be delayed and re-ordered), data records include a time stamp ﬁeld. A time stamp of a data item is obtained from the local clock of the data producer when the item is published. At the local storage of the consumer, data items are only overwritten if their time stamp is smaller than that of a newly arrived item (with the same key). This overwriting technique reduces memory requirements and allows a decoupling of frequencies between producers and consumers. It also reduces the number of updates to be performed on the dataspace, as not all received records get stored. The timestamps improve the quality of the data stored, as no record can be overwritten by older data. To program components on top of Splice, a Splice API can be called within conventional programming languages such as C and Java. Splice provides, for instance, constructs for retrieving (reading) data from the local dataspace, and for publishing (writing) data. Read actions contain a query on the dataspace, selecting data items that satisfy certain criteria. Data has a life-cycle attribute, such as “read”, “unread”, “new”, and “update”, which may be used for additional ﬁltering.

3

Syntax of a Simple Splice-Like Language

In this section we deﬁne the formal syntax of a very simple Splice-like language. We have embedded the basic Splice primitives in a minimal programming language to be able to high-light the essential features and to prove equivalences between various semantic deﬁnitions in a formal way. In Sect. 7.1 we show that it is easy to extend the language with, e.g. if-then-else and loop constructs. Lifecycle attributes are not included in the current version, because we detected some problems with the informal deﬁnition (as mentioned in Sect. 9). We consider only one sort. Let Data be some data domain, with a set KeyData of key data and a function key: Data → KeyData. Assume a given type LocalT ime to represent values of local clocks (in our PVS representation we choose the natural numbers). The type DataItems of time-stamped data items, consists of records with two ﬁelds: dat of type Data and ts of type LocalT ime. A record of type DataItems can be written as (#dat := ..., ts := ...#), following the PVS notation. Hence, for di ∈ DataItems, we have dat(di) ∈ Data and ts(di) ∈ LocalT ime. The function key is extended to DataItems by key(di) = key(dat(di)). Let SV ars be a set of variables ranging over sets of elements from DataItems. A data expression e yields an element from Data (dependent on the current value of the program variables). Henceforth we typically use the following variables ranging over the types mentioned above:

186

– – – – –

J. Hooman and J. van de Pol

d, d0 , d1 , . . . over Data lt, lt0 , lt1 , . . . over LocalT ime dati, dati0 , dati1 , . . . over DataItems diset, diset0 , diset1 , . . . over sets of DataItems x, x0 , x1 , . . . , y, y0 , y1 , . . . over SV ars

A query q ⊆ P(diset) speciﬁes sets of (time-stamped) data items (it may also depend on program variables). A few examples: – q1 = {diset | for all dati ∈ diset: ts(dati) > 100} = 0} – q2 = {diset | for all dati ∈ diset: dat(dati) = ø and diset = x} – q2 = {diset | diset For simplicity, we do not give the syntax of data expressions and queries here. The syntax of our programming language is given in Table 1. Table 1. Syntax Programming Language. Sequential program S ::= Write(e) | Read(x, q) | S1 ; S2 Process

P ::= S | P1 P2

Informally, the statements of this language have the following meaning: – Write(e) publishes the data item with value e (in the current state) and the current time stamp (from the local clock). We model best eﬀort delivery; a data item arrives 0 or more times at each process, where it might be used to update the local storage if there is no current value with the same key which has a larger or equal time stamp. – Read(x, q) assigns to x some set of data items from local storage that satisﬁes query q (if there are several sets satisfying q, the choice is non-deterministic). For instance, for query q1 above, we assign to x a set of data items from local storage such that each data item has time stamp greater than 100. If there are no such items in local storage, x becomes the empty set. Note that query q3 above does not allow the empty set; then the read statement blocks until the local storage contains a set of items satisfying the query. Hence a read statement may be blocking, depending on the query. – S1 ; S2 : sequential composition of sequential programs S1 and S2 . – P1 P2 : parallel composition of processes. A process is a sequential program or a parallel composition of processes. As a very simple example, consider a few producers and consumers of ﬂight data. Let Data be a record with two ﬁelds: ﬂightnr (a string, e.g. KL309) and pos a position in some form, here a natural number for simplicity. The ﬂight number is the key, that is, key(dati) = ﬂightnr(dati). Consider a producer of ﬂight data

Equivalent Semantic Models for a Distributed Dataspace Architecture

187

P1 = Write((#ﬂightnr := KL567, pos := 1#)) ; Write((#ﬂightnr := LU 321, pos := 6#)) ; Write((#ﬂightnr := KL567, pos := 2#)) ; Write((#ﬂightnr := KL567, pos := 3#)) and two consumers: C1 = Read(x1 , true) ; Read(y1 , q1 ) ; Read(z1 , q1 ) C2 = Read(x2 , q1 ) ; Read(y2 , q2 ) whose queries are speciﬁed as follows: = ø and for all dati ∈ diset : ﬂightnr(dati) = KL567} q1 = {diset | diset = ø and for all dati ∈ diset : ﬂightnr(dati) = KL567 and q2 = {diset | diset for all dati1 ∈ x2 : ts(dati) > ts(dati1 )} Consider the process P1 C1 C2 and assume there are no other producers of data. Note that the producer does not specify the local time stamp explicitly; this is added implicitly. Recall that the items produced by P1 may arrive at a diﬀerent order at the consumers, and they may arrive several times. Variable x1 may be empty (if no data item has been delivered yet – note that this read is not blocking) or a set with one or two elements, at most one for each ﬂight number. For instance, it may contain position number 3 for KL567. Variable y1 will be a singleton, since the local storage contains at most one item with ﬂight number KL567 and the second read is blocking (the query requires a non-empty set). If there is a position for KL567 in x1 , then the position in y1 will be greater or equal (lower values are produced earlier, hence have a smaller local clock value, and thus they cannot overwrite greater values). Similarly for z1 , where the position is greater or equal than the one in y1 . It is possible that z1 = y1 . For consumer C2 the second read action requires a newer time stamp, hence we always have y2 = x2 and the position in y2 is at least 2.

4

Operational Semantics

We deﬁne an operational semantics for a process S1 ... Sn of the syntax of Sect. 3. where the Si are sequential programs. We ﬁrst deﬁne an operational status of a sequential process (Def. 1) and a conﬁguration (Def. 3) which represents the state of aﬀairs during operational execution of a process. Next computation steps are deﬁned (Def. 5), leading to the operational semantics (Def. 6). Let DataBases be the type consisting of sets of data items with at most one item for each key, i.e. DataBases = {diset ⊆ DataItems | for all dati1 , dati2 ∈ diset: key(dati1 ) = key(dati2 ) → dati1 = dati2 } Deﬁnition 1 (Operational Status). An operational status of a sequential process, denoted os, os0 , os1 , .., is a record with three ﬁelds, st, clock and db: – st : SV ars → ℘(DataItems), represents the local state, assigning to each variable a set of data items; – clock ∈ LocalT ime, the value of the local clock;

188

J. Hooman and J. van de Pol

– db ∈ DataBases, the local database, a set of data items (at most one for each key) representing local storage. Deﬁnition 2 (Variant). The variant of local state st with respect to variable x ∈ SV ars and value diset ⊆ DataItems, denoted diset if y = x st[x := diset], is deﬁned as (st[x := diset])(y) = st(y) if y =x Similarly, the variant of a record r with ﬁelds f1 . . . fm is deﬁned by v if fi = f r[f := v](fi ) = =f fi (r) if fi Produced data items are sent to an underlying network. Here this is represented by N , a set of data items, i.e. N ⊆ DataItems. Note that we do not use a multiset, although a particular item might be produced several times. The use of a set is justiﬁed by the fact that data items may always be delivered more than once and hence are never deleted from N . Deﬁnition 3 (Conﬁguration). The state of aﬀairs of a process S1 ... Sn during execution is represented by a conﬁguration: (S1 , os1 ), ..., (Sn , osn ), N It denotes for each sequential process Si , the status osi and the remaining part Si that still has to be executed, and the current contents N of the network. For convenience, we introduce the empty statement E indicating that the process has terminated. An execution of S1 ... Sn is represented by sequence of conﬁgurations C0 −→ C1 −→ C2 −→ ... where C0 = (S1 ; E, os1 ), ..., (Sn ; E, osn ), ø with, for all i, db(osi ) = ø. The idea is that each step in the sequence C0 −→ C1 −→ C2 −→ ... represents the execution of an atomic action by some process i. We deﬁne the update of a database. Deﬁnition 4 (Update Database). The update of database db using a new database db1 , denoted UpdateDb(db, db1 ) is deﬁned by dati ∈ UpdateDb(db, db1 ) iﬀ – either dati ∈ db and for all dati1 ∈ db1 with key(dati1 ) = key(dati) we have ts(dati1 ) ≤ ts(dati), – or dati ∈ db1 and for all dati0 ∈ db with key(dati0 ) = key(dati) we have ts(dati0 ) < ts(dati). Computation steps are deﬁned formally as follows. Deﬁnition 5 (Computation Step). We have a step of process i, i.e., (S1 , os1 ), .., (Si , osi ), .., (Sn , osn ), N −→ (S1 , os1 ), .., (Si , osi ), .., (Sn , osn ), N iﬀ one of the following clauses holds: = E and Si = Si , N = N , st(osi ) = st(osi ), clock(osi ) = (Update) Si clock(osi ), and there exists a database db1 ⊆ N that is used to update

Equivalent Semantic Models for a Distributed Dataspace Architecture

189

db(osi ) such that an element of db1 is added if its key is not yet present and, otherwise, it replaces an element of db(osi ) with the same key if its local time-stamp is strictly greater. Formally, using Def. 4, there exists a database db1 ⊆ N such that db(osi ) = UpdateDb(db(osi ), db1 ). For simplicity, we did not change the local clock (it is not needed), but alternatively we might require clock(osi ) ≥ clock(osi ). Note that the network has not been changed, since data items might be used several times for an update (modeling the fact that an item might be delivered by the network several times). (Write) Si = Write(e) ; Si , st(osi ) = st(osi ), db(osi ) = db(osi ), clock(osi ) > clock(osi ), and N = N ∪ {(v, clock(osi ))} where v is the value of e in the current state. Note that, given a syntax of expressions, it would be easy to deﬁne the values of expressions and queries in the current status (see, e.g. [dRdBH+ 01]). (Read) Si = Read(x, q); Si , N = N , db(osi ) = db(osi ), clock(osi ) = clock(osi ), and there exists a set of data items diset ⊆ db(osi ) satisfying query q and such that st(osi ) = st(osi )[x := diset], i.e. assigning diset to x. For two conﬁgurations C1 and C2 , deﬁne C1 −→k C2 , for k ∈ IN, by C1 −→0 C1 and C1 −→k+1 C2 iﬀ there exists a conﬁguration C such that C1 −→k C and C −→ C2 . Deﬁne C1 −→∗ C2 iﬀ there exists a k ∈ IN such that C1 −→k C2 . Typically, the operational semantics yields some abstraction of the execution sequence, depending on what is observable. Here we postulate that only the set of produced data items in the last conﬁguration of an execution sequence is (externally) observable. Deﬁnition 6 (Operational Semantics). The operational semantics of a program S1 ... Sn , given an initial operational status os0 , is deﬁned by O(S1 ... Sn )(os0 ) = {N ⊆ DataItems | (S1 ; E, os0 ), .., (Sn ; E, os0 ), ø −→∗ (E, os1 ), .., (E, osn ), N , with db(os0 ) = ø } Thus the operational semantics of a program yields a set of sets of produced data items, where each set of produced data items represents a possible execution of the program. Example 1. Let 0 be the query specifying that the value 0 should be read, similarly for 1. Observe that, for any os0 , O((Read(x, 0) ; Write(1)) (Read(x, 1) ; Write(0)))(os0 ) = ø.

5

Denotational Semantics

We deﬁne the denotational semantics of a program, given an initial status, i.e. the state of aﬀairs when execution starts. To support our aim to reason with assumptions about the items produced by the environment, such assumptions are

190

J. Hooman and J. van de Pol

included in the status. The semantics yields a set of statuses, each representing a possible execution of the program. To achieve compositionality and to describe a process in isolation, it is quite common that information has to be added to the status to express relations with the environment explicitly. Here we add, e.g. the set of written data items and the set of items that are assumed to be produced by the environment. Moreover, we need a way to represent causality between the written items. As shown below, this is needed to assign a correct meaning to the program (Read(x, 0); Write(1)) (Read(x, 1); Write(0)). Here this is achieved by the use of conceptual logical clocks that are added to the status of each process. They are updated similarly to Lamport’s logical clocks [Lam78], which ensures that there exists a global, total order on the produced items. Let T ime be the domain of logical clock values, here we use the natural numbers. We add a logical clock value to the data items produced. Type ExtDataItems, with typical variables edi, edi0 , edi1 , . . ., consists of records with two ﬁelds: – di of type DataItems, and – tm of type T ime. Field tm represents the logical moment of publication. It can be used to construct a global partial order on the produced data items that reﬂects causality. Field selector di is extended to remove the logical clock values from a set of extended data items. For a set ediset ⊆ ExtDataItems deﬁne di(ediset) = {di(edi) | edi ∈ ediset}. A status, typically denoted by s, s0 , s1 , .., representing the current state of aﬀairs of a program, is a record with six ﬁelds. In addition to the three ﬁelds of the operational status: – st : SV ars → ℘(DataItems), the local state (values of variables); – clock ∈ LocalT ime, the value of the local clock; – db ∈ DataBases, the local database (a set of data items, unique per key); we have three new ﬁelds: – time ∈ T ime, the logical clock, to represent causality; – ownw ⊆ ExtDataItems, the set of extended data items written by the program itself in the past; – envw ⊆ ExtDataItems, the set of extended data items written by the environment of the program; this is an assumption about all items produced (including present and future). Below we deﬁne a meaning function M for programs by induction on their structure. The possible behaviour of a program prog, i.e. a set of statuses, is deﬁned by M(prog)(s0 ), where s0 is the initial status at the start of program execution. Note that this includes an assumption about all data items that have been or will be produced by the environment. The semantics will be such that if s ∈ M(prog)(s0 ) then

Equivalent Semantic Models for a Distributed Dataspace Architecture

191

– ownw(s) equals the union of ownw(s0 ) and the items written by prog. – envw(s) = envw(s0 ); the ﬁeld envw is only used by prog to update its local storage. Although all items are available initially, constraints on logical clocks prevent the use of items “too early”. Next we deﬁne M(prog) by induction on the structure of prog. Write. In the semantics of the write statement the published item, extended with the current value of the local clock, is added to the ownw ﬁeld. The local clock is increased to ensure that subsequent written items get a later time stamp. In a similar way, a logical clock value has been added. M(Write(e))(s0 ) = {s | clock(s) > clock(s0 ), time(s) > time(s0 ), ownw(s) = ownw(s0 ) ∪ {(#di := dati, tm := time(s)#)}, where dati = (#dat := v, ts := clock(s0 )#), with v the value of e in s0 , and s equals s0 for the other ﬁelds (st, db and envw) } Read. To deﬁne the meaning of a read statement, we ﬁrst introduce an auxiliary Update function which may update the local database with data items written (by the process itself or by its environment), using UpdateDb of Def. 4. Update(s0 ) = {s | there exists an ediset ⊆ ownw(s0 ) ∪ envw(s0 ) and a database db1 ⊆ di(ediset), such that db(s) = UpdateDb(db(s0 ), db1 ), time(s) >= time(s0 ), for all edi ∈ ediset, we have tm(edi) < time(s), and s equals s0 for the other ﬁelds (st, clock, ownw and envw) } Then the read statement Read(x, q) ﬁrst updates the local storage and next assigns to x a set of data items that satisﬁes the query q. M(Read(x, q))(s0 ) = {s | exists s1 ∈ Update(s0 ) and diset ⊆ db(s1 ) such that diset satisﬁes query q and st(s) = st(s1 )[x := diset], and s equals s1 for the other ﬁelds (clock, db, time, ownw and envw) } Note that we only represent terminating executions; blocking has not been modeled explicitly. Sequential Composition. Since we only model terminating executions, the meaning of the sequential composition S1 ; S2 is deﬁned by applying the meaning of S2 to any status that results from executing S1 . In Sect. 7.1 we show how this can be extended to deal with non-terminating programs. M(S1 ; S2 )(s0 ) = {s | exists s1 with s1 ∈ M(S1 )(s0 )∧ s ∈ M(S2 )(s1 )} Parallel Composition. To deﬁne parallel composition, let init(s0 ) be the condition db(s0 ) = ø ∧ ownw(s0 ) = ø. Moreover, we use s + ediset to add a set

192

J. Hooman and J. van de Pol

ediset ⊆ ExtDataItems to the environment writes of s, i.e. envw(s + ediset) = envw(s) ∪ ediset and all other ﬁelds of s remain the same. In the semantics of P1 P2 , starting in initial status s0 , the main observation is that envw(s0 ) contains only the data items produced outside P1 P2 . Hence the semantic function for P1 is applied to s0 where we add the items written by P2 to the environment writes. Similarly for P2 . Then parallel composition is deﬁned as follows: M(P1 P2 )(s0 ) = {s | init(s0 ) and there exist s1 and s2 with s1 ∈ M(P1 )(s0 + ownw(s2 )), s2 ∈ M(P2 )(s0 + ownw(s1 )), ownw(s) = ownw(s1 ) ∪ ownw(s2 ), envw(s) = envw(s0 )} Example 2. Consider again the program of Example 1: (Read(x, 0) ; Write(1)) (Read(x, 1) ; Write(0)) Without using logical clocks, the semantics of parallel composition would allow for this program a status where envw = ø and ownw contains 0 and 1 (each component produces the item required by the other one). This, however, does not correspond to the operational semantics which yields the empty set. In the current version of the semantics, we can indeed show that, for any s0 , M((Read(x, 0) ; Write(1)) (Read(x, 1) ; Write(0)))(s0 ) = ø.

6

Equivalence of Denotational and Operational Semantics

In this section we ﬁrst deﬁne what it means that the operational and the denotational semantics of Sect. 4 and 5, resp., are equivalent. Next we give an outline of how we proved this equivalence formally. Note that equivalence is far from trivial, since there a number of prominent diﬀerences: – The operational semantics allows updates of the local database at any point in time, whereas in the denotational semantics we minimized the number of updates to keep veriﬁcation simple, allowing it only once, immediately before reading items. – The parallel composition of the denotational semantics is deﬁned by a few recursive equations and it is not clear whether this indeed corresponds to operational execution. – The denotational semantics has additional ﬁelds in a status and the read statement contains an additional check on logical clock values. – The underlying network is modeled in diﬀerent ways. Equivalence is based on what is externally observable, i.e. two semantics functions are equivalent if they assign the same observable behaviour to any program. For the denotational semantics, we choose the same notion of observable behaviour as has been used in the operational semantics, namely the set of published data items. For a set D of denotational statuses, deﬁne the observations

Equivalent Semantic Models for a Distributed Dataspace Architecture

193

by Obs(D) = {di(ownw(s)) | s ∈ D}. Deﬁne for an n-tuple (s1 , . . . , sn ) of statuses Obs(s1 , . . . , sn ) = di(∪i,1≤i≤n ownw(si )), and for a set T of such tuples Obs(T ) = {Obs(s1 , . . . , sn ) | (s1 , . . . , sn ) ∈ T }. To relate the operational and the denotational semantics, we use a function Ext to extend an operational status to a status of the denotational semantics; Ext(os) copies the ﬁelds st, clock, and db of os and it sets the ﬁelds time to 0 and ownw and envw to ø. This leads to the main theorem. Theorem 1. O(S1 ... Sn )(os) = Obs(M(S1 (S2 (... Sn )...))(Ext(os))) We give an outline of the proof that has been checked completely using the interactive theorem prover PVS. Below we only give the main steps, for instance, ignoring details about initial conditions. The proof uses three intermediate versions of the semantics and corresponding lemma’s: – M which is the same as M except that also the write-statement is preceded by an update action. Lemma 1. Obs(M(S1 (S2 (... Sn )...))(Ext(os))) = Obs(M (S1 (S2 (... Sn )...))(Ext(os))) Note that M and M are deﬁned for the parallel composition of two processes. But we derive a similar formulation for n processes: Lemma 2. Obs(M (S1 (S2 (... Sn )...))(Ext(os))) = Obs({(s1 , . . . , sn ) | init(Ext(os)), envw(Ext(os)) = ø and for all i, 1 ≤ i ≤ n, si ∈ M (Si )(Ext(os)[envw := ∪j=i ownw(sj )]) }) – OD which extends the operational semantics O to the status of the denotational semantics (adding time, ownw and envw). Moreover, the network N is removed. This is achieved by deﬁning the atomic steps of a single sequential process as (S, s) −→ediset (S , s ), where ediset represents the set of items written in the step (a singleton if S starts with a write statement, the empty set otherwise). This also includes update steps, similar to the update inside a read statement of the denotational semantics, so including a condition of the values of logical clocks. Based on this step relation for one process, we have a step of n parallel processes, denoted (S1 , s1 ), ..., (Sn , sn ) −→ (S1 , s1 ), ..., (Sn , sn ), if there = i, Sj = Sj and exists an i with (Si , si ) −→ediset (Si , si ), and for all j sj = sj + ediset. This leads to the following semantics for n processes: OD(S1 ... Sn )(s0 ) = {(s1 , . . . sn ) | init(s0 )∧ (S1 ; E, s0 ), ..., (Sn ; E, s0 ) −→∗ (E, s1 ), ..., (E, sn ) } Lemma 3. O(S1 ... Sn )(os) = Obs(OD(S1 ... Sn )(Ext(os)) – OS which extends the operational semantics of a sequential process to the status of the denotational semantics, using the relation (S, s) −→ediset (S , s ) mentioned above. Lemma 4. OS(S) = M (S), for sequential programs S.

194

J. Hooman and J. van de Pol

Now, by the lemmas 1, 2, 3, and 4, theorem 1 reduces to the following lemma. Lemma 5. Obs(OD(S1 ... Sn )(Ext(os)) = Obs({(s1 , . . . , sn ) | init(Ext(os)), envw(Ext(os)) = ø and for all i, 1 ≤ i ≤ n, si ∈ OS(Si )(Ext(os)[envw := ∪j=i ownw(sj )]) }) Proof. We give the main ideas of the proof, showing that the sets are contained in each other. ⊆ Suppose (s1 , . . . sn ) ∈ OD(S1 ... Sn )(Ext(os)), that is, (S1 ; E, Ext(os)), ..., (Sn ; E, Ext(os)) −→∗ (E, s1 ), ..., (E, sn ). We have proved by induction on the number of steps in the execution sequence that this implies, for all i, 1 ≤ i ≤ n, (Si ; E, Ext(os)[envw := ∪j=i ownw(sj )]) −→ediset1 . . . −→edisetk (E, si ), that is, si ∈ OS(Si )(Ext(os)[envw := ∪j=i ownw(sj )]). ⊇ Assume, for all i, 1 ≤ i ≤ n that si ∈ OS(Si )(Ext(os)[envw := ∪j=i ownw(sj )]). Thus we have an operation execution (Si ; E, Ext(os)[envw := ∪j=i ownw(sj )]) −→ediset1 . . . −→edisetk (E, si ) for each of the sequential processes. We have to show, (S1 ; E, Ext(os)), ..., (Sn ; E, Ext(os)) −→∗ (E, s1 ), ..., (E, sn ), i.e., we have to show that these sequential executions can be merged into a global execution sequence for the parallel program. Basically, this is done by induction on the total number of steps in all sequential executions. The global execution sequence is constructed by selecting for each step the process with the lowest logical time after its next possible step. This construction is far from trivial, since the local, sequential executions start with all available environment writes, whereas in the global execution a process can only use what has been produced up to the current moment. However, the constraints on the logical clocks (that have been included in this extended operational semantics), ensure that only items produced before its current logical time are used. Formally, this is captured by the property that if (S, s+ediset) −→ediset1 (S , s ) (representing a step of a local process) and for all edi ∈ ediset, tm(edi) ≥ time(s ) then (S, s) −→ediset1 (S , s [envw := envw(s)]) (i.e. it can be used in the global sequence). Since we execute the process with the earliest logical time, we can also show that all items produced later cannot have a smaller logical time stamp.

7

Veriﬁcation Framework

In this section we provide a framework that can be used to specify and verify processes, as shown in Sect. 8. First, in Sect. 7.1, the programming language is extended with a number of useful constructs. Section 7.2 contains the main speciﬁcation and veriﬁcation constructs.

Equivalent Semantic Models for a Distributed Dataspace Architecture

195

7.1 Language Extensions To deal with some more interesting examples, the simple programming language of Sect. 3 is extended with an assignment, if-then-else construct, and inﬁnite loops. Accordingly, the denotational semantics of Sect. 5 is extended. Since inﬁnite loops introduce non-terminating computations, we add one ﬁeld to the status: – term ∈ {true, f alse}: indicates termination of the process; if it is false all subsequent statements are ignored. Henceforth, we assume that s0 is such that term(s0 ) = true, i.e. after s0 we can still execute subsequent statements. The deﬁnition of sequential composition has to be adapted, since it is possible that the ﬁrst process does not terminate and thus prohibits execution of the second process. M(S1 ; S2 )(s0 ) = {s | s ∈ M(S1 )(s0 ) ∧ ¬term(s)}∪ {s | there exists an s1 with s1 ∈ M(S1 )(s0 )∧ term(s1 ) ∧ s ∈ M(S2 )(s1 )} The meaning of the skip statement can be deﬁned easily. M(Skip)(s0 ) = {s0 } Similarly we can easily deﬁne x := e where x ∈ Svars and e an expression yielding a set of data items. We also add variables that range over data items and deﬁne assignments for such variables. Next we deﬁne an if-then-else statement, where b is a boolean expression. M(If b Then S1 Else S2 Fi)(s0 ) = {s | if b is true in st(s0 ) then s ∈ M(S1 )(s0 ) else s ∈ M(S2 )(s0 ) } Finally, we deﬁne an inﬁnite loop by means of an inﬁnite sequence of statuses s0 , s1 , s2 , . . ., where si is the result of executing the loop body i times, provided all these executions terminate. Otherwise the term-ﬁeld of si is false. The written items are collected by taking the union of the produced items in each execution of the body. Note that only the own and environment writes are relevant. M(Do S Od)(s0 ) = {s | there exists a sequence s0 , s1 , s2 , . . . such that for all i, if term(si ) then si+1 ∈ M(S)(si ) else term(si+1 ) = f alse, and ownw(s) = ∪{i|term(si )} ownw(si+1 ) and envw(s) = envw(s0 ) } 7.2

Speciﬁcation and Veriﬁcation

To obtain a convenient speciﬁcation and veriﬁcation framework, we deﬁne a mixed formalism in which one can freely mix programs and speciﬁcations, based on earlier work [Hoo94]. Speciﬁcations are part of the program syntax; let p, p0 , p1 , . . . , q, q0 , q1 , . . . be assertions, that is, predicates over statuses. A speciﬁcation is a “program” of the form Spec(p, q) with the following meaning.

196

J. Hooman and J. van de Pol

M(Spec(p, q))(s0 ) = {s | (p(s0 ) implies q(s)) and envw(s) = envw(s0 ) } Next we deﬁne a reﬁnement relation ⇒ between programs (which now may include speciﬁcations). Deﬁnition 7 (Reﬁnement). For any two programs P1 , P2 , we deﬁne P1 ⇒ P2 iﬀ for all s0 , we have M(P1 )(s0 ) ⊆ M(P2 )(s0 ). Note that it is easy to prove that the reﬁnement relation is reﬂexive and transitive. We have the usual consequence rule. Lemma 6 (Consequence). Spec(p, q).

If p → p0 and q0 → q then Spec(p0 , q0 ) ⇒

Based on the denotational semantics for Splice, we checked in PVS the soundness of a number of proof rules for programming constructs. For instance, for sequential composition we have a composition rule and a monotonicity rule which allows reﬁnements in a sequential context. Lemma 7 (Sequential Composition). (Spec(p, r); Spec(r, q)) ⇒ Spec(p, q). Lemma 8 (Monotonicity of Sequential Composition). If P3 ⇒ P1 and P4 ⇒ P2 then (P3 ; P4 ) ⇒ (P1 ; P2 ). The reasoning about parallel composition in PVS mainly uses the semantics directly. We only have a monotonicity rule, which forms the basis of stepwise reﬁnement of components. Lemma 9 (Monotonicity of Parallel Composition). If P3 ⇒ P1 and P4 ⇒ P2 then (P3 P4 ) ⇒ (P1 P2 ).

8

Veriﬁcation Example

To illustrate our reasoning framework, we ﬁrst show top-down development of a typical application consisting of a producer, a transformer, and a consumer in Sect. 8.1. Next, in Sect. 8.2, we consider transparent replication. The general question is whether we can replace a single process by two, or more, copies of the same process without having to change the environment. So, given the monotonicity rules mentioned above, we look for suﬃcient conditions for having P P ⇒ P . We investigate this for the concrete case study developed in Sect. 8.1. 8.1

Top-Down Development

Here we consider a concrete and simple case study, which is prototypical for the application area, with the following processes: – Producer: provides monotonically increasing data (here simply natural numbers) with name SensData.

Equivalent Semantic Models for a Distributed Dataspace Architecture

197

– Transformer: assuming it gets increasing SensData items, it provides monotonically increasing data with name Intern. – Consumer: assuming the environment provides increasing data with name Intern, it produces monotonically increasing Display items. To formalize this, we deﬁne the following types and functions: – DataN ame = {SensData, Intern, Display}, with typical variable dn. – DataV al = IN. – Data is a type of records with two ﬁelds: name of type DataN ame and val of type DataV al. Let dvar be a variable of type Data. – KeyData = DataN ame and key(dvar) = name(dvar). To formulate the speciﬁcations, ﬁrst a few preliminary deﬁnitions are needed, where wset is a set of extended data items: – N ameOwnw(dn)(s) = ∀edi ∈ ownw(s) : name(edi) = dn – SensData(wset) = {edi | edi ∈ wset ∧ name(edi) = SensData} Similarly, we have Intern(wset) and Display(wset). – Increasing(wset) = ∀edi1 ∈ wset, edi2 ∈ wset : (val(edi1) < val(edi2) ↔ ts(edi1) < ts(edi2)) Let pre be an assertion expressing that db = ø, ownw = ø, and all variables ranging over sets of data items are initialized to the empty set. It is used as a precondition for the processes. The top-level speciﬁcation of the overall system is deﬁned as follows. postT opLevel(s) = envw(s) = ø → Increasing(Display(ownw(s))) T opLevel = Spec(pre, postT opLevel) We implement this top-level speciﬁcation by the parallel composition of the three processes mentioned above: Producer Transformer Consumer We ﬁrst specify the processes, without considering their implementation, and show that these speciﬁcations lead to the top-level speciﬁcation. The producer writes an increasing sequence of sensor data. postP rod(s) = N ameOwnw(SensData)(s)∧ Increasing(ownw(s)) P rod = Spec(pre, postP rod) The consumer should produce an increasing sequence of Display data, assuming the environment provides increasing internal values. postCons = N ameOwnw(Display)(s)∧ Increasing(Intern(envw(s))) → Increasing(Display(ownw(s))) Cons = Spec(pre, postCons) To connect producer and consumer, we specify the transformer as follows.

198

J. Hooman and J. van de Pol

postT rans = N ameOwnw(Intern)(s)∧ Increasing(SensData(envw(s))) → Increasing(Intern(ownw(s))) T rans = Spec(pre, postT rans) We have proved in PVS that this leads to a correct reﬁnement of the top-level speciﬁcation. Theorem 2. (P rod (T rans Cons)) ⇒ T opLevel Next we consider the implementation of the three components. Let d be a variable ranging over Data, whereas dset and dold are variables ranging over sets of data items. P roducer = d := (#name := SensData, val := 0#) ; Do Write(d) ; d := (#name := SensData, val := val(d) + 1#) Od Note that the producer writes all natural numbers, which is not required by the speciﬁcation. Also note that subscribers to data with name SensData will usually read only a (increasing) subsequence of these items. We have proved in PVS that this is indeed a correct reﬁnement. Lemma 10. P roducer ⇒ P rod Similarly we provide a program for the transformer and show that it is a correct implementation. Let q(dn, old) be the query that speciﬁes a set of data items with name dn and local time stamp diﬀerent from those in variable old (which is initially empty). The set is allowed to be empty, to avoid blocking computations; otherwise it will be a singleton with a new, unread, item. The item that has been read is transformed using a function T r(dn, dset) which, for simplicity, just changes the name of the record of the item in dset into dn. Transformer = Do Read(dset, q(SensData, old)) ; If dset =ø Then Write(T r(Intern, dset)) ; old := dset Else Skip Fi Od Lemma 11. Transformer ⇒ T rans Similarly for the consumer. For simplicity, we used the same name transformation, which makes it possible to derive the correctness of transformer and consumer from a single proof that was parameterized by the name of the data sort. Consumer = Do Read(dset, q(Intern, old)) ; If dset =ø Then Write(T r(Display, dset)) ; old := dset Else Skip Fi Od Lemma 12. Consumer ⇒ Cons Finally observe that top-level theorem 2 and the lemmas 10, 11, and 12 for the components lead by the monotonicity property (lemma 9) to (P roducer (Transformer Consumer)) ⇒ T opLevel

Equivalent Semantic Models for a Distributed Dataspace Architecture

8.2

199

Transparent Replication

While investigating transparent replication, we observed that the current version of the transformer speciﬁcation is not suitable. The next lemma expresses that we cannot replicate the transformer in a transparent way. Lemma 13. ¬((T rans T rans) ⇒ T rans) The lemma has been proved by a concrete counter example in PVS. The basic idea is that the two transformers each have increasing output, but - because they may write this output at diﬀerent moment - the local time stamps in these sequences might be diﬀerent and hence merging these output streams need not lead to an increasing sequence. The main problem is that the items produced get their local time stamp from the local clock when the item is written. This need not be related to the temporal validity of the data. Hence we propose an alternative speciﬁcation for the transformer where the local time stamp is just copied from the incoming data item (as we will see later, this also requires a modiﬁed write statement). M aintainT s(s) = ∀edi ∈ ownw(s) : ∃edi1 ∈ envw(s) : ts(edi) = ts(edi1 )∧ name(edi1 ) = SensData∧ val(edi) = val(edi1 ) postT ransN ew = N ameOwnw(Intern) ∧ M aintainT s T ransN ew = Spec(pre, postT ransN ew) This indeed reﬁnes the transformer speciﬁcation: Lemma 14. T ransN ew ⇒ T rans Moreover, this speciﬁcation can indeed be replicated. Lemma 15 (Transformer Replication). (T ransN ew T ransN ew) ⇒ T ransN ew Hence we have (by theorem 2, lemma 14 and monotonicity): (P rod (T ransN ew Cons)) ⇒ T opLevel and by lemma 15 and monotonicity: (P rod ((T ransN ew T ransN ew) Cons)) ⇒ T opLevel It remains to implement T ransN ew. Important part of this speciﬁcation is that it just copies the local time stamp (the ts-ﬁeld). This, however, cannot be implemented with the current write statement, since it always use the current value of the local clock as its time stamp (the ts ﬁeld is set to clock(s0 )). Therefore we introduce a new write statement Write(e, texp) which has an additional parameter, a time expression texp, which speciﬁes the ts value, i.e. in the semantics of this extended write statement the ts-ﬁeld in the data item gets the value of texp. Query Qtrans speciﬁes a set of data items that is either empty or a singleton containing an item with name SensData. Expression MkIntern(dset) changes the name of the item in dset to Intern and copies it value. Now the write statement has a time expression T s(dset) which just yields the ts-ﬁeld of the data item in dset.

200

J. Hooman and J. van de Pol

NewTransformer = Do Read(dset, Qtrans) ; If dset =ø Then Write(MkIntern(dset), T s(dset)) Else Skip Fi Od Lemma 16. NewTransformer ⇒ T ransN ew So, ﬁnally, we have (P roducer(NewTransformerConsumer)) ⇒ T opLevel and (P roducer ((NewTransformer NewTransformer) Consumer)) ⇒ T opLevel.

9

Concluding Remarks

We have deﬁned a new denotational semantics for the main primitives of the industrial software architecture Splice. This architecture is data-oriented and based on local storages of data items. Communication between components is anonymous, based on the publish-subscribe paradigm. New is especially the modeling of time stamps, based on local clocks, and the update mechanism of local storages based on these time stamps. Moreover, the denotational semantics supports convenient compositional veriﬁcation, based on assumptions about the data items produced by the environment of a component. Causality between data items is represented by logical time stamps. To simplify veriﬁcation, we minimized the number of updates of the local storages and tried a short, but non-trivial, formulation of parallel composition. To increase the conﬁdence in this denotational semantics, we also formulated a rather straightforward operational semantics. Using the interactive theorem prover PVS, we formally showed that the operational semantics is equivalent to the denotational one. The proof revealed a number of errors in earlier versions of the semantics, e.g. concerning the precise interpretation of logical time stamps representing causality. As a side-eﬀect, the proof resulted in a number of useful properties. Classical ones, such as associativity of sequential and parallel composition, but also more Splice-oriented properties such as idempotence of updates. We also observed that the current deﬁnition of the notion of a lifecycle in Splice breaks idempotence of updates. Since it indicates a conceptual error in the deﬁnition, we have removed the lifecycle from the current formalization. A topic of current work concerns the equivalence classes induced by the semantics, i.e. which programs obtain the same semantics? The question is whether the denotational semantics is fully abstract with respect to the operational one, that is, does it only distinguish those programs that are observably diﬀerent in some context? We have shown how the semantics can be used as a basis for a formal framework for speciﬁcation and compositional veriﬁcation. The study of transparent replication clariﬁed the use of local time stamps and led to an improvement of the write primitive. In future work we intend to investigate the use of clock synchronization.

Equivalent Semantic Models for a Distributed Dataspace Architecture

201

References BdJ97.

M. Boasson and E. de Jong. Software architecture for large embedded systems. IEEE Workshop on Middleware for Distributed Real-Time Systems and Services, 1997. BHdJ00. R. Bloo, J. Hooman, and E. de Jong. Semantical aspects of an architecture for distributed embedded systems. In Proc. of the 2000 ACM Symposium on Applied Computing (SAC 2000), volume 1, pages 149– 155. ACM press, 2000. BKBdJ98. M.M. Bonsangue, J.N. Kok, M. Boasson, and E. de Jong. A software architecture for distributed control systems and its transition system semantics. In Proc. of the 1998 ACM Symposium on Applied Computing (SAC ’98), pages 159 – 168. ACM press, 1998. BKZ99. M. M. Bonsangue, J.N. Kok, and G. Zavattaro. Comparing coordination models based on shared distributed replicated data. In Proc. of the 1999 ACM Symposium on Applied Computing (SAC’99). ACM Press, 1999. Boa93. M. Boasson. Control systems software. IEEE Transactions on Automatic Control, 38(7):1094–1106, July 1993. dRdBH+ 01. W.P. de Roever, F. de Boer, U. Hannemann, J. Hooman, Y. Lakhnech, M. Poel, and J. Zwiers. Concurrency Veriﬁcation, Introduction to Compositional and Noncompositional Methods. Cambridge Tracts in Theoretical Computer Science. Cambridge University Press, 2001. FHA99. E. Freeman, S. Hupfer, and K. Arnold. JavaSpaces: Principles, Patterns, and Practice. Addison-Wesley, Reading, MA, USA, 1999. Gel85. D. Gelernter. Genarative communication in Linda. Transactions on Programming Languages and Systems, 7(1):80–112, 1985. HH02. U. Hannemann and J. Hooman. Formal reasoning about real-time components on a data-oriented architecture. In Proc. of 6th World Multiconference on Systemics, Cybernetics and Informatics (SCI02), volume XI, pages 313–318, 2002. Hoo94. J. Hooman. Correctness of real time systems by construction. In Formal Techniques in Real-Time and Fault-Tolerant Systems, pages 19–40. LNCS 863, Springer-Verlag, 1994. HvdP02. J. Hooman and J. van de Pol. Formal veriﬁcation of replication on a distributed data space architecture. In Proc. of the 2002 ACM Symposium on Applied Computing (SAC 2002), pages 351–358, 2002. Jon83. C.B. Jones. Tentative steps towards a development method for interfering programs. ACM Transactions on Programming Languages and Systems, 5(4):596–619, 1983. Lam78. L. Lamport. Time, clocks, and the ordering of events in a distributed system. Communications of the ACM, 21(7):558–565, 1978. MC81. J. Misra and K.M. Chandy. Proofs of networks of processes. IEEE Transactions on Software Engineering, 7(7):417–426, 1981. OSRSC01. S. Owre, N. Shankar, J.M. Rushby, and D.W.J. Stringer-Calvert. PVS System Guide. SRI International, Computer Science Laboratory, Menlo Park, CA, version 2.4 edition, December 2001. http://pvs.csl.sri.com.

Java Program Veriﬁcation Challenges Bart Jacobs, Joseph Kiniry, and Martijn Warnier Dep. Comp. Sci., Univ. Nijmegen P.O. Box 9010, 6500 GL Nijmegen, The Netherlands {bart,kiniry,warnier}@cs.kun.nl http://www.cs.kun.nl/∼{bart,kiniry,warnier}

Abstract. This paper aims to raise the level of veriﬁcation challenges by presenting a collection of sequential Java programs with correctness annotations formulated in JML. The emphasis lies more on the underlying semantical issues than on veriﬁcation.

1

Introduction

In the study of (sequential) program veriﬁcation one usually encounters the same examples over and over again (e.g., stacks, lists, alternating bit protocol, sorting functions, etc.), often going back to classic texts like [11,14,15]. These examples typically use an abstract programming language with only a few constructs, and the logic for expressing the program properties (or speciﬁcations) is some variation on ﬁrst order logic. While these abstract formalisms were useful at the time for explaining the main ideas in this ﬁeld, they are not very helpful today when it comes to actual program veriﬁcation for modern programming and speciﬁcation languages. The aim of this paper is to give an updated collection of program veriﬁcation examples. They are formulated in Java [13], and use the speciﬁcation language JML [19,21]. The examples presented below are based on our experience with the LOOP tool [3] over the past ﬁve years. Although the examples have actually been veriﬁed, the particular veriﬁcation technology based on the LOOP tool does not play an important rˆ ole: we shall focus on the underlying semantical issues. The examples may in principle also be veriﬁed via another approach, like those of Jack [6], Jive [23], Krakatoa [10], and KeY [2]. We shall indicate when static checking with the ESC/Java [12] tool brings up interesting semantical issues in our examples1 . When ESC/Java checks an annotated program, it returns one of three results. The result “passed” indicates that ESC/Java believes the implementation of a method fulﬁlls its speciﬁcation; in that case we’ll say ESC/Java accepts the input. A result of “warning” indicates that an error potentially exists in the program but ESC/Java was unable to prove its existence deﬁnitively (e.g., a counter-example or instruction trace). Finally, the message “error” indicates a speciﬁc bug in the program, typically a potential run-time violation like a NullPointerException or an ArrayIndexOutOfBoundsException. 1

We use the ﬁnal binary release (v. 1.2.4, 27 September 2001) of ESC/Java.

F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 202–219, 2003. c Springer-Verlag Berlin Heidelberg 2003

Java Program Veriﬁcation Challenges

203

Of course the examples below do not cover all possible topics. For instance, ﬂoating point computations and multi-threading2 are not covered. In general, our work has focused on Java for smart cards and several examples stem from that area. Other examples are taken from work of colleagues [4], from earlier publications, or from test sets that we have been using internally. They cover a reasonable part of the semantics of Java and JML. Most of the examples can be translated easily in other programming languages. The examples do not involve semantical controversies, as discussed for instance in [5]. These examples are not meant to be canonical or proscriptive; they simply give a ﬂavor of the level of completeness (and thus, complexity) necessary to cover modern programming language semantics. The veriﬁcation examples are organised in the following categories. – – – – – –

Control ﬂow and side-eﬀects Overﬂow and bitwise operations Static initialization Inheritance Non-termination Speciﬁcation issues

The organization of the paper roughly follows this categorization. Each example is accompanied by a short explanation of why the example is interesting and the challenges involved. But ﬁrst we brieﬂy describe the speciﬁcation language JML.

2

JML

The Java Modeling Language, JML [19], is a behavioral interface speciﬁcation language designed to specify Java modules. It can be used for classes as a whole, via class invariants and constraints, and for the individual methods of a class, via method speciﬁcations consisting of pre-, post- and frame-conditions (assignable clauses). In particular, it is also possible within a method speciﬁcation to describe if a particular exception may occur and which post-condition results in that case. JML annotations are to be understood as predicates, which should hold for the associated Java code. These annotations are included in the Java source ﬁles as special comments indicated by //@, or enclosed between /*@ and */. They are recognised by special tools like the JML run-time checker [9], the LOOP compiler, and the Krakatoa veriﬁcation condition generator. Class invariants and constraints are described as follows. /∗@ /∗@ 2 @ invariant <predicate> @ constraint @∗/ @∗/ An invariant is thus a predicate on the underlying state space. It must hold after termination of constructors, and also after termination (both normal and 2

For a meta-theory on multithreaded Java programs see [1] in this volume.

204

B. Jacobs, J. Kiniry, and M. Warnier

exceptional) of methods, provided it holds before. Thus, invariants are implicitly added to postconditions of methods and constructors, and to preconditions of normal (non-constructor) methods. A constraint is a relation between two states, expressing what should hold about the pre-state and post-state of all methods. In this paper we use no constraints, and only one invariant (in Subsection 3.6). However, in practice they contain important information. Next we give an example JML method speciﬁcation of some method m(). 2

4

6

/∗@ behavior @ requires <precondition>; @ assignable ; @ ensures <normal postcondition>; @ signals (E) <exceptional postcondition>; @∗/ public void m()

Such method speciﬁcations may be understood as an extension of correctness triples {P }m{Q} used in Hoare logic, because they allow both normal and exceptional termination. Moreover, the postconditions in JML are relations, because the pre-state, indicated by \old(--), may occur. We shall see many method speciﬁcations below. JML is intended to be usable by Java programmers. Its syntax is therefore very much like Java. However, it has a few additional keywords, such as ==> (for implication), \old (for evaluation in the pre-state), \result (for the return value of a method, if any), and \forall and \exists (for quantiﬁcation). This paper will not pay much attention to the semantics of JML (interested readers should see [21] for such). Hopefully, most of the JML assertions are self-explanatory. However, there are three points that we would like to mention. – In principle, expressions within assertions (such as an array access a[i]) may throw exceptions. The JML approach, see [21], is to turn such exceptions into arbitrary values. Of course, one should try to avoid such exceptions by including appropriate requirements. For instance, for the expression a[i] one should add a != null && i >= 0 && i < a.length, in case this is not already clear from the context. This is what we shall always do. – JML uses the subtype semantics for inheritance, see [22]. This means that overriding methods in subclasses should still satisfy the speciﬁcations of the overridden ancestors in superclasses. This is a non-trivial restriction, but one which is essential in reasoning about methods in an object-oriented setting. However, it does not hold for all our examples (see for instance Subsection 3.8). In that case we simply write no speciﬁcation at all for the relevant methods. – JML method speciﬁcations form proof obligations. But also, once proved, they can be used in correctness proofs of other methods. In that case one ﬁrst has to establish the precondition and invariant of the method that is called, and subsequently one can use the postcondition in the remainder of

Java Program Veriﬁcation Challenges

205

the veriﬁcation (which will rely heavily on the called method’s assignable clause). An alternative approach is to reason not with the speciﬁcation, but with the implementation of the method that is called3 . Basically, this means that the body of the called method gets substituted at the appropriate place. However, this may lead to duplication of veriﬁcation work, and makes proofs more vulnerable to implementation changes. But if no speciﬁcation is available (see the previous point), one may be forced to reason with the implementation. In the examples below we shall see illustrations of method calls which are used both by speciﬁcation and by implementation. For readers unfamiliar with JML, this paper may hopefully serve as an introduction via examples. More advanced use of JML in specifying API-components may be found for instance in [24] 4 .

3

Veriﬁcation Challenges

This section describes our Java+JML examples in several subsections. Our explanations focus on the (semantic) issues involved, and not so much on the actual code snippets. They should be relatively self-explanatory. 3.1

Aliasing and Field Access

Our ﬁrst example, seen in Figure 1, might seem trivial to some readers. The return expression of the method Alias.m() references the value of the ﬁeld i of the object c value via an aliased reference to itself in the ﬁeld a. We present this example because it represents (in our view) the bare minimum necessary to model a language like Java. ESC/Java has no problem verifying this program. Either the implementation or speciﬁcation of the constructor of class C can be used to verify method Alias.m(). 3.2

Side-Eﬀects in Expressions

One of the most common abstractions in program veriﬁcation is to omit sideeﬀects from expressions in the programming language. This is a serious restriction. Figure 2 contains a nice and simple example from [4] where such side-eﬀects play a crucial rˆ ole, in combination with the logical operators. Recall that in Java there are two disjunctions (| and ||) and two conjunctions (& and &&). The double versions (|| and &&) are the so-called conditional operators: their second argument is only evaluated if the ﬁrst one is false (for ||) or true (for &&). 3 4

This only works if one actually knows the run-time type of the object on which the method is called. See also on the web at www.cs.kun.nl/∼ erikpoll/publications/jc211 specs.html for a speciﬁcation of the Java API for smart cards.

206

2

B. Jacobs, J. Kiniry, and M. Warnier

class C { C a; int i ;

4

6

8

10

}

/∗@ normal behavior @ requires true; @ assignable a , i ; @ ensures a == null && i == 1; @∗/ C() { a = null; i = 1; }

12

class Alias { /∗@ normal behavior @ requires true; 16 @ assignable \nothing; @ ensures \ result == 4; 18 @∗/ int m() { 20 C c = new C(); c.a = c; 22 c. i = 2; return c.i + c.a. i ; 24 } } 14

Fig. 1. Aliasing via Field References.

In case the ﬁeld b in Figure 2 is true, method m() yields f() ∨ ¬f() = false and ¬f() ∧ f() = true, going against standard logical rules. The veriﬁcation of the speciﬁcation for method m() may use either the implementation or the speciﬁcation of f(). 3.3

Breaking out of a Loop

While and for loops are typically used for going through an enumeration, for instance to ﬁnd or modify an entry meeting a speciﬁc condition. Upon hitting this entry, the loop may be aborted via a break statement. This presents a challenge for the underlying control ﬂow semantics. Figure 3 presents a simple example of a for loop that goes through an array of integers in order to change the sign of the ﬁrst negative entry. The two lines of java code are annotated with the loop invariant, with JML-keyword maintaining stating what holds while going through the loop, and the loop variant, with JML-keyword decreasing. The loop variant is a mapping to the natural numbers which decreases with every loop cycle. It is used in veriﬁcations to show that the repetition terminates.

Java Program Veriﬁcation Challenges

207

boolean b = true; boolean result1, result2; 2

/∗@ normal behavior @ requires true; @ assignable b; 6 @ ensures b == !\old(b) && \result == b; @∗/ 8 boolean f() { b = !b; return b; } 4

/∗@ normal behavior @ requires true; 12 @ assignable b, result1 , result2 ; @ ensures (\old(b) ==> !result1 && result2) && 14 @ (!\ old(b) ==> result1 && result2); @∗/ 16 void m() { result1 = f () || ! f (); result2 = ! f() && f (); } 10

Fig. 2. Side-eﬀects and Conditional Logical Operators.

The result of ESC/Java on this program is not very interesting because of its limited handling of loops: they are executed (symbolically) only once by default. In general, this may indicate a basic problem with the invariant, but the coverage is far from complete5 . 3.4

Catching Exceptions

Typical of Java is its systematic use of exceptions, via its statements for throwing and catching. They require a suitable control ﬂow semantics. Special care is needed for the ‘ﬁnally’ part of a try-catch-ﬁnally construction. Figure 4 contains a simple example (adapted from [17]) that combines many aspects. The subtle point is that the assignment m+=10 in the ﬁnally block will still be executed, despite the earlier return statements, but has no eﬀect on the value that is returned. The reason is that this value is bound earlier. 3.5

Bitwise Operations

Our next example in Figure 5 is not of the sort one ﬁnds in textbooks on program veriﬁcation. But it is a good example of the ugly code that veriﬁcation tools have to deal with in practice, speciﬁcally in Java Card applets6 . It involves a “command” byte cmd which is split in two parts: the ﬁrst three, and last ﬁve bits. Depending on these parts, a mode ﬁeld is given an appropriate value. This 5

6

As an aside: ESC/Java has diﬃculty with this example due to limitations in its parser as quantiﬁed expressions cannot be used in ternary operations. If we rewrite the speciﬁcation of negatefirst as a conjuction of disjoint implications, ESC/Java accepts the program. ESC/Java does not handle a bitwise operator like signed right shift (>>) correctly.

208

B. Jacobs, J. Kiniry, and M. Warnier

int [] ia ; 2

/∗@ normal behavior @ requires ia != null ; @ assignable ia [∗]; 6 @ ensures \ forall int i; 0 <= i && i < ia.length ==> @ (\old(ia [ i]) < 0 && 8 @ (// i is the ﬁrst position with negative value @ \ forall int j; 0 <= j && j < i ==> \old(ia[j]) >= 0)) 10 @ ? ( ia [ i] == −\old(ia[i ])) @ : ( ia [ i] == \old(ia[ i ])); 12 @∗/ void negateﬁrst () { 14 /∗@ maintaining i >= 0 && i <= ia.length && @ (\ forall int j; 0 <= j && j < i ==> 16 @ (ia [ j] >= 0 && ia[j] == \old(ia[j ]))); @ decreasing ia .length − i ; 18 @∗/ for(int i = 0; i < ia .length ; i++) { 20 if ( ia [ i ] < 0) { ia [ i] = −ia[ i ]; break; } } } 4

Fig. 3. Breaking out of a Repetition. int m; 2

/∗@ normal behavior @ requires true; @ assignable m; 6 @ ensures \ result == ((d == 0) ? \old(m) : \old(m) / d) @ && m == \old(m) + 10; 8 @∗/ int returnﬁnally (int d) { 10 try { return m / d; } catch(Exception e) { return m / (d+1); } 12 ﬁnally { m += 10; } } 4

Fig. 4. Return within try-catch-ﬁnally.

happens in a nested switch. The speciﬁcation is helpful because it tells in decimal notation what is going on. 3.6

Class Invariants and Callbacks

Class invariants are extremely useful in speciﬁcation, because they often make explicit what programmers have in the back of their mind while writing their code. A typical example is: “integer i is always non-zero” (so that one can safely divide by i).

Java Program Veriﬁcation Challenges

2

209

static ﬁnal byte ACTION ONE = 1, ACTION TWO = 2, ACTION THREE = 3, ACTION FOUR = 4; private /∗@ spec public @∗/ byte mode;

4

/∗@ behavior @ requires true; @ assignable mode; 8 @ ensures (cmd == 0 && mode == ACTION ONE) || @ (cmd == 16 && mode == ACTION TWO) || 10 @ (cmd == 4 && mode == ACTION THREE) || @ (cmd == 20 && mode == ACTION FOUR); 12 @ signals (Exception) @ ((cmd & 0x07) != 0 || (cmd != 0 && cmd != 16)) 14 @ && @ ((cmd & 0x07) != 4 || (cmd != 4 && cmd != 20)); 16 @∗/ void selectmode(byte cmd) throws Exception { 18 byte cmd1 = (byte)(cmd & 0x07), cmd2 = (byte)(cmd >> 3); switch (cmd1) { 20 case 0x00: switch (cmd2) { case 0x00: mode = ACTION ONE; break; 22 case 0x02: mode = ACTION TWO; break; default: throw new Exception(); } 24 break; case 0x04: switch (cmd2) { 26 case 0x00: mode = ACTION THREE; break; case 0x02: mode = ACTION FOUR; break; 28 default: throw new Exception(); } break; 30 default: throw new Exception(); } // ... more code 32 } 6

Fig. 5. Typical Mode Selection Based on Command Byte.

The standard semantics for class invariants is: when an invariant holds in the pre-state of a (non-constructor) method, it must also hold in the post-state. Note that this post-state can result from either normal or exceptional termination. An invariant may thus be temporarily broken within a method body, as long as it is re-established at the end. A simple example is method decrementk in Figure 6. Things become more complicated when inside such a method body the class invariant is broken and another method is called. The current object this is then left in an inconsistent state. This is especially problematic if control returns at some later stage to the current object. This re-entrance or callback phenomenon is discussed for instance in [25, Sections 5.4 and 5.5]. The commonly adopted solution to this problem is to require that the invariant of this is established before a method call. Hence the proof obligation in a method call a.m() involves the invariants of both the caller (this) and the callee (a).

210

2

B. Jacobs, J. Kiniry, and M. Warnier

class A { private /∗@ spec public @∗/ int k, m; B b;

4

/∗@ invariant k + m == 0; @∗/ 6

/∗@ normal behavior @ requires true; @ assignable k , m; @ ensures k == \old(k) − 1 && m == \old(m) + 1; @∗/ void decrementk () { k−−; m++; }

8

10

12

14

16

18

20

}

/∗@ normal behavior @ requires b != null ; @ assignable k , m; @ ensures true; @∗/ void incrementk () { k++; b.go(this); m−−; }

class B { /∗@ normal behavior 24 @ requires arg != null ; @ assignable arg.k , arg.m; 26 @ ensures arg.k == \old(arg.k) − 1 && @ arg.m == \old(arg.m) + 1; 28 @∗/ void go(A arg) { arg.decrementk(); } 30 } 22

Fig. 6. Callback with Broken Invariant.

This semantics is incorporated in the translation performed by the LOOP tool. Therefore we can not prove the speciﬁcation for the method incrementk in Figure 6. However, a proof using the implementations of method go and decrementk is possible, if we make the additional assumptions that the run-time type of the ﬁeld b is actually B, and that the method incrementk is executed on an object of class A. These restrictions are needed because if, for instance, ﬁeld b has a subclass of B as run-time type, a diﬀerent implementation will have to be used if the method go is overridden in the subclass. ESC/Java warns about the potential for invariant violation during the callback. Another issue related to class invariant is whether or not they should be maintained by private methods. JML does require this, but allows a special category of so-called ‘helper’ methods which need not maintain invariants. We don’t discuss this matter further.

Java Program Veriﬁcation Challenges

2

211

class C { static boolean result1, result2 , result3 , result4 ;

4

6

8

10

12

14

16

}

/∗@ normal behavior @ requires !\ is initialized (C) && @ !\ is initialized (C1) && @ !\ is initialized (C2); @ assignable \ static ﬁelds of (C), @ \ static ﬁelds of (C1), @ \ static ﬁelds of (C2); @ ensures result1 && !result2 && result3 && result4; @∗/ static void m(){ result1 = C1.b1; result2 = C2.b2; result3 = C1.d1; result4 = C2.d2; }

18

class C1 { static boolean b1 = C2.d2; static boolean d1 = true; 22 } 20

24

class C2 { static boolean d2 = true; static boolean b2 = C1.d1; 28 } 26

Fig. 7. Static Initialization.

3.7

Static Initialization

Figure 7 shows an example of static initialization in Java (due to Jan Bergstra). In Java a class is initialized at its ﬁrst active use (see [13]). This means that class initialization in Java is lazy, so that the result of initialization depends on the order in which classes are initialized. The rather sick example in Figure 7 shows what happens when two classes, which are not yet initialized, have static ﬁelds referring to each other. In the speciﬁcation we use a new keyword \static_fields_of in the assignable clause. It is syntactic sugar for all static ﬁelds of the class. The ﬁrst assignment in the body of method m() triggers the initialization of class C1, which in turn triggers the initialization of class C2. The result of the whole initialization is, for instance, that static ﬁeld C2.b2 gets value false assigned to it. This can be seen when one realizes that the boolean static ﬁelds from class C1 initially get the default value false. Subsequently, class C2 becomes initialized and its ﬁelds also get the default value false. Now the assignments

212

B. Jacobs, J. Kiniry, and M. Warnier

in class C2 are carried out: d2 is set to true and b2 is set to false. Note that d1 is still false at this stage. Finally the assignments to ﬁelds in class C1 take place, both resulting in value true. One can see that the order of initializations is important. When the ﬁrst two assignments in the method body of m() are switched, class C2 will be initialized before class C1 resulting in all ﬁelds getting value true. ESC/Java cannot handle this example as it cannot reason about static initialization. It provides no warnings for potential run-time errors in static initializers or in initializers for static ﬁelds. 3.8

Overriding and Dynamic Method Invocation

The example in Figure 8 is usually attributed to Kim Bruce. It addresses an issue which is often thought of as confusing in programming with languages which support inheritance. The overriding of the method equal makes it hard to tell which implementation is called: the one in the subclass or in the superclass. When a method is overridden, the run-time type of an object decides which implementation is called. This phenomena is also called late-binding. In the example three diﬀerent objects are created, and the question is which equal method will be used. Notice that the equal methods in Figure 8 have no speciﬁcations. According to the behavioural subtyping semantics used in JML, the equal method in the subclass ColorPoint should also satisfy the speciﬁcation of the equal method in superclass Point. This makes it impossible to prove a precise speciﬁcation of the equal method in class ColorPoint. Therefore we proved the speciﬁcation of method m() by using the implementations of the equal methods. 3.9

Inheritance

The program in Figure 9 is from [16] and was originally suggested by Joachim van den Berg. On ﬁrst inspection it looks like the method test() will loop forever. The method test() calls method m() from class C, which calls method m() from class Inheritance, since ‘this’ has runtime-type Inheritance. Due to the subtype semantics used in JML for inheritance, we cannot write speciﬁcations for both of the m() methods with which we can reason. Therefore we can only prove the speciﬁcation of method test() by using the method implementations. 3.10

Non-termination

The example in Figure 10 (due to Cees-Bart Breunesse) shows a program that does not terminate. The speciﬁcation asserts that the program does not terminate normally or with an exception. The JML keyword diverges followed by the predicate true indicates that the program fails to terminate. The reader can easily see that this

Java Program Veriﬁcation Challenges class Point { 2

int equal(Point x) { return 1; }

4

}

6

class ColorPoint extends Point {

8

}

int equal(Point x) { return 2; }

10

int r1,r2,r3,r4,r5,r6,r7,r8,r9; 12

/∗@ normal behavior @ requires true; @ assignable r1 , r2 , r3 , r4 , r5 , r6 , r7 , r8 , r9; @ ensures r1 == 1 && r2 == 1 && r3 == 2 && @ r4 == 2 && r5 == 2 && r6 == 2 && @ r7 == 1 && r8 == 2 && r9 == 2; @∗/ void m() { Point p1 = new Point(); Point p2 = new ColorPoint(); ColorPoint cp = new ColorPoint(); r1 = p1.equal(p1);r2 = p1.equal(p2);r3 = p2.equal(p1); r4 = p2.equal(p2);r5 = cp.equal(p1);r6 = cp.equal(p2); r7 = p1.equal(cp);r8 = p2.equal(cp);r9 = cp.equal(cp); }

14

16

18

20

22

24

26

Fig. 8. Overriding and dynamic method invocation.

2

class C { void m() throws Exception { m(); } }

4

6

class Inheritance extends C { void m() throws Exception { throw new Exception(); }

8

10

12

14

}

/∗@ exceptional behavior @ requires true; @ assignable \nothing; @ signals (Exception) true; @∗/ void test () throws Exception { super.m(); }

Fig. 9. Overriding and Dynamic Types.

213

214

B. Jacobs, J. Kiniry, and M. Warnier

class Diverges{ 2

4

6

8

10

12

14

}

/∗@behavior @ requires true; @ assignable \nothing; @ ensures false ; @ signals (Exception e) false ; @ diverges true; @∗/ public void m(){ for (byte b = Byte.MIN VALUE;b <= Byte.MAX VALUE;b++) ; }

Fig. 10. A Program that does not Terminate.

program does not terminate. Since Byte.MAX VALUE + 1 = Byte.MIN VALUE the guard in the for loop will never fail. Note that in order to verify this program both overﬂowing and non-termination have to be modeled appropriately. ESC/Java does not handle non-termination. 3.11

Speciﬁcation

The ﬁnal example exempliﬁes two commonplace complications in reasoning about “real world” Java. Representations of integral types (e.g., int, short, byte, etc.) in Java are ﬁnite. A review of annotated programs that use integral types indicates that speciﬁcations are often written with inﬁnite numeric models in mind [7]. Programmers seem to think about the issues of overﬂow (and underﬂow, in the case of ﬂoating point numbers) in program code, but not in speciﬁcations. Additionally, it is often the case that speciﬁcations use functional method invocations. Methods which have no side-eﬀects in the program are called called “pure” methods in JML and “queries” in the Eiﬀel and UML communities7 . The example in Figure 11 highlights both complications, as it uses method invocations in a speciﬁcation and integral values that can potentially overﬂow. The method isqrt(), which computes an integer square root of its input, is inspired by a speciﬁcation (see Figure 12) of a similar function included with early JML releases [20]. Note that the iabs() method in Figure 11 is used in the speciﬁcation of isqrt() to stipulate that both the negative and the positive square root are an acceptable return value, as all we care about is its magnitude. Actually, our implementation of isqrt() only computes the positive integer square root. 7

There is still debate in the community about the meaning of “pure”. Many Java methods, for example, which claim to have no side-eﬀects, and thus should be pure, actually do modify the state due to caching, lazy evaluation, etc.

Java Program Veriﬁcation Challenges

215

/∗@ normal behavior @ requires true; @ assignable \nothing; 4 @ ensures \ result == ((x >= 0 || @ x == Integer.MIN VALUE) ? x : −x); 6 @∗/ /∗@ pure @∗/ int iabs(int x) { 8 if (x < 0) return −x; else return x; } 2

/∗@ normal behavior @ requires x >= 0 && x <= 2147390966; 12 @ assignable \nothing; @ ensures iabs(\ result ) < 46340 && 14 @ \ result ∗ \ result <= x && @ x < (iabs(\ result ) + 1) ∗ ( iabs(\ result ) + 1); 16 @∗/ int isqrt (int x) { 18 int count = 0, sum = 1; /∗@ maintaining 0 <= count && 20 @ count < 46340 && @ count ∗ count <= x && 22 @ sum == (count + 1) ∗ (count + 1); @ decreasing x − count; 24 @∗/ while (sum <= x) { count++; sum += 2 ∗ count + 1; } 26 return count; } 10

Fig. 11. Dependent Speciﬁcations and Integral Types.

2

4

6

/∗@ normal behavior @ requires x >= 0; @ ensures Math.abs(\result) <= x && @ \ result ∗ \ result <= x && @ x < (Math.abs(\result) + 1) ∗ @ (Math.abs(\result) + 1); @∗/ Fig. 12. JML Speciﬁcation of Integer Square Root from [20].

The original speciﬁcation included in early JML releases is provided in Figure 12. This speciﬁcation uses the Math.abs() method instead of our iabs() method. We use our own equivalent implementation of square root out of convenience, not necessity. The deﬁnition of JML states that expressions in the requires and ensures clauses are to be interpreted in the semantics of Java. Consequently, a valid im-

216

B. Jacobs, J. Kiniry, and M. Warnier

plementation of the speciﬁcation in Figure 12 is permitted to return Integer.MIN VALUE when x is 0 (as already noted in [20]). This surprising situation exists because Java’s integral are bounded and because the deﬁnition of unary negation in the Java Language Speciﬁcation is somewhat unexpected. Because integral types are bounded, expressions in the postcondition, specifically the multiplication operations, can overﬂow. Additionally, the two implementations of integer absolute value both return a value of Integer.MIN VALUE when passed Integer.MIN VALUE. While this is the documented behavior of java.lang.Math.abs(int), it is often overlooked by programmers because they presume that a function as mathematically uncomplicated as absolute value will produce unsurprising, mathematically correct results for all input. The absolute value of Integer.MIN VALUE is equal to itself because Math.abs() is implemented with Java’s unary negation operator “-”. This operator, deﬁned in Section 15.15.4 of the Java Language Speciﬁcation, silently overﬂows when applied to maximal negative integer or long [13] 8 . The precondition of isqrt() in Figure 11 is explained in Figure 13. We wish to ensure that no operation, either in the implementation of isqrt() or in its speciﬁcation, overﬂows. The critical situation that causes an overﬂow is when we attempt to take an integer square root of a very large number. In particular, if we attempt to evaluate the postcondition of isqrt() for values of x larger than 2,147,390,966 an overﬂow takes place. The small but critical interval between the precondition’s bound 2,147,390,966 and Integer.MAX VALUE = 2,147,483,647 is indicated by the dark interval on the right of Figure 13: to check the postcondition, the prospective root (the arrow labeled 1) must be determined, one is added to its value (arrow 2), and the result is squared (arrow 3). This ﬁnal result will thus overﬂow. Indeed, (46, 340 + 1)2 > 2, 147, 483, 647. 2.

3.

Integer.MAX_VALUE

0 1.

Fig. 13. The Positive Integers.

The erroneous nature of a speciﬁcation involving potential overﬂows should become clear when one veriﬁes the method using an appropriate bit-level representation of integral types [18]. Unfortunately, such errors are not at all apparent, even when performing extensive unit testing, because the boundary conditions for arithmetic expressions, like the third term of the postcondition of isqrt() in Figure 11, are rarely automatically derivable, and full state-space coverage is simply too computationally expensive. 8

Interestingly, the documentation for java.lang.Math.abs() did not reﬂect this fact until the 1.1 release of Java.

Java Program Veriﬁcation Challenges

217

Speciﬁcations involving integral types, and thus potential overﬂows, are frequently seen in application domains that involve numeric computation both complex (e.g., scientiﬁc computation, computer graphics, embedded devices, etc.) and relatively simple (e.g., currency and banking). The former category are obviously challenging due to the complexity of the related data-structures, algorithms, and their speciﬁcations, and the latter are problematic because it is there that implementation violations have egregious (ﬁnancial) consequences. This speciﬁcation raises the question: What is the appropriate model for arithmetic speciﬁcations9 ? Another challenge highlighted by this example might be called “formalism completeness”. Many semantic formalisms of modern programming languages like Java do not attempt to specify the semantics of complicated features like method invocation. Even fewer attempt to incorporate such semantics in method speciﬁcations, as used here. For example, ESC/Java is unable to deal with this example because of such interdependencies. This is a signiﬁcant limitation as many, if not most, method speciﬁcations rely upon pure or JML helper methods. This is also a weakness of the LOOP tool, as dealing with (pure) method calls in speciﬁcations is tedious in veriﬁcations10 . In general, the semantics of method invocation in speciﬁcations is still an unclear issue at this time.

4

Conclusions

The starting point of this paper is the observation that the classical examples in (sequential) program veriﬁcation are no longer very relevant in today’s context. They need to be updated in two dimensions: language complexity and size. This paper focuses on complexity, by presenting a new series of challenges, written in Java, with correctness assertions expressed in JML. The examples incorporate some of the ugly details that one encounters in real-world programs, and that any reasonable semantics should be able to handle. The fact that these example programs are small does not mean that we think size is unimportant. On the contrary, once a reasonably broad semantic spectrum is covered, the next challenge is to scale up one’s program veriﬁcation techniques to larger programs. With our tools we are currently verifying programs with hundreds of lines of code.

Acknowledgments We want to thank Patrice Chalin for insightful comments on an earlier version of this paper. 9 10

We do not have answers for these questions, though investigations are underway [8]. The veriﬁcation of the method isqrt() from Figure 11 uses the implementation of the absolute value method iabs().

218

B. Jacobs, J. Kiniry, and M. Warnier

References ´ 1. Erika Abrah´ am Mumm, Frank S. de Boer, Willem-Paul de Roever, and Martin Steﬀen. A Tool-supported Proof System for Multithreaded Java. In FMCO 2002 proceedings, Lect. Notes Comp. Sci. Springer, Berlin, 2003. 2. W. Ahrendt, Th. Baar, B. Beckert, R. Bubel, M. Giese, R. H¨ ahnle, W. Menzel, W. Mostowski, A. Roth, S. Schlager, and P.H. Schmitt. The KeY tool. Technical Report 2003-5, Chalmers and G¨ oteborg University, 2003. http://i12www.ira.uka.de/˜key. 3. J. van den Berg and B. Jacobs. The LOOP compiler for Java and JML. In T. Margaria and W. Yi, editors, Tools and Algorithms for the Construction and Analysis of Systems, number 2031 in Lect. Notes Comp. Sci., pages 299–312. Springer, Berlin, 2001. 4. J. Bergstra and M. Loots. Empirical semantics for object-oriented programs. Artiﬁcial Intelligence Preprint Series nr. 007, Dep. Philosophy, Utrecht Univ. http://preprints.phil.uu.nl/aips/, 1999. 5. E. B¨ orger and W. Schulte. Initialization problems in Java. Software—Concepts and Tools, 20(4), 1999. 6. Lilian Burdy, Jean-Louis Lanet, and Antoine Requet. JACK (Java Applet Correctness Kit), 2002. www.gemplus.com/smart/r_d/trends/jack.html. 7. P. Chalin. Back to basics: Language support and semantics of basic inﬁnite integer types in JML and Larch. Technical Report 2002.003.1, Computer Science Department, Concordia University, 2002. www.cs.concordia.ca/˜faculty/chalin/. 8. P. Chalin. Improving JML: For a safer and more eﬀective language. Technical Report 2003-001.1, Computer Science Department, Concordia University, March 2003. 9. Y. Cheon and G.T. Leavens. A runtime assertion checker for the Java Modeling Language (JML). In H.R. Arabnia and Y. Mun, editors, International Conference on Software Engineering Research and Practice (SERP ’02), pages 322–328. CSREA Press, Las Vegas, 2002. 10. E. Contejean, J. Duprat, J.-C. Filliˆ atre, C. March´e, C. Paulin-Mohring, and X. Urbain. The Krakatoa tool for JML/Java program certiﬁcation. Available via the Krakatoa home page at www.lri.fr/˜marche/krakatoa/, October 2002. 11. E.W. Dijkstra. A Discipline of Programming. Prentice Hall, 1976. 12. C. Flanagan, K.R.M. Leino, M. Lillibridge, G. Nelson, J.B. Saxe, and R. Stata. Extended static checking for Java. In Proceedings of the 2002 ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI), volume 37(5) of SIGPLAN Notices, pages 234–245. ACM, 2002. 13. J. Gosling, B. Joy, G. Steele, and G. Bracha. The Java Language Speciﬁcation Second Edition. The Java Series. Addison-Wesley, 2000. http://java.sun.com/docs/books/jls/second_edition/html/ j.title.doc.html. 14. D. Gries. The Science of Programming. Springer, 1981. 15. C.A.R. Hoare. An axiomatic basis for computer programming. Commun. ACM, 12:576–580, 583, 1969. 16. M. Huisman and B. Jacobs. Inheritance in higher order logic: Modeling and reasoning. In M. Aagaard and J. Harrison, editors, Theorem Proving in Higher Order Logics, number 1869 in Lect. Notes Comp. Sci., pages 301–319. Springer, Berlin, 2000.

Java Program Veriﬁcation Challenges

219

17. B. Jacobs. A formalisation of Java’s exception mechanism. In D. Sands, editor, Programming Languages and Systems (ESOP), number 2028 in Lect. Notes Comp. Sci., pages 284–301. Springer, Berlin, 2001. 18. B. Jacobs. Java’s integral types in PVS. Manuscript, http://www.cs.kun.nl/˜bart/PAPERS/, 2003. 19. G.T. Leavens, A.L. Baker, and C. Ruby. JML: A notation for detailed design. In H. Kilov and B. Rumpe, editors, Behavioral Speciﬁcations of Business and Systems, pages 175–188. Kluwer, 1999. 20. G.T. Leavens, A.L. Baker, and C. Ruby. Preliminary design of JML: A behavioral interface speciﬁcation language for Java. Techn. Rep. 98-06, Dep. of Comp. Sci., Iowa State Univ. http://www.cs.iastate.edu/˜leavens/JML.html, 1999. 21. G.T. Leavens, E. Poll, C. Clifton, Y. Cheon, and C. Ruby. JML reference manual (draft). www.jmlspecs.org, 2002. 22. B. Liskov and J.M. Wing. A behavioral notion of subtyping. ACM Trans. on Prog. Lang. & Syst., 7(16(6)):1811–1841, 1994. 23. J. Meyer and A. Poetzsch-Heﬀter. An architecture for interactive program provers. In S. Graf and M. Schwartzbach, editors, Tools and Algorithms for the Construction and Analysis of Systems, number 1785 in Lect. Notes Comp. Sci., pages 63–77. Springer, Berlin, 2000. 24. E. Poll, J. van den Berg, and B. Jacobs. Speciﬁcation of the JavaCard API in JML. In J. Domingo-Ferrer, D. Chan, and A. Watson, editors, Smart Card Research and Advanced Application, pages 135–154. Kluwer Acad. Publ., 2000. 25. C. Szyperski. Component Software. Addison-Wesley, 1998.

ToolBus: The Next Generation Hayco de Jong1 and Paul Klint1,2 1

2

Centrum voor Wiskunde en Informatica Kruislaan 413, P.O.Box 94079 1090 GB Amsterdam, The Netherlands Informatics Institute, University of Amsterdam, The Netherlands {Hayco.de.Jong,Paul.Klint}@cwi.nl

Abstract. The ToolBus is a software coordination architecture for the integration of components written in different languages running on different computers. It has been used since 1994 in a variety of projects, most notably in the complete renovation of the Asf+Sdf Meta-Environment. In this paper we summarize the experience that has been gained in these projects and sketch preliminary ideas how the ToolBus can be further improved. Topics to be discussed include the structuring of message exchanges, crash recovery in distributed applications, and call-by-value versus call-by-reference.

1

Generic Language Technology

Our primary interest is generic language technology that aims at the rapid construction of tools for a wide variety of programming and application languages. Its central notion is a language deﬁnition of some programming or application language. The common methodology is that a language is identiﬁed in a given domain, that relevant aspects of that language are formally deﬁned and that desired tools are generated on the basis of this language deﬁnition. This generative approach is illustrated in Fig. 1. Using a deﬁnition for some language L as starting point, a generator can produce a range of tools for editing, manipulating, checking or executing L programs. Language aspects have to be deﬁned, analyzed, and used to generate appropriate tooling such as compilers, interpreters, type checkers, syntax-directed editors, debuggers, partial evaluators, test case generators, documentation generators, and more. Language deﬁnitions are used, on a daily basis, in application areas as disparate as Cobol renovation, Java refactoring, smart card veriﬁcation and in application generation for domains including ﬁnance, industrial automation and software engineering. In the case of Cobol renovation, the language in question is Cobol and those aspects that are relevant for renovation have to be formalized. In the case of application generation, the language in question is probably new and has to be designed from scratch. 1.1

One Realization: The Asf+Sdf Meta-environment

The Asf+Sdf Meta-Environment [1,2] is an incarnation of the approach just described and covers both the interactive development of language deﬁnitions and the generation of tools based on these language deﬁnitions. F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 220–241, 2003. c Springer-Verlag Berlin Heidelberg 2003

ToolBus: The Next Generation

221

Fig. 1. From language deﬁnition to generated programming environment.

In this paper we are primarily interested in the software engineering aspects of building such a system. Starting point is the Asf+Sdf Meta-Environment as we had completed it in the beginning of the 1990’s. This was a monolithic 200 KLOC Lisp program that was hard to maintain. It had all the traits of a legacy system and was the primary motivation to enter the area of system and software renovation. 1.2 Towards a Component Based Architecture We give a brief time line of the efforts to transform the old, monolithic, implementation of the Meta-Environment into a well-structured, component-based, implementation. In 1992, ﬁrst, unsuccessful, experiments were carried out to decompose the system into separate parts [3]. The idea was to separate the user-interface and the text editor from the rest of the system. The user-interface was completely re-implemented as a separate component and as text editor we re-used Emacs. In hindsight, we were unaware of the fact that we made the transition from a completely sequential system to a system with several concurrent components. Unavoidably, we encountered hard to explain deadlocks and race conditions. In 1993, a next step was to write a formal speciﬁcation of the desired system behavior [4] using PSF, a speciﬁcation language based on process algebra and algebraic speciﬁcations [5]. Simulation of this speciﬁcation unveiled other, not yet observed, deadlocks. Although this was clearly an improvement over the existing situation, this speciﬁcation approach also had its limitations and drawbacks: – The speciﬁcation lacked generality. It would, for instance, have been a major change to add the description of a new component. – The effort to write the PSF speciﬁcation was signiﬁcant and there was no way to derive an actual implementation from it. In 1994, the ﬁrst version of the ToolBus was completed [6,7]. The key idea was to organize a system along the lines of a software bus and to make this bus programmable by way of a scripting language (Tscript) that was based on ACP (Algebra of Communicating Processes, [8]). Another idea was to use a uniform data format (called

222

H. de Jong and P. Klint

ToolBus terms) to exchange data between ToolBus and tools. At the implementation level, Tscripts were executed by an interpreter and communication between tools and ToolBus took place using TCP/IP sockets. In this way, multi-language, distributed, applications could be built with signiﬁcantly less effort than using plain C and sockets. Based on various experiments [9,10,11,12], in 1995 a new version of the ToolBus was designed and implemented: the Discrete Time ToolBus [13,14,15]. Its main innovations were primitives for expressing timing considerations (delay, timeout) and for operating on a limited set of built-in data-types (booleans, integers, reals, lists). The Discrete Time ToolBus has been used for the restructuring of the Asf+Sdf MetaEnvironment [16]. A ﬁrst version was released in 2001 [2]. In the meantime, the exchange format has also evolved from the ToolBus terms mentioned above to ATerms [17]: a term format that supports maximal subterm sharing and a very concise, sharing preserving, binary exchange format. ATerms decrease memory usage thanks to sharing and they permit a very fast equality test since structural equality can be replaced by pointer equality thanks to the maximal subterm sharing. Another line of development is the ToolBus Integrated Debugging Environment (Tide) described in [18]. Today, beginning 2003, it turns out that the original software engineering goals that triggered the development of the ToolBus have been achieved and that the MetaEnvironment can now be even further stretched than anticipated [19]. Therefore, it is time for some reﬂection. What have we learned from this major renovation project and what are the implications for the ToolBus design and implementation? 1.3

Plan of this Paper

In Sect. 2 we discuss component coordination, representation and computation and introduce the ToolBus: our component coordination architecture. Following, in Sect. 3, we demonstrate some of the ToolBus-features by means of an example. In Sect. 4 we show how we used the ToolBus in the Asf+Sdf Meta Environment to migrate from a monolithic to a distributed architecture. Then, in Sect. 5 we elaborate on the various issues that we would like to tackle in a next generation of the ToolBus. We conclude the paper with an overview of the current status of our current implementation of this next generation ToolBus (Sect. 6) and some concluding remarks (Sect. 7).

2 The TOOLBUS Architecture In [20] it was advocated that the overall architecture of a software system can be improved by separating coordination from computation. In addition to this, we also distinguish representation and use the following deﬁnitions: – Coordination: the way in which program and system parts interact (using procedure calls, remote method invocation, middleware, and others). – Representation: language and machine neutral data exchanged between components. – Computation: program code that carries out a specialized task.

ToolBus: The Next Generation

223

Fig. 2. Separating coordination from computation.

Fig. 3. The ToolBus architecture.

The assumption is now that a rigorous separation of coordination, representation and computation leads to ﬂexible and reusable systems. This subdivision is sketched in Fig. 2. Our ToolBus approach follows this paradigm and is illustrated in Fig. 3. The goal of the ToolBus is to integrate tools written in different languages running on different machines. This is achieved by means of a programmable software bus. The ToolBus coordinates the cooperation of a number of tools. This cooperation is described by a Tscript that runs inside the ToolBus. The result is a set of concurrent processes inside the ToolBus that can communicate with each other and with the tools. Tools can be written in any language and can run on different machines. They exchange data by way of ATerms. A typical cooperation scenario is illustrated in Fig. 4. A user-interface (UI) and a database (DB) are combined in an application. Pushing a button in the user-interface leads to a database action and the result is displayed in the user-interface. In a traditional approach, the database action is directly connected to the user-interface button by means of a call-back function. This implies that the user-interface needs some knowledge about

224

H. de Jong and P. Klint

Fig. 4. A typical cooperation scenario.

the database tool and vice versa. In the ToolBus approach the two components are completely decoupled: pushing the button only leads to an event that is handled by some process in the ToolBus. This process routes the event to the database tool (likely via some intermediary process) and gets the answer back via the inverse route. This implies that the conﬁguration knowledge is now completely localized in the Tscript and that UI and DB do not even know about each others existence. The primitives that can be used in Tscripts are listed in Table 1.

3 An Example: The Address Book Service To make the scenario from Fig. 4 more concrete, we describe the construction of an address book holding (name, address) pairs. Typical uses include creating a new address, ﬁnding an address based on the name, etc. First we consider some aspects of the User Interface. An instance of the UI connects to the ToolBus and during the subsequent session, the user can: create a new entry in the address book database; delete an existing entry from the database; search for an entry in the database; update an existing entry in the database. Each of these use cases can be described as a ToolBus process which, together with a process that explains how these use cases interact, form the ToolBus script describing our Address Book Service. 3.1 TOOLBUS Processes for the Address Book Service The ADDRESSBOOK process tells the ToolBus that an instance of our address-book tool is to be executed, followed by a loop which invokes one of the processes CREATE,

ToolBus: The Next Generation

225

Table 1. Overview of ToolBus primitives. Primitive delta + . * create snd-msg rec-msg snd-note rec-note no-note subscribe unsubscribe snd-eval rec-value snd-do rec-event snd-ack-event if ... then ... fi if ... then ... else ... fi || let ... in ... endlet := delay abs-delay timeout abs-timeout rec-connect rec-disconnect execute snd-terminate shutdown attach-monitor detach-monitor

Description inaction (“deadlock”) choice between two alternatives (P1 or P2 ) sequential composition (P1 followed by P2 ) iteration (zero or more times P1 followed by P2 ) process creation send a message (binary, synchronous) receive a message (binary, synchronous) send a note (broadcast, asynchronous) receive a note (asynchronous) no notes available for process subscribe to notes unsubscribe from notes send evaluation request to tool receive a value from a tool send request to tool (no return value) receive event from tool acknowledge a previous event from a tool guarded command conditional expressions communication-free merge (parallel composition) local variables assignment relative time delay absolute time delay relative timeout absolute timeout receive a connection request from a tool receive a disconnection request from a tool execute a tool terminate the execution of a tool terminate ToolBus attach a monitoring tool to a process detach a monitoring tool from a process

DELETE, SEARCH or UPDATE in each iteration. This construction, using the + operator ensures that at this level, the sub-processes can be regarded atomically. This means that for example no DELETE will happen during an UPDATE. process ADDRESSBOOK is let AB : address-book in execute(address-book, AB?) . ( CREATE(AB) + DELETE(AB) + SEARCH(AB) + UPDATE(AB) ) * delta endlet

226

H. de Jong and P. Klint

The operating system level details of starting the tool are deﬁned in a separate section (one for each tool if multiple tools are involved): tool address-book is { command = "java-adapter -class AddressBookService" }

In this case, the ToolBus is told that our tool is written in Java, and that the main class to be started is called AddressBookService. The CREATE process can be described as a ToolBus process as follows: process CREATE(AB : address-book) is let AID : int in rec-msg(create-address) . snd-eval(AB, create-entry) . rec-value(AB, new-entry(AID?)) . snd-msg(address-created(AID)) endlet

The request to create a new address book entry is received and delegated to the tool, so it can update its state. In this case, our tool yields a unique id for reference to the new entry, which is returned as the result of the creation message. Note that communication between processes involves the matching of the arguments of snd-msg and rec-msg. The same holds for the communication between a process and a tool using snd-eval and rec-value. In all these cases, a result variable of the form V? gets a value assigned as the result of a successful match. The DELETE process differs only from the CREATE process in that it does not need a return value: ... rec-msg(delete-address(AID?) . snd-do(AB, delete-entry(AID)) . snd-msg(address-deleted(AID)) ...

The SEARCH process in our example implements but a single query: ﬁnding an address book entry by name. It shows how different results from a tool-evaluation request can be processed in much the same way that different messages are handled. Upon receiving a find-by-name message from another process, this request is delegated to the tool. Depending on whether or not the entry exists in the database, the tool replies with a found or a not-found message, respectively. This result is then propagated to the process that sent the initial find-by-name message.

ToolBus: The Next Generation

227

process SEARCH(AB : address-book) is let Aid : int, Name : str in rec-msg(find-by-name(Name?)) . snd-eval(AB, find-by-name(Name)) . ( rec-value(AB, found(Aid?)) . snd-msg(found(Aid)) + rec-value(AB, not-found) . snd-msg(not-found) ) endlet

The UPDATE process is more interesting. It shows that each update of an address entry is guarded. A process wanting to update an entry ﬁrst has to announce this fact by sending an update-entry message, before it can do one or more updates to the entry. It then ﬁnishes the update by sending an update-entry-done message. Because matching snd-msg and rec-msg messages are connected synchronously, it is not possible for one update transaction to interfere with another. After a sender and receiver of the update-entry message are connected, all other processes that want to send a update-entry message have to wait until the receiving process is ready to receive an update-entry message again. This message pair thus acts as a very primitive locking scheme. More elaborate schemes are very well possible, but are not discussed in this paper. Summarizing, the UPDATE process shows that outside the implementation of the address book service, we can enforce the order in which certain parts of the service are invoked, as well as mutual exclusion of some of its sections. process UPDATE(AB : address-book) is let AID : int, Name : str, Address : str in rec-msg(update-entry(AID?)) . ( rec-msg(set-name(Name?)) . snd-do(AB, set-name(AID, Name)) + rec-msg(set-address(Address?)) . snd-do(AB, set-address(AID, Address)) ) * rec-msg(update-entry-done(AID)) endlet

3.2 TOOLBUS Process for the User Interface Because users can connect at any time to the ToolBus to start a session with the Address Book Service, the ToolBus itself does not execute instances of the UI (as it did with the

228

H. de Jong and P. Klint

address book tool). Instead UITool instances can connect, make zero or more requests to the service, and disconnect at their convenience. A ui tool is declared to exist, but no operating system level details are provided. The following deﬁnition of the UI process shows how UI requests for the creation of a new entry and a name-change can be realized: tool ui is { /* the ToolBus does not execute ui-instances */ } process UI is let UITool : ui, AID : int, Name : str in rec-connect(UITool?) . ( rec-event(UITool, create-address) . snd-msg(create-address) . rec-msg(address-created) . snd-ack-event(UITool, create-address) + rec-event(UITool, update-name(AID?, Name?)) . snd-msg(update-entry(AID)) . snd-msg(set-name(Name)) . snd-msg(update-entry-done(AID)) . snd-ack-event(UITool, update-name(AID, Name)) + ... /* more UI requests */ ) * rec-disconnect(UITool) endlet

4 Application to the ASF+SDF Meta-environment As explained in Sect. 1.2, the ToolBus has been used to restructure the Asf+Sdf MetaEnvironment. It consists of a cooperation of 27 tools ranging from a user-interface, graph browser, various editors, compiler and interpreter, to a parser generator and a repository for parse trees. A simpliﬁed view is shown in Fig. 5. Our insight can be further increased by considering some statistics. Table 2 shows the relative sizes of the various implementation languages used in the Meta-Environment. In the column language the various languages are listed. In column KLOC the size (in Kilo Lines Of Code) is given for each language. The result is 107 KLOC for the whole system of which 4.6% are Tscripts. If we consider the fact that Asf+Sdf speciﬁcations are compiled to C code, another view is possible as well: 12 KLOC of Asf+Sdf generates 170 KLOC of C code. Taking this generated code into account, the total size of the whole system amounts to 277 KLOC of which 1.8% are Tscripts. This is compatible with the expectation that Tscripts are relatively small and form high-level “glue” to connect much larger components.

ToolBus: The Next Generation

229

Fig. 5. Architecture of the Asf+Sdf Meta-Environment. Table 2. Facts concerning implementation languages.

†

Language KLOC† Generated KLOC Total KLOC ASF+SDF 12 170 (C) C 80†† Java, Tcl/Tk 5 Makeﬁles, etc 5 Tscript 5 Total LOC: 107 170 277 Tscript 4.6% 1.8% Kilo Lines of Code excluding third party code such as emacs, dot, and the like. †† This includes 10 KLOC (C code) for the ToolBus implementation itself.

Part of the generated C code is currently done by ApiGen [21]. This is an API generator which takes an Sdf grammar as input and generates a C library which gives type-safe access to the underlying ATerm representation of the parse trees over this grammar. Another conclusion from these facts is that low-level information for building the software (makeﬁles and conﬁguration scripts) are of the same size as the high level Tscripts. This points into the direction that the level of these build scripts should be raised. This conclusion will, however, not be further explored in this paper. Another view is given in Table 3 where the frequency of occurrence of Tscript primitives is shown. Clearly, sequential composition (.) is the dominant operator and sending/receiving (snd-msg, rec-msg) messages is the dominant communication mechanism, followed by communication with tools (snd-do, snd-eval). It may be surprising that parallel composition (||) is used so infrequently. However, one should be aware that at the top level all ToolBus processes run concurrently and that || is only used for explicit concurrency inside a process. The level of concurrency is therefore approximately 100 (104 process deﬁnitions and 3 explicit || operators). Empirical evidence shows that: – The ToolBus-based version of the Asf+Sdf Meta-Environment is more ﬂexible as illustrated by the fact that clones of the Meta-Environment start to appear for other languages than Asf+Sdf. Examples are Action Semantics [22] and Elan [23].

230

H. de Jong and P. Klint Table 3. Facts concerning Tscript primitives. Primitive Number of occurrences process deﬁnitions 104 tool deﬁnitions 27 . (sequential composition) 4343 + (choice) 341 * (iteration) 243 || (parallel composition) 3 snd-msg 555 rec-msg 541 snd-note 100 rec-note 24 snd-do/snd-eval 220 rec-event 56 create 58

– Various components of the Asf+Sdf Meta-Environment are being reused in other projects [10,24].

5

Issues in a Next-Generation TOOLBUS

The ToolBus has been used in various applications of which the Meta-Environment is by far the largest. Some of the questions posed by our users and ourselves are: – I ﬁnd it difﬁcult to see which messages are requests and which are replies; can you provide support for this? See Sect. 5.1. – If a tool crashes, what is the best way to describe the recovery in the Tscript? See Sect. 5.2. – I have huge data values that are exchanged between tools and the ToolBus becomes a data bottleneck; can you improve this? See Sect. 5.3. – The ToolBus and tools are running as separate tasks of the operating systems. Would it not be more efﬁcient to run ToolBus and tools in a single task? See Sect. 6. 5.1

Undisciplined Message Patterns

The classical pattern of a remote procedure call is shown in Fig. 6: a caller performs a call to a callee. During the call the caller suspends execution and the callee executes until it has computed a reply. At that point in time, the caller continues its execution. Compare this simple situation with general message communication as shown in Fig. 7: the caller continues execution after sending a message msg1 to Callee1 and may even send a message msg2 to Callee2. At a certain point in time Callee2 may send message msg3 back to Caller. In this case, the three parties involved continue their execution while messages are being exchanged and there is no obvious pairing of calls and replies.

ToolBus: The Next Generation

231

Fig. 6. Communication pattern for remote procedure call.

Fig. 7. Communication pattern for general messages.

In the ToolBus case, a snd-msg and a rec-msg can interact with each other if their arguments match. A typical sequence is: Process A: snd-msg(calculate(E)) . ... other actions ... rec-msg(value(E, V?))

Process B: rec-msg(calculate(E?)) . ... actions that compute value V ... snd-msg(value(E, V))

What we see here is that a form of call/reply regime is encoded in the messages: process B returns the value V that it has computed as snd-msg(value(E, V)). The E is completely redundant but serves as identiﬁcation for process A to which message this is an answer. The call/reply regime is thus implicitly encoded in messages. This makes error handling harder (which reply did not come?) and makes the Tscripts harder to understand. This is particularly so, since unstructured combinations of snd-msg/rec-msg and sequential composition, choice, iteration and parallel composition are allowed.

232

H. de Jong and P. Klint

The only solution for the above problems is to limit the occurrences of snd-msg or rec-msg in such a way that a form of very general call/reply regime is enforced. Our approach is to syntactically enforce that snd-msg/rec-msg or rec-msg/snd-msg may only occur in (possibly nested) pairs and that in between arbitrary operations are allowed. In fact, the matching snd-msg or rec-msg may be an arbitrary expression provided that all its alternatives begin with a matching snd-msg or rec-msg. We replace thus snd-msg(req(E)) . arbitrary process expression . rec-msg(ans(A?)) by the syntactic construct snd-msg(req(E)) { arbitrary process expression } rec-msg(ans(A?)) and also allow more general cases like: snd-msg (req(E)) { arbitrary process expression} ( rec-msg(ans(A?)) + rec-msg(error(M?) ) It is an interesting property of Process Algebra that every process expression can be normalized to a so-called action preﬁx form: a list of choices where each choice starts with an atomic action. An action preﬁx form has the following structure: a1 .P1 + a2 .P2 +...+ an .Pn . Using this property we can formulate the most general constraint that we impose on occurrences of snd-msg and rec-msg. Consider P1 { Q } P2 and let P1 ’ and P2 ’ be the action preﬁx forms of, respectively, P1 and P2 . Our requirement is now that each choice in P1 ’ starts we a snd-msg and each choice in P2 ’ with a rec-msg, or vice versa. Note that this constraint can be checked statically. 5.2

Exception Handling

Exception handling is notorious for its complexity and impact on the structure of program code. The mainstream exception handling approach as used in, for instance, Java associates one or more exception handlers with a speciﬁc method call. If the call completes successfully, the handlers are ignored. If the call raises an exception, it is checked whether this exception can be handled locally by one of the given handlers. If not, the exception is propagated to the caller of the current code. This model does, however, not work well in a setting where multiple processes are active and the occurrence of an exception may require recovery in several processes. Local Exception Handling. We start with the simpler case of local error handling and introduce the disrupt operator (>>) proposed in LOTOS [25]. A process algebra variant of this operator is described in [26]. It has the form P >> E, where P describes the normal processing and E the exceptional processing. It adds the exception E as alternative to each atomic action in P. If the action preﬁx form of P is a1 .P1 + a2 .P2 +...+ an .Pn , then P >> E ≡ (a1 + E).(P1 >> E) +.. + (an + E).(Pn >> E).

ToolBus: The Next Generation

233

Global Exception Handling. Global exception handling in distributed systems is a very well-studied subject from the perspective of crash recovery and transaction management in distributed databases. An overview of rollback-recovery protocols in message-passing systems is, for instance, given in [27]. In the context of system reliability, the notion of a recovery block has been introduced by Randell [28]. Its purpose was to provide several alternative algorithms for doing the same computation. Upon completion of one algorithm, an acceptance test is made. If the test succeeds, the program proceeds normally, but if it fails a rollback is made to the system state before the algorithm was started and one of the alternative algorithms is tried. In [29] this idea is applied to backtracking in string processing languages. It turns out that the preservation of the system state can be done efﬁciently by only saving updates to the state after the last recovery point. Recovery blocks also form the basis for Coordinated Atomic Actions described in [30]. Recovery blocks are intended for the error recovery in a single process. They can be generalized to conversations between more than one process: several processes can enter a conversation at different times but they can only leave it simultaneously, when all participating processes satisfy their acceptance test. In case one participant fails to pass its test, each participant is rolled back to the state when it entered the conversation. We are currently studying this model since it can be ﬁt easily in the ToolBus framework and seems to solve our problem of global exception handling. It is helpful that a backtrack operator similar to the one described in [29] has also been described for Process Algebra [31]. What remains to be studied is how the recovery of tools has to be organized. Most likely, we will add a limited undo request to the tool interface to recover from the last few operations carried out by a tool. 5.3

Call-by-Value Versus Call-by-Reference

Background. The concepts of call-by-reference and call-by-value are well-known in programming languages. They describe how an actual parameter value is transmitted from a procedure call to the body of the called procedure. In the case of call-by-reference, a pointer to the parameter is transmitted to the body. Call-by-reference is efﬁcient (only a pointer has to be transmitted) and the parameter value can be changed during execution of the procedure body (via the pointer). In the case of call-by-value, a physical copy of the parameter is transmitted to the procedure body. Call-by value is less efﬁcient for large values and does not allow the called procedure to make changes to the parameter value in the calling procedure. These considerations also apply to value transmissions in a distributed setting, with the added complication that values can be accessed or modiﬁed by more than one party. Call by reference (Fig. 8) is efﬁcient for infrequent access or update. It is the prevalent mechanism in, for instance, CORBA [32]. However, uncontrolled modiﬁcations by different parties can lead to disaster. Call-by-value (Fig. 9) is inefﬁcient for large values and any sharing between calls is lost. To us, this is of particular interest, because we need to preserve sharing in huge parse trees. In the case of Java RMI [33], value transmission is achieved via serialization and works only for communication with other Java components. Using IIOP [34] communication with non-Java components is possible.

234

H. de Jong and P. Klint

Fig. 8. Call-by-reference in a distributed application.

Fig. 9. Call-by-value in a (Java-based) distributed application.

Current ToolBus approach. Currently, the ToolBus provides a transport mechanism based on call-by-value as shown in Fig. 10(a). It is transparent since the transmitted values are ATerms (see Sect. 1.2) that can be exchanged with components written in any language. Since pure values are exchanged, there is no need for distributed garbage collection. Note that the call-by-reference model can easily be mimicked in the ToolBus. For instance, one tool can maintain a shared database and can communicate with other tools using record keys and ﬁeld names so that only the values of record ﬁelds have to be exchanged (as opposed to complete records or even the complete database). In this way the access control to the shared database can be spelled out in detail and concurrency conﬂicts can be avoided. This solves one of the major disadvantages of the pure call-byreference model in a distributed environment. The downside is, however, that the ToolBus becomes a data bottleneck when huge values really have to be transmitted between tools. Currently, two workarounds are used. A ﬁrst workaround is to get temporary relief by sending compressed values rather than the values themselves. A second workaround is to store the large value in the ﬁlesystem and to send a ﬁle name rather than the ﬁle itself. It does scale, but it also creates an additional inter-tool dependency and assumes that both tools have access to the same shared ﬁle system. We will now ﬁrst discuss how related frameworks handle call-by-reference and then we come back to implications for the ToolBus design. In particular, we will discuss channel-based transmission as already shown in Fig. 10(b). 5.4

Related Frameworks: Java RMI, RMI-IIOP and Java IDL

Given our needs and desires for a next generation ToolBus it is interesting to see what other solutions are applied in similar projects. In this section, we brieﬂy look at three related mechanisms:

ToolBus: The Next Generation

235

Fig. 10. Value-based (a) versus channel-based (b) transmission in the ToolBus.

– Java Remote Method Invocation (RMI) which connects distributed objects written in Java; – Java RMI over Internet Inter-ORB Protocol (IIOP) which is like RMI, but uses IIOP as the underlying protocol; – Java IDL which connects Java implementations of CORBA interfaces. Java RMI. Java Remote Method Invocation is similar to the ToolBus architecture in the sense that it connects different tools, possibly running on different machines. It differs from the ToolBus setting because it is strictly Java based: only components written in Java can communicate via RMI. For components to work together in RMI, ﬁrst a remote interface is established. This is a Java interface that has a “real” implementation in the tool (or server) and a “stub” implementation on the client sides (Fig. 11). The interface is written by the programmer as opposed to the generated interfaces in a ToolBus setting where they are derived from the communication patterns found in the ToolBus script. The stubs in the RMI setting are then generated from this Java interface using rmic: the RMI compiler. Stubs act as a client-side proxy, delegating the method call via the RMI system to the server object. In RMI, any object that implements a remote interface is called a remote object. In RMI, arguments to or return values from remote methods can be primitive data (e.g. int), remote objects, or serializable objects. In Java, an object is said to be serializable if it implements the java.util.Serializable interface. Both primitive data and serializable objects are passed by value using Java’s object serialization. Remote objects are essentially passed by reference. This means that changes to them are actually performed on the server, and updates become available to all clients. Only the behavior that was deﬁned in the remote interface is available to the clients. RMI programmers should be aware of the fact that any parameters, return values and exceptions that are not remote objects are passed by value. This makes it hard to understand when looking at a system of RMI objects exactly which method calls will result in a local (i.e. client side) state change, and which will have global (server side) effect.

236

H. de Jong and P. Klint

Fig. 11. Client-server model in RMI framework.

Consider, again, our address book example. If the AddressBookService is implemented as a remote object in RMI, then client-side invocations of the setAddress method will cause a global update. If, on the other hand, the AddressBookEntries are made serializable and instances of this class are returned as the result of a query to the AddressBookService, then updates on these instances will have a local state change only. Finally, before two RMI components can connect, the server side needs to register itself with an rmiregistry, after which the client needs to explicitly obtain a reference to the (remote) server object. Java RMI over IIOP. By making RMI programs conform to some restrictions, they can be made available over the Internet Inter-ORB Protocol (IIOP). This means that functionality offered by the RMI program can be made available to CORBA clients written in any (CORBA supported) language. The restrictions are mostly namespace oriented: programmers need to take special care not to use certain names that might collide with CORBA generated names, but some reservations should also be made regarding sharing preservation of object references. References to objects that are equal according to the == operator in one component, need not necessarily be equal in a remote component. Instead the equals method should be used to discern equality. RMI over IIOP is best used when combining several Java tools for which the programmer would like to use RMI, and some tools written in another CORBA-supported language need to use (some of) the services provided by the Java tools. The component’s interface is established by writing a Java interface, just as in plain RMI. Java IDL. Apart from Java RMI, which is optimized for connecting components that are all written in Java, there is also a connection from Java to CORBA using the Java Interface Deﬁnition Language (IDL). This alternative to Java RMI is for Java programmers who want to program in the Java programming language, based on interfaces deﬁned in the CORBA Interface Deﬁnition Language. Using this bridge, it becomes possible to let Java components communicate with CORBA objects written in any language that has Interface Deﬁnition Language (IDL) mappings.

ToolBus: The Next Generation ToolBus Architecture Component coordination Interface Tscript GC yes parameters / by-value return values language any with TB adapter component yes coordination

RMI Client Server Java Interface yes local: by-value remote: by-ref only Java no

RMI-IIOP Client Client Java Interface no local: by-value remote: by-ref CORBA objects if interface in Java no

237

Java IDL Client Server IDL no depends on signature any with IDL binding no

Fig. 12. Related architectures: a feature overview.

Instead of writing a Java interface as is done in RMI, in Java IDL the deﬁnition is written in IDL: a special purpose interface language used as the base for CORBA implementations. This IDL deﬁnition is then used to generate the necessary stubs (client side proxies to delegate method invocations to the server) and skeletons, holder and helper classes (server side classes that hide low-level CORBA details). Feature summary. Fig. 12 shows some of the similarities and differences in ToolBus, RMI, RMI-IIOP and Java IDL. – RMI, RMI-IIOP and Java IDL make an explicit distinction between client and server sides of a set of cooperating components. In the ToolBus setting all components are considered equal (and none are more equal than others). – In RMI and RMI-IIOP, the programmer writes a Java interface which describes the component’s incoming and outgoing method signature, from which stubs and skeletons are generated. In Java IDL a CORBA interface is written. In the ToolBus setting, these signatures are generated from the ToolBus script which describes much more of the component’s behavior in terms of method call interaction, rather than just method signatures. – The ToolBus takes care of garbage collection of the ATerms that are used to represent data as it is sent from one component to another. RMI allows programmers access to Java’s Distributed Garbage Collection API. In RMI-IIOP and Java IDL however, this is not possible, because the underlying CORBA architecture is used, which does not support (distributed) GC, but places this burden entirely on the developer. – In the ToolBus all data is sent by-value. RMI and RMI-IIOP use both pass-by-value and pass-by-reference, depending on whether the relevant data is serializable (it is a primitive type, or it implements Serializable) or is a remote object. In Java IDL the components abide by IDL prescribed interfaces. Determination of whether a parameter is to be passed by-value or by-reference is made by examination of the parameter’s formal type (i.e. in the IDL signature of the method it is being passed to). If it is a CORBA value type, it is passed by-value. If it is an ordinary CORBA interface type (the “normal” case for all CORBA objects), it is passed by-reference.

238

H. de Jong and P. Klint

– The ToolBus allows components in any language for which a ToolBus adapter exists. Programming languages such as C and Java are supported, but adapters also exist for a wide range of languages and applications, including e.g., Perl, Prolog, MySQL, Tcl and Asf+Sdf. In RMI, only Java components can be connected; in RMIIIOP the service is implemented in Java, its functionality (client-side) is available to CORBA clients. The Java IDL framework is fully CORBA compliant. – Only the ToolBus has coordination support for component interaction. In the three other cases any undesired sequence of incoming and outgoing method calls will have to be prohibited by adding code to the component’s internals. Whereas RMI, RMI-IIOP and Java IDL just perform the wiring that connects the components, the ToolBus also provides workﬂow support. In relation to this workﬂow support, it would be interesting to compare the ToolBus to related workﬂow description languages such as the Business Process Modeling language [35] and the Web Services Description Language [36]. Implications for the ToolBus Approach. To overcome the problems of value-based transmission, we envisage the introduction of channels as sketched in Fig. 10(b). This model is inspired by the second workaround mentioned at the end of Sect. 5.3 and is completely transparent for the user. The idea is to stick to the strict call-by-value transmission model, but to implement the actual value transmission by data communication between sending tool and receiving tool thus ofﬂoading the ToolBus itself. Via the ToolBus, only an identiﬁcation of the data value is transmitted between sender and receiver. The downside of this model is that it introduces the need for distributed garbage collection, since a value may be distributed to more than one receiving tool and the sender does not known when all receivers have retrieved their copy. Adding expiration times to values or reference counting at the ToolBus level may solve this problem.

6

Current Status

The current ToolBus was ﬁrst speciﬁed in Asf+Sdf and has then been implemented manually in C. Its primary target was the renovation of the Asf+Sdf Meta-Environment. The next generation ToolBus is being implemented in Java and aims at supporting larger applications such as, for instance, a multi-user game site like www. gamesquare.nl with thousands of users. High performance and recovery of crashed game clients are then of paramount importance. The Java implementation is organized in such a way that the actual implementation of tools is as much hidden as possible. This is achieved by introducing the interface ToolInterface that describes the required ToolBus/tool interaction. This interface can be implemented by a variety of classes: ClassicToolBusTool: This implements the ToolBus/tool communication as used in current applications. The tool is executed as a separate, operating system level, process and the ToolBus/tool communication is achieved using sockets. JavaTool: This implements a new model that addresses one of the issues mentioned in Sect. 5: when ToolBus and tool run on the same computer and the tool is written

ToolBus: The Next Generation

239

in Java, then the tool can be loaded dynamically in the executing ToolBus, e.g. using Java Threads. In this way, the overhead of interprocess communication can be eliminated. JavaRMITool: This is a special case where a Java tool runs on another computer. SOAPTool: This implements communication with a tool that has a SOAP interface. A prototype implementation is under development that allows experimentation with the features mentioned in this paper.

7

Concluding Remarks

In this paper we have reﬂected on our experiences over the past years with the use of the ToolBus as a means to refactor a previously monolithic system: the Asf+Sdf Meta Environment. This real test case of the ToolBus has taught us some of its shortcomings: its data bottleneck in case very large data items are sent using pass-by-value, maintenance issues related to undisciplined message passing and questions such as how to deal with exceptions caused by e.g. crashing tools. Some of the ideas we showed in this paper could be implemented by changing or extending the Tscript (e.g. to implement a call-reply regime as discussed in Sect. 5.1), others will also require extending the ToolBus and the tool-adapters (e.g. to detect crashed tools in combination with exception handling as discussed in Sect. 5.2). We have also studied some related ideas and frameworks and we are now in a position where we have a new prototype of the ToolBus in Java, with a very open structure which allows for all sorts of experiments and case studies based on the experience we have with the existing ToolBus and the ideas presented in this paper.

Acknowledgments We thank Pieter Olivier for his contribution and input to the many interesting and fruitful discussions we have had about ToolBus related issues, and his efforts to get www.gamesquare.nl ToolBus enabled.

References 1. Klint, P.: A meta-environment for generating programming environments. ACM Transactions on Software Engineering and Methodology 2 (1993) 176–201 2. van den Brand, M.G.J., van Deursen, A., Heering, J., de Jong, H.A., de Jonge, M., Kuipers, T., Klint, P., Moonen, L., Olivier, P.A., Scheerder, J., Vinju, J.J., Visser, E., Visser, J.: The ASF+SDF Meta-Environment: a Component-Based Language Development Environment. In Wilhelm, R., ed.: Compiler Construction (CC ’01). Volume 2027 of Lecture Notes in Computer Science., Springer-Verlag (2001) 365–370 3. Bakker, H.C.N., Koorn, J.W.C.: Building an editor from existing components: an exercise in software re-use. Technical Report P9312, Programming Research Group, University of Amsterdam (1993) 4. van Vlijmen, S.F.M., Vriend, P.N., van Waveren, A.: Control and data transfer in the distributed editor of the ASF+SDF meta-environment. Technical Report P9415, University of Amsterdam, Programming Research Group (1994)

240

H. de Jong and P. Klint

5. Mauw, S., Veltink, G.J.: A process speciﬁcation formalism. Fundamenta Informaticae XIII (1990) 85–139 6. Bergstra, J.A., Klint, P.: The ToolBus: a component interconnection architecture. Technical Report P9408, University of Amsterdam, Programming Research Group (1994) 7. Bergstra, J.A., Klint, P.: The ToolBus coordination architecture. In Ciancarini, P., Hankin, C., eds.: Coordination Languages and Models. Volume 1061 of Lecture Notes in Computer Science. (1996) 75–88 8. Bergstra, J.A., Klop, J.W.: Process algebra: speciﬁcation and veriﬁcation in bisimulation semantics. In Hazewinkel, M., Lenstra, J.K., Meertens, L.G.L.T., eds.: Mathematics & Computer Science II. Volume 4 of CWI Monograph., North-Holland (1986) 9. Olivier, P.A.: Embedded system simulation: testdriving the ToolBus. Technical Report P9601, University of Amsterdam, Programming Research Group (1996) 10. Dams, D., Groote, J.F.: Speciﬁcation and Implementation of Components of a muCRL toolbox. Logic Group Preprint Series 152, Utrecht University, Dept. of Philosoph (1995) 11. Lisser, B., van Wamel, J.J.: Speciﬁcation of components in a proposition solver. Technical Report SEN-R9720, Centrum voor Wiskunde en Informatica (CWI) (1997) 12. Diertens, B.: Simulation and animation of process algebra speciﬁcations. Technical Report P9713, Programming Research Group, University of Amsterdam (1997) 13. Bergstra, J.A., Klint, P.: The discrete time ToolBus. Technical Report P9502, University of Amsterdam, Programming Research Group (1995) 14. Bergstra, J.A., Klint, P.: The discrete time ToolBus. In Wirsing, M., Nivat, M., eds.: Algebraic Methodology and Software Technology. Volume 1101 of Lecture Notes in Computer Science., Springer-Verlag (1996) 286–305 15. Bergstra, J.A., Klint, P.: The discrete time ToolBus—a software coordination architecture. Science of Computer Programming 31 (1998) 205–229 16. van den Brand, M.G.J., Heering, J., Klint, P.: Renovation of the ASF+SDF meta-environment current state of affairs. In Sellink, M.P.A., ed.: Proceedings of the 2nd International Workshop on the Theory and Practice of Algebraic Speciﬁcations. electronic Workshops in Computing, Springer-Verlag (1997) 17. van den Brand, M.G.J., de Jong, H.A., Klint, P., Olivier, P.A.: Efﬁcient Annotated Terms. Software, Practice & Experience 30 (2000) 259–291 18. Olivier, P.A.: A Framework for Debugging Heterogeneous Applications. PhD thesis, University of Amsterdam (2000) 19. Brand, M.G.J.v.d., Moreau, P.E., Vinju, J.J.: Environments for Term Rewriting Engines for Free. In: Rewriting Techniques and Applications. Lecture Notes in Computer Science, Springer Verlag (2003) To appear. 20. Gelernter, D., Carriero, N.: Coordination languages and their signiﬁcance. Communications of the ACM 35 (1992) 96 21. de Jong, H.A., Olivier, P.A.: Generation of abstract programming interfaces from syntax deﬁnitions. Technical Report SEN-R0212, St. Centrum voor Wiskunde en Informatica (CWI) (2002) Submitted to Journal of Logic and Algebraic Programming. 22. Mosses, P.D.: System demonstration: Action semantics tools. In van den Brand, M.G.J., L¨ammel, R., eds.: Proceedings of the Second Workshop on Language Descriptions, Tools and Applications (LDTA 2002). Volume 65.3 of Electronic Notes in Theoretical Computer Science. (2002) 23. Brand, M.G.J.v.d., Ringeissen, C.: ASF+SDF parsing tools applied to ELAN. In Futatsugi, K., ed.: Third International Workshop on Rewriting Logic and its Applications (WRLA’2000). Volume 36 of Electronic Notes in Theoretical Computer Science., Elsevier Science Publishers (2001)

ToolBus: The Next Generation

241

24. Blom, S.C.C., Fokkink, W.J., Groote, J.F., van Langevelde, I., Lisser, B., van de Pol, J.C.: µCRL: A toolset for analysing algebraic speciﬁcations. In Berry, G., Comon, H., Finkel, A., eds.: Proc. of the CAV 2001. Volume 2102 of LNCS., Springer (2001) 250–254 25. Brinksma, E.: On the Design of Extended LOTOS–A Speciﬁcation Language for Open Distributed Systems. PhD thesis, University Twente (1988) 26. Diertens, B.: New features in PSF I – Interrupts, Disrupts, and Priorities. Technical Report P9417, Programming Research Group, University of Amsterdam (1994) 27. Elnozahy, E.N.M., Alvisi, L., Wang, Y., Johnson, D.B.: A survey of rollback-recovery protocols in message-passing systems. ACM Computing Surveys (CSUR) 34 (2002) 375–408 28. Randell, B.: System structures for software fault tolerance. IEEE Transactions on Software Engineering SE–1 (1975) 21–232 29. Klint, P.: A Study in String Processing Languages. Volume 205 of Lecture Notes in Computer Science. Springer-Verlag (1985) 30. Zorzo, A., Romanovsky, A., Xu, J., Randell, B., Stroud, R., Welch, I.: Using coordinated atomic actions to design dependable distributed object systems. In: OOPSLA’97 Workshop on Dependable Distributed Object Systems. (1997) 31. Bergstra, J.A., Ponse, A., van Wamel, J.: Process algebra with backtracking. In: REX School/Symposium. (1993) 46–91 32. Object Management Group (OMG): The Common Object Request Broker: Architecture and Speciﬁcation. (1999) http://www.omg.org/technology/documents/formal/corba_2.htm. 33. Sun MicroSystems Inc.: Java Remote Method Speciﬁcation. (2003) http://java.sun.com/j2se/1.4/docs/guide/rmi. 34. Object Management Group: CORBA IIOP Speciﬁcation. (2003) www.omg.org/technology/documents/formal/corba_iiop.htm. 35. Business Process Management Initiative: Business Process Modeling Language. (2002) http://www.bpmi.org/bpmi-downloads/BPML1.0.zip. 36. W3C: World Wide Web Consortium: Web Services Description Language. (2001) http://www.w3.org/TR/wsdl.

High-Level Speciﬁcations: Lessons from Industry Brannon Batson1 and Leslie Lamport2 1 2

Intel Corporation Microsoft Research

Abstract. We explain the rationale behind the design of the TLA+ speciﬁcation language, and we describe our experience using it and the TLC model checker in industrial applications–including the veriﬁcation of multiprocessor memory designs at Intel. Based on this experience, we challenge some conventional wisdom about high-level speciﬁcations.

1

Introduction

The ﬁrst author is a computer architect with a master’s degree in electrical engineering. His work focuses on designing, implementing, and validating multiprocessor cache-coherence protocols. He has worked on TLA+ formal speciﬁcations for the cache-coherence protocols of two Digital/Compaq Alpha multiprocessors, and he is currently using TLA+ to model protocols on future Intel products. The second author is a computer science researcher who began verifying concurrent algorithms over 25 years ago [12]. About ten years ago, he devised TLA, a logic for reasoning about concurrent algorithms [15]. He later designed TLA+, a complete high-level speciﬁcation language based on TLA [17]. The two authors view formal veriﬁcation and TLA+ from two diﬀerent, complementary vantage points. In this paper, we try to synthesize our two views to explore the rationale behind TLA+, describe our experience using it in industry, and derive some lessons from this experience. When discussing our individual experiences, we refer to the ﬁrst and second authors as BB and LL, respectively. We begin by describing TLA+ and TLC, the TLA+ model checker. We then describe how TLA+ has been used at Digital/Compaq and at Intel. We next explore how our experience contradicts some conventional wisdom about speciﬁcation, and we end with some simple conclusions.

2 2.1

TLA+ Desiderata

TLA+ is a high-level language for describing systems – especially asynchronous concurrent and distributed systems. It was designed to be simple, to be very expressive, and to permit a direct formalization of traditional assertional reasoning – the style of reasoning begun by Floyd [5] and Hoare [9] and extended to concurrent programs by Ashcroft [2], Owicki and Gries [21], Pnueli [24], and F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 242–261, 2003. c Springer-Verlag Berlin Heidelberg 2003

High-Level Speciﬁcations: Lessons from Industry

243

others [3,11,12,22]. Making it easy, or even possible, to build tools was not a design criterion for the language. The desire to formalize assertional reasoning, especially for liveness properties, led LL to base TLA+ on TLA (the Temporal Logic of Actions) [15], a simple variant of linear-time temporal logic [24]. To be practical, a temporal logic must be based on an expressive language for writing elementary, non-temporal expressions. The desire for simplicity and expressiveness led to the use of ordinary ﬁrst-order logic and set theory for this underlying language of expressions. 2.2

From Math to TLA+

First-order logic and set theory provide a formalization of ordinary mathematics. TLA adds to them modalities for expressing temporal properties. Temporal modalities are useful for describing liveness (eventuality) properties. However, temporal logic, like any modal logic, is more complicated than ordinary math. TLA was therefore designed to put most of the complexity, both in describing a system and in reasoning about it, into the realm of ordinary math rather than into temporal logic. LL originally assumed that ﬁrst-order logic and set theory extended with the temporal operators of TLA would provide the semantic basis for a language, but that a practical language would require conventional programming-language constructs such as assignment statements. However, not wanting to introduce unnecessary constructs, he decided to begin writing formal speciﬁcations using only mathematics, and to add other constructs as needed. To his surprise, he discovered that he did not need those conventional constructs. Instead, he added to TLA+ only the following extensions to ordinary mathematics: Unambiguous Syntax. A formal language must be unambiguous, meaning that it must be possible for a program to parse it. This led to eliminating from TLA+ two common practices of mathematicians: using juxtaposition as an operator and the overloading of operators. Mathematicians write the product of x and y as xy; in TLA+ it is written x ∗ y. (One could require a space between x and y to distinguish this product from the single variable xy, but that would make parsing diﬃcult.) Mathematicians frequently overload operators—for example, f −1 could mean either the inverse of f or f raised to the power −1. There is no overloading of operators in TLA+. (The use of types can make some instances of overloading unambiguous; but for reasons explained below, TLA+ is untyped.) New Constructs. TLA+ borrows a few useful constructs from computer science—for example, allowing if/then expressions like if x = 0 then 1/x else 0 Also, mathematicians have no notation for explicitly describing a function— for example, the function whose domain is the set of reals and that maps every number to its negative. In TLA+, this function is written [x ∈ Real → −x ]

244

B. Batson and L. Lamport

(A computer scientist might write this as a λ expression, but TLA+ avoids the keyword lambda because of its potentially confusing connotations.) Deﬁnitions. Mathematicians have no standard convention for deﬁning operators. They typically write something like “let ◦ be deﬁned by letting a ◦ b equal . . . , for any a and b.” In TLA+, one writes: ∆

a ◦ b = ... Support for Large Speciﬁcations. Mathematicians typically introduce new variables informally as needed. This casual introduction of variables could lead to errors in large speciﬁcations, so TLA+ requires that variables be declared before they are used. Moreover, mathematicians will write “let x be an element of S ” even though two pages earlier they had deﬁned x to have some other meaning. Formalizing this requires some method of restricting the scope of a declaration. TLA+ does this through the use of modules, which provide a mechanism for structuring large speciﬁcations. The mathematical operation of substituting expressions for variables is expressed in TLA+ by instantiating a module with expressions substituted for its declared variables. Support for Large Formulas. For a mathematician, a 20-line formula is large. In a speciﬁcation, a 200-line formula is not unusual. To aid in writing long formulas, TLA+ allows bulleted lists of conjuncts and disjuncts, using indentation to eliminate parentheses [14]. For example, ∧ reqQ[p][i ].type = “MB” ∧ ∨ DirOpInProgress(p, reqQ[p][i ].adr ) ∨ reqQ[p][j ].adr = reqQ[p][i ].adr means (reqQ[p][i ].type = “MB”) ∧ ( (DirOpInProgress(p, reqQ[p][i ].adr )) ∨ (reqQ[p][j ].adr = reqQ[p][i ].adr )) TLA+ also has a let/in construct for making deﬁnitions local to an expression. This permits structuring an expression for easier reading as well as combining multiple instances of the same subexpression. TLA. TLA+ extends ordinary math by adding the modal operators of TLA. The most important of these is prime ( ), where priming an expression makes it refer to the value of the expression in the next state. For example, x = x + 1 is an action, a predicate on a pair of states (called the current and next state), that is true iﬀ the value of x in the next state is one greater than its value in the current state. Although formally a modal operator, expressions with primes obey the rules of ordinary mathematics, where x is treated like a new variable unrelated to the variable x . TLA also has a few simple temporal operators, used mostly for expressing liveness properties. As we will see, these operators appear in only a small part of a speciﬁcation. We have listed the ways in which TLA+ diﬀers from math as used by ordinary mathematicians. Where practical, TLA+ maintains the richness and economy of

High-Level Speciﬁcations: Lessons from Industry

245

inline AppendNum(n) { i = 0; do :: i < MaxSeqLen && seq[i] != 0 && seq[i] != n -> i++ :: else -> break od; if :: i >= MaxSeqLen || seq[i] != 0 :: else -> seq[i] = n fi } Fig. 1. Part of a toy speciﬁcation written in Promela. ∆

AppendNum(n) = ∧ ∀ i ∈ 1 . . Len(seq) : n = seq[i] ∧ seq = Append (seq, n) ∧ num = num Fig. 2. The TLA+ version of the piece of speciﬁcation in Figure 1.

ordinary mathematical notation. For example, while a textbook on ﬁrst-order logic typically deﬁnes only a simple quantiﬁed formula such as ∃ x : exp, mathematicians typically write formulas like: ∃ x , y ∈ S , z , w ∈ T : exp TLA+ allows this kind of richer syntax. On the other hand, mathematicians do not use extraneous keywords or punctuation. TLA+ maintains this simplicity of syntax; for example, successive statements are not separated by punctuation. This syntactic economy makes TLA+ speciﬁcations easy for people to read, but surprisingly hard for a program to parse. 2.3

A Brief Taste of TLA+

To provide a sense of how a speciﬁcation written in TLA+ compares to one written in a typical speciﬁcation language used in industry, Figures 1 and 2 give small parts of two versions of a toy speciﬁcation. The ﬁrst version is written in Promela, the input language of the Spin model checker [10]. (It was written by Gerard Holzmann, the designer of the language.) The second is written in TLA+. The ﬁgures contain corresponding parts of the two speciﬁcations, although those parts are not completely equivalent. (One signiﬁcant diﬀerence is mentioned in Section 3 below.) Figure 3 shows a small part of a TLA+ speciﬁcation of a real cache-coherence protocol for a multiprocessor computer. Observe that it looks very much like the piece of toy speciﬁcation of Figure 2; it is just a little more complicated. In both examples, the TLA+ speciﬁcation uses only simple mathematics.

246

B. Batson and L. Lamport

∧ req.type = “MB” ∧ ∀ i ∈ 1 . . (idx − 1) : ∧ reqQ[p][i].type = “MB” ∧ DirOpInProgress(p, reqQ[p][i].adr ) ∧ ∀ j ∈ 1 . . (i − 1) : reqQ[p][j ].adr = reqQ[p][i].adr ∧ ¬ ∃ m ∈ msgsInTransit : ∧ m.type ∈ {“Comsig”, “GetShared”, “GetExclusive”, “ChangeToExclusive”} ∧ m.cmdr = p Fig. 3. A small piece of a real TLA+ speciﬁcation.

Crucial to the simplicity of TLA+ is that it is based on the ordinary mathematics used by ordinary mathematicians. Computer scientists have devised many forms of weird mathematics. They have introduced bizarre concepts such as “nondeterministic functions”, leading to strange formalisms in which the formula A = A is not necessarily true. Ordinary mathematics was formalized about a century ago in terms of ﬁrst-order logic and (untyped) set theory. This is the formal mathematics on which TLA+ is based. The use of ordinary mathematics in TLA+ led BB to remark: if I want to ﬁnd out what an expression in a TLA+ speciﬁcation means, I can just look it up in a math book. Computer scientists and engineers, accustomed to computer languages, are likely to be surprised by the expressiveness of TLA+. BB describes the power of TLA+ in this way: A single line of TLA+ can do powerful manipulations of complex data structures. This allows me to focus on the algorithm without getting lost in bookkeeping tasks. TLA+ tops even perl in this regard. Unlike perl, however, TLA+ is unambiguous. The simplicity and expressiveness of TLA+ is not the result of any cleverness in the design of the language; it comes from two thousand years of mathematical development, as formalized by great mathematicians like Hilbert. TLA+ is described in a recent book by LL, which is available on the Web [17]. 2.4

The Structure of a TLA+ Speciﬁcation

TLA+ does not enforce any particular way of structuring a speciﬁcation, allowing declarations and deﬁnitions to appear in any order as long as every identiﬁer is declared or deﬁned before it is used. Moreover, by partitioning it into separate modules, one can structure a speciﬁcation to allow reading higher-level deﬁnitions before reading the lower-level deﬁnitions on which they depend. Logically, most TLA+ speciﬁcations consist of the following sections: Declarations. The declarations of the constant parameters and the variables that describe the system. There are typically a dozen or so declared identiﬁers.

High-Level Speciﬁcations: Lessons from Industry

247

Deﬁnitions of Operations on Data Structures. These deﬁne mathematical operators used to describe operations speciﬁc to the particular system. For example, a speciﬁcation of a system that can perform a masked store to a register might deﬁne MaskedStoreResult(curr , val , msk ) to be the new value of a register, whose current value is curr , after storing to it a value val through a mask msk . A speciﬁcation usually has only a few such operator deﬁnitions, each just a few lines long. The operator MaskedStoreResult is more likely to be used only in the deﬁnition that describes the masked store action, in which case it would probably be deﬁned locally in a let/in expression. The Initial Predicate. The deﬁnition of a formula that describes the possible initial values of the variables. It is a conjunction of formulas x = . . . or x ∈ . . . for each variable x , where the “. . .” is usually a simple constant expression. The Next-State Action. The deﬁnition of a formula containing primed and unprimed variables that describes the system’s possible next state as a function of its current state. It is generally deﬁned as a disjunction of subactions, each describing one type of system step. For example, in the speciﬁcation of a mutual-exclusion algorithm, the next-state action might have a disjunct ∃ p ∈ Proc : EnterCS (p) , where EnterCS (p) describes a step in which a process p enters its critical section. The deﬁnition of the next-state action comprises the bulk of the speciﬁcation. Liveness. The deﬁnition of a temporal formula specifying the liveness properties of the system, usually in terms of fairness conditions on subactions of the next-state action. It typically consists of about a dozen lines. The Speciﬁcation. This is the one-line deﬁnition ∆

Spec = Init ∧ 2[Next]v ∧ Liveness that deﬁnes Spec to be the actual speciﬁcation, where Init is the initial predicate, Next is the next-state action, Liveness is the liveness formula, and v is the tuple of all variables. For a high-level speciﬁcation that describes a system’s correctness properties, Spec is the inner speciﬁcation in which internal variables are visible. The true speciﬁcation would be obtained by hiding those internal variables. If h is the tuple of all internal variables, then Spec with those variables hidden is represented by the TLA formula ∃ h : Spec. For technical reasons, TLA+ requires that this formula be deﬁned in a separate module from the one deﬁning Spec. However, hiding the internal variables is done for philosophical correctness only. The TLC model checker cannot handle the TLA hiding operator ∃ , and in practice, one uses the inner speciﬁcation Spec. The only temporal operators that appear in the entire speciﬁcation are the ones in the liveness formula and the single 2 in the ﬁnal speciﬁcation. The rest of the speciﬁcation—usually about 99% of it—consists entirely of ordinary math,

248

B. Batson and L. Lamport

with no temporal operators. Moreover, engineers generally do not use the liveness property. The complexity of model checking liveness properties is inherently much greater than that of checking safety properties, which means that liveness can be checked only for extremely small models of real systems. Engineers therefore usually do not even write the liveness property; instead their speciﬁcation is ∆ Spec = Init ∧ 2[Next]v The only temporal operator in this speciﬁcation is the single 2. 2.5

The Size of Speciﬁcations

In principle, one could write speciﬁcations of any size, from tiny toy examples to million-line monsters. But tiny toys are of no practical interest, and the module structure of TLA+ is probably inadequate for handling the complexity of speciﬁcations longer than ten or twenty thousand lines. We have identiﬁed the following three classes of applications for which TLA+ is useful, each with a surprisingly narrow range of speciﬁcation sizes: Abstract Algorithms. These are the types of concurrent algorithms that are published in journals—for example, the Disk Paxos algorithm [6]. Their speciﬁcations seem to require a few hundred lines of TLA+. Interesting algorithms simple enough to have much shorter speciﬁcations seem to be rare, while an algorithm with a much longer speciﬁcation is probably not abstract enough for journal publication. Correctness Properties. These are descriptions of the properties that protocols or systems should satisfy. One example is a description of the memory model that a cache-coherence protocol is supposed to implement [20]. Their speciﬁcations also seem to require a few hundred lines of TLA+. The requirements of a real system are seldom simple enough to have a shorter speciﬁcation, while a statement of correctness requiring a much longer speciﬁcation would probably be too complicated to be useful. High-Level Protocol or System Designs. These describe the high-level designs of actual systems or protocols. We know of no published example; such designs are usually proprietary. We have found that these speciﬁcations are about two thousand lines of TLA+. Any such speciﬁcation is an abstraction of the actual lower-level implementation. Engineers want to describe their design in as much detail as they can. However, if the speciﬁcation takes much more than two thousand lines, then the design is too complicated to understand in its entirety, and a higher-level abstraction is needed.

3

TLC: The TLA+ Model Checker

TLA+ was not designed with tools in mind; LL believed that a practical model checker for it was impossible and advised against trying to write one. Fortunately, Yuan Yu ignored him and wrote TLC, an explicit-state model checker for TLA+ programmed in Java [26].

High-Level Speciﬁcations: Lessons from Industry

249

TLA+ is an extremely expressive language—for example, it can easily be used to specify a program that accepts an arbitrary Turing machine as input and tells whether or not it will halt. No model checker can handle all TLA+ speciﬁcations. TLC handles a subset of TLA+ that seems to include most speciﬁcations of algorithms and correctness properties, as well as all the speciﬁcations of protocol and system designs that engineers actually write. Those few speciﬁcations arising in practice that TLC does not handle can be easily modiﬁed, usually by changing only a few lines, so they can be checked by TLC. Explicit-state model checking is possible only for bounded-state speciﬁcations. Most high-level speciﬁcations are not bounded-state because the state contains data structures such as unbounded queues. We want engineers to use the TLA+ speciﬁcation as the oﬃcial high-level speciﬁcation of their system, and a major goal of TLC is that the speciﬁcation should not have to be changed to allow it to be checked. So TLC accepts as input a TLA+ speciﬁcation and a conﬁguration ﬁle that deﬁnes a ﬁnite model. The conﬁguration ﬁle instantiates the constant parameters of the speciﬁcation—for example, instructing TLC to replace the parameter Proc that represents the set of processors with a set containing three elements. The conﬁguration ﬁle can also specify a constraint on the state space, instructing TLC to explore only states satisfying the constraint. As an example, we return to the toy speciﬁcation, part of which is speciﬁed in Figures 1 and 2. In that speciﬁcation, the variable seq represents a queue that can grow arbitrarily large. To model check it with TLC, we write a simple module that imports the original speciﬁcation, declares a constant MaxSeqLen, and deﬁnes a constraint asserting that the length of seq is at most MaxSeqLen. We then instruct TLC to check the speciﬁcation using that constraint, substituting a speciﬁc value for MaxSeqLen. We do this for increasing values of MaxSeqLen until we are conﬁdent enough that the speciﬁcation is correct, or until the space of reachable states becomes so large that it takes TLC too long to explore it. In contrast, note in Figure 1 that, to model check the speciﬁcation with Spin, the parameter MaxSeqLen had to be made part of the actual Promela speciﬁcation. Operations on many common data types are not built into TLA+, but are instead deﬁned in standard modules. For example, the natural numbers are deﬁned in the Naturals module to be an arbitrary set satisfying Peano’s axioms, and arithmetic operation are deﬁned in the usual way in terms of the next-number function. A speciﬁcation that uses operators like + on natural numbers imports the Naturals module. It would be rather diﬃcult for TLC to compute 2 + 2 from the deﬁnition of + in that module. Instead, such arithmetic operations are programmed in Java using TLC’s general module overriding mechanism. When a speciﬁcation imports a module named M , TLC looks for a Java class ﬁle named M.class. If it ﬁnds one, it replaces operators deﬁned in M with their Java implementations in M.class. There are Java implementations for common operators on numbers, sequences (lists), ﬁnite sets, and bags (multisets) that are deﬁned in the standard modules. Ordinary users can also write their own Java class ﬁles to provide more eﬃcient implementations of the operators that they deﬁne. However, we know of no case where this was necessary, and we know of only one user (a researcher) who has done it.

250

B. Batson and L. Lamport

TLC has a primitive command-line interface. Debugging is done by adding print statements to the speciﬁcation. Although this violates the principle of not having to modify the speciﬁcation to check it, the print statements are usually removed as soon as initial debugging is completed and simple “coding” errors corrected. Design errors are generally found by examining error traces. We hope eventually to add a graphical user interface. TLC is coded in Java. It is a multithreaded program and there is a version that can use multiple machines. For checking safety properties, it obtains close to an n-fold speedup with n processors when run on Alphas using a high quality Java runtime. However, we have found that the poor implementation of multithreading in many Java runtimes can signiﬁcantly reduce the speedup. The largest case we know of was one with 900 million reachable states that took about two weeks on a four-processor machine. The expressiveness of TLA+ makes it essentially impossible to compile TLA+ speciﬁcations into eﬃcient code. Therefore, TLC must interpret speciﬁcations. We guess that this makes TLC about ten times slower than explicit-state model checkers that require speciﬁcations to be written in a low-level, compilable language. Because TLC maintains its data structures on disk, it has essentially no space limitations for checking safety properties. The goal of TLC is to help engineers ﬁnd bugs in their designs. Experience tells an engineer what kind of bugs a particular ﬁnite model is and is not likely to ﬁnd. For example, if a cache-coherence protocol handles diﬀerent addresses independently, then it may suﬃce to check models with only a single memory address. A speciﬁcation is an abstraction, and checking it can ﬁnd only errors that are present in that abstraction. Lower levels of detail introduce many other sources of error not reﬂected in the speciﬁcation. In most industrial applications, engineers need cost-eﬀective methods of ﬁnding bugs; they are not seeking perfection.

4

TLA+ Use in Industry

4.1 Digital/Compaq/HP TLA+ and TLC were conceived at the Digital (later Compaq) Systems Research Center. The ﬁrst serious industrial use of TLA+ was for specifying and writing part of a hand proof of the cache-coherence protocol of a multiprocessor codenamed Wildﬁre, based on the Alpha EV6 processor [7]. The Wildﬁre experience inspired Yuan Yu to write TLC. It also persuaded the veriﬁcation team to write a TLA+ speciﬁcation of the cache-coherence protocol of the next generation Alpha, the EV7. The initial speciﬁcation viewed an entire processor chip as a single component; the level of abstraction was later lowered to model protocol interactions between on-chip components as well. TLC checked important properties of the protocol and helped ﬁnd several bugs. TLC and the EV7 protocol speciﬁcation were also used as the basis of a project to improve test coverage for hardware simulation [25]. The processor design team for the next Alpha processor, the EV8, began using TLA+ to write the oﬃcial speciﬁcation of its cache-coherence protocol. However, development of that processor was cancelled.

High-Level Speciﬁcations: Lessons from Industry

251

TLA+ and TLC were also applied by Compaq engineers to the cache- coherence protocols of two Itanium-based processors. Researchers used TLC to debug a bus protocol proposal and to help develop database recovery and cache management protocols. TLC is now used routinely by some researchers to check the concurrent algorithms that they develop. The use of TLA+ and TLC at Digital and Compaq, some of which continued at HP, is described in [18]. 4.2

Intel

We now describe the use of TLA+ and TLC by BB and his colleagues at Intel. The actual speciﬁcations are proprietary and have not been viewed by anyone outside Intel. Overview of the Problem. Designing a complex system starts with a problem statement and an appropriate set of boundary conditions. A component of a computer system is initially represented abstractly as a black box, with assertions about its functionality and with some guidelines on performance, cost, and complexity. The engineering process involves iteratively reﬁning this abstract model into lower-level models. Each lower-level model is a representation of the design at a certain level of abstraction, and it has a speciﬁc purpose. Some models are meant to evaluate tradeoﬀs between scope, performance, cost, and complexity. Others carry the design down to the low level of detail needed to manufacture the component. The engineering process therefore creates multiple representations of a design. Validation entails checking these multiple representations against one another. Designers of digital systems have good tools and methods for validating mid-level functional models, written in a hardware description language (HDL) like VHDL or RTL, against lower-level models such as circuit net-lists. However, they have not had as much success checking the higher-level functional representations of the design against one another, and against the initial problem statement and functional assertions. For some components, there is an intuitive correlation between the highlevel notions of correctness and the HDL model; such components tend not to be diﬃcult to validate. Other components, like multiprocessor cache-coherence protocols, are suﬃciently complex that checking the HDL model against the problem statement is quite challenging. We need formal techniques from the world of mathematics to perform this high-level validation. Although formal methods are based on mathematics, engineers view them diﬀerently from the way mathematicians do. To engineers, formal veriﬁcation is simply another imperfect validation tool (albeit a powerful one). A TLA+ speciﬁcation is only an abstraction of the actual system, and model checking can usually validate the speciﬁcation only for a highly restricted set of system parameters. Validating the speciﬁcation therefore cannot guarantee that there are no errors in the system. For engineers, formal veriﬁcation is a way of ﬁnding bugs, not of proving correctness.

252

B. Batson and L. Lamport

The main beneﬁt of applying TLA+ to engineering problems comes from the eﬃciency of the TLC model checker in reaching high levels of coverage and ﬁnding bugs. A secondary beneﬁt we have encountered is the ability of TLA+ and TLC to provide good metrics for the complexity of a design. Complexity is a major consideration in evaluating design tradeoﬀs. However, unlike performance or cost, engineers have not historically had a good way to quantify algorithmic complexity before attempting to validate a design. TLA+ encourages designers to specify the design abstractly, suppressing lower-level details, so the length of the speciﬁcation provides a measure of a design’s complexity. TLC reports the size of the reachable state space, providing another measure of complexity. Experience and intuition will always have a place in evaluating complexity, but TLA+ and TLC provide robust and impartial input to the evaluation. Having this input early in the design process is of considerable value.

Designing with TLA+. The core group at Intel started using TLA+ at Compaq while working on the Alpha EV7 and EV8 multiprocessor projects described above. From that experience, the Alpha engineers learned that multiprocessor cache-coherence protocols are an ideal candidate for formal methods because most of the protocol bugs can be found at a high level of abstraction. They also learned that the true value of TLA+ and TLC would be realized when (a) they were applied early enough in the design to provide implementation feedback, and (b) the implementation was based directly on the speciﬁcation that had been veriﬁed. On the EV8 project, the TLA+ speciﬁcation was completed before the design was stable, and it provided important feedback to the designers. When the engineers from the Alpha group joined Intel, they began applying their experience in writing TLA+ speciﬁcations when collaborating with other Intel engineers on cache-coherence protocols for future Intel products. Intel engineers are now using TLA+ as an integral part of the design process for the protocols that are under study. Whiteboard Phase. Designing one cache-coherence protocol from scratch provided the engineers with the opportunity to evaluate TLA+ as a prototyping platform for complex algorithms. Work on this protocol started by exploring the design space on a whiteboard for about two months. In this phase, basic message sequencing was determined, as were some coarse notions of what state had to be recorded at the protocol endpoints. A basic direction was set, based on the guidelines for functionality, performance, and cost. Because of their background, engineers tend to visualize an algorithm in terms of a particular implementation. They are better at gauging implementation complexity than at measuring algorithmic complexity. One beneﬁt of having engineers write formal speciﬁcations is that it helps them learn how to think about a protocol abstractly, independent of implementation details. We found that, even in the whiteboard phase of the protocol design, the Intel engineers were able to make some judgments on complexity by asking themselves, “How would I code this in TLA+ ?”.

High-Level Speciﬁcations: Lessons from Industry

253

The whiteboard phase produced a general understanding of the protocol philosophy, an understanding of the constraints placed on the communication medium, the basic message ﬂows, and coarse ideas on what state needed to be maintained. The next step was to introduce the rigor of a formal speciﬁcation. TLA+ Scratchpad Phase. The TLA+ scratchpad phase of the project involved formally describing the abstract system, with appropriate state variables representing high-level components. This phase took about two months, starting with the initial design of the protocol. The diﬃculty lay not in the use of TLA+— engineers frequently learn new programming languages—but rather in (a) determining the layer of abstraction and (b) exploring the protocol’s corner cases. Task (a) is where TLA+ forces engineers to think about the protocol abstractly, which they often ﬁnd unnatural. Their ability to think abstractly improves with experience writing TLA+ speciﬁcations. Task (b) is inevitable when documenting a protocol formally, as it forces the designers to explore the corner cases. During the scratchpad phase, the designers had to return to the whiteboard a few times when they encountered new race cases while writing the speciﬁcation. The actions that formed the major blocks of the speciﬁcation were chosen early; very few changes were made later. The Intel engineers adopted a methodology used in the earlier Alpha speciﬁcations, in which the decomposition of high-level named actions is based on classifying the protocol messages that they process. This methodology has led to fairly readable speciﬁcations, since it means that each action changes only a few local state variables. It encouraged the protocol speciﬁcations to be designed in a modular way, which also enabled the inter-module interfaces in the speciﬁcation to be similar to their low-level counterparts in the implementation. Running TLC. The initial week or so of running TLC was spent ﬁnding and ﬁxing typographical errors and type mismatch problems. This time could probably have been shortened by doing more syntax checking when writing the speciﬁcation, which is what one often does when programming. The next four weeks saw a continuous process of running TLC, ﬁnding bugs, ﬁxing them, and re-running TLC. During this phase of the project, many assumptions and assertions about the protocol were brought into question. This had the eﬀect of educating the engineers about the protocol they had designed. We have found that TLC can be a useful learning tool if we use in-line assertions and global invariants to check everything we think is true. The Intel engineers were able to develop an intuitive understanding of the correctness of the protocol by developing meaningful global invariants and having TLC check them. If an assertion or invariant fails, TLC generates a counterexample that is useful for visualizing a diﬃcult race case. These counterexamples are such a powerful teaching aid that the Intel engineers have developed tools to translate the TLC output into nicely formatted protocol ﬂow diagrams that are easier to read. Another useful feature of the TLC model checker is its coverage checking. TLC can print the number of times each action was “executed”. This provides a simple way to identify holes in coverage. Much of the eﬀort expended by the engineers in debugging the speciﬁcation was spent eliminating each of these

254

B. Batson and L. Lamport

holes, or convincing themselves that it represented an action that could never happen. The performance of the model checker was suﬃcient to debug a large protocol speciﬁcation. The engineers determined a base conﬁguration that would “execute” all the actions and that displayed all interesting known cases. This conﬁguration could be run on a four-processor machine in about a day, enabling fast turn-around on bug ﬁxes. Larger conﬁgurations were periodically run as sanity checks on the smaller ones. The engineers would also run TLC in simulation mode, which randomly and non-exhaustively explores the state space, allowing them to check much larger conﬁgurations. Such random simulations are similar to the ones engineers typically perform on lower-level models, but it has the advantage of being several orders of magnitude faster because it is based on the abstract TLA+ model, and it provides a robust metric for coverage. Optimizing with TLC. Once the initial protocol speciﬁcation was successfully checked by TLC, the Intel engineers were able to use it as a test bed for exploring optimizations. TLA+ is an ideal language to explore changes because its expressiveness usually allows the new version to be written quickly. Model checking the modiﬁed speciﬁcation with TLC not only checks functional correctness, but it also measures any increase in the state space. Such an increase implies additional algorithmic complexity. The engineers spent several months exploring additions to the protocol, testing them with TLC. As a general rule, they would consider adopting only those optimizations that did not appreciably expand the state space. The insight that TLA+ and TLC gave into the complexity of modiﬁcations to the protocol was invaluable in iterating towards an optimal solution that adequately weighed algorithmic complexity along with factors like cost and performance. A signiﬁcant optimization was later made to the protocol. This optimization followed the normal design cycle described above, though on a compressed schedule. With the original design yielding a good starting point, the entire cycle (whiteboard phase, TLA+ coding, and veriﬁcation with TLC) was done within six weeks. This modiﬁcation was accomplished by a recent college graduate with an undergraduate degree in engineering. He was able to learn TLA+ well enough within a matter of weeks to do this work. Feedback on TLA+ Syntax. The feedback we have received from engineers about the TLA+ language has been mostly positive. Engineers are usually able to pick up and understand a speciﬁcation within a couple of days. One mistake we made was to present TLA+ to hardware designers as similar to a programming language. This led to some frustration. A better approach seems to be to describe TLA+ as being like a hardware description language. Engineers who design digital systems are well acquainted with methods for specifying ﬁnite-state machines, with the associated restrictions of allowing a primed variable to be assigned a value only once within a conjunction, not allowing a primed variable to appear in a conjunction before the assignment of its value, etc. To an engineer, TLA+ looks like a language for specifying ﬁnite-state machines.

High-Level Speciﬁcations: Lessons from Industry

255

While writing the protocol speciﬁcation at Intel, BB was impressed by the ease of specifying complex data structures in TLA+ as sets and tuples. The part of the speciﬁcation that described and manipulated data structures was a small part of the complete protocol speciﬁcation. This compact speciﬁcation of “bookkeeping tasks”, along with the overall expressiveness of TLA+, won over the engineers who were accustomed to using more clumsy functional languages for specifying complex algorithms. For the algorithmic speciﬁcation, TLA+ naturally encourages nested disjunctions of conjunctions (known to engineers as sums of products of expressions). This method for specifying Boolean formulas has both advantages and disadvantages. One advantage is that it allows expressive comment blocks and assertions to be inserted in-line with a nested conjunct. A disadvantage is that this tends to lead to large speciﬁcations. The engineers are experimenting with the use of TLA+ operators to encode large blocks of regular Boolean disjunctions as truth tables, which engineers ﬁnd more natural to work with.

5

Some Common Wisdom Examined

Based on our experience using TLA+, we now examine the following popular concepts from the world of programming: types, information hiding, object-oriented languages, component-based/compositional speciﬁcations, hierarchical description/decomposition, and hierarchical veriﬁcation. Most of these concepts were mentioned in this symposium’s call for papers. We do not question the usefulness of these concepts for writing programs. But high-level speciﬁcations are not programs. We ﬁnd that in the realm of high-level speciﬁcations, these ideas are not as wonderful as they appear. 5.1

Types

Very simple type systems are very restrictive. Anyone who has programmed in Pascal has written programs that were obviously type-correct, but which were not allowed by Pascal’s simple type system. Moderately complicated type systems are moderately restrictive. A popular type system is that of higher-order logic [8]. However, it does not allow subtyping. With such a type system, an integer cannot be a real number. When writing Fortran programs, one gets used to 1.0 being unequal to 1. One should not have to put up with that kind of complication in a speciﬁcation language. Subtyping is provided by predicate subtyping, perhaps best known through its use in the PVS veriﬁcation system [23]. We will see below a problem with PVS’s predicate subtyping. Moreover, predicate subtyping is not simple. It has led to several bugs that caused PVS to be unsound. For a typed language to be as expressive as TLA+, it will need an extremely complicated type system, such as that of Nuprl [4]. Engineers have a hard enough task dealing with the complexity of the systems that they design; they don’t want to have to master a complicated type system too.

256

B. Batson and L. Lamport

A speciﬁcation consists of a large number of deﬁnitions, including many local ones in let/in expressions. Although an operator may have a simple type, it is often hard or impossible to declare the types of the operators deﬁned locally within its deﬁnition. Even when those type declarations are possible, LL has found that they clutter a speciﬁcation and make it harder to read. (Early precursors of TLA+ did include type declarations.) Any information contained in a type declaration that is helpful to the reader can be put in a comment. The main virtue of types, which makes us happy to bear the inconvenience they cause when writing programs, is that they catch errors automatically. (That advantage disappears in type systems with predicate subtyping, in which type checking can require manually guided theorem proving.) However, we have found that the errors in a TLA+ speciﬁcation that could have been found by type checking are generally caught quite quickly by running TLC with very small models. The problems with types are discussed at length in [19]. We want to emphasize that we do not dispute the usefulness of types in programming languages. We prefer to program in strongly typed languages. We are questioning the use of types only in a high-level speciﬁcation language. 5.2

Information Hiding

We have learned that programmers should hide irrelevant implementation details. However, a high-level speciﬁcation should not contain implementation details. Such details will appear in a well-written speciﬁcation only if an inexpressive language requires high-level concepts to be described by low-level implementations. TLA+ provides users with powerful mathematical objects like sets and functions; they don’t have to encode them in arrays of bits and bytes. Such “bookkeeping details” do not occur in speciﬁcations written in a truly high-level language like TLA+, so there is no need to hide them. 5.3

Object-Oriented Languages

The mathematical concept underlying object oriented programming languages can be described as follows. A program maintains identiﬁers of (references to) objects. There is a function Obj that maps the set ObjectId of object identiﬁers to a set Object of objects. An object-oriented language simply hides the function Obj , allowing the programmer to write o.ﬁeld instead of Obj [o].ﬁeld , where o is an object identiﬁer. Eliminating explicit mention of Obj can make a speciﬁcation look a little simpler. But it can also make it hard to express some things. For example, suppose we want to assert that a property P (obj ) holds for every object obj . (Usually, P (obj ) will assert that, if obj is a non-null object of a certain type, then it satisﬁes some property.) This is naturally expressed by the formula ∀ o ∈ ObjectId : P (Obj [o]) It can be diﬃcult or impossible to express in a language that hides Obj .

High-Level Speciﬁcations: Lessons from Industry

257

Object-orientation introduces complexity. It raises the problem of aliasing. It leads to the confusing diﬀerence between equality of object identiﬁers and equality of objects—the diﬀerence between o1 = o2 and o1.equals(o2). You can’t ﬁnd out what o1.equals(o2) means by looking it up in a math book. Object-oriented programming languages were developed for writing large programs. They are not helpful for two-thousand-line programs. Object orientation is not helpful for two-thousand-line speciﬁcations. 5.4

Component-Based/Compositional Speciﬁcations

A high-level speciﬁcation describes how the entire system works. In a TLA+ speciﬁcation, a component is represented by a subexpression of the next-state relation—usually by a disjunct. We can’t understand a formula by studying its subexpressions in isolation. And we can’t understand a system by studying its components in isolation. We have known for 20 years that the way to reason about a distributed system is in terms of a global invariant, not by analyzing each component separately [13]. Many tools have been developed for debugging the low-level designs of individual hardware components. Engineers need a high-level speciﬁcation to catch bugs that can’t be found by looking at individual components. 5.5

Hierarchical Description/Decomposition

Hierarchical description or decomposition means specifying a system in terms of its pieces, specifying each of those pieces in terms of lower-level pieces, and so on. Mathematics provides a very simple, powerful mechanism for doing this: the deﬁnition. For example, one might deﬁne A by ∆

A = B ∨C ∨D and then deﬁne B , C , and D. (TLA+ requires that the deﬁnitions appear in the opposite order, but one can present them in a top-down order by splitting the speciﬁcation into modules.) Building up a speciﬁcation by a hierarchy of deﬁnitions seems simple enough. But a speciﬁcation language can make it diﬃcult in at least two ways: – It can restrict the kinds of pieces into which a deﬁnition can be broken. For example, it might require the pieces to be separate processes. There is no reason to expect that splitting the system into separate processes will be the best way to describe it. – It can use a strict type system. For example, suppose x is a variable of type real number, and we want to deﬁne an expression A by ∆

A = if x = 0 then B else C where B is deﬁned by ∆

B = 1/x

258

B. Batson and L. Lamport

This is a perfectly reasonable deﬁnition, but PVS’s type system forbids it. PVS allows the expression 1/x only in a context in which x is diﬀerent from 0. This particular example is contrived, but TLA+ speciﬁcations often contain local deﬁnitions in let/in expressions that are type-correct only in the context in which they are used, not in the context in which they are deﬁned. How Intel engineers use and don’t use hierarchical decomposition is somewhat surprising. As we observed above, the major part of a TLA+ speciﬁcation is the deﬁnition of the next-state action. Intel engineers use the common approach of decomposing this deﬁnition as a disjunction such as ∆

Next = ∃ p ∈ Proc : A1 (p) ∨ . . . ∨ An (p) where each Ai (p) describes a particular operation performed by process p. They also use local deﬁnitions to make a deﬁnition easier to read. For example, an action Ai (p) might be deﬁned to equal let newV = . . . newW = . . . in . . . ∧ v = newV ∧ w = newW ... This allows a reader to scan the in clause to see what variables the action changes, and then read the complex deﬁnitions of newV and newW to see what the new values of v and w are. What the Intel engineers do not do is use hierarchical decomposition to hide complexity. For example, they would not eliminate common subexpressions by writing ∆ SubAction(p) = . . . ∆

A1 (p) = . . . ∧ SubAction(p) ∧ . . . ∆

A2 (p) = . . . ∧ SubAction(p) ∧ . . . if the two instances of SubAction(p) represent physically distinct components. The Intel engineers rely on the TLA+ speciﬁcation to gauge the complexity of their designs, using the number of lines in the speciﬁcation as a measure of a design’s complexity. This is possible because TLA+ does not introduce the extraneous details needed by lower-level languages to encode higher-level concepts. 5.6

Hierarchical Veriﬁcation

Hierarchical veriﬁcation works as follows: To show that a system described by the speciﬁcation Sys implements the correctness properties Spec, we write an intermediate-level spec Mid and show that Sys implements Mid and Mid implements Spec. In TLA+, implementation is implication. To show that a speciﬁcation ∃ x : F implements a speciﬁcation ∃ y : G, we must show

High-Level Speciﬁcations: Lessons from Industry

259

(∃ ∃ x : F ) ⇒ (∃ ∃ y : G) Here, x and y are tuples of variables, which we assume for simplicity to be distinct. By simple logic, we show this by showing F ⇒ (G with y ← exp) for some tuple of expressions exp, where with denotes substitution (which is expressed in TLA+ by module instantiation). The tuple exp is called a reﬁnement mapping [1]. To show that Sys implies Spec, we must show Sys ⇒ (ISpec with h ← exp) where ISpec is the inner speciﬁcation, with internal variables h visible. To use hierarchical veriﬁcation, we ﬁnd an intermediate-level inner speciﬁcation IMid , with internal variables m visible, and show: Sys ⇒ (IMid with m ← exp1) IMid ⇒ (ISpec with h ← exp2) When the veriﬁcation is done by TLC, there is no reason for such a decomposition; TLC can verify directly that Sys implements ISpec under the reﬁnement mapping. When the veriﬁcation is done by mathematical proof, this decomposition seems reasonable. However, as LL has argued elsewhere [16], it is just one way to decompose the proof; it is not necessarily the best way. Unfortunately, the whole problem of verifying that a high-level design meets its speciﬁcation is not yet one that is being addressed in the hardware community. Thus far, engineers are checking only that their TLA+ design speciﬁcations satisfy an incomplete set of invariants. Checking that they satisfy a complete speciﬁcation is the next step. Engineers want to do it, but they have to learn how—which means learning how to ﬁnd a correct reﬁnement mapping. TLC has the requisite functionality to do the checking, but only doing it for real systems will tell if it works in practice. The reader can get a feeling for the nature of the task by trying to solve the Wildﬁre Challenge Problem [20].

6

Conclusions

Our industrial experience with TLA+ has led us to some simple, common-sense conclusions: – Buzzwords like hierarchical and object-oriented are to be viewed with suspicion. – A language for writing high-level speciﬁcations should be simple and have eﬀective debugging tools. – Only proofs and model checking can catch concurrency bugs in systems. For the vast majority of applications, proofs are not a practical option; engineers have neither the training nor the time to write them.

260

B. Batson and L. Lamport

– A speciﬁcation method cannot be deemed a success until engineers are using it by themselves. TLA+ and TLC are practical tools for catching errors in concurrent systems. They can be used very early in the design phase to catch bugs when it is relatively easy and cheap to ﬁx them. Writing a formal speciﬁcation of a design also catches conceptual errors and omissions that might otherwise not become evident until the implementation phase. TLA+ is not just for industrial use. Anyone who writes concurrent or distributed algorithms can use it. We invite the reader to give it a try.

References 1. Mart´ın Abadi and Leslie Lamport. The existence of reﬁnement mappings. Theoretical Computer Science, 82(2):253–284, May 1991. 2. E. A. Ashcroft and Z. Manna. Formalization of properties of parallel programs. In Machine Intelligence, volume 6. Edinburgh University Press, 1970. 3. K. Mani Chandy and Jayadev Misra. Parallel Program Design. Addison-Wesley, Reading, Massachusetts, 1988. 4. R. L. Constable, S. F. Allen, H. M. Bromley, W. R. Cleaveland, J. F. Cremer, R. W. Harper, D. J. Howe, T. B. Knoblock, N. P. Mendler, P. Panagaden, J. T. Sasaki, and S. F. Smith. Implementing Mathematics with the Nuprl Proof Development System. Prentice-Hall, 1986. 5. R. W. Floyd. Assigning meanings to programs. In Proceedings of the Symposium on Applied Math., Vol. 19, pages 19–32. American Mathematical Society, 1967. 6. Eli Gafni and Leslie Lamport. Disk paxos. To appear in Distributed Computing., 2002. 7. Kourosh Gharachorloo, Madhu Sharma, Simon Steely, and Stephen Van Doren. Architecture and design of AlphaServer GS320. In Anoop Gupta, editor, Proceedings of the Ninth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS IX), pages 13–24, November 2000. 8. M. J. C. Gordon and T. F. Melham. Introduction to HOL: A Theorem Proving Environment for Higher Order Logic. Cambridge University Press, 1993. 9. C.A.R. Hoare. An axiomatic basis for computer programming. Communications of the ACM, 12(10):576–583, October 1969. 10. Gerard Holzmann. The model checker spin. IEEE Transactions on Software Engineering, 23(5):279–295, May 1997. 11. Simon S. Lam and A. Udaya Shankar. Protocol veriﬁcation via projections. IEEE Transactions on Software Engineering, SE-10(4):325–342, July 1984. 12. Leslie Lamport. Proving the correctness of multiprocess programs. IEEE Transactions on Software Engineering, SE-3(2):125–143, March 1977. 13. Leslie Lamport. An assertional correctness proof of a distributed algorithm. Science of Computer Programming, 2(3):175–206, December 1982. 14. Leslie Lamport. How to write a long formula. Formal Aspects of Computing, 6:580– 584, 1994. First appeared as Research Report 119, Digital Equipment Corporation, Systems Research Center. 15. Leslie Lamport. The temporal logic of actions. ACM Transactions on Programming Languages and Systems, 16(3):872–923, May 1994.

High-Level Speciﬁcations: Lessons from Industry

261

16. Leslie Lamport. Composition: A way to make proofs harder. In Willem-Paul de Roever, Hans Langmaack, and Amir Pnueli, editors, Compositionality: The Signiﬁcant Diﬀerence (Proceedings of the COMPOS’97 Symposium), volume 1536 of Lecture Notes in Computer Science, pages 402–423. Springer-Verlag, 1998. 17. Leslie Lamport. Specifying Systems. Addison-Wesley, Boston, 2002. A link to an electronic copy can be found at http://lamport.org. 18. Leslie Lamport, John Matthews, Mark Tuttle, and Yuan Yu. Specifying and verifying systems with TLA+ . In Proceedings of the Tenth ACM SIGOPS European Workshop, pages 45–48, Saint-Emilion, France, September 2002. INRIA (Institut National de Recherche en Informatique et en Automatique). 19. Leslie Lamport and Lawrence C. Paulson. Should your speciﬁcation language be typed? ACM Transactions on Programming Languages and Systems, 21(3):502– 526, May 1999. 20. Leslie Lamport, Madhu Sharma, Mark Tuttle, and Yuan Yu. The wildﬁre veriﬁcation challenge problem. At URL http://research.microsoft.com/users/ lamport/tla/wildfire-challenge.html on the World Wide Web. It can also be found by searching the Web for the 24-letter string wildfirechallengeproblem. 21. Susan Owicki and David Gries. Verifying properties of parallel programs: An axiomatic approach. Communications of the ACM, 19(5):279–284, May 1976. 22. Susan Owicki and Leslie Lamport. Proving liveness properties of concurrent programs. ACM Transactions on Programming Languages and Systems, 4(3):455–495, July 1982. 23. Sam Owre, John Rushby, Natarajan Shankar, and Friedrich von Henke. Formal veriﬁcation for fault-tolerant architectures: Prolegomena to the design of PVS. IEEE Transactions on Software Engineering, 21(2):107–125, February 1995. 24. Amir Pnueli. The temporal logic of programs. In Proceedings of the 18th Annual Symposium on the Foundations of Computer Science, pages 46–57. IEEE, November 1977. 25. Serdar Tasiran, Yuan Yu, Brannot Batson, and Scott Kreider. Using formal speciﬁcations to monitor and guide simulation: Verifying the cache coherence engine of the Alpha 21364 microprocessor. In In Proceedings of the 3rd IEEE Workshop on Microprocessor Test and Veriﬁcation, Common Challenges and Solutions. IEEE Computer Society, 2002. 26. Yuan Yu, Panagiotis Manolios, and Leslie Lamport. Model checking TLA+ speciﬁcations. In Laurence Pierre and Thomas Kropf, editors, Correct Hardware Design and Veriﬁcation Methods, volume 1703 of Lecture Notes in Computer Science, pages 54–66, Berlin, Heidelberg, New York, September 1999. Springer-Verlag. 10th IFIP wg 10.5 Advanced Research Working Conference, CHARME ’99.

How the Design of JML Accommodates Both Runtime Assertion Checking and Formal Veriﬁcation Gary T. Leavens1 , Yoonsik Cheon1 , Curtis Clifton1 , Clyde Ruby1 , and David R. Cok2 1

Department of Computer Science, Iowa State University 226 Atanasoﬀ Hall, Ames, Iowa 50011-1041 USA {leavens,cheon,cclifton,ruby}@cs.iastate.edu phone: +1 515 294 1580, fax: +1 515 294 1580 2 Eastman Kodak Company Research & Development Laboratories 1700 Dewey Avenue, Building 65, Rochester, New York 14650-1816 USA [email protected] phone: +1 585 588 3107, fax: +1 585 588 3269

Abstract. Speciﬁcations that are used in detailed design and in the documentation of existing code are primarily written and read by programmers. However, most formal speciﬁcation languages either make heavy use of symbolic mathematical operators, which discourages use by programmers, or limit assertions to expressions of the underlying programming language, which makes it diﬃcult to write complete speciﬁcations. Moreover, using assertions that are expressions in the underlying programming language can cause problems both in runtime assertion checking and in formal veriﬁcation, because such expressions can potentially contain side eﬀects. The Java Modeling Language, JML, avoids these problems. It uses a side-eﬀect free subset of Java’s expressions to which are added a few mathematical operators (such as the quantiﬁers \forall and \exists). JML also hides mathematical abstractions, such as sets and sequences, within a library of Java classes. The goal is to allow JML to serve as a common notation for both formal veriﬁcation and runtime assertion checking; this gives users the beneﬁt of several tools without the cost of changing notations.

1

Introduction

The Java Modeling Language, JML [55, 54], is the result of a cooperative, international eﬀort aimed at providing a common notation and semantics for the speciﬁcation of Java code at the detailed-design level [58]. JML is being designed cooperatively so that many diﬀerent tools can use a common notation for Hoarestyle behavioral interface speciﬁcations. In this paper we explain the features of JML’s design that make its assertions easily understandable by programmers and suitable for both runtime assertion checking and formal veriﬁcation. F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 262–284, 2003. c Springer-Verlag Berlin Heidelberg 2003

How the Design of JML Accommodates

1.1

263

Background

By a Hoare-style speciﬁcation we mean one that uses pre- and postconditions to specify the behavior of methods [34, 43, 44]. A behavioral interface speciﬁcation language (BISL) is a speciﬁcation language that speciﬁes both the syntactic interface of a module and its behavior [33, 48, 52, 85]. JML, the interface speciﬁcation languages in the Larch family [33,48,52,85] and RESOLVE/C++ [22,73] are BISLs. Most design by contract languages and tools, such as Eiﬀel [70, 71] and APP [77], are also BISLs, because they place speciﬁcations inside programming language code. By contrast, neither Z [80, 79, 87] nor VDM [6, 27, 74, 43] is a BISL; they have no way to specify interface details for a particular programming language. OCL [82, 83] is a BISL for the UML, but the UML itself is language-independent; this poses problems for a Java programmer, because the UML does not have standard notations for all details of Java method signatures. For example, the UML’s syntax for specifying the signatures of operations has no standard notation for declaring that a Java method is strictfp or for declaring the exceptions that a method may throw [7, pp. 128-129] [49, p. 516]1 . Also the OCL has no standard constraints that correspond to JML’s exceptional postconditions. Because BISLs like JML specify both interface and behavior, they are good at specifying detailed designs that include such Java details. This makes JML well suited to the task of documenting reusable components, libraries, and frameworks written in Java. 1.2

Tool Support

Because BISLs are easily integrated with code, they lend themselves to tool support for activities related to detailed design, coding, testing, and maintenance. An important goal of JML is to enable a wide spectrum of such tools. Besides tools that enforce JML’s semantics (e.g., type checking), the most important JML tools help with the following tasks. Runtime checking and testing. The Iowa State group provides (from www. jmlspecs.org): – the jmlc runtime assertion checking compiler [13], which generates class ﬁles from JML-annotated Java sources2 , and – the jmlunit tool [14], which uses the runtime assertion checker to generate test oracle code for JUnit tests. Documentation. David Cok provides the jmldoc tool, also available through www.jmlspecs.org, which generates HTML documentation similar to that 1

2

Larman notes that the UML has some nonstandard ways to specify the exceptions that a method may throw, by either using Java’s own syntax directly or by using a “property string”. Besides this runtime assertion checking work at Iowa State, which relies on adding instrumentation to compiled code, Steven Edwards’s group at Virginia Tech is working on a wrapper-class based approach to runtime assertion checking that will allow instrumentation of programs for which source code is not available.

264

G.T. Leavens et al.

produced by javadoc [29], but containing speciﬁcations as well. The generated documentation is useful for browsing speciﬁcations or posting to the web. Static analysis and veriﬁcation. The following tools are prepared by our partners at Compaq and the University of Nijmegen: – The ESC/Java tool [28,65,66] statically checks Java code for likely errors. ESC/Java understands a subset of JML annotations. – The LOOP tool [37,38,40,42] assists in the formal veriﬁcation of the correctness of implementations from JML speciﬁcations, using the theorem prover PVS. In addition, the Daikon dynamic invariant detector [23,72] outputs invariants for Java programs in a subset of JML, and the Korat automated testing tool [8] uses the jmlunit tool to exercise the test data it derives. In this paper, we discuss how JML meets the needs of tools for runtime assertion checking, documentation, static analysis, and veriﬁcation. We focus on runtime assertion checking and formal veriﬁcation, which we consider to be the extremes of the spectrum of tools that a BISL might support. The tasks of runtime assertion checking and formal veriﬁcation have widely diﬀering needs: – Runtime assertion checking places a high premium on executability. Many speciﬁcation languages intended for runtime assertion checking, such as Eiﬀel [70, 71] and APP [77], only allow assertions that are completely executable. This is sensible for a language that is intended only to support runtime assertion checking and not formal veriﬁcation. – On the other hand, formal theorem proving and reasoning place a high premium on the use of standard mathematical notations. Thus, most speciﬁcation languages intended for formal reasoning or veriﬁcation, such as VDM, the members of the Larch family, and especially Z, feature a variety of symbolic mathematical notations. Many expressive mathematical notations, such as quantiﬁers, are impossible, in general, to execute at runtime. Again, including such notations is sensible for a language intended only to support formal theorem proving and reasoning and not runtime assertion checking. 1.3

Problems

We begin by describing the problems that arise when addressing the needs of the range of tools exempliﬁed by runtime assertion checking and formal veriﬁcation. 1.3.1 Notational Problem. It is often said that syntax does not matter; however, our experience with Larch/Smalltalk [11] and Larch/C++ [12, 50, 51, 53, 56] showed that programmers object to learning a specialized mathematical notation (the Larch Shared Language). This is similar to the problems found by Finney [26], who did a preliminary experiment demonstrating that the symbolic notation in Z speciﬁcations may make them hard to read. Conversely, in executable languages like Eiﬀel and APP, programmers feel comfortable with the use of the programming language’s expressions in assertions. Such an assertion

How the Design of JML Accommodates

265

language is therefore more appealing for purposes of documentation than highly symbolic mathematical notations. To summarize, the ﬁrst problem that we address in this paper is how to provide a good syntax for speciﬁcation expressions. Speciﬁcation expressions are the syntactic forms that are used to denote values in assertions. By a good syntax we mean one that is close enough to programming language expressions that programmers feel comfortable with it and yet has all of the features necessary to support both runtime assertion checking and formal veriﬁcation. 1.3.2 Undeﬁnedness Problem. Expressions in a programming language may abruptly terminate (e.g., throw exceptions) and may go into inﬁnite loops; consequently, they may have undeﬁned values from a strictly mathematical point of view. Programming languages typically provide features to deal explicitly with such undeﬁnedness. For example, Java provides short-circuit versions of boolean operators (such as && and ||) that allow programmers to suppress evaluation of some subexpressions. We want both programmers and mathematicians to use JML’s notations; hence, JML’s speciﬁcation expressions should not only look like Java’s expressions and use Java’s semantics, but should also validate the standard laws of logic. However, because of a potential for undeﬁnedness, Java expressions do not satisfy all the standard rules of logic; for example, in Java the conjunction E1 && E2 is not equal to E2 && E1 , although in logic they would be equal. To resolve this conﬂict, we are willing to accept a slightly diﬀerent semantics for assertion evaluation as long as programmers are not too surprised by it. Thus, the second problem we address in this paper is how to ﬁnd a semantics for expressions used in assertions that validates standard laws of logic and yet does not surprise programmers and is still useful for runtime assertion checking. 1.3.3 Side Eﬀects Problem. Another important semantic issue is that expressions in a programming language like Java (and most others, including Eiffel) can contain side eﬀects. Side eﬀects have a very practical problem related to runtime assertion checking. It is generally assumed that assertions may be evaluated or skipped with no change in the outcome of a computation, but an assertion with side eﬀects has the potential to alter the computation’s outcome. For example, an assertion with side eﬀects might mask the presence of a bug that would otherwise be revealed or cause bugs that are not otherwise present. Because one of the principal uses of runtime assertion checking is debugging and isolating bugs, it is unacceptable for side eﬀects from assertion checking to alter the outcome of a computation. Thus, the third problem that we address in this paper is how to prevent side eﬀects in assertions while still retaining as much of the syntax of normal programming language expressions as possible. 1.3.4 Mathematical Library Problem. Most speciﬁcation languages come with a library of mathematical concepts such as sets and sequences. Such concepts are especially helpful in specifying collection types. For example, to specify

266

G.T. Leavens et al.

a Stack type, one would use a mathematical sequence to describe, abstractly, the states that a stack object may take [35]. VDM, OCL, Z, and the interface speciﬁcation languages of the Larch family all have libraries of such mathematical concepts. They also are standard in theorem provers such as PVS. However, as discussed in Section 1.3.1, we want to limit the barriers that Java programmers must overcome to use JML. Thus, the fourth problem that we address in this paper is how to provide a library of mathematical concepts in a way that does not overwhelm programmers, and yet is useful for formal veriﬁcation. 1.4

Other Goals of JML

In addition to providing solutions to the preceding four problems, the design of JML is guided and constrained by several other goals. One of the most important of these goals is to allow users to write speciﬁcations that document detailed designs of existing code. This motivates the choice of making JML a BISL, as described above. Moreover, we would like JML to be useful for documenting code regardless of whether it was designed according to any particular design method or discipline. This is important because the cost of speciﬁcation is high enough that it is not always justiﬁed until one knows that the design and the code have stabilized enough to make the documentation potentially useful to other people. In general, JML’s design adheres to the goal of being able to document existing designs; however, there is one signiﬁcant aspect of JML’s design that departs from this goal—JML imposes the speciﬁcations of supertypes on subtypes, a property termed speciﬁcation inheritance, in order to achieve behavioral subtyping [19]. JML’s use of speciﬁcation inheritance is justiﬁed by another of our goals: we want JML to support modular reasoning, that is, reasoning about the behavior of a compilation unit using just the speciﬁcations of the compilation units that it references (as opposed to the details of their implementations). Modular reasoning is important because without it, the diﬃculty of understanding an object-oriented program increases much more rapidly than the size of the program, and thus the beneﬁts of the abstraction mechanisms in object-oriented languages are lost. Consequently, modular reasoning is also important for formal veriﬁcation, because then the scope of the veriﬁcation problem is limited. Speciﬁcation inheritance, and the resulting behavioral subtyping, allows modular reasoning to be sound, by allowing one to reason based on the static types of references. Subsumption in Java allows a reference to a subtype object to be substituted for a supertype reference. The requirements of behavioral subtyping [2, 3, 19, 57, 59, 63, 69] guarantee that all such substituted objects will obey the speciﬁcations inherited from the static type of the reference [19, 60, 61]. Because of the beneﬁts of modular reasoning to programmers and veriﬁers, we favor speciﬁcation inheritance over the conﬂicting goal of being able to document existing designs that do not follow behavioral subtyping. In any case, it is possible to work around the requirements of behavioral subtyping for cases in which a subtype does not obey the inherited speciﬁcations of its supertype(s). One simply underspeciﬁes each supertype enough to allow all of the subtypes that

How the Design of JML Accommodates

267

are desired [63, 69]. Note that this work-around does not involve changing the code or the design, but only the speciﬁcation, so it does not interfere with the goal of documenting existing code. 1.5

Outline

The remainder of this paper is organized as follows. The next section discusses our solution to the notational problem described above. Having described the notation in general terms, Section 3 provides more background on JML. The subsequent three sections treat the remaining problems discussed above. The paper ends with a discussion of related work and some conclusions.

2

Solving the Notational Problem

To solve the notational problem described in Section 1.3.1, JML generally follows Eiﬀel, basing the syntax of speciﬁcation expressions on Java’s expression syntax. However, because side eﬀects are not desired in speciﬁcation expressions, JML’s speciﬁcation expressions do not include Java expressions that can cause obvious side eﬀects, i.e., assignment expressions and Java’s increment and decrement operators (++ and --). Furthermore, to make JML suitable for formal veriﬁcation eﬀorts, JML includes a number of operators that are not present in Java [55, Section 3]. The syntax of these operators comes in two ﬂavors: those that are symbolic and those that are textual. We did not want to introduce excess notation that would cause diﬃculties for programmers when reading speciﬁcations, so JML adds just ﬁve symbolic operators. Four of these are logical operators: forward and reverse implication, written ==> and <==, respectively, and logical equivalence and inequivalence, written <==> and <=!=>, respectively. The inclusion of symbols for logical operators is inspired by the calculational approach to formal methods [17, 20, 31]. The other symbolic operator is <:, which is used to compare types to see if they are in a subtype relationship [65]. All the other operators added to Java and available in JML’s speciﬁcation expressions use a textual notation consisting of a backslash (\) followed by an English word or phrase. For example, the logical quantiﬁers in JML are written as \forall and \exists [55]. Besides these quantiﬁers, JML also has several other operators using this backslash syntax. One of the most important is \old(), which is used in method postconditions to indicate an expression whose value is taken from the pre-state of a method call. For example, \old(i-1) denotes the value of i-1 evaluated in the pre-state of a method call. This notation is borrowed from the old operator in Eiﬀel. Other JML expressions using the backslash syntax include \fresh(o), which says that o was not allocated in the pre-state of a method call, but is allocated (and not null) in the post-state, and \result, which denotes the normal result returned by a method.

268

G.T. Leavens et al.

The backslashes in the syntax of these operators serve a very important purpose—they prevent the rest of the operator’s name from being interpreted as a Java identiﬁer. This allows JML to avoid reserving Java identiﬁers in speciﬁcation expressions. For example, result can be used as a program variable and is distinguished from \result. This trick is useful in allowing JML to specify arbitrary Java programs. Indeed, because a goal of JML is to document existing code, it cannot add new reserved words to Java.

3

Background on JML

In this section we provide additional background on JML that will be useful in understanding our solutions to the remaining problems. 3.1

Semantics of Speciﬁcation Expressions

Just as JML adopts much of Java’s expression syntax, it attempts to keep JML’s semantics similar to Java’s. In particular, the semantics of speciﬁcation expressions is a reference semantics. That is, when the name of a variable or ﬁeld is used in an expression, it denotes either a primitive value (such as an integer) or a reference to an object. References themselves are values in the semantics, which allows one to directly express aliasing or the lack of it. For example, the expression arg != fieldVal says that arg and fieldVal are not aliased. Java also allows one to compare the states of objects using the equals method. For example, in the postcondition of a clone method, one might write the following to say that the result returned by clone is a newly allocated object that has the same state as the receiver (this): \fresh(\result) && this.equals(\result); Note that the exact meaning of the equals method for a given type is left to the designer of that type, as in Java. Thus, if one only knows that o is an Object, it is hard to conclude much about x from o.equals(x). Because JML uses this reference semantics, speciﬁers must show the same care as Java programmers when choosing between the == and equals equality tests. And like Eiﬀel, but unlike Larch-style interface speciﬁcation languages, JML does not need “state functions” to be applied to extract the value of an expression from a reference. Values are implicitly extracted as needed by methods and operators. Besides being easier for programmers, this lends some succinctness to the notation. Currently, JML adopts all of the Java semantics for integer arithmetic. Thus types such as int use two’s complement arithmetic and are ﬁnite. Although Java programmers are, in theory, aware of the nature of integer arithmetic, it seems that JML’s adoption of Java’s semantics causes some misunderstandings; for example, some published JML speciﬁcations are inconsistent because of this semantics [10]. Chalin has suggested adding new primitive value types for inﬁnite precision arithmetic to JML; in particular, he suggests a type \bigint for inﬁnite precision integers [9, 10]. He is currently implementing and experimenting with this idea.

How the Design of JML Accommodates

3.2

269

Method and Type Speciﬁcations

To explain JML’s semantics for method speciﬁcations, we use the example in Figure 1. JML uses special comments, called annotations, to hold the speciﬁcation of behavior; these are added to the interface information contained in the Java code. A speciﬁer writes these annotation comments by inserting an at-sign (@) following the usual characters that signify the start of a comment. In multi-line annotation comments, at-signs at the beginnings of lines are ignored. Figure 1 starts with a “model import” directive, which says that JML will consider all types in the named package, org.jmlspecs.models, to be imported for purposes of the speciﬁcation. This allows the JML tools to ﬁnd the type JMLObjectSequence (see the third line) in that package. The type JMLObjectSequence is used as the type of the model instance ﬁeld, named absVal. In this declaration, the model keyword says that the ﬁeld is not part of the Java code, but is used solely for purposes of speciﬁcation. The instance keyword says that the ﬁeld is imagined, for purposes of speciﬁcation, to be a non-static ﬁeld in every class that implements this interface3 . Following the declaration of the two model instance ﬁelds is an invariant. It says that the ﬁeld absVal is never null. Following the invariant are the declarations and speciﬁcations of three methods. In JML, a method’s speciﬁcations are typically written, as they are in Figure 1, before the header of the method they specify. This makes the scope of the formal parameters of a method a bit strange, because it extends backward into the method’s speciﬁcation. However, it works best with Java tools, which expect comments related to a method, such as javadoc comments, to precede the method’s header. Consider the speciﬁcation of the ﬁrst method, push. This shows the general form of a “normal behavior” speciﬁcation case. A speciﬁcation case includes a precondition, indicated by the keyword requires, and some other speciﬁcation clauses. A speciﬁcation case is satisﬁed if, whenever the precondition is satisﬁed, the other clauses are also satisﬁed. Additionally, in a normal behavior speciﬁcation case, the method must not throw an exception when the precondition is satisﬁed. The speciﬁcation case given for push includes, besides the requires clause, a frame axiom, introduced by the keyword assignable, and a normal postcondition, following the keyword ensures. As with speciﬁcation languages in the Larch family, a precondition that is just true can be omitted. In the Larch family, an omitted frame axiom means “assignable \nothing;”, which is a very strong speciﬁcation that says that the method has no side eﬀects. Following a suggestion of Erik Poll, we decided that such a speciﬁcation was too strong for a default. So in JML, an omitted frame axiom allows assignment to all locations. This agrees with most of the defaults for omitted clauses in JML, which impose no restrictions. JML also allows speciﬁers to write “exceptional behavior” speciﬁcation cases, which say that, when the precondition is satisﬁed, the method must not return 3

Omitting instance makes ﬁelds static and ﬁnal, which is Java’s default for ﬁelds declared in interfaces.

270

G.T. Leavens et al.

//@ model import org.jmlspecs.models.*; public interface Stack { //@ public model instance JMLObjectSequence absVal; //@ public instance invariant absVal != null; /*@ public normal_behavior @ requires true; @ assignable absVal; @ ensures absVal.equals(\old(absVal.insertFront(x))); void push(Object x);

@*/

/*@ public normal_behavior @ requires !absVal.isEmpty(); @ assignable absVal; @ ensures absVal.equals(\old(absVal.trailer())) @ && \result == \old(absVal.first()); @ also @ public exceptional_behavior @ requires absVal.isEmpty(); @ assignable \nothing; @ signals (Exception e) e instanceof IllegalStateException; @*/ Object pop(); //@ ensures \result <==> absVal.isEmpty(); /*@ pure @*/ boolean isEmpty(); } Fig. 1. The speciﬁcation and code for the interface Stack.

normally but must instead throw an exception. An example appears in the speciﬁcation of the pop method. This speciﬁcation has two speciﬁcation cases connected with also. The meaning of the also is that the method must satisfy both of these speciﬁcation cases [84, 86]. Thus, when the value of the model instance ﬁeld absVal is not empty, a call to pop must return normally and must satisfy the given ensures clause. But when the value of the model instance ﬁeld absVal is empty, a call to pop must throw an IllegalStateException. This kind of case analysis can be desugared into a single speciﬁcation case, which can be given a semantics in the usual way [38, 41, 53, 76]. The speciﬁcation cases given for push and pop are heavyweight speciﬁcation cases [55, Section 1]. Such speciﬁcation cases are useful when one wants to give a relatively complete speciﬁcation, especially for purposes of formal veriﬁcation. For runtime assertion checking or documentation, one may want to specify only part of the behavior of a method. This can be done using JML’s lightweight speciﬁcation cases, which are indicated by the absence of a behavior keyword

How the Design of JML Accommodates

271

(like normal behavior). Figure 1 gives an example of a lightweight speciﬁcation case in the speciﬁcation of the method isEmpty.

4

Dealing with Undeﬁnedness

As discussed in Section 1.3.2, a fundamental problem in using the underlying language for speciﬁcation expressions is dealing with expressions whose value is undeﬁned. In Java, undeﬁnedness in expressions is typically signaled by the expression throwing an exception. For example, when one divides an integer by 0, the expression throws an ArithmeticException. Exceptions may also be thrown by methods called from within speciﬁcation expressions. Speciﬁcation languages have adopted several diﬀerent approaches to dealing with undeﬁnedness in expressions [4,32]. We wanted a semantics that would not be surprising to either Java programmers or to those doing formal veriﬁcation. Typically, a Java programmer would try to write the speciﬁcation in a way that “protects” the meaning of the expression against any source of undeﬁnedness [62]. This can be accomplished by using the short-circuit boolean operators; for example, a speciﬁer might write denom > 0 && num/denom > 1 to be sure that the division would be deﬁned whenever it was carried out. However, we would like speciﬁcations to be meaningful even if they are not protective. Hence, the semantics of JML does not rely on the programmer writing protective speciﬁcations but, instead, ensures that every expression has some value. To do this, we adopted the “underspeciﬁed total functions” approach favored in the calculational style of formal methods [31,32]. That is, an expression that would not have a value in Java is given an arbitrary, but unspeciﬁed, value. For example, num/0 has some integer value, although this approach does not say what the value is, only that it must be uniformly substituted in any surrounding expression. In JML all expressions have an implicit argument of the program’s state; thus, the uniform substitution of values need only be carried out within a given assertion. An advantage of this substitution approach is that it validates the rules for standard logic. For example, in JML, E1 && E2 is equivalent to E2 && E1 . Consider what happens if E1 throws an exception; in that case, one may chose some unspeciﬁed boolean value for E1 , say b. This means that E1 && E2 equals b && E2 , which is equal to E2 && b, as can be seen by a simple case analysis on E2 ’s value. The case where E2 throws an exception is similar. Furthermore, if programmers write protective speciﬁcations, they will never be surprised by the details of this semantics. The JML assertion checking compiler takes advantage of the semantics of undeﬁnedness to attempt, as much as possible, to detect possible assertion violations [13]. That is, assertion checking attempts to use a value that will make the overall assertion false, whenever the undeﬁnedness of some subexpression allows it to do so. In this way, the assertion checker can both follow the rules of standard logic and detect places where speciﬁcations are not suﬃciently protective. This is a good example of how JML caters to the needs of both runtime assertion checking and formal veriﬁcation.

272

5

G.T. Leavens et al.

Preventing Side Eﬀects in Assertions

As discussed in Section 1.3.3, it is important to prevent side eﬀects in assertions, for both practical and theoretical reasons. JML is designed to prevent such side eﬀects statically. It does this using an eﬀect-checking type system [30, 81]. This type system is designed to be as simple as possible. Although it allows speciﬁcation expressions to call Java methods and constructors, it only allows such calls if the called method or constructor is declared with the modiﬁer pure. The semantics of JML must thus assure that pure methods and constructors are side-eﬀect free. 5.1

JML’s Purity Restrictions

JML’s semantic restrictions on pure methods and constructors are as follows: – A pure method implicitly has a speciﬁcation that includes the following speciﬁcation case [55, Section 2.3.1]: assignable \nothing; This ensures that a correct implementation of the method has no side eﬀects. – “A pure constructor implicitly has a speciﬁcation that only allows it to assign to the instance ﬁelds of the class in which it appears” (including inherited instance ﬁelds) [55, Section 2.3.1]. This ensures that, if the constructor is correctly implemented, then a new expression that calls it has no side eﬀects. – Pure methods and pure constructors may only invoke other methods and constructors that are pure. This makes the type system modular, as it allows the purity of a method or a constructor to be checked based only on its code and the speciﬁcations of the other methods and constructors that it calls. – All methods and constructors that override a pure method or constructor must also be pure. This inheritance of purity is a consequence of speciﬁcation inheritance and is necessary to make the type system modular in the presence of subtyping. The ﬁrst restriction implies that a pure method may not perform any input or output, nor may it assign to any non-local variables. Similarly, by the second restriction, a pure constructor may not do any I/O and may not assign to non-local storage other than the instance ﬁelds of the object the constructor is initializing. Note that, in JML, saying that a method may not assign to non-local storage means precisely that—even benevolent side eﬀects are prohibited [55, Section 2.1.3.1]. This seems necessary for sound modular reasoning [64]. It is also a useful restriction for reasoning about supertypes from their speciﬁcations [78] and for reasoning about concurrent programs. The last two restrictions are also motivated by modularity considerations. Inheritance of purity has as a consequence that a method cannot be pure if any overriding method has side eﬀects. In particular, a method in Object can be speciﬁed as pure only if every override of that method, in any Java class, obeys JML’s purity restrictions.

How the Design of JML Accommodates

273

The type system of JML is an important advance over languages like Eiﬀel, which trust programmers to avoid side eﬀects in assertions rather than statically checking this property. However, as we will see in the following subsection, JML’s purity restrictions give rise to some practical problems. 5.2

Practical Problems with JML’s Purity Restrictions

An initial practical problem is how to decide which methods in Java’s libraries should be speciﬁed as pure. One way to start to answer this question is to use a static analysis to conservatively estimate which methods in Java’s libraries have side eﬀects. A conservative analysis could count a method as having side eﬀects if it assigns to non-local storage or calls native methods (which may do I/O), either directly or indirectly. All other methods can safely be speciﬁed as pure, provided they are not overridden by methods that the analysis says have side eﬀects. Researchers from Purdue have provided a list of such methods to us, using their tools from the Open Virtual Machine project4 . We hope to integrate this technology into the JML tools eventually. Declaring a method to be pure entails a very strong speciﬁcation, namely that the method and all possible overriding methods have no side eﬀects. Thus, ﬁnding that a method, and all known methods that override it, obey JML’s purity restrictions is not the same as deciding that the method should be speciﬁed as pure. Such a decision aﬀects not just all existing overrides of the method, but all future implementations and overrides. How is one to make such a decision? This problem is particularly vexing because there are many methods that seem intuitively to be side-eﬀect free, but that do not obey JML’s purity restrictions. Methods with benevolent side eﬀects are common examples. A benevolent side eﬀect is a change in the internal state of an object in a way that is not externally visible. Two examples from the protocol of Object will illustrate the importance of this problem. First, consider computing a hash code for an instance of a class. Because this may be computationally costly, an implementation may desire to compute the hash code the ﬁrst time it is asked for and then cache the result in a private ﬁeld of the object. When the hash code is requested on subsequent occasions, the cached result is returned without further computation. For example, this is done in the hashCode method of Java’s String class. However, in JML, storing the computed hash code into the cache is considered to be a side eﬀect. So String’s hashCode method cannot be speciﬁed as pure. Second, consider computing object equality. In some implementations, an object’s ﬁelds might be lazily initialized or computed only on ﬁrst access. If the equals method happens to be the ﬁrst such method to be called on such an object, it will trigger the delayed computation. We found such an example in our work on the MultiJava compiler [15, 16]; in this compiler, the class CClassType has such delayed computations, and its override of Object’s equals method can trigger a previously delayed computation with side eﬀects. It seems very diﬃcult to rewrite this method to be side-eﬀect free, because to do so one would probably 4

See http://www.ovmj.org/.

274

G.T. Leavens et al.

need to change the compiler’s architecture. (Similar kinds of lazy initialization of ﬁelds occur in implementations of the Singleton pattern, although these usually do not aﬀect the equals method.) We have shown two cases where methods in the protocol of Object are overridden by methods that cannot be pure. By purity and speciﬁcation inheritance, these examples imply that neither hashCode nor equals can be speciﬁed as pure in Object. Object is typically used in Java as the type of the elements in a collection. Hence, in the speciﬁcation of a collection type, such as a hash table, one cannot use the hashCode or equals methods on elements. Without changes, this would make JML unsuitable for specifying collection types. (This problem is mostly a problem for collection types, because one can specify many subclasses of Object with pure hashCode and equals methods. Speciﬁcations operating on instances of such subclasses can use these methods without violating JML’s type system.) 5.3

Solving the Problems

The desire to use intuitively side-eﬀect free methods in speciﬁcations, even if they are not pure according to JML’s semantics, is strong enough that we considered changing the semantics of the assignable clause in order to allow benevolent side eﬀects. However, we do not know how to do that and still retain sound modular reasoning [64]. In any case, the use of such methods in runtime assertion checking would still be problematic because of the side eﬀects they might cause. In addition, we would like to prevent problems when a programmer wrongly believes that side eﬀects are benevolent; it is not clear whether an automatic static analysis could prevent such problems, and even if so, whether such a tool could be modular. Thus far, the only viable solution we have identiﬁed is to refactor speciﬁcations by adding pure model (i.e., speciﬁcation-only) methods that are to be used in speciﬁcations in place of program methods that cannot be pure. That is, whenever one has an intuitively side-eﬀect free program method, m, that is not pure according to JML’s semantics, one should create a pure model method m , which returns the same result as m but without its side eﬀects. Then one replaces calls to m by calls to m in assertions. We are currently experimenting with this solution. The most important part of this experiment is to replace uses of Object’s equals method, which cannot be pure, with calls to a new pure model method in Object, called isEqualTo. The speciﬁcations of these methods are shown in Figure 2. The assignable clause in the speciﬁcation of the equals method permits benevolent side eﬀects; it is also speciﬁed to return the same result as would a call to isEqualTo. Thus, whenever someone overrides equals, they should also override the isEqualTo method. When an override of equals is speciﬁed as pure, then an override of isEqualTo in the same class can be speciﬁed in terms of this pure equals method, and the implementation of the model isEqualTo method can simply call equals as well. However, an implementation of equals can never call isEqualTo, because program code cannot call model methods (since model methods can only be used in speciﬁcations). Therefore, to avoid code duplication when equals is not

How the Design of JML Accommodates

275

/*@ public normal_behavior @ assignable objectState; @ ensures \result <==> this.isEqualTo(obj); @*/ public boolean equals(Object obj); /*@ public normal_behavior @ requires obj != null; @ assignable \nothing; @ ensures (* \result is true when obj is equal to this object *); @ also @ public normal_behavior @ requires obj != null && \typeof(this) == \type(Object); @ assignable \nothing; @ ensures \result <==> this == obj; @ also @ public normal_behavior @ requires obj == null; @ assignable \nothing; @ ensures \result <==> false; public pure model boolean isEqualTo(Object obj) { return this == obj; } @*/ Fig. 2. The refactored speciﬁcation for Object’s equals method and the pure model method isEqualTo. The text between (* and *) in the ﬁrst speciﬁcation case of isEqualTo’s speciﬁcation is an “informal description”, which formally is equivalent to writing true [53].

declared to be pure but the two methods share some common implementation code, one can introduce a (non-model) pure, private method that both equals and isEqualTo can call. We have also applied this refactoring to all the collection classes in java.util (and in other packages) that we had previously speciﬁed, in order to check that the solution is viable. So far the results seem satisfactory. However, as of May 2003, this restructuring is not part of the JML release, because the JML tools are not yet able to handle some of the details of this approach. In particular, the runtime assertion checker is not yet able to compile the model methods added to Object without having all of Object’s source code available. (And we cannot legally ship Sun’s source code for Object in the JML release.) However, we working on solutions to this problem that will allow us to obtain more experience with this approach and to do more case studies.

276

5.4

G.T. Leavens et al.

Future Work on Synchronized Methods and Purity

JML currently permits synchronized methods to be declared pure if they meet all the criteria described in Section 5.1. Given that obtaining a lock is a side eﬀect that can aﬀect control ﬂow in a program, does allowing synchronized methods to be pure violate the intent of JML’s purity restrictions? On the surface it would seem so, because when a synchronized method gains a lock, it may change the outcome of other concurrent threads. Furthermore, execution of such a method might block, conceivably even causing a deadlock between concurrent threads that would not occur if one was not doing assertion checking. However, since we have largely ignored concurrency thus far in JML’s design, we leave resolution of this issue for future work.

6

Mathematical Libraries

As described in Section 1.3.4, we need to provide a library of mathematical concepts with JML in a way that does not overwhelm programmers, and yet is useful for formal veriﬁcation. 6.1

Hiding the Mathematics

It is sometimes convenient to use mathematical concepts such as sets and sequences in speciﬁcation, particularly for collection classes [36, 68, 85]. For example, the speciﬁcation of Stack in Figure 1 uses the type JMLObjectSequence, which is part of JML’s org.jmlspecs.models package. This package contains types that are intended for such mathematical modeling. Besides sequences, these include sets, bags, relations, and maps, and a few other convenience types. Most types in org.jmlspecs.models have only pure methods and constructors5 . For example, JMLObjectSequence’s insertFront method returns a sequence object that is like the receiver, but with its argument placed at the front; the receiver is not changed in any way. JMLObjectSequence’s trailer method similarly returns a sequence containing all but the ﬁrst element of the receiver, without changing the receiver. Because such methods are pure, they can be used during runtime assertion checking without changing the underlying computation. JML gains two advantages from having these mathematical modeling types in a Java package, as opposed to having them be purely mathematical concepts. First, these types all have Java implementations and thus can be used during runtime assertion checking. Second, using these types in assertions avoids the introduction of special mathematical notation; instead, normal Java expressions (method calls) are used to do things like concatenating sequences or intersecting sets. This is an advantage for our main audience, which consists of programmers and not mathematicians. 5

The org.jmlspecs.models package does have some types that have non-pure methods. These are various kinds of iterators and enumerators. The methods of these iterators and enumerators that have side eﬀects cannot be used in speciﬁcation expressions.

How the Design of JML Accommodates

6.2

277

Use by Theorem Provers

The second part of the mathematical libraries problem described in Section 1.3.4 is that the library of mathematical modeling types should be useful for formal veriﬁcation. The types in the org.jmlspecs.models package are intended to correspond (loosely) to the libraries of mathematical concepts found in theorem provers, such as PVS. As we gain experience, we can add additional methods to these types to improve their correspondence to these mathematical concepts. It is also possible to add new packages of such types tailored to speciﬁc theorem provers or to other notations, such as OCL. When translating speciﬁcation expressions into theorem prover input, the Loop tool currently treats all methods in the same way — it does not make a special case for pure methods in the org.jmlspecs.models package. This makes the resulting proof obligations more complex than is desirable. Since the types in the models package are known, it seems that one should be able, as a special case, to replace the general semantics of such a method call with a call to some speciﬁc function from the theorem prover’s library of mathematical concepts. To facilitate this, it may be that these model types should all be declared to be ﬁnal, which is currently not the case.

7

Related Work

We have already discussed how JML diﬀers from conventional formal speciﬁcation languages, such as Z [80, 79, 87], VDM [6, 27, 74, 43], the Larch family [33, 48, 52, 85] and RESOLVE [22, 73]. To summarize, the main diﬀerence is that JML’s speciﬁcation expressions are based on a subset of the Java programming language, a design that is more congenial to Java programmers. The Alloy Annotation Language (AAL) oﬀers a syntax similar to JML for annotating Java programs [46]. AAL supports extensive compile-time checking based on static analysis techniques. Unlike similar static analysis tools such as ESC/Java [18], AAL also supports method calls and relational expressions in assertions. However, AAL’s assertion language is based on a simple ﬁrst-order logic with relational operators [39] and not on a subset of Java expressions. We believe that a Java-based syntax is more likely to gain acceptance among Java programmers. However, JML could adopt some of AAL’s features for specifying sets of objects using regular expressions. These would be helpful in using JML’s frame axioms, where they would allow JML to more precisely describe locations that can be assigned to in the method. (Another option that would have similar beneﬁts would be to use the approach taken in DemeterJ [67].) We have also discussed how JML diﬀers from design by contract languages, such as Eiﬀel [70, 71], and tools, such as APP [77]. Summarizing, JML provides better support for complete speciﬁcations and formal veriﬁcation by – extending the set of speciﬁcation expressions with more expressive mathematical constructs, such as quantiﬁers, – ensuring that speciﬁcation expressions do not contain side eﬀects, and – providing a library of types corresponding to mathematical concepts.

278

G.T. Leavens et al.

JML’s speciﬁcation-only (model) declarations and frame axioms also contribute to its ability to specify types more completely than is easily done with design by contract tools. We know of several other design by contract tools for Java [5,21,24,45,47,75]. The approaches vary from a simple assertion mechanism similar to the assert macros of C and C++ to full-ﬂedged contract enforcement capabilities. Jass [5], iContract [47], and JContract [75] focus on the practical use of design by contract in Java. Handshake and jContractor focus on implementation techniques such as library-based on-the-ﬂy instrumentation of contracts [21, 45]. Contract Java focuses on properly blaming contract violations [24, 25]. These notations and tools suﬀer from the same problems as Eiﬀel. That is, none of them guarantee the lack of side eﬀects in assertions, handle undeﬁnedness in a way that would facilitate formal veriﬁcation and reasoning, support more expressive mathematical notations such as quantiﬁers, or provide a set of immutable types designed for use in speciﬁcations. In sum, they all focus on runtime checking, and thus it is diﬃcult to write complete speciﬁcations for formal veriﬁcation and reasoning.

8

Conclusion

JML synthesizes the best from the worlds of design by contract and more mathematical speciﬁcation languages. Because of its expressive mathematical notations, its speciﬁcation-only (model) declarations, and library of mathematical modeling types, one can more easily write complete speciﬁcations in JML than in a design by contract language, such as Eiﬀel. These more complete speciﬁcations, along with JML’s purity checking, allow JML to be useful for formal veriﬁcation. Thus, JML’s synthesis of features allows it to serve many roles in the Java formal methods community. Our experience so far is that this approach has had a modest impact. Release 3.7 of JML has been downloaded almost 400 times. JML has been used in at least 5 universities for teaching some aspects of formal methods. It is used somewhat extensively in the Java Smart Card industry and has been used in at least one company outside of that industry (Fulcrum). In the future, we would like to extend the range of tools that JML supports to include tools for model checking and speciﬁcation of concurrent Java programs [1]. We invite others to join us in this eﬀort to furnish Java programmers with a single notation that can be used by many tools.

Acknowledgments The work of Leavens, Cheon, Clifton, and Ruby was supported in part by the US National Science Foundation, under grants CCR-0097907 and CCR-0113181. Thanks to Robyn Lutz, Sharon Ryan, and Janet Leavens for comments on earlier drafts of this paper. Thanks to all who have contributed to the design and implementation of JML including Al Baker, Erik Poll, Bart Jacobs, Joe Kiniry, Rustan Leino, Raymie Stata, Michael Ernst, Gary Daugherty, Arnd PoetzschHeﬀter, Peter M¨ uller, and others acknowledged in [55].

How the Design of JML Accommodates

279

References 1. E. Abraham-Mumm, F.S. de Boer, W.P. de Roever, , and M. Steﬀen. A toolsupported proof system for mutlithreaded java. In Frank de Boer, Marcello Bonsangue, Susanne Graf, and Willem-Paul de Roever, editors, FMCO 2002: Formal Methods for Component Objects, Proceedings, Lecture Notes in Computer Science. Springer-Verlag, 2003. 2. Pierre America. Inheritance and subtyping in a parallel object-oriented language. In Jean Bezivin et al., editors, ECOOP ’87, European Conference on ObjectOriented Programming, Paris, France, pages 234–242, New York, NY, June 1987. Springer-Verlag. Lecture Notes in Computer Science, volume 276. 3. Pierre America. Designing an object-oriented programming language with behavioural subtyping. In J. W. de Bakker, W. P. de Roever, and G. Rozenberg, editors, Foundations of Object-Oriented Languages, REX School/Workshop, Noordwijkerhout, The Netherlands, May/June 1990, volume 489 of Lecture Notes in Computer Science, pages 60–90. Springer-Verlag, New York, NY, 1991. 4. H. Barringer, J. H. Cheng, and C. B. Jones. A logic covering undeﬁnedness in program proofs. Acta Informatica, 21(3):251–269, October 1984. 5. D. Bartetzko, C. Fischer, M. Moller, and H. Wehrheim. Jass - Java with assertions. In Workshop on Runtime Veriﬁcation held in conjunction with the 13th Conference on Computer Aided Veriﬁcation, CAV’01, 2001. Published in Electronic Notes in Theoretical Computer Science, K. Havelund and G. Rosu (eds.), 55(2), 2001. Available from www.elsevier.nl. 6. Juan Bicarregui, John S. Fitgerald, Peter A. Lindsay, Richard Moore, and Brian Ritchie. Proof in VDM: A Practitioner’s Guide. Springer-Verlag, New York, NY, 1994. 7. Grady Booch, James Rumbaugh, and Ivar Jacobson. The Uniﬁed Modeling Language User Guide. Object Technology Series. Addison Wesley Longman, Reading, Mass., 1999. 8. Chandrasekhar Boyapati, Sarfraz Khurshid, and Darko Marinov. Korat: Automated testing based on Java predicates. In Proceedings International Symposium on Software Testing and Analysis (ISSTA), pages 123–133. ACM, July 2002. 9. Patrice Chalin. Back to basics: Language support and semantics of basic inﬁnite integer types in JML and Larch. Technical Report CU-CS 2002-003.1, Computer Science Department, Concordia University, October 2002. 10. Patrice Chalin. Improving JML: For a safer and more eﬀective language. Technical Report 2003-001.1, Computer Science Department, Concordia University, March 2003. 11. Yoonsik Cheon and Gary T. Leavens. The Larch/Smalltalk interface speciﬁcation language. ACM Transactions on Software Engineering and Methodology, 3(3):221– 253, July 1994. 12. Yoonsik Cheon and Gary T. Leavens. A quick overview of Larch/C++. Journal of Object-Oriented Programming, 7(6):39–49, October 1994. 13. Yoonsik Cheon and Gary T. Leavens. A runtime assertion checker for the Java Modeling Language (JML). In Hamid R. Arabnia and Youngsong Mun, editors, Proceedings of the International Conference on Software Engineering Research and Practice (SERP ’02), Las Vegas, Nevada, USA, June 24-27, 2002, pages 322–328. CSREA Press, June 2002.

280

G.T. Leavens et al.

14. Yoonsik Cheon and Gary T. Leavens. A simple and practical approach to unit testing: The JML and JUnit way. In Boris Magnusson, editor, ECOOP 2002 — ObjectOriented Programming, 16th European Conference, M´ aalaga, Spain, Proceedings, volume 2374 of Lecture Notes in Computer Science, pages 231–255, Berlin, June 2002. Springer-Verlag. 15. Curtis Clifton. MultiJava: Design, implementation, and evaluation of a Javacompatible language supporting modular open classes and symmetric multiple dispatch. Technical Report 01-10, Department of Computer Science, Iowa State University, Ames, Iowa, 50011, November 2001. Available from www.multijava.org. 16. Curtis Clifton, Gary T. Leavens, Craig Chambers, and Todd Millstein. MultiJava: Modular open classes and symmetric multiple dispatch for Java. In OOPSLA 2000 Conference on Object-Oriented Programming, Systems, Languages, and Applications, volume 35(10) of ACM SIGPLAN Notices, pages 130–145, New York, October 2000. ACM. 17. Edward Cohen. Programming in the 1990s: An Introduction to the Calculation of Programs. Springer-Verlag, New York, NY, 1990. 18. David L. Detlefs, K. Rustan M. Leino, Greg Nelson, and James B. Saxe. Extended static checking. SRC Research Report 159, Compaq Systems Research Center, 130 Lytton Ave., Palo Alto, Dec 1998. 19. Krishna Kishore Dhara and Gary T. Leavens. Forcing behavioral subtyping through speciﬁcation inheritance. In Proceedings of the 18th International Conference on Software Engineering, Berlin, Germany, pages 258–267. IEEE Computer Society Press, March 1996. A corrected version is Iowa State University, Dept. of Computer Science TR #95-20c. 20. Edsger W. Dijkstra and Carel S. Scholten. Predicate Calculus and program semantics. Springer-Verlag, NY, 1990. 21. Andrew Duncan and Urs Holzle. Adding contracts to Java with Handshake. Technical Report TRCS98-32, Department of Computer Science, University of California, Santa Barbara, CA, December 1998. 22. Stephen H. Edwards, Wayne D. Heym, Timothy J. Long, Murali Sitaraman, and Bruce W. Weide. Part II: Specifying components in RESOLVE. ACM SIGSOFT Software Engineering Notes, 19(4):29–39, Oct 1994. 23. Michael Ernst, Jake Cockrell, William G. Griswold, and David Notkin. Dynamically discovering likely program invariants to support program evolution. IEEE Transactions on Software Engineering, 27(2):1–25, February 2001. 24. Robert Bruce Findler and Matthias Felleisen. Contract soundness for objectoriented languages. In OOPSLA ’01 Conference Proceedings, Object-Oriented Programming, Systems, Languages, and Applications, October 14-18, 2001, Tampa Bay, Florida, USA, pages 1–15, October 2001. 25. Robert Bruce Findler, Mario Latendresse, and Matthias Felleisen. Behavioral contracts and behavioral subtyping. In Proceedings of Joint 8th European Software Engineering Conference (ESEC) and 9th ACM SIGSOFT International Symposium on the Foundations of Software Engineering (FSE), September 10-14, 2001, Vienna, Austria, September 2001. 26. Kate Finney. Mathematical notation in formal speciﬁcation: Too diﬃcult for the masses? IEEE Transactions on Software Engineering, 22(2):158–159, February 1996. 27. John Fitzgerald and Peter Gorm Larsen. Modelling Systems: Practical Tools in Software Development. Cambridge, Cambridge, UK, 1998.

How the Design of JML Accommodates

281

28. Cormac Flanagan, K. Rustan M. Leino, Mark Lillibridge, Greg Nelson, James B. Saxe, and Raymie Stata. Extended static checking for Java. In Cindy Norris and Jr. James B. Fenwick, editors, Proceedings of the ACM SIGPLAN 2002 Conference on Programming Language Design and Implementation (PLDI-02), volume 37, 5 of SIGPLAN, pages 234–245, New York, June 17–19 2002. ACM Press. 29. Lisa Friendly. The design of distributed hyperlinked programming documentation. In S. Fra¨iss`e, F. Garzotto, T. Isakowitz, J. Nanard, and M. Nanard, editors, Proceedings of the International Workshop on Hypermedia Design (IWHD’95), Montpellier, France, 1–2 June 1995, pages 151–173. Springer, 1995. 30. David K. Giﬀord and John M. Lucassen. Integrating functional and imperative programming. In ACM Conference on LISP and Functional Programming, pages 28–38. ACM, August 1986. 31. David Gries and Fred B. Schneider. A Logical Approach to Discrete Math. Texts and Monographs in Computer Science. Springer-Verlag, New York, NY, 1994. 32. David Gries and Fred B. Schneider. Avoiding the undeﬁned by underspeciﬁcation. In Jan van Leeuwen, editor, Computer Science Today: Recent Trends and Developments, number 1000 in Lecture Notes in Computer Science, pages 366–373. Springer-Verlag, New York, NY, 1995. 33. John V. Guttag, James J. Horning, S.J. Garland, K.D. Jones, A. Modet, and J.M. Wing. Larch: Languages and Tools for Formal Speciﬁcation. Springer-Verlag, New York, NY, 1993. 34. C. A. R. Hoare. An axiomatic basis for computer programming. Communications of the ACM, 12(10):576–583, October 1969. 35. C. A. R. Hoare. Notes on data structuring. In E. Dijkstra Ole-J. Dahl and C. A. R. Hoare, editors, Structured Programming, pages 83–174. Academic Press, Inc., New York, NY, 1972. 36. C. A. R. Hoare. Proof of correctness of data representations. Acta Informatica, 1(4):271–281, 1972. 37. Marieke Huisman. Reasoning about Java Programs in higher order logic with PVS and Isabelle. Ipa dissertation series, 2001-03, University of Nijmegen, Holland, February 2001. 38. Marieke Huisman and Bart Jacobs. Java program veriﬁcation via a Hoare logic with abrupt termination. In T. Maibaum, editor, Fundamental Approaches to Software Engineering (FASE 2000), volume 1783 of LNCS, pages 284–303. Springer-Verlag, 2000. An earlier version is technical report CSI-R9912. 39. Daniel Jackson. Alloy: A lightweight object modeling notation. ACM Transactions on Software Engineering and Methodology, 11(2):256–290, April 2002. 40. Bart Jacobs, Joseph Kiniry, and M. Warnier. Java program veriﬁcation challenges. In Frank de Boer, Marcello Bonsangue, Susanne Graf, and Willem-Paul de Roever, editors, FMCO 2002: Formal Methods for Component Objects, Proceedings, Lecture Notes in Computer Science. Springer-Verlag, 2003. 41. Bart Jacobs and Eric Poll. A logic for the Java modeling language JML. In Fundamental Approaches to Software Engineering (FASE’2001), Genova, Italy, 2001, volume 2029 of Lecture Notes in Computer Science, pages 284–299. SpringerVerlag, 2001. 42. Bart Jacobs, Joachim van den Berg, Marieke Huisman, Martijn van Berkum, Ulrich Hensel, and Hendrik Tews. Reasoning about Java classes (preliminary report). In OOPSLA ’98 Conference Proceedings, volume 33(10) of ACM SIGPLAN Notices, pages 329–340. ACM, October 1998. 43. Cliﬀ B. Jones. Systematic Software Development Using VDM. International Series in Computer Science. Prentice Hall, Englewood Cliﬀs, N.J., second edition, 1990.

282

G.T. Leavens et al.

44. H. B. M. Jonkers. Upgrading the pre- and postcondition technique. In S. Prehn and W. J. Toetenel, editors, VDM ’91 Formal Software Development Methods 4th International Symposium of VDM Europe Noordwijkerhout, The Netherlands, Volume 1: Conference Contributions, volume 551 of Lecture Notes in Computer Science, pages 428–456. Springer-Verlag, New York, NY, October 1991. 45. Murat Karaorman, Urs Holzle, and John Bruno. jContractor: A reﬂective Java library to support design by contract. In Pierre Cointe, editor, Meta-Level Architectures and Reﬂection, Second International Conference on Reﬂection ’99, SaintMalo, France, July 19–21, 1999, Proceedings, volume 1616 of Lecture Notes in Computer Science, pages 175–196. Springer-Verlag, July 1999. 46. Sarfraz Khurshid, Darko Marinov, and Daniel Jackson. An analyzable annotation language. In Proceedings of OOPSLA ’02 Conference on Object-Oriented Programming, Languages, Systems, and Applications, volume 37(11) of SIGPLAN Notices, pages 231–245, New York, NY, November 2002. ACM. 47. Reto Kramer. iContract – the Java design by contract tool. TOOLS 26: Technology of Object-Oriented Languages and Systems, Los Alamitos, California, pages 295– 307, 1998. 48. Leslie Lamport. A simple approach to specifying concurrent systems. Communications of the ACM, 32(1):32–45, January 1989. 49. Craig Larman. Applying UML and Patterns: An Introduction to Object-Oriented Analysis and Design and the Uniﬁed Process. Prentice Hall PTR, Upper Saddle River, NJ, second edition edition, 2002. 50. Gary T. Leavens. An overview of Larch/C++: Behavioral speciﬁcations for C++ modules. In Haim Kilov and William Harvey, editors, Speciﬁcation of Behavioral Semantics in Object-Oriented Information Modeling, chapter 8, pages 121–142. Kluwer Academic Publishers, Boston, 1996. An extended version is TR #96-01d, Department of Computer Science, Iowa State University, Ames, Iowa, 50011. 51. Gary T. Leavens. Larch/C++ Reference Manual. Version 5.41. Available in ftp: //ftp.cs.iastate.edu/pub/larchc++/lcpp.ps.gz or on the World Wide Web at the URL http://www.cs.iastate.edu/˜leavens/larchc++.html, April 1999. 52. Gary T. Leavens. Larch frequently asked questions. Version 1.110. Available in http://www.cs.iastate.edu/˜leavens/larch-faq.html, May 2000. 53. Gary T. Leavens and Albert L. Baker. Enhancing the pre- and postcondition technique for more expressive speciﬁcations. In Jeannette M. Wing, Jim Woodcock, and Jim Davies, editors, FM’99 — Formal Methods: World Congress on Formal Methods in the Development of Computing Systems, Toulouse, France, September 1999, Proceedings, volume 1709 of Lecture Notes in Computer Science, pages 1087– 1106. Springer-Verlag, 1999. 54. Gary T. Leavens, Albert L. Baker, and Clyde Ruby. JML: A notation for detailed design. In Haim Kilov, Bernhard Rumpe, and Ian Simmonds, editors, Behavioral Speciﬁcations of Businesses and Systems, pages 175–188. Kluwer Academic Publishers, Boston, 1999. 55. Gary T. Leavens, Albert L. Baker, and Clyde Ruby. Preliminary design of JML: A behavioral interface speciﬁcation language for Java. Technical Report 98-06v, Iowa State University, Department of Computer Science, May 2003. See www.jmlspecs.org. 56. Gary T. Leavens and Yoonsik Cheon. Preliminary design of Larch/C++. In U. Martin and J. Wing, editors, Proceedings of the First International Workshop on Larch, July, 1992, Workshops in Computing, pages 159–184. Springer-Verlag, New York, NY, 1993.

How the Design of JML Accommodates

283

57. Gary T. Leavens and Krishna Kishore Dhara. Concepts of behavioral subtyping and a sketch of their extension to component-based systems. In Gary T. Leavens and Murali Sitaraman, editors, Foundations of Component-Based Systems, chapter 6, pages 113–135. Cambridge University Press, 2000. 58. Gary T. Leavens, K. Rustan M. Leino, Erik Poll, Clyde Ruby, and Bart Jacobs. JML: notations and tools supporting detailed design in Java. In OOPSLA 2000 Companion, Minneapolis, Minnesota, pages 105–106. ACM, October 2000. 59. Gary T. Leavens and Don Pigozzi. A complete algebraic characterization of behavioral subtyping. Acta Informatica, 36:617–663, 2000. 60. Gary T. Leavens and William E. Weihl. Reasoning about object-oriented programs that use subtypes (extended abstract). In N. Meyrowitz, editor, OOPSLA ECOOP ’90 Proceedings, volume 25(10) of ACM SIGPLAN Notices, pages 212–223. ACM, October 1990. 61. Gary T. Leavens and William E. Weihl. Speciﬁcation and veriﬁcation of objectoriented programs using supertype abstraction. Acta Informatica, 32(8):705–778, November 1995. 62. Gary T. Leavens and Jeannette M. Wing. Protective interface speciﬁcations. Formal Aspects of Computing, 10:59–75, 1998. 63. Gary Todd Leavens. Verifying object-oriented programs that use subtypes. Technical Report 439, Massachusetts Institute of Technology, Laboratory for Computer Science, February 1989. The author’s Ph.D. thesis. 64. K. Rustan M. Leino. A myth in the modular speciﬁcation of programs. Technical Report KRML 63, Digital Equipment Corporation, Systems Research Center, 130 Lytton Avenue Palo Alto, CA 94301, November 1995. Obtain from the author, at [email protected]. 65. K. Rustan M. Leino, Greg Nelson, and James B. Saxe. ESC/Java user’s manual. Technical note, Compaq Systems Research Center, October 2000. 66. K. Rustan M. Leino, James B. Saxe, and Raymie Stata. Checking Java programs via guarded commands. Technical Note 1999-002, Compaq Systems Research Center, Palo Alto, CA, May 1999. 67. Karl Lieberherr, Doug Orleans, and Johan Ovlinger. Aspect-oriented programming with adaptive methods. Communications of the ACM, 44(10):39–41, October 2001. 68. Barbara Liskov and John Guttag. Abstraction and Speciﬁcation in Program Development. The MIT Press, Cambridge, Mass., 1986. 69. Barbara Liskov and Jeannette Wing. A behavioral notion of subtyping. ACM Transactions on Programming Languages and Systems, 16(6):1811–1841, November 1994. 70. Bertrand Meyer. Eiﬀel: The Language. Object-Oriented Series. Prentice Hall, New York, NY, 1992. 71. Bertrand Meyer. Object-oriented Software Construction. Prentice Hall, New York, NY, second edition, 1997. 72. Jeremy W. Nimmer and Michael D. Ernst. Static veriﬁcation of dynamically detected program invariants: Integrating Daikon and ESC/Java. In Proceedings of RV’01, First Workshop on Runtime Veriﬁcation. Elsevier, July 2001. To appear in Electronic Notes in Theoretical Computer Science. 73. William F. Ogden, Murali Sitaraman, Bruce W. Weide, and Stuart H. Zweben. Part I: The RESOLVE framework and discipline — a research synopsis. ACM SIGSOFT Software Engineering Notes, 19(4):23–28, October 1994.

284

G.T. Leavens et al.

74. International Standards Organization. Information technology – programming languages, their environments and system software interfaces – Vienna Development Method – speciﬁcation language – part 1: Base language. ISO/IEC 13817-1, December 1996. 75. Parasoft Corporation. Using design by contractTM to automate JavaTM software and component testing. Available from http://www.parasoft.com/jsp/ products/tech papers.jsp?product=Jcontract, as of Feb. 2003. 76. Arun D. Raghavan and Gary T. Leavens. Desugaring JML method speciﬁcations. Technical Report 00-03c, Iowa State University, Department of Computer Science, August 2001. 77. D. S. Rosenblum. Towards a method of programming with assertions. In Proceedings of the 14th International Conference on Software Engineering, pages 92–104, May 1992. 78. Clyde Ruby and Gary T. Leavens. Safely creating correct subclasses without seeing superclass code. In OOPSLA 2000 Conference on Object-Oriented Programming, Systems, Languages, and Applications, Minneapolis, Minnesota, volume 35(10) of ACM SIGPLAN Notices, pages 208–228, October 2000. 79. J. Spivey. An introduction to Z and formal speciﬁcations. Software Engineering Journal, January 1989. 80. J. Michael Spivey. The Z Notation: A Reference Manual. International Series in Computer Science. Prentice-Hall, New York, NY, 1989. ISBN 013983768X. 81. Jean-Pierre Talpin and Pierre Jouvelot. The type and eﬀect discipline. Information and Computation, 111(2):245–296, June 1994. 82. Jos Warmer and Anneke Kleppe. The Object Constraint Language: Precise Modeling with UML. Addison Wesley Longman, Reading, Mass., 1999. 83. Jos Warmer and Anneke Kleppe. OCL: The constraint language of the UML. Journal of Object-Oriented Programming, 12(1):10–13,28, March 1999. 84. Alan Wills. Capsules and types in Fresco: Program validation in Smalltalk. In P. America, editor, ECOOP ’91: European Conference on Object Oriented Programming, volume 512 of Lecture Notes in Computer Science, pages 59–76. Springer-Verlag, New York, NY, 1991. 85. Jeannette M. Wing. Writing Larch interface language speciﬁcations. ACM Transactions on Programming Languages and Systems, 9(1):1–24, January 1987. 86. Jeannette Marie Wing. A two-tiered approach to specifying programs. Technical Report TR-299, Massachusetts Institute of Technology, Laboratory for Computer Science, 1983. 87. Jim Woodcock and Jim Davies. Using Z: Speciﬁcation, Reﬁnement, and Proof. Prentice Hall International Series in Computer Science, 1996.

Finding Implicit Contracts in .NET Components Karine Arnout1 and Bertran Meyer1,2 1 Chair

of Software Engineering, Swiss Federal Institute of Technology (ETH) CH-8092 Zurich, Switzerland 2 Eiffel Software, 356 Storke Road, Santa Barbara CA 93117, USA [email protected] http://se.inf.ethz.ch, http://www.eiffel.com

Abstract. Are contracts inherent in reusable libraries, or just one design technique among others? To help answer this question, we performed an empirical study of library classes from the .NET Collections library, which doesn’t use Design by Contract™, to look for unexpressed contracts. This article reports on the buried contracts we have found, and discusses improvements to the architecture – especially to the libraries’ ease of learning and ease of use – that may result from making the contracts explicit. It extends previous reports [3,4,5,6] with an analysis of the benefits of an a posteriori addition of contracts for the library users. Keywords: Design by Contract™, Library design, Reuse, Implicit contracts, .NET, Metadata, Contract Wizard, Eiffel.

1 Introduction Equipping libraries with contracts has become a second nature to designers working with Eiffel. Many commonly used libraries, however, don’t show any contracts at all. The resulting style is very different, and, to someone used to Design by Contract [21, 23,25,31], deficient. Because the benefits of contracts are so clear to those who use them, it’s natural to suspect that non-Eiffel programmers omit contracts because they have no good way to express them, or haven’t even been taught the concepts, but that conceptually contracts are there all the same: that inside every contract-less specification there is a contract wildly signaling to be let out. For an Eiffel programmer this is the natural interpretation. But when you are doing something different from the rest of the world, it’s good to check your own sanity. Are we wrong in seeing contracts around libraries, and the rest of the world – including the most recent general-purpose development frameworks – right in continuing to act as if contracts had never been invented? This article is such a sanity check. The basic conjecture that it explores may be stated more precisely: Resolving the Closet Contract Conjecture is interesting for several reasons: − The answer can shed light on important issues of reusable component design, one of the keys to progress in software engineering. − An answer can help library users (application programmers) choose between competing libraries. F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 285–318, 2003. © Springer-Verlag Berlin Heidelberg 2003

286

K. Arnout and B. Meyer Box 1. The Closet Contract Conjecture

Eiffel libraries have contracts; most others don’t. Which is the right explanation? − The contract-rich style of Eiffel libraries is but an artefact of the support for contracts in the Eiffel method, language and tools. Remove contract mechanisms, and the contracts just go away. − Contracts are inherent in library design; if not explicitly stated, as in C++/Java/.NET libraries, they are lurking anyway under the cover, either suppressed or replaced by comments in the program, explanations in the documentation, exceptions and other ersatz techniques. − On a more specific point, the answer would help ascertain the potential usefulness of a “Contract Wizard”. Such a tool, of which a first version has been implemented by Eiffel Software [1], takes advantage of the reflection facilities of .NET — “Metadata” — to let its users work interactively on compiled classes, coming from any non-contracted language such as C#, C++, Visual Basic, Cobol or Java, and add contracts to them a posteriori. But this is only interesting if the second answer holds for the Closet Contract Conjecture. If not, the Wizard’s user wouldn’t find any interesting contracts to add. − If that second answer indeed holds, we may use it to improve our understanding of the libraries, and even to improve the library themselves by turning the implicit contracts that we have elicited into explicit elements of the software. To help answer the conjecture, we have started a study of non-contracted libraries to see if we could spot implicit contracts. The .NET collection library [27], a comprehensive set of data structure implementations, has been one of the first targets. We examined some commonly used .NET collection classes, sleuthing around for hidden contracts, and trying to uncover language or documentation techniques used to make up for the absence of proper contract mechanisms such as precondition clauses, postcondition clauses and class invariants. Where we spotted closet contracts, we proceeded to out them, by producing class variants that retain the original APIs but make the contracts explicit. We compared the result both with the originals and with some of the classes’ closest counterparts in the EiffelBase library. The rest of this presentation describes the analysis and its outcomes: − Section 2 provides more details on why we have engaged in this study and explains the method of analysis. − Section 3 recalls the principles of Design by Contract and their application to library design. − Section 4 summarizes the contributions of .NET and its Collections library. − Section 5 presents the results of analyzing an important .NET collection class: ArrayList [28]. − Section 6 introduces a variant of ArrayList where the implicit contracts detected in the official version have been made explicit, and gives measurements of properties of the contracted class.

Finding Implicit Contracts in .NET Components

287

− Section 7 extends the analysis to some other classes and interfaces to assess the consistency of the results across the library. − Section 8 compares the effect of the two design styles, with and without contracts, on the development of applications using a library: ease of use, ease of learning, bug avoidance. − Section 9 presents related work about contract extraction and evaluates the possibility of automating the extraction of hidden preconditions by analyzing the CIL code. − Section 10 concludes with an assessment of the lessons learned. The focus of this work is on design and programming methodology, in particular design methodology for the construction of libraries of reusable components; we are looking for techniques that help component producers turn out better components, help component consumers learn to use the components, and reduce the potential for errors arising from incorrect, incomplete or misunderstood specifications. We were surprised, when presenting early versions, that some listeners were mostly interested in possibilities of extracting the contracts automatically from non-contracted components. Although section 9 indeed describes possibilities in this direction, building on earlier work on contract extraction, we must warn the reader not to expect any miracles; no computer tool can divine the intent behind a programmer’s work without some help from the programmer. The potential for automatic contract inference should not obscure the concrete and immediate benefits that good design methodology can bring to both the producers and consumers of reusable software.

2 The Context 2.1 A Distinctive Design Style Applying to reusable libraries the ideas of Design by Contract [21,23,25,31] means equipping each library class with precise specifications, or “contracts”, governing its interaction with the library's clients. Contracts include the class invariant, stating general consistency conditions to be maintained by every exported routine of the class, and, for every routine, preconditions stating the clients' obligations, and postconditions stating guarantees to the clients. Systematic application of these principles leads to a distinctive design style, immediately visible in Eiffel frameworks such as EiffelBase [10] [22] covering fundamental data structures and algorithms, EiffelVision for portable graphics, and others. These Eiffel libraries have been in wide use for many years and seemingly appreciated by their users as easy to learn, convenient to use, and beneficial to the reliability of applications built with them. A recent report by the Software Engineering Institute [37] confirms that for components in general — not just classes — the use of contracts appears to be a key condition of any effort to improve “composability” and scale up the application of component-based technology. Design by Contract as it has been applied to libraries so far, mostly in Eiffel, is not an a posteriori addition to the design of a library; it is an integral part of the design process. The resulting contract-rich library APIs are markedly different from more traditional, contract-less designs. One might argue that criticism of current libraries [38] becomes partly unjustified when libraries are built according to this style. The

288

K. Arnout and B. Meyer

difference is clear, for example, in a comparison of two libraries that cover some of the same ground: EiffelBase [10] [22], which is based on Design by Contract, and the .NET Collections library [27], which is not. Most non-Eiffel libraries, such as the .NET framework’s libraries, have indeed been built without explicit consideration to the notion of contract. Three possible explanations come to mind: − The library authors do not know about Design by Contract. − They know about the concepts, but don’t find them particularly useful. − They know about the concepts and find them interesting but too cumbersome to apply without built-in Eiffel-style support in the method, language and supporting tools. Regardless of the reason, the difference in styles is so stark that we must ask what happened, in these contract-less libraries, to the properties that the Eiffel designer would have expressed in preconditions, postconditions and class invariants. It’s this question that leads to the Closet Contract Conjecture: are the contracts of Eiffel libraries a figment of the Eiffel programmer’s obsession with this mechanism? Or are they present anyway, hidden, in non-Eiffel libraries as well? The only way to find out is to search contract-less libraries for closet contracts. In performing this search we have been rummaging through interface specifications, source code when available, documentation, even – since any detective knows not to overlook the household’s final output – generated code, which in .NET and Java still retains significant high-level information. 2.2 .NET Libraries and the Contract Wizard A property of the .NET libraries that makes them particularly interesting for such a study is the flexibility of the .NET component model, which has enabled the development of a “Contract Wizard” [1], a tool that enables a user to examine a compiled module (“assembly” in .NET), typically coming from a contract-less language such as C#, Visual Basic, C++, Cobol etc., and interactively add contracts to its classes and routines, producing a proxy assembly that is contracted as if it had been written in Eiffel, but calls the original. The Contract Wizard relies on the reflection capabilities provided in .NET by the metadata that every assembly includes, providing interface information such as the signature of each routine, retained from the source code in the compiling process. By nature, however, the Contract Wizard is only interesting if the Closet Contract Conjecture holds. This observation provides one of the incentives for the present study: as we consider further developments of the Contract Wizard, we must first gather empirical evidence confirming or denying its usefulness. If we found that .NET and other non-contracted libraries do very well without contracts, thank you very much, and that there are no useful closet contracts to be added, it would be a waste of time to continue working on the Contract Wizard. 2.3 Method of Work Our library analyses have so far not relied on any automatic tools. Because we are looking for something that officially isn’t there, we have to exercise our own interpretation to claim and authenticate our finds. It’s incumbent on us to state why we think a

Finding Implicit Contracts in .NET Components

289

particular class characteristic, such as an exception, is representative of an underlying contract. Having to rely on a manual extraction process puts a natural limit on future extensions of this article’s analysis to other libraries. Beyond facilitating the analysis, automated extraction tools could help users of the Contract Wizard by suggesting possible contract additions. The results of this article indeed suggest certain patterns, in code or documentation, that point to possible contracts, as certain geological patterns point to possible oil deposits. However, the final process of contract elicitation, starting from non-contracted libraries, requires subjective decisions.

3 Building Libraries with Design by Contract The ideas of Design by Contract are inspired by commercial relationships and business contracts, which formally express the rights and obligations binding a client and a supplier. Likewise, software contracts are a way to specify the roles and constraints applying to a class as a whole (class invariants) or to the routines of the class (preconditions and postconditions). 3.1 Why Use Contracts? Many programmers who have heard of contracts think they are just a way to help test and debug programs through conditionally compiled instructions of the form if not “Some condition I expect to hold here” “Scream”

then

end where “Scream” might involve triggering an exception, or stopping execution altogether. Such a use — similar to the “assert” of C — is only a small part of the application of contracts, and wouldn’t by itself justify special language constructs. Contracts address a wider range of issues in the software process, for general application development as well as library design: − Correctness: Contracts help build software right in the first place by avoiding bugs rather than correcting once they are there. Designing with contracts encourages the designer to think about the abstract properties of each software element, and build the observance of these properties into the software. − Documentation: From contracted software, automatic tools can extract documentation that is both abstract and precise. Because the information comes from the software text, this approach saves the effort of writing documentation as a separate product, and lowers the risk of divergence between software and documentation. It underlies the basic form of Eiffel documentation for Eiffel software: the contract form, produced by tools of the Eiffel environment and retaining interface information only. − Debugging and testing: Run-time monitoring of contracts permits a coherent, focused form of quality assurance based on verifying that the run-time state of the software satisfies the properties expected by the designers.

290

K. Arnout and B. Meyer

− Inheritance control: Design by Contract principles provide a coherent approach to inheritance, limiting the extent to which new routine definitions may affect the original semantics (preconditions may only be weakened, postconditions strengthened). − Management: Contracts allow project managers and decision makers to understand the global purpose of a program without going into the depth of the code. The principles are particularly relevant to library design. Eiffel libraries are thoroughly equipped with contracts stating their abstract properties, as relevant to clients. 3.2 Kinds of Contract Elements Contracts express the semantic specifications of classes and routines. They are made of assertions: boolean expressions stating individual semantic properties, such as the property, in a class representing lists stored in a container of bounded capacity, that the number count of elements in a list must not exceed the maximum permitted, capacity. Uses of contracts include: − Preconditions: Requirements under which a routine will function properly. A precondition is binding on clients (callers); the supplier (the routine) can turn it to its advantage to simplify its algorithm by assuming the precondition. − Postconditions: Properties guaranteed by the supplier to the client on routine exit. − Class invariants: Semantic constraints characterizing the integrity of instances of a class; they must be ensured by each constructor (creation procedure) and maintained by every exported routine. − Check instructions: “Assert”-like construct, often used on the client side, before a call, to check that a precondition is satisfied as expected. − Loop variants and invariants: Correctness conditions for a loop. Check instructions, loop variants and loop invariants address implementation correctness rather than properties of library interfaces and will not be considered further here. Although preconditions and postconditions are the best known forms of library contracts, class invariants are particularly important in an object-oriented context since they express fundamental properties of the abstract data type (ADT) underlying a class, and the correctness of the ADT implementation chosen for the class (representation invariant [9]). We must make sure that our contract elicitation process doesn’t overlook them. 3.3 Contracts in Libraries Even a very simple example shows the usefulness of contracts in library design. Consider a square root function specified, in a first approach, as sqrt (x: REAL): REAL This specification tells us that the function takes a REAL argument and returns a REAL result. That is already a form of contract, specifying the type signature of the

Finding Implicit Contracts in .NET Components

291

function. We can call it a signature contract. (Regrettably, some of the Java and .NET reference documentation uses the term “contract”, without qualification, for such signature contracts, creating confusion with the well established use of the term as used in the rest of this article.) A more complete contract — semantic contract if we need to distinguish it from mere signature contracts — should also specify properties of the argument and result that can’t just be captured by type information, but is just as important to the library client. The most obvious example is what happens for a negative argument, with at least four possible answers: − The function might silently return a default value, such as zero. (Not very good!) − It might return a default value, and set a special flag that it is the caller’s responsibility to examine after a call. − It might trigger an exception, which it is the caller’s responsibility to handle (otherwise execution will probably terminate abnormally). − It might produce aberrant behavior, such as entering an infinite loop or crashing the execution. (Not good in the absence of contracts.) A fifth would be to return a COMPLEX result, but that is not permitted by statically typed languages if the specification, as above, declares the type of the result as REAL. A contract — here a precondition and a postcondition — will express which of these specifications the function implements. In Eiffel the function would appear as sqrt (x: REAL): REAL is -- Mathematical square root of x, within epsilon require non_negative: x >= 0 do ... Square root algorithm here ... ensure good_approximation: abs (Result ^2 – x) <= 2 * x * epsilon end

where epsilon is some appropriate value expressing the requested precision, abs gives the absolute value, and ^ is the power operator. The assertion tags non_negative and good_approximation are there for documentation purposes and will also appear in error messages if the contracts are checked at run time during debugging and testing. The contract form of the enclosing class, as produced by the environment tools, will show the above without the do… part and the is keyword (but with the header comment). This is the basic documentation that any application programmer wishing to use an Eiffel class will receive. That documentation is, essentially, the set of contracts associated with the class. Here we find direct support for the contract clauses — require, ensure, and the yet to be encountered invariant — in the language and the associated documentation standard, but the contract is inherent to the routine, regardless of its language of implementation. If a library is to provide a usable square root routine it must be based on such a contract. Unless you know under what conditions a square root

292

K. Arnout and B. Meyer

function will operate and what properties you may expect of its result, you couldn’t use it properly. The general questions are: − If there is no explicit contract discipline, comparable to the Eiffel practice of documenting all libraries through their contract form, where will we find the implicit contract? − Will, as in this example, a contract always exist, whether expressed or not? The rest of this study provides material for answering these questions.

4 Why .NET Libraries? .NET libraries suggested themselves for our study not only because they are one of the most recent and widely publicized collections of general-purpose reusable components, but also because the innovative .NET concept of metadata equips them with specification information that appears directly useful to contract elicitation. 4.1 The Scope of .NET The .NET libraries are part of the .NET framework [24], a major Microsoft endeavor. Overall, .NET addresses the needs of companies and the general public through advances in Web services infrastructure, new tools for business-to-business and business-to-consumer interactions (Universal Description, Discovery and Integration, MyServices, Biztalk), advanced Web mechanisms (through the ASP.NET framework), new security mechanisms and other innovations. In support of these goals, .NET includes new techniques and products of direct interest to developers. The most significant advance here is extensive support for multilanguage programming with full interoperability between languages. Beyond Microsoft’s own languages, C# and Visual Basic .NET, implementations exist for thirdparty languages, in particular Eiffel. (The Eiffel implementation on .NET includes the full language as on other platforms; in particular it supports Design by Contract, genericity and multiple inheritance without restrictions [2] [36].) All such language implementations benefit from common mechanisms including a versioning scheme, an extensive security framework, a component model considerably simpler to use than Microsoft’s previous COM technology, facilities for memory management, debugging and exception handling, a multi-language development environment (Visual Studio .NET), all supported by a Common Language Runtime. They also benefit from a set of libraries covering many areas of applications, which the interoperability infrastructure makes available to programs written in all languages supported on .NET. 4.2 The Role of Metadata The basic compilation unit in .NET, covering for example a library or a section of a library, is the assembly. The key to the framework’s support for component-based development is the presence, in every assembly, of documentary information known as metadata, making the assembly self-describing in accordance with the selfdocumentation principle [23].

Finding Implicit Contracts in .NET Components

293

The metadata of an assembly — accessible to programs through the Reflection library, to users through various tools, and to the world at large in XML — provides information on the assembly’s contents, including: − A “manifest” describing the assembly name, version, culture and public key (if the assembly is signed). − The list of dependencies on other .NET assemblies. − For each class, the list of its parents (interfaces, and at most one class) and of its features (routines, attributes, properties and events). − For each class member, the signature (including arguments and return type). In addition to these predefined categories, developers can define, assuming proper source language support, their own specific kinds of metadata in the form of custom attributes. Metadata of both kinds – predefined and custom – opens attractive new possibilities. The Contract Wizard is one of them: by relying on the metadata, it enables users to examine a class and its features interactively, and add any appropriate contracts – all without having access to the source code.

5 Analysis of a Collection Class Our first search for closet contract will target the class ArrayList [28], part of the core .NET library (mscorlib.dll). This choice of class is almost arbitrary; in particular it was not based on any a priori guess that the class would suggest more (or fewer) contracts than any other. Rather, the informal criteria were that the class, describing lists implemented through arrays: − Is of obvious practical use. − Seems typical, in its style, of the Collections library. − Has a direct counterpart, ARRAYED_LIST, in the EiffelBase library, opening the possibility of comparisons once we’ve completed the contract elicitation process. 5.1 Implicit Class Invariants Documentation comments first reveal properties of ArrayList that fall into the category of class invariants. We find our first leads in the specification of class constructors, which states that “The default initial capacity for an ArrayList is 16” This comment implies that the capacity of the created object is greater than zero. Taking up this lead, we notice that all three constructors of ArrayList set the initial list’s capacity to a positive value. This suggests an invariant, since it is part of the Design by Contract rules that an invariant property must be guaranteed by all the creation procedures of a class. To test our intuition, we examine the other key property of an invariant: that it must be preserved by every exported routine of the class. Examining all such routines

294

K. Arnout and B. Meyer

confirms this and suggests that we indeed have the germ of an invariant, which in Eiffel would be expressed by the clause invariant positive_capacity: capacity >= 0

Continuing our exploration of the documentation, we note that two of the three constructors of ArrayList “initialize a new instance of the ArrayList class that is empty”. The count of elements of an array list created in such a way must then be zero. The third constructor, which takes a collection c as parameter “initializes a new instance of the ArrayList class that contains elements copied from the specified collection” So the number of elements of the new object equals the number of elements in the collection received as parameter, expressed by the assertion count = c.count (which in Eiffel would normally appear in a postcondition). Can then c.count be negative? Most likely not. Checking the documentation further reveals that the argument c passed to the constructor may denote any non-void collection, represented through one of the many classes inheriting from the ICollection interface [29]: arrayed list, sorted list, queue etc. Without performing an exhaustive examination, we note a hint in ArrayList itself, in the specification of routine Remove: “The average execution time is proportional to Count. That is, this method is an O(n) operation, where n is Count” which implies that count must always be non-negative. This evidence is enough to let us add a clause to the above invariant: positive_count: count >= 0 These first two properties are simple but already useful. For our next insights we examine the specification of class members. Documentation on the Count property reveals interesting information: “Count is always less than or equal to Capacity”. The self-assurance of this statement indicates that this property of the class always holds, suggesting that it is a class invariant. Hence a third invariant property for class ArrayList yielding the accumulated clause invariant positive_capacity: capacity >= 0 positive_count: count >= 0 valid_count: count <= capacity

Finding Implicit Contracts in .NET Components

295

5.2 Implicit Routine Preconditions Aside from implicit class invariants, the documentation also suggests preconditions. To get our clues we may look at documented exception cases. The specification of the routine Add of class ArrayList states that Add throws an exception of type NotSupportedException if the arrayed list on which it is called is read-only or has a fixed size. This suggests that the underlying implementation of Add first checks that the call target is writable (not read-only) and extendible (does not have a fixed size) before actually adding elements to the list. Such a requirement for having the method do what it is expected to is the definition of a routine precondition in terms of Design by Contract. An Eiffel specification of Add would then include the following two preconditions: require writable: not is_read_only extendible: not is_fixed_size

is_read_only and is_fixed_size are the Eiffel counterparts of the .NET properties IsReadOnly and IsFixedSize of class ArrayList. This example — one of many to be found in the reference documentation of the .NET Framework — suggests a scheme for extracting preconditions, applicable systematically, with some possibility of tool support: − Read the exception condition; e.g. the array list is read-only. − Take the opposite; for ArrayList the condition would be not is_read_only

− Infer the underlying routine precondition; here: writable: not is_read_only

5.3 Implicit Routine Postconditions

Does the .NET documentation also reveal closet postconditions? For an answer we consider the example of the query IndexOf. More precisely, since it is an overloaded method, we choose a specific version identified by its signature: public virtual int IndexOf (Object value);

The documentation explains that the return value is “the zero-based index of the first occurrence of value within the entire ArrayList, if found; otherwise, -1”. We may rephrase this specification more explicitly: − If value appears in the list, the result is the index of the first occurrence, hence greater than or equal to zero (.NET list indexes are indexed starting at zero) and less than Count, the number of elements in the list. − If value is not found, the result is –1.

296

K. Arnout and B. Meyer

Such a property is a guarantee on routine exit, a condition incumbent on the supplier on completion of the task — the definition of a postcondition. In Eiffel we would add the corresponding clause to the routine: ensure valid_index_if_found: contains (value) implies Result>0 and Result
This simple analysis suggests that routine postconditions do exist in .NET libraries, although not explicitly expressed because of the lack of support from the underlying environment. Unlike preconditions — for which it may be possible to devise supporting tools — postconditions are likely to require case-by-case human examination since they are scattered across the reference documentation. 5.4 Contracts in Interfaces Class ArrayList implements three interfaces (completely abstract specification modules) of the .NET Collections library: IList, ICollection, and IEnumerable. It is interesting to subject such interfaces to the same analysis as we have applied to the class. This analysis (see sections 6 and 7 for general statistics about the contract rates of .NET collection classes and interfaces) indicates that IList, ICollection, IEnumerable, and IEnumerator (of which IEnumerable is a client), do have routine preconditions and postconditions similar to those of ArrayList. We have not, however, found class invariants in these interfaces. This is probably because of the more limited scope of interfaces in .NET (coming from Java) as compared to “deferred classes”, their closest counterpart in the object-oriented model embodied by Eiffel. Deferred classes may have a mix of abstract and concrete features; in particular, they may include attributes. Interfaces, for their part, are purely abstract and may not contain attributes. The Eiffel policy provides a continuous spectrum from totally deferred classes, the equivalent of .NET and Java interfaces, to fully implemented classes, supporting the aims of object-oriented development with a seamless process from analysis (which typically uses deferred classes) to design and implementation (which make the classes progressively more concrete). Class invariants in the Eiffel libraries [22] often express consistency properties binding various attributes together. One can imagine, however, finding properties that hold for all the classes implementing the interface and that would be relevant candidates for “interface invariants”. But our non-exhaustive analysis of the .NET Collections library did not reveal such a case.

Finding Implicit Contracts in .NET Components

297

6 Adding Contracts a Posteriori The discovery of closet contracts in the .NET arrayed list class suggests that we should build a “contracted variant” of this class, ARRAY_LIST, that has the same interface as the original ArrayList plus the elicited contracts. We now present a sketch of this class and compare it with its EiffelBase counterpart: ARRAYED_LIST. Rather than modifying the original class we may produce the contracted variant — here in Eiffel — as a new class whose routines call those of the original. This is the only solution anyway when one doesn’t have access to the source code. The Contract Wizard is intended to support such a process, although for this discussion we have produced the result manually. 6.1 A Contracted Form of the .NET Arrayed List Class The original class, ArrayList, has 57 features (members in the .NET terminology). In presenting the contracted version we limit ourselves to 12 features, discussed in the preceding analysis of the class. The notation require -- from MY_CLASS

or ensure -- from MY_CLASS

shows that the assertion clauses that follow (respectively preconditions or postconditions) are inherited from the parent class MY_CLASS. This is the convention applied by EiffelStudio documentation tools when displaying the assertions of a class, for the parts inherited from ancestors. The difference is that in Eiffel assertions clauses are automatically inherited from parents; in .NET there is no such convention, so the .NET documentation has to repeat the same comments and exception conditions for an ancestor class or interface and all its descendants. The feature keyword introduces a “feature clause”, which groups a set of features (members) that have a common purpose. For example, the feature clause feature -- Initialization lists all the creation procedures of class ARRAY_LIST: make, make_from_capacity and make_from_collection. indexing description: "[ Implementation of a list using an array, whose size is dynamically increased as required. ]" class interface ARRAY_LIST create -- Note for non-Eiffelists: This is the list of -- creation procedures (constructors) for the -- class; the procedures’ definitions appear below.

298

K. Arnout and B. Meyer

make, make_from_capacity, make_from_collection feature -- Initialization make -- Create empty list with capacity -- Default_capacity. ensure empty: count = 0 default_capacity_set: capacity = Default_capacity writable: not is_read_only extendible: not is_fixed_size make_from_capacity (a_capacity: INTEGER) -- Create empty list with capacity a_capacity. require positive_capacity: a_capacity >= 0 ensure empty: count = 0 positive_capacity_implies_capacity_set: a_capacity > 0 implies capacity = a_capacity capacity_is_zero_implies_default_capacity_set: a_capacity =0 implies capacity =Default_capacity writable: not is_read_only extendible: not is_fixed_size make_from_collection (c: ICOLLECTION) -- Create list containing elements copied from c -- and the corresponding capacity. require collection_not_void: c /= Void ensure capacity_set: capacity = c.count count_set: count = c.count writable: not is_read_only extendible: not is_fixed_size feature -- Access capacity: INTEGER -- Number of elements the list can store count: INTEGER

Finding Implicit Contracts in .NET Components -- Number of elements in the list Default_capacity: INTEGER is 16 -- Default list capacity index_of (value: ANY): INTEGER -- Zero-based index of the first occurrence of -- value ensure -- from ILIST not_found_implies_minus_one: not contains (value) implies Result = - 1 found_implies_valid_index: contains (value) implies Result >= 0 and Result < count found_implies_correct_index: contains (value) implies item (Result) = value item (index: INTEGER): ANY -- Entry at index require -- from ILIST valid_index: index >= 0 and index < count feature -- Status report contains (an_item: ANY): BOOLEAN -- Does list contain an_item? is_fixed_size: BOOLEAN -- Has list a fixed size? is_read_only: BOOLEAN -- Is list read-only? feature -- Status setting set_capacity (value: like capacity) -- Set list capacity to value. require valid_capacity: value >= count ensure capacity_set: value > 0 implies capacity = value default_capacity_set: value = 0 implies capacity = Default_capacity feature -- Element change add (value: ANY): INTEGER -- Add value to the end of the list (double list

299

300

K. Arnout and B. Meyer -- capacity if the list is full) and return the -- index at which value has been added. require -- from ILIST writable: not is_read_only extendible: not is_fixed_size ensure -- from ILIST value_added: contains (value) updated_count: count = old count + 1 valid_index_returned: Result = count - 1 ensure then capacity_doubled: (old count = old capacity)

implies (capacity = 2 * (old capacity)) invariant positive_capacity: capacity >= 0 positive_count: count >= 0 valid_count: count <= capacity end

6.2 Metrics Fig. 1 shows measurements of properties of the contracted class ARRAY_LIST, produced with by the Metrics Tool of EiffelStudio. The measurements apply to the full class, with all 57 features, not to the abbreviated form shown above. A feature is “immediate” if it is new in the class, as opposed to a feature inherited from a parent (and possibly redefined in the class). The Eiffel metric tool’s output uses the following terminology: − Feature is the general name for class members. Features include attributes (“fields”) and routines (“methods”). A routine is a computation (algorithm) applicable to instances of the class; an attribute is stored in memory. If a routine returns a result, it is called a procedure; otherwise, it is a function. − Eiffel also distinguishes between commands and queries: A command returns no result; a query returns a result. If a query is computed at run time, it is a function; if it is stored in memory, it is an attribute. We note the following conclusions from these measurements: − 62% of the routines now have a contract (a precondition or a postcondition, usually both): 33 out of 52. − The 33 routines with preconditions tend to have more than one precondition clause: 2.5 on the average (82 total). − The 33 routines with postconditions tend to have more than one postcondition clause: 2 on average (67 total).

Finding Implicit Contracts in .NET Components

301

Fig. 1. Metrics about the contracted arrayed list class

7 Extending to Other Classes and Interfaces Equipped with our first results on ArrayList and its contracted Eiffel counterpart ARRAY_LIST, we now perform similar transformations and measurements on a few other classes and interfaces, to probe how uniform the results appear to be across the .NET Collections library. Since the assumptions and techniques are the same, we won’t repeat the details but go directly to results and interpretations. 7.1 Interfaces First, consider Eiffel deferred classes obtained by contracting the .NET interfaces from which ArrayList inherits: − ILIST, ICOLLECTION, IENUMERABLE, from which ARRAY_LIST inherits. − IENUMERATOR, of which IENUMERABLE is a client. Table 1 shows some resulting measurements. The statistics highlight three trends: − Absence of class invariant in these .NET interfaces, as already noted. − Presence of routine contracts: both preconditions and postconditions. The figures about IENUMERABLE and ICOLLECTION involve too few routines to bring valuable information — only one for class IENUMERABLE and four for ICOLLECTION. The figures about ILIST and IENUMERATOR are more significant in that respect — class ILIST has eleven routines and IENUMERATOR has six. Both classes (ILIST and IENUMERATOR) have at least one half of their routines with contracts. − Presence of multiple routine contracts: most routines have several preconditions and postconditions.

302

K. Arnout and B. Meyer Table 1. “Contract rate” of some .NET collection interfaces ILIST

ICOLLECTION

IENUMERABLE

IENUMERATOR

Routines

11

4

1

6

Routines with preconditions

7

1

0

3

Routines with postconditions

7

1

1

3

Number of preconditions

14

7

0

4

Number of postconditions

11

1

2

3

Precondition rate

64 %

25 %

0%

50 %

Postcondition rate

64 %

25 %

100 %

50 %

0

0

0

0

Class invariants

The last two points are consistent with the properties observed for class ARRAY_LIST. 7.2 Other Classes: Stack and Queue To test the generality of our first results on ArrayList, we consider two other classes of the Collections library. We choose Stack and Queue because they: − Are concrete collection classes. − Have no relation to ArrayList, except that all three implement the .NET interfaces ICollection and IEnumerable. − Have a direct counterpart in the EiffelBase library. Three classes is still only a small sample of the library. Any absolute conclusion would require exhaustive analysis, and hence a larger effort than the present study since the analysis is manual. So we have to be careful with any generalization of the results. We may note, however, that none of the three choices has been influenced by any a priori information or guess about the classes’ likelihood of including contracts. The same approach was applied to these classes as to ArrayList and its parents. Fig. 2 shows some of the resulting measurements for classes Stack and Queue. The figures confirm the trends previously identified: − Preconditions and postconditions are present. Class STACK has a 29% precondition rate (17 routines, of which 5 have preconditions) and a 59% postcondition rate (10 postcondition-equipped out of 17); class QUEUE has 42% and 58%.

Finding Implicit Contracts in .NET Components

303

Fig. 2. Metrics about the contracted stack and queue classes

− Preconditions and postconditions usually include several assertions. For example, STACK has 16 postcondition assertions for 10 contract-equipped routines. − Concrete classes have class invariants. For example, all three classes ArrayList, Stack, and Queue have an invariant clause positive_count: count >= 0

involving one attribute: count. This case-by-case analysis of 3 concrete classes and 4 interfaces of the .NET Collections library (out of 13 concrete classes and 8 interfaces) supports the second answer of the “Closet Contract Conjecture” – that contracts are inherent. We will now explore the benefits and limitations of such an a posteriori addition of contracts.

8 Effect on Library Users To appreciate the value of the results of the preceding analysis, we should assess their effect on the only constituency that matters in the end: library users – application

304

K. Arnout and B. Meyer

developers who take advantage of library classes to build their own systems. This issue is at the core of the Closet Contract Conjecture, since it determines whether we are doing any good at all by uncovering implicit contracts in contract-less libraries. By producing new versions of the library that make the contracts explicit, are we actually helping the users? To answer this question, we may examine the effect of the different styles on the library user (in terms of ease of learning and ease of use) and on the likely quality of the applications they develop. We take arrayed lists as an example and consider three variants: − The original, non-contracted class ArrayList from the .NET Collections library. − The contracted version ARRAY_LIST discussed above. − Finally, the corresponding class in the EiffelBase library, called ARRAYED_LIST, which was built with Design by Contract right from the start, rather than contracted a posteriori, and uses some other design ideas as well. 8.1 Dealing with Abnormal Cases in a Contract-Less Style The chapter in the .NET documentation devoted to class ArrayList provides a typical example of dealing with arrayed lists in that framework: using System; using System.Collections; public class SamplesArrayList { public static void Main() { // Creates and initializes a new ArrayList. ArrayList myAL = new ArrayList(); myAL.Add("Hello"); myAL.Add("World"); myAL.Add("!"); // Displays the properties and values of the // ArrayList. Console.WriteLine("myAL"); Console.WriteLine("\tCount: {0}",myAL.Count); Console.WriteLine("\tCapacity: {0}",myAL.Capacity); Console.Write ("\tValues:"); PrintValues (myAL); } public static void PrintValues (IEnumerable myList) { System.Collections.IEnumerator myEnumerator = myList.GetEnumerator(); while (myEnumerator.MoveNext()) Console.Write("\t{0}", myEnumerator.Current); Console.WriteLine(); } }

Finding Implicit Contracts in .NET Components

305

Running this C# program produces the following output: myAL Count: 3 Capacity: 16 Values: Hello

World

!

One striking point of this example is the absence of any exception handling — not even one if instruction in the class text — although our analysis of class ArrayList (see section 5) has revealed a non-trivial number of implicit contracts. For example, we have seen that the .NET method Add can only work properly if the targeted arrayed list is writable and extendible. But there is no such check in the class text above. This is likely to be on purpose since the property always holds at this point of the method execution: the .NET constructor ensures that the created list is not read-only and does not have a fixed size (see the contracted version of class ArrayList introduced in section 6), which allows calling the method Add on it. ArrayList myAL = new ArrayList(); /* Implicit check: (!myAL.IsFixedSize) && (!myAL.IsReadOnly) */ myAL.Add ("Hello"); myAL.Add ("World"); myAL.Add ("!");

If harmless in this simple example, such code may become dangerous if part of a reusable component. As a matter of fact, a novice programmer may overlook such a subtlety and reuse this code to create and add elements to a fixed-size arrayed list, which would cause the program execution to terminate on an unhandled exception of type NotSupportedException. This becomes even clearer if we encapsulate the calls to Add in a separate method FillArrayList that would look like the following: public void FillArrayList( ArrayList AL ){ AL.Add ("Hello"); AL.Add ("World"); AL.Add ("!"); }

and use FillArrayList in the Main routine: public static void Main() { ArrayList myAL = new ArrayList(); /* Implicit check: (!myAL.IsFixedSize) && (!myAL.IsReadOnly) */ FillArrayList (myAL); }

The previous program would work; the following one would not:

306

K. Arnout and B. Meyer

public static void Main() { ArrayList myAL = new ArrayList(); ArrayList.FixedSize (myAL); // The following call would throw an exception // because myAL is now a fixed-size arrayed list, // to which no element can be added. FillArrayList (myAL); }

Having Design by Contract support would be the right solution here (as discussed in the next sections). But because the .NET Common Language Runtime does not have native knowledge of contracts, .NET users have to rely on other techniques: − Using a “defensive” style of programming: checking explicitly for the routine requirements even if it can be inferred directly from the previous method statements (relying on the motto: “better check too much than too less”), hence adding redundant checking: ArrayList myAL = new ArrayList(); if ((!myAL.IsFixedSize) && (!myAL.IsReadOnly)) FillArrayList (myAL);

with: public void FillArrayList (ArrayList AL){ if ((!myAL.IsFixedSize) && (!myAL.IsReadOnly)){ AL.Add ("Hello"); AL.Add ("World"); AL.Add ("!"); } }

This style, however, leads to needless complexity by producing duplicate errorchecking code. The Design by Contract method goes in the opposite direction by avoiding redundancy and needless (Non-Redundancy principle, [20] p 343). − Relying on the exception handling mechanism of the .NET Common Language Runtime (typically, by using try…catch...finally… clauses): public static void Main() { try { // Creates and initializes a new ArrayList. ArrayList myAL = new ArrayList(); FillArrayList (myAL); // Prints list values. } catch (NotSupportedException e) { Console.WriteLine (e.Message); } }

Finding Implicit Contracts in .NET Components

307

with: public void FillArrayList (ArrayList AL) throws NotSupportedException { AL.Add ("Hello"); AL.Add ("World"); AL.Add ("!"); }

− Adding comments in the code to make implicit checks explicit and avoid misleading the library users: public static void Main() { ArrayList myAL = new ArrayList(); /* Implicit check: (!myAL.IsFixedSize) && (!myAL.IsReadOnly) */ FillArrayList (myAL); }

with: /* This method can only be called if AL does not * have a fixed size and is not read-only. */ public void FillArrayList (ArrayList AL) { AL.Add ("Hello"); AL.Add ("World"); AL.Add ("!"); }

Such an approach is efficient in the sense that there is no redundant check, thus no performance penalty, which gets closer to the ideas of Design by Contract, but it is not enforced at run time since it just relies on comments. This suggests the next approach, a posteriori contracting of classes. 8.2 Dealing with Abnormal Cases in a Contract-Rich Style A posteriori addition of contracts to a .NET component is likely to simplify the task of clients: rather than testing for a routine’s successful completion, they can just rely on the contracts, yielding to a lighter programming style (no redundant checking): indexing description: "[ Typical use of contracted class ARRAY_LIST ]"

308

K. Arnout and B. Meyer

class ARRAY_LIST_SAMPLE create make feature -- Initialization make is -- Create an arrayed list, fill it with -- Hello World!, and print its content. local my_list: ARRAY_LIST do create my_list.make fill_array_list (my_list) print_values (my_list) end feature -- Element change fill_array_list (an_array_list: ARRAY_LIST) is -- Fill an_array_list with Hello World!. require an_array_list_not_void: an_array_list /= Void is_extendible: not an_array_list.is_fixed_size is_writable: not an_array_list.is_read_only local index: INTEGER do index := an_array_list.add ("Hello ") index := an_array_list.add ("World”) index := an_array_list.add (“!") ensure array_list_filled: an_array_list.count = 3 end feature -- Output print_values (an_array_list: ARRAY_LIST) is -- Print content of an_array_list. require an_array_list_not_void: an_array_list /= Void local my_enumerator: IENUMERATOR do from my_enumerator := an_array_list.enumerator until

Finding Implicit Contracts in .NET Components

309

not my_enumerator.move_next loop print (my_enumerator.current_element) end end end

Since we know from the postconditions is_extendible and is_writable of creation procedure make of ARRAY_LIST that the preconditions of fill_array_list will be satisfied at this point of the routine execution, we do not need to add tests before calling the procedure. For readability or to facilitate debugging — when executing the software with assertion monitoring on — we might want to use an additional check instruction: create my_list.make check non_void: my_list /= Void is_extendible: not my_list.is_fixed_size is_writable: not my_list.is_read_only end fill_array_list (my_list) print_values (my_list)

although this is not required. If the creation routine make of class ARRAY_LIST had no such postconditions as is_extendible and is_writable, an explicit if control would have been needed in the client class ARRAY_LIST_SAMPLE to guarantee that the requirements of feature fill_array_list actually hold: create my_list.make if not my_list.is_fixed_size and not my_list.is_read_only then fill_array_list (my_list) end

But this is different from the “defensive” programming style used in a contract-less environment, since the test only affects the client side, not both client and supplier; the latter simply has preconditions. We call such a use of routine preconditions the a priori scheme: the client must act beforehand — before calling the routine — and ensure that the contracts are satisfied (either by testing them directly with an if control, or by relying on the postconditions of a previously called routine or on the class invariants). With this approach, any remaining run-time failure signals a design error. Such a design may not always be applicable in practice for either of three reasons: − Performance: Testing for a precondition before a routine call may be similar to the task of the routine itself, resulting in an unacceptable performance penalty. − Lack of expressiveness of the assertion languages: The notation for assertions might not be powerful enough.

310

K. Arnout and B. Meyer

− Dependency on external events: It is impossible to test for requirements if a routine involves interaction with the outside world, for example with a human user: there is no other choice than attempting to execute it, hence no way to predict abnormal cases. To address these limitations of the a priori scheme, it is possible to apply an a posteriori scheme — try the operation first and find out how it went — if a failed attempt has no irrecoverable consequences. Performance overhead – the first case above – is not a problem when the test being repeated is checking that a number is positive or a reference is not void. But the inefficiency might be more significant. An example from numerical computation [23] is a matrix equation solver: an equation of the form AX = B, where A is a matrix, and X (the unknown) and B are vectors, has a unique solution of the form X = A•¹ B only if matrix A is not singular. (A matrix is singular if one of the rows is a linear combination of others.) Applying the a priori scheme would lead the client to write code looking like the following: if a.is_singular then -- Report error. else x := a.inverse (b) end using a function inverse with precondition non_singular_matrix: inverse (b: VECTOR): VECTOR -- Solve equation of the form ax = b. require non_singular_matrix: not is_singular

This code does the job but is inefficient since determining whether a matrix is singular is essentially the same operation as solving the associated linear equation. Hence the idea of applying the a posteriori scheme; the client code would be of the form: a.invert (b) if a.inverted then x := a.inverse else -- Process erroneous case. end

Procedure invert replaces the previous function inverse. A call to this procedure (for which a more accurate name might be attempt_to_invert) sets a boolean attribute inverted to True or False to indicate whether inverting the matrix was possible, and if it was, makes the result available through attribute inverse. (A class invariant may state that inverted = (inverse /= Void).) This technique, which splits any function that may produce errors into a procedure that attempts to perform an operation and two attributes, one reporting whether the

Finding Implicit Contracts in .NET Components

311

operation was successful and the other giving access to the result of the operation if any, is compliant with the Command-Query Separation principle ([23], p 751). This example highlights one basic engineering principle for dealing with abnormal cases: whenever available, a method for preventing failures to occur is usually preferable to methods for recovering from failures. The techniques seen so far do not, however, provide a solution in three cases: − When abnormal events — such as a numerical failure or memory exhaustion — can cause the hardware or the operating system to interrupt the program execution abruptly (which is intolerable for systems with continuous availability requirements). − When abnormal situations, although not detectable through preconditions, must be diagnosed at the earliest possible time to avoid disastrous consequences — such as destroying the integrity of a database or even endangering human lives, as in an airplane control system. (One must keep in mind that such situations can appear in a contract-rich environment as well, since the support for assertions may not be rich enough to express complex properties.) − When there is a requirement for software fault tolerance, protecting against the most dramatic consequences of any remaining errors in the software. 8.3 “A Posteriori Contracting” vs. “Contracting from the Start” We have seen that clients of a .NET library are likely to benefit from an a posteriori addition of contracts: instead of having to test whether a routine successfully went to completion (with the risk of forgetting to check and getting an exception at run time), they could just rely on the contracts. What about contracting “from the start” now? Is the EiffelBase class ARRAYED_LIST more convenient to use than the contracted class ARRAY_LIST? To help answer this question, let’s consider a variant of the previous class ARRAY_LIST_SAMPLE, representing a typical client use of arrayed lists, this time using the EiffelBase ARRAYED_LIST: indexing description: "Typical use of EiffelBase ARRAYED_LIST" class ARRAYED_LIST_SAMPLE create make feature -- Initialization make is -- Create a list with two elements and print the -- list contents. local my_list: ARRAYED_LIST [STRING] do create my_list.make my_list.extend ("Hello ")

312

K. Arnout and B. Meyer my_list.extend ("World”) my_list.extend (“!") from my_list.start until my_list.after loop io.put_string (my_list.item) my_list.forth end end

end

This example highlights three characteristics of the EiffelBase ARRAYED_LIST: − A clear separation between commands and queries: the routine extend returns no result (on the contrary to the .NET feature Add, which returns an integer, yielding a useless local variable index in the ARRAY_LIST code example). − The usefulness of genericity: we know that my_list.item is of type STRING, thus we can use a more appropriate I/O feature to print it: put_string, rather than the general print. − A user-friendly interface to traverse the list through features start, after, item, and forth, relying on an internal cursor stored in class ARRAYED_LIST. Another interesting property to look at is the easiness of switching to other list implementations. As shown on Fig. 3, ARRAYED_LIST inherits from both ARRAY and DYNAMIC_LIST. It suffices to remove the relationship with class ARRAY to obtain a LINKED_LIST. The a-posteriori-contracted class ARRAY_LIST (section 6) just mapped the original .NET hierarchy, which makes extensive use of interfaces to compensate the lack of multiple inheritance (Fig. 4). Not surprisingly in light of how it was obtained, this class does not fully benefit from the power of Eiffel, in matter of design, reusability, and extendibility. Another obvious difference between ARRAY_LIST (the contracted class) and ARRAYED_LIST (the EiffelBase class) is the use of genericity in the Eiffel version. Although this falls beyond the scope of the present discussion, we may note that the lack of genericity in the current .NET object model leads to code duplication as well as to run-time check (casts) that damage both the performance and the reliability of the software. Future versions of .NET are expected to provide genericity [17] [18]. The next difference is the use of enumerators in ARRAY_LIST whereas ARRAYED_LIST stores a cursor as an attribute of the class. − In the first approach, enumerators become irrecoverably invalidated as soon as the corresponding collection changes (addition, modification, or deletion of list elements). The approach, on the other hand, allows multiple concurrent traversals of the same list through multiple cursors. − The EiffelBase approach solves the problem of invalid cursors: addition or deletion of list elements change the cursor position, and queries before and after take care of the cursor position’s validity. But the use of internal cursors requires care to avoid endless loops.

Finding Implicit Contracts in .NET Components

313

Fig. 3. Inheritance hierarchy of the EiffelBase arrayed list class

Fig. 4. Inheritance hierarchy of the .NET arrayed list class

These differences of style should not obscure the fundamental difference between a design style that has contracts from the start and one that adds them to existing contract-less library components. The examples illustrate that the right time to put in the contracts is during design. When that is not possible, for example with a library produced by someone who didn’t trouble himself with contracts, it may still be worthwhile to add them a posteriori, as a way to understand the library better, improve its documentation, and make it safer to use.

9 Automatic Extraction of Closet Contracts Our analysis of the .NET Collections library has shown interesting “patterns” about the nature and location of hidden contracts. In particular, routine preconditions tend to be buried under exception conditions. Our goal is to estimate the highest degree of automation we can achieve in extracting closet contracts from .NET libraries. We first report on the technique of dynamic contract detection and then describe our approach of inferring preconditions from the CIL code [32] of .NET assemblies.

314

K. Arnout and B. Meyer

9.1 Dynamic Contract Inference Dynamic contract inference, working from source code, seeks to deduce assertions from captured variable traces by executing the program with various inputs, relying on a set of possible assertions to deduce contracts from the execution output. The next step is to determine whether the detected assertions are meaningful and useful to the users, typically by computing a confidence probability. Ernst’s Daikon tool discovers class invariants, loop invariants and routine pre- and postconditions. Its first version [11] was limited to finding contracts over scalars and arrays; the next one (Daikon 2) [14] enables contract discovery over collections of data, and computes conditional assertions. Daikon succeeds in finding the assertions of a formally-specified program, and can even find some more, revealing deficiencies in the formal specification. Daikon also succeeds in inferring contracts from a C program, which helps developers performing changes to the C program without introducing errors [12]. It appears that the large majority of reported invariants are correct, and that Daikon extracts more invariants from high-quality programs [11]. Daikon still needs improvement in terms of: − Performance: “Invariant detection time grows approximately quadratically with the number of variables over which invariants are checked” [12]. Some experiments using incremental processing have shown promising results in improving Daikon’s performance [14]. − Relevance of the reported invariants: Daikon still reports irrelevant — meaning useless, but not necessary incorrect — invariants. Polymorphism can help increase the number of desired invariants reported to the users [14]. − Richness of inferred invariants: Currently, most cover simple properties. Ernst et al. suggest examining the techniques and algorithms used in the research fields of artificial intelligence [12] and information retrieval [13] to improve dynamic inference of invariants for applications in software evolution. The Daikon detector is not the sole tool available to dynamically extract contracts. Some Java detectors also exist; some of them do not even require the program source code to infer contracts: they can operate directly on bytecode files (*.class). 9.2 Extracting Routine Preconditions from Exception Cases “Human analysis is sometimes more powerful than either, allowing deep and insightful reasoning that is beyond hope for automation” [16]. When comparing static and dynamic techniques of program analysis, Ernst et al. admit that automatic tools fail in some cases where a human being would succeed. Does extraction of closet contracts fall into the category of processes that cannot be fully automated? If so, can we at least automate part of the effort? The analysis reported in this article has shown some regularity in the form and location of the closet contracts we can find in existing .NET components. In particular, preconditions tend to be buried in exception cases. Since method exception cases are not kept into the assembly metadata, we are currently exploring another approach: inferring routine preconditions from a systematic analysis of the CIL (Common Intermediate Language) code [32] of the .NET assemblies provided as input. More precisely, we are parsing the CIL code of .NET libraries — using Gobo Eiffel Lex and

Finding Implicit Contracts in .NET Components

315

Gobo Eiffel Yacc [9] — to list the exceptions a method or a property may throw to infer the corresponding routine preconditions. The first results are promising.

10 Conclusion This discussion has examined some evidence from the .NET libraries relevant to our basic conjecture: do existing libraries designed without a clear notion of contract contain some “contracts” anyway? This analysis provides initial support for the conjecture. The contracts are there, expressed in other forms. Preconditions find their way into exceptions; postconditions and class invariants into remarks scattered across the documentation, hence more difficult to extract automatically. The analysis reported here provides a first step in a broader research plan, which we expect to expand in the following directions: − Applying the same approach to other .NET and non-.NET libraries, such as C++ STL (a first informal look at [26] suggests that there are contracts lurking there too). − Investigating more closely the patterns that help discover each type of contract — class invariants, routine preconditions and postconditions — to facilitate the work of programmers interested in adding contracts a posteriori to existing libraries, with a view to providing an interactive tool that would support this process. − Turning the Eiffel Contract Wizard into a Web service to allow any programmers to contribute contracts to .NET components. This area of research opens up the possibility of various generalizations of this work in a broad investigation of applications of Design by Contract. (We are looking forward to seeing the evolution of the project conducted by Kevin McFarlane and aiming at providing a Design by Contract framework for use in .NET projects [19], of a current project at Microsoft Research about adding contracts into the C# language — see the “Assertions” section of [30] — and also of the new eXtensible C#© [35]; the outcome of these projects are likely to influence our research direction.)

Acknowledgements This paper takes advantage of extremely valuable comments and insights from Éric Bezault (Axa Rosenberg), Michael D. Ernst (MIT), Tony Hoare (Microsoft) and Emmanuel Stapf (Eiffel Software). Opinions expressed are of course our own. References [3] to [6] are previous versions of this work. Reference [7] is a summary version.

316

K. Arnout and B. Meyer

References 1. Karine Arnout, and Raphaël Simon. “The .NET Contract Wizard: Adding Design by Contract to languages other than Eiffel”. TOOLS 39 (39th International Conference and Exhibition on Technology of Object-Oriented Languages and Systems). IEEE Computer Society, July 2001, p 14-23. 2. Karine Arnout. “Eiffel for .NET: An Introduction”. Component Developer Magazine, September-October 2002. Available from http://www.devx.com/codemag/Article/8500. Accessed October 2002. 3. Karine Arnout, and Bertrand Meyer. “Extracting implicit contracts from .NET components”. Microsoft Research Summer Workshop 2002, Cambridge, UK, 9-11 September 2002. Available from http://se.inf.ethz.ch/publications/arnout/workshops/microsoft_summer_research_workshop_2002/contract_extraction.pdf. Accessed September 2002. 4. Karine Arnout. “Extracting Implicit Contracts from .NET Libraries”. 4th European GCSE Young Researchers Workshop 2002, in conjunction with NET.OBJECT DAYS 2002. Erfurt, Germany, 7-10 October 2002. IESE-Report No. 053.02/E, 21 October 2002, p 20-24. Available from http://www.cs.uni-essen.de/dawis/conferences/Node_YRW2002/papers/ karine_arnout_gcse_final_copy.pdf. Accessed October 2002. 5. Karine Arnout. “Extracting Implicit Contracts from .NET Libraries”. OOPSLA 2002 (17th ACM Conference on Object-Oriented Programming, Systems, Languages, and Applications), Posters. Seattle USA, 4-8 November 2002. OOPSLA'02 Companion, ACM, p 104105. 6. Karine Arnout, and Bertrand Meyer. “Contrats cachés en .NET: Mise au jour et ajout de contrats a posteriori”. LMO 2003 (Langages et Modèles à Objets). Vannes, France, 3-5 February 2003. 7. Karine Arnout, and Bertrand Meyer. “Spotting hidden contracts: the .NET example”. Submitted for publication. 8. Mike Barnett, and Wolfram Schulte. “Contracts, Components, and their Runtime Verification on the .NET Platform”. Microsoft Research Technical Report TR 2002-38, April 2002. Available from ftp://ftp.research.microsoft.com/pub/tr/tr-2002-38.pdf. Accessed April 2002. 9. Éric Bezault. Gobo Eiffel Lex and Gobo Eiffel Yacc. Retrieved September 2002 from http://www.gobosoft.com. 10. Eiffel Software Inc. EiffelBase. Retrieved October 2002 from http://docs.eiffel.com/libraries/base/index.html. 11. Michael D. Ernst. “Dynamically Detecting Likely Program Invariants”. Ph.D. dissertation, University of Washington, 2000. Available from http://pag.lcs.mit.edu/~mernst/pubs/invariants-thesis.pdf. Accessed August 2002. 12. Michael D. Ernst, Jake Cockrell, William G. Griswold, and David Notkin. “Dynamically Discovering Likely Program Invariants to Support Program Evolution”. IEEE TSE (Transactions on Software Engineering), Vol.27, No.2, February 2001, p: 1-25. Available from http://pag.lcs.mit.edu/~mernst/pubs/invariants-tse.pdf. Accessed August 2002. 13. Michael D. Ernst, Adam Czeisler, William G. Griswold, and David Notkin. “Quickly Detecting Relevant Program Invariants”. ICSE 2000 (International Conference on Software Engineering), Limerick, Ireland, 4-11 June 2000; Available from http://pag.lcs.mit.edu/~mernst/pubs/invariants-icse2000.pdf. Accessed August 2002. 14. Michael D. Ernst, William G. Griswold, Yoshio Kataoka, and David Notkin. “Dynamically Discovering Program Invariants Involving Collections”, Technical Report, University of Washington, 2000. Available from http://pag.lcs.mit.edu/~mernst/pubs/invariantspointers.pdf. Accessed August 2002. 15. C.A.R. Hoare. “Proof of Correctness of Data Representations”. Acta Infomatica, Vol. 1, 1973, p: 271-281.

Finding Implicit Contracts in .NET Components

317

16. Yoshio Kataoka, Michael D. Ernst, William G. Griswold, and David Notkin: “Automated Support for Program Refactoring using Invariants”. ICSM 2001 (International Conference on Software Maintenance), Florence, Italy, 6-10 November 2001. Available from: http://pag.lcs.mit.edu/~mernst/pubs/invariants-refactor.pdf. Accessed August 2002. 17. Andrew Kennedy, and Don Syme. “Design and Implementation of Generics for the .NET Common Language Runtime”. PLDI 2001 (Conference on Programming Language Design and Implementation). Snowbird, Utah, USA, 20-22 June 2001. Available from http://research.microsoft.com/projects/clrgen/generics.pdf. Accessed September 2002. 18. Andrew Kennedy, and Don Syme. Generics for C# and .NET CLR, September 2002. Retrieved September 2002 from http://research.microsoft.com/projects/clrgen/. 19. Kevin McFarlane. Design by Contract Framework for .Net. February 2002. Retrieved October 2002 from http://www.codeproject.com/csharp/designbycontract.asp and http://www.codeguru.com/net_general/designbycontract.html. st 20. Bertrand Meyer: Object-Oriented Software Construction (1 edition). Prentice Hall International, 1988. 21. Bertrand Meyer. “Applying ‘Design by Contract’”. Technical Report TR-EI-12/CO, Interactive Software Engineering Inc., 1986. Published in IEEE Computer, Vol. 25, No. 10, October 1992, p 40-51. Also published as “Design by Contract” in Advances in ObjectOriented Software Engineering, eds. D. Mandrioli and B. Meyer, Prentice Hall, 1991, p 150. Available from http://www.inf.ethz.ch/personal/meyer/publications/computer/contract. pdf. Accessed April 2002. 22. Bertrand Meyer: Reusable Software: The Base Object-Oriented Component Libraries. Prentice Hall, 1994. 23. Bertrand Meyer: Object-Oriented Software Construction, second edition. Prentice Hall, 1997. 24. Bertrand Meyer, Raphaël Simon, and Emmanuel Stapf: Instant .NET. Prentice Hall (in preparation). 25. Bertrand Meyer: Design by Contract. Prentice Hall (in preparation). 26. Scott Meyers: Effective STL. Addison Wesley, July 2001. 27. Microsoft. .NET Collections library. Retrieved June 2002 from http://msdn.microsoft.com/library/default.asp?url=/library/enus/cpref/html/frlrfsystemcollections.asp. 28. Microsoft. .NET ArrayList class. Retrieved June 2002 from http://msdn.microsoft.com/library/default.asp?url=/library/enus/cpref/html/frlrfsystemcollectionsarraylistclasstopic.asp. 29. Microsoft. .NET ICollection interface. Retrieved October 2002 from http://msdn.microsoft.com/library/default.asp?url=/library/enus/cpref/html/frlrfsystemcollectionsicollectionclasstopic.asp. 30. Microsoft Research. Current research, Programming Principles and Tools. Retrieved November 2002 from http://research.microsoft.com/research/ppt/. 31. Richard Mitchell, and Jim McKim: Design by Contract, by example. Addison-Wesley, 2002. 32. NET Experts. ECMA TC39 TG2 and TG3 working documents. Retrieved September 2002 from http://www.dotnetexperts.com/ecma/index.html. 33. Jeremy W. Nimmer, and Michael D. Ernst. “Invariant Inference for Static Checking: An Empirical Evaluation”. FSE ’02 (10th International Symposium on the Foundations of Software Engineering). Charleston, SC, USA. November 20-22, 2002. Proceedings of the ACM SIGSOFT. Available from http://pag.lcs.mit.edu/~mernst/pubs/esc-annotate.pdf. Accessed October 2002. 34. Jeremy W. Nimmer, and Michael D. Ernst. “Automatic generation of program specifications”. ISSTA 2002 (International Symposium on Software Testing and Analysis). Rome, Italy, 22-24 July 2002. Available from http://pag.lcs.mit.edu/~mernst/pubs/invariantsspecs.pdf. Accessed October 2002.

318

K. Arnout and B. Meyer

35. ResolveCorp. eXtensible C#© is here! Retrieved May 2003 from http://www.resolvecorp.com/products.htm. 36. Raphaël Simon, Emmanuel Stapf, and Bertrand Meyer. “Full Eiffel on .NET”. MSDN, July 2002. Available from http://msdn.microsoft.com/library/default.asp?url=/library/enus/dndotnet/html/pdc_eiffel.asp. Accessed October 2002. 37. Software Engineering Institute. “Volume II: Technical Concepts of Component-Based Software Engineering”. CMU/SEI-2000-TR-008, 2000. Available from http://www.sei.cmu.edu/publications/documents/00.reports/00tr008.html. Accessed June 2002. 38. Dave Thomas. “The Deplorable State of Class Libraries”. Journal of Object Technology (JOT), Vol.1, No.1, May-June 2002. Available from http://www.jot.fm/issues/issue_2002_05/column2. Accessed June 2002.

From Co-algebraic Specifications to Implementation: The Mihda Toolkit Gianluigi Ferrari, Ugo Montanari, Roberto Raggi, and Emilio Tuosto Dipartimento di Informatica, Universit´ a di Pisa, Italy. {giangi,ugo,raggi,etuosto}@di.unipi.it

Abstract. This paper describes the architecture of a toolkit, called Mihda, providing facilities to minimize labelled transition systems for name passing calculi. The structure of the toolkit is derived from the co-algebraic formulation of the partition-reﬁnement minimization algorithm for HD-automata. HD-automata have been speciﬁcally designed to allocate and garbage collect names and they provide faithful ﬁnite state representations of the behaviours of π-calculus processes. The direct correspondence between the coalgebraic speciﬁcation and the implementation structure facilitates the proof of correctness of the implementation. We evaluate the usefulness of Mihda in practice by performing ﬁnite state veriﬁcation of π-calculus speciﬁcations.

1

Introduction

Finite state automata (e.g. labelled transition systems) provide a foundational model underlying eﬀective veriﬁcation techniques of concurrent and distributed systems. From a theoretical point of view, many behavioural properties of concurrent and distributed systems can be naturally deﬁned directly as properties over automata. From a practical point of view, eﬃcient algorithms and veriﬁcation techniques have been developed and widely applied in practice to case studies of substantial complexity in several areas of computing such as hardware, compilers, and communication protocols. We refer to [2] for a review. A fundamental property of automata is the possibility, given an automaton, to construct its canonical form: The minimal automaton. The theoretical foundations guarantee that the minimal automaton is indistinguishable from the original one with respect to many behavioural properties (e.g., bisimilarity of automata and behavioural properties expressed in suitable modal or temporal logics). Minimal automata are very important also in practice. For instance, the problem of deciding bisimilarity is reduced to the problem of computing the minimal transition system [8]. Moreover, it is often convenient, from a computational point of view, to verify properties on the minimal automaton rather than on the original one. Indeed, minimization algorithms can be used to attack state explosion: They yield a small state space, but still retain all the relevant information for the veriﬁcation.

This work has been supported by EU-FET project PROFUNDIS IST-2001-33100 and by MIUR project NAPOLI.

F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 319–338, 2003. c Springer-Verlag Berlin Heidelberg 2003

320

G. Ferrari et al.

Global computing systems consists of networks of stationary and mobile components. The primary features of a global computing systems is that components are autonomous, software versioning is highly dynamic, the network coverage is variable and often components reside over the nodes of the network (WEB services), membership is dynamic and often ad hoc without a centralized authority. Global computing systems must be made very robust since they are intended to operate in potentially hostile environments. Moreover, they are hard to construct correctly and very diﬃcult to test in a controlled way. Although signiﬁcant progresses have been made in providing formal models and eﬀective veriﬁcation techniques to support veriﬁcation of global computing systems, current software engineering technologies provide limited solutions to some of the issues discussed above. The problem of formal veriﬁcation of global computing systems still requires considerable research and dissemination eﬀorts. History Dependent automata (HD-automata shortly) have been proposed in [14, 11, 4] as a new eﬀective model for name passing calculi. Name passing calculi (e.g. the π-calculus [10, 9, 16]) are basically the best known and probably the most acknowledged models of mobility. Moreover, they provide a rich set of techniques for reasoning about mobile systems. Similarly to ordinary automata, HD-automata are made out of states and labelled transitions; their peculiarity resides in the fact that states and transitions are equipped with names which are no longer dealt with as syntactic components of labels, but become explicit part of the operational model. This allows one to model explicitly name creation/deallocation or name extrusion: These are the distinguished mechanisms of name passing calculi. HD-automata have been abstractly understood as automata over a permutation model, whose ingredients are a set of names and an action of its group of permutations (renaming substitutions) on an abstract set. This framework is suﬃcient to describe and reason about formalisms with name-binding operations. It has been incorporated into various kinds of transition systems that aim at providing syntax-free models of name-passing calculi [5, 6, 12, 15]. It is important to emphasize that names of states of HD-automata have local meaning. For instance, assume that A(x, y, z) denotes an agent having three (free) names x, y and z. Then agent A(y, x, z), obtained through the transformation which swaps names x and y, is syntactically diﬀerent from A(x, y, z). However, these two agents can be semantically represented by means of a single state q of a HD-automaton simply by considering a “swapping” operation on the local names corresponding to names x and y. More generally, states that diﬀers only for renaming of their local names are identiﬁed in HD-automata. This property allows for a very compact representation of name passing calculi. Local meaning of names requires a mechanism for describing how names correspond each other along state transitions. Graphically, we can represent such correspondences using “wires” that connect names of labels, source and target states of transitions. For instance, Figure 1 depicts a transition from source state s to destination state d. The transition exposes two names: Name 2 of s and a fresh name 0. State s has three names, 1, 2 and 3 while state d has two names

From Co-algebraic Speciﬁcations to Implementation: The Mihda Toolkit

lab 2

3

321

0 5

2 1

4 s

σ

d

Fig. 1. A HD-automaton transition

4 and 5 which correspond to the old name 1 of s and to the fresh name 0, respectively. Notice that names 3 is discharged along such state transition. HD-automata have a natural representation as coalgebras on a category of named sets and named functions. Elements of named sets are equipped with names which are deﬁned up to groups of name permutations called symmetries [12]. General results concerning coalgebras guarantees the existence of the minimal HD-automata up to bisimilarity. In [4] two of the authors describe a declarative coalgebraic procedure to perform minimization of ﬁnite state HDautomata. In this paper, we review the coalgebraic description of the minimization algorithm for HD-automata, and we illustrate its prototype implementation. This yields a toolkit, called Mihda, providing general facilities to minimize labelled transition systems for name passing calculi. The usefulness of the Mihda toolkit will be shown by performing ﬁnite state veriﬁcation of π-calculus speciﬁcations. The minimization algorithm has been speciﬁed by exploiting a type-theoretic notation rather than standard coalgebraic notations. The implementation data structures have been obtained by reﬁning with eﬃciency considerations the typetheoretic formulation. The direct correspondence between the semantical structures and the implementation structures facilitates the design and the implementation of the toolkit. Moreover, it provides the formal machinery to perform the proof of correctness of the implementation. Recently, several software engineering technologies have been introduced to support a programming paradigm where the web is exploited as a service distributor. By service we do not mean a monolithic web server but rather a component available over the web that others might use to develop other services. Conceptually, web services are stand-alone components that reside over the nodes of the network. Each web service has an interface which is network accessible through standard network protocols and describes the interaction capabilities of the service. The Mihda toolkit have been designed and made available as a WEB service. By a few clicks in a browser at the URL http://jordie.di.unipi.it:8080/pweb/ the Mihda toolkit can be accessed remotely and its facilities can be evaluated directly over the WEB.

2

Preliminaries

This section sketches the main concepts on the coalgebraic representation of automata as a basis for ﬁnite state veriﬁcation by semantic minimization. We

322

G. Ferrari et al.

Set idA

/ F(A) t

,A f

f ;g

+3 Set

F

F (f )

/ F(B)

B g

F(idA ) = idF(A)

F(f ; g) = F(f ); F(g)

F (g)

/ F(C)

C

Fig. 2. Functor over Set

illustrate the approach by providing the coalgebraic speciﬁcation of the minimization algorithm for ordinary labelled transition systems. Hereafter, we will use terms ’automaton’ and ’labelled transition system’ interchangeably. An automaton A is a triple (S, L, →) where S is the set of states, L is the set of actions or labels and →⊆ S × L × S is the transition relation. Usually, one writes s − → d to indicate (s, , d) ∈− →; s is the source state and d is the destination or target state. Let idA denote the identity function on set A and f ; g be the composition of functions f and g (when it is deﬁned). An endo-functor F (over category Set) is a mapping from sets to sets and from function to functions that preserves identity functions and function composition. Figure 2 gives a graphical representation of how a functor acts on sets and functions. If A is mapped on F (A) then idA is associated to idF (A) and, if f and g can be composed, then F(f ) and F (g) can be composed as well. Moreover, the image through F of the function composition f ; g is obtained by composing the images of functions f and g. Definition 1 (F -coalgebra). Let F be an endo-functor on the category Set. A F-coalgebra consists of a pair (A, α) such that α : A → F (A). The duality between F -coalgebras and F -algebras (a function F(A) → A) consists in the fact that domain and codomain are “reversed”, namely, are arrows between the same objects but with opposite directions. Diﬀerent directions can be interpreted as “construction” (induction) and “observation” (coinduction). The interested reader is referred to [7, 1]. Before specifying the coalgebraic description of the minimization algorithm we introduce some notations. – Expression Q : Set denotes a set and q : Q is synonym of q ∈ Q; – Fun is the collection of functions among sets (the arrows of category Set). The function space over sets has the following structure: Fun = {H | H = S : Set , D : Set , h : S − → D}.

From Co-algebraic Speciﬁcations to Implementation: The Mihda Toolkit bij

323

inj

– h : A − → B (h : A − → B) explicitly states that function h is bijective (injective). We shall use SH , DH and hH to denote domain, codomain and mapping of an element of Fun,respectively. A similar convention will be used throughout the paper to denote components of tuples. Let H, K ∈ Fun be two functions, then the composition of H and K (H; K) is deﬁned provided that SK = DH and it is the function such that SH;K = SH , DH;K = DK , and hH;K = hK ◦ hH . Sometimes, we shall need to work with be the function given by S = SH , D = surjective functions. Hence we let H H H {q : DH | ∃q : SH , hH (q) = q } and hH = hH . Finite-state transition systems have been coalgebraically described by employing two ingredients: A set Q, that represents the state space, together with a function K : Q − → ℘fin (L × Q) giving the transition relation; K(q) is the set of

→ q . pairs (, q ) such that q − In this paper, we shall work on a more concrete representation. In particular, we introduce a mathematical structure, called bundle, whose rˆole is to provide a declarative speciﬁcation of the concrete data structure storing all the transitions out of a given state. Indeed, each bundle details which states are reachable by performing certain actions. Definition 2 (Bundles). Let L be the set of labels, then a bundle β over L is a structure D : Set, Step : ℘fin (L × D). Set D is the support of β. Given a ﬁxed set of labels L, B L denotes the collection of bundles and β : B L indicates that β is a bundle over L. We now introduce the functor A over the universe of sets and functions. The following clauses deﬁne A: – A(Q) = {β : B L | Dβ = Q}, for each Q : Set; – For each H : Fun, A(H) is deﬁned as follows: • SA(H) = A(SH ) and DA(H) = A(DH ); • hA(H) : β → DH , {, hH (q) | , q : Stepβ }. Definition 3 (Transition systems as coalgebras). Let L be a set of labels. Then a labelled transition system over L is a coalgebra for functor A, namely it is a function K such that DK = A(SK ). Example 1. A coalgebra K for functor A represents a transition system where SK is the set of states, and hK (q) = β, with Dβ = SK . Let us consider a ﬁnite-state automaton and its coalgebraic formulation via the mapping hK . 0> >> >> b

a

a

/1i >> >> a > b b 4 3> >> > > c c 5

) b

2

Sk , {a, 1, b, 3} hK (0) = hK (1) = Sk , {a, 2, b, 3, b, 4} hK (2) = Sk , {a, 1, b, 4} hK (3) = Sk , {c, 5} hK (4) = Sk , {c, 5} hK (5) = Sk , ∅

324

G. Ferrari et al.

Note how, for each state q ∈ {0, ..., 5}, hK (q) yields all the immediate successor states of q and the corresponding labels. In other words, (, q ) ∈ StephK (q) if,

→ q . and only if, q − General results on coalgebras ensure the existence of the ﬁnal coalgebra for a large class of functors. These results apply to our formulation of labelled transition systems. In particular, it is interesting to see the result of the iteration along the terminal sequence of functor A. Let K be a transition system, and let H0 , H1 , . . . , Hi+1 , . . . be the sequence of functions computed by Hi+1 = K; A(Hi ), where H0 is the unique function from SK to the one-element set {∗} given by SH0 = SK ; DH0 = {∗}; and hH0 (q : SH0 ) = ∗. Finiteness of ℘fin ensures convergence of the iteration along the terminal sequence. We can say much more if the transition system is ﬁnite state. Indeed, if K is a ﬁnite-state transition system, then – The iteration along the terminal sequence converges in a ﬁnite number of steps, i.e. DHn+1 ≡ DHn (for some natural number n), – The isomorphism mapping F : DHn − → DHn+1 yields the minimal realization of transition system K. Comparing the co-algebraic construction with the standard algorithm [8, 3] which constructs the minimal labelled transition system we can observe: – at each iteration i the elements of DHi are the blocks of the minimization algorithm (i.e. the i-th partition). Notice that the initial approximation DH0 contains a single block: in fact H0 maps all the states of the transition system into {∗}. – at each step the algorithm creates a new partition by identifying the splitters for states q and q . This corresponds in our co-algebraic setting to the fact that Hi (q) = Hi (q ) but Hi+1 (q) = Hi+1 (q ). – the iteration proceeds until a stable partition of blocks is reached: then the iteration along the terminal sequence converges. We now apply the iteration along the terminal sequence to the coalgebraic formulation of the transition system of Example 1. The initial approximation is the function H0 deﬁned as follows H0 = SH0 = SK , DH0 = {∗}, hH0 (q) = ∗ and

→ q } the ﬁrst approximation H1 is the map hH1 : q → DH0 , {, hH0 (q ) : q − obtained by T (H0 ){1, 2, 3, 4, 5}, {a, 2, b, 3, b, 4} = {∗}, {a, ∗, b, ∗}

From Co-algebraic Speciﬁcations to Implementation: The Mihda Toolkit

325

We obtain the function hH1 and the destination state DH1 = {β1 , β2 , β3 } as detailed below. hH1 (0) hH1 (1) hH2 (2) hH1 (3) hH1 (4) hH1 (5)

= {∗}, {a, ∗, b, ∗} = {∗}, {a, ∗, b, ∗} β1 = {∗}, {a, ∗, b, ∗} = {∗}, {a, ∗, b, ∗} {∗}, {c, ∗} β2 = = {∗}, {c, ∗} β3 = ∅ = {∗}, {c, ∗} = {∗}, ∅

A further iteration yields: hH2 (0) hH2 (1) hH2 (2) hH2 (3) hH2 (4) hH2 (5)

= DH1 , {a, β1 , b, β2 } = DH1 , {a, β1 , b, β2 } = DH1 , {a, β1 , b, β2 } = DH1 , {c, ∅} = DH1 , {c, ∅} = DH1 , ∅

Since DH2 ≡ DH1 the iterative construction converges, thus providing the minimal labelled transition system depicted as

a

•M 1

b

/ •2

c

/ •3

where •1 = {0, 1, 2}, •2 = {3, 4} and •3 = {5}.

3

HD-Automata for π-Agents

This section outlines the representation of HD-automata as coalgebras over the concrete permutation algebra of named sets. Let N be an inﬁnite countable set of names ranged over by v and let N be the set N ∪ ∗ where ∗ ∈ N is a distinguished name. The distinguished name ∗ will be used to model name creation. We also assume ta total order < on N (for instance, < can be the lexicographic order on N and ∀v ∈ N : ∗ < v). Table 1 displays the deﬁnitions of named sets, named functions, and composition of named functions. In Table 1, the general product is employed (as usual in type theory) to type functions f such that the type of f (q) is dependent on q. Intuitively, a named set represents a set of states equipped with a mechanism to give local meaning to names occurring in each state. In particular, function | | yields the number of local names of states. Moreover, the permutation group GA (q) allows one to describe directly the renamings that do not aﬀect the behaviour of q, i.e., symmetries on the local names of q. For technical reasons, we assume that states are totally ordered. By convention we write {q : QA }A to indicate the set {v1 , ..., v|q|A } and we use NSet to denote the universe of named sets. As in the case of standard transition systems, named functions are used to determine the possible transitions of a given state. Intuitively, hH (q) yields the

326

G. Ferrari et al.

Table 1. Named sets, Named Functions and Composition of Named Functions Named set A named set A is a structure A = Q : Set, | |: Q −→ ω, ≤: ℘(Q × Q), G :

bij

℘({v1 ..v|q| } −→ {v1 ..v|q| })

q∈Q

where ∀q : QA , GA (q) is a permutation group and ≤A is a total ordering.

Named function A named function H is a structure inj

H = S : NSet, D : NSet, h : QS −→ QD , Σ : QS −→ ℘({h(q)}D −→ {q}S ) where ∀q : QSH , ∀σ : ΣH (q), 1. GDH (hH (q)); σ = ΣH (q) and 2. σ; GSH (q) ⊆ ΣH (q). Composition of named functions Named functions can be composed in the obvious way. Let H and K be named functions. Then H; K is deﬁned only if DH = SK , and SH;K = SH ,

DH;K = DK ,

hH;K : QSH −→ QDK = hH ; hK ,

ΣH;K (q : QSH ) = ΣK (hH (q)); ΣH (q)

denotes the surjective component of H: Let H be a named function, H = {q : QDH | ∃q : QSH .hH (q) = q }, – SH = SH and QDH – |q|D = |q|DH , GD (q) = GDH (q), hH (q) = hH (q) and ΣH (q) = ΣH (q)

H

H

behaviour of state q : SH , i.e. the transitions departing from q. Since states are equipped with local names, a name correspondence is needed to describe how names in the destination state are mapped into names of the source state, therefore we must equip H with a set ΣH (q) of injective functions. However, names of corresponding states (q, hH (q)) in hH are deﬁned up to permutation groups and name correspondence must not be “sensible” to the local meaning of names. Therefore, the whole set ΣH (q) must be generated by saturating any of its elements by the permutation group of hH (q), and the result must be invariant with respect to the permutation group of q. Condition (1) in Table 1 states that the group of hH (q) does not change the meaning of names in hH (q), while Condition (2) states that the group of q does not “generate meanings” for local names of q that are outside hH (q).

From Co-algebraic Speciﬁcations to Implementation: The Mihda Toolkit

327

Table 2. Bundles: the π-calculus case Bundles A bundle β consists of the structure β = D : NSet, Step : ℘(qd D) where qd D is the set of quadruples of the form , π, σ, q given by inj

qd D = { : Lπ , π : N −→ {v1 ..}, σ :

inj

{q}D → Q, q : QD }.

∈Lπ

and Q =

N if ∈ {BOU T, BIN } N if ∈ {BOU T, BIN }

under the constraint that GDβ (q); Sq = Sq , where Sq = {, π, σ, q ∈ Stepβ } and ρ; , π, σ, q = , π, ρ; σ, q.

Bundle names Let β be a bundle. Function {| |} : B → N , mapping each bundle to the set of its names, is deﬁned by {| β |} =

rng(π) ∪ rng(σ) \ {∗}

,π,σ,q∈Stepβ

where rng yields the range of functions. We only consider bundles β such that {| β |} is ﬁnite and we let β to indicate the number of names which occur in the bundle β (i.e. β = |{| β |}|).

3.1

Bundles over π-Calculus Actions

To represent the minimization algorithm for the early semantics of π-calculus [10], the notion of bundle must be enriched. Labels of transitions must distinguish among the diﬀerent meanings of names occurring in π-calculus actions, namely synchronization, bound/free output and bound/free input. The set of π-calculus labels Lπ is {T AU, BOU T, OU T, BIN, IN }. We specify two diﬀerent labels for input actions: Label BIN is used when the input transition exposes a fresh name, while label IN handles the case of an input transition that exposes a name of the source state of the transition. Labels in Lπ have weights. The weight map | | : Lπ → {∅, {1}, {1, 2}} is deﬁned as: |T AU | = ∅

|BOU T | = |BIN | = {1}

|OU T | = |IN | = {1, 2}

and associates the set of indexes of distinct names the label refers to. A bundle on π-labels is deﬁned as in Table 2. Table 2 illustrates deﬁnitions of bundles and names of bundles. As it is the case for ordinary automata, the Step component of a bundle speciﬁes the data structure that contains the set of successor states for a given source state. More precisely, if , π, σ, q ∈ qd D,

328

G. Ferrari et al.

then q is the destination state; is the label of the transition; π associates to the label the names observed in the transition; and σ states how names in the destination state are related to the names in the source state. According to the deﬁnition of σ in Table 2, a name in a destination state of a quadruple is mapped on the distinguished name ∗ only on transitions where a new name is created (i.e. transitions labelled by BOU T or BIN ). In order to exploit named functions for representing HD-automata it is necessary to equip the set of bundles B with a named set structure. In other words we must deﬁne – a total order on bundles, – a function that maps a bundle to its number of names, – a group of permutations over those names. The names of a bundle are the names (diﬀerent from ∗) that appear either in the labels or in the range of σ’s of the quadruples of the bundle. Without loss a generality, we can assume that a total order on states and labels exist. Hence, quadruples are totally ordered1 . The order over quadruples yields an ordering over bundles. bij The group of β : B Lπ is the set of permutations θ : {| β |} −→ {| β |} such that β; θ = β where β; θ is deﬁned as Dβ , {, π; θ, σ; θ, q | , π, σ, q : β}. 3.2

Normalizing Bundles

In the minimization algorithm two states belong to the same block (partition) whenever they have the “same” bundles. Hence, the most crucial construction on bundles is the normalization operation. This operation is necessary for two diﬀerent reasons. The ﬁrst reason is that there are diﬀerent equivalent ways for picking up the step components (i.e. quadruples , π, σ, q) of a bundle. The second (more important) reason consists of removing from the step component of a bundle all the redundant input transitions. Indeed, redundant transitions occur when a HD-automaton is built from a π-calculus agent. During this phase, it is not possible to decide which free input transitions are required, and which transitions are irredundant2 . The solution to this problem consists of adding a superset of the required free input transitions and to exploit a reduction function to remove the unnecessary ones during the minimization phase. Consider for instance the case of a state q having only one name v1 and assume that the following two tuples appear in a bundle: IN, xy, {v1 → y}, q

and

BIN, x, {v1 → ∗}, q.

Then, the IN transition is redundant if y is not active in q as it expresses exactly the same behaviour of the second tuple, except that a “free” input transition 1 2

For instance, we can assume the lexicographic order of labels, states and names. In the general case, to decide whether a free input transition is required it is as diﬃcult as to decide the bisimilarity of two π-calculus agents.

From Co-algebraic Speciﬁcations to Implementation: The Mihda Toolkit

329

is used rather than a “bound” one. Hence, the transformation removes the ﬁrst tuple from the bundle. During the iterative execution of the minimization algorithm, bundles are split; hence the set of redundant transitions of bundles decreases. Hence, when the iterative construction terminates, only those free inputs that are really redundant have been removed from the bundles. The normalization of a bundle β is done in diﬀerent steps. First, the bundle is reduced by removing all the possibly redundant input transitions. Reduction function red(β) on bundles is deﬁned as follows: – Dred(β) = Dβ , – Stepred(β) = Stepβ \ {IN, xy, σ, q | BIN, x, σ , q : Stepβ ∧ σ = σ; {y → ∗}}.

where σ; {y → ∗} is the function equal to σ on any name diﬀerent from y and that assigns ∗ to y. Second, the normalization function norm(β) is deﬁned as follows: – Dnorm(β) = Dβ – Stepnorm(β) = min Stepβ \ {IN, xy, σ, q | y ∈ anβ } ,

where anβ = {| red(β) |} is the active names of β and min is the function that, when applied to Stepβ , returns the step of the minimal bundle (with respect to order ) among those obtained by permuting names of β in all possible ways. More precisely, given a bundle β, min β is the minimal bundle in the set bij

{β; θ | θ : {| β |} −→ {| β |}}, with respect to the total ordering of bundles over D. The order relation is used to deﬁne the canonical representatives of bundles and relies on the order of quadruples. Hereafter, we use perm(β) to denote the canonical permutation that associates Stepnorm(β) and Stepβ \ {IN, xy, σ, q | y ∈ anβ }. We remark that, while all IN transitions covered by BIN transitions are removed in the deﬁnition of red(β), only those corresponding to the reception of non-active names are removed in the deﬁnition of norm(β). In fact, even if an input transition is redundant, it might be the case that it corresponds to the reception of a name that is active due to some other transitions. Finally, we need a construction which extracts in a canonical way a group of permutations out of a bundle. Let β be a bundle, deﬁne Gr β to be the set {ρ | Stepβ ; (ρ[∗ /∗ ]) = Stepβ }. It can be proved that Gr β is a group of permutations. 3.3

The Minimization Algorithm

We are now ready to give the deﬁnition of the functor T that states the coalgebras for HD-automata. The action of functor T over named sets is given by: – – – –

QT (A) = {β : Bundle | Dβ = A, β normalized}, |β|T (A) = β, GT (A) (β) = Gr β, β1 ≤T (A) β2 iﬀ Stepβ1 Stepβ2 ,

while the action of functor T over named functions is given by:

330

G. Ferrari et al.

– ST (H) = T (SH ), DT (H) = T (DH ), – hT (H) (β : QT (SH ) ) : QT (DH ) = norm(β ), : → {| norm(β ) |}{β}T (SH ) – ΣT (H)(β : QT (SH ) ) = Gr(norm(β ));(perm(β ))−1;inj − where β = DH , {, π, σ ; σ, hH (q) | , π, σ, q : Stepβ , σ : ΣH (q)}.

Notice that functor T maps every named set A into the named set T (A) of its normalized bundles. A named function H is mapped into a named function T (H) in such a way that every corresponding pair (q, hH (q)) in hH is mapped into a set of corresponding pairs (β, norm(β )) of bundles in hT (H) . The quadruples of bundle β are obtained from those of β by replacing q with hH (q) and by saturating with respect to the set of name mappings in ΣH (q). The name mappings in ΣT (H) (β) are obtained by transforming the permutation group of bundle norm(β ) with the inverse of the canonical permutation of β and with a ﬁxed injective function inj mapping the set of names of norm(β ) into the set of names of β, deﬁned as i < j, inj(vi ) = vi and inj(vj ) = vj implies i < j . Without bundle normalization, the choice of β among those in β ; θ would have been arbitrary and not canonical with the consequence of mapping together fewer bundles than needed. Definition 4 (Transition systems for π-agents). A transition system over named sets and π-actions is a named function K such that DK = T (SK ). HD-automata are particular transition systems over named sets. An HD- automaton A is given by: – – – –

the elements of QA are π-agents and ≤A is the lexicographic order on QA ; |p(v1 , ..., vn )|A = n; GA (q) = {id : {q}A −→ {q}A }, where id denotes the identity function, h : QA −→ {β | Dβ = A} is such that , π, σ, q ∈ Steph(q) represent the π-calculus transitions from agent q. ,π,σ

We will often use the notation q −−→ q to denote the “representative” transitions from agent q that are used in the construction of the HD-automaton. We can now deﬁne the function K. – SK = A, – hK (q) = norm(h(q)), – ΣK (q) = Gr(hK (q)); (perm(h(q)))−1 ; inj : {| h(q) |} −→ {q}A The minimal HD-automata is built by an iterative procedure on K: the iteration along the terminal sequence. The formula which details the iterative construction is given by Hi+1 = K;T (Hi ). If K is a ﬁnite state HD-automaton. Then The initial approximation, H0 , is deﬁned as follows:

From Co-algebraic Speciﬁcations to Implementation: The Mihda Toolkit

331

Block

Domination

Bundle

Automaton

Transitions

Labels

States Fig. 3. Mihda Software Architecture

– SH0 = SK , DH0 = unit where Qunit = {∗}, |∗|unit = 0 (and hence {∗} = φ), Gunit ∗ = φ, and ∗ ≤unit ∗, – hH0 (q : QsH0 ) = ∗, – ΣH0 (q) = {φ} We recall that the iteration along the terminal sequence converges in a ﬁnite number of steps: i exists such that DHi+1 ≡ DHi , and the isomorphism mapping F : DHi → DHi+1 yields the minimal realization of the transition system K up to strong early bisimilarity.

4

The Mihda Toolkit

The previous sections outlined the coalgebraic foundation for the ﬁnite state veriﬁcation of name passing process calculi. It remains to show that this theory can be eﬀectively used as a basis for the design and development of eﬀective and usable veriﬁcation toolkits. This section and the one following explore this issue by describing our experience in designing, implementing and experimenting a minimization toolkit, called Mihda, for verifying ﬁnite state mobile systems represented in the π-calculus. The Mihda toolkit cleanly separates facilities that are language-speciﬁc (parsing, transition system calculation) from those that are independent from the calculus notation (bisimulation) in order to ease modiﬁcations. The toolkit has been implemented in ocaml. Indeed, the partition reﬁnement algorithm has been speciﬁed in a “type-theoretic” style and the underlying type system makes use of parametric polymorphism. The type system of ocaml oﬀers all the necessary features for handling these kind of types. Figure 3 illustrates the modules of Mihda and their dependencies.

332

G. Ferrari et al.

For instance, State is the module which provides all the structures for handling states and its main type deﬁnes the type of the states of the automata. Domination is the module containing the structures underlying bundle normalization. The connections express typing relationships among the modules. For instance, since states in bundles and transitions must have the same type, then a connection exists between modules Bundle and Transitions. Notice that the iterative construction of the minimal automaton is parameterized with respect to the modules of Figure 3. Indeed, the same algorithm can be applied to diﬀerent kind of automata and bisimulation, provided that these automata match the constraints on types imposed by the software architecture. For instance, the architecture of Mihda has been exploited to provide minimization of both HD-automata and ordinary automata (up to strong bisimilarity). 4.1

The Main Cycle

We have already pointed out that the iterative step of the minimization algorithm can be represented in a functional style form as follows: ,π,σ

hHi+1 (q) = norm DHi , {, π, σ ; σ, hHi (q ) | q −−→ q , σ : ΣHi (q )}.

(1)

We compute hHi+1 (q) through the following steps: (a) determine the bundle of state q; (b) for each quadruple , π, σ, q in this bundle, apply hHi to q , the target state of the quadruple (yielding the bundle of q in the previous iteration of the algorithm); (c) left-compose symmetry σ ∈ Σ(q ) with σ; (d) normalize the resulting bundle. In the Mihda implementation the value of the i-th iteration (i.e. hHi ) is stored in a list of blocks which are the crucial data structures of Mihda. Blocks implements the action of the functor on states of the automata and contain all those information needed to compute the iteration steps of the algorithm expressed in a set theoretic framework. Blocks represent both (ﬁnite) named functions and partitions of an automaton (at each iteration of the algorithm). When the algorithm terminates, each block will correspond to a state of the minimal automaton. A block has the following structure: type Block t = Block of id : string ∗ states : State t list ∗ norm : Bundle t ∗ names : int list ∗ group : int list list ∗ Σ : (State t → (int ∗ int) list list) ∗ Θ −1 : (State t → (int ∗ int) list)

From Co-algebraic Speciﬁcations to Implementation: The Mihda Toolkit

333

q

θq

x

Fig. 4. Graphical representation of a block

Field id is the name of the block and is used to identify the block in order to construct the minimal automaton at the end of the algorithm. Field states contains the states which are considered equivalent. The remaining ﬁelds represent – – – –

the normalized bundle with respect to the block considered as state (norm), names is the list of names of the bundle in norm, group is its group, function Θ −1 , given a state q, maps the names appearing in norm into the name of q. Basically, Θ−1 (q) is the function which establishes a correspondence between the bundle of q and the bundle of the corresponding representative element in the equivalence class of the minimal automaton.

We pictorially represent (some components of) a block as in Figure 4: The upper elements are the states in the block, while the element x is the “representative state”, namely it is a graphical representation of the block as a state. For each state q a function θq maps names of x into the names of q. Function θq describes “how” the block approximates the state q at a given iteration. The circled arrow on x aims at recording that a block also has symmetries on its names. Bundle norm of block x is computed by exploiting the ordering relations over names, labels and states. A graphical representation of steps (a)-(d) above in terms of blocks is illustrated in Figure 5. Step (a) is computed by the facility Automaton.bundle that ﬁlters all transitions of the automaton whose source corresponds to q. Figure 5(a) shows that a state q is taken from a block and its bundle is computed. Step (b) is obtained by applying facility Block.next to the bundle of q. The operation Block.next substitutes all target states of the quadruples with the corresponding current block and computes the new mappings (see Figure 5(b)). Step (c) does not seem to correctly adhere to the corresponding step of equation 1. However, if we consider that θ functions are computed at each step by composing symmetries σ’s we can easily see that θ functions exactly play the rˆole of σ’s. Finally, step (d) is represented in Figure 5(d) and is obtained via the function Bundle.normalize.

334

G. Ferrari et al.

The main step of the minimization algorithm is the function split that computes, at each iteration, the current partition (the list of blocks). let split blocks block = try let minimal = (Bundle.minimize red (Block.next (h n blocks) (state of blocks) (Automaton.bundle aut (List.hd (Block.states block))))) in Some (Block.split minimal (fun q → let normal = (Bundle.normalize red (Block.next (h n blocks) (state of blocks) (Automaton.bundle aut q))) in Bisimulation.bisimilar minimal normal) block) with Failure e → None

Fig. 5. Computing hHi+1

From Co-algebraic Speciﬁcations to Implementation: The Mihda Toolkit

335

let blocks = ref [ (Block.from states states) ] in let stop = ref false in while not ( !stop ) do begin let oldblocks = !blocks in let buckets = split iter (split oldblocks) oldblocks in begin blocks := (List.map (Block.close block (h n oldblocks)) buckets); stop := (List.length !blocks) = (List.length oldblocks) && (List.for all2 (fun x y → (Block.compare x y) == 0) !blocks oldblocks) end end done ; !blocks Fig. 6. The main cycle of Mihda

Let block be a block in the list blocks, function split computes minimal by minimizing the reduced bundle of the ﬁrst state of block. The choice of the state for computing minimal is not important: Without loss of generality, given two equivalent states q and q’, it is possible to map names of q into names of q’ preserving their associated normalized bundle if, and only if, a similar map from names of q’ into names of q exists. Once minimal has been computed, split invokes Block.split with parameters minimal and block, while the second argument of Block.split is a function that computes the current normalized bundle of each state in block and checks whether or not it is bisimilar to minimal. This computation is performed by function Bisimulation.bisimilar. If bisimilarity holds through θq then Some θq is returned, otherwise None is returned. We are now ready to comment on the main cycle of Mihda reported in Figure 6. Let k = (start, states, arrows) be a HD-automaton. When the algorithm starts, blocks is the list that contains a single block collecting all the states of the automata k. At each iteration, the list of blocks is splitted, as much as, possible by split iter that returns a list of buckets which have the same ﬁelds of a block apart from the name, symmetries and the functions mapping names of destination states into names of source states. Essentially, the split operation checks if two states in a block are equivalent or not. States which are no longer equivalent to the representative element of the block are removed and inserted into a bucket. Then, by means of Block.close block, all buckets are turned into blocks which are assigned to blocks. Finally, the termination condition stop is evaluated. This condition is equivalent to say that a bijection can be established between oldblocks (that corresponds to Di ) and blocks (corresponding to Di+1 ). This

336

G. Ferrari et al.

condition reduces to test whether blocks and oldblocks have the same length and that blocks at corresponding positions are equal.

5

Verifying Mobile Systems with Mihda

In this section we discuss some experimental results of Mihda in the analysis of mobile systems. In particular, we consider the π-calculus speciﬁcation of the Handover Protocol for Mobile Telephones borrowed from that given in [17] (which has been in turn derived from that in [13]). The π-calculus speciﬁcation of the GSM is define GSM(in,out) = (tca)(ta)(ga)(sa)(aa)(tcp)(tp)(gp)(sp)(ap) |( Car(ta,sa,out), Base(tca,ta,ga,sa,aa), IdleBase(tcp,tp,gp,sp,ap), Centre(in,tca,ta,ga,sa,aa,tcp,tp,gp,sp,ap))

Centre receives messages from the environment on channel in; these input actions are the only observable actions performed by Centre. Module Car sends the messages to the end user along the channel out; these outputs are the only visible actions performed by the Car. Modules Centre and Car interact via the base corresponding to the cell in which the car is located. The speciﬁcation of modules Car, Base, IdleBase and Centre is reported in Table 3. The behaviour of the four modules is brieﬂy summarized below: – Car brings a MobileStation and travels across two diﬀerent geographical areas that provide services to end users; – Base and IdleBase are Base Station modules; they interconnect the MobileStation and the MobileSwitching Centre. – Centre is a MobileSwitching centre which controls radio communications within the whole area composed by the two cells; The protocol starts when Car moves from one cell to the other. Indeed, Centre communicates to the MobileStation the name of the base corresponding to the new cell. The communication of the new channel name to the MobileStation is performed via the current base. All the communications between the MobileSwitching centre and the MobileStation are suspended until the MobileStation receives the names of the new transmission channels. Then the base corresponding to the new cell is activated, and the communications between the MobileSwitching centre and the MobileStation continue through the new base. In Table 4 we report the results of Mihda on two diﬀerent versions of the protocols. The ﬁrst row of the table corresponds to the version discussed above. The second line gives the ﬁgures on a version of the GSM protocol that models the MobileSwitching and MobileStation modules in a more realistic way. Indeed, the ’full’ version exploits a protocol for establishing whether or not the car is crossing the boundary of a cell and entering the other cell.

From Co-algebraic Speciﬁcations to Implementation: The Mihda Toolkit

337

Table 3. π-calculus speciﬁcation of GSM modules define Car(talk,switch,out) = talk?(msg).out!msg.Car(talk,switch,out) + switch?(t).switch?(s).Car(t,s,out) define Base(talkcentre,talkcar,give,switch,alert) = talkcentre?(msg).talkcar!msg. Base(talkcentre,talkcar,give,switch,alert) + give?(t).give?(s).switch!t.switch!s.give!give. IdleBase(talkcentre,talkcar,give,switch,alert) define IdleBase(talkcentre,talkcar,give,switch,alert) = alert?(empty).Base(talkcentre,talkcar,give,switch,alert) define Centre(in,tca,ta,ga,sa,aa,tcp,tp,gp,sp,ap) = in?(msg).tca!msg.Centre(in,tca,ta,ga,sa,aa,tcp,tp,gp,sp,ap) + tau.ga!tp.ga!sp.ga?(empty).ap!ap. Centre(in,tcp,tp,gp,sp,ap,tca,ta,ga,sa,aa)

Table 4. Mihda at work Protocol Time to compile States Transitions Time to minimize States Transitions GSM small 0m 0.931s 211 398 0m 4.193s 105 197 GSM full 0m 8.186s 964 1778 0m 54.690s 137 253

The results are obtained by running Mihda on an AMD AthlonTM XP 1800+ dual processor with 1Giga RAM. The time for minimizing the automata is very contained. The results on the GSM seem very promising. Indeed, the size of the minimal automata in terms of states and transitions is smaller than their nonminimized version. In the case of GSM small the size of the minimal automaton is the half or the automaton obtained by compiling the original speciﬁcation; while, in version GSM full, states and transitions are reduced of a factor 8.

6

Conclusion

This paper has provided an overview of a foundational model for the ﬁnite state veriﬁcation of global computing systems and has showed how eﬃcient tool supports can be derived from it. We are currently extending the Mihda toolkit with facilities to handle other notions of equivalences (e.g. open bisimilarity) and other foundational calculi for global computing (e.g. the asynchronous π-calculus, the fusion calculus). To improve eﬃciency, we plan to incorporate software supports for symbolic approaches based on Binary Decision Diagrams.

338

G. Ferrari et al.

References 1. Peter Aczel. Algebras and coalgebras. In Roy Backhouse, Roland Crole and Jeremy Gibbons, editors, Algebraic and Coalgebraic Methods in the Mathematics of Program Construction, volume 2297 of LNCS, chapter 3, pages 79–88. Springer Verlag, April 2002. Revised Lectures of the Int. Summer School and Workshop. 2. Edmund M. Clarke and Jeanette M. Wing. Formal methods: state of the art and future directions. ACM Computing Surveys, 28(4):626–643, December 1996. 3. Jean Claude Fernandez. An implementation of an eﬃcient algorithm for bisimulation equivalence. Science of Computer Programming, 13:219–236, 1990. 4. GianLuigi Ferrari, Ugo Montanari, and Marco Pistore. Minimizing transition systems for name passing calculi: A co-algebraic formulation. In Mogens Nielsen and Uﬀe Engberg, editors, FOSSACS 2002, volume LNCS 2303, pages 129–143. Springer Verlag, 2002. 5. Marcelo Fiore, Gordon G. Plotkin, and Daniele Turi. Abstract syntax and variable binding. In 14th Annual Symposium on Logic in Computer Science. IEEE Computer Society Press, 1999. 6. Murdoch J. Gabbay and Andrew M. Pitts. A new approach to abstract syntax involving binders. In 14th Annual Symposium on Logic in Computer Science. IEEE Computer Society Press, 1999. 7. Bart Jacobs and Jan Rutten. A tutorial on (co)algebras and (co)induction. Bulletin of the EATCS, 62:222–259, 1996. 8. Paris C. Kanellakis and Scott A. Smolka. Ccs expressions, ﬁnite state processes and three problem of equivalence. Information and Computation, 86(1):272–302, 1990. 9. Robin Milner. Commuticating and Mobile Systems: the π-calculus. Cambridge University Press, 1999. 10. Robin Milner, Joachim Parrow, and David Walker. A calculus of mobile processes, I and II. Information and Computation, 100(1):1–40,41–77, September 1992. 11. Ugo Montanari and Marco Pistore. History dependent automata. Technical report, Computer Science Department, Universit` a di Pisa, 1998. TR-11-98. 12. Ugo Montanari and Marco Pistore. π-calculus, structured coalgebras and minimal hd-automata. In Mathematical Foundations of Computer Science 2000, volume 1893. Springer, 2000. 13. Fredrik Orava and Joachim Parrow. An algebraic veriﬁcation of a mobile network. Formal Aspects of Computing, 4(5):497–543, 1992. 14. Marco Pistore. History dependent automata. PhD thesis, Computer Science Department, Universit` a di Pisa, 1999. 15. Andrew M. Pitts and Murdoch J. Gabbay. A metalanguage for programming with bound names modulo renaming. In Mathematics of Program Construction, 5th International Conference, MPC2000, volume 1837. Springer, 2000. 16. Davide Sangiorgi and David Walker. The π-calculus: a Theory of Mobile Processes. Cambridge University Press, 2002. 17. Bj¨ orn Victor and Faron Moller. The Mobility Workbench — a tool for the πcalculus. In David Dill, editor, Proceedings of CAV ’94, volume 818 of Lecture Notes in Computer Science, pages 428–440. Springer-Verlag, 1994.

A Calculus for Modeling Software Components Oscar Nierstrasz and Franz Achermann Software Composition Group University of Bern, Switzerland http://www.iam.unibe.ch/∼scg

Abstract. Many competing deﬁnitions of software components have been proposed over the years, but still today there is only partial agreement over such basic issues as granularity (are components bigger or smaller than objects, packages, or application?), instantiation (do components exist at run-time or only at compile-time?), and state (should we distinguish between components and “instances” of components?). We adopt a minimalist view in which components can be distinguished by composable interfaces. We have identiﬁed a number of key features and mechanisms for expressing composable software, and propose a calculus for modeling components, based on the asynchronous π calculus extended with explicit namespaces, or “forms”. This calculus serves as a semantic foundation and an executable abstract machine for Piccola, an experimental composition language. The calculus also enables reasoning about compositional styles and evaluation strategies for Piccola. We present the design rationale for the Piccola calculus, and brieﬂy outline some of the results obtained.

1

Introduction

What is a software component? What are the essential aspects of ComponentBased Software Development? What is a suitable foundation for modeling and reasoning about CBSD? To the ﬁrst question, one of the most robust and appealing answers has been: “A software component is a unit of independent deployment without state.” [43] This simple deﬁnition captures much that is important, though it leaves some very important aspects implicit. First, CBSD attempts to streamline software development and evolution by separating what is stable from what is not. That is, components are not just “independently deployable”, but they must encapsulate a stable unit of functionality. This, of course, begs the question, “If components are the stable stuﬀ, what makes up the rest?” Second, “independent deployment” of components actually entails compliance with some well-deﬁned component model in which components present their services as a set of interfaces or “plugs”: “A software component is a static abstraction with plugs.” [31] F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 339–360, 2003. c Springer-Verlag Berlin Heidelberg 2003

340

O. Nierstrasz and F. Achermann

This leads us to answer the question, “What makes up the rest?” as follows: Applications = Components + Scripts [6] that is, component-based applications are (ideally) made up of stable, oﬀ-theshelf components, and scripts that plug them together. Scripts (ideally) make use of high-level connectors that coordinate the services of various components [3,29,42]. Furthermore, complex applications may need services of components that depend on very diﬀerent architectural assumptions [16]. In these cases, glue code is needed to adapt components to diﬀerent architectural styles [40,41]. Returning to our original questions, then, we conclude that it is not really possible to deﬁne software components without taking these complementary aspects of CBSD into account. At a purely technical level, i.e., ignoring methodological and software process aspects, these aspects include styles (plugs and connectors), scripts, coordination and glue code. A formal foundation for any reasonable notion of software components must address these aspects. We claim that most of these aspects can be adequately addressed by the notion of forms — ﬁrst-class, extensible namespaces. The missing aspect (coordination) can be addressed by agents and channels. We propose, therefore, a calculus for modeling composable software which is based on the asynchronous π calculus [25,36] extended with ﬁrst-class namespaces [5]. This calculus serves both as the semantic target and as an executable abstract machine for Piccola, an experimental composition language for implementing styles, scripts, coordination abstractions and glue code [4,6]. The Piccola calculus is described in greater detail in Achermann’s PhD dissertation [2]. In this paper we ﬁrst motivate the calculus by establishing a set of requirements for modeling composition of software components in section 2. Next, we address these requirements by presenting the syntax and semantics of the Piccola calculus in section 3. In section 4 we provide a brief overview of Piccola, and summarize how the calculus helps us to deﬁne its semantics, reason about composition, and optimize the language bridge by partial evaluation while preserving its semantics. Finally, we conclude with a few remarks about related and ongoing work in sections 5 and 6.

2

Modeling Software Composition

As we have seen, a foundation for modeling software components must also be suitable for expressing compositional styles, scripts, coordination abstractions and glue code. Let us examine each of these in turn to see which requirements they pose. Figure 1 summarizes these requirements, and illustrates how Piccola and the Piccola calculus support them.

A Calculus for Modeling Software Components

341

Glue Piccola • extensible, immutable records • first-class, monadic services • language bridging • introspection • explicit namespaces • services as operators • dynamic scoping on demand • agents & channels

• generic wrappers • component packaging • generic adaptors

Styles • primitive neutral object model • meta-objects • HO plugs & connectors • default arguments • encapsulation • component algebras

Scripts Coordination • coordination abstractions

• sandboxes • composition expressions • context-dependent policies

Fig. 1. How Piccola supports composition.

2.1

Compositional Styles

A compositional style allows us to express the structure of a software application in terms of components, connectors and rules governing their composition (cf. “architectural style” [42]). – Neutral object model: There exists a wide variety of diﬀerent object and component models. Components may also be bigger or smaller than objects. As a consequence, a general foundation for modeling components should make as few assumptions about objects, classes and inheritance as possible, namely, objects provide services, they may be instantiated, and their internal structure is hidden. – Meta-objects: On the other hand, many component models depend on runtime reﬂection, so it must be possible to express dynamic generation of metaobjects. – Higher-order plugs and connectors: In general, connectors can be seen as higher-order operators over components and other connectors. – Default arguments: Flexibility in plugging together components is achieved if interface dependencies are minimized. Keyword-based rather than positional arguments to services enable both ﬂexibility and extensibility. – Encapsulation: Components are black-box entities that, like objects, provide services, without exposing their structure. At the same time, the components and connectors of a particular style can be encapsulated as a module, or namespace within which components may be scripted.

342

O. Nierstrasz and F. Achermann

Fig. 2. Evaluating the helloButton script.

– Component algebras: Compositional styles are most expressive when compositions of components and connectors again yield components (or connectors). (The composition of two ﬁlters is again a ﬁlter.) Based on these requirements, we conclude that we need (at least) records (to model objects and components), higher-order functions, reﬂection, and (at some level) overloading of operators. Services may be monadic, taking records as arguments, rather than polyadic. To invoke a service, we just apply it to a record which bundles together all the required arguments, and possibly some optional ones. These same records can serve as ﬁrst-class namespaces which encapsulate the plugs and connectors of a given style. For this reason we unify records and namespaces, and call them “forms”, to emphasize their special role. A “form” is essentially a nested record, which binds labels to values. Consider, for example, the following JPiccola script [30]: makeFrame title = "AWT Demo" x = 200 y = 100 hello = "hello world" sayHello: println hello component = Button.new(text=hello) ? ActionPerformed sayHello This script invokes an abstraction makeFrame, passing it a form containing bindings for the labels title, x, and so on. The script makes use of a compositional style in which GUI components (i.e., the Button) can be bound to events (i.e., ActionPerformed) and actions (i.e., sayHello) by means of the ? connector. When we evaluate this code, it generates the button we see in ﬁgure 2. When we click on the button, hello world is printed on the Java console. 2.2

Glue

Glue code is needed to package, wrap or adapt code to ﬁt into a compositional style. – Generic wrappers: Wrappers are often needed to introduce speciﬁc policies (such as thread-safe synchronization). Generic wrappers are hard to specify for general, polyadic services, but are relatively straightforward if all services are monadic.

A Calculus for Modeling Software Components

343

– Component packaging: Glue code is sometimes needed to package existing code to conform to a particular component model or style. For this purpose, a language bridge is needed to map existing language constructs to the formal component model. – Generic adaptors: Adaptation of interfaces can also be speciﬁed generically with the help of reﬂective or introspective features, which allow components to be inspected before they are adapted. The JPiccola helloButton script only works because Java GUI components are wrapped to ﬁt into our compositional style. In addition to records and higher-order functions over records, we see that some form of language bridging will be needed, perhaps not at the level of the formal model, but certainly for a practical language or system based on the model. 2.3

Scripts

Scripts conﬁgure and compose components using the connectors deﬁned for a style. – Sandboxes: For various reasons we may wish to instantiate components only in a controlled environment. We do not necessarily trust third-party components. Sometimes we would like to adapt components only within a local context. For these and other reasons it is convenient to be able to instantiate and compose namespaces which serve as sandboxes for executing scripts. – Composition expressions: Scripts instantiate and connect components. A practical language might conveniently represent connectors as operators. Pipes and ﬁlters connections are well-known, but this idea extends well to other domains. – Context-dependent policies: Very often, components must be prepared to employ services of the dynamic context. Transaction services, synchronization or communication primitives may depend on the context. For this reason, pure static scoping may not be enough, and dynamic scoping on demand will be needed for certain kinds of component models. So, we see that explicit, manipulable namespaces become more important. 2.4

Coordination

CBSD is especially relevant in concurrent and distributed contexts. For this reason, a foundation for composition must be able to express coordination of interdependent tasks. – Coordination abstractions: Both connectors and glue code may need to express coordination of concurrent activities. Consider a readers/writers synchronization policy as a generic wrapper. We conclude that we not only need higher-order functions over ﬁrst-class namespaces (with introspection), but also a way of expressing concurrency and communication [40].

344

O. Nierstrasz and F. Achermann Table 1. Syntax of the Piccola Calculus. A, B, C ::= | | | | | |

A; B → x L λx.A νc.A c?

empty form sandbox bind inspect abstraction restriction input

| | | | | | |

F, G, H ::= →F | x

empty form binding

|S service | F · G extension

S ::= F ; λx.A closure → | x bind | c output

3

R x hide x A·B AB A|B c

current root variable hide extension application parallel output

|L inspect | hide x hide

The Piccola Calculus

As a consequence of the requirements we have identiﬁed above, we propose as a foundation a process calculus based on the higher-order asynchronous π calculus [25,36] in which tuple-based communication is replaced by communication of extensible records, or forms [5]. Furthermore, forms serve as ﬁrst-class namespaces and support a simple kind of introspection. The design of the Piccola calculus strikes a balance between minimalism and expressiveness. As a calculus it is rather large. In fact, it would be possible to express everything we want with the π calculus alone, but the semantic gap between concepts we wish to model and the terms of the calculus would be rather large. With the Piccola calculus we are aiming for the smallest calculus with which we can conveniently express components, connectors and scripts. 3.1

Syntax

The Piccola calculus is given by agents A, B, C that range over the set of agents A in 1. There are two categories of identiﬁers: labels and channels. The set of labels L is ranged over by x, y, z. (We use the term “variables” and “labels” interchangeably.) Speciﬁc labels are also written in the italic text font. Channels are denoted by a, b, c, d ∈ N . Labels are bound with bindings and λ-abstractions, and channels are bound by ν-restrictions. The operators have the following precedence: application > extension > restriction, abstraction > sandbox > parallel Agent expressions are reduced to static form values or simply forms. Forms are ranged over by F, G, H. Notice that the set of forms is a subset of all agents. Forms are the ﬁrst-class citizens of the Piccola calculus, i.e., they

A Calculus for Modeling Software Components

345

are the values that get communicated between agents and are used to invoke services. Forms are sets of bindings and services. The set of forms is denoted by F. Certain forms play the role of services. We use S to range over services. User-deﬁned services are closures. Primitive services are inspect, the bind and hide primitives, and the output service. Before considering the formal reduction relation, we ﬁrst give an informal description of the diﬀerent agent expressions and how they reduce. – The empty form, , does not reduce further. It denotes a form without any binding. – The current root agent, R, denotes the current lexical scope. – A sandbox A; B evaluates the agent B in the root context given by A. A binds all free labels in B. If B is a label x, we say that A; x is a projection on x in A. – A label, x, denotes the value bound by x in the current root context. →A – The primitive service bind creates bindings. If A reduces to F , then x →F . reduces to the binding x → · y →) reduces – The primitive service hide x removes bindings. So, hide x (x →. to y – The inspect service, L, can be used to interate over the bindings and services of an arbitrary form F . The result of LF is a service that takes as its argument a form that binds the labels isEmpty, isService and isLabel to services. One of these three services will then be selected, depending on whether F is , contains some bindings, or is only a service. – The values of two agents are concatenated by extension. In the value of A · B the bindings of B override those for the same label in A. – An abstraction λx.A abstracts x in A. – The application AB denotes the result of applying A to B. Piccola uses a call-by-value reduction order. In order to reduce AB, A must reduce to a service and B to a form. – The expression νc.A restricts the visibility of the channel name c to the agent expression A, as in the π calculus. – A | B spawns oﬀ the agent A asynchronously and yields the value of B. Unlike in the π calculus, the parallel composition operator is not commutative, since we do not wish parallel agents to reduce to non-deterministic values. – The agent c? inputs a form from channel c and reduces to that value. The reader familiar with the π-calculus will notice a diﬀerence with the input preﬁx. Since we have explicit substitution in our calculus it is simpler to specify the input by c? and use the context to bind the received value instead of deﬁning a preﬁx syntax c(X).A as in the π-calculus. – The channel c is a primitive output service. If A reduces to F , then cA reduces to the message cF . The value of a message is the empty form . (The value F is only obtained by a corresponding input c? in another agent.)

346

O. Nierstrasz and F. Achermann Table 2. Free Channels. fc() fc(x) →) fc(x fc(A; B) fc(λx.A) fc(νc.A) fc(c?)

3.2

= = = = = = =

∅ ∅ ∅ fc(A) ∪ fc(B) fc(A) fc(A)\{c} {c}

fc(R) fc(L) fc(hide x ) fc(A · B) fc(AB) fc(A | B) fc(c)

= = = = = = =

∅ ∅ ∅ fc(A) ∪ fc(B) fc(A) ∪ fc(B) fc(A) ∪ fc(B) {c}

Free Channels and Closed Agents

As in the π-calculus, forms may contain free channel names. An agent may create a new channel, and communicate this new name to another agent in a separate lexical scope. The free channels fc(A) of an agent A are deﬁned inductively in table 2. αconversion (of channels) is deﬁned in the usual way. We identify agent expressions up to α-conversion. We omit a deﬁnition of free variables. Since Piccola is a calculus with explicit environments, we cannot easily deﬁne α-conversion on variables. Such a deﬁnition would have to include the special nature of R. Instead, we deﬁne a closed agent where all variables, root expressions, and abstractions occur beneath a sandbox: Deﬁnition 1. The following agents A are closed: →, hide x , L, c and c? are closed. – , x – If A and B are closed then also A · B, AB, A | B and νc.A are closed. – If A is closed, then also A; B is also closed for any agent B.

Observe that any form F is closed by the above deﬁnition. An agent is open if it is not closed. Open agents are R, variables x, abstractions λx.A and compositions thereof. Any agent can be closed by putting it into a sandbox with a closed context. Sandbox agents are closed if the root context is closed. In lemma 1 we show that the property of being closed is preserved by reduction. 3.3

Congruence and Pre-forms

As in the π calculus, we introduce structural congruence over agent expressions to simplify the reduction relation. The congruence allows us to rewrite agent expressions to bring communicating agents into juxtapositions, as in the Chemical Abstract Machine of Berry and Boudol [8]. The congruence rules constitute three groups (see table 3). The ﬁrst group (from ext empty right to single service) deals with congruence over forms. It speciﬁes that extension is idempotent and associative on forms. The rules single service and single binding specify that extension overwrites services and bindings with the same label. We deﬁne labels(F ) as follows:

A Calculus for Modeling Software Components

347

Table 3. Congruences. ≡ is the smallest congruence satisfying the following axioms:

x =y

implies

x =y

implies

c∈ / fc(A) c∈ / fc(A) c∈ / fc(A) c∈ / fc(A) c∈ / fc(A) c∈ / fc(A) c∈ / fc(A) c∈ / fc(A)

implies implies implies implies implies implies implies implies

F · ·F (F · G) · H →F ) S · (x →F · y →G x →F · x →G x S · S F;A · B F ; AB A; (B; C) F;G F;R →G) hide x (F · x →G) hide y (F · x hide x hide x S (F · S)G (A | B) | C (A | B) | C (A | B) · C F · (A | B) (A | B)C F (A | B) (A | B); C F ; (A | B) F |A cF νcd.A A | νc.B (νc.B) | A (νc.B) · A A · νc.B A; νc.B (νc.B); A (νc.B)A A(νc.B)

≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡ ≡

F (ext empty right) F (ext empty left) F · (G · H) (ext assoc) →F ) · S (x (ext service commute) →G · x →F y (ext bind commute) →G x (single binding) S (single service) (F ; A) · (F ; B) (sandbox ext) (F ; A)(F ; B) (sandbox app) (A; B); C (sandbox assoc) G (sandbox value) F (sandbox root) hide x F (hide select) →G hide y F · x (hide over) (hide empty) S (hide service) SG (use service) A | (B | C) (par assoc) (B | A) | C (par left commute) A | B·C (par ext left) A | F ·B (par ext right) A | BC (par app left) A | FB (par app right) A | B; C (par sandbox left) F;A | F;B (par sandbox right) A (discard zombie) cF | (emit) νdc.A (commute channels) νc.(A | B) (scope par left) νc.(B | A) (scope par right) νc.(B · A) (scope ext left) νc.(A · B) (scope ext right) νc.(A; B) (scope sandbox left) νc.(B; A) (scope sandbox right) νc.BA (scope app left) νc.AB (scope app right)

Deﬁnition 2. For each form F , the set of labels(F ) ⊂ L is given by: labels() = ∅ →G) = {x} labels(x

labels(S) = ∅ labels(F · G) = labels(F ) ∪ labels(G)

348

O. Nierstrasz and F. Achermann

Using the form congruences, we can rewrite any form F into one of the following three cases: F ≡ F ≡S →G F ≡ F · x

where x ∈ labels(F )

This is proved by structural induction over forms [2]. This formalizes our idea that forms are extensible records uniﬁed with services. A form has at most one binding for a given label. The second group (from sandbox ext to use service) deﬁnes preforms. These are agent expressions that are congruent to a form. For instance, the agent hide x is equivalent to the empty form . The set of all preforms is deﬁned by: F ≡ = {A|∃F ∈ F with F ≡ A} Clearly, all forms are preforms. The last group (from par assoc to scope app right) deﬁnes the semantics of parallel composition and communication for agents. Note how these rules always preserve the position of the rightmost agent in a parallel composition, since this agent, when reduced to a form, will represent the value of the composition. In particular, the rule discard zombie garbage-collects form values appearing to the left of this position. The rule emit, on the other hand, spawns an empty form as the value, thus enabling the message to move around freely. For instance in →c() ≡ x →(c() | ) x → ≡ c() | x

by emit by par ext right

→. the message c() escapes the binding x

3.4

Reduction

We deﬁne the reduction relation → on agent expressions to reduce applications, communications and projections (see table 4). ⇒ is the reﬂexive and transitive closure of →. Especially noteworthy is the rule reduce beta. This rule does not substitute G for x in the agent A as in the classical λ-calculus. Instead, it extends the environment in which A is evaluated. This is essentially the beta-reduction rule found in calculi for explicit substitution [1,32]: →G; A (F ; λx.A)G → F · x

The application of the closure F ; λx.A to the argument G reduces to a sandbox →G. Free expression in which the agent A is evaluated in the environment F · x occurrences of x in A will therefore be bound to G. The property of being closed is respected by reduction: Lemma 1. If A is a closed agent and A → B or A ≡ B then B is closed as well. Proof. Easily checked by induction over the formal proof for A → B.

A Calculus for Modeling Software Components

349

Table 4. Reduction rules.

→G; A (F ; λx.A) G → F · x

(reduce beta)

cF | c? → F

(reduce comm)

→G; x → G F · x

(reduce project)

L → ; λx.(x; isEmpty)

(reduce inspect empty)

LS → ; λx.(x; isService)

(reduce inspect service)

→G) → ; λx.(x; isLabel )label x L(F · x

(reduce inspect label)

A≡A

A →B

A→B

B ≡B A→B

E[A] → E[B]

(reduce struct) (reduce propagate)

→(; λx.(x; x)) · hide →hide x · bind →(x →) and E is an evaluation where label x = project context deﬁned by the grammar: E ::= [ ] E · A F · E E; A F ; E EA F E A|E E|A νc.E

3.5

Encoding Booleans

The following toy example actually illustrates many of the principles at stake when we model components with the Piccola calculus. We can encode booleans by services that either project on the labels true or false depending on which boolean value they are supposed to model (cf. [13]). (This same idea is used by the primitive service L to reﬂect over the bindings and services of a form.) def

True = ; λx.(x; true) def

False = ; λx.(x; false)

(1) (2)

Consider now: →1 · false →2) = (; λx.(x; true))(true →1 · false →2) True(true →(true →1 · false →2); (x; true) → · x

by reduce beta →(true →1 · false →2); x); true by sandbox assoc ≡ ( · x →1 · false →2); true → (true by reduce project →2 · true →1); true ≡ (false by ext bind commute →1

by reduce project

350

O. Nierstrasz and F. Achermann

Note how the bindings are swapped to project on true in the last step. A →1 · false →2) ⇒ 2. similar reduction would show False(true One of the key points of forms is that a client can provide additional bindings which are ignored when they are not used (cf. [13]). This same principle is applied to good eﬀect in various scripting languages, such as Python [22]. For instance →F for arbitrary we can use True and provide an additional binding notused form F : →1 · false →2 · notused →F ) True(true →1 · false →2 · notused →F ); true ⇒ (true →2 · true →1 · notused →F ); true ≡ (false →2 · notused →F · true →1); true ≡ (false →1

by ext bind commute by ext bind commute by reduce project

Extending forms can also be used to overwrite existing bindings. For instance instead of binding the variable notused a client may override true: →1 · false →2 · true →3) ⇒ 3 True(true

A conditional expression is encoded as a curried service that takes a boolean and a case form. When invoked, it selects and evaluates the appropriate service in the case form: def

→(v; then) · false →(v; else)) if = ; λuv.u(true

Now consider: →(F ; λx.A) · else →(G; λx.B)) if True (then →; A ⇒ F · x

The expression if True has triggered the evaluation of agent A in the envi→. ronment F · x The contract supported by if requires that the cases provided bind the labels then and else. We can relax this contract and provide default services if those bindings are not provided by the client. To do so, we replace in the deﬁnition of if the sandbox expression v; else with a default service. This service gets triggered when the case form does not contain an else binding: def

→(v; then) · false →(else →(λx.) · v; else)) ifd = ; λuv.b(true (F ; λx.A)) ⇒ . Now ifd False(then →

3.6

Equivalence for Agents

Two agents are equivalent if they exhibit the same behaviour, i.e., they enjoy the same reductions. We adopt Milner and Sangiorgi’s notion of barbed bisimulation

A Calculus for Modeling Software Components

351

[26]. The idea is that an agent A is barbed similar to B if A can exhibit any reduction that B does and if B is a barb, then A is a barb, too. If A and B are similar to each other they are bisimilar. The advantage of this bisimulation is that it can be readily be given for any calculus that contains barbs or values. For the asynchronous π-calculus, barbs are usually deﬁned as having the capability of doing an output on a channel. A Piccola agent reduces to a barb, i.e., it returns a form. During evaluation the agent may spawn oﬀ new subthreads which could be blocked or still be running. We consequently deﬁne barbs as follows: Deﬁnition 3. A barb V is an agent expression A that is congruent to an agent generated by the following grammar: V ::= F A|V νc.V We write A ↓ for the fact that A is a barb, and A ⇓ when a barb V exists such that A ⇒ V . The following lemma relates forms, barbs and agents: Lemma 2. The following inclusion holds and is strict: F ⊂ F ≡ ⊂ {A|A ↓} ⊂ A Proof. The inclusions hold by deﬁnition. To see that the inclusion are strict, consider the empty form , the agent hide x , the barb 0 | hide x and the agent 0 (where 0 = νc.c? is the deadlocked null agent). The following lemma gives a syntactical characterization of barbs. Lemma 3. For any form F , agent A, and label x, the following terms are barbs, given V1 and V2 are barbs. V 1 · V2 V1 ; V 2 →V1 x

νc.V1 A | V1

Proof. By deﬁnition we have V ≡ ν˜ c.A | F . The claim follows by induction over F. We now deﬁne barbed bisimulation and the induced congruence: Deﬁnition 4. A relation R is a (weak) barbed bisimulation, if A R B, i.e., (A, B) ∈ R implies: – – – –

If If If If

A → A then there exists an agent B with B ⇒ B and A R B . B → B then there exists an agent A with A ⇒ A and A R B . A ↓ then B ⇓. B ↓ then A ⇓.

352

O. Nierstrasz and F. Achermann

˙ B, if there is some (weak) Two agents are (weakly) barbed bisimilar, written A ≈ barbed bisimulation R with A R B. Two agents are (weakly) barbed congruent, ˙ C[B]. written A ≈ B, if for all contexts C we have C[A] ≈ We deﬁne behavioural equality using the notion of barbed congruence. As usual we can deﬁne strong and weak versions of barbed bisimulation. The strong versions are obtained in the standard way by replacing ⇒ with → and ⇓ with ↓ in Deﬁnition 4. We only concentrate on the weak case since it abstracts internal computation. 3.7

Erroneous Reductions

Not all agents reduce to forms. Some agents enjoy an inﬁnite reduction [2]. Other agents are stuck. An agent is stuck if it is not a barb and can reduce no further. Deﬁnition 5. An agent A is stuck, written A ↑, if A is not a barb and there is no agent B such that A → B. Clearly it holds that 0 ↑ and R ↑. The property of being stuck is not compositional. For instance c? ↑ but obviously, c() | c? can reduce to . We can put R into a context so that it becomes a barb, for instance F ; R ≡ F . Note that if an agent is stuck it is not a preform: F ≡ ∩ {A|A ↑} = ∅ by deﬁnition. Although 0 is arguably stuck by intention, in general a stuck agent can be interpreted as an error. The two typical cases which may lead to errors are (i) projection on an unbound label, e.g., ; x, and (ii) application of a non-service, e.g., . 3.8

π-Calculus Encoding

One may well ask what exactly the Piccola calculus adds over and above the asynchronous π calculus. In Achermann’s thesis it is shown that the Piccola calculus can be faithfully embedded into the localized π-calculus Lπ of Merro and Sangiorgi [23,36]. The mapping [[]]a encodes Piccola calculus agents as π-calculus processes. The process [[A]]a evaluates A in the environment given by the empty form, and sends the resulting value along the channel a. A form (value) is encoded as a 4-tuple of channels representing projection, invocation, hiding and selection. The main result is that the encoding is sound and preserves reductions. We do not require a fully abstract encoding since that would mean that equivalent Piccola agents translated into the π-calculus could not be distinguished by any π-processes. Our milder requirement means that we consider only π-processes which are translations of Piccola agents themselves and state that they cannot distinguish similar agents: Proposition 1 (Soundness). For closed agents A, B and channel a the congruence [[A]]a ≈ [[B]]a implies A ≈ B.

A Calculus for Modeling Software Components

353

Although it is comforting to learn that the π-calculus can serve as a foundation for modeling components, it is also clear from the complexity of the encoding that it is very distant from the kinds of abstractions we need to conveniently model software composition. For this reason we ﬁnd that a richer calculus is more convenient to express components and connectors.

4

From the Piccola Calculus to Piccola

Piccola is a small composition language that supports the requirements summarized in ﬁgure 1, and whose denotational semantics is deﬁned in terms of the Piccola calculus [2]. Piccola is designed in layered fashion. At the lowest level we have an abstract machine that implements the Piccola calculus. At the next level, we have the Piccola language, which is implemented by translation to the abstract machine, following the speciﬁcation of the denotational semantics.

Applications: Composition styles: Standard libraries:

Piccola language:

Piccola calculus:

Piccola Layers Components + Scripts Streams, GUI composition, ... Coordination abstractions, control structures, basic object model ... Host components, user-deﬁned operators, dynamic namespaces Forms, agents and channels

Piccola provides a more convenient, Python-like syntax for programming than does the calculus, including overloaded operators to support algebraic component composition. It also provides a bridge to the host language (currently Java or Squeak). Piccola provides no basic data types other than forms and channels. Booleans, integers, ﬂoating point numbers and strings, for example, must be provided by the host language through the language bridge. Curiously, the syntax of the Piccola calculus is actually larger than that of Piccola itself. This is because we need to represent all semantic entities, including agents and channels, as syntactic constructs in the calculus. In the Piccola language, however, these are represented only by standard library services, such as run and newChannel. The next layer provides a set of standard libraries to simplify the task of programming with Piccola. Not only does the Piccola language provide no built-in data types, it does not even oﬀer any control structures of its own. These, however, are provided as standard services implemented in Piccola. Exceptions and

354

O. Nierstrasz and F. Achermann

try-catch clauses are implemented using agents, channels, and dynamic namespaces [5]. The ﬁrst three layers constitute the standard Piccola distribution. The fourth layer is provided by the component framework designer. At this level, a domain expert encodes a compositional styles as a library of components, connectors, adaptors, coordination abstractions, and so on. Finally, at the top level, an application programmer may script together components using the abstractions provided by the lower layers [3,29]. 4.1

Reasoning about Styles

We have also explored how to reason about Piccola programs at the language level [2]. We have studied two extended examples. First, we considered synchronization wrappers that express the synchronization constraints assumed by a component. We can use synchronization wrappers to make components safe in a multithreaded environment. The wrappers separate the functionality of the component from their synchronization aspects. If the constraints assumed by the component hold in a particular composition the wrapper is not needed. In particular the wrapper is not necessary when the component is already wrapped. This property is formally expressed by requiring that the wrappers be idempotent. The second study compares push- and pull-ﬂow ﬁlters. We demonstrate how to adapt pull-ﬁlters so that they work in a push-style. We have constructed a generic adapter for this task in two iterations. The ﬁrst version contains a racecondition that may lead to data being lost. The formal model of Piccola is used to analyze the traces of an adapted ﬁlter and helps to detect the error. To ﬁx the problem we specify the dynamics of a push-style ﬁlter, namely that push and close calls be mutually exclusive, that no further push calls may be attempted after a close, and that no “air-bubble” elements (ﬁlter slots holding an empty form) may be pushed downstream. Having clariﬁed the interaction protocol as a wrapper, we present an improved version of the generic adapter. We show that the adapter ensures these invariants. 4.2

Partial Evaluation

Another interesting application of the calculus was to enable the eﬃcient implementation of the language bridge. Since Piccola is a pure composition language, evaluating scripts requires intensive upping and downing [24] between the “down” level of the host language and the “up” level of Piccola. If the language bridge were implemented na¨ıvely, it would be hopelessly ineﬃcient. Instead, Piccola achieves acceptable performance by adopting a partial evaluation scheme [2,38,39]. Since the language has a denotational semantics, we can implement it eﬃciently while proving that we preserve the intended semantics. The partial evaluation algorithm uses the fact that forms are immutable. We replace references

A Calculus for Modeling Software Components

355

to forms by the forms referred to. We can then specialize projections and replace applications of referentially transparent services by their results. However, most services in Piccola are not referentially transparent and cannot be inlined since that would change the order in which side-eﬀects are executed. We need to separate the referentially transparent part from the non-transparent part in order to replace an application with its result and to ensure that the order in which the side-eﬀects are evaluated is preserved. At the heart of the proof is that we can separate form expressions into sideeﬀects and referentially transparent forms [2].

5

Related Work

The Piccola calculus extends the asynchronous π-calculus with higher-order abstractions and ﬁrst-class environments. π-calculus. The π-calculus [25] is a calculus of communicating systems in which one can naturally express processes with a changing structure. Its theory has been thoroughly studied and many results relate other formalisms or implementations to it. The aﬃnity between objects and processes, for example, has been treated by various authors in the context of the π-calculus [18,44]. The Pict experiment has shown that the π-calculus is a suitable basis for programming many high-level construct by encodings [33]. For programming and implementation purposes, synchronous communication seems uncommon and can generally be encoded by using explicit acknowledgments (cf. [18]). Moreover, asynchronous communication has a closer correspondence to distributed computing [45]. Furthermore, in the π-calculus the asynchronous variant has the pleasant property that equivalences are simpler than for the synchronous case [14]. Input-guarded choice can be encoded and is fully abstract [27]. For these reasons we adopt asynchronous channels in Piccola. Higher-order abstractions. Programming directly in the π-calculus is often considered like programming a concurrent assembler. When comparing programs written in the π-calculus with the lambda-calculus it seems like lambda abstractions scale up, whereas sending and receiving messages does not scale well. There are two possible solutions proposed to this problem: We can change the metaphor of communication or we can introduce abstractions as ﬁrst-class values. The ﬁrst approach is advocated by the Join-calculus [15]. Communication does not happen between a sender and a receiver, instead a join pattern triggers a process on consumption of several pending messages. The Blue calculus of Boudol [9] changes the receive primitive into a deﬁnition which is deﬁned for a scope. By that change, the Blue calculus is more closely related to functions and provides a better notion for higher-order abstraction. Boudol calls it a continuation-passing calculus. The other approach is adopted by Sangiorgi in the HOπ-calculus. Instead of communicating channels or tuples of channels, processes can be communicated

356

O. Nierstrasz and F. Achermann

as well. Surprisingly, the higher-order case has the same expressive power as the ﬁrst-order version [35,36]. In Piccola we take the second approach and reuse existing encodings of functions into the π-calculus as in Pict. The motivation for this comes from the fact that the HOπ-calculus itself can be encoded in the ﬁrst-order case. Asymmetric parallel composition. The semantics of asynchronous parallel composition is used in the concurrent object calculus of Gordon and Hankin [17] or the (asymmetric) Blue calculus studied by Dal-Zilio [12]. In the higher-order π-calculus the evaluation order is orthogonal to the communication semantics [36]. In Piccola, evaluation strategy interferes with communication, therefore we have to ﬁx one for meaningful terms. For Piccola, we deﬁne strict evaluation which seems appropriate and more common for concurrent computing. Record calculus. When modeling components and interfaces, a record-based approach is the obvious choice. We use forms [20,21] as an explicit notion for extensible records. Record calculi are studied in more detail for example in [11,34]. In the λ-calculus with names of Dami [13] arguments to functions are named. The resulting system supports records as arguments instead of tuples as in the classical calculus. The λN -calculus was one of the main inspiration for our work on forms without introspection. An issue omitted in our approach is record typing. It is not clear how far record types with subtyping and the runtime acquisition can be combined. An overview of record typing and the problems involved can be found for example in [11]. Explicit environments. An explicit environment generalizes the concept of explicit substitution [1] by using a record like structure for the environment. In the environment calculus of Nishizaki, there is an operation to get the current environment as a record and an operator to evaluate an expression using a record as environment [32,37]. Projection of a label x in a record R then corresponds to evaluating the script x in an environment denoted by R. The reader may note that explicit environments subsume records. This is the reason why we call them forms in Piccola instead of just records. Handling the environment as a ﬁrst-class entity allows us to deﬁne concepts like modules, interfaces and implementation for programming in the large within the framework. To our knowledge, the language Pebble of Burstall and Lampson was the ﬁrst to formally show how to build modules, interfaces and implementation, abstract data types and generics on a typed lambda calculus with bindings, declarations and types as ﬁrst-class values [10]. Other approaches A very diﬀerent model is oﬀered by ρω (AKA Reo) [7], a calculus of component connectors. Reo is algebraic in ﬂavour, and provides various connectors that coordinate and compose streams of data. Primitives connectors can be composed using the Reo operators to build hiigher-level connectors. In contrast to process calculi, Reo is well-suited to compositional reasoning, since connectors can be composed to yield new connectors, and properties of

A Calculus for Modeling Software Components

357

connectors can be shown to compose. Data communicated along streams are uninterpreted in Reo, so it would be natural to explore the application of Reo to streams of forms.

6

Concluding Remarks

We have presented the Piccola calculus, a high-level calculus for modeling software components that extends the asynchronous π-calculus with explicit namespaces, or forms. The calculus serves as the semantic target for Piccola, a language for composing software components that conform to a particular compositional style. JPiccola, the Java implementation of Piccola, is realized by translation to an abstract machine that implements the Piccola calculus. The Piccola calculus is not only helpful for modeling components and connectors, but it also helps to reason about the Piccola language implementation and about compositional styles. Eﬃcient language bridging between Piccola and the host language (Java or Squeak) is achieved by means of partial evaluation of language wrappers. The partial evaluation algorithm can be proved correct with the help of the Piccola calculus. Diﬀerent compositional styles make diﬀerent assumptions about software components. Mixing incompatible components can lead to compositional mismatches. We have outlined how the Piccola calculus can help to bridge mismatches by supporting reasoning about wrappers that adapt component contracts from one style to another. One shortcoming of our work so far is the lack of a type system. We have been experimenting with a system of contractual types [28] that expresses both the provided as well as the required services of a software component. Contractual types are formalized in the context of the form calculus, which can be seen as the Piccola calculus minus agents and channels. Contractual types have been integrated into the most recent distribution of JPiccola [19].

Acknowledgments We gratefully acknowledge the ﬁnancial support of the Swiss National Science Foundation for projects No. 20-61655.00, “Meta-models and Tools for Evolution Towards Component Systems”, and 2000-067855.02, “Tools and Techniques for Decomposing and Composing Software”.

References 1. Mart´ın Abadi, Luca Cardelli, Pierre-Louis Curien, and Jean-Jacques L´evy. Explicit substitutions. Journal of Functional Programming, 1(4):375–416, October 1991. 2. Franz Achermann. Forms, Agents and Channels - Deﬁning Composition Abstraction with Style. PhD thesis, University of Berne, January 2002.

358

O. Nierstrasz and F. Achermann

3. Franz Achermann, Stefan Kneubuehl, and Oscar Nierstrasz. Scripting coordination styles. In Ant´ onio Porto and Gruia-Catalin Roman, editors, Coordination ’2000, volume 1906 of LNCS, pages 19–35, Limassol, Cyprus, September 2000. SpringerVerlag. 4. Franz Achermann, Markus Lumpe, Jean-Guy Schneider, and Oscar Nierstrasz. Piccola – a small composition language. In Howard Bowman and John Derrick, editors, Formal Methods for Distributed Processing – A Survey of Object-Oriented Approaches, pages 403–426. Cambridge University Press, 2001. 5. Franz Achermann and Oscar Nierstrasz. Explicit Namespaces. In J¨ urg Gutknecht and Wolfgang Weck, editors, Modular Programming Languages, volume 1897 of LNCS, pages 77–89, Z¨ urich, Switzerland, September 2000. Springer-Verlag. 6. Franz Achermann and Oscar Nierstrasz. Applications = Components + Scripts – A Tour of Piccola. In Mehmet Aksit, editor, Software Architectures and Component Technology, pages 261–292. Kluwer, 2001. 7. Farhad Arbab and Farhad Mavaddat. Coordination through channel composition. In F. Arbab and C. Talcott, editors, Coordination Languages and Models: Proc. Coordination 2002, volume 2315 of Lecture Notes in Computer Science, pages 21– 38. Springer-Verlag, April 2002. 8. G´erard Berry and G´erard Boudol. The chemical abstract machine. Theoretical Computer Science, 96:217–248, 1992. 9. G´erard Boudol. The pi-calculus in direct style. In Conference Record of POPL ’97, pages 228–241, 1997. 10. Rod Burstall and Butler Lampson. A kernel language for abstract data types and modules. Information and Computation, 76(2/3), 1984. Also appeared in Proceedings of the International Symposium on Semantics of Data Types, Springer, LNCS (1984), and as SRC Research Report 1. 11. Luca Cardelli and John C. Mitchell. Operations on records. In Carl A. Gunter and John C. Mitchell, editors, Theoretical Aspects of Object-Oriented Programming. Types, Semantics and Language Design, pages 295–350. MIT Press, 1993. 12. Silvano Dal-Zilio. Le calcul bleu: types et objects. Ph.D. thesis, Universit´e de Nice - Sophia Antipolis, July 1999. In french. 13. Laurent Dami. Software Composition: Towards an Integration of Functional and Object-Oriented Approaches. Ph.D. thesis, University of Geneva, 1994. 14. C´edric Fournet and Georges Gonthier. A hierarchy of equivalences for asynchronous calculi. In Proceedings of ICALP ’98, pages 844–855, 1998. 15. C´edric Fournet, Georges Gonthier, Jean-Jacques L´evy, Luc Maranget, and Didier R´emy. A calculus of mobile agents. In Proceedings of the 7th International Conference on Concurrency Theory (CONCUR ’96), volume 1119 of LNCS, pages 406–421. Springer-Verlag, August 1996. 16. David Garlan, Robert Allen, and John Ockerbloom. Architectural mismatch: Why reuse is so hard. IEEE Software, 12(6):17–26, November 1995. 17. Andrew D. Gordon and Paul D. Hankin. A concurrent object calculus: Reduction and typing. In Proceedings HLCL ’98. Elsevier ENTCS, 1998. 18. Kohei Honda and Mario Tokoro. An object calculus for asynchronous communication. In Pierre America, editor, Proceedings ECOOP ’91, volume 512 of LNCS, pages 133–147, Geneva, Switzerland, July 15–19 1991. Springer-Verlag. 19. Stefan Kneubuehl. Typeful compositional styles. Diploma thesis, University of Bern, April 2003. 20. Markus Lumpe. A Pi-Calculus Based Approach to Software Composition. Ph.D. thesis, University of Bern, Institute of Computer Science and Applied Mathematics, January 1999.

A Calculus for Modeling Software Components

359

21. Markus Lumpe, Franz Achermann, and Oscar Nierstrasz. A Formal Language for Composition. In Gary Leavens and Murali Sitaraman, editors, Foundations of Component Based Systems, pages 69–90. Cambridge University Press, 2000. 22. Mark Lutz. Programming Python. O’Reilly & Associates, Inc., 1996. 23. Massimo Merro and Davide Sangiorgi. On asynchrony in name-passing calculi. In Kim G. Larsen, Sven Skyum, and Glynn Winskel, editors, 25th Colloquium on Automata, Languages and Programming (ICALP) (Aalborg, Denmark), volume 1443 of LNCS, pages 856–867. Springer-Verlag, July 1998. 24. Wolfgang De Meuter. Agora: The story of the simplest MOP in the world — or — the scheme of object–orientation. In J. Noble, I. Moore, and A. Taivalsaari, editors, Prototype-based Programming. Springer-Verlag, 1998. 25. Robin Milner, Joachim Parrow, and David Walker. A calculus of mobile processes, part I/II. Information and Computation, 100:1–77, 1992. 26. Robin Milner and Davide Sangiorgi. Barbed bisimulation. In Proceedings ICALP ’92, volume 623 of LNCS, pages 685–695, Vienna, July 1992. Springer-Verlag. 27. Uwe Nestmann and Benjamin C. Pierce. Decoding choice encodings. In Ugo Montanari and Vladimiro Sassone, editors, CONCUR ’96: Concurrency Theory, 7th International Conference, volume 1119 of LNCS, pages 179–194, Pisa, Italy, August 1996. Springer-Verlag. 28. Oscar Nierstrasz. Contractual types. submitted for publication, 2003. 29. Oscar Nierstrasz and Franz Achermann. Supporting Compositional Styles for Software Evolution. In Proceedings International Symposium on Principles of Software Evolution (ISPSE 2000), pages 11–19, Kanazawa, Japan, Nov 1-2 2000. IEEE. 30. Oscar Nierstrasz, Franz Achermann, and Stefan Kneubuehl. A guide to jpiccola. Technical report, Institut f¨ ur Informatik, Universit¨ at Bern, Switzerland, 2003. Available from www.iam.unibe.ch/∼scg/Research/Piccola. 31. Oscar Nierstrasz and Laurent Dami. Component-oriented software technology. In Oscar Nierstrasz and Dennis Tsichritzis, editors, Object-Oriented Software Composition, pages 3–28. Prentice-Hall, 1995. 32. Shin-ya Nishizaki. Programmable environment calculus as theory of dynamic software evolution. In Proceedings ISPSE 2000. IEEE Computer Society Press, 2000. 33. Benjamin C. Pierce and David N. Turner. Pict: A programming language based on the pi-calculus. In G. Plotkin, C. Stirling, and M. Tofte, editors, Proof, Language and Interaction: Essays in Honour of Robin Milner. MIT Press, May 2000. 34. Didier R´emy. Typing Record Concatenation for Free, chapter 10, pages 351–372. MIT Press, April 1994. 35. Davide Sangiorgi. Expressing Mobility in Process Algebras: First-Order and HigherOrder Paradigms. Ph.D. thesis, Computer Science Dept., University of Edinburgh, May 1993. 36. Davide Sangiorgi. Asynchronous process calculi: the ﬁrst-order and higher-order paradigms (tutorial). Theoretical Computer Science, 253, 2001. 37. Masahiko Sato, Takafumi Sakurai, and Rod M. Burstall. Explicit environments. In Jean-Yves Girard, editor, Typed Lambda Calculi and Applications, volume 1581 of LNCS, pages 340–354, L’Aquila, Italy, April 1999. Springer-Verlag. 38. Nathanael Sch¨ arli. Supporting pure composition by inter-language bridging on the meta-level. Diploma thesis, University of Bern, September 2001. 39. Nathanael Sch¨ arli and Franz Achermann. Partial evaluation of inter-language wrappers. In Workshop on Composition Languages, WCL ’01, September 2001. 40. Jean-Guy Schneider. Components, Scripts, and Glue: A conceptual framework for software composition. Ph.D. thesis, University of Bern, Institute of Computer Science and Applied Mathematics, October 1999.

360

O. Nierstrasz and F. Achermann

41. Jean-Guy Schneider and Oscar Nierstrasz. Components, scripts and glue. In Leonor Barroca, Jon Hall, and Patrick Hall, editors, Software Architectures – Advances and Applications, pages 13–25. Springer-Verlag, 1999. 42. Mary Shaw and David Garlan. Software Architecture: Perspectives on an Emerging Discipline. Prentice-Hall, 1996. 43. Clemens A. Szyperski. Component Software. Addison Wesley, 1998. 44. David Walker. Objects in the π-calculus. Information and Computation, 116(2):253–271, February 1995. 45. Pawel T. Wojciechowski. Nomadic Pict: Language and Infrastructure Design for Mobile Computation. PhD thesis, Wolfson College, University of Cambridge, March 2000.

Speciﬁcation and Inheritance in CSP-OZ Ernst-R¨ udiger Olderog and Heike Wehrheim Department of Computing Science University of Oldenburg 26111 Oldenburg, Germany {olderog,wehrheim}@informatik.uni-oldenburg.de

Abstract. CSP-OZ [16,18] is a combination of Communicating Sequential Processes (CSP) and Object-Z (OZ). It enables the speciﬁcation of systems having both a state-based and a behaviour-oriented view using the object-oriented concepts of classes, instantiation and inheritance. CSP-OZ has a process semantics in the failures divergence model of CSP. In this paper we explain CSP-OZ and investigate the notion of inheritance. Behavioural subtyping relations between classes introduced in [50] guarantee the inheritance of safety and “liveness” properties. Keywords: CSP, Object-Z, failure divergence semantics, inheritance, safety and “liveness” properties, model-checking, FDR

1

Introduction

In contrast to the wide-spread use of object-oriented programming and speciﬁcation languages, little is known about the properties enjoyed by systems constructed in the object-oriented style. Research on veriﬁcation of object-oriented descriptions takes often place in the setting of object-oriented programming languages, for instance Java. The methods range from Hoare-style veriﬁcation supported by theorem provers [25,36] via static checkers [30] to model-checking techniques [20]. Veriﬁcation of object-oriented modeling languages focusses on UML (e.g. [28,40]). These approaches check properties of UML state machines by translating them into existing model-checkers. Although UML is an integrated formalism allowing the speciﬁcation of data and behaviour aspects, existing model-checking techniques most often focus on the behavioural view. For a semantics of UML integrating diﬀerent types of diagrams (including dynamic object creation and dynamically changing communication topologies) and its veriﬁcation see [8,9]. Reasoning about object-oriented speciﬁcations represents a challenge in its own right. To describe the preservation of behavioural properties of classes under change the concept of subtyping has been lifted from data types to objects by [1,31]. Whereas these approaches are restricted to state-based speciﬁcations (using e.g. Object-Z) [34] proposed deﬁnitions suitable to deal with behaviouroriented speciﬁcations (using e.g. CSP). A ﬁrst systematic study of subtyping for speciﬁcations integrating state-based and behaviour-oriented views is [50].

This research is partially supported by the DFG under grant Ol/98-3.

F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 361–379, 2003. c Springer-Verlag Berlin Heidelberg 2003

362

E.-R. Olderog and H. Wehrheim

Given a proof made with respect to one data-model, its reuse in another data-model extended by inheritance represents a major problem that must be overcome in order to build up libraries that support proofs about non-trivial applications. Existing approaches, based on shallow representations of subtyping via parametric polymorphism, are either geared towards abstractions of programs (like Eiﬀel, e.g. [21]) or speciﬁc data-models (like UML/OCL, [5]). So far, these techniques have not been applied to the combination of data-oriented and behavioral speciﬁcations. In this paper we study speciﬁcation and inheritance in the language CSP-OZ [16,18] combining two existing speciﬁcation languages for processes and data. Speciﬁcation of processes. Communicating Sequential Processes (CSP) were introduced by Hoare [22,23]. The central concepts of CSP are synchronous communication via channels between diﬀerent processes, parallel composition and hiding of internal communication. c!e

c

c?x

For CSP a rich mathematical theory comprising operational, denotational and algebraic semantics with consistency proofs has been developed [4,35,38]. Tool support comes through the FDR model-checker [37]. The name stands for Failure Divergence Reﬁnement and refers to the standard semantic model of CSP, the failures divergence model, and its notion of process reﬁnement. Speciﬁcation of data. Z was introduced in the early 80’s in Oxford by Abrial as a set-theoretic and predicate language for the speciﬁcation of data, state spaces and state transformations. It comprises of the mathematical tool kit, a collection of convenient notations and deﬁnitions, and the schema calculus for structuring large state spaces and their transformations. A Z schema has a name, say S, and consists of variable declarations and a predicate constraining the values of these variables. It is denoted as follows: S declarations predicate The ﬁrst systematic description of Z is [46]. Since then the language has been published extensively (e.g. [52]) and used in many case studies and industrial projects. Object-Z is an object-oriented extension of Z [11,42,44]. It comprises the concepts of classes, instantiation and inheritance. Z and Object-Z come with the concept of data reﬁnement. For Z there exist proof systems for establishing properties of speciﬁcations and reﬁnements such as Z/EVES [39] or HOL-Z based on Isabelle [27]. For Object-Z type checkers exist. Veriﬁcation support is less developed except for an extension of HOL-Z [41].

Speciﬁcation and Inheritance in CSP-OZ

363

Combination. CSP-OZ, developed by C. Fischer at the University of Oldenburg, is a combination of CSP and Object-Z. Object-Z is used to describe all data-dependent aspects of classes, viz. attributes and methods. The dynamic behaviour of classes, i.e. their protocols of interaction, are speciﬁed within CSP. CSP-OZ has been used in a number of case studies from the area of telecommunication systems, production automatisation and satellite technology [3,49,33]. Veriﬁcation of CSP-OZ speciﬁcations is supported by the FDR model-checker for CSP [14]. Structure of this paper. In Section 2 the combination CSP-OZ is introduced by way of an example. In Section 3 the semantics of CSP-OZ is reviewed. Section 4 is devoted to the inheritance operator in CSP-OZ and its semantics. In Section 5 inheritance of properties is studied in depth, and Section 6 concludes the paper.

2

The Combination CSP-OZ

There are various specialised techniques for describing individual aspects of system behaviour. But complex systems exhibit various behavioural aspects. This observation has led to research into the combination and semantic integration of speciﬁcation techniques. One such combination is CSP-OZ [16,13,14,18] integrating CSP and ObjectZ. Central to CSP-OZ is the notion of a class. The speciﬁcation of a class C has the following format: C I P Z

[interface] [CSP part] [OZ part]

The interface I declares channels names and types to be used by the class. The CSP part uses a CSP process P to describe the desired sequencing behaviour on these channels. The OZ part is of the form Z st : State Init(st) com c(st, in?, out!, st )

[state space] [initial condition] [communication schemas]

where the state space of C and its transformation is speciﬁed in the style of Object-Z. The state space itself is given by a schema State, here with the symbolic variable st ranging over State. The initial state of C is described by the Init schema that restricts the value of st. For the channels c declared in the interface communication schemas com c describe the state transformation induced by communicating along c with input values in? and output values out!. In Z and hence Object-Z the prime decoration is used for denoting the new values after the transformation. Thus com c depends on st, in?, out!, st .

364

E.-R. Olderog and H. Wehrheim

Example 1. To illustrate the use of CSP-OZ we present here part of the speciﬁcation of a till [50] for the problem “cash point service” deﬁned in [10]. Informally, the till is used by inserting a card and typing in a PIN which is compared with the PIN stored on the card. In case of a successful identiﬁcation, the customer may withdraw money from the account. The till is only one component of a larger system including banks, cards, customers, and money dispensers. Global deﬁnitions. The CSP-OZ class Till makes use of two basic Z types: [Pin, CardID] Pin represents the set of personal identiﬁcation numbers and CardID the set of card identiﬁcation numbers. Interface. The Till is connected to its environment by several typed channels: chan chan chan chan

getCustPin : [ p? : Pin ] getCardPin : [ p? : Pin ] pay : [ a! : N ] updateBalance : [ a! : N; c! : CardID ]

The channel getCustPin expects a value p? of type Pin as its input, the channel getCardPin inputs a value p? of type Pin and outputs a value c! of type CardID, the channel pay outputs a natural number a! (the amount of money to be paid out to the customer), the channel updateBalance is intended for communication with the bank and outputs an amount a! of money to be withdrawn form the account belonging to card with identity c!. The full deﬁnition of the till comprises of more interface channels (see [50]). CSP part. This part speciﬁes the order in which communications along the interface channels can occur. To this end, a set of recursive process equations is c given. The main equation starts with the process identiﬁer main. The symbol = is used instead of an ordinary equals symbol to distinguish between CSP process equations and Z equations. c

main = insertCard → Ident c Ident = getCustPin → getCardPin → (idFail → Eject 2 idSucc → Service) c

Eject = eject → main c Service = (stop → Eject 2 withdraw → getAmount → pay → updateBalance → Eject)

Speciﬁcation and Inheritance in CSP-OZ

365

This process speciﬁes the following behaviour. First the till gets the CardID via a communication along channel insertCard . Then the till checks the customers identity by getting the customer’s PIN via getCustPin, retrieving the PIN stored on the card via getCardPin, and comparing both. The communications idFail signals that this comparison failed, the communication idSucc signals that it was successful. In case of failure the card is ejected. In case of success that service mode is entered where the customer has the choice of stopping the interaction or withdrawing money. In the latter case the communication getAmount inputs the amount of money the customer wishes to withdraw, the communication pay initiates the money dispenser to pay out this amount, and the communication updateBalance informs the bank about the change. Note that in this particular CSP process no communication values are speciﬁed. They will be dealt with in the OZ part. OZ part. This part speciﬁes the state of the till and the eﬀect of the communications on this state. The state consists of the following typed variables:

currCard : CardID currPin, typedPin : Pin amount : N

[state space]

The eﬀect of communications along the interface channels are speciﬁed using communication schemas. Here we give only some examples. The schema com insertCard ∆(currCard ) c? : CardId currCard = c? speciﬁes that a communication along insertCard may only change the state variable currCard . This is the meaning of the ∆-list in the ﬁrst line of the declaration part of this schema. The second line speciﬁes that a communication along insertCard has an input value c? of type CardID. The predicate of the schema speciﬁes that the eﬀect of the schema is an assignment of this input value to the variable currCard . The schema com idSucc ∆() currPin = typedPin speciﬁes that a communication idSucc does not change any variable of the state (empty ∆-list) and is enabled only if the current PIN of the card is identical to the PIN typed in by the customer. The schema

366

E.-R. Olderog and H. Wehrheim

com updateBalance ∆() a! : N c! : CardId a! = amount ∧ c! = CardId speciﬁes that the amount a! of money to withdrawn and the identity c! of the card are sent to the bank where the balance of the account is updated. Instances. An instance (object) t of the class Till is speciﬁed by a declaration t : Till . The instance t behaves like the class Till but with all channels ch renamed to ch.t with its own identity t. A customer using till t might perform the following interaction with it expressed as a CSP process: c

Customer = insertCard .t.765 → getCustPin.t.4711 → withdraw .t → getAmount.t.100 → SKIP To model the behaviour of several instances t1 , ..., tn of the class Till the interleaving operator ||| of CSP can be used: t1 ||| ... ||| tn

or

|||i=1,...,n ti

To be able to be connected to a ﬁnite sets of tills the class Bank will have a parameter adr of type F Till for the addresses of the diﬀerent tills: Bank [adr : F Till ] chan updateBalance : [ t : adr , a! : N; c! : CardID ] ........................... For example, a system comprising one bank b connected to tills t1 , ..., tn can be speciﬁed by the process expression (b : Bank ; t1 , ..., tn : Till • b({t1 , ..., tn }) ||{updateBalance} (|||i=1,...,n ti )) where b is instantiated with the set {t1 , ..., tn } and ||{updateBalance} is the parallel composition enforcing synchronisation on the channel updateBalance between the bank b and the tills t1 , ..., tn .

3

Semantics

Each class of a CSP-OZ speciﬁcation denotes a process obtained by transforming the OZ part into a process that runs in parallel with the CSP part. First we brieﬂy review the semantics of CSP and Object-Z.

Speciﬁcation and Inheritance in CSP-OZ

3.1

367

Semantics of CSP

The standard semantics of CSP is the FD-semantics based on failures and divergence [38]. A failure is a pair (s, X ) consisting of a ﬁnite sequence or trace s ∈ seq Comm over a set Comm of communications and a so-called refusal set X ∈ P Comm. Intuitively, a failure (s, X ) describes that after engaging in the trace s the process can refuse to engage in any of the communications in X . Refusal sets allow us to make ﬁne distinctions between diﬀerent nondeterministic process behaviour; they are essential for obtaining a compositional deﬁnition of parallel composition in the CSP setting of synchronous communication when we want to observe deadlocks. Formally, we need the following sets of observations about process behaviour: Traces = seq Comm, Refusals = P Comm, Failures = Traces × Refusals. A divergence is a trace after which the process can engage in an inﬁnite sequence of internal actions. The FD-semantics of CSP is then given by two mappings F : CSP → P Failures and D : CSP → P Traces. For a CSP process P we write FD[[P ]] = (F[[P ]] , D[[P ]]). Certain well-formedness conditions relate the values of F and D (see [38], p.192). The FD-semantics induces a notion of process reﬁnement denoted by F D . For CSP processes P and Q this relation is deﬁned as follows: P F D Q iﬀ F[[P ]] ⊇ F[[Q]] and D[[P ]] ⊇ D[[Q]] Intuitively, P F D Q means that Q reﬁnes P , i.e. Q is more deterministic and more deﬁned than P . 3.2

Semantics of Z and Object-Z

In Z an operation schema Op x : Dx x : Dx P( x, x

)

describes a transformation on state space x speciﬁed by the predicate P ( x , x ). Z comes with the usual notion of data reﬁnement and operation reﬁnement [52]. Given a relation ρ between an abstract and a concrete data domain, a concrete operation schema C Op reﬁnes an abstract operation schema A Op, denoted by A Op

ρ C Op,

368

E.-R. Olderog and H. Wehrheim

if C Op is more deﬁned and more deterministic than A Op. For Object-Z a history semantics comprising of sequences of states and events (operation calls) as well as some more abstract semantics are deﬁned [42]. We do not need these semantics here because we only use the state transformation view of the communication schemas in the OZ part. 3.3

Semantics of CSP-OZ

The semantics of CSP-OZ is deﬁned in [16,18]. Each CSP-OZ class denotes a process in the failures divergence model of CSP. This is achieved by transforming the OZ part of such a class into a CSP process that runs in parallel and communicates with the CSP part of the class. The OZ part can hence be seen as describing all data-dependent orderings of communications. The process successively chooses one of the enabled methods, executes it (thereby changing the state space) and afterwards starts again. Consider a CSP-OZ class C I P Z

[interface] [CSP part] [OZ part]

also written horizontally as U = spec I L P Z end with an OZ part Z st : State Init(st) ...com c(st, in?, out!, st )...

[state space] [initial condition] [one communication schema for each c in I]

where the notation com c(st, in?, out!, st ) indicates that this communication schema for c relates the state st to the successor state st and has input parameters in? and output parameters out!. Transformation. The OZ part of the class is transformed into a CSP process OZMain deﬁned by the following system of (parametrised) recursive equations for OZpart using the (indexed) CSP operators for internal nondeterministic choice () and alternative composition (2): OZMain = OZPart(st) =

st OZPart(st) 2c, in? out!, st

c.in?.out! → OZPart(st )

where st ranges over all states in State : Exp satisfying Init(st). Thus the process OZMain can nondeterministically choose any state st satisfying Init(st) to start

Speciﬁcation and Inheritance in CSP-OZ

369

with. Further on, c ranges over all channels declared in I , and in? ranges over the set Inputs(c) such that the precondition of the communication schema for c holds, i.e. ∃ out! : Outputs(c); st : State • com c(st, in?, out!, st ). Finally, for any chosen c and in?, the value out! ranges over the set Outputs(c), and st ranges over State such that com c(st, in?, out!, st ) holds. So the OZPart(st) is ready for every communication event c.in?.out! along a channel c in I where for the input values in? the communication schema com c(st, in?, out!, st ) is satisﬁable for some output values out! and successor state st . For given input values in? any such out! and st can be nondeterministically chosen to yield c.in?.out! and the next recursive call OZPart(st ). Thus input and output along channels c are modelled by a subtle interplay of the CSP alternative and nondeterministic choice. Semantics of a class. A CSP-OZ class deﬁnes a template for all of its instances. The semantics of an instance is derived from the semantics of its class by ﬁlling in an instance name. Using parallel composition we deﬁne the semantics of a class in the failures divergence model: FD[[C ]] = FD[[ P ||{commonEvents} ZMain ]] D = FD[[P ]] ||F {commonEvents} FD[[Main]] D where ||F denotes the semantic counterpart in the failure divergence model A of the parallel composition ||A with synchronisation on the event set A (see e.g. [38]).

Semantics of an instance. Suppose an instance o of class C is declared by o : C . Then o denotes a process which is obtained from FD[[C ]] by applying a renaming operator that renames every channel ch in the interface I of C into ch.o. Thus events ch.d for some data element d become now ch.o.d as described by the following deﬁnition due to [12]: FD[[o]] = FD[[ C [{ch : Chans; d : Data | ch.d ∈ I • ch.d → ch.o.d }] ]] Reﬁnement compositionality. By the above process semantics of CSP-OZ, the reﬁnement notion F D is immediately available for CSP-OZ. In [18] it has been shown that CSP-OZ satisﬁes the principle of reﬁnement compositionality, i.e. reﬁnement of the parts implies reﬁnement of the whole. Formally: – Process reﬁnement P1 F D P2 implies reﬁnement in CSP-OZ: spec I P1 Z end F D spec I P2 Z end – Data reﬁnement Z1 ρ Z2 for a reﬁnement relation ρ implies reﬁnement in CSP-OZ: spec I P Z1 end F D spec I P Z2 end

370

4

E.-R. Olderog and H. Wehrheim

Inheritance

Process reﬁnement P F D Q in CSP stipulates that P and Q have the same interface. Often one wishes to extend communication capabilities of a process or the operation capabilities of a class. This can be speciﬁed using the notion of inheritance, a syntactic relationship on classes: a superclass (or abstract class) A is extended to a subclass (or concrete class) C . The subclass should inherit the parts of the superclass. In CSP-OZ this is denoted as follows. Given a superclass A of the form A IA PA ZA

[interface] [CSP part] [OZ part]

we obtain a subclass C of A by referring to A using the inherit clause: C inherit A IC PC ZC

[superclass] [interface] [CSP part] [OZ part]

The semantics of the inherit operator is deﬁned in a transformational way, i.e. by incorporating the superclass A into C yielding the following expanded version of C : C I P Z

[interface] [CSP part] [OZ part]

where I = IA ∪ IC and P is obtained from PA and PC by parallel composition and Z is obtained from ZA and ZC by schema conjunction. More precisely, to obtain the CSP part P we ﬁrst replace in PA and PC the process identiﬁers main by new identiﬁers mainA and mainC respectively, then collect the resulting set of process equations, and add the equation c

main = mainA ||{commonEvents} mainC modelling parallel composition of PA and PC . To obtain the OZ part Z the corresponding schema of ZA and ZC are conjoined: State = StateA ∧ StateC , Init = InitA ∧ InitC , com c = com cA ∧ com cC for all channels c in IA ∩ IC , com c = com cA for all channels c in IA \ IC , com c = com cC for all channels c in IC \ IA .

Speciﬁcation and Inheritance in CSP-OZ

371

Note that a subclass constructed in this way is not necessarily a reﬁnement of its superclass.

5

Inheritance of Properties

In this section we study preservation of properties under inheritance. The scenario we are interested in is the following: suppose we have a superclass A for which we have already veriﬁed that a certain property P holds. Now we extend A to a subclass C and like to know under which conditions P also holds for C . This would allow us to check the conditions on the subclass and avoid re-veriﬁcation. In principle we are thus interested in a reasoning which is similar to that used for reﬁnement in CSP. In CSP, properties are preserved under reﬁnement, i.e. P F D A ∧ A F D C ⇒ P F D C holds. If inheritance is employed instead of reﬁnement this is in general not true. Inheritance allows to modify essential aspects of a class and thus may destroy every property proven for the superclass. We thus have to require a closer relationship between super- and subclass in order to achieve inheritance of properties. A relationship which guarantees a certain form of property inheritance is behavioural subtyping [31]. Originally studied in state-based contexts, this concept has recently been extended to behaviour-oriented formalisms (see [15]) and is thus adequate for CSP-OZ with its failure divergence semantics. Behavioural subtyping guarantees substitutability while also allowing extension of functionality as introduced by inheritance. In the following we will look at two forms of behavioural subtyping and show that one of them preserves safety and the other also a form of liveness properties. 5.1

Safety: Trace Properties

Since CSP-OZ has a failure-divergence semantics we use the CSP way of property speciﬁcation. In CSP, properties are formalised by CSP processes. A property holds for a class A if the class reﬁnes the property. Since CSP oﬀers diﬀerent forms of reﬁnement there are also diﬀerent forms of satisfaction. We say that a class satisﬁes a property with respect to safety issues if it is a trace reﬁnement of the property; when failure-divergence reﬁnement is used a (limited form of) liveness is checked (see next section). Deﬁnition 2. Let A be a class and P a CSP property (process). A satisﬁes the trace property P (or A satisﬁes P in the trace semantics) iﬀ traces(A) ⊆ traces(P ) (or equally P T A). We illustrate this by means of the cash point example. Consider the following class A0 with a behaviour as speciﬁed in Figure 1. For reasons of readability we only consider a very simple form of till.

372

E.-R. Olderog and H. Wehrheim A0

R idSucc

stop

? withdraw

pay

R

Fig. 1. The simple till A0 .

We want to specify that money is only paid out after the correct PIN code has been entered. As a CSP property process this gives: Seq = idSucc → pay → Seq Safe = Seq ||| CHAOS ( Σ\{idSucc, pay} ) Here the process CHAOS (S ), where S is a set of events, is the chaotic process which can always choose to communicate as well as refuse events of S . It is ev → CHAOS (S )). A0 satisﬁes the deﬁned by CHAOS (S ) = STOP ( ev ∈S trace property Safe since Safe T A0 . Next we like to know whether such a trace property P can be inherited to a subclass (or more speciﬁcally, a subtype) C . As a ﬁrst observation we notice that C potentially has traces over a larger alphabet than A since it can have a larger interface. This might immediately destroy the holding of a trace property. Nevertheless, a trace property might still hold in the sense that, as far as the operations of A are concerned, the ordering of operations as speciﬁed in P also hold in C . Deﬁnition 3. Let A, C be classes with α(A) ⊆ α(C ). C satisﬁes a trace property P w.r.t. α(A) iﬀ traces(C ) ↓ α(A) ⊆ traces(P ) ↓ α(A). Here α(A) denotes the alphabet of A, i.e. the set of events A may communicate, and ↓ denotes projection. The question we ultimately like to answer can now be precisely formulated: if A satisﬁes P , does C satisfy P wrt. α(A)? This is in fact the case when C is a trace subtype of A. Deﬁnition 4. Let A, C be classes with α(A) ⊆ α(C ) and N = α(C )\α(A). Then C is a trace subtype of A, abbreviated A tr −st C , iﬀ A ||| CHAOS (N ) T C .

Speciﬁcation and Inheritance in CSP-OZ

373

Intuitively, parallel composition with the chaotic process over the set of methods N says that C may have new methods in addition to A and these can at any time be executed as well as refused but have to be independent from (interleaved with) the A-part. Safety properties are inherited to trace subtypes. Theorem 5. Let A, C be processes with A tr −st C , let P be a process formalising a trace property. If A satisﬁes the trace property P then C satisﬁes P w.r.t. α(A). As an example we look at an extension of class A0 . The till C0 depicted in Figure 2 extends A0 with a facility of viewing the current balance of the account. C0 is a trace subtype of A0 and thus inherits property Safe, i.e. C0 satisﬁes Safe wrt. α(A0 ). A0

C0

R

R

idSucc

idSucc

stop display

? withdraw

pay

R

stop

-?

view

withdraw

pay

R

Fig. 2. A class and a trace subtype.

This completes the section on inheritance of safety properties. Next we study liveness properties. 5.2

“Liveness”: F D Properties

Liveness properties are checked in CSP by comparing the property process and the class with respect to their failure divergence set, i.e. checking whether the class is a failure divergence reﬁnement of the property. This yields a form of bounded liveness check: it can be proven that methods can be refused or conversely, are always enabled after certain traces. Unbounded liveness, as expressible in temporal logic, cannot be speciﬁed in CSP. Deﬁnition 6. Let A be a class and P a CSP property (process). A satisﬁes the liveness property P (or A satisﬁes P in the FD semantics) iﬀ FD[[A]] ⊆ FD[[P]] (or equally P F D A).

374

E.-R. Olderog and H. Wehrheim

We illustrate this again on our till example. The property we like to prove for class A0 concerns service availability: Money can be withdrawn immediately after the PIN code has been veriﬁed. Formalised as a CSP property this is: Live = idSucc → withdraw → Live pay → Live stop → Live Class A0 satisﬁes the liveness property Live since Live F D A0 . Analogous to trace properties we now ﬁrst have to deﬁne what preservation of a liveness property to a subclass should mean. Again, we have to face the fact that the subclass has additional functionality and thus failures and divergences range over a larger alphabet. In principle we apply the same technique as for trace properties. We project the failures and divergences of the subclass down to the alphabet of the superclass and on this projection carry out the comparison with the property P . Deﬁnition 7. Let A, C be classes with α(A) ⊆ α(C ). C satisﬁes a liveness property P w.r.t. α(A) iﬀ ∀(s, X ) ∈ failures(C ) ∃(t, Y ) ∈ failures(P ) : s ↓ α(A) = t ↓ α(A) ∧ X ∩ α(A) = Y ∩ α(A) and ∀ s ∈ divergences(C ) ∃ t ∈ divergences(P ) : s ↓ α(A) = t ↓ α(A) . Liveness properties can be shown to be inherited to classes which are optimal subtypes of the superclass. Deﬁnition 8. Let A, C be classes with α(A) ⊆ α(C ) and N = α(C )\α(A). Then C is an optimal substype of A, abbreviated A ost C , iﬀ A ||| CHAOS (N ) F D C . This deﬁnition lifts the idea of trace subtyping to the failure divergence semantics: in class C anything ”new” is allowed in parallel with the behaviour of A as long as it does not interfere with this ”old” part. Looking at class C0 in comparison with A0 we ﬁnd that C0 is not an optimal subtype of A0 . For instance, C0 has the pair (idSucc view , {Σ \ {display}) in its failure set for which projected down to the alphabet of A0 no corresponding pair can be found in failures(A0 ) (the crucial point is refusal of withdraw ). Class C1 as depicted in Figure 3 on the other hand is an optimal subtype of class A0 and it indeed inherits property Live. Theorem 9 (Wehrheim). Let A, C be classes with A ost C , let P be a process formalising a liveness property. If A satisﬁes P in the FD semantics, then C satisﬁes P w.r.t. α(A) in the FD semantics.

Speciﬁcation and Inheritance in CSP-OZ A0

375

C1

R

R

idSucc

idSucc

stop

?

?

test withdraw

stop

pay

withdraw

R

pay

R Fig. 3. A class and an optimal subtype.

stop

english

idSucc

? pay

withdraw

C2

german

german

R idSucc

stop

?

english

withdraw

pay

R

Fig. 4. Another optimal subtype.

A more complex extension of the simple till which is also an optimal subtype is shown in Figure 4. The till allows to switch between diﬀerent languages but switching does not eﬀect the basic functionality. Since C2 is an optimal subtype of A0 we get by the previous theorem that C2 satisﬁes Live.

6

Conclusion

In this paper we took the combined speciﬁcation formalism CSP-OZ to deﬁne and study the inheritance of properties from superclasses to subclasses. Semantically, classes, instances, and systems in CSP-OZ denote processes in the standard failures divergence model of CSP. This allowed us to make full use of the well established mathematical theory of CSP. In case of systems with ﬁnite state CSP parts and ﬁnite data types in the OZ parts the FDR model-checker for CSP can be applied to automatically verify reﬁnement relations between CSPOZ speciﬁcations [14] and subtyping relations [51]. Optimal subtyping is a strong requirement for subclasses to satisfy. As future work it would be interesting to study conditions under which speciﬁc properties are inherited. Related Work. A number of other combinations of process algebra with formalisms for describing data exist today. A comparison of approaches for com-

376

E.-R. Olderog and H. Wehrheim

bining Z with a process algebra can be found in [17]. Such integrations include Timed CSP and Object-Z (TCOZ) [32], B and CSP [6] and Z and CCS [47,19]. A similar type of integration which has been developed much earlier than these approaches is LOTOS [2]. LOTOS adopts a number of operators from CCS and CSP. The ﬁrst design of LOTOS contained ACT-One for describing data, a language for specifying abstract data types. This proved to be unacceptable for industrial users, and the newest development called E-Lotos [26] (standardised by ISO) uses instead elements from functional languages. Closest to the combination CSP-OZ is Object-Z/CSP due to Smith [43]. There, CSP operators serve to combine Object-Z classes and instances. Thus to Object-Z classes a semantics in the failures divergence model of CSP is assigned just like done here for CSP-OZ. This semantics is obtained as an abstraction of the more detailed history semantics of Object-Z [42]. In contrast to CSP-OZ there is no CSP-part inside of classes. As we have seen in the example, the CSP part is convenient for specifying sequencing constraints on the communications events. Both CSP-OZ and Object-Z/CSP have been extended to deal with realtime [24,45]. The issue of inheritance of properties to subtypes has been treated by van der Aalst and Basten [48]. They deal with net-speciﬁc properties like safety (of nets), deadlock freedom and free choice. Leavens and Weihl [29] show how to verify object-oriented programs using a technique called “supertype abstraction”. This technique is based on the idea that subtypes need not to be re-veriﬁed once a property has been proven for their supertypes. In their study they have to take particular care about aliasing since in object-oriented programs several references may point to the same object, and thus an object may be manipulated in several ways. Subtyping for objectoriented programs has to avoid references which are local to the supertype but accessible in the subtype. Preservation of properties is also an issue in transformations within the language UNITY proposed by Chandy and Misra [7]. The superposition operator in Unity is a form of parallel composition which requires that the new part does not make assignments to underlying (old) variables. This is close to the non-modiﬁcation condition we used in one pattern. Superposition preserves all properties of the original program.

References 1. P. America. Designing an object-oriented programming language with behavioural subtyping. In J.W. de Bakker, W.P. de Roever, and G. Rozenberg, editors, REX Workshop: Foundations of Object-Oriented Languages, number 489 in LNCS. Springer, 1991. 2. T. Bolognesi and E. Brinksma. Introduction to the ISO speciﬁcation language LOTOS. Computer Networks and ISDN Systems, 14:25–59, 1987. 3. J. Bredereke. Maintaining telephone switching software requirements. IEEE Communications Magazine, 40(11):104–109, 2002.

Speciﬁcation and Inheritance in CSP-OZ

377

4. S.D. Brookes, C.A.R. Hoare, and A.W. Roscoe. A theory of communicating sequential processes. Journal of the ACM, 31:560–599, 1984. 5. A. Brucker and B. Wolﬀ. A proposal for a formal OCL semantics in Isabelle/HOL. In Proc. International Conference on Theorem Proving in Higher Order Logics (TPHOLs), LNCS. Springer, 2002. 6. M. Butler. csp2B: A practical approach to combining CSP and B. In J. Wing, J. Woodcock, and J. Davies, editors, FM’99: Formal Methods, number 1708 in Lecture Notes in Computer Science, pages 490–508. Springer, 1999. 7. K.M. Chandy and J. Misra. Parallel Program Design – A Foundation. AddisonWesley, 1988. 8. W. Damm, B. Josko, A. Pnueli, and A. Votintseva. Understanding UML: A Formal Semantics of Concurrency and Commu nication in Real-Time UML. In F.S. de Boer, M. Bonsangue, S. Graf, and W.P. de Roever, editors, Formal Methods of Components and Objects (FMCO’02), LNCS. Springer, 2003. (this volume). 9. W. Damm and B. Westphal. Live and Let Die: LSC-based Veriﬁcation of UMLModels. In F.S. de Boer, M. Bonsangue, S. Graf, and W.P. de Roever, editors, Formal Methods of Components and Objects (FMCO’02), LNCS. Springer, 2003. (this volume). 10. B. T. Denvir, J. Oliveira, and N. Plat. The Cash-Point (ATM) ‘Problem’. Formal Aspects of Computing, 12(4):211–215, 2000. 11. R. Duke, G. Rose, and G. Smith. Object-Z: A speciﬁcation language advocated for the description of standards. Computer Standards and Interfaces, 17:511–533, 1995. 12. C. Fischer, E.-R. Olderog, and H. Wehrheim. A CSP view on UML-RT structure diagrams. In H. Husmann, editor, Fundamental Approaches to Software Engineering, volume 2029 of LNCS, pages 91–108. Springer, 2001. 13. C. Fischer and G. Smith. Combining CSP and Object-Z: Finite or inﬁnite tracesemantics? In T. Mizuno, N. Shiratori, T. Higashino, and A. Togashi, editors, Proceedings of FORTE/PSTV’97, pages 503–518. Chapmann & Hall, 1997. 14. C. Fischer and H. Wehrheim. Model-checking CSP-OZ speciﬁcations with FDR. In K. Araki, A. Galloway, and K. Taguchi, editors, Integrated Formal Methods, pages 315–334. Springer, 1999. 15. C. Fischer and H. Wehrheim. Behavioural subtyping relations for object-oriented formalisms. In T. Rus, editor, AMAST 2000: International Conference on Algebraic Methodology And Software Technology, number 1816 in Lecture Notes in Computer Science, pages 469–483. Springer, 2000. 16. C. Fischer. CSP-OZ: A combination of Object-Z and CSP. In H. Bowman and J. Derrick, editors, Formal Methods for Open Object-Based Distributed Systems (FMOODS’97), volume 2, pages 423–438. Chapman & Hall, 1997. 17. C. Fischer. How to combine Z with a process algebra. In J. Bowen, A. Fett, and M. Hinchey, editors, ZUM’98 The Z Formal Speciﬁcation Notation, volume 1493 of LNCS, pages 5–23. Springer, 1998. 18. C. Fischer. Combination and Implementation of Processes and Data: From CSPOZ to Java. PhD thesis, Bericht Nr. 2/2000, University of Oldenburg, April 2000. 19. A. J. Galloway and W. Stoddart. An operational semantics for ZCCS. In M. Hinchey and Shaoying Liu, editors, Int. Conf. of Formal Engineering Methods (ICFEM). IEEE, 1997. 20. J. Hatcliﬀ and M. Dwyer. Using the Bandera tool set to model-check properties of concurrent Java software. In K.G. Larsen, editor, CONCUR 2001, LNCS. Springer, 2001.

378

E.-R. Olderog and H. Wehrheim

21. S. Helke and T. Santen. Mechanized analysis of behavioral conformance in the Eiﬀel base libraries. In M. Butler, L. Petre, and K. Sere, editors, Proceedings of the FME 2001, LNCS. Springer, 2001. 22. C.A.R. Hoare. Communicating sequential processes. CACM, 21:666–677, 1978. 23. C.A.R. Hoare. Communicating Sequential Processes. Prentice Hall, 1985. 24. J. Hoenicke and E.-R. Olderog. Combining speciﬁcation techniques for processes, data and time. In M. Butler, L. Petre, and K. Sere, editors, Integrated Formal Methods (IFM 2002), volume 2335 of LNCS, pages 245–266. Springer, 2002. 25. M. Huisman and B. Jacobs. Java Program Veriﬁcation via a Hoare Logic with Abrupt Termination. In T. Maibaum, editor, Fundamental Approaches to Software Engineering (FASE 2000), volume 1783 of LNCS, pages 284–303. Springer, 2000. 26. ISO. Final comittee draft on enhancements to LOTOS. ISO/IEC JTC1/SC21, WG7 Enhancements to LOTOS, 1998. ftp://ftp.dit.upm.es/pub/lotos/elotos/Working.Docs/. 27. Kolyang. HOL-Z – An Integrated Formal Support Environment for Z in Isabelle/HOL. PhD thesis, Univ. Bremen, 1997. Shaker Verlag, Aachen, 1999. 28. D. Latella, I. Majzik, and M. Massink. Automatic veriﬁcation of a behavioural subset of UML statechart diagrams using the SPIN model-checker. Formal Aspects of Computing, 11:430–445, 1999. 29. G.T. Leavens and W.E. Weihl. Speciﬁcation and veriﬁcation of object-oriented programs using supertype abstraction. Acta Informatica, 32:705–778, 1995. 30. K. R. M. Leino. Extended static checking: A ten-year perspective. In R. Wilhelm, editor, Informatics – 10 Years Back, 10 Years Ahead, volume 2000 of LNCS, pages 157–175. Springer, 2001. 31. B. Liskov and J. Wing. A behavioural notion of subtyping. ACM Transactions on Programming Languages and Systems, 16(6):1811 – 1841, 1994. 32. B. P. Mahony and J.S. Dong. Blending Object-Z and Timed CSP: An introduction to TCOZ. In The 20th International Conference on Software Engineering (ICSE’98), pages 95–104. IEEE Computer Society Press, April 1998. 33. A. Mota and A. Sampaio. Model-checking CSP-Z: strategy, tool support and industrial application. Science of Computer Programming, 40(1), 2001. 34. O. Nierstrasz. Regular types for active objects. In O. Nierstrasz and D. Tsichritzis, editors, Object-oriented software composition, pages 99 – 121. Prentice Hall, 1995. 35. E.-R. Olderog and C.A.R. Hoare. Speciﬁcation-oriented semantics for communicating processes. Acta Inform., 23:9–66, 1986. 36. A. Poetzsch-Heﬀter and J. Meyer. Interactive veriﬁcation environments for objectoriented languages. Journal of Universal Computer Science, 5(3):208–225, 1999. 37. A.W. Roscoe. Model-checking CSP. In A.W. Roscoe, editor, A Classical Mind — Essays in Honour of C.A.R.Hoare, pages 353–378. Prentice-Hall, 1994. 38. A.W. Roscoe. The Theory and Practice of Concurrency. Prentice-Hall, 1997. 39. M. Saaltink. The Z/EVES system. In J. Bowen, M. Hinchey, and D. Till, editors, ZUM’97, volume 1212 of LNCS, pages 72–88. Springer, 1997. 40. T. Sch¨ afer, A. Knapp, and S. Merz. Model Checking UML State Machines and Collaborations. In Workshop on Software Model Checking, volume 55 of ENTCS, 2001. 41. G. Smith, F. Kamm¨ uller, and T. Santen. Encoding Object-Z in Isabelle/HOL. In D. Bert, J.P. Bowen, M.C. Henson, and K. Robinson, editors, ZB 2002: Formal Speciﬁcation and Development in Z and B, volume 2272 of LNCS, pages 82–99. Springer, 2002. 42. G. Smith. A fully abstract semantics of classes for Object-Z. Formal Aspects of Computing, 7:289–313, 1995.

Speciﬁcation and Inheritance in CSP-OZ

379

43. G. Smith. A semantic integration of Object-Z and CSP for the speciﬁcation of cocurrent systems. In J. Fitsgerald, C.B. Jones, and P. Lucas, editors, Foraml Methods Europe (FME’97), volume 1313 of LNCS, pages 62–81. Springer, 1997. 44. G. Smith. The Object-Z Speciﬁcation Language. Kluwer Academic Publisher, 2000. 45. G. Smith. An integration of real-time Object-Z and CSP for specifying concurrent real-time systems. In M. Butler, L. Petre, and K. Sere, editors, Integrated Formal Methods (IFM 2002), volume 2335 of LNCS, pages 267–285. Springer, 2002. 46. J.M. Spivey. The Z Notation: A Reference Manual. Prentice-Hall International Series in Computer Science, 2nd edition, 1992. 47. K. Taguchi and K. Araki. Specifying concurrent systems by Z + CCS. In International Symposium on Future Software Technology (ISFST), pages 101–108, 1997. 48. W.M.P. van der Aalst and T. Basten. Inheritance of Workﬂows – An approach to tackling problems related to change. Theoretical Computer Science, 270(1-2):125– 203, 2002. 49. H. Wehrheim. Speciﬁcation of an automatic manufacturing system – a case study in using integrated formal methods. In T. Maibaum, editor, FASE 2000: Fundamental Aspects of Software Engineering, number 1783 in LNCS, pages 334–348. Springer, 2000. 50. H. Wehrheim. Behavioural subtyping in object-oriented speciﬁcation formalisms. University of Oldenburg, Habilitation Thesis, 2002. 51. H. Wehrheim. Checking behavioural subtypes via reﬁnement. In A. Rensink B. Jacobs, editor, FMOODS 2002: Formal Methods for Open Object-Based Distributed Systems, pages 79–93. Kluwer, 2002. 52. J. Woodcock and J. Davies. Using Z — Speciﬁcation, Reﬁnement, and Proof. Prentice-Hall, 1996.

Model-Based Testing of Object-Oriented Systems Bernhard Rumpe IRISA-Université de Rennes 1, Campus de Beaulieu, Rennes, France and Software & Systems Engineering, TU München, Germany

Abstract. This paper discusses a model-based approach to testing as a vital part of software development. It argues that an approach using models as central development artifact needs to be added to the portfolio of software engineering techniques, to further increase efficiency and flexibility of the development as well as quality and reusability of results. Then test case modeling is examined in depth and related to an evolutionary approach to model transformation. A number of test patterns is proposed that have proven helpful to the design of testable object-oriented systems. In contrast to other approaches, this approach uses explicit models for test cases instead of trying to derive (many) test cases from a single model.

1 Portfolio of Software Engineering Techniques Software has become a vital, but often invisible part of our lives. Embedded forms of software are part of almost any technical device. The average household uses several computers, and the internet and telecommunication world has considerably changed our lives. Software is used for a variety of jobs. It can be as small as a simple script or as complex as an entire operating or enterprise resource planning system. For the near future, we can be rather sure that we will not have a single notation or process that can cover the diversity of today’s development projects. Projects are too different in their application domain, size, need for reliability, time-to-market pressure, and the skills and demands of the project participants. Even the UML [OMG02], which is regarded as a de-facto standard, is seen as a family of languages rather than a single notation and by far doesn’t cover all needs. This leads to an ongoing proliferation of methods, notations, principles, techniques and tools in the software engineering domain that is at least partly influenced from practical applications of Formal Methods. On the one hand, methods like Extreme Programming [Bec99] and Agile Software Development [Coc02] even discourage the long well known distinction between analysis, design and implementation activities and abandon all documentation activities in favor of rigorous test suites. On the other hand, upcoming development tools allow to generate increasing amounts of code from UML models, thus supporting the OMG’s initiative on “Model Driven Architecture” (MDA) [OMG01]. MDA’s primary purpose is to decouple platform-independent models from platform-specific, F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 380–402, 2003. © Springer-Verlag Berlin Heidelberg 2003

Model-Based Testing of Object-Oriented Systems

381

technical information. This should increase the reusability of both. Code generation, however, is today focusing pretty much on the generation of the productive system. Generation of test code from models is still a side issue. In particular the question, what a good test model should look like, is to be examined in this paper. In general, we can observe that in the foreseeable future we will have a portfolio of software engineering techniques that enables developers and managers to select appropriate processes and tools for their projects. Today, however, it is not quite clear which elements the portfolio should have, how they relate, when they are applicable, and what their benefits and drawbacks are. The software and systems engineering community therefore must reconsider and extend its portfolio of software engineering techniques incorporating new ideas and concepts, but also try to scientifically assess the benefits and limits of new approaches. For example: • Lightweight projects that don’t produce requirement and design documentation need intensive communications and can hardly be split into independent subprojects. Thus they don’t scale up to large projects. But where are the limits? A guess is, around 10 people, but there have been larger projects reportedly “successful” [RS02]. • Formal methods have built a large body of knowledge (see for example [ABRS03,JKW03,LCCRC03]), but how can this knowledge successfully and in a goal-oriented way be applied in today’s projects? A guess seems to be, formal methods apply best, if embodied in practical tools, using practical and well known notations without exposing the user directly to the formal method. • Product reliability often need not be 100% for all developments and in the first iteration already. But how to predict reliability from project metrics and how to adapt the project to increase reliability and accuracy to the desired level while minimizing the project/product costs? Thus in contrast to applying formal methods for verification purposes, the use of formal techniques for test case specification and metrics of test coverage does not give 100% reliability, but in practice has a much better cost/benefit ratio. Based on this observation we will in the following examine the modeling of test cases using several of the UML-like notations, arguing that this technique should be a new item in the broad portfolio of SE techniques. For this purpose, we develop our interpretation of the used UML notation in the context of test case modeling, which should give us the justification to regard the used notation as being backed up by a formal technique (without explicitly referring to that formal method). Section 2 discusses synergies and problems of using models for a variety of activities, including programming. Section 3 establishes the general needs of successful testing strategies. In Section 4 the scenario of a model-based test approach is discussed. Sections 5 and 6 present several test patterns that are useful to make an object-oriented design testable. While Section 5 concentrates on basic test patterns, Section 6 presents more specific test patterns for distributed systems. Section 7 finally discusses the benefits of an evolutionary approach to modeling in combination with an intensive, model-based test approach. In particular, the usability of tests as invariant observations for model-transformations is explored. For sake of conceptual discussions, technical details are omitted, but can be found in [Rum03].

382

B. Rumpe

2 Modeling Meets Programming UML [OMG02] undoubtedly has become the most popular modeling language for software intensive systems used today. Models can be used for quite a variety of purposes. Besides informal sketches that are used for communication, e.g. by being drawn on paper and posted on a wall the most common are: • Semi-precisely defined diagrams are used for documentation of that part of the requirements that is not written in plain English. • Architecture and designs are captured and documented with models. In practice, these models are increasingly often used for code generation. More sophisticated and therefore less widespread uses of models are analysis of certain quality attributes (such as message throughput, responsiveness or failure likelihood) or development of tests from models. Many UML-based tools today offer functionality to directly simulate models or generate at least parts of the code. As tool vendors work hard on continuous improvement of this feature, this means a sublanguage of UML will become a high-level programming language and modeling at this level becomes identical to programming. This raises a number of interesting questions: • Is it critical for a modeling language to be also used as programming language? For example analysis and design models may become overloaded with details that are not of interest yet, because modelers are addicted to executability. • Is a future version of the UML expressive enough to describe systems completely or will it be accompanied by conventional languages? How well are these integrated? • How will the toolset of the future look like and how will it overcome round trip engineering (i.e. mapping code and diagrams in both directions)? • What implications does an executable UML have on the development process? In [Rum03,Rum02] we have discussed these issues and have demonstrated, how the UML in combination with Java may be used as a high-level programming language. But, UML cannot only be used for modeling the application, but more importantly for modeling tests on various levels (class, integration, and system tests) as well. For this purpose we need executable test models, as testing is in general the process of executing a program with the intention to identify faults [Mye79,Bin99]. Executable models are usually less abstract than design models, but they are still more compact and abstract than the implementation. The same holds for test models versus manually implemented tests. One advantage of using models for test case description is that application specific parts are modeled with UML-diagrams and technical issues, such as connection to frameworks, error handling, persistence, or communication are handled by the parameterized code generator. This basically allows us to develop models that are independent of any technology and platform, as for example proposed in [SD00]. Only during the generation process platform dependent elements are added. When the technology changes, we only need to update the generator, but the application defining models as well as test models can directly be reused. This concept also directly

Model-Based Testing of Object-Oriented Systems

383

supports the above mentioned MDA-Approach [Coc02] of the OMG. Another important advantage is that both, the production code and automatically executable tests at any level, are modeled by the same UML diagrams. Therefore, developers use a single homogeneous language to describe implementation and tests. This will enhance the availability of tests already at the beginning of the coding activities and leads to a development method similar to the “test first approach” [Bec01,LF02]. Some of the UML-models (mainly class diagrams and statecharts) are used constructively, others are used for test case definition (mainly OCL, sequence and enhanced object diagrams). Fig. 1 illustrates the key mappings. object diagrams

statecharts

__:

class diagrams

__: sequence diagrams

Java

OCL

__:

WKLFNQHVVRIDUURZ LQGLFDWHVLPSRUWDQFH RIJHQHUDWLRQ

FRGHDQG WHVWFDVH JHQHUDWLRQ production code

test code

Fig. 1. Mapping of UML-models to code and test code.

As a consequence of the various possible forms of model use, we will identify the notion of diagram and model. Thus a model is a coherent piece of information, denoted in a diagrammatic or textual notation that describes an abstraction of the desired system. Multiple models can describe various aspects of the system. We also allow models to be composed of sub-models, as is the case with test models. This is in slight contrast to approaches, where a model is a virtual something in a tool and the user manipulates it indirectly through (diagrammatic) views. The latter approach has shown some difficulties when using models for test case definition.

3 Testing Strategies Not only [Bin99,Mye79] show that there is a huge variety of testing strategies and testing goals. While tests in the small usually try to identify faults, tests suites and coverage metrics can be used to estimate the quality of the system in terms of absence of faults. The “success” of a test can therefore be seen twofold, but we follow the

384

B. Rumpe

general line and define a test to fail, when an abnormal behavior (failure) shows that there is at least one fault in the system. This means the test and the system do not fit together. Testing can be done manually or automated. The widespread use of JUnit [BG99] shows that the use of automated tests has gained considerable attention in recent years, because it allows to “reuse” tests in form of regression tests on evolving systems without actually knowing what the test does. This allows very small iterations with continuous integration and the use of refactoring techniques [Fow99] to improve the code structure. Automated tests ensure a low defect rate and continuous progress, whereas manual tests would very rapidly lead to exhausted testers. To summarize the characteristics of tests we are aiming at: • Tests run the system – in contrast to static analyses. • Tests are automatic to prevent project members to get bored with tests (or alternatively to prevent a system that isn’t tested enough) • Automated tests build the test data, run the test and examine the result automatically. Success resp. failure of the test are automatically observed during the test run. • A test suite also defines a system that is running together with the tested production system. The purpose of this extended system is to run the tests in an automated form. • A test is exemplaric. A test uses particular values for the input data, the test data. • A test is repeatable and determined. For the same setup the same results are produced. In particular the last point is tricky to achieve, when the system under test (SUT) is distributed or has side effects. Specific care has to be taken to deal with these situations. Faults may also occur without being observable, because they are hidden in the SUT or cannot be traced in a distributed system. That is why the TTCN standard [TTCN92] also allows “inconclusive” and “none” as test results. We instead strongly demand that systems need to be built in a way that testing can be properly achieved. At least in the domain of object-oriented software systems this is a realistic demand. After discussing the structure of a model-based test in the following section, we will discuss, how object-oriented systems should be designed to assist automated testing.

4 Model-Based Testing The use of models for the definition of tests and production code can be manifold: • Code or at least code frames can be generated from a design model. • Test cases can be derived from an analysis or design model that is not used/usable for constructive generation of production code. For example behavioral models, such as statecharts, can be used to derive test cases that cover states, transitions or even larger subsets of its paths. • The models itself can be used to describe test cases or at least some part thereof.

Model-Based Testing of Object-Oriented Systems

test data

o1 o3

expected result and/or OCL-contract as test oracle

test driver

o2

OD

SD or method call o2

o1

objects under test o5

o4

385

o3

OD

o4

+

OCL

Fig. 2. Structure of a test modeled with object diagrams (OD), sequence diagram (SD) and the Object Constraint Language (OCL).

The first two uses are already discussed e.g. in [Rum02] and [BL01]. Therefore, in this section we concentrate on the development of models that define tests. A typical test, as shown in Fig. 2 consists of a description of the test data, the test driver and an oracle characterizing the expected test result. In object-oriented environments, test data can usually be described by an object diagram (OD). The object diagram in Fig. 3 shows the necessary objects as well as concrete values for their attributes and the linking structure. OD Ready a1213:Auction

…

long auctionIdent = 1213 String title= “Copper 413tons“ /int numberOfBids = 0

timePol:ConstantTimingPolicy int status = TimingPolicy.READY_TO_GO boolean isInExtension = false int extensionTime = 180

Fig. 3. Object diagram (OD) describing the test data as a particular situation in an online auction, that has not yet started (see the full example in [Rum03,Rum03b]).

The test driver can be defined using a simple method call or, if more complex, modeled by a sequence diagram (SD). An SD has the considerable advantage that not only the triggering method calls can be described, but it is possible to model desired interactions and check object states during the test run. For this purpose, the Object Constraint Language (OCL, [WK98]) is used. In the sequence diagram in Fig. 4, an OCL constraint at the bottom ensures that the new closing time of the auction is set to the time when the bid was submitted (bid.time) plus the extension time to allow competitors to react (the auction system using this structure is partly described in [Rum03, Rum03b]). It is effective to model the test oracle using a combination of an object diagram and reuse globally valid OCL properties. The object diagram in this case serves as a property description and can therefore be rather incomplete, just focusing on desired effects. The OCL constraints can also be taken from the set of general invariants or can be defined as specific properties. In practice, it turns out that there is a high amount of reuse possible through the following techniques:

386

B. Rumpe SD copper: Auction «trigger» handleBid(bid)

:BidPolicy

:TimingPolicy

validateBid(bid) return OK

test driver getNewClosingTime(bid) return t

OCL constraints describe properties during the test run

t.time == bid.time + extensionTime

Fig. 4. A sequence diagram (SD) describing the trigger of a test driver and predicts some interactions as well as an OCL property that holds at that point in the test.

•

Well prepared test data can be reused for many tests. There is often only a handful of basic test structures necessary. From those the specifically desired test data can be derived by small adaptations, e.g. replacing a single attribute value or adding a certain object. Having an explicit, graphically depicted basic model of the data structure at hand increases its reusability for specific adaptations. • Test data can be composed of several object diagrams, describing different parts of the data. • Test oracles can be defined using a combination of an object diagram and reuse globally valid OCL properties. The resulting object diagram can be rather small, describing deltas only and can be derived from the test data diagram. The OCL properties can be reused as they are usually globally valid invariants. As already mentioned, being able to use the same, coherent language to model the production system and the tests allows for a good integration between both tasks. It allows the developer to immediately define tests for the constructive model developed. It is therefore feasible that in a kind of “test-first modeling approach” the test data in form of possible object structures is developed before the actual implementation.

5 Test Pattern In the last few years a number of Agile Methods have been defined that share a certain kind of characteristics, described in [AM03]. Among these Extreme Programming (XP) [Bec99] is the most widely used and discussed method. One of the most important XP characteristics is that it uses automated tests at all stages. Practical experience shows that when this is properly done, the defect rate is considerably low [RS02]. Furthermore, automation allows to repeat tests continuously in form of regression tests. Thus the quality of the result is ensured through strong emphasis on testing activities, ideally on development of the tests before the production code (“test

Model-Based Testing of Object-Oriented Systems

387

first approach” [LF02]). When using UML models for test design the development project should become even more efficient. However, practical experience shows that there are a number of obstacles that need to be overcome to enable model-based testing. In particular, there are object-oriented architectures that exhibit serious problems that prevent tests. It is therefore important to identify those problems and offer appropriate and effective solutions. In the remainder of this section, we provide several solutions for a number of problems that typically occur and that we also experienced e.g. in the online auction system. These solutions are defined in form of applicable test patterns similar to the design patterns of [GHJV94]. Indeed, some of the test patterns are based on design patterns, such as singleton, adapter or factory to achieve a testable design. Unlike [Bin99] we only provide the essential structure and a short explanation of the patterns in this article and refer to [Rum03] to a more detailed description. A test pattern description typically consists of several parts, describing intention, how to apply, the resulting structure, example implementations and a discussion of the pros and cons. Often the structure itself appears as a simple concept and it’s the method part, describing practical knowledge of its applicability that makes a pattern useful. auctions Auction *

1 «interface» BiddingPolicy

BidPolDummy

participants

1 «interface» TimingPolicy

TimePolDummy

CD

bidder Person *

PersonDummy

dummies replace ordinary objects through subclassing

Fig. 5. Class diagram showing how dummies are added to the system.

Dummies for the Test Context It has become a primary testing technique to use dummies (also “stubs” or “mocks”) to replace parts of the system and thus better expose the tested part to the test driver. Only object-oriented concepts, namely inheritance and dynamic binding (also known as polymorphic replacement of objects), comfortably allows to build object structures that are testable with dummies. Fig. 5 shows the principle in one class diagram that allows to isolate an auction object, by replacing its context completely by dummies. Sometimes a dummy just does nothing, but often it is also necessary to feed back specific values to keep the test going in the desired direction. Practical experience shows that this should normally not be achieved through various dummy-subclasses,

388

B. Rumpe

but through parameterized dummies, whose behavior during the test can be determined via constructor parameters. This for example allows to predefine and store results of queries given back to the calling SUT just to see what the SUTs reaction will be on that data. Remembering Interaction and Results A typical application of a dummy is to prevent side effects that a system otherwise has on the environment. Such side effects may affect files, the data base, the graphical user interface etc. An object responsible for logging activities that provides a method “write” may be replaced by a subclass object (say “LogDummy”) where the “write” method simply stores the line to write in a local attribute where it can be examined after the test. Sequence diagrams, however, already allow access to this kind of values during the test. Fig. 6 describes the effect of a test on the log object directly.

«Testclass» :AuctionTest

kupfer912: Auction

SD :ProtocolSimpleDummy

«trigger» start() «trigger» handleBid(bid1) «trigger» handleBid(bid2) «trigger» finish()

write(“Auction 912 opened at 14:00:00“)

write(“Bid 55.200 $US in Auction 912 from Theo accepted“)

effects of the test are captured in the method arguments

write(“Bid 55150 $US in Auction 912 from Joe accepted“)

write(“Auction 912 closed at 15:30:00“)

Fig. 6. Describing the effects for the log in a sequence diagram.

Static Elements As explained two concepts of object-oriented languages, namely inheritance and dynamic binding, allow to setup tests in a form that was not possible in procedural and functional languages. However, typical OO languages also provide concepts that make testing difficult. These are in particular static concepts, such as static attributes, static methods and constructors. Static attributes are mainly used to allow easy sharing of some global resources, such as the log object or database access. Static attributes, however, should be avoided anyway. If necessary e.g. for access to generally known objects, a static attribute can at least be encapsulated by a static method. The problem with a static method results from the inability to redefine it for testing purposes. For example if the static method “write” does have side effects, these cannot be prevented during a test. Unfortunately, there are often at least some static

Model-Based Testing of Object-Oriented Systems

389

methods necessary. We have therefore used the technique shown in Fig. 7 to provide a static interface to the customer and at the same time to allow the effect of the static method to be adaptable, through using an internal delegation to a singleton object that is stored in a static attribute. With proper encapsulation of the initialization of that attribute this is a safe and still efficient technique to make static methods replaceable by dummies without changing their signature. Thus the side effects of static methods can be prevented. CD

Singleton #Singleton singleton = null +initialize() #initialize(Singleton s) +method(Arguments) #doMethod(Arguments)

SingletonDummy #doMethod(Arguments)

singleton initialisation (default resp. with an object of a subclass) the (underlined) static method only calls the dynamic method „doMethod“ dynamic method contains the real functionality and can be redefined.

Fig. 7. Singleton object behind a static method.

Constructors With respect to testing purposes a constructor shares a number of similarities with static methods, because a constructor is not dynamically bound and can therefore not be redefined. Furthermore, a constructor creates new objects and thus allows the object under test to change its own environment. For example the tested object may have the idea to create its own log object and write the log through it. The standard solution for this problem is to force any object creation to be done by a factory. So instead “new class(arg)” a method “getClass(arg)” might be called. This method may be static using the approach above to encapsulate the factory object, but still make it replaceable for tests. A factory dummy can then create objects of appropriate subclasses that serve as dummies with certain predefined test behavior. In practice, we found it useful to model the newly created objects using object diagrams. The factory dummy that replaces the factory then doesn’t really create new objects, but returns one of the predefined objects each time it is called. Fig. 8 shows the factory part of a test data structure where three different person objects shall be “created” during the test. The data structure and the attribute values can be described using the same object diagram as is used when describing the basic test data structure. Further advantages are (1) that the newly “created” objects are known by name and can thus easily be checked after the test, even if the objects were disposed during the test and (2) the order of creation can also be checked.

390

B. Rumpe OD

:TestFactoryDummy index=0

:PersonBidderDummy

…

personIdent = 1783 name = “Theo Smith“ isActive = true

index=1

index=2

:PersonGuestDummy personIdent = 20544 name = “Otto Mair“ isActive = false

…

:Person

…

personIdent = 19227 name = “Jo Miller“ isActive = false

Fig. 8. Replacing a constructor by a factory and modeling the factory behavior through a series of objects to be “created” during a test.

Frameworks and Components Frameworks are commonly used in object-oriented systems. Most prominent examples are the AWT and Swing from Java, but basic classes, such as the containers partly also belong to that category. Software that uses predefined frameworks is particularly hard to test: • The control flow of the framework makes tests tricky to run. • Form and class of newly created objects within the framework is predefined through the used constructors. • Static variables and methods of the framework cannot be controlled, in particular if they are encapsulated. • Encapsulation prevents to check the test result. • Sometimes subclasses cannot be built properly, because the class or methods are declared “final”, there is no public constructor, the constructor does have side effects, or the internal control flow is unknown. Today there doesn’t exist a single framework that is directly suited for tests. Such a framework should allow to replace framework objects by self-defined subclass objects and should provide its own default dummy subclasses. Furthermore, the framework should use factories for object creation and give the test developer a possibility to replace these factories. The white-box adaptation principles that frameworks usually provide through subclassing are indeed helpful and sometimes sufficient, but if not, a more general technique, the adapter, is needed to separate application and framework. This is a recommended technique for application developers anyway to decouple application and framework and can be reused for improvement of testability as well. Fig. 9 shows how a JSP “ServletRequest” class is adapted. A “ServletRequest” basically contains the contents of a web form filled by the user in form of pairs (parametername, content). Unfortunately “ServletRequest”-objects can only be created by handling actual requests through the web. Therefore, an adapter is used, which is called “OwnServletRequest”. In an adapter normally simple delegation is used. But, as framework classes are strongly interconnected method calls often require other framework objects as parameters or reveal access to other framework objects. For example the method “get-

Model-Based Testing of Object-Oriented Systems

391

Session()” needs an additional wrapping to return the proper object of class “OwnSession”. This adapter technique allows us to completely decouple application and framework and even to run the application part without the framework as it may be desired in tests. For testing purposes “OwnServletRequestDummy” may now overwrite all methods and use a “Map” to store a predefined set of “user” inputs. However, there must be noted that this kind of wrapping may need additional infrastructure to ensure that each time “getSession()” is called on the same “ServletRequest”, the same corresponding session object is returned. This can be solved through an internal Map from Session to OwnSession that keeps track.

«Adapter» OwnServletRequest

…

OwnServletRequest() OwnServletRequest(HttpServletRequest hsr) +Enumeration getParameterNames() +String getParameter(String name) +OwnSession getSession()

OwnServletRequestDummy

HttpServletRequest 0..1

…

CD

+Enumeration getParameterNames() +String getParameter(String name) +HttpSession getSession()

result with the type of a framework class are also wrapped in an adapter

…

Map/*String,String*/ parameter OwnServletRequestDummy(Map /*String,String*/ p) +Enumeration getParameterNames() +String getParameter(String name) +OwnSession getSession()

…

0..1

«Adapter» OwnSession

… HttpSession

… OwnSessionDummy

Fig. 9. Adapters for framework classes.

So far we have discussed a number of easy to apply, but effective techniques to make object-oriented systems testable. We have used class, sequence and object diagrams to model the test pattern and demonstrated how to use these diagrams to model the test data and dummies. It is an interesting question, how to present such methodological knowledge and its individual parts. Basically the technical principles, such as pattern structure can be formally defined. This has the advantage that at least the structural part of such a pattern can automatically be applied using an appropriate tool. However, the most important part of a test pattern, namely the methodical experience cannot be formalized, but needs to be presented to the user in an understandable way. The user needs then to adapt a pattern to his specific situation. Therefore, we have chosen to largely use examples instead of precise descriptions for patterns.

6 Test Pattern for Distributed Systems Testing a distributed system may become hard, as distribution naturally involves concurrent processes with interactions and timeouts thus leading to nondeterministic behavior. Furthermore, it is normally not possible to stop the system and obtain a global system state for consistency checking. There exists a variety of approaches to

392

B. Rumpe

deal with these problems, in particular in the hardware and embedded systems area. Through the distribution of web services in particular in the E-Commerce domain, it becomes increasingly important to be able to deal with distributed object systems in this domain as well. In our example application, the online auction system, timing and distribution are very important, as auctions last only a very restricted time (e.g. one hour) and in the final phase bids are submitted within seconds. Therefore, it is necessary that auctions are handled synchronously over the web. The test patterns discussed in this section tackle four occurring problems: (1) simulation of time and progress, (2) handling concurrency through threads, (3) dealing with distribution and (4) communication. As already mentioned, the test patterns concentrate on functional tests. Additional effort is necessary to test quality of service attributes, such as throughput, mean uptime, etc. The proposed techniques have already been used in other approaches, the novelty basically comes from the combination of modeling techniques and these concepts in form of methodical test patterns. Handling Time and Progress An online auction usually takes about one hour. However, a single test may not take that time, but needs to be run in milliseconds, as hundreds of tests shall finish quickly. So it is necessary to simulate time. This becomes even more important, when distributed processes come into play that do not agree on a global time as is usually the case in the internet. Thus instead of calling the time routine of the operating system directly, an adapter is used. The adapter can be replaced by a parameterized dummy that allows us to freely set time. For many tests, a fixed time is sufficient, for tests of larger series of behaviors, however, it is also necessary that progress happens. Thus in the time pattern two more concepts can be established: First, each query of the current time increases time for one tick. Second, we use explicit time stamps on sequence diagrams to adapt time during the test. The time stamps as shown in Fig. 10 therefore correspond to statements that update the timing dummy. This active use of time stamps contrasts other approaches, where a passive interpretation regards a time stamp as maximum durations that a signal may take. The principle used here to simulate time also allows to simulate the behavior of times that trigger certain events regularly or after timeouts. Concurrency Using Threads Concurrency within one processing unit is often used to increase reactivity and to delegate regularly occurring tasks to specific units. In web applications, threads deal with polling of TCP/IP-data from sockets and with GUI interactions. However, those threads are normally encapsulated in the frameworks and use “callbacks” to the application code to handle a request. For a functional test of this type of concurrency it is necessary to simulate these callbacks. This can be done by defining a fixed scheduling for callbacks to obtain the necessary determinism and repeatability. Fig. 11 shows a test driver in form of a sequence diagram, where the driving object

Model-Based Testing of Object-Oriented Systems

393

shows a test driver in form of a sequence diagram, where the driving object submits several requests to a number of objects that are normally running within different threads. time stamps define the time where the interaction takes place and are used to set the time dummy «Testclass» :AuctionTest

{time=Feb 12 2000, 13:00:00}

{time=14:42:22}

{time=14:44:18}

{time=14:47:18}

SD

copper912: Auction

timePol: TimingPolicy

«trigger» start() «trigger» handleBid(bid1)

«trigger» handleBid(bid2)

newCurrentClosingTime(copper912, bid1)

{time+=100msek}

return t1 newCurrentClosingTime(copper912, bid2)

{time+=40msek}

return t2

«trigger» finish()

Fig. 10. A sequence diagram with time stamps to describe the progress of time.

SD

three objects usually active in different threads of the auction client

:ClockDisplay {time+=1s}

«Testclass» :ClientTest

:WebBidding

:BiddingPanel

updateDisplay() updateMessages() foreignBid(Money m1)

{time+=1s}

updateDisplay() m1.full=“553.000,00 $US“ updateDisplay()

{time+=1s}

actionPerformed(...)

AWT callback bid(“552 400“)

updateMessages()

ownBid(Money m2) m2.full=“552.400,00 $US“

updateDisplay() {time+=1s} method calls are completed before next one is issued, there is no true concurrency

Fig. 11. A test driver schedules calls to objects that normally reside in different threads.

This approach only works for sequential calls and therefore doesn’t test whether the simulated threads would behave similar, if running in parallel resp. a machine scheduled interleaving. Thus we just do functional tests. On one hand interleaving can be checked through additional stress tests and reviews. On the other hand Java e.g. provides a synchronization concept that if properly used is a powerful technique to make programs thread-safe. In practice concurrency problems have been considerably reduced since the concept of thread-safeness was introduced. In more involved situations, where interaction between two active threads is actually desired and therefore shall be tested, it might be necessary to slice methods into smaller portions and do a more fine grained scheduling. However, the possibilities of interactions easily

394

B. Rumpe

explode and efficient testing strategies are necessary. It is also possible to set up test drivers that run large sets of generated tests to explore at least a part of the interaction space. In an application where the developers define threads on their own, these threads usually have the form of a loop with a regularly repeated activity and a sleep statement. If not, they usually can and for applicability of the test pattern also should be reformulated in such a form. The repeating activity then can easily be added to the test scheduling, whereas the Thread object itself should be so simple, that a review is sufficient to ensure its correctness. In Java the Thread class provides additional functionality, such as a join or termination of threads which causes additional effort to simulate. As some of those methods cannot be redefined in subclasses it might be necessary to use an adapter. Distributed Systems Based on the test pattern defined so far, it now becomes feasible to test distributed systems. Real or at least conceptually distributed systems have subsystems with separated storage and enforce explicit communication. With CORBA, DCOM, RMI or even plain socket handling there is a variety of communication techniques available. Of course it is possible to run distributed tests, but it is a lot more efficient to simulate the distribution within one process. Again this technique only works for tests of the functionality. One cannot expect to get good data on reactivity and efficiency of the system when several subsystems are mapped into one process. As each object in the distributed system resides in exactly one part, we introduce a new tag, called location that allows to model in the test, where the object resides. Fig. 12 shows a test driver with an interleaving of activities in distributed locations. To simulate a distributed system it is necessary to ensure that the distributed threads are mapped into one process in such a way that no additional interactions occur. But interactions usually occur when static state is involved, because e.g. static attributes can be globally accessed. In a distributed system every subsystem had its own static attribute, after the mapping only one attribute exists. Our encapsulation of static attributes in singleton objects, however, can easily be adapted to simulate a multiple static attribute. Actually the delegation mechanism explained earlier is extended to use a map from location to attribute content instead of a single attribute. The location is set by the test driver accordingly thus allowing to distinguish the respective context of each tested object. This for example allows to handle multiple logs, etc. The location tag is therefore crucial to setup virtually distributed systems and run them in an interleaved manner in such a way that they believe they run on their own. The virtually distributed system, however, gives us an interesting opportunity: It allows to stop the system during the run and check invariants and conditions across subsystem borders by simply adding globally valid OCL-constraints to the sequence diagram that drives the test run. To be furthermore able to talk about local invariants, we have extended the OCL in [Rum03] to also allow localized constraints.

Model-Based Testing of Object-Oriented Systems th ree o b jects from d istin g uish ed lo catio n s

{location=S erver} s:A uctio n S erver

«Testclass» :C om m Test

395 SD

{location=C 2} w 2:W eb B id d in g

{location=C 3} w 3:W eb B id ding

subm itB id(b) updateM essages()

updateM essages() subm itB id(b) updateM essages()

updateM essages()

Fig. 12. A test driver schedules calls to objects in different locations.

Distributed Communication The remaining problem for tests of distributed systems is to simulate communication in an efficient way. The standard technique here is to build layers of respective communicating objects and use proxies (stubs) on each layer where appropriate. If for example CORBA is used, we build an adapter system around the CORBA API to encapsulate it in the same way as for ordinary frameworks. Thus replacement of the communication part through a dummy becomes feasible. In Fig. 13 we see two object diagrams showing layers of a subset of the communication mechanism that directly deals with sockets in Java. A bid arrives at the AuctionServerProxy in the client. It is transformed into a string and transferred via the MessageHandleProxy, the BufferedWriter and the URLConnection to the socket on the server side. There a thread that resides in the HttpConnection sleeps until a string is received on the socket. The received strings is transferred to the actual MessageHandler that un-marshalls the object into the original bid and gives it to the actual auction server. as:AuctionServer

…

OD Server

submitBid(Bid b)

…

asp:AuctionServerProxy

OD Client

submitBid(Bid b)

mh:MessageHandler

…

send(String s)

…

mhp:MessageHandlerProxy send(String s) send(String s, Writer w)

:HttpConnection

…

handleRequest(Socket s) handleRequest(Reader r)

«java.net» :Socket

/

«java.io» :BufferedReader

«java.net» :URLConnection

/

«java.io» :BufferedWriter

Internet: Browser, Caches, Proxies, Web-Server

Fig. 13. The layers of communication objects in the original system.

396

B. Rumpe

{location=Server} as:AuctionServer

…

{location=Client} asp:AuctionServerProxyDummy

submitBid(Bid b)

OD

submitBid(Bid b) delegation and copying of the arguments

…

{location=Server} as:AuctionServer submitBid(Bid b)

OD

{location=Client} asp:AuctionServerProxy submitBid(Bid b)

{location=Server} mh:MessageHandler

…

test of marshalling included

send(String s)

as:AuctionServer

…

submitBid(Bid b)

…

asp:AuctionServerProxy

OD

submitBid(Bid b)

mh:MessageHandler

…

send(String s)

…

mhp:MessageHandlerProxy send(String s) send(String s, Writer w)

…

:HttpConnection handleRequest(Socket s) handleRequest(Reader r)

«java.io» :PipedReader

«java.io» :PipedWriter

Fig. 14. Three shortcuts for the communication layer.

The trick is that both proxies on the right hand side resemble the same signature by sharing the same interface as their real counterparts on the left hand side. Therefore in a test we may simply shortcut the communication structure. Depending on the objects, we want to test, we might shortcut at the AuctionServer-layer already or go as deep as the Reader/Writer-pair. Fig. 14 shows three variants of possible connections. In the first two configurations it is important to model the location of each object, because the test generator needs to ensure the location is changed, when a client object calls a server object. In the third configuration it is unnecessary, as a transfer of a bid now consists of two parts. First, the bid is submitted at the AuctionServerProxy on the client side and stored at the PipedReader and then the HttpConnection is activated in a second call on the server side. In the preceeding two sections, we have discussed a number of test patterns using models to describe the basic structure. The test patterns on one hand allow us to actually define functional tests for almost any kind of object-oriented and in particular distributed system in a systematic way. On the other hand the used examples show how easy it can be to define and understand test setups that are based on models. These models are a lot more compact and can more easily be developed, read and understood than code. Increased usability of these models for several development stages becomes feasible, because of a better understanding what these models can be used for. Therefore, model-based development as proposed by the MDA-approach [OMG01] becomes applicable.

Model-Based Testing of Object-Oriented Systems

397

7 Model Evolution Using Automated Tests Using models for test and application development is only one side of the medal. Automated testing is the primary enabler for an evolutionary approach of developing systems. Therefore, in this section, we give a sketch of how model-based, automated testing, and model evolution fit together. In the development approach sketched so far, an explicit architectural design phase is abandoned and the architecture emerges during design. Architectural shortcomings are resolved through the application of refactoring techniques [OJ93,Fow99]. These are transformational techniques to evolve a system in small, systematic steps to enhance its structure. The concept isn’t new (see [PR03] for a discussion), but through availability of tools and its embedding in XP [Bec99], transformational development now becomes widely used. Nowadays, it is expected that the development and maintenance process is capable of being flexible enough to dynamically react on changing requirements. In particular, enhanced business logic or additional functionality should be added rapidly to existing systems, without necessarily undergo a major re-development or reengineering phase. This can be achieved at best if techniques are available that systematically evolve the system using transformations. To make such an approach manageable, the refactoring techniques for Java [Fow99] have proven that a comprehensible set of small and systematically applicable transformation rules seems optimal. Transformations, however, cannot only be applied to code, but to any kind of model. A number of possible applications are discussed in [PR03]. Having a comprehensible set of model transformations at hand, model evolution becomes a crucial step in software development and maintenance. Architectural and design flaws can then be more easily corrected, superfluous functionality and structure removed, structure for additional functionality or behavioral optimizations be adapted, because models are more abstract, exhibit higher-level architectural and design information in a better way. CD

CD

Person

Person transformation

Bidder

Guest

checkPasswd()

checkPasswd()

long ident

checkPasswd() long ident

Bidder

Guest

Fig. 15. Two transformational steps moving an attribute and a method along the hierarchy.

Two simple transformation rules for a class diagram are shown in Fig. 15. The figure shows two steps that move a method and an attribute upward in the inheritance hierarchy at once. The upward move of the attribute is accompanied by the only context condition that the other class “Guest” does not have an attribute with the same name

398

B. Rumpe

yet. In contrast, moving the method may be more involved. In particular, if both existing method bodies are different, there are several possibilities: (1) Move up one method implementation and have it overridden in the other class. (2) Just add the methods signature in the superclass. (3) Adapt the method implementations in such a way that distinguishing parts are factored out into other sub-methods and the remainder of the method bodies is identical in both methods. Many of the necessary transformation steps are as simple as the upward move of an attribute. However, others are more involved and their application comes with a larger set of context conditions. These of course need automated assistance. The power of these rather simple and manageable transformation steps comes from the possibility to combine them and evolve complex designs in a systematic and traceable way. Following the definition on refactoring from [Fow99], we use transformational steps for structure enhancement that do not affect “externally visible behavior”. For example both transformations shown in Fig. 15 do not affect the external behavior if made properly. By “externally visible behavior” Fowler in [Fow99] basically refers to behavioral changes visible to the user. This can be generalized by introducing an abstract “system border” that may also act as interface to other systems. Furthermore, in a hierarchically structured system, we may enforce behavioral equivalence for “subsystem borders” already. It is therefore necessary to explicitly describe, which kind of behavior is regarded as externally visible. For this purpose tests are the appropriate technique to describe behavior, because (1) tests are already available as result of the development process and (2) tests are automated which allows us to check the effect of a transformation through inexpensive, automated regression testing. A test case thus acts as an “observer” of the behavior of a system under a certain condition. This condition is also described by the test case, namely through the setup, the test driver and the observations made by the test. Fig. 16 illustrates this situation. test = driver and “observer” setup & call

observe observe creation interaction

check property

compare with expected result

snapshots of the test run

time axis

Fig. 16. A test case acts as observation.

Fig. 16 also shows that tests do not necessarily constrain their observation to “externally visible behavior”, but can make observations on local structure, internal interactions or state properties even during the system run. Therefore, we distinguish “inter-

Model-Based Testing of Object-Oriented Systems

399

nal” test that evolve together with the transformed system and “external” tests which need to remain unchanged, because they describe external properties of the system. Unit and integration tests focus on small parts of the system (classes or subsystems) and usually take a deep look into system internals. These tests are usually transformed together with the code models. Unit and integration tests are usually provided by the developer or test teams that have access to the systems internal details. Therefore, these are usually “glass box tests”. Acceptance tests, instead, are “black box” tests that are provided by the user (although again realized by developers) and describe external properties of the system. These tests must be a lot more robust against changes of internal structure. Fig. 17 illustrates a diagram that illustrates how an observation remains invariant under a test. To achieve robustness, acceptance tests should be modeled against the published interfaces of a system. In this context “published” means that parts of the system that are explicitly marked as externally visible and therefore usually rather stable. Only explicit changes of requirements lead to changes of these tests and indeed the adaptation of requirements can very well be demonstrated through adaptation of these test models followed by the transformations necessary to meet these tests afterwards in a “test-first-approach”.

test = driver and “observer”

observation

transformation

system run

modified system run

Fig. 17. The transformed system model is invariant under a test observation.

To increase stability of acceptance tests in transformational development, it has proven useful to follow a number of standards for test model development. These are similar to coding standards and have been found useful already before the combination with the transformational approach: • In general an acceptance test should be abstract, by not trying to determine every detail of the tested part of the system. • A test oracle should not try to determine every part of the output and the resulting data structure, but concentrate on important details, e.g. by ignoring uninteresting objects and attribute values (e.g. in object diagrams and OCL constraints). • OCL property descriptions can often be used to model a range of possible results instead of determining one concrete result.

400

B. Rumpe

• Query-methods can be used instead of direct attribute access. This is more stable when the data structure is changed. • It should not be tried to observe internal interactions during the system run. This means that sequence diagrams that are used as drivers for acceptance tests concentrate on triggers and on interactions with the system border. • Explicitly published interfaces that are regarded as highly stable should be introduced and acceptance tests should focus on these interfaces.

8 Conclusions The proposal made in this paper is part of a pragmatic approach to model-based software development. This approach uses models as primary artifact for requirements and design documentation, code generation and test case development and includes a transformational technique to model evolution for efficient adaptation of the system to changing requirements and technology, to optimize architectural design and fix bugs. To ensure the quality of such an evolving system, intensive sets of test cases are an important prerequisite. They are modeled in the same language, namely UML, and thus exhibit a good integration and allow us to model system and tests in parallel. The paper demonstrates that it is feasible to use various kinds of models to explicitly define automated tests. For use in object-oriented systems, however, the design of the system has to some extent to be adapted in order to allow testable systems. A series of basic and enhanced test patterns lead to a better testable design. In particular test patterns for distributed systems are a necessary prerequisite to allow testability. However, there are some obstacles for the proposed approach. (1) Currently, tool assistance is still in its infancy. (2) More experience is needed to come up with effective testing techniques in the context of model evolution, which must also involve coverage metrics. (3) These new techniques, namely an executable sub-language of the UML as well as a lightweight methodological use of models in a development process are both a challenge to current practice in software engineering. They exhibit new possibilities and problems. Using executable UML allows to program in a more abstract and efficient way. This may finally downsize projects and decrease costs. The free resources can alternatively be used within the project for additional validation activities, such as reviews, additional tests or even a verification of critical parts of the system. Therefore, we can conclude that techniques such as model-based development, model evolution and test-first design will change software engineering and add new elements to its portfolio.

Acknowledgements I would like to thank Markus Pister, Bernhard Schätz, Tilman Seifert and Guido Wimmel for commenting an earlier version of the paper as well as for valuable dis-

Model-Based Testing of Object-Oriented Systems

401

cussions. This work was partially supported by the Bayerisches Staatsministerium für Wissenschaft, Forschung und Kunst and through the Bavarian Habilitation Fellowship, the German Bundesministerium für Bildung und Forschung through the Virtual Software Engineering Competence Center (ViSEK).

References [ABRS03]

E. Abraham-Mumm, F.S. de Boer, W.P. de Roever, and M. Steffen. A Toolsupported Proof System for Mutlithreaded Java. (in this volume) LNCS. Springer, 2003. [AM03] Agile Manifesto. http://www.agilemanifesto.org/. 2003. [Bec99] Beck, K. Extreme Programming explained, Addison-Wesley. 1999. [Bec01] Beck K. Aim, Fire (Column on the Test-First Approach). IEEE Software, 18(5):87-89, 2001. [BG99] Beck K., Gamma E. JUnit: A Cook’s Tour, JavaReport, August, 1999. [Bin99] Binder, R. Testing Object-Oriented Systems. Models, Patterns, and Tools, Addison-Wesley, 1999. [BL01] Briand L. and Labiche Y. A UML-based Approach to System Testing. In M. Gogolla and C. Kobryn (eds): «UML» - The Unified Modeling Language, 4th Intl. Conference, pages 194-208, LNCS 2185. Springer, 2001. [Coc02] Cockburn, A. Agile Software Development. Addison-Wesley, 2002. [Fow99] Fowler M. Refactoring. Addison-Wesley. 1999. [GHJV94] Gamma E., Helm R., Johnson R., Vlissides J. Design Patterns, Addison-Wesley, 1994. [JKW03] B. Jacobs, J. Kiniry, and M. Warnier. Java Program Verification Challenges. (in this volume) LNCS. Springer, 2003. [LCCRC03] G.T. Leavens, Y. Cheon, C. Clifton, C. Ruby and D.R. Cok. How the Design of JML Accommodates Both Runtime Assertion Checking and Formal Verification. (in this volume) LNCS. Springer, 2003. [LF02] Link J., Fröhlich P. Unit Tests mit Java. Der Test-First-Ansatz. dpunkt.verlag Heidelberg, 2002. [Mye79] Myers, G. The Art of Software Testing, John Wiley & Sons, New York, 1979. [OJ93] Opdyke W., Johnson R. Creating Abstract Superclasses by Refactoring. Technical Report. Dept. of Computer Science, University of Illinois and AT&T Bell Laboratories. 1993 [OMG01] OMG. Model Driven Architecture (MDA). Technical Report OMG Document ormsc/2001-07-01, Object Management Group, 2001. [OMG02] OMG - Object Management Group. Unified Modeling Language Specification. V1.5. 2002. [PR03] Philipps J., Rumpe B.. Refactoring of Programs and Specifications. In: Practical foundations of business and system specifications. H.Kilov and K.Baclawski (Eds.), 281-297, Kluwer Academic Publishers, 2003. [Rum02] Rumpe, B. Executable Modeling with UML. A Vision or a Nightmare? In: Issues & Trends of Information Technology Management in Contemporary Associations, Seattle. Idea Group Publishing, Hershey, London, pp. 697-701. 2002. [Rum03] Rumpe, B. Agiles Modellieren mit der UML. Habilitation Thesis. To appear 2003.

402

B. Rumpe

[Rum03b]

[RS02]

[SD00] [TTCN92]

[WK98]

Rumpe B. E-Business Experiences with Online Auctions. In: Managing ECommerce and Mobile Computing Technologies, Julie Mariga (Ed.) Idea Group Inc., 2003. Rumpe B., Schröder A. Quantitative Survey on Extreme Programming Projects. In: Third International Conference on Extreme Programming and Flexible Processes in Software Engineering, XP2002, May 26-30, Alghero, Italy, pg. 95-100, 2002. Siedersleben J., Denert E. Wie baut man Informationssysteme? Überlegungen zur Standardarchitektur. Informatik Spektrum, 8/2000:247-257, 2000. ISO/IEC, Information Technology - Open Systems Interconnection - Conformance Testing Methodology and Framework - Part 3: The Tree and Tabular Combined Notation (TTCN), ISO/IEC International Standard 9646, 1992. Warmer J., Kleppe A. The Object Constraint Language. Addison-Wesley. 1998.

Concurrent Object-Oriented Programs: From Specification to Code Emil Sekerinski McMaster University Department of Computing and Software Hamilton, Ontario, Canada [email protected]

Abstract. In this paper we put forward a concurrent object-oriented programming language in which concurrency is tightly integrated with objects. Concurrency is expressed by extending classes with actions and allowing methods to be guarded. Concurrency in an object may be hidden to the outside, thus allowing concurrency to be introduced in subclasses of a class hierarchy. A disciplined form of intra-object concurrency is supported. The language is formally defined by translation to action systems. Inheritance and subtyping is also considered. A theory of class refinement is presented, allowing concurrent programs to be developed from sequential specifications. Our goal is to have direct rules for verification and refinement on one hand and a practical implementation on the other hand. We briefly sketch our implementation. While the implementation relies on threads, the management of threads is hidden to the programmer.

1 Introduction The reason for having concurrency in programs is that concurrency occurs naturally when modeling the problem domain, is to make programs more responsive, and is to exploit the potential speedup offered by multiple processors. It has been argued that objects can be naturally thought of as evolving independently and thus concurrently; objects are a natural “unit” of concurrency. Yet, current mainstream object-oriented languages treat concurrency independently of objects: typically concurrency is expressed in terms of threads that have to be created separately from objects. In this paper we put forward a notation for writing truly concurrent object-oriented programs. Sequential object-oriented programs are expressed in terms of classes featuring attributes and methods. We keep this paradigm and extend it by augmenting classes by actions and adding guards to methods. While methods need to be invoked, actions are executed autonomously. Atomicity of attribute access is guaranteed by allowing only one method or action to be active in an object at any time. Concurrency is achieved by having active methods and actions in several objects. We also suggest a theory for developing concurrent object-oriented programs out of sequential ones, recognizing that concurrent programs often arise from sequential specifications. Class hierarchies are commonly used to express specification-implementation relationships. We envisage continuing to do this with concurrent classes by treating concurrency in the same way an implementation issue as the choice of a data structure. F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 403–423, 2003. c Springer-Verlag Berlin Heidelberg 2003

404

E. Sekerinski

Thus we may have a class serving as a specification and subclasses of it being sequential or concurrent implementations. For a general overview of concurrent object-oriented languages we refer to [10]. Our work shares with the πoβλ approach by Jones et al. [14, 16] the use of synchronous communication between objects and the use of objects for restricting interference. While πoβλ is defined in terms of the π calculus, a process algebra, the definition of our language is in terms of action systems. We do not directly support early return and delegate statements as πoβλ does, but we do support inheritance and subtyping. Earlier related work includes the POOL family of languages [1], where communication between concurrent objects is done by rendezvous. Hoare-style verification rules for a language that includes statements for sending and receiving synchronous messages are given in [2]. Here we consider instead only (synchronous) method calls, where entrance to objects is regulated by method guards. Several approaches have emerged from extending action systems to model concurrent objects, as there is an established theory of data refinement and atomicity refinement of action systems [3, 6]. Action systems with procedures by Back and Sere [5] and Sere and Walden [23] resemble concurrent objects, except that action systems cannot be created dynamically like objects. Bosangue, Kok and Sere [8, 9] apply action systems to model dynamically created objects. B¨uchi and Sekerinski [11] take this further by defining inheritance and subtyping and justify the refinement rules with respect to observable traces. However, both approaches enforce strict atomicity of actions: if an action (or method) contains several method calls that may block, either all are executed or the whole action is not enabled. Thus, these approaches do not allow direct translation to efficient code. The Seuss approach of Misra [20] is also action-based but additionally considers fairness between actions. Guarded methods are distinguished from unguarded methods, with the syntactic restriction that there can be only one call to a guarded method per action and this must be the first statement. Other restrictions are that object cannot be created dynamically and there is no inheritance. The goal of the presented work is on one hand to have a simple theory of program development and on the other hand to have an efficient and practical implementation. This paper is the result of several iterations towards this goal, starting with [4]. To test our ideas, we have developed a prototypical compiler for our language [17]. A key concept is to weaken the strict atomicity of methods and actions: when a method call transfers control to another object, the lock to the first object is released and a new activity in that object can be initiated. Section 2 introduces the language and gives some examples. Our approach to making the theory simple is to start with a formal model of concurrent modules and to express all other constructs by translations into this “core”. Only translations that are needed have to be applied and all formal reasoning is done in the core. The formalization is done within the Simple Theory of Types. Section 3 formalizes that core, on top of which Section 4 defines classes, objects, inheritance, subtyping, and dynamic binding and discusses verification and refinement. Section 5 extends this to concurrent objects. Section 6 sketches the implementation of the language. We conclude with a discussion of the proposed model and the kind of concurrency it leads and with observations of the limitations of the current work.

Concurrent Object-Oriented Programs: From Specification to Code

405

2 A Concurrent Object-Oriented Language We start by giving the (slightly simplified) formal syntax of the language in extended BNF. The construct a | b stands for either a or b, [a] means that a is optional, and {a} means that a can be repeated zero or more times: class

::= class identifier [ inherit identifier ] [ extend identifier ] { attribute | initialization | method | action } end attribute ::= attr variableList initialization ::= initialization ( variableList ) statement method ::= method identifier ( variableList , res variableList ) [ when expression do ] statement action ::= action [ identifier ] [ when expression do ] statement statement ::= assert expression | identiferList := expressionList | identiferList :∈ expressionList | identifier.identifier ( expressionList , identifierList ) | identifier := new identifier ( expressionList ) | begin statement { ; statement } end | if expression then statement [ else statement ] | while expression do statement var variableList • statement variableList ::= identifierList : type { , identifierList : type } identifierList ::= identifier { , identifier } expressionList ::= expression { , expression } A class is declared by giving it a name, optionally stating the class being inherited or extended, and then listing all the attributes, initializations, methods, and actions. Initializations have only value parameters, methods may have both value and result parameters, and actions don’t have parameters. Both methods and actions may optionally have a guard, a boolean expression. Actions may be named, though the name does not carry any meaning. The assertion statement assert b checks whether boolean expression b holds. If it holds, it continues, otherwise it aborts. The assignment x := e assigns simultaneously the values of the list e to the list x of variables. The nondeterministic assignment statement x :∈ s selects an element of the set s and assigns it to the list x of variables. This statement is not part of the programming language, but is included here for use in abstract programs. A method call o.m(e, z) to object o takes the list e as the value parameters and assigns the result to the list z of variables. The object creation o := new C(e) creates a new object of class C and calls its initialization with value parameter e. We do not further define identifier and expression. We illustrate the constructs of the language by a series of examples. Consider the problem of ensuring mutual exclusion of multiple users accessing two shared resources. A user can perform a critical section cs only if that user has exclusive access to both resources. We assume each resource is protected by a semaphore. Semaphores and users are represented by objects, with a semaphore having a guarded method:

406

E. Sekerinski

class Semaphore attr n : integer initialization n := 1 method P when n > 0 do n := n − 1 method V n := n + 1 end

class User attr s, t : Semaphore initialization (a, b : Semaphore) s, t := a, b method doCriticalSection begin s.P ; t.P ; cs ; s.V ; t.V end end

We assume that all statements only access and update attributes of the object itself and local variables, except for method calls that may access and update the state of other objects. All statements are executed atomically up to method calls. Thus in class Semaphore the method V is always executed atomically, as is the initialization. The method P may block if the guard is not true, but once the method is enabled, it is also executed atomically. The method doCriticalSection may block at the calls s.P and t.P. In this case some other activity must first call the V method of the corresponding semaphore before execution can resume. The next example is about merging the elements of two bounded buffers into a third buffer. Buffers and mergers are represented by objects: class Buffer attr b : array of Object attr in, out, n, max : integer initialization (m : integer) in, out, n, max := 0, 0, 0, m ; b := new Object[m] method put(x : Object) when n < max do in, b[in], n := (in + 1) mod max, x, n + 1 method get( res x : Object) when n > 0 do out, x, n := (out + 1) mod max, b[out], n − 1 end class Merger attr in1, in2, out : Buffer attr a1, a2 : boolean attr x1, x2 : Object initialization (i1, i2, o : Buffer) in1, in2, out, a1, a2 := i1, i2, o, false, false action copy1 when a1 do begin a1 := false ; in1.get(x1) ; out.put(x1) ; a1 := true end action copy2 when a2 do begin a2 := false ; in2.get(x2) ; out.put(x2) ; a2 := true end end After creating a new merger object m, the actions of m can execute in parallel with the remaining program, including other Merger objects. Actions cannot be called, they can be initiated automatically whenever they are enabled. Action copy1 is enabled if a1 is true. Once copy1 is initiated, it may block at either the call in1.get(x1) or the call

Concurrent Object-Oriented Programs: From Specification to Code

407

out.put(x1). In this case another activity in the same object may be initiated or may resume (if it was blocked). Initiating an activity here means starting either copy1 or copy2 again. Since a1 is false at these points, copy1 is disabled and cannot be initiated a second time. On the other hand, copy2 may be initiated and come to conclusion or block at the call in2.get(x2) or the call out.put(x2). Hence for example the situation may arise that both actions are blocked at the out.put calls. Thus Merger can buffer two elements. The last example is the observer design pattern, expressed as an abstract program. The pattern allows that all observers of one subject perform their update methods in parallel: class Observer attr sub : Subject initialization (s : Subject) begin sub := s ; s.attach(this) end method update . . . end class Subject attr obs, notifyObs : set of Observer initialization obs, notifyObs := {}, {} method attach(o : Observer) obs := obs ∪ {o} method notify notifyObs := obs action notifyOneObserver when notifyObs = {} do var o : Observers • begin o :∈ notifyObs ; notifyObs := notifyObs − {o} ; o.update end end As soon as execution of the action notifyOneObserver in a subject s reaches the call o.update, control is passed to object o and another activity in s may be initiated or may resume. In particular, the action notifyOneObserver may be initiated again, as long as notifyObs is not empty, i.e. some observers have not been notified. Thus at most as many notifyOneObserver actions are initiated as there are observers and all notified observers can proceed concurrently. New observers can be added at any time and will be updated after the next call to notify.

3 Statements, Procedures, Modules, and Concurrency We introduce the “core” language into which the object-oriented constructs are translated. The definition is done in terms of higher order logic, as the type system of higher order logic is close to that of Pascal-like languages. We assume there are some basic types like boolean and integer. New types can be constructed as functions X → Y and

408

E. Sekerinski

products X × Y, for given types X and Y. Function application is written as f (x) or simply f x and a pair as (x, y) or simply x, y. For convenience we also assume that further type constructors like set of T and bag of T are available. Statements The core statements are as follows. Let X be the type of the program state and p : X → boolean be state predicate. The assertion {p} does nothing if p is true and aborts otherwise. The guard [p] does nothing if p holds and blocks otherwise. If S and T are statements then S ; T is their sequential composition. The choice S T selects either S or T nondeterministically. If Q is a relation, i.e. a function of type X → Y → boolean, then [Q] is a statement that updates the state according to relation Q, choosing one state nondeterministically if several final states are possible and blocking it no final state according to Q exists. All further statements are defined in terms of these five core statements. These five statements can for example be defined by higher order predicate transformers, i.e. function mapping predicates (the postconditions) to predicates (the preconditions) as done by Back and von Wright [7]. States are typically tuples and program variables are used to selects components of the state tuple. For example, if the state space is X = integer × integer and variables x, y are used to refer to the two integer components, then a state predicate p can be defined as p(x, y) = (x > y). We assume the state space is understood from context, allowing us to write boolean expressions instead of state predicates in assertions and guards, for example the assertion {x > y}. We define skip = {true} = [true] to be the statement that does nothing, abort = {false} to be the statement that always aborts, and wait = [false] to be the statement that always blocks. Assume b is a boolean expression. The assertion statement assert b is synonymous to {b}. The guarded statement when b do S and the conditional statements if b then S and if b then S else T are defined as: when b do S = [b] ; S if b then S = ([b] ; S) [¬b] if b then S else T = ([b] ; S) ([¬b] ; T) Suppose x : X and y : Y are the only program variables. The assignment statement x := e updates x and leaves y unchanged. The nondeterministic assignment statement x :∈ s assigns x an arbitrary element of the set s and leaves y unchanged. If s is the empty set then the statement blocks. Both are defined in terms of an update statement: x := e = [Q] x :∈ s = [Q]

where where

Q(x, y)(x , y ) = (x = e) ∧ (y = y) Q(x, y)(x , y ) = (x ∈ s) ∧ (y = y)

The declaration of a local variable var x :∈ s • S extends the state space by x, executes S, and reduces the state space again. The initial value of x is chosen nondeterministically from the set s. If s is the empty set then the statement blocks. We write var x : X • S or simply var x • S if an arbitrary element of type X is chosen initially. [Q] ; S ; [R] var x :∈ s • S =

where

Q y (x , y ) = (x ∈ s) ∧ (y = y) R (x, y) y = (y = y)

Following theorem gives laws for transforming statements into equivalent ones. We let e[x\f ] stand for simultaneously substituting variables x by expressions f in e:

Concurrent Object-Oriented Programs: From Specification to Code

409

Theorem (Equational Laws). Assume x, y are disjoint lists of variables. x := e = x :∈ {x | x = e}

(1)

x :∈ {x | b} ; y :∈ {y | c} = x, y :∈ {x , y | b ∧ c[x\x ]} var x • x, y :∈ {x , y | b} = y :∈ {y | ∃x, x • b}

(2) (3)

For a statement S and predicate b, we let wp(S, b) be the weakest precondition for S to terminate and to establish postcondition b. The enabledness domain or guard of statement S is defined by grd S = ¬wp(S, false) and the termination domain by trm S = wp(S, true). The weakest liberal precondition wlp(S, b) is the weakest precondition for S to establish b provided S terminates. We give selected laws: Theorem (Weakest Preconditions). wlp(x := e, b) = b[x\e]

(4)

wlp(x :∈ s, b) = ∀x ∈ s • b wlp(S ; T, b) ⇐ wlp(S, wlp(T, b))

(5) (6)

The refinement of statement S by T, written S T, means that T terminates whenever S does, T is disabled whenever S is, and T is “more deterministic” than S. In the predicate transformer model, S T holds if for any postcondition q, whenever S establishes q so does T. Data refinement S R T generalizes (algorithmic) refinement by relating the initial and final state of S and T with relation R. We allow R to refine only part of the state, i.e. if the (initial and final) state space of S is X × Z, the state space of T is Y × Z, then it is sufficient for R to relate X to Y. We write Id for the identity relation and × for the parallel composition of relations: S R T = S ; [R × Id] [R × Id] ; T We give selected laws about data refining statements; they naturally generalize when only a specific component of a larger state space is refined. Theorem (Refinement Laws). Assume that relation R relates X to Y and the state space includes Z. Variables x, y, z refer to the corresponding state components: x := e R y := f if R x y ⇒ R e f {a} ; x := e R {b} ; y := f iff a ∧ R x y ⇒ b and a ∧ R x y ⇒ R e f x := e R y :∈ {y | d} if R x y ∧ d ⇒ R e y {a} ; x := e R {b} ; y :∈ {y | d} iff a ∧ R x y ⇒ b and a ∧ R x y ∧ d ⇒ R e y

(7) (8) (9) (10)

x :∈ {x | c} R y :∈ {y | d} if R x y ∧ d ⇒ ∃ x • c ∧ R x y (11) (12) {a} ; x :∈ {x | c} R {b} ; y :∈ {y | d} iff a ∧ R x y ⇒ b and a ∧ R x y ∧ d ⇒ ∃ x • c ∧ R x y z :∈ {z | c} R z :∈ {z | d} if R x y ∧ d ⇒ c {a} ; z :∈ {z | c} R {b} ; z :∈ {z | d} if a ∧ R x y ⇒ b and a ∧ R x y ∧ d ⇒ c

(13) (14)

410

E. Sekerinski

S1 ; S2 R T1 ; T2 if S1 R T1 and S2 R T2

(15)

S1 S2 R T1 T2 if S1 R T1 and S2 R T2

(16)

The iteration statement Sω repeats S an arbitrary number of times, as long as S is enabled. If S never becomes disabled, then Sω aborts. Iteration Sω is defined as the least fixed point (with respect to the refinement relation) of the equation X = (S ; X) skip. The while statement while b do S is defined in terms of iteration, with the additional restriction that upon termination ¬b must hold: while b do S = ([b] ; S)ω ; [¬b] Modules. A module declares a number of variables with initial values as well as a number of procedures. The procedures operate on the local variables and possibly variables declared in other modules either directly or by calling other procedures. Formally a module is a pair (init, proc) where init is the initial local state and proc is a tuple of statements. The syntax for defining a module with a two variables p, q of types P, Q with initial values p0 , q0 and a single procedure m is as follows: module K var p : P := p0 var q : Q := q0 procedure m(u : U, res v : V) M end Formally we have K = (init, proc) with init = (p0 , q0 ) and proc = M. The (initial and final) state space of the body M of m is U × V × X, where X is the state space of the whole program, which includes P and Q as components. Again, K.p or simply p is the name used to select the corresponding state component. Procedure names are used for selecting the components of proc: we write K.m or simply m in order to refer to statement M. A procedure call m(e, z) extends the state space by the formal value and result parameters, copies the actual value parameters to the formal parameters, executes the procedure body, and copies the formal result parameters to the actual result parameters: m(e, x) = var u, v • u := e ; m ; x := v Within modules other modules may be referred. The state space of the whole program is the combined state space of all modules of that program. Concurrency. Concurrency is introduced by adding actions to modules. These actions may access variables of that module and variables of other modules, either directly or through procedures. Actions that access disjoint sets of variables may be executed in any order or in parallel. Module actions are executed atomically, i.e. either an action is enabled and can be carried to completion or it is not enabled (in contrast to class actions that are atomic only up to method calls). Formally a concurrent module is a triple (init, proc, act) where in addition act is the combined action of the module. We use following syntax for defining a module with actions a and b:

Concurrent Object-Oriented Programs: From Specification to Code

411

module K var p : P := p0 var q : Q := q0 procedure m(u : U, res v : V) M action a A action b B end We have K = (init, proc, act) with init = (p0 , q0 ), proc = M, and act = A B. All actions are combined into a single action and the names of the actions do not carry any meaning. The state space of act is the state space of the whole program, which includes P and Q as components.

Definition (Module Refinement). Module K = (init, proc) with variables p is refined by module K = (init , proc , act) with variables p through relation R, written K R K , if: (a) for the initialization: R init init (b) for every procedure m: (b.1) procedure refinement: K.m R K .m (b.2) procedure enabledness: grd K.m ∧ R p p ⇒ grd K .m ∨ grd act (c) for the action: (c.1) action refinement: skip R act (c.2) action termination: R p p ⇒ trm( do act od ) The loop do S od repeats S as long as it is enabled. It is defined as Sω ; [¬grd S]. Compared to the while loop, the guard is implicit in the body. Condition (a) requires that the initializations are in the refinement relation. Condition (b.1) requires that each procedure of K is refined by the corresponding procedure of K . While refinement by itself allows the guard to be weakened, condition (b.2) requires that whenever K.m is enabled, either K .m or act must be enabled. Condition (c.1) requires that the effect of the action act is not visible when viewed from K. Finally, condition (c.2) requires that act eventually disables itself, hence cannot introduce non-termination. The definition can be applied when K has no action by taking act = wait. As grd wait = false condition (b.2) simplifies to grd K.m ∧ r ⇒ grd K .m and condition (c) holds by default. Module refinement can be generalized in several ways: K may also be allowed to have an action, allowing to increase the concurrency of an already concurrent module [23]. Both K and K can exhibit finite stuttering and the generalized rule can be shown to be correct with respect to trace refinement [11]. We have restricted the refinement relation to relate only the local variables. The refinement relation can be generalized to include global variables, at the expense of losing compositionality [11, 23].

412

E. Sekerinski

4 Objects We distinguish between the class and the type of an object. The class defines the attributes and the methods of objects. We define a class in terms of a module with one variable for each attribute, one procedure for each method, and an extra variable for the objects populating that class. The variables map each object of the class to the corresponding attribute values. Each procedure takes an additional value parameter, this, for the object to which the procedure is applied. We assume the type Object is infinite and contains the distinguished element nil. All objects are of type Object. We write x :∈ /s as a shorthand for x :∈ s: = module C class C var C : set of Object := {} attr p : P var p : Object → P initialization (g : G) procedure new(g : G, res this : Object) I this :∈ / C ∪ {nil} ; C := C ∪ {this} ; I method l(s : S, res t : T) procedure l(this : Object, s : S, res t : T) L {this ∈ C} ; L method m(u : U, res v : V) procedure m(this : Object, u : U, res v : V) M {this ∈ C} ; M end end Within a method body attribute p is referred to by this.p. In general, referencing x.p amounts to applying the function p to x. Creating a new object x of class C with initialization parameter e amounts to calling the new procedure of class C. Calling the method m of an object x of class C amounts to calling the procedure m of class C with x as the additional parameter that is bound to this in m: x.p = p(x) x := new C(e) = C.new(e, x) x.m(f , z) = C.m(x, f , z) We follow the practice of using class names as if they were types in variable declarations, e.g. c : C. While the type of c is Object, the class name C is used to determine the module to which method calls to c go. The class name can also be used by the compiler to forbid certain assignments. We illustrate these concepts by an example of points in a plane. class Point attr x : integer attr y : integer initialization (x : integer, y : integer) this.x, this.y := abs(x), abs(y) method distance(p : Point, res d : integer) d := abs(this.x − p.x) + abs(this.y − p.y) method copy( res p : Point) p := new Point(this.x + 2, this.y + 2) end

Concurrent Object-Oriented Programs: From Specification to Code

413

Class Point translates to following module. We write f [a ← b] for modifying function f to return b for argument a. The assignment x.p := e, or equivalently p(x) := e, stands for p := p[x ← e]. For convenience we continue to write x.p instead of p(x): module Point var Point : set of Object := {} var x : Object → integer var y : Object → integer procedure new(x : integer, y : integer, res this : Object) this :∈ / Point ∪ {nil} ; Point := Point ∪ {this} ; this.x, this.y := abs(x), abs(y) procedure distance(this : Object, p : Object, res d : integer) {this ∈ Point} ; d := abs(this.x − p.x) + abs(this.y − p.y) procedure copy(this : Object, res p : Point) {this ∈ Point} ; new(this.x + 2, this.y + 2, p) end We sketch how to verify invariance properties of classes. For example, consider showing that (this.x ≥ 0) ∧ (this.y ≥ 0) is an invariant of class Point: this requires proving that I defined as ∀ this ∈ Point • (x(this) ≥ 0) ∧ (y(this) ≥ 0) is an invariant of the module Point. This holds if the initial values imply the invariant, (Point = {}) ⇒ I, and each procedure preserves the invariant, I ⇒ wlp(Point.new, I), I ⇒ wlp(Point.distance, I), and I ⇒ wlp(Point.copy, I). For new we have by using (4), (5), and (6): wlp(Point.new, I) = wlp(this :∈ / Point ∪ {nil} ; Point := Point ∪ {this} ; x(this), y(this) := abs(x), abs(y), ∀ p ∈ Point • (x(p) ≥ 0) ∧ (y(p) ≥ 0)) ⇐ wlp(this :∈ / Point ∪ {nil} ; Point := Point ∪ {this}, ∀ p ∈ Point • (x[this ← abs(x)](p) ≥ 0) ∧ (y[this ← abs(y)](p) ≥ 0)) ⇐ ∀this :∈ / Point ∪ {nil} • ∀ p ∈ Point ∪ {this} • (x[this ← abs(x)](p) ≥ 0) ∧ (y[this ← abs(y)](p) ≥ 0)) ⇐I While we allow references this.a to attributes of the object itself to be abbreviated by a, care has to be taken as this involves a hidden function application, which is the source of aliasing. For example, consider adding method tile to class Point: method tile(p : Point) p.x := x + 2 ; p.y := y We might be tempted to conclude that the postcondition p.x = x + 2 is always established. Expanding the body to x(p) := x(this)+ 2 ; y(p) := y(this) and the postcondition to x(p) = x(this) + 2 makes it evident that this is only true if initially this = p, i.e. the postcondition does not hold for the call p.tile(p), with p ∈ Point. We turn our attention to inheritance. Suppose C is as earlier and class D inherits from C, while adding attributes and methods and redefining the initialization and some methods. We call C the superclass of D and D the subclass of C. This corresponds to defining a module D that uses module C:

414

E. Sekerinski

class D inherit C attr q : Q initialization (h : H) J method m(u : U, res v : V) M method n(w : W, res y : Y) N end

=

module D var D : set of Object := {} var q : Object → Q procedure new(h : H, res this : Object) this :∈ / C ∪ {nil} ; C := C ∪ {this} ; D := D ∪ {this} ; J procedure l(this : Object, s : S, res t : T) {this ∈ D} ; C.l(this, r, s) procedure m(this : Object, u : U, res v : V) {this ∈ D} ; M procedure n(this : Object, w : W, res y : Y) {this ∈ D} ; N end

Those methods that are not explicitly redefined in D are defined in D as forwarding the call to C. Method bodies may contain calls to other methods of the same class, either to the same object, this.m(e, z) or to another object, x.m(e, z). The call this.m(e, z) is also written m(e, z). A method body in D may also contain a super-call super.m(e, z). In this case the call goes to the inherited class, i.e. the immediate superclass. This also applies to inheritance hierarchies with more than two classes: super.m(e, z) = C.m(e, z) We illustrate these issues with classes Point1D and Point2D: class Point1D attr x : integer method setX(x : integer) this.x := x method scale(s : integer) this.x := this.x × s end

class Point2D inherit Point1D attr y : integer method setY(y : integer) this.y := y method setXY(x, y : integer) this.setX(x) ; this.setY(y) method scale(s : integer) super.scale(s) ; this.y := this.y × s end

These classes translate to following modules: module Point1D var Point : set of Object := {} var x : Object → integer procedure new( res this : Object) this :∈ / Point1D ∪ {nil} ; Point1D := Point1D ∪ {this} procedure setX(this : Object, x : integer) {this ∈ Point1D} ; this.x := x procedure scale(this : Object, s : integer) {this ∈ Point1D} ; this.x := this.x × s end

Concurrent Object-Oriented Programs: From Specification to Code

415

module Point2D var Point2D : set of Object := {} var y : Object → integer procedure new( res this : Object) this :∈ / Point2D ∪ {nil} ; Point1D := Point1D ∪ {this} ; Point2D := Point2D ∪ {this} procedure setX(this : Object, x : integer) {this ∈ Point2D} ; Point1D.setX(this, x) procedure setY(this : Object, y : integer) {this ∈ Point2D} ; this.y := y procedure setXY(this : Object, y : integer) {this ∈ Point2D} ; setX(this, x) ; setY(this, y) procedure scale(this : Object, s : integer) {this ∈ Point2D} ; Point1D.scale(this, s) ; this.y := this.y × s end

Inheritance does not affect the creation of objects, i.e. if D inherits from C then x := new D(e) = D.new(e, x). A key point of the definition of inheritance is that a new object of class D becomes also a member of class C, that is D is a subtype of C. Subtypes correspond to subsets between the members of the class, C ⊆ D. Assuming c, d are objects, the type test c is D tests whether c is indeed an object of class D. The type cast d := c as D aborts if c is not an object of class D and assigns c to d otherwise. Assuming D is a subtype of C and c is declared to be of class C, the method call c.m(e, z) is bound dynamically, i.e. the actual class of c rather than the declared class determines the module to which the call goes. This generalizes to class hierarchies involving more than two classes accordingly: c is D = c∈D d := c as D = {c ∈ D} ; d := c c.m(e, z) = if c ∈ D then D.m(c, e, z) else C.m(c, e, z) Within the bodies of the methods of class D attributes of class C may be referred to. The type system would either allow or forbid this according to visibility declarations; we do not explicitly indicate visibility here. However, we note that if modification of C attributes is allowed in D, then an invariant shown to hold for C objects does not necessarily hold for D objects. Such an invariant has also to be shown to be preserved by D methods. We can also define inheritance without subtyping, which we call extension. When class E extends class C, the methods of E may refer to the attributes of C, but creating an E object does not make it a C object. Methods are not inherited and no super-calls are possible (although one could generalize this). Hence this only allows sharing of attribute declarations. In case of extension the type system would forbid assignments between E and C objects:

416

E. Sekerinski

class E extend C attr q : Q initialization (h : H) J method m(v : U, res v : V) M method n(w : W, res y : Y) N end

=

module E var E : set of Object := {} var q : Object → Q procedure new(h : H, res this : Object) this :∈ / D ∪ {nil} ; D := D ∪ {this} ; J procedure m(this : Object, u : U, res v : V) {this ∈ D} ; M procedure n(this : Object, w : W, res y : Y) {this ∈ D} ; N end

Now we show how class refinement translates to module refinement. We give an example that involves creation of auxiliary objects and creation of garbage—objects to which there is no reference. Consider following class S for defining a store in which we only record whether the store is empty or full: = class S attr f : boolean initialization f := false method full( res r : boolean) r := f method store f := true end

module S var S : set of Object := {} var f : Object → boolean procedure new( res this : Object) this :∈ / S ∪ {nil} ; S := S ∪ {this} ; this.f := false procedure full(this : Object, res r : boolean) {this ∈ S} ; r := this.f procedure store(this : Object) {this ∈ S} ; this.f := true end

In the refinement LS the boolean attribute f becomes a link l to another object of class LS. Initially l is nil and is set to some object of class LS in store. Hence, repeated calls to store will generate garbage: = class LS attr l : LS initialization f := nil method full( res r : boolean) r := l = nil method store l := new LS end

module LS var LS : set of Object := {} var l : Object → Object procedure new( res this : Object) this :∈ / LS ∪ {nil} ; LS := LS ∪ {this} ; this.l := nil procedure full(this : Object, res r : boolean) {this ∈ LS} ; r := this.l = nil procedure store(this : Object) {this ∈ LS} ; new(this.l) end

We show refinement between modules S and LS with relation R defined by: R(S, f )(LS, l) = (S ⊆ LS) ∧ (∀ s ∈ S • f (s) = (l(s) = nil))

Concurrent Object-Oriented Programs: From Specification to Code

417

Condition (a) of module refinement, R({}, f )({}, l), holds immediately. To show condition (b.2) for new we rewrite the bodies using (1) and (2): S.new = this, S, f := {this , S , f | / S ∪ {nil}) ∧ (S = S ∪ {this }) ∧ (f = f [this ← false])} (this ∈ LS.new = this, LS, l := {this , LS , l | / LS ∪ {nil}) ∧ (LS = LS ∪ {this }) ∧ (l = l[this ← nil])} this ∈ Refinement is now established by first applying (11) and then eliminating LS , l , S , f by the one-point rule: S.new R LS.new = (S ⊆ LS) ∧ (∀ s ∈ S • f (s) = (l(s) = nil)) ∧ (this ∈ / LS ∪ {nil}) ∧ (LS = LS ∪ {this }) ∧ (l = l[this ← nil]) ⇒ (∃ S , f • (this ∈ / S ∪ {nil}) ∧ (S = S ∪ {this }) ∧ (f = f [this ← false]) ∧ (S ⊆ LS ) ∧ (∀ s ∈ S • f (s) = (l (s) = nil))) = (S ⊆ LS) ∧ (∀ s ∈ S • f (s) = (l(s) = nil)) ∧ (this ∈ / LS ∪ {nil})∧ ⇒ (this ∈ / S ∪ {nil}) ∧ (S ∪ {this} ⊆ LS ∪ {this }) ∧ (∀ s ∈ S ∪ {this } • f [this ← false](s) = (l[this ← nil](s) = nil)) = true For procedure full we immediately apply (8): S.full R LS.full = ((this ∈ S) ∧ (S ⊆ LS) ∧ (∀ s ∈ S • f (s) = (l(s) = nil)) ⇒ (this ∈ LS)) ∧ ((this ∈ S) ∧ (S ⊆ LS) ∧ (∀ s ∈ S • f (s) = (l(s) = nil)) ⇒ (f (this) = (l(this) = nil))) = true We rewrite procedure procedure S.store using the definitions. For procedure LS.store we expand the call, rename the local variable to t, apply (1) and (2) to merge the assignments, and apply (3) to eliminate the local variable: S.store = {this ∈ S} ; this, S, f := this, S, f [this ← false])} LS.store = {this ∈ LS} ; this, LS, l := {this , LS , l | (this = this) ∧ / LS ∪ {nil}) ∧ (LS = LS ∪ {t}) ∧ (∃ t • (t ∈ (l = l[t ← nil][this ← t])) Refinement of store is established by applying (10); we leave out the details of the proof. To show condition (b.2) we first observe that grd LS.new = true, grd LS.full = true, and grd LS.store = true, i.e. all procedures are always enabled. Therefore condition (b.2) is immediately satisfied for all procedures. This completes the proof.

5 Concurrent Objects Classes with actions are translated to modules with actions, such that there is one action for each object of the class. This is formally expressed by nondeterministically assigning any element of C to this before executing the action body. If C is empty, no action is

418

E. Sekerinski

enabled. For the time being we make the restriction that method calls can appear only as the first statement in methods and actions. = module C class C var C : set of Object := {} attr p : P var p : Object → P initialization (g : G) procedure new(g : G, res this : Object) I this :∈ / C ∪ {nil} ; C := C ∪ {this} ; I method l(s : S, res t : T) procedure l(this : Object, s : S, res t : T) L {this ∈ C} ; L method m(u : U, res v : V) procedure m(this : Object, u : U, res v : V) M {this ∈ C} ; M action a action a A var this :∈ C • A action b B action b end var this :∈ C • B end Inheritance and subtyping works as for classes without actions. Refinement of classes with actions translates to refinement of modules with actions. We give an example that illustrates the concept of delaying a computation by enabling a background action. Class Doubler allows to store an integer and to retrieve its double. Class DelayedDoubler doubles the integer in the background and blocks if the integer to be retrieved is not yet doubled: class DelayedDoubler class Doubler attr y : integer attr x : integer attr d : boolean method store(u : integer) initialization d := true this.x := 2 × u method store(u : integer) method retrieve( res u : integer) y, d := u, false u := this.x method retrieve( res u : integer) end when d do u := y action double when ¬d do y, d := 2 × y, true end These classes translate to modules in the same fashion as previous examples. We give immediately the refinement relation needed to prove that Doubler is refined by DelayedDoubler: R(Doubler, x)(DelayedDoubler, y, d) = (Doubler = DelayedDoubler) ∧ (∀ o ∈ Doubler • (d(o) ∧ y(o) = x(o)) ∨ (¬d(o) ∧ (2 × y(o) = x(o))) We conclude this example by noting that we can alternatively express DelayedDoubler as a subtype of Doubler, thus arriving at a class hierarchy in which concurrency is introduced in a subclass:

Concurrent Object-Oriented Programs: From Specification to Code

419

class DelayedDoubler inherit Doubler attr d : boolean initialization d := true method store(u : integer) x, d := u, false method retrieve( res u : integer) when d do u := x action double when ¬d do x, d := 2 × x, true end Statements in classes are atomic only up to method calls. If method calls appear not only as the first statement in methods and actions, the class has to be normalized first. A method or action body with such a call has to be split in order to model that execution can block at that point. If at the point of the method call there are no local variables, then we introduce an auxiliary integer variable that is initialized to zero and incremented at the point of the method call. For every call we also introduce an action that contains the call and the remainder of the body. This action is enabled if the counter for that call is positive and the action decrements the counter first. We illustrate this by an example of a faulty merger: class FaultyMerger attr in1, in2, out : Buffer attr x1, x2 : integer initialization (i1, i2, o : Buffer) in1, in2, out := i1, i2, o action begin in1.get(x1) ; out.put(x1) end action begin in2.get(x2) ; out.put(x2) end end Class FaultyMerger is normalized as follows: class FaultyMerger attr in1, in2, out : Buffer attr x1, x2 : integer attr at1, at2 : integer initialization (i1, i2, o : Buffer) in1, in2, out := i1, i2, o action begin in1.get(x1) ; at1 := at1 + 1 end action when at1 > 0 do begin at1 := at1 − 1 ; out.put(x1) end action begin in2.get(x2) ; at2 := at2 + 1 end action when at2 > 0 do begin at2 := at2 − 1 ; out.put(x2) end end If in1 contains sufficiently many elements, the action in1.get(x1) ; at1 := at1 + 1 can be taken several times and overwriting x1 before the action for placing x in out is taken. The class Merger avoids this problem with the help of an extra variable.

420

E. Sekerinski

¬inPool thread pool

inPool

inPool

object pool

Fig. 1. Illustration of the implementation. Boxes with the inPool attribute represent active objects, the other passive objects. A thin arrow between boxes represents a reference, a thick arrows from a thread to an object represents a reference with a lock.

Suppose there are local variables at the point of the method call. These local variables form the context in which execution may resume, after possible interleaving with other methods or action. This is modelled by storing the context in an attribute with each object. As multiple activities may create local contexts, but the order of creation is ignored, the contexts are stored in a bag. We illustrate this by normalizing the action notifyOneObserver of class Subject: attr at1 : bag of Observer action notifyOneObserver when notifyObs ={} do var o : Observers • begin o :∈ notifyObs ; notifyObs := notifyObs − {o} ; at1 := at1 + [o] end action notifyOneObserver var o :∈ at1 • begin at1 := at1 − [o] ; o.update end This normalization step is required before verification and refinement can be carried out by translating classes to modules.

6 Implementation In order to test our ideas, we have developed a prototypical compiler for our language, see [17] for details. The compiler currently translates to the Java Virtual Machine. We sketch the principles of the implementation, see Fig. 1 for an illustration. The implementation relies on the restriction that method and action guards may refer only to attributes of the object itself and may not contain method calls. An object that has guarded methods is called a guarded object. An object that has actions is called an active object, otherwise a passive object. An active object that has at least one enabled action is called an enabled object, otherwise a disabled object. At runtime a thread pool and an object pool are maintained. The object pool is initially empty. When an active object is created, a pointer to it is placed in the object

Concurrent Object-Oriented Programs: From Specification to Code

421

pool and only active objects are placed in the object pool. Each active object has an extra boolean attribute inPool indicating whether a pointer to it is in the object pool. Threads request a reference to an active object from object pool. If the object is disabled, the thread resets the inPool attribute and removes it from the object pool. If the object is enabled, the thread executes an enabled action and leaves the object in the object pool. Each thread obtains a lock to an object when entering one of its methods or actions and releases the lock when exiting the method or action. The lock is also released at a call to another object and obtained again at re-entry from the call. If a guarded method is called the guard is evaluated and the thread waits if the guard is false. At the exit from a guarded object all waiting threads are notified to reevaluate the guards. Fairness among the actions of an object is ensured by evaluating the guards in a cyclic fashion. This is done with one additional attribute for the index of the last evaluated action guard in every active object. The object pool is implemented as a dynamic array. Fairness among the objects is ensured by retrieving active objects in a cycling fashion. The object pool grows and shrinks like a stack: new objects are added at the end and when an object is retrieved, its position is filled with the last object. Hence adding objects and retrieving objects take constant time. Active objects are garbage collected like passive objects, i.e. when there is no reference from any other object and no reference from the object pool. With this scheme action guards are only evaluated when a thread is searching for an action to execute. Method guards are only re-evaluated when another thread has exited the object and thus possibly affected the guard. The memory overhead is that every active object requires one bit for the inPool attribute, one integer for the index to the last evaluated action guard, and one pointer in the object pool. We are currently experimenting with techniques to control the creation and termination of threads.

7 Discussion A number of attempts have been made in formalizing objects with records, initiated by the work of Cardelli [12] and leading to various type systems incorporating objectoriented concepts. Our experience in using one such type system is that in verification and refinement it is more in the way than helpful [22]. Understanding attributes as mappings from object identities to their values emerges naturally in object modeling techniques like [21], and is used in a number of formalizations of object models, e.g. [15]. The approach of viewing a method as a procedure with an additional this parameter is also taken in Modula-3 and Oberon-2. We find that this combination leads to a simple model with a clear distinction between classes and types. A consequence is that objects can only be allocated on the heap, an approach also taken in several mainstream objectoriented languages. One may argue about releasing the lock to an object when a method call in that object goes to another object, hence allowing other methods to be called or actions to be initiated. Indeed in our first implementation we retained the lock. However we found programs to be difficult to analyze, as it is necessary to keep track of which objects are locked by which actions. The model of releasing the lock allows some disciplined intra-object concurrency: while several actions or methods can be initiated, only one

422

E. Sekerinski

can progress, thus still guaranteeing atomicity of attribute updates. In order for a class invariant to be preserved, the class invariant has not only to be established at the end of every method, but also before each call to another object. The need for doing so is already recognized in sequential programs when re-entrance is possible [19]. The model presented does not define (indirectly) recursive method calls; doing so would require taking a fixed point. The model also does not accurately capture how self- and super-calls are resolved when methods are redefined: a super-call will always remain in the superclass, even if calling other methods that are redefined in the subclass. In order to model this, methods calls must not be resolved immediately, but when objects of the classes are created. A model of inheritance that delays resolution of method calls to the time when objects are created was proposed by Cook and Palsberg [13] and applied to studies of class refinement by Mikhajlov and Sekerinski [18]. While our implementation follows this model, the presented theory does not capture this.

Acknowledgement The author’s understanding profited from the interaction with participants of FMCO 02; discussions with Rustan Leino are particularly acknowledged. The comments of the reviewer lead to a significant improvement.

References 1. Pierre America. Issues in the design of a parallel object-oriented language. Formal Aspects of Computing, 1(4):366–411, 1989. 2. Pierre America and Frank de Boer. Reasoning about dynamically evolving process structures. Formal Aspects of Computing, 6(3):269–316, 1994. 3. Ralph Back. Refinement calculus, part II: Parallel and reactive programs. In J. W. deBakker, W.-P. deRoever, and G. Rozenberg, editors, REX Workshop on Stepwise Refinement of Distributed Systems - Models, Formalisms, Correctness, Lecture Notes in Computer Science 430, pages 67–93, Mook, The Netherlands, 1989. Springer Verlag. 4. Ralph Back, Martin B¨uchi, and Emil Sekerinski. Action-based concurrency and synchronization for objects. In T. Rus and M. Bertran, editors, Transformation-Based Reactive System Development, Fourth AMAST Workshop on Real-Time Systems, Concurrent, and Distributed Software, Lecture Notes in Computer Science 1231, pages 248–262, Palma, Mallorca, Spain, 1997. Springer-Verlag. 5. Ralph Back and Kaisa Sere. Action systems with synchronous communication. In E.-R. Olderog, editor, IFIP Working Conference on Programming Concepts, Methods, Calculi, pages 107–126, San Miniato, Italy, 1994. North-Holland. 6. Ralph Back and Joakim von Wright. Trace refinement of action systems. In B. Jonsson and J. Parrow, editors, CONCUR ’94: Concurrency Theory, Lecture Notes in Computer Science 836. Springer-Verlag, 1994. 7. Ralph Back and Joakim von Wright. Refinement Calculus – A Systematic Introduction. Springer-Verlag, 1998. 8. Marcello M. Bonsangue, Joost N. Kok, and Kaisa Sere. An approach to object-orientation in action systems. In Mathematics of Program Construction, Lecture Notes in Computer Science 1422, Marstrand, Sweden, 1998. Springer-Verlag.

Concurrent Object-Oriented Programs: From Specification to Code

423

9. Marcello M. Bonsangue, Joost N. Kok, and Kaisa Sere. Developing object-based distributed systems. In P. Ciancarini, A. Fantechi, and R. Gorrieri, editors, 3rd IFIP International Conference on Formal Methods for Open Object-based Distributed Systems (FMOODS’99), pages 19–34. Kluwer, 1999. 10. Jean-Pierre Briot, Rachid Guerraoui, and Klaus-Peter Lohr. Concurrency and distribution in object-oriented programming. ACM Computing Surveys, 30(3):291–329, 1998. 11. Martin B¨uchi and Emil Sekerinski. A foundation for refining concurrent objects. Fundamenta Informaticae, 44(1):25–61, 2000. 12. Luca Cardelli. A semantics of multiple inheritance. In G. Kahn, D. MacQueen, and G. Plotkin, editors, International Symposium on the Semantics of Data Types, Lecture Notes in Computer Science 173, pages 51–67. Springer-Verlag, 1984. 13. William Cook and Jens Palsberg. A denotational semantics of inheritence and its correctness. In ACM Conference Object Oriented Programming Systems, Languages and Applications, ACM SIGPLAN Notices, Vol 14, No 10, pages 433–443, 1989. 14. Steve J. Hodges and Cliff B. Jones. Non-interference properties of a concurrent object-based language: Proofs based on an operational semantics. In Burkhard Freitag, Cliff B. Jones, Christian Lengauer, and Hans-Joerg Schek, editors, Object Orientation with Parallelism and Persistence, pages 1–22. Kluwer Academic Publishers, 1996. 15. Daniel Jackson. Alloy: A lightweight object modelling notation. ACM Transactions on Software Engineering and Methodology, 11(2):256–290, 2002. 16. Cliff B. Jones. Accomodating interference in the formal design of concurrent object-based programs. Formal Methods in System Design, 8(2):105–122, March 1996. 17. Kevin Lou. A Compiler for an Action-Based Object-Oriented Programming Language. Master’s thesis, McMaster University, 2003. 18. Leonid Mikhajlov and Emil Sekerinski. A study of the fragile base class problem. In Eric Jul, editor, ECOOP’98 – 12th European Conference on Object-Oriented Programming, Lecture Notes in Computer Science 1445, pages 355–382, Brussels, Belgium, 1998. Springer-Verlag. 19. Leonid Mikhajlov, Emil Sekerinski, and Linas Laibinis. Developing components in presence of re-entrance. In J. Wing, J. Woodcock, and J. Davis, editors, World Congress on Formal Methods, FM’99, Lecture Notes in Computer Science 1709, Toulouse, France, 1999. Springer-Verlag. 20. Jayadev Misra. A simple, object-based view of multiprogramming. Formal Methods in System Design, 20(1):23–45, 2002. 21. James Rumbaugh, Michael Blaha, William Premerlani, Frederick Eddi, and William Lorensen. Object-Oriented Modeling and Design. Prentice-Hall, 1991. 22. Emil Sekerinski. A type-theoretic basis for an object-oriented refinement calculus. In S. J. Goldsack and S. J. H. Kent, editors, Formal Methods and Object Technology, pages 317–335. Springer-Verlag, 1996. 23. Kaisa Sere and Marina Wald´en. Data refinement of remote procedures. Formal Aspects of Computing, 12(4):278–297, 2000.

Design with Asynchronously Communicating Components J. Plosila1 , K. Sere2 , and M. Wald´en2,3 1

University of Turku Turku Centre for Computer Science (TUCS) 2 ˚ Abo Akademi University Turku Centre for Computer Science (TUCS) FIN-20520 Turku, Finland 3 Financing via the Academy of Finland

Abstract. Software oriented methods allow a higher level of abstraction than the often quite low-level hardware design methods used today. We propose a component-based method to organise a large system derivation within the B Method via its facilities as provided by the tools. The designer proceeds from an abstract high-level speciﬁcation of the intended behaviour of the target system via correctness-preserving transformation steps towards an implementable architecture of library components which communicate asynchronously. At each step a pre-deﬁned component is extracted and the correctness of the step is proved using the tool support of the B Method. We use Action Systems as our formal approach to system design.

1

Introduction

When carrying out formal speciﬁcation and derivation of systems we can apply methods that allow a high-level abstract speciﬁcation of a system to be stepwise developed into a more concrete version by correctness- preserving transformations. Hence, these methods provide a top-down approach to system design where the initial speciﬁcation is a very abstract view of the system to be built. Details about the intended functionality and hardware/software components of the system are stepwise added to the speciﬁcation during the design while preserving the intended behaviour of the original description. While this approach is very appealing from the designer’s point of view as it allows the system to be developed and veriﬁed in manageable tasks, it still lacks e.g. good tool support. In this paper, we concentrate on formal speciﬁcation and derivation of asynchronous systems within the Action Systems formalism. Such systems may contain locally synchronous components, but the components interact with each other via asynchronous communication channels. This kind of architecture provides a promising, modular and reliable approach to implement modern large digital systems. Action systems [3] and the associated reﬁnement calculus [4], which provides a mathematical reasoning basis for the stepwise development of action F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 424–442, 2003. c Springer-Verlag Berlin Heidelberg 2003

Design with Asynchronously Communicating Components

425

systems, have shown their value in design of reliable and correct systems in many ways [18]. The formal design methods supporting action systems and reasoning about them are heavily inﬂuenced by approaches to parallel and distributed program design as well as approaches to object-oriented programming [3,6]. Recently, methods to derive systems from an abstract speciﬁcation all the way down to VLSI circuits within action systems have received considerable attention [15,17]. An action systems-based design process of an asynchronous system starts from an abstract speciﬁcation which is then decomposed into asynchronously communicating library components. Each component can be implemented as an asynchronous (self-timed) [15] or a synchronous (clocked) [17] hardware module, or developed into an executable piece of software which is run in a standard microprocessor. Utilisation of formal methods is particularly important when designing such complex embedded systems, as it potentially reduces the design costs by eliminating errors and wrong design decisions at an early stage of the design process, before the involved costly hardware modules, e.g. VLSI chips, have been manufactured. The focus in this paper is on the correctness preserving stepwise decomposition of a system speciﬁcation into an asynchronous network of action system components pre-deﬁned in a module library available to the designer. This is a very important design phase as it determines the basic structure of the system. The next phases, which include for example the software/hardware partitioning and the decision on the type of the hardware modules (clocked/self-timed), are out of the scope of this paper. In order to get more conﬁdence in the development of asynchronous systems we need mechanical tool support. Recently, Wald´en et al. [8,21] have shown how action systems and, hence, parallel and distributed systems can be modelled with the B Method, an approach that with its tool support [1,13,20] has been accepted by many industrial organisations. The B Method, similarly to action systems, supports stepwise derivation of programs. Moreover, the B Method oﬀers automatic proof-support for the veriﬁcation of the correctness of each step as well as tools to manage and administrate a derivation task. Hence, the very well-developed and rich theory for Action Systems can utilise the mechanical tool support provided by the B Method. In this paper we will further extend the applicability area of the B Method to provide support for formal asynchronous system design. Our goal is component based design, where we can utilise the component libraries provided by the tools. We propose a method to administrate a large system derivation within the B Method via stepwise decomposition. We start in Section 2 by formalising the concepts of asynchronous systems in the B Method using the Action Systems approach. In Section 3 we develop the reﬁnement and the component extraction ideas and associated tool support needed in design of asynchronous systems in a quite high level of abstraction. Section 4 is then devoted to a case study on component- based design. We end in Section 5 with comparisons to related work and some concluding remarks.

426

2

J. Plosila, K. Sere, and M. Wald´en

Asynchronous Systems in B

Asynchronous interfacing provides a viable approach to build modern large digital systems, composed of several hardware and software units, in a modular and reliable manner. In asynchronous communication, a data transfer event between two system components consists of two phases: request and acknowledgement (Section 2.4). Depending on the application, the duration of each phase may be either unbounded or bounded. Asynchronously communicating components form an asynchronous system architecture in which a component module, taken separately, can internally be either an asynchronous (self-timed) or synchronous (clocked) hardware block, or a software module running in a standard or application-speciﬁc processor. When decomposing an abstract functional system speciﬁcation stepwise into an asynchronous composition of dedicated modules within the formal framework of Action Systems, a correctness-preserving transformation accompanied with a number of proof obligations is applied at each reﬁnement step. In order to give more conﬁdence in the correctness proof of a reﬁnement step we want to use a mechanical tool. Atelier B [20] and B-Toolkit [13] provide this kind of tool. They both comprise a set of tools which support the B Method [1], a formal framework for stepwise system development. Action Systems and the B Method have essentially the same structure, both are state-based methods as opposed to event-based process calculi, and both support the stepwise reﬁnement paradigm to system construction. This gives a solid background for combining the two methods. The designer supplies the tool, Atelier B or B-Toolkit, with the speciﬁcation and a number of reﬁnements of this speciﬁcation. The veriﬁcation conditions, the proof obligations, needed for proving the correctness of the reﬁnement steps can be automatically generated. Furthermore, these veriﬁcation conditions can be automatically or interactively proved using these tools. Hence, by using the B Method for designing asynchronous systems we will have tool support with proving facilities for correct design. 2.1

Action Systems in B

We model asynchronous systems in B via abstract machines. The main components of an abstract machine are the state variables, the operations on the state variables and the invariant giving the properties of these variables. For specifying the operations we use substitutions, for example, a skip-substitution, a simple substitution (x := e), a multiple substitution (x := e || y := f ), preconditioned substitution (PRE P THEN S END), an action (also called guarded substitution) (SELECT P THEN S END), or a non-deterministic substitution (ANY x WHERE THEN P ENDS ), where x and y are variables, e and f are expressions, P is a predicate, and S is a substitution. Each substitution S is deﬁned as a predicate transformer which transforms a postcondition Q into the weakest precondition for S to establish Q, wp(S, Q) [10], the initial states from which S is guaranteed to terminate. The substitutions above are deﬁned as follows:

Design with Asynchronously Communicating Components wp(skip, Q) wp(x := e, Q) wp(x := e || y := f, Q) wp(PRE P THEN S END, Q) wp(SELECT P THEN S END, Q) wp(ANY x WHERE P THEN S END, Q)

= = = = = =

427

Q Q[x := e] Q[x, y := e, f ] P ∧ wp(S, Q) P ⇒ wp(S, Q) (∀x.P ⇒ wp(S, Q))

The abstract machine A given below MACHINE A VARIABLES x INVARIANT I(x) INITIALISATION x := x0 OPERATIONS ˆ SELECT P1 THEN S1 END; A1 = ... ˆ SELECT Pm THEN Sm END Am = END

where every operation in the operations-clause is an action, is called an action system. An action system is identiﬁed by a unique name, here A. The state variable(s) x of the action system are given in the variables-clause. The invariant I(x) in the invariant-clause gives types and other properties for the variables. The variables are assigned initial values in the initialisation-clause. In the operationsclause each action Ai is given as a named operation. Each operation might have value parameters and/or return a result. An action A with the value parameter a and the result parameter b is denoted b ← A(a). This feature is used later to model communication in the form of message-passing between action systems. The types of the value parameters should be given as a preconditioned substituˆ PRE type(a) THEN SELECT P THEN S END END . tion in the operation b ← A(a) = For readablility of the action systems we have excluded the preconditions from the operations in this paper. Action systems are used as a model for parallel and distributed systems [3,21] with the basic idea that actions are selected for execution in a non-deterministic way. Hence, there is a non-deterministic choice between the actions A1 , . . . , Am of A, A1 [] . . . [] Am . The non-deterministic choice of the actions A and B is deﬁned as wp(A [] B, Q) = wp(A, Q) ∧ wp(B, Q). Only actions that are enabled, i.e., when the predicate P holds for the action in a given state, are considered for execution. The execution terminates when there are no enabled actions. If two actions do not share any variables, they can be executed in any order or in parallel. Hence, we have an interleaving semantics for action systems. Example. The machine E below is a high-level action system speciﬁcation of a system unit which computes a new value for data dout whenever the action E is enabled and selected for execution. The parameters l and r act as two- directional communication signals between E and its environment. In other words, when E receives values for l and r from the environment, it responds by assigning other values which are then detected by the environment. The values l and r can be assigned are req and ack corresponding to the request and acknowledgement phases of asynchronous communication (Section 2.4). The machine E receives

428

J. Plosila, K. Sere, and M. Wald´en

input data din when l = req and sends output data dout to the environment by setting r to req. MACHINE E OPERATIONS l, r, dout ← E(l, din, r, dout) = ˆ SELECT l = req ∧ r = ack THEN ANY dout WHERE F (dout , din, dout) THEN dout := dout END || r := req || l := ack END END

2.2

Scheduling of Actions

Implicitly there is a nondeterministic choice between enabled actions in the operations-clause as explained above. This can be stated explicitly in a scheduling-clause of an action system. In this clause we can also give other more speciﬁc scheduling policies like sequential composition or some parallel, exclusive, prioritised, or probabilistic composition between actions. The clause is optional, but in case a scheduling-clause appears in an action system, all the actions of the machine must be included. The scheduling has the same name as the abstract machine. If the actions have parameters, these will also be parameters of the scheduling. MACHINE A ... SCHEDULING A= ˆ A1 * . . . * A m END

In this paper we focus on the two most common composition operators between actions and consider each occurrence of * to be either a nondeterministic choice, [], or sequential composition, ;. The sequential composition is frequently needed in asynchronous modeling to sequence communication events on asynchronous communication channels discussed later in Section 2.4. The non-deterministic choice was explained above. The sequential composition of two actions can also be interpreted in terms of the non-deterministic ˆ SELECT P THEN S END and choice. Let us consider the two actions A = B = ˆ SELECT Q THEN T END . Their sequential composition, A ; B, can then be interpreted as the non-deterministic choice, A [] B , between the two actions A and B , where a variable pc (initially set to 1) for scheduling the actions has been added:

= ˆ SELECT P ∧ pc = 1 THEN S || pc := 2 END

= ˆ SELECT Q ∧ pc = 2 THEN T || pc := 1 END

A

B

Hence, B is only enabled after A has been executed, setting pc to 2. We can note that the scheduling-clause provides us with a convenient way of rewriting the scheduling of the actions, which otherwise should be coded within the actions. As an example of the scheduling, let the action system A have three actions ˆ ((A1 ; A2 ) [] A3 ). The execution of A1 , A2 and A3 and the scheduling clause A =

Design with Asynchronously Communicating Components

429

A is restricted so that A1 and A2 always are executed in a sequence interleaved with A3 . In case the actions have parameters these have to be taken into account in the scheduling. For example, the sequential execution of the two actions B1 (b) and d ← B2 (c) with associated value and result parameters b, c and d, is given as B(b, c, d) = ˆ B1 ; B2 in the scheduling clause of B. Example. In the machine Reg below the operations Reg1 and Reg2 are sequentially composed. We return to this machine later. MACHINE Reg OPERATIONS ˆ SELECT a = req THEN dout := din || b := req END; b, dout ← Reg1 (a, din) = ˆ SELECT b = ack THEN a := ack END a ← Reg2 (b) = SCHEDULING Reg(a, din, b, dout) = Reg1 ; Reg2 END

2.3 Modularisation Action systems can be composed/decomposed into parallel systems [5]. The parallel composition of action systems A and B can be presented in the B Method using the extends-clause. This also provides an eﬃcient way to model system hierarchy, i.e., a system can be presented as a composition of subsystem modules listed in its extends-clause. MACHINE A EXTENDS B VARIABLES x INVARIANT I(x) INITIALISATION x := x0 OPERATIONS A(a) = ˆ SELECT P THEN S END SCHEDULING A(a) = ˆ A [] B(a, x) END

MACHINE B VARIABLES y INVARIANT J(y) INITIALISATION y := y0 OPERATIONS B(b, c) = ˆ SELECT Q THEN T END SCHEDULING B(b, c) = ˆ B END

Here the action system A extends the system B indicating that A is considered to be composed in parallel with B. We can also say that A contains the component (i.e. subsystem) module B. The scheduling of A is then A [] B(a, x), where a and x are the actual parameters which should be variables of A and/or formal parameters of the actions in A. The result of composing A and B in parallel is given as the system AB below. MACHINE AB VARIABLES x, y INVARIANT I(x) ∧ J(y) INITIALISATION save x := x0 || y := y0 OPERATIONS A(a) = ˆ SELECT P THEN S END B(a) = ˆ SELECT Q[a/b, x/c] THEN T [a/b, x/c] END SCHEDULING AB(a) = ˆ A [] B END

430

J. Plosila, K. Sere, and M. Wald´en

The variables, the invariants and the actions of the two action systems A and B are simply merged in the composed action system AB. The formal parameters, b and c, in the action of B are substituted with the actual parameters, a and x, in A. Since a is a formal parameter of the scheduling A, it should also be a formal parameter of the action B after the substitution. Example. To exemplify modularisation, let us consider the below action system E 1 which contains the system Reg (Section 2.2) as a component. The scheduling of E 1 is then ((E11 ; E13 ) [] E12 ) [] Reg(c1 , dm, c2 , dout), where Reg(c1 , dm, c2 , dout) stands for (Reg1 ; Reg2 ), indicating that the parallel composition of the action systems E 1 and Reg is actually modelled. The types of the involved variables are given as the sets com ({req, ack}) and data (represesents any data type). We will return to this machine later. MACHINE E 1 EXTENDS Reg VARIABLES c1 , c2 , dm INVARIANT c1 ∈ com ∧ c2 ∈ com ∧ dm ∈ data INITIALISATION c1 := ack || c2 := ack || dm :∈ data OPERATIONS ˆ E11 (l, din, dout) = SELECT l = req ∧ r = ack THEN ANY dm WHERE F (dm , din, dout) THEN dm := dm END || c1 := req END; ˆ SELECT c2 = req THEN c2 := ack || r := req END; r ← E12 = ˆ SELECT c1 = ack THEN l := ack END l ← E13 = SCHEDULING 1 E (l, din, r, dout) = ˆ (E11 ; E13 ) [] E12 [] Reg(c1 , dm, c2 , dout) END

2.4

Modelling Asynchronous Components

In this paper, we consider systems which are organizations of asynchronously communicating components. Such building blocks with asynchronous interfaces are here collectively called asynchronous components, independently of the intended internal structure of each component. As an example, the abstract system E discussed in Section 2.1, acts as an asynchronous component towards its environment. Asynchronous Communication Channels. Interaction between asynchronous components is arranged via communication channels composed of the value and result parameters of the actions. In our formal design framework, a communication channel c(d), or a channel c in short, is deﬁned to be a tuple (c, d), where c is a communication variable and d the list of those data variables whose values are transfered from a system module to another by communicating via c. In the case

Design with Asynchronously Communicating Components

431

when the list d is empty, c is called a control channel . Otherwise we have a data channel c(d). Furthermore, when refering to a single party of communication, we talk about communication ports rather than channels. Generally, a communication variable c is of the enumerated type comm,n deﬁned by {req1 , . . . , reqm , ack1 , . . . , ackn } comm,n = where req1 , . . . , reqm and ack1 , . . . , ackn are request and acknowledgement states (values), respectively. A variable c ∈ comm,n is initialized to one of its acknowledgement states ackj . If m = 1 or n = 1, the default value is just req or ack, respectively. Hence, the simplest and the most usual type com1,1 is equivalent to {req, ack} by default. We denote com1,1 simply by com. A communication channel connects two action system components, one of which acts as the master and the other as the slave. The master side of a channel is called an active communication port, and the slave side is referred to as a passive communication port. A communication cycle on a channel c(d) includes two main phases. When the active party, the master, initiates the cycle by setting c to a request state reqi , the cycle is said to be in the request phase. Correspondingly, when the passive party, the slave, responds by setting c to an acknowledgement state ackj , the communication cycle on c is said to be in the acknowledgement phase. The data d can be transfered either in the request phase from the master to the slave (push channel ), in the acknowledgement phase from the slave to the master (pull channel ), or in both phases bidirectionally (biput channel ) [14]. Example. The above machine E 1 (Section 2.3) and its component Reg (Section 2.2) communicate asynchronously via the push channels c1 (dm) ∈ com and c2 (dout) ∈ com. When the system E 1 transfers data dm to Reg by setting c1 to the request state req, it acts as the master towards the machine Reg which sets c1 to the acknowledgement state ack as a response. On the other hand, when data dout is transfered from Reg to E 1 via the channel c2 , the system Reg is the active party and E 1 acts as the slave.

3

Deriving Asynchronous Systems with B

Top-down design of an asynchronous system starts from a high-level action system speciﬁcation of the basic functionality of the target system. The initial abstract speciﬁcation is then stepwise implemented, within the reﬁnement calculus framework, as an asynchronous architecture of action system modules representing pre-deﬁned library components available to the designer. Such a module could be, for example, an arithmetic-logical unit, a computational algorithm, an interface module, a memory block, a controller of a set of other units, or even a very complex microprocessor core. After this component extraction process, the main system does not contain any operations of its own, but all of them come from the component modules listed in the extends-clause of the system. The

432

J. Plosila, K. Sere, and M. Wald´en

components interact via asynchronous communication channels created during the stepwise extraction process. In this section, we ﬁrst discuss the reﬁnement in the B Method generally, and then we formulate a speciﬁc transformation rule for component extraction in asynchronous system design. 3.1

Abstract Machine Reﬁnement

Reﬁnement is a viable method for stepwise derivation of systems. Let us consider an abstract machine A and its reﬁnement C given as MACHINE A VARIABLES x INVARIANT I(x) INITIALISATION x := x0 OPERATIONS ˆ SELECT P1 THEN S1 END; A1 = ... ˆ SELECT Pm THEN Sm END Am = SCHEDULING A= ˆ A1 * . . . * Am END

REFINEMENT C REFINES A VARIABLES y INVARIANT R(x, y) INITIALISATION y := y0 OPERATIONS ˆ SELECT Q1 THEN T1 END; C1 = ... ˆ SELECT Qn THEN Tn END Cn = SCHEDULING C = ˆ C1 * . . . * C n MAPPINGS ... END

The machine reﬁnement states in the reﬁnes-clause what it reﬁnes, an abstract machine or another machine reﬁnement. Above, the reﬁnement C reﬁnes the abstract machine A. The invariant R(x, y) of the reﬁnement gives the relation between the variable(s) x in the action system A and the variable(s) y in its reﬁnement C for replacing abstract statements with more concrete ones. The reﬁned and more concrete actions Ci are given in the operations-clause and the scheduling-clause indicates how these actions are composed. The mappings-clause states the reﬁnement relation between the actions in A and C and will be dealt with in more detail below. In order to prove that the reﬁnement C on the variables y is a reﬁnement of the action system A on the variables x using the invariant R(x, y), a number of proof obligations must be satisﬁed [1]. The invariant R of the reﬁnement should not contradict the invariant I of the speciﬁcation. (∃(x, y). I ∧ R)

(1)

Furthermore, the initialisation y := y0 in C establishes a situation where the initialisation x := x0 in A cannot fail to establish the invariant R. wp(y := y0 , ¬wp(x := x0 , ¬R))

(2)

Moreover, we need to prove that the actions in A are data reﬁned by the actions in C. In case each action Ai in A corresponds to one action Ci in C,

Design with Asynchronously Communicating Components

433

(n = m), we have an entry Ai ≤ Ci for each action Ai of A indicating that action Ai is data reﬁned by Ci under invariant R. MAPPINGS A1 ≤ C1 , ... Am ≤ Cm

Hence, it should be proven that for each action Ai in A there is an action Ci in C such that the action Ci establishes a situation where Ai cannot fail to maintain R. (3) (∀(x, y). I ∧ R ⇒ wp(Ci , ¬wp(Ai , ¬R))) where 1 ≤ i ≤ m. In this case the scheduling of the corresponding actions should be the same in A and C, i.e. if A = ˆ A1 [] A2 then C = ˆ C1 [] C2 . In asynchronous system design, especially in the component extraction process, we often rely on atomicity reﬁnement [12,19], where we split an atomic action in a system into several actions in order to increase the degree of parallelism in the system. Hence, we may need to introduce new actions during the reﬁnement process. In case we introduce new actions in C, (n > m), we have the case that an action in A is reﬁned by a composition of actions in C, e.g., A2 ≤ C3 [] C4 . Furthermore, we allow a composition of actions in A to be reﬁned by a composition of actions in C. MAPPINGS Ai ∗ . . . ∗ Ak ≤ Cj ∗ . . . ∗ Cl , ...

where 1 ≤ i, k ≤ m and 1 ≤ j, l ≤ n and each * is [] or ; depending on the scheduling of the corresponding actions in A and C. Each action of A and C should appear once and only once in the mappings-clause. If we denote the composed actions Ai ∗ . . . ∗ Ak as Di and Cj ∗ . . . ∗ Cl as Ei , we can write the proof obligation for reﬁnement using composed actions as (∀(x, y). I ∧ R ⇒ wp(Ei , ¬wp(Di , ¬R))),

(4)

for each entry Di ≤ Ei in the mappings-clause. Hence, for each composed action Di of A the composed actions Ei of C should establish such a situation that Di cannot fail to maintain R. Notice that in case the reﬁned action system C is composed in parallel with another action system B. i.e., C has B as a component module, and B contains the scheduling B = ˆ B1 ; B2 , then either B, or both B1 and B2 should appear within the composed actions Ei in the mappings-clause of C. If action A1 in A is reﬁned by (C1 [] B) in C then we actually consider A1 to be reﬁned by C1 and the composition of all the actions in B, A1 ≤ (C1 [] (B1 ; B2 )). The proof obligations (1), (2) and (3) can be generated automatically and checked using the theorem-proving environments associated with the B Method [1,13]. See Wald´en and Sere [21] for further details on reﬁning action systems within B. The B Method supports one-to-one reﬁnement corresponding to proof obligation (3). This is, however, too restrictive for derivation of asynchronous

434

J. Plosila, K. Sere, and M. Wald´en

systems. Therefore, we have introduced the mappings-clause providing us with the needed ﬂexibility for the reﬁnement. The mappings-clause makes an automatic generation and check of proof obligation (4) possible as well. In Event B [9] for developing distributed systems in B several operations can reﬁne the same operation and one operation can in turn reﬁne several opertions. However, in Event B sequential composition of these operations is not allowed. In this paper we propose only to add new variables and not to change the abstract variables in each reﬁnement step. We, therefore, have the case that the parallel composition of action systems is monotonic with respect to this reﬁnement. Due to this and to the transitivity of the reﬁnement relation, if action system A is reﬁned by A and action system B is reﬁned by B , then the parallel composition of A and B is reﬁned by the parallel composition of A and B . This means that the subsystems in a parallel composition may be reﬁned independently. Example. Studying the examples given in Section 2 we can note that the action system E 1 , which has Reg as a component, is actually a reﬁnement of the action system E. Since E has fewer actions than E 1 , the reﬁnement of the actions is given in the mappings-clause as follows. REFINEMENT E 1 REFINES E EXTENDS Reg VARIABLES ... SCHEDULING ˆ (E11 ; E13 ) [] E12 [] Reg(c1 , dm, c2 , dout) E 1 (l, din, r, dout) = MAPPINGS E ≤ ((E11 ; E13 ) [] E12 [] Reg(c1 , c2 , dm, dout)) END

The tool support for the B Method is then used to generate the proof obligations (1), (2) and (4) for E 1 as well as to automatically or interactively prove them. Hence, the tool assists us in proving that E 1 is a correct reﬁnement of E. 3.2

Component Extraction

The stepwise component extraction process of an asynchronous system is based on decomposing the system by introducing new asynchronous communication channels. Each decomposition step creates at least one channel through which the extracted component is accessed. Let us now sketch a tranformation rule for component extraction within B. We study here one of the most usual cases. More application-speciﬁc rules can be derived when required. First, assume that we have a module library at our disposal and that this library contains a component M of the form

Design with Asynchronously Communicating Components

435

MACHINE M ... OPERATIONS x ← M (x, y) = ˆ SELECT x = req THEN S(y) || x := ack END SCHEDULING M(x, y) = ˆ M END

where S is an arbitrary substitution on the data parameter y. The component has the passive communication port x(y) (x ∈ com) through which it can be accessed. Consider an action system A deﬁned as: MACHINE A ... OPERATIONS A = ˆ SELECT P THEN S1 (d); S(d); S2 END; ... SCHEDULING A= ˆ A... END

Here S1 and S2 are two arbitrary substitutions, and S is the same substitution as in the library component M given above, and the data variable d, shared by S1 and S, is of the same type as the parameter x of M. Let us now assume that we want to extract the library component M from the machine A, i.e., our goal is to implement a part of A as an instance of the pre-deﬁned module M. For this, a fresh communication channel c(d) (c ∈ com) is created. The result of this reﬁnement is the system C given as MACHINE C EXTENDS M ... OPERATIONS ˆ SELECT P THEN S1 (d) || c := req END; C1 = ˆ SELECT c = ack THEN S3 END; C2 = ... SCHEDULING C = ˆ (C1 ; C2 ) [] M(c, d) . . . END

The initial value of the communication variable c is ack. Observe how the variables c and d are given to the component module M as actual parameters in the scheduling-clause. The system C acts as a master towards the component M and initiates a communication cycle on the channel c(d) by setting c to the request state req in its operation C1 . The acknowledgement ack from the slave M is detected in the operation C2 which is composed sequentially with C1 . Notice that the substitutions S1 , S, and S2 are executed exactly in the same order as in the initial machine A, but in C and M this execution sequence is non-atomic. The component M takes care of the substitution S, while S1 and S2 belong to C. Extraction as Reﬁnement. To obtain the above system C from the initial machine A the communication variable c ∈ com needs to be ﬁrst introduced. Then the operation A of A is transformed into three separate atomic actions C1 , C2 , and

436

J. Plosila, K. Sere, and M. Wald´en

M using the variable c and sequential scheduling. The sequence of the involved substitutions S1 , S, and S2 is preserved. Hence, the initial machine A is reﬁned into A which has the form REFINEMENT A REFINES A ... OPERATIONS ˆ SELECT P THEN S1 (d) || c := req END; C1 = M = ˆ SELECT c = req THEN S(d) || c := ack END; ˆ SELECT c = ack THEN S2 END; C2 = ... SCHEDULING ˆ (C1 ; C2 ) [] M . . . A = MAPPINGS A ≤ (C1 ; C2 ) [] M, ... END

The transformation is correct provided we can show the correctness of reﬁning A into (C1 ; C2 ) [] M as required by the proof obligation (4). For this, we need to ﬁnd a suitable invariant R which is application dependent. Intuitively, the machine A is semantically equivalent to the above machine C which contains the component module M. The structural diﬀerence is that in A the action M is a local operation of the system, while in C it belongs to the pre-deﬁned library component M mentioned in the extends-clause. This means that the action instance M in the scheduling-clause of A is turned into the component instance M(c, d) in the scheduling- clause of C. We can write REFINEMENT C REFINES A EXTENDS M ... OPERATIONS ... SCHEDULING C = ˆ (C1 ; C2 ) [] M(c, d) . . . MAPPINGS M ≤ M(c, d), ... END

In order to show the correctness of this transformation as a reﬁnement step we need to verify the proof obligations (1)-(4) from the Section 3.1. The proof obligations can be automatically generated from C and proved with the tools supporting the B Method. Note that the above transformation rule applies to the most straightforward extraction step. In practice, the procedure can be more complicated requiring several reﬁnement steps, depending on the complexity of the library component which is to be extracted. The communication variable c, for instance, might have more values than the two (req, ack) needed above, and the introduction of c might involve more than one (A) action in the original system. Furthermore, an extraction operation can require several distinct communication variables rather than just one, and maybe a number of fresh data variables as well. However, the basic idea remains the same, i.e., atomic actions are split into two or more separate parts using the introduced channels in order to form a model of a known

Design with Asynchronously Communicating Components

437

library component into the system. This embedded model is then replaced with a reference to the actual library component. Naturally, the idea of component extraction discussed above can be as well used for creating new library components which are to be re-used in future design projects.

4

Component Extraction Example

As an example of the component extraction process, consider the action system E of Section 2.2. We assume that E operates within an environment, modeled by another abstract machine, which instantiates E in its scheduling-clause as the component E (l, din, r, dout), where (l ∈ com)∧(r ∈ com)∧(dout ∈ data)∧(din ∈ data), and the communication variables l and r are both initialized to ack. The machine E is an abstract model of an asynchronous system. It has one passive input port l(din) and one active output port r(dout), and its behavior is the following. First, the environment selects a value for the data input din and activates the machine E by setting the channel l to the request state req. Then E computes a new value for the data output dout using the relation F . Observe that F is not explicitly speciﬁed in this generic example. The machine E sends the data dout to the environment via the channel r by setting r to the request state req. Simultaneously, an acknowledgement ack is issued through the channel l. This indicates that the environment may send new data din to E via l, but computation in E is blocked until the environment has stored the value of dout and issued an acknowledgement through r. Let us now assume that we have a library of pre-deﬁned asynchronous componets available and that we intend to implement the abstract system speciﬁcation E stepwise as a composition of 4 components belonging to this library: register Reg, function Func, release R, and suspend Susp. Below we discuss these componets and related extraction steps separately. 4.1 Register Component The predicate F in the operation E of the machine E refers also to the variable dout itself. In other words, the next value of dout depends on the current value of dout. Furthermore, new input data din can arrive from the environment before communication on the channel r has been completed. This indicates that a storage element, a register, is needed for the variable dout. Hence, the register component Reg, deﬁned in Section 2.2, is extracted as the ﬁrst decomposition step. As the result we obtain the machine E 1 , given in Sections 2.3 and 3.1, which is a reﬁnement of the initial abstract machine E, containing Reg as a component. REFINEMENT E 1 REFINES E EXTENDS Reg ... SCHEDULING ˆ (E11 ; E13 ) [] E12 [] Reg(c1 , dm, c2 , dout) E 1 (l, din, r, dout) = MAPPINGS E ≤ ((E11 ; E13 ) [] E12 [] Reg(c1 , dm, c2 , dout)) END

438

J. Plosila, K. Sere, and M. Wald´en

The extraction procedure splits the operation E of E into ﬁve separate parts, two of which (Reg1 , Reg2 ) belong to the register Reg and the others (E11 , E12 , E13 ) to the reﬁned machine E 1 . For this, we have introduced two fresh communication variables c1 , c2 ∈ com, through which E 1 and Reg communicate, and applied twice the transformation rule discussed in Section 3.2. Furthermore, the reﬁnement step includes the introduction of the intermediate data variable dm which acts as the data input of the extracted register component, so that Reg carries out the copying assignment dout := dm. The machine E 1 activates Reg by computing a new value for the data variable dm and setting the channel c1 to the request state in the operation E11 . Then Reg copies the value of dm to the variable dout and performs a communication cycle on the channel c2 , which activates E 1 to send dout to the environment via the channel r in the operation E12 . After the cycle on c2 , Reg sets c1 to the acknowledgement state, and E 1 executes ﬁnally the operation E13 , where the channel l is set to the acknowledgement state. 4.2

Function Component

The second extraction step places the computation of the data variable dm, the input of Reg, to the component machine Func deﬁned by MACHINE F unc OPERATIONS ˆ b, dout ← F unc1 (a, din1 , din2 ) = SELECT a = req THEN ANY dout WHERE F (dout , din1 , din2 ) THEN dout := dout END || b := req END; ˆ SELECT b = ack THEN a := ack END a ← F unc2 (b) = SCHEDULING F unc(a, din1 , din2 , b, dout, F ) = F unc1 ; F unc2 END

This library component has the passive input port a(din1 , din2 ) and the active output port b(dout). Notice that also the predicate F is viewed as a parameter in the scheduling-clause. In the extraction procedure, the formal ports a(din1 , din2 ) and b(dout) are replaced with the actual push channels c3 (din, dout) and c1 (dm), respectively, where c3 ∈ com is the new communication variable introduced in the transformation step. The resulting machine E 2 , which is a reﬁnement of E 1 , is given below. In order to extract the component Func, the operations E11 and E13 of the system E 1 are split into two parts each: E11 into E21 and F unc1 , and E13 into E23 and F unc2 . The system E 2 activates Func by executing the operation E21 , where the current values of the variables dout and din are sent to Func via the new channel c3 . The function component then carries out the data assignment ANY dm WHERE F (dm , din, dout) THEN dm := dm END

Design with Asynchronously Communicating Components

439

and sends the result data dm to the register component Reg via the channel c1 which was created in the ﬁrst decomposition step. REFINEMENT E 2 REFINES E 1 EXTENDS Reg, F unc VARIABLES c1 , c2 , c3 , dm INVARIANT c1 ∈ com ∧ c2 ∈ com ∧ dm ∈ data ∧ c3 ∈ com DEFINITIONS F (· · ·) == . . . INITIALISATION c1 := ack || c2 := ack || c3 := ack || dm :∈ data OPERATIONS ˆ SELECT l = req ∧ r = ack THEN c3 := req END; E21 (l, r) = ˆ SELECT c2 = req THEN c2 := ack || r := req END; r ← E22 = ˆ SELECT c3 = ack THEN l := ack END l ← E23 = SCHEDULING 2 ˆ (E21 ; E23 ) [] E22 [] Reg(c1 , dm, c2 , dout) E (l, din, r, dout) = [] F unc(c3 , din, dout, c1 , dm, F ) MAPPINGS (E11 ; E13 ) ≤ ((E21 ; E23 ) [] F unc(c3 , din, dout, c1 , dm, F )), E12 ≤ E22 END

4.3

Final Extraction

We complete our example by extracting two distinct library components based on the three operations of E 2 . The component machines are called R (release) and Susp (suspend), deﬁned as follows: MACHINE R OPERATIONS ˆ SELECT a = req THEN b := req || a := ack || bsy := true END; a, b, bsy ← R1 (a) = ˆ SELECT b = ack THEN bsy := false END bsy ← R2 (b) = SCHEDULING R(a, b, bsy) = R1 ; R2 END

MACHINE Susp OPERATIONS b ← Susp1 (a, bsy) = ˆ SELECT a = req ∧ ¬bsy THEN b := req END; ˆ SELECT b = ack THEN a := ack END a ← Susp2 (b) = SCHEDULING Susp(a, b, bsy) = Susp1 ; Susp2 END

They both have two communication ports: the passive port a and the active port b. Furthermore, R outputs the boolean signal bsy setting it to true whenever a communication cycle on the active port b begins, and back to f alse when a cycle on b ends. The component Susp, in turn, reads the signal bsy, allowing a new communication cycle on its active port b to start only when bsy = false. In the system derivation, the formal parameters a and b of R are replaced with the

440

J. Plosila, K. Sere, and M. Wald´en

actual variables c2 and r, respectively. In the case of Susp, a and b are replaced with l and c3 , respectively. The parameter bsy becomes a variable of the same name, shared by the components R and Susp. The ﬁnal system E 3 is then a reﬁnement of E 2 , given as REFINEMENT E 3 REFINES E 2 EXTENDS Reg, F unc, R, Susp VARIABLES c1 , c2 , c3 , bsy, dm INVARIANT c1 ∈ com ∧ c2 ∈ com ∧ c3 ∈ com ∧ bsy ∈ BOOL ∧ dm ∈ data DEFINITIONS F (· · ·) == . . . INITIALISATION c1 := ack || c2 := ack || c3 := ack || bsy := false || dm :∈ data SCHEDULING ˆ Reg(c1 , dm, c2 , dout) E 3 (l, din, r, dout) = [] F unc(c3 , din, dout, c1 , dm, F ) [] R (c2 , r, bsy) [] Susp(l, c3 , bsy) MAPPINGS ((E21 ; E23 ) [] E22 ) ≤ (Susp(l, c3 , bsy) [] R (c2 , r, bsy) END

Observe that the resulting system does not have operations of its own, but the functionality comes completely from the four component machines. In the ﬁnal transformation, we do not insert new communication variables of the type com. Instead a boolean variable bsy (“busy”) is introduced in order to move the detection of the condition r = ack to a diﬀerent location, making extraction of R and Susp possible. Basically, the component Susp implements the operations E21 and E23 of E 2 . The component R, in turn, implements the operation E22 . However, a new action has to be created for detecting the condition r = ack and setting bsy to false. This action is the operation R2 of R. The component R is enabled by the register component Reg via the channel c2 . It initiates a communication cycle with the environment on the channel r and sends immediately an acknowledgement back to Reg setting the introduced boolean variable bsy to true. Hence, R releases the communication cycles on the channels c1 , c2 , c3 , and l to be completed while output data dout is transferred from the system E 3 to the environment through r. The component Susp reads the variable bsy and suspends computation on new input data din, sent by the environment via the channel l, until the component R has received an acknowledgement on r and set the control signal bsy back to false.

5

Conclusions

We have proposed an approach to component-based asynchronous system design within the B Method and its supporting tools. The main idea is to extract system components in a stepwise manner from the initial speciﬁcation. At each extraction step new asynchronous communication channels are introduced. There has been a lot of work on composition and reﬁnement in the context of formal component-based design approaches, see e.g. [2]. However, these seldom come with a practical methodology and tool support, which is our main focus here.

Design with Asynchronously Communicating Components

441

The B Method was used almost as such with minor extensions mainly needed to specify alternative scheduling strategies for actions in a scheduling-clause and explicitly giving reﬁnement relations in the mappings-clause. The ﬁrst clause is mainly syntactic sugaring as every scheduling could be coded in the actions of the operations-clause. The second clause is a real extension, but also it can be easily supported by the tools. The scheduling-clause is inspired by the work of Butler [7] who studies a more general process-concept for the B Method to tie it together with the CSP-approach [11]. The Reﬁnement Calculator [22] is a tool that supports development of correct programs within the reﬁnement calculus framework. It is a tool for program reﬁnement based on the HOL theorem prover. Until now, the tool does not have proper support for action systems. However, a method to prove the correctness of the implementations of asynchronous modules has been mechanized within the HOL theorem prover [16] which supports more general veriﬁcation techniques than the tools studied in this paper. In the future we can envisage a situation where the veriﬁcation of the low level implementations are carried out within HOL and the high-level design and component libraries are supported by the tools of the B Method.

References 1. J.-R. Abrial. The B-Book. Cambridge University Press, Cambridge, Great Britain, 1996. 2. L. de Alfaro and T.A. Henzinger. Interface Theories for Component-based Design. Proc. of the 1st International Workshop on Embedded Software, Springer-Verlag, 2001. 3. R. J. R. Back and R. Kurki-Suonio. Decentralization of process nets with centralized control. In Proc. of the 2nd ACM SIGACT–SIGOPS Symp. on Principles of Distributed Computing, pages 131–142, 1983. 4. R. J. R. Back and K. Sere. Stepwise reﬁnement of action systems. Structured Programming, 12:17–30, 1991. 5. R. J. R. Back och K. Sere. From action systems to modular systems. In Proc. of FME’94: Industrial Beneﬁt of Formal Methods. LNCS 873, pp. 1–25, Barcelona, Spain, October 1994. Springer–Verlag. 6. M. M. Bonsangue, J. N. Kok, and K. Sere. Developing object-based distributed system. In Formal Methods for Open Object-based Distributed Systems (FMOODS’99), Florence, Italy, February 1999. Kluver Academic Publishers. 7. M. J. Butler. csp2B: A practical approach to combining CSP and B. In J. Wing, J. Woodcock and J. Davies (Eds.) Proc. of FM’99 - Formal Methods. LNCS 1708, pages 490 – 508, Toulouse, France, September 1999. Springer-Verlag. 8. M. J. Butler and M. Wald´en. Distributed System Development in B. In H. Habrias (Ed.) Proc. of the First Conference on the B Method. pages 155 – 168, IRIN, Nantes, France, November 1996. 9. ClearSy. Event B Reference Manual v1., 2001. 10. E. W. Dijkstra. A Discipline of Programming. Prentice–Hall International, Englewood Cliﬀs, New Jersey, 1976. 11. C.A.R. Hoare. Communicating Sequential Processes. Series in Computer Science, Prentice-Hall Int, 1985.

442

J. Plosila, K. Sere, and M. Wald´en

12. R. J. Lipton. Reduction: A method of proving properties of parallel programs. Communications of the ACM, 18(12):717–721, 1975. 13. D. S. Neilson and I. H. Sorensen. The B-Technologies: A system for computer aided programming. Including the B-Toolkit User’s Manual, Release 3.2. B-Core (UK) Ltd., Oxford, U.K., 1996. 14. A. Peeters. Single-Rail Handshake Circuits. PhD Thesis, Eindhoven University of Technology, The Netherlands, 1996. 15. J. Plosila. Self-Timed Circuit Design – The Action Systems Approach. PhD thesis, University of Turku, Turku, Finland, 1999. 16. R. Ruksenas. Tool Support for Data Reﬁnement. Ph.D. Thesis. Forthcoming. 17. T. Seceleanu. Systematic Design of Synchronous Digital Circuits. PhD thesis, Turku Centre for Computer Science (TUCS), Turku, Finland, 2001. 18. E. Sekerinski and K. Sere (Eds.). Program Development by Reﬁnement. FACIT, Springer Verlag 1998. 19. K. Sere and M. Wald´en. Data Reﬁnement of Remote Procedures. Formal Aspects of Computing, Volume 12, No 4, pp. 278 - 297, December 2000. 20. St´eria M´editerran´ee. Atelier B. France, 1996. 21. M. Wald´en and K. Sere. Reasoning about action systems using the B-Method. In Formal Methods in System Design, Vol. 13, No 1, pages 5 - 35, May 1998. Kluwer Academic Publishers. 22. J. von Wright. Program reﬁnement by theorem prover. In Proc. of Sixth BCS-FACS Reﬁnement Workshop, January 1994.

Composition for Component-Based Modeling Gregor G¨ ossler1 and Joseph Sifakis2 1 INRIA Rhˆ one-Alpes [email protected] 2 VERIMAG [email protected]

1

Introduction

Component-based engineering is of paramount importance for rigorous system design methodologies. It is founded on a paradigm which is common to all engineering disciplines: complex systems can be obtained by assembling components (building blocks). Components are usually characterized by abstractions that ignore implementation details and describe properties relevant to their composition e.g. transfer functions, interfaces. Composition is used to build complex components from simpler ones. It can be formalized as an operation that takes in components and their integration constraints. From these, it provides the description of a new, more complex component. Component-based engineering is widely used in VLSI circuit design methodologies, supported by a large number of tools. Software and system componentbased techniques have known signiﬁcant development, especially due to the use of object technologies supported by languages such as C++, Java, and standards such as UML and CORBA. However, these techniques have not yet achieved the same level of maturity as has been the case for hardware. The main reason seems to be that software systems are immaterial and are not directly subject to the technological constraints of hardware, such as ﬁne granularity and synchrony of execution. For software components, it is not as easy to establish a precise characterization of the service and functionality oﬀered at their interface. Existing component technologies encompass a restricted number of interaction types and execution models, for instance, interaction by method calls under asynchronous execution. We lack concepts and tools allowing integration of synchronous and asynchronous components, as well as diﬀerent interaction mechanisms, such as communication via shared variables, signals, rendez-vous. This is essential for modern systems engineering, where applications are initially developed as systems of interacting components, from which implementations are derived as the result of a co-design analysis. The development of a general theoretical framework for component-based engineering is one of the few grand challenges in information sciences and technologies. The lack of such a framework is the main obstacle to mastering the complexity of heterogeneous systems. It seriously limits the current state of the practice, as attested by the lack of development platforms consistently integrating design activities and the often prohibitive cost of validation. F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 443–466, 2003. c Springer-Verlag Berlin Heidelberg 2003

444

G. G¨ ossler and J. Sifakis

The application of component-based design techniques raises two strongly related and hard problems. First, the development of theory for building complex heterogeneous systems. Heterogeneity is in the diﬀerent types of component interaction, such as strict (blocking) or non strict, data driven or event driven, atomic or non atomic and in the diﬀerent execution models, such as synchronous or asynchronous. Second, the development of theory for building systems which are correct by construction, especially with respect to essential and generic properties such as deadlock-freedom or progress. In practical terms, this means that the theory supplies rules for reasoning on the structure of a system and for ensuring that such properties hold globally under some assumptions about its constituents e.g. components, connectors. Tractable correctness by construction results can provide signiﬁcant guidance in the design process. Their lack leaves a posteriori veriﬁcation of the designed system as the only means to ensure its correctness (with the well-known limitations). In this paper, we propose a framework for component-based modeling that brings some answers to the above issues. The framework uses an abstract layered model of components. It integrates and simpliﬁes results about modeling timed systems by using timed automata with dynamic priorities [5,1]. A component is the superposition of three models: a behavioral model, an interaction model, and an execution model. – Behavioral models describe the dynamic behavior of components. – Interaction models describe architectural constraints on behavior. They are deﬁned as a set of connectors and their properties. A connector is a maximal set of compatible component actions. The simultaneous occurrence of actions of a connector is an interaction. – Execution models reduce non determinism resulting from parallel execution in the lower layers. They are used to coordinate the execution of threads so as to ensure properties related to the eﬃciency of computation, such as synchrony and scheduling. An associative and commutative composition operator is deﬁned on components, which preserves deadlock-freedom. The operator deﬁnes a three-layered component by composing separately the corresponding layers of its arguments. As a particular instance of the proposed framework, we consider components where behaviors are transition systems and both interaction and execution models are described by priority relations on actions. Our framework diﬀers from existing ones such as process algebras, semantic frameworks for synchronous languages [4,11,3,17] and Statecharts [12], in two aspects. First, it distinguishes clearly between three diﬀerent and orthogonal aspects of systems modeling: behavior, interaction (architecture) and execution. This distinction, apart from its methodological interest, allows solving technical problems such as associativity of a unique and powerful composition operator. The

Composition for Component-Based Modeling

445

proposed framework has concepts in common with Metropolis [2] and Ptolemy [16] where a similar separation of concerns is advocated. Second, parallel composition preserves deadlock-freedom. That is, if the arguments can perform some action from any state then their product does so. This is due to the fact that we replace restriction or other mechanisms used to ensure strong synchronization between components, by dynamic priorities. Nevertheless, our composition is a partial operation: products must be interaction safe, that is, they do not violate strong synchronization assumptions. In that respect, our approach is has some similarity to [7]. The paper is organized as follows. Section 2 discusses three requirements for composition in component-based modeling. The ﬁrst is support for two main types of heterogeneity: heterogeneous interaction and heterogeneous execution. The second is that it provide results for ensuring correctness by construction for a few essential and generic system properties, such as deadlock-freedom. The third is the existence of a composition operator that allows abstraction and incremental description. Section 3 presents a general notion of composition and its properties for components with two layers: behavior and interaction models. Interaction models relate concepts from architecture (connectors) to actions performed by components via the notion of interaction. Interaction models distinguish between complete and incomplete interactions. This distinction induces the concept of interaction safety for models, meaning that only complete or maximal interactions are possible. We show associativity and commutativity of the composition operator. The section ends with a few results on correctness by construction for interaction safety of models and deadlock-freedom. Section 4 presents two examples illustrating the use of execution models. We assume that execution models can be described by priority orders. The ﬁrst example shoes how synchronous execution can be enforced by a priority order on the interactions between reactive components. The order respects the causality ﬂow relation between component actions. The second example shows how scheduling policies can be implemented by an execution model. Section 5 presents concluding remarks about the presented framework.

2 2.1

Requirements for Composition General

We consider a very simple and abstract concept of components that is suﬃcient for the purpose of the study. A component can perform actions from a vocabulary of actions. The behavior of a component describes the eﬀect of its actions. A system of interacting components is a set of components integrated through various mechanisms for coordinating their execution. We assume that the overall eﬀect of integration on the components of a system is the restriction of their behavior and it can be abstractly described by integration constraints. The latter describe the environment of a component. A component’s actions may be blocked until the environment oﬀers actions satisfying these constraints.

446

G. G¨ ossler and J. Sifakis

We distinguish two types of integration constraints: interaction and execution constraints. Interaction constraints characterize mechanisms used in architectures such as connectors, channels, synchronization primitives. Interactions are the result of composition between actions. In principle, all the actions of a component are “visible” from its environment. We do not consider any speciﬁc notion of interface. Execution constraints restrict non determinism arising from concurrent execution, and ensure properties related to the eﬃciency of computation, such as synchronous execution and scheduling. There exists a variety of formalisms proposing concepts for parallel execution of sequential entities, such as process algebras (CCS [19], CSP [13]), synchronous languages (Esterel, Lustre, Statecharts), hardware description languages (VHDL), system description languages (SystemC [20], Metropolis), and more general modeling languages (SDL [14], UML [10]). In our terminology, we use the term “component” to denote any executable description whose runs can be modeled as sequences of actions. Component actions can be composed to produce interactions. Tasks, processes, threads, functions, blocks of code can be considered as components provided they meet these requirements. The purpose of this section is to present concept requirements for composition in component-based modeling and to discuss the adequacy of existing formalisms with respect to these requirements. 2.2

Heterogeneity

Heterogeneity of Interaction. It is possible to classify existing interaction types according to the following criteria: Interactions can be atomic or non atomic. For atomic interactions, the behavior change induced in the participating components cannot be altered through interference with other interactions. Process algebras and synchronous languages assume atomic interactions. In languages with buﬀered communication (SDL, UML) or in multi-threaded languages (Java), interactions are not atomic, in general. An interaction is initialized by sending a message or by calling a method, and between its initiating action and its termination, components non participating in the interaction can interfere. Interactions can involve strict or non strict synchronization. For instance, atomic rendez-vous of CSP is an interaction with strict synchronization in the sense that it can only occur if all the participating actions can occur. Strict synchronization can introduce deadlocks in systems of interacting deadlock-free components, that is, components that can always oﬀer an action. If a component persistently oﬀers an action and its environment is unable to oﬀer a set of actions matching the interaction constraints, then there is a risk of deadlock. In synchronous languages, interactions are atomic and synchronization is non strict in the sense that output actions can occur whether or not they match with some input. Nevertheless, for inputs to be triggered, a matching output is necessary.

Composition for Component-Based Modeling

447

Finally, interactions can be binary (point to point) or n-ary for n 3. For instance, interactions in CCS and SDL are binary (point to point). The implementation of n-ary interactions by using binary interaction primitives is a non trivial problem. Clearly, there exists no formalism supporting all these types of interaction. Heterogeneity of Execution. There exist two well-known execution paradigms. Synchronous execution is typically adopted in hardware, in the synchronous languages, and in many time triggered architectures and protocols. It considers that a system run is a sequence of global steps. It assumes synchrony, meaning that the system’s environment does not change during a step, or equivalently “that the system is inﬁnitely faster than its environment”. In each execution step, all the system components contribute by executing some “quantum” computation, deﬁned through the use of appropriate mechanisms such as timing mechanisms (clocks, timers) or a notion of stable states. For instance, in synchronous languages, an execution step is a reaction to some external stimulus obtained by propagating the reactions of the components according to a causality ﬂow relation. A component reaction is triggered by a change of its environment and eventually terminates at some stable state for this environment. The synchronous execution paradigm has built-in a very strong assumption of fairness: in each step all components execute a quantum computation deﬁned using either quantitative or logical time. The asynchronous execution paradigm does not adopt any notion of a global computation step in a system’s execution. It is used in languages for the description of distributed systems such as SDL and UML, and programming languages such as Ada and Java. The lack of a built-in mechanism for sharing resources between components is often compensated by using scheduling. This paradigm is also common to all execution platforms supporting multiple threads, tasks, etc. Currently, there is no framework encompassing the diversity of interaction and execution models. Figure 1 classiﬁes diﬀerent system description languages in a three-dimensional space with coordinates corresponding to execution (synchronous/asynchronous) and to interaction: atomic/non atomic and strict/nonstrict. It is worth noting that synchronous languages use non strict and atomic interactions. This choice seems appropriate for synchronous execution. On the contrary, for asynchronous execution there is no language using this kind of interaction. 2.3

Correctness by Construction

It is desirable that frameworks for component-based modeling provide results for establishing correctness by construction for at least a few common and generic properties such as deadlock-freedom or stronger progress properties. In practical terms, this implies the existence of inference rules for deriving system and com-

448

G. G¨ ossler and J. Sifakis

Fig. 1. About composition: heterogeneity. A: atomic, S: strict interaction.

ponent properties from properties of lower-level components. In principle, two types of rules are needed for establishing correctness by construction. Composability rules allow to infer that, under some conditions, a component will meet a given property after integration. These rules are essential for preserving across integration previously established component properties. For instance, to guarantee that a deadlock-free component (a component that has no internal deadlocks) will remain deadlock-free after integration. Composability is essential for incremental system construction as it allows building large systems without disturbing the behavior of their components. It simply means stability of established component properties when the environment changes by adding or removing components. Property instability phenomena are currently poorly understood e.g. feature interaction in telecommunications, or non composability of scheduling algorithms. Results in composability are badly needed. Compositionality rules allow to infer a system’s properties from its components’ properties. There exists a rich body of literature for establishing correctness through compositional reasoning [15,9,8]. However, most of the existing results deal with the preservation of safety properties. 2.4

Abstraction and Incrementality

A basic assumption of component-based engineering is that components are characterized by some external speciﬁcation that abstracts out internal details. However, it is often necessary to modify the components according to the context of their use, at the risk of altering their behavior. Such modiﬁcations may be necessary to adapt components to a particular type of composition. For instance, to model non strict synchronization using strict synchronization, a common transformation consists in modifying both the action vocabularies (interfaces) and the behavior of components by adding for each action a of the interface a “com-

Composition for Component-Based Modeling

449

plementary” a ¯ action that will be executed from all the states from which a is not possible. To model strict synchronization using non strict synchronization, similar modiﬁcations are necessary (see for instance Milner’s SCCS [18]). We currently lack suﬃciently powerful and abstract composition operators encompassing diﬀerent kinds of interaction. Another important requirement for composition is incrementality of description. Consider systems consisting of sets of interacting components, the interaction being represented as usual, by connectors or architectural constraints of any kind. Incrementality means that such systems can be constructed by adding or removing components and that the result of the construction is independent of the order of integration. Associative and commutative composition operators allow incrementality. Existing theoretical frameworks such as CCS, CSP, SCCS, use parallel composition operators that are associative and commutative. Nevertheless, these operators are not powerful enough. They need to be combined with other operators such as hiding, restriction, and renaming in system descriptions. The lack of a single operator destroys incrementality of description. For instance, some notations use hiding or restriction to enforce interaction between the components of a system. If the system changes by adding a new component, then some hiding or restriction operators should be removed before integrating the new component. Graphical formalisms used in modeling tools such as Statecharts or UML do not allow incremental description as their semantics are not compositional. They are deﬁned by functions associating with a description its meaning, as a global transition system (state machine), i.e., they implicitly use n-ary composition operators (n is equal to the number of the composed components). It is always easy to deﬁne commutative composition, even in the case of asymmetric interactions. On the contrary, the deﬁnition of a single associative and commutative composition operator which is expressive and abstract enough to support heterogeneous integration remains a grand challenge.

3

Composition

We present an abstract modeling framework based on a unique binary associative and commutative composition operator. Composition operators should allow description of systems built from components that interact by respecting constraints of an interaction model. The latter characterizes a system architecture as a set of connectors and their properties. Given a set of components, composition operations allow to construct new components. We consider that the meaning of composition operations is deﬁned by connectors. Roughly speaking, connectors relate actions of diﬀerent components and can be abstractly represented as tuples or sets of actions. The related actions can form interactions (composite actions) when some conditions are met. The conditions deﬁne the meaning of the connector and say when and how the interaction can take place depending on the occurrence of the related actions.

450

G. G¨ ossler and J. Sifakis

For instance, interactions can be asymmetric or symmetric. Asymmetric interactions have an initiator (cause) which is a particular action whose occurrence can trigger the other related actions. In symmetric interactions all the related actions play the same role. The proposed composition operator diﬀers from existing ones in automata theory and process algebras in the following. – First, it preserves deadlock-freedom. This is not the case in general, for existing composition operators except in very speciﬁc cases. For instance, when from any state of the behavioral model any action oﬀered by the environment can be accepted. – Second, deadlock-freedom preservation is due to systematic interleaving of all the actions of the composed components, combined with the use of priority rules. The latter give preference to synchronization over interleaving. In existing formalisms allowing action interleaving in the product such as CCS and SCCS, restriction operators are used instead of priorities to prevent occurrence of interleaving actions. For instance, if a and a ¯ are two synchronizing actions in CCS, their synchronization gives an invisible action τ = aa ¯. The interleaving actions a and a ¯ are removed from the product system by using restriction. This may introduce deadlocks at product states from which no matching actions are oﬀered. Priority rules implement a kind of dynamic restriction and lead to a concept of “ﬂexible” composition. 3.1

Interaction Models and Their Properties

Consider a set of components with disjoint vocabularies of actions Ai for i ∈ K, K a set of indices. We put A = i∈K Ai . A connector c is a non empty subset of A such that ∀i ∈ K . |Ai ∩ c| 1. A connector deﬁnes a maximally compatible set of interacting actions. For the sake of generality, our deﬁnition accepts singleton connectors. The use of the connector {a} in a description is interpreted as the fact that action a cannot be involved in interactions with other actions. Given a connector c, an interaction α of c is any term of the form α = a1 . . . an such that {a1 , . . . , an } ⊆ c. We assume that is a binary associative and commutative operator. It is used to denote some abstract and partial action composition operation. The interaction a1 . . . an is the result of the simultaneous occurrence of the actions a1 , . . . , an . When α and α are interactions we write α α to denote the interaction resulting from their composition (if its is deﬁned). Notice that if α = a1 . . . an is an interaction then any term corresponding to a sub-set of {a1 , . . . , an } is an interaction. By analogy, we say that α is a sub-interaction of α if α = α α for some interaction α . Clearly, actions are minimal interactions. The set of the interactions of a connector c = {a1 , . . . , an }, denoted by I(c), consists of all the interactions corresponding to sub-sets of c (all the subinteractions of c). We extend the notation to sets of connectors. If C is a set

Composition for Component-Based Modeling

451

of connectors then I(C) is the set of its interactions. Clearly for C1 , C2 sets of connectors, I(C1 ∪ C2 ) = I(C1 ) ∪ I(C2 ). Deﬁnition 1 (Interaction model). The interaction model of a system composed of a set of components K with disjoint vocabularies of actions Ai for i ∈ K, is deﬁned by ; – the vocabulary of actions A = i∈K Ai – the set of its connectors C such that c∈C c = A, and if c ∈ C then there exists no c ∈ C and c ⊂ c . That is, C contains only maximal connectors; – the set of the complete interactions I(C)+ ⊆ I(C), such that ∀b, b ∈ I(C), b ∈ I(C)+ and b ⊆ b implies b ∈ I(C)+ . We denote by I(C)− the set of the incomplete (non complete) interactions. Notice that all actions appear in some connector. The requirement that C contains only maximal sets ensures bijective correspondence between the set of connectors C and the corresponding set of interactions I(C). Given I(C), the corresponding set of connectors is uniquely deﬁned and is C. To simplify notation, we write IC instead of I(C). The distinction complete/incomplete is essential for building correct models. As models are built incrementally, interactions are obtained by composing actions. It is often necessary to express the constraint that some interactions of a sub-system are not interactions of the system. This is typically the case for binary strict synchronization (rendez-vous). For example, send and receive should be considered as incomplete actions but sendreceive as complete. The occurrence of send or receive alone in a system model is an error because it violates the assumption about strict synchronization made by the designer. Complete interactions can occur in a system when all the involved components are able to perform the corresponding actions. The distinction between complete/incomplete encompasses many other distinctions such as input/output, internal/external, observable/controllable used in diﬀerent formalisms. It is in our opinion, the most relevant concerning the ability of components to interact. Clearly, internal component actions should be considered as complete because they can be performed by components independently of the state of their environment. In some formalisms, output actions are complete (synchronous languages, asynchronous buﬀered communication). In some others, with strict synchronization rules, all actions participating in interactions are incomplete. In that case, it is necessary to specify which interactions are complete. For instance, if a1 a2 a3 is complete and no sub-interaction is complete, this means that a strong synchronization between a1 , a2 , a3 is required. A requirement about complete interactions is closedness for containment that is, if α is a complete interaction then any interaction containing it, is complete. This requirement follows from the assumption that the occurrence of complete interactions cannot be prevented by the environment. Very often it is suﬃcient to consider that the interactions of IC+ are deﬁned from a given set of complete actions A+ ⊆ A. That is, IC+ consists of all the interactions of IC where at least one complete action (element of A+ ) is involved.

452

G. G¨ ossler and J. Sifakis

In the example of ﬁgure 2, we give sets of connectors and complete actions to deﬁne interaction models. By convention, bullets represent incomplete actions and triangles complete actions. In the partially ordered set of the interactions, full nodes denote complete interactions. The interaction between put and get represented by the interaction putget is a rendez-vous meaning that synchronization is blocking for both actions. The interaction between out and in is asymmetric as out can occur alone even if in is not possible. Nevertheless, the occurrence of in requires the occurrence of out. The interactions between out, in1 and in2 are asymmetric. The output out can occur alone or in synchronization with any of the inputs in1 , in2 .

Fig. 2. Flexible composition: interaction structure.

In general, completeness of interactions need not be the consequence of the completeness of some action. For instance, consider a connector {a1 , a2 , a3 , a4 } and suppose that the set of the minimal complete interactions of I{a1 , a2 , a3 , a4 } is a1 a2 and a3 a4 . That is, the actions a1 , a2 , a3 , a4 are incomplete and only interactions containing a1 a2 or a3 a4 are complete. This speciﬁcation requires strict synchronization of at least one of the two pairs (a1 , a2 ), (a3 , a4 ). 3.2

Incremental Description of Interaction Models

Consider the interaction model IM = (IC, IC+ ) of a set of interacting components K with disjoint vocabularies of actions Ai for i ∈ K. IC and IC+ denote the sets of interactions and complete interactions, respectively on the vocabulary of actions A = i∈K Ai .

Composition for Component-Based Modeling

453

For given K ⊆ K the interaction model IM[K ] of the set of interacting components K is deﬁned as follows: – A[K ] = i∈K Ai is the vocabulary of actions of IM[K ]; ∃c ∈ C . c c ∩ A[K ]} is the – C[K ] = {c | ∃c ∈ C . c = c ∩ A[K ] ∧ set of the connectors of IM[K ]; – IM[K ] = (IC[K ], IC[K ]+ ) is the interaction model of IM[K ] where IC[K ] is the set of the interactions of C[K ] and IC[K ]+ = IC[K ] ∩ IC+ . Deﬁnition 2. Given a family of disjoint sets of components K1 , . . . , Kn subsets of K, denote by C[K1 , . . . , Kn ] the set of the connectors having at least one action in each set, that is, C[K1 , . . . , Kn ] = {c = c1 ∪ · · · ∪ cn | ∀i ∈ [1, n] . ci ∈ C[Ki ]}. Clearly, C[K1 , . . . , Kn ] is the set of the connectors of IM[K1 ∪ · · · ∪ Kn ] which are not connectors of any IM[K ] for any subset K of at most n − 1 elements from {K1 , . . . , Kn }. Proposition 1. Given K1 , K2 two disjoint subsets of K. IC[K1 ∪ K2 ] = IC[K1 ] ∪ IC[K2 ] ∪ IC[K1 , K2 ] IC[K1 ∪ K2 ]+ = IC[K1 ]+ ∪ IC[K2 ]+ ∪ IC[K1 , K2 ]+ IM[K1 ∪ K2 ] = (IC[K1 ∪ K2 ], IC[K1 ∪ K2 ]+ ) = IM[K1 ] ∪ IM[K2 ] ∪ IM[K1 , K2 ] where IC[K1 , K2 ]+ = IC[K1 , K2 ] ∩ IC+ . Proof. The ﬁrst equality comes from the fact that C[K1 ] ∪ C[K2 ] ∪ C[K1 , K2 ] contains all the connectors of C[K1 ∪ K2 ] and other connectors that are not maximal. By deﬁnition, IC contains all the sub-sets of C. Thus, IC[K1 ∪ K2 ] = I(C[K1 ] ∪ C[K2 ] ∪ C[K1 , K2 ]) = IC[K1 ] ∪ IC[K2 ] ∪ IC[K1 , K2 ]. Remark 1. The second equality says that the same interaction cannot be complete in an interaction model IM[K1 ] and incomplete in IM[K2 ], for K1 , K2 ⊆ K. This proposition provides a basis for computing the interaction model IM[K1 ∪K2 ] from the interaction models IM[K1 ] and IM[K2 ] and from the interaction model of the connectors relating components of K1 and components of K2 . Property 1. For K1 , K2 , K3 three disjoint subsets of K, IC[K1 ∪ K2 , K3 ] = IC[K1 , K3 ] ∪ IC[K2 , K3 ] ∪ IC[K1 , K2 , K3 ] IM[K1 ∪ K2 , K3 ] = IM[K1 , K3 ] ∪ IM[K2 , K3 ] ∪ IM[K1 , K2 , K3 ] Proof. The ﬁrst equality comes from the fact that C[K1 , K3 ] ∪ C[K2 , K3 ] ∪ C[K1 , K2 , K3 ] contains all the connectors of C[K1 ∪ K2 , K3 ] and in addition, other connectors that are not maximal. By deﬁnition, IC contains all the subsets of C. Thus, IC[K1 ∪ K2 , K3 ] = I(C[K1 , K3 ] ∪ C[K2 , K3 ] ∪ C[K1 , K2 , K3 ]) from which we get the result by distributivity of I over union.

454

G. G¨ ossler and J. Sifakis

This property allows computing the connectors and thus the interactions between IM[K1 ∪ K2 ] and IM[K3 ] in terms of the interactions between IM[K1 ], IM[K2 ], and IM[K3 ]. By using this property, we get the following expansion formula: Proposition 2 (Expansion formula). IM[K1 ∪ K2 ∪ K3 ] =IM[K1 ] ∪ IM[K2 ] ∪ IM[K3 ] ∪ IM[K1 , K2 ] ∪ IM[K1 , K3 ] ∪ IM[K2 , K3 ] ∪ IM[K1 , K2 , K3 ] . 3.3

Composition Semantics and Properties

We consider that a system S is a pair S = (B, IM) where B is the behavior of S and IM is its interaction model. As in the previous section, IM is the interaction model of a set of interacting components K with disjoint action vocabularies Ai , i ∈ K. For given K ⊆ K, we denote by S[K ] the sub-system of S consisting of components of K , S[K ] = (B[K ], IM[K ]), where IM[K ] is deﬁned as before. We deﬁne a composition operator allowing to obtain for disjoint sub-sets K1 , K2 of K, the system S[K1 ∪K2 ] as the composition of the sub-systems S[K1 ], S[K2 ] for given interaction model IM[K1 , K2 ] connecting the two sub-systems. The operator composes separately the behavior and interaction models of the sub-systems. Deﬁnition 3. The composition of two systems S[K1 ] and S[K2 ] is the system S[K1 ∪ K2 ] = (B[K1 ], IM[K1 ]) (B[K2 ], IM[K2 ]) = (B[K1 ] × B[K2 ], IM[K1 ] ∪ IM[K2 ] ∪ IM[K1 , K2 ]) where × is a binary associative behavior composition operator such that B[K1 ] × B[K2 ] = B[K1 ∪ K2 ]. Due to proposition 1 we have (B[K1 ], IM[K1 ]) (B[K2 ], IM[K2 ]) = (B[K1 ∪ K2 ], IM[K1 ∪K2 ]), which means that composition of sub-systems gives the system corresponding to the union of their components. Notice that under these assumptions composition is associative: (B[K1 ], IM[K1 ]) (B[K2 ], IM[K2 ]) (B[K3 ], IM[K3 ]) = = (B[K1 ∪ K2 ], IM[K1 ∪ K2 ]) (B[K3 ], IM[K3 ]) = (B[K1 ] × B[K2 ] × B[K3 ], IM[K1 ∪ K2 ] ∪ IM[K3 ] ∪ IM[K1 ∪ K2 , K3 ]) = (B[K1 ∪ K2 ∪ K3 ], IM[K1 ∪ K2 ∪ K3 ]) by application of proposition 2. Transition Systems with Priorities. As a rule, interaction models constrain the behaviors of integrated components. We consider the particular case where interactions are atomic, component behaviors are transition systems, and the constraints are modeled as priority orders on interactions. Transition systems with dynamic priorities have already been studied and used to model timed systems. The interested reader can refer to [6,1].

Composition for Component-Based Modeling

455

Fig. 3. The composition principle.

Deﬁnition 4 (Transition system). A transition system B is a triple (Q, I(A), →) where Q is a set of states, I(A) is a set of interactions on the action vocabulary A, and →⊆ Q × I(A) × Q is a transition relation. α

As usual, we write q1 → q2 instead of (q1 , α, q2 ) ∈→. Deﬁnition 5 (Transition system with priorities). A transition system with priorities is a pair (B, ≺) where B is a transition system with set of interactions I(A), and ≺ is a priority order, that is, a strict partial order on I(A). Semantics: A transition system with priorities represents a transition system: if B = (Q, I(A), →), then (B, ≺) represents the transition system B = α α (Q, I(A), → ) such that q1 → q2 if q1 → q2 and there exists no α and q3 such α

that α ≺ α and q1 → q3 . Deﬁnition 6 (⊕). The sum ≺1 ⊕ ≺2 of two priority orders ≺1 , ≺2 is the least priority order (if it exists) such that ≺1 ∪ ≺2 ⊆≺1 ⊕ ≺2 . Lemma 1. ⊕ is a (partial) associative and commutative operator.

456

G. G¨ ossler and J. Sifakis

Deﬁnition 7 ( ). Consider a system S[K] with interaction model IM[K] = (IC[K], IC[K]+ ). Let S[K1 ] = (B[K1 ], ≺1 ) and S[K2 ] = (B[K2 ], ≺2 ) with disjoint K1 and K2 be two sub-systems of S[K] such that their priority orders do not allow domination of complete interactions by incomplete ones, that is for all α1 ∈ IC[K]+ and α2 ∈ IC[K]− , ¬(α1 ≺ α2 ). The composition operator is deﬁned as follows. If Bi = (Qi , IC[Ki ], →i ) for i = 1, 2, then S[K1 ] S[K2 ] = (B1 × B2 , ≺1 ⊕ ≺2 ⊕ ≺12 ), where B1 × B2 = (Q1 × Q2 , IC[K1 ∪ K2 ], →12 ) with α

α

α

α

α

α α

q1 →1 q1 implies (q1 , q2 ) →12 (q1 , q2 ) q2 →2 q2 implies (q1 , q2 ) →12 (q1 , q2 ) α

1 2 q1 →1 1 q1 and q2 →2 2 q2 implies (q1 , q2 ) → 12 (q1 , q2 ) if α1 α2 ∈ IC[K1 ∪ K2 ].

≺12 is the minimal priority order on IC[K1 ∪ K2 ] such that – α1 ≺12 α1 α2 for α1 α2 ∈ IC[K1 , K2 ] (maximal progress priority rule); – α1 ≺12 α2 for α1 ∈ IC[K1 ∪ K2 ]−− and α2 ∈ IC[K1 ∪ K2 ]+ (completeness priority rule), where IC[K1 ∪ K2 ]−− denotes the elements of IC[K1 ∪ K2 ]− that are non-maximal in IC[K1 ∪ K2 ]. The ﬁrst priority rule favors the largest interaction. The second ensures correctness of the model. It prevents the occurrence of incomplete interactions if they are not maximal. The occurrence of such interactions in a model is a modeling error. If a component can perform a complete action, all non maximal interactions of the other components are prevented. By executing complete actions the components may reach states from which a maximal incomplete interaction is possible. Proposition 3. is a total, commutative and associative operator. Proof. Total operator: prove that for K1 ∩ K2 = ∅, ≺1 ⊕ ≺2 ⊕ ≺12 is a priority order, that is, the transitive closure of the union of ≺1 , ≺2 , and ≺12 does not have any circuits. The maximal progress priority rule deﬁnes a priority order identical to the set inclusion partial order, and is thus circuit-free. The completeness priority rule relates incomplete and complete interactions and is circuit-free, too. The only source of a priority circuit could be the existence of interactions α1 , α2 , α3 ∈ IC[K1 ∪ K2 ] such that α1 = α2 α3 , α1 ∈ IC[K1 ∪ K2 ]−− , and α2 ∈ IC[K1 ∪ K2 ]+ . This is impossible due to the monotonicity requirement of deﬁnition 1. Associativity: (B[K1 ], ≺1 ) (B[K2 ], ≺2 ) (B[K3 ], ≺3 ) = = (B[K1 ∪ K2 ], ≺1 ⊕ ≺2 ⊕ ≺12 ) (B[K3 ], ≺3 ) = (B[K1 ∪ K2 ∪ K3 ], ≺1 ⊕ ≺2 ⊕ ≺12 ⊕ ≺3 ⊕ ≺[12],3 ) where ≺[12],3 is the least priority order deﬁned by

Composition for Component-Based Modeling

457

Fig. 4. Composition: producer/consumer.

– α1 ≺[12],3 α1 α2 for α1 α2 ∈ IC[K1 ∪ K2 , K3 ], and – α1 ≺[12],3 α2 for α1 ∈ IC[K1 ∪ K2 ∪ K3 ]−− and α2 ∈ IC[K1 ∪ K2 ∪ K3 ]+ . It can be shown that the order ≺=≺12 ⊕ ≺[12],3 is the one deﬁned by – α1 ≺ α1 α2 for α1 α2 ∈ IC[K1 , K2 ] ∪ IC[K1 , K3 ] ∪ IC[K2 , K3 ] ∪ IC[K1 , K2 , K3 ], and – α1 ≺ α2 for α1 ∈ IC[K1 ∪ K2 ∪ K3 ]−− and α2 ∈ IC[K1 ∪ K2 ∪ K3 ]+ . So the resulting priority order is the same independently of the order of composition. Example 1. Consider the system consisting of a producer and a consumer. The components interact by rendez-vous. The actions put and get are incomplete. We assume that the actions prod and cons are internal and thus complete. Figure 4 gives the interaction model corresponding to these assumptions. The product system consists of the product transition system and the priority order deﬁned from the interaction model. The priority order removes all incomplete actions (crossed transitions). 3.4

Correctness by Construction

We present results allowing to check correctness of the models with respect to two properties: interaction safety and deadlock-freedom.

458

G. G¨ ossler and J. Sifakis

Interaction Safety of the Model. As explained in section 3.1, the distinction between complete and incomplete interactions is essential for building correct models. In existing formalisms, undesirable incomplete interactions are pruned out by applying restriction operators to the model obtained as the product of components [19]. In our approach, we replace restriction by priorities. This allows deadlock-freedom preservation: if an interaction is prevented from occurring, then some interaction of higher priority takes over. Nevertheless, it is necessary to check that our “ﬂexible” composition operator does not allow illegal incomplete actions in a system model. For this we induce a notion of correctness called interaction safety. Interaction safety is a property that must be satisﬁed by system models at any stage of integration. Notice however, that legality of incomplete interactions depends on the set of integrated components. Sub-systems of a given system may perform incomplete interactions that are not legal interactions of the system. For instance, consider a system consisting of three components with a connector {a1 , a2 , a3 } such that all its interactions are incomplete. The interaction a1 a2 is legal in the sub-system consisting of the ﬁrst two components while it is illegal in the system. In the latter, a1 a2 is incomplete and non maximal. It must synchronize with a3 to produce the maximal incomplete interaction a1 a2 a3 . For a given system, only complete and maximal incomplete interactions are considered as legal. Deﬁnition 8 (Interaction safety). Given an interaction model IM = (IC, IC+ ), deﬁne the priority order ≺ on incomplete interactions such that α1 ≺ α2 if α1 ∈ IC−− and α2 ∈ IC− IC−− . A system with interaction model IM is interaction safe if its restriction with ≺ can perform only complete or maximal incomplete interactions. Notice that the rule deﬁning the priority order ≺ is similar to the completeness priority rule of deﬁnition 7. For a given system, incomplete interactions that are maximal in IC have the same status as complete interactions with respect to non maximal incomplete interactions. Nevertheless, the priority order ≺ depends on the considered system as legality of incomplete actions depends on the interaction model considered. We give below results for checking whether a model is interaction safe. Dependency graph: Consider a system S[K] consisting of a set of interacting components K with interaction model IM = (IC, IC+ ). For c ∈ C (C is the set + (c) the set of the minimal complete of the connectors of IC ) we denote by Imin + + interactions of c, and write Imin (C) for {i ∈ Imin (c)}c∈C . The dependency graph of S[K] is a labelled bipartite graph with two sets of nodes: the components of K, and nodes labelled with elements of the set + + (c) = ∅} ∪ {(c, α) | c ∈ C ∧ α ∈ Imin (c)}, where α(c) is {(c, α(c)) | c ∈ C ∧ Imin the maximal interaction of c (involving all the elements of c). The edges are labelled with actions of A as follows: Let (c, α) = ({a1 , . . . , an }, α) be a node of the graph and assume that for an action ai of c, owner(ai ) ∈ K is the component which is owner of action ai . For

Composition for Component-Based Modeling

459

all actions ai of c occurring in α, add an edge labelled with ai from owner(ai ) to (c, α). For all actions ai of c, add an edge labelled with ai from (c, α) to owner(ai ) if ai is oﬀered in some incomplete state of owner(ai ), that is, a state in which no complete or maximal action is oﬀered. The graph encodes the dependency between interacting actions of the components in the following manner. If a component has an input edge labelled ai from a node ({a1 , . . . , an }, α), then for ai to occur in some interaction of {a1 , . . . , an } containing α it is necessary that all the actions labelling input edges of ({a1 , . . . , an }, α) interact. We call a circuit in the dependency graph non trivial if it encompasses more than one component node. Example 2 (Producer/consumer). Consider a producer providing data to two consumers. Interaction is by rendez-vous and takes place if at least one of the two consumers can get an item. The interaction model is described by C = {put, get1 , get2 } and IC+ = {put get1 , put get2 , put get1 get2 }. The dependency graph is shown in ﬁgure 5.

Fig. 5. Dependency graph for the producer/two consumer example.

Deﬁnition 9 (Cooperativity). Let a and b be labels of input and output edges of a component k in the dependency graph of S[K]. We say that a component k ∈ K is cooperative with respect to (a, b) if from any state of B[k] with a transition labelled a there exists a transition labelled b. k ∈ K is cooperative in a circuit γ in the dependency graph if it is cooperative wrt. (a, b), where a and b are the arcs of γ entering and leaving k, respectively. Theorem 1 (Interaction safety). A system model is interaction safe if its dependency graph contains a non-empty sub-graph G such that (1) G contains all

460

G. G¨ ossler and J. Sifakis

its predecessors, (2) any component in G is deadlock-free, and in any elementary circuit γ of G, either (3a) there exists a component k that is cooperative in γ and whose successor node in γ is a binary interaction, or (3b) the set of components k in γ whose successor node is not a binary interaction, is not empty, and all components in this set are cooperative in γ. Proof. Assume that the system is in an incomplete state, that is, a state from which only incomplete actions are possible. Then each component in G oﬀers some incomplete action since it is deadlock-free. We consider the sub-graph G of G that represents dependencies in the current state: G has an edge from an interaction node (c, α) to a component node k if k is actually waiting for α in the current state; G has the same edges from component to interaction nodes as G. G has the same set of components as G since any component of G is awaiting at least one incomplete action. If according to (3a) one of the components k is cooperative in some non trivial elementary circuit γ G , and the successor node (c, α) of k in γ is a binary interaction, then k and the successor of (c, α) can interact via the complete or maximal interaction α. Otherwise, all non trivial circuits in G satisfy condition (3b). Let k be some component in a strongly connected sub-graph of G not having any predecessors. Such a sub-graph exists since any component is node of some non-trivial circuit. Let γ be a non-trivial circuit in G containing k, and consider some non-binary interaction node (c, α) in γ. Let k be an arbitrary predecessor node of (c, α) in G . By the choice of k, k and (c, α) are in some non-trivial circuit γ of G . γ satisﬁes (3b), which implies that k is cooperative in γ . That is, all predecessors of (c, α) are cooperative, such that the complete or maximal interaction α is enabled. In both cases, at least one complete or maximal interaction is enabled, which means that any non-maximal incomplete interaction is disabled in (B, ≺). Intuitively, the hypotheses of Theorem 1 make sure that any circular dependency between the occurrence of strict interactions is broken by some cooperative component. Notice that by deﬁnition components are cooperative with respect to (a, a) for any action a. If the dependency graph has a backwards-closed subgraph all of whose elementary circuits are self-loops with the same label then the model is interaction safe. Example 3 (Producer/consumer). For example 2, the only subgraph G satisfying the backward closure requirement is the whole dependency graph. Let n1 = ({put, get1 , get2 }, put get1 ) and n2 = ({put, get1 , get2 }, put get2 ). ΓG contains two non-trivial elementary circuits γ1 = (producer, n1 , consumer2 , n2 ) and γ2 = (producer, n2 , consumer1 , n1 ). Since the producer is trivially cooperative wrt. the pair (put, put), condition (3a) is satisﬁed. If all three components are deadlockfree, the system is interaction safe.

Composition for Component-Based Modeling

461

Deadlock-Freedom. We give some results about deadlock-freedom preservation for transitions systems with priorities. Similar results have been obtained for timed transition systems with priorities in [5]. Deﬁnition 10 (Deadlock-freedom). A transition system is called deadlockfree if it has no sink states. A system is deadlock-free if the transition system with priorities representing it is deadlock-free. Proposition 4 (Composability). Deadlock-freedom is preserved by priority orders that is, if B is deadlock-free then (B, ≺) is deadlock-free for any priority order ≺. Proposition 5 (Compositionality). Deadlock-freedom is preserved by composition that is, if (B1 , ≺1 ) and (B2 , ≺2 ) are deadlock-free then (B1 , ≺1 ) (B2 , ≺2 ) is deadlock-free. Proof. Follows from the fact that composition of behaviors preserves deadlockfreedom and from the previous proposition. Proposition 6. Any system obtained by composition of deadlock-free components is deadlock-free.

4

Execution Model

Execution models constitute the third layer. They implement constraints which superposed to interaction constraints further restrict the behavior of a system by reducing non determinism. They diﬀer from interaction models from a pragmatic point of view. Interaction models restrict behavior so as to meet global functional properties, especially properties ensuring harmonious cooperation of components and integrity of resources. Execution models restrict behavior so as to meet global performance and eﬃciency properties. They are often timed and speciﬁc to execution platforms. In that case, they describe scheduling policies which coordinate system activities by taking into account the dynamics of both the execution platform and of the system’s environment. We assume that execution models are also described by priority orders, and discuss two interesting uses of execution models. Asynchronous vs. Synchronous Execution. As explained in 2.2, synchronous execution adopts a very strong fairness assumption as in all computation steps components are oﬀered the possibility to execute some quantum of computation. Our thesis is that synchronous execution can be obtained by appropriately restricting the ﬁrst two layers. Clearly, it is possible to build synchronous systems by using speciﬁc interaction models to compose behaviors. This is the case for Statecharts, and synchronous languages whose semantics

462

G. G¨ ossler and J. Sifakis

use parallel composition operators combined with unary restriction operators [17]. Nevertheless, their underlying interaction model uses non strict interaction and speciﬁc action composition laws which are not adequate for asynchronous execution. In the proposed framework, systems consisting of the ﬁrst two layers are not synchronous, in general. Interactions between components may be loose. Components keep running until they reach some state from which they oﬀer a strongly synchronizing action. Thus, executions are rich in non-determinism resulting from the independence of computations performed in the components. This is the case for formalisms which point to point interaction, such as SDL and UML. We believe that it is possible to deﬁne synchronous execution semantics for appropriate sub-sets of asynchronous languages. Clearly, these sub-sets should include only reactive components, that is, components with distinct input and output actions such that when an input occurs some output(s) eventually occur. The deﬁnition of synchronous execution semantics for asynchronous languages is an interesting and challenging problem. Consider the example of ﬁgure 6, a system which is the serial composition of three strongly synchronized components with inputs ij and outputs oj , j = 1, 2, 3. Assume that the components are reactive in the sense that they are triggered from some idle (stable) state when an input arrives and eventually produce an output before reaching some idle state from where a new input can be accepted. For the sake of simplicity, components have simple cyclic behaviors alternating inputs and outputs. The interaction model is speciﬁed by {o1 , i2 , o2 , i3 } ≺ {i1 , o3 }, {o1 , i2 } ≺ o1 i2 , {o2 , i3 } ≺ o2 i3 . That is, we assume that i1 and o3 are complete as the system is not connected to any environment. In the product of the behaviors restricted with the interaction model each component can perform computation independently of the others provided the constraints resulting from the interaction model are met. This corresponds to asynchronous execution. The behavior of the two layers can be further constrained by an execution model to become synchronous in the sense that a run of the system is a sequence of steps, each step corresponding to the treatment of an input i1 until an output o3 is produced. This can be easily enforced by the order i1 ≺ o1 i2 ≺ o2 i3 ≺ o3 . This order reﬂects the causality order between the interactions of the system. In fact, if all the components are at some idle state then all the components are awaiting for an input. Clearly, only i1 can occur to make the ﬁrst component evolve to a state from which o1 i2 can occur. This will trigger successively o2 i3 and ﬁnally o3 . Notice that i1 cannot be executed as long as a computation takes place in some component.

Scheduling Policies as Execution Models. We have shown in [1] that general scheduling policies can be speciﬁed as timed priority orders. The following example illustrates this idea for untimed systems.

Composition for Component-Based Modeling

463

Fig. 6. Enforcing synchronous execution.

We model ﬁxed priority scheduling with pre-emption for n processes sharing a common resource (ﬁgure 7). The scheduler gives preference to low index processes. The states of the i-th process are si (sleeping), wi (waiting), ei (executing), and ei (pre-empted). The actions are ai (arrival), bi (begin), fi (ﬁnish), pi (preempt), ri (resume). To ensure mutual exclusion between execution states ei , we assume that begin actions bj are complete and synchronize with pi for all 1 i, j n, i = j. By the maximal progress priority rule, an action bj cannot occur if some interaction bj pi is possible. Similarly, we assume that ﬁnish = j. An actions fj are complete and synchronize with ri for all 1 i, j n, i action fj cannot occur if some interaction fj ri is possible. The system is not interaction safe, since the structural properties of theorem 1 cannot exclude the case where the system is in the incomplete state (e1 , . . . , en ), that is, all processes are preempted. However, this is the only incomplete state of the system, and it is easy to show that it is not reachable from any other state: as all actions pi are incomplete, they are disabled by the completeness priority rule of deﬁnition 7 giving priority to the complete actions. Interactions bi pj are complete but keep component i, and thus the whole system, in a complete state. Therefore, initialized in a complete state the system always remains in a complete state, where interaction safety is guaranteed. Scheduling constraints resolve conﬂicts between processes (bi and ri actions) competing for the acquisition of the common resource. They can be implemented

464

G. G¨ ossler and J. Sifakis

Fig. 7. Fixed-priority preemptive scheduling of processes.

by adding a third layer with the priority rules bi ≺ bj , bi pk ≺ bj pk , and fk ri ≺ fk rj for all k, and i > j. It is easy to check that these constraints preserve mutual exclusion, in the sense that if the initial state respects mutual exclusion then mutual exclusion holds at any reachable state. Notice that as the components are deadlock-free and the composition of the interaction and execution priority orders is a priority order, the obtained model is deadlock-free.

5

Discussion

The paper proposes a framework for component composition encompassing heterogeneous interaction and execution. The framework uses a single powerful associative and commutative composition operator for layered components. Component layering seems to be instrumental for deﬁning such an operator. Existing formalisms combine at the same level behavior composition and unary restriction operators to achieve interaction safety. Layered models allow separation of concerns. Behaviors and restrictions (represented by priority orders) are composed separately. This makes technically possible the deﬁnition of a single associative operator. Interaction models describe architectural constraints on component behavior. Connectors relate interacting actions of diﬀerent components. They naturally deﬁne the set of interactions of a system. The distinction between complete and incomplete interactions is essential for the uniﬁcation of existing interaction

Composition for Component-Based Modeling

465

mechanisms. It induces the property of interaction safety characterizing correctness of a model with respect to modeling assumptions about the possibility for interactions to occur independently of their environment. Such assumptions are implicit in existing formalisms. Their satisfaction is enforced on models at the risk of introducing deadlocks. The proposed composition operator preserves deadlock-freedom. Theorem 1 can be used to check interaction safety of models. The distinction between interaction and execution models is an important one from a methodological point of view. Priority orders are a powerful tool for describing the two models. Their use leads to a semantic model consisting of behaviors and priorities which is amenable to correctness by construction. This is due to the fact that priorities are restrictions that do not introduce deadlocks to an initially deadlock-free system. More results about deadlock-freedom and liveness preservation can be found in [5].

References 1. K. Altisen, G. G¨ ossler, and J. Sifakis. Scheduler modeling based on the controller synthesis paradigm. Journal of Real-Time Systems, special issue on ”controltheoretical approaches to real-time computing”, 23(1/2):55–84, 2002. 2. F. Balarin, L. Lavagno, C. Passerone, A. Sangiovanni-Vincentelli, M. Sgroi, and Y. Watanabe. Modeling and Designing Heterogeneous Systems, volume 2549 of LNCS, pages 228–273. Springer-Verlag, 2002. 3. A. Benveniste, P. LeGuernic, and Ch. Jacquemot. Synchronous programming with events and relations: the SIGNAL language and its semantics. Science of Computer Programming, 16:103–149, 1991. 4. G. Berry and G. Gonthier. The ESTEREL synchronous programming language: Design, semantics, implementation. Science of Computer Programming, 19(2):87– 152, 1992. 5. S. Bornot, G. G¨ ossler, and J. Sifakis. On the construction of live timed systems. In S. Graf and M. Schwartzbach, editors, Proc. TACAS’00, volume 1785 of LNCS, pages 109–126. Springer-Verlag, 2000. 6. S. Bornot and J. Sifakis. An algebraic framework for urgency. Information and Computation, 163:172–202, 2000. 7. L. de Alfaro and T.A. Henzinger. Interface theories for component-based design. In T.A. Henzinger and C. M. Kirsch, editors, Proc. EMSOFT’01, volume 2211 of LNCS, pages 148–165. Springer-Verlag, 2001. 8. W.-P. de Roever, F. de Boer, U. Hannemann, J. Hooman, Y. Lakhnech, M. Poel, and J. Zwiers. Concurrency Veriﬁcation: Introduction to Compositonal and Noncompositional Methods. Cambridge University Press, 2001. 9. W.-P. de Roever, H. Langmaack, and A. Pnueli, editors. Compositionality: The Signiﬁcant Diﬀerence, volume 1536 of LNCS. Springer-Verlag, 1997. 10. OMG Working Group. Response to the omg rfp for schedulability, performance, and time. Technical Report ad/2001-06-14, OMG, June 2001. 11. N. Halbwachs, P. Caspi, P. Raymond, and D. Pilaud. The synchronous dataﬂow programming language lustre. Proceedings of the IEEE, 79(9):1305–1320, September 1991. 12. D. Harel. Statecharts: A visual formalism for complex systems. Science of Computer Programming, 8:231–274, 1987.

466

G. G¨ ossler and J. Sifakis

13. C. A. R. Hoare. Communicating Sequential Processes. Prentice Hall, 1985. 14. ITU-T. Recommendation Z.100. Speciﬁcation and Design Language (SDL). Technical Report Z-100, International Telecommunication Union — Standardization Sector, Geneva, 1999. 15. L. Lamport. Specifying concurrent program modules. ACM Trans. on Programming Languages and Systems, 5:190–222, 1983. 16. E.A. Lee et al. Overview of the Ptolemy project. Technical Report UCB/ERL M01/11, University of California at Berkeley, 2001. 17. F. Maraninchi. Operational and compositional semantics of synchronous automaton compositions. In proc. CONCUR, volume 630 of LNCS. Springer-Verlag, 1992. 18. R. Milner. Calculi for synchrony and asynchrony. Theoretical Computer Science, 25(3):267–310, 1983. 19. R. Milner. Communication and Concurrency. Prentice Hall, 1989. 20. SystemC. http://www.systemc.org.

Games for UML Software Design Perdita Stevens and Jennifer Tenzer Software Engineering Programme and Laboratory for Foundations of Computer Science School of Informatics, University of Edinburgh [email protected] Fax: +44 131 667 7209 [email protected] Fax: +44 131 667 7209

Abstract. In this paper we introduce the idea of using games as a driving metaphor for design tools which support designers working in UML. We use as our basis a long strand of work in veriﬁcation and elsewhere. A key diﬀerence from that strand, however, is that we propose the incremental development of the rules of a game as part of the design process. We will argue that this approach may have two main advantages. First, it provides a natural means for tools to interactively help the designer to explore the consequences of design decisions. Second, by providing a smooth progression from informal exploration of decisions to full veriﬁcation, it has the potential to lower the commitment cost of using formal veriﬁcation. We discuss a simple example of a possible game development.

1

Introduction

The Uniﬁed Modeling Language (UML)[10] is a widely adopted standard for modelling object-oriented software systems. It consists of several diagram types providing diﬀerent views of the system model. UML is a semi-formal language deﬁned by a combination of UML class diagrams, natural language and formal constraints written in the object constraint language (OCL). There are many tools available which support, or claim to support, design with UML. They aid in drawing UML diagrams, generation of code fragments in diﬀerent object-oriented languages and documentation of software systems. Some of them are able to perform consistency checks, for example, checking accessibility of referenced packages. However, these features seem to be useful for the recording and verifying of a chosen design, rather than for the design activity itself. There is nothing available to the mainstream object-oriented business software developer which will actively help him or her to explore diﬀerent design decisions and work out which is the best option. It is natural to look to veriﬁcation to ﬁll this gap. Ideally, a designer would be able to make use of veriﬁcation technology whenever s/he is faced with a diﬃcult decision, say between two design solutions. S/he might take two models representing the design with each of the solutions applied, F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 467–486, 2003. c Springer-Verlag Berlin Heidelberg 2003

468

P. Stevens and J. Tenzer

together with some desired properties, and check in each case whether the desired properties are fulﬁlled. Then s/he would adopt the design that best met the requirements. Unfortunately this situation is far from the truth at present. The decision to verify cannot be taken so lightly, for several reasons. It generally requires the building of a special purpose model, which has to be complete in an appropriate sense. (There are a few tools which can work directly with UML models, but most work only with one diagram type, typically state diagrams.) Writing properties which should be fulﬁlled by a design is a specialist job requiring knowledge of an appropriate logic: even for someone with that knowledge, identifying the properties at an appropriate level of granularity can sometimes be even harder than the design problem itself. One of the issues is that a good design does not only meet the external, customer requirements. It also does so in a clear, understandable way so that the software will be maintainable in future. The designer’s choice often involves understanding the implications of each decision and then making essentially aesthetic judgements about them. That is, the desirable characteristics are often a combination of technical, formalisable features and features pertaining to how future human maintainers will most easily think. For this reason, it is unlikely that the process of design will ever be fully automated: veriﬁcation and synthesis will not replace design. Instead tools should support the human designer in making use of formal technology alongside his/her own skills. Thus the ideal tool support should do two things: ﬁrst, it should help make veriﬁcation in the traditional sense, against explicit requirements, available at any stage of design and even, as far as possible, in the presence of incomplete models; second, it should help the designer to explore consequences of design decisions so that choices can be made even when the criteria for the choices have not been formalised.

2

Games in Veriﬁcation

Our thesis is that mathematical, formal games may be a suitable basis for improving tool support for software design. In order to support this, we need to introduce the formal games, before explaining the relevance of games to software design. In this section we introduce games by means of a simple example, the bisimulation game as explained by Stirling in [12]. Such a game is used for checking whether two processes are equivalent under the equivalence relation known as bisimulation. Essentially this captures the idea that two processes can each simulate the other, and that during the simulation their states remain equivalent, so that either process can “lead” at any time. A bisimulation game is deﬁned over two processes E and F and played by players Refuter (abbreviated R) and Veriﬁer (abbreviated V ). The aim of player R is to show that E and F are diﬀerent from each other while player V wants to prove that E is equivalent to F. At the beginning of the game player R picks one of the two processes and chooses a transition. After that player V has to

Games for UML Software Design

469

respond by choosing a transition with the same label from the other process. This procedure is repeated and each time player R can choose a transition from either process. Furthermore all moves in the game, no matter by which player they were made, can be seen by both players. If one of the players is stuck and cannot choose a suitable transition, the other player wins the game. In the case that the game is inﬁnite player V wins. A player can play the game according to a strategy, which is a set of rules. These rules tell the player how to move and may depend on earlier decisions taken in the game. A strategy is called a winning strategy if a player wins every game in which he or she uses it. Figure 1 shows the classic example of two vending machines. Imagine a machine E which has only one coin slot, and a machine F which has separate slots for tea and coﬀee. E and F are not equivalent because player R has a winning strategy consisting of the following rules: 1. Pick transition 20p from E. 2. If player V responds with the left transition 20p in F, choose selCoﬀee in E. Otherwise select transition selTea in E. If player R follows this strategy, player V gets stuck and thereby player R wins the game. Notice that playing the game yields a counter-example, i.e. a particular sequence of moves from which we can see that E and F are not equivalent, which is an advantage of using formal games for veriﬁcation tasks. E

F

20p getTea

getCoffee

selTea

getTea

20p

20p

getCoffee

selCoffee selTea

selCoffee

Fig. 1. Bisimulation game for two vending machines E and F.

In this game a winning strategy for Refuter can be viewed as a proof that no bisimulation could exist. Similarly, a winning strategy for player V is a bisimulation relation, that is, it is a proof that the answer to the question “are these processes bisimilar?” is Yes. Similarly, model-checking can be expressed as a game: this one is somewhat more complicated, and we will not go into details here (the interested reader is referred to [3]). Two diﬀerences are particularly worth noting. First, in a modelchecking game it is not necessarily the case that the players alternate turns. The

470

P. Stevens and J. Tenzer

current game position determines who is to move. Depending on the position and the move chosen by a player, the next game position may be one from which the same player is to move again. Second, the winning conditions are more complex than the simple fact that V wins all inﬁnite plays. Instead, there is a predicate on plays which determines which player wins a given play. In fact, it seems that all veriﬁcation questions can be expressed as games, although sometimes the game is “trivial” in the sense that a play will always be over after two moves. Tools such as the Edinburgh Concurrency Workbench1 exploit this game view of veriﬁcation questions. The user asks the question; the tool calculates a winning strategy for the game; it then oﬀers to take the winning part in a game against the user. The user ﬁnds that, no matter which moves s/he chooses, the tool always has an answer: the user can only lose. This seems to be an eﬀective way of getting an intuition about why the answer to the question is as it is.

3

Beyond Veriﬁcation Games

When we use veriﬁcation games, we notice a curious fact. A typical scenario is as follows: we have one process, representing a system, and we want to verify that it is correct. We may choose to do this in any one of several ways. We may develop a second process, which is supposed to stand in some formal relation to our system process; perhaps the two are supposed to be bisimilar or perhaps one is supposed to be a reﬁnement of the other according to one of the many diﬀerent reﬁnement relations used in process algebra. Alternatively, we may choose to develop a logical formula in a temporal logic such as the modal mu calculus. In either case, the veriﬁcation problem can be expressed as a game. The positions of the game incorporate some information about the “current state” of the model, and also some information about the “current state” of the speciﬁcation. If the two speciﬁcations, by process and by logic, intuitively succeed in expressing the same notion of correctness, they correspond to (essentially) the same game. Now, games are rather natural: the idea of deﬁning a game between Veriﬁer and Refuter by describing the valid challenges that Refuter may make, how Veriﬁer may respond, and who wins under which circumstances, is quite easy to grasp. It is easier than understanding the semantics of one’s chosen relation between two processes, or understanding the semantics of the modal mu calculus. (Indeed in many institutions, including Edinburgh, this fact is often used to help students grasp the concepts of bisimulation and the semantics of the mu calculus.) Thus the central idea of this work is to allow the user to deﬁne, directly, a veriﬁcation game. The game should involve the existing design of the system, together with the information that determines whether the design is correct. It should be the case that the player Veriﬁer has a winning strategy for the game if and only if the design is correct. The rules of the game incorporate 1

http://www.lfcs.ed.ac.uk/cwb/

Games for UML Software Design

471

the challenging circumstances in which the design must work as challenges by Refuter; correct and incorrect behaviour is captured by the winning conditions. Once we have decided to let the user deﬁne a game, we may observe that the game can in fact be deﬁned incrementally. For example, suppose that the design is complete, but that there is only limited understanding of what it means for the design to be correct. (We assume that, as usual, there is no formal speciﬁcation. Perhaps it has not yet been understood how the informal speciﬁcation of overall system requirements should be translated down into precise requirements; or perhaps the informal speciﬁcation is itself incomplete or incorrect. In mainstream business software development, which is our main focus of concern, both are likely to be the case.) In this case the game as initially deﬁned by the user will incorporate only a small amount of information about what it is for the design to be correct: it will be “too easy” for Veriﬁer to win the game. The user should be able to explore the game and improve the rules to make it a better reﬂection of the correctness of the design. This might include, for example, changing the moves of the game to permit Refuter to make new challenges, or changing the winning conditions so that plays which would have been won by Veriﬁer are won by Refuter in the new game. At the same time, it is likely that the design itself is too incomplete to permit full veriﬁcation. The user should also be able to change the game by adding more information about the design. In order to work more formally with this basic idea, let us begin by deﬁning what is meant by a game in general. 3.1

Game Terminology and Formal Deﬁnition

For the purposes of this paper, a game is always played between two players Veriﬁer (abbreviated V ) and Refuter (abbreviated R). We refer to players A and B to mean Veriﬁer and Refuter in either order. Deﬁnition 1. A game G is (Pos, I, moves, λ, WR , WV ) where: – P os is a set of positions. We use u, v, . . . for positions. – I ⊆ P os is a set of starting positions: we insist that λ(i) = λ(j) for all i, j ∈ I. – moves ⊆ P os × P os deﬁnes which moves are legal. A play is in the obvious way a ﬁnite or inﬁnite sequence of positions starting at some p0 ∈ I where pj+1 ∈ moves(pj ) for each j. We write pij for pi . . . pj . – λ : P os → {Veriﬁer, Refuter} deﬁnes who moves from each position. – WR , WV ⊆ P osω are disjoint sets of inﬁnite plays, and (for technical reasons to do with working with abstractions of games) WA includes every inﬁnite play p such that there exists some i such that for all k > i, λ(pk ) = B. Player A wins a play p if either p = p0n and λ(pn ) = B and moves(pn ) = ∅ (you win if your opponent can’t go), or else p is inﬁnite and in WA . Notice that our general deﬁnition does not insist that WR ∪ WV = P osω ; that is, it is possible for a play to be a draw. The games we consider in this

472

P. Stevens and J. Tenzer

paper will have no draws, but when we go on to consider abstractions of games (see e.g. [11]) it is necessary to permit them. Deﬁnition 2. A (nondeterministic) strategy S for player A is a partial function from ﬁnite plays pu with λ(u) = A to sets of positions (singletons, for deterministic strategies), such that S(pu) ⊆ moves(u) (that is, a strategy may only prescribe legal moves). A play q follows S if whenever p0n is a proper ﬁnite preﬁx of q with λ(pn ) = A then pn+1 ∈ S(p0n ). Thus an inﬁnite play follows S whenever every ﬁnite preﬁx of it does. It will be convenient to identify a strategy with the set of plays following the strategy and to write p ∈ S for p follows S. S is a complete strategy for Player A if whenever p0n ∈ S and λ(pn ) = A then S(p0n ) = ∅. It is a winning strategy for A if it is complete and every p ∈ S is either ﬁnite and extensible or is won by A. It is non-losing if it is complete and no p ∈ S is won by B. It is history-free (or memoryless) if S(pu) = S(qu) for any plays pu and qu with a common last position. A game is determined if one player has a winning strategy. All the games we need to consider are determined, and this is an assumption of this work. In this paper we focus on the informal presentation of the idea of using and modifying games for software design. It should be clear, however, that there is scope for deﬁning relationships between games in which the existence of a winning strategy for one game implies the existence of a winning strategy for a related game. Some early work in this direction was reported in [11] in the context of veriﬁcation games. The study of these relationships between games will be important in the context of tools to support games for software design. A simple example is that increasing the number of options open to Refuter – e.g., adding requirements that a design should satisfy – should make it harder for Veriﬁer to win the game: if Refuter had a winning strategy for the original game, the same strategy will work in the extended game. 3.2

How to Manage Games and Their Improvement in a Tool

A tool will always have a notion of “current game”. Part of what the tool should do is to allow the user to play the current game. The tool could operate in two modes: 1. Tool as referee. The user chooses moves both for Refuter and for Veriﬁer (indeed, there might be several users playing against one another). The tool’s role is simply to ensure fair play and declare who wins (in the case of a ﬁnite play which is won at all). 2. Tool as opponent. The user chooses whether to play Veriﬁer or Refuter and the tool takes the other player’s part. If it is possible for the tool to calculate a winning strategy for the current game, then the tool might play this winning strategy, or use it to suggest better moves to the user, as appropriate. Otherwise, the tool might use random choices and/or heuristics to play as well as possible.

Games for UML Software Design

473

It is not immediately obvious how to incorporate the improvement of games into the tool use. Should improving a game be thought of as part of the game, or as a separate process? It is undoubtedly easier for formal reasoning to regard the improvement of the game as a separate process: then we do not have to worry about what’s true of strange composite games in which the rules are changed part way through a play. For practical purposes though, if the user plays a signiﬁcant number of moves and then realises that there is a problem with the rules which aﬀects how play will proceed from then on, but not what has happened up to this point, it would be annoying not to be allowed to alter the rules and carry on. A suitable compromise is probably to say that a play in which the user changed the rules cannot be formally won by either player. (The tool might announce “You won, but you cheated so it doesn’t count”.) It is possible to formalise such rule changes as being the actions of a third player, but we have not so far found this very helpful.

4

Example: Incremental Deﬁnition of Simple Game

The overall aim of this work is to show how games akin to veriﬁcation games can be used for software design, and incrementally designed as part of the software design process. Thus we have two aspects to address: the incremental deﬁnition of games, and the use of games for software design. In the interests of clarity we devote this section to demonstrating how a simple game, without any special relation to software design, can be deﬁned incrementally. Our example is deliberately very simple. We will follow a hypothetical user through the design of a vending machine process. At any stage, there will be both a current system design and a current game, which will include the currently permissible challenges and winning conditions. Our “system design” will be simply a labelled transition system. We initialise this to the LTS shown in Figure 2. A position in the game may take either of two forms: 1. A state (si say) in the system design: Refuter to move. We notate such a position simply si . 2. A state si in the system design, plus a challenge c from Refuter. Veriﬁer to move. We notate such a position (si , c). Note that it is immediate from the form of the position who is to move: we do not have to specify λ separately. Initially the winning conditions are that Veriﬁer wins any inﬁnite plays; other than this, we have only the default rules, that any player wins if their opponent is supposed to move but cannot legally do so. Thus in order to deﬁne the game we need to specify, for each state of the system design, what the legal challenges that Refuter may make are, and for each state and challenge, what the legal responses from Veriﬁer are. Initially, we decide simply to record the requirements that from the initial state, s0, it is possible to insert 20p and (maybe many events later) receive tea, respectively coﬀee. We express this as the challenge to “pick a trace” with certain characteristics.

474

P. Stevens and J. Tenzer F

s0 20p

20p

s1

s2 selTea

selCoffee

s3

s4 getTea

getCoffee

s5

s6 Fig. 2. Initial system design.

State Challenges 20p

getTea

20p

getCoﬀee

s0 pick a trace s0 −→ . . . −→ si pick a trace s0 −→ . . . −→ si Note that such a challenge is easy to express in a table as above, and it would be easy for a tool to provide an interface for deﬁning such challenges; however, expressing the property required to hold at s0 in a temporal logic is already slightly tricky, because of the need to permit actions other than those named in the traces. For example, the existence of the ﬁrst trace from state s0 is equivalent to state s0 satisfying the mu calculus formula 20pµX. getTeaT ∨ −X. After Veriﬁer successfully picks such a trace, we specify that the new game position will be si (i.e. system state si , with Refuter to move). There are (so far) no legal challenges from any system state other than s0, so if Veriﬁer picks such a trace, which will necessarily not end in s0, she will win because it will be Refuter’s turn to pick a move, but there will be no legal moves. This is a boring game so far: Veriﬁer easily wins. Suspending disbelief in the inability of the user to understand this, suppose that the user plays the game, taking either Veriﬁer or Refuter’s part, and ﬁnally is satisﬁed of this. However, the user realises that the system should not simply stop after the tea or coﬀee is collected. S/he decides to capture the liveness of the system by permitting Refuter a new challenge, valid from any system design state, which is simply “pick a transition”. At this point, playing the game conﬁrms that Refuter has a winning strategy. So the user reﬁnes the system design by merging s5 and s6 with s0 . We get the game described by Figure 3 and the table below.

Games for UML Software Design

475

F

s0 getTea

20p

20p

s1

s2 selTea

getCoffee

selCoffee

s4

s3

Fig. 3. Revised system design.

State Challenges 20p

getTea

20p

getCoﬀee

s0 pick a trace s0 −→ . . . −→ si si

pick a trace s0 −→ . . . −→ si pick a transition si −→

We could continue in various ways, with the user gradually improving both the system design and the challenges and winning conditions which (in this example) constitute its speciﬁcation. So far we have only described adding challenges. Changing the winning conditions is the other main way of altering the game. We could imagine, for example, that instead of winning every inﬁnite play, we might want to say that Veriﬁer won every inﬁnite play on which the number of 20p actions so far was inﬁnitely often equal to the number of getTea actions plus the number of getCoﬀee actions so far, thus capturing the idea that the machine should not systematically swindle the beverage-buyer, or vice-versa. (A technical point is that we cannot specify arbitrary winning conditions without losing known determinacy of the game, that is, the existence of a winning strategy for one player: however, in practice, any reasonably speciﬁable winning condition will fall within the class known to lead to determined games.)

5

Games in the Context of Object Orientation/UML

In this section we address the question of how games for software design using UML may diﬀer from standard veriﬁcation games; we do not consider the incremental deﬁnition of such games. In the next section we consider an example that brings together aspects of the incremental deﬁnition considered in the previous section with the games for design issues considered here. The chief question is what constitutes the “system” being designed, the part analogous to the LTS in the previous example. The design artifact being produced by the designer is the UML model, so obviously the UML model is part of the system. Does it suﬃce? In order to explore the dynamic behaviour which is implied by a UML model, we will need to be able to record a “current state” of a prototypical system described by the model, and here we meet interesting issues. The UML model is

476

P. Stevens and J. Tenzer

most unlikely to deﬁne a unique system, complete in all detail. This is of course the usual challenge of doing any formal work with an incomplete speciﬁcation. In the game framework, a promising approach is to allow one or both players, when faced with an incompleteness in the speciﬁcation, to resolve the nondeterminacy. Diﬀerent choices about exactly how this is done will yield diﬀerent games with diﬀerent strengths. For example, if a transition in a UML state diagram is guarded by an informal guard which cannot be formally evaluated, perhaps the player choosing the transition may decide to assume that the guard is satisﬁed. If the game is to be maximally powerful for detecting ﬂaws, we probably wish to make sure that we record enough information to ensure that subsequent choices about whether that guard holds are consistent. For example, we might insist that the player writes a formal speciﬁcation of the guard, and use this in place of the original informal guard to annotate the model for the rest of the play. On the other hand, a weaker treatment in which the rules of the game did not ensure consistency could still be useful for exploring possibilities. There are many such game-design choices to be made. More concretely, any particular game for design with UML will draw information from particular parts of a UML model – maybe a complete model, maybe just the parts of it considered most relevant. If the focus is to be on exploring consequences of design decision, it seems likely, though perhaps not essential, that some dynamic diagrams will be involved. The section following gives an example using a simple form of state diagrams, protocol state machines (PSMs)[10] in which transitions are labelled by events, not by actions. (In earlier work we attempted to use more general state machines with both events and actions labelling transitions. However, we found that in normal sequential systems where events and actions are the receiving and sending (respectively) of normal synchronous messages, the UML semantics has anomalies in the presence of recursion. This is discussed in [13], where we propose as a solution that PSMs without actions be used for providing (loose) speciﬁcations of the overall behaviour of classes, whilst method state machines are used to deﬁne the behaviour of operations.) Another issue which may arise is the possibility of using parts of the UML for more than simply describing the current system model. For example, it might be convenient to let the user describe certain kinds of challenges as sequence diagrams. We do not consider this further here.

6

Example: Deﬁnition of a Game for UML Software Design

We now show how a game for software design with UML could be deﬁned. For this purpose we assume that a class diagram and state diagrams for some of its classes are given. As an example consider the class diagram shown in ﬁgure 4 with classes Customer, TravelAgent, Hotel and Flight which models a (very much simpliﬁed and not realistic) software system for a travel agency. The most interesting class is

Games for UML Software Design Customer

477

TravelAgent

−startDate: Date −endDate: Date +bookHoliday(l:String) +changeHotel(s:int)

ta +findFlight(d:String, sd:Date, ed:Date):Flight +findHotel(l:String, sd:Date, ed:Date, s:int):Hotel

−setFlight(f:Flight) −setHotel(h:Hotel)

−flight

−hotel

Flight

Hotel

...

...

+ destination: String ...

+location: String +nearestAirport:String ...

Fig. 4. Example class diagram.

Customer which has attributes for the holiday start and end dates of a customer. It contains public methods for booking a holiday, changing a hotel booking and private methods for linking hotel and ﬂight objects. The parameter l to Customer::bookHoliday and TravelAgent::ﬁndHotel represents the desired holiday location and the parameter s to Customer::changeHotel and TravelAgent::ﬁndHotel represents the requested hotel quality given by the number of stars. The remaining parameters d, sd, ed in TravelAgent provide values for the ﬂight destination (an airport name), and the start and end date of the holiday. A protocol state machine for Customer is given in Figure 5. Only the eﬀects of booking a holiday, changing a hotel and setting hotel and ﬂight on the object state are shown in this diagram. For the other classes of the system it is assumed that their objects can only be in a state default which is not changed by any of the methods, i.e. the corresponding state machines have only one state and a loop transition for each method of the class. For simplicity we only use classes and associations from the class diagram and the (ﬁnite) set of abstract states speciﬁed in the state machines for the deﬁnition of a state in the UML system design. As we will see later this is restrictive with respect to how the game can be incremented. For a given class diagram CD and a set of state machines SM , where S is the set of all states occurring in SM and Sc the set of states in the state machine for class c, a state in the UML system design consists of – a collaboration given by a set of objects O and a relation Links ⊆ O × R × O respecting the associations in CD where R is the set of role names in CD, including the empty string ε. A tuple (o, r, p) means that o can access p via rolename r.

478

P. Stevens and J. Tenzer Customer setHotel(h)

hotel only

setFlight(f) changeHotel(s)

no booking

bookHoliday(l)

setFlight(f)

hotel and flight

setHotel(h) flight only

Fig. 5. State machine for Customer.

– a function state : O → S such that state(o) ∈ Sc for an object o of class c. – a callstack cs = a1 : . . . : an whose elements ai are of the form o.m(p1 , . . . pn ) where o ∈ O is an object of class c and m is a method of c with parameters p1 , . . . pn according to the method deﬁnition in CD. We refer to o in this context as the target object of m. A position in the game consists of a state in the UML system design and depending on the callstack it is either – a state in the UML system design with an empty callstack: Refuter to move. – a state in the UML system design with a non-empty callstack: Veriﬁer to move. As initial state S0 for our example we choose – O = {c:Customer, t:TravelAgent, h:Hotel, f:Flight} and Links0 = {(c, ta, t), (t, ε, c)}. For simplicity the choice of objects is not changed during the game, i.e. we do not consider creation and deletion of objects. – state(c) =no booking and state(t) = state(h) = state(f) = default – cs is the empty stack. Refuter challenges by invoking a method that has public visibility on any of the currently available objects. The choice of the method and its parameters has to conform to the speciﬁcation by the class diagram, i.e. the method has to be deﬁned in the class of the target object and the parameters have to be accessible and of suitable type. Furthermore the object has to be in a state where an invocation is possible. A challenge is independent of the current linking of objects and the states of t, h, f never change, so table 1 only shows the mapping of state for c. A challenge by Refuter is pushed on the callstack. Since the callstack is then non-empty that means Veriﬁer has to make the next move. Veriﬁer can respond in two diﬀerent ways:

Games for UML Software Design

479

Table 1. Challenges by Refuter. State any state

Challenge t.ﬁndFlight(d,sd,ed) t.ﬁndHotel(l,sd,ed,s)

any state where c.bookHoliday(l) state(c) = no booking any state where c.changeHotel(s) state(c) = hotel and ﬂight

– pick a transition in the state machine for the class of the target object of the method on top of the callstack. The call on top of the stack is popped and the state of the target object is updated according to the state machine. The response may also have an eﬀect on how the objects are linked with each other. – call n other (possibly non-public) methods on objects that are reachable from the target object of the method call on top of the stack. The ﬁrst new method call is pushed on the callstack. We assume that Veriﬁer’s responses are at some point of the ﬁrst kind, which leads to the call being popped from the callstack. After that Veriﬁer pushes the next method call which is speciﬁed in his/her response onto the callstack and again we assume that it is at some point removed from the callstack. This procedure continues until all n new methods calls have been one by one pushed onto and then later popped from the callstack. Finally, after the response has been completed, the call on top of the stack, which is the one that caused the response, is popped. Notice that in general Veriﬁer and Refuter do not take alternate turns. Veriﬁer responds to the method invocation that is on top of the callstack which can either come from Refuter or Veriﬁer. For our example we could have responses as shown in table 2. Of particular interest are the responses to bookHoliday and changeHotel which are of the more complicated kind explained above. Notice that by the choice of parameters for the methods some properties are ﬁxed. The last parameter “3” of ﬁndHotel in the response to c.bookHoliday, for instance, has the eﬀect that h is always set to a 3-star-hotel. Furthermore the usage of nearestAirport in ﬁndFlight within the same response ensures that the ﬂight destination ﬁts well with the chosen hotel. The speciﬁcation of responses can be regarded as a strategy for meeting Refuter’s challenges. Since the sequence of method calls in the response is pushed on the stack one by one and Veriﬁer has to memorise which ones s/he has already handled, it is a strategy with a history. Tool support for the creation of a game as described here should allow the user to manipulate and reﬁne the strategy for Veriﬁer in a comfortable way. For each system state the tool could display all reachable objects and let the user pick a target object. The user could then proceed by selecting one of the object’s methods, which are again displayed by

480

P. Stevens and J. Tenzer Table 2. Responses by Veriﬁer.

State any state

Top of callstack Response t.ﬁndFlight(d,sd,ed) pick loop transition ﬁndFlight in state machine for TravelAgent any state t.ﬁndHotel(l,sd,ed,s) pick transition ﬁndHotel in state machine for TravelAgent h = c.ta.ﬁndHotel(l,c.startDate, c.endDate, 3); any state where f = c.ta.ﬁndFlight(h.nearestAirport, c.bookHoliday(l) state(c) = no booking c.startDate,c.endDate); c.setFlight(f); c.setHotel(h) c.hotel = any state where c.ta.ﬁndHotel(c.hotel.location, c.changeHotel(s) state(c) = hotel and ﬂight c.startDate,c.endDate,s); pick transition setFlight in state maany state where chine for Customer whose source state(c) = no booking or c.setFlight(f) is state(c) and add (c,ﬂight,f) and state(c) = hotel only (f, ε, c) to links pick transition setHotel in state maany state where chine for Customer whose source state(c) = no booking or c.setHotel(h) is state(c) and add (c,hotel,h) and state(c) = ﬂight only (h,ε,c) to links

the tool, and stepwise create a valid response. The tool could also contain a functionality to record the chosen strategy in a diagram, such as for example a method state machine (see [13]). In order to complete our deﬁnition of a software design game we have to declare the winning conditions. We assume that a game is won by one player if the other one cannot make a move and that all inﬁnite plays are won by Veriﬁer. We can now ﬁnally play a game, starting with our initial system state S0 . An extract of a play is shown in table 3. The play will be won by Veriﬁer because it is inﬁnite: after the ﬁrst two challenges Refuter can still continue to make challenges, but Veriﬁer can always respond by simply picking a loop transition. The table does not record the full system state but only the parts that are relevant for this example (callstack, links, state of c) and the moves of the players. The parameters with which a method is called are sometimes left out in the callstack to save space and because they are speciﬁed in the preceding move. Moreover Refuter is abbreviated by R and Veriﬁer by V.

7

Example: Incrementing a Software Design Game

There are several ways in which the example game from the previous section could be incremented. One way is to permit Refuter to call a public method from additional states, for instance we could permit a challenge by changeHotel when the object is in state hotel only. Another possibility is to add a completely

Links {(c, ta, t), (t, , c)} {(c, ta, t), (t, , c)} {(c, ta, t), (t, , c)} {(c, ta, t), (t, , c)} {(c, ta, t), (t, , c)} {(c, ta, t), (t, , c)} {(c, ta, t), (t, , c)}

Move R: c.bookHoliday(l) V: h=c.ta.ﬁndHotel(l,c.startDate,c.endDate,3) V: pick loop transition ﬁndHotel V: f=c.ta.ﬁndFlight(h.nearestAirport,c.startDate,c.endDate) V: pick loop transition ﬁndFlight V: c.setFlight(f) V: pick transition setFlight from no booking to ﬂight only

c.bookHoliday(l) V: c.setHotel(h)

{(c, ta, t), (t, , c), (c, ﬂight, f), (f, , c)} c.setHotel(h) {(c, ta, t), (t, , c), V: pick transition setHotel from ﬂight only to hotel and ﬂight c.bookHoliday(l) (c, ﬂight, f), (f, , c)} {(c, ta, t), (t, , c), (c, ﬂight, f), (f, , c), c.bookHoliday(l) response to bookHoliday completed (c, hotel, h), (h, , c)} {(c, ta, t), (t, , c), (c, ﬂight, f), (f, , c), empty R: c.changeHotel(4) (c, hotel, h), (h, , c)} {(c, ta, t), (t, , c), c.changeHotel(4) V: c.hotel=c.ta.ﬁndHotel(c.hotel.location,c.startDate,c.endDate,4) (c, ﬂight, f), (f, , c), (c, hotel, h), (h, , c)} {(c, ta, t), (t, , c), t.ﬁndHotel(...) (c, ﬂight, f), (f, , c), V: pick loop transition for ﬁndHotel c.changeHotel(4) (c, hotel, h), (h, , c)} {(c, ta, t), (t, , c), (c, ﬂight, f), (f, , c), c.changeHotel(4) response to changeHotel completed (c, hotel, h), (h, , c)} {(c, ta, t), (t, , c), (c, ﬂight, f), (f, , c), empty ... (c, hotel, h), (h, , c)}

Callstack empty c.bookHoliday(l) t.ﬁndHotel(...) c.bookHoliday(l) c.bookHoliday(l) t.ﬁndFlight(...) c.bookHoliday(l) c.bookHoliday(l) c.setFlight(f) c.bookHoliday(l)

Table 3. Example play.

hotel and ﬂight

hotel and ﬂight

hotel and ﬂight

hotel and ﬂight

hotel and ﬂight

hotel and ﬂight

ﬂight only

ﬂight only

no booking

no booking

no booking

no booking

no booking

State of c no booking no booking

Games for UML Software Design 481

new public method to the class diagram, such as for example a method for the cancellation of a holiday booking. Notice that this would be both an incrementation of the system design and its speciﬁcation at the same time. That is caused by our decision that challenges are method calls and to require methods to be picked in accordance with the class diagram. The game could be further incre-

482

P. Stevens and J. Tenzer

mented by adding states and transitions to the protocol state machines which oﬀer new possibilities of responding to the Veriﬁer. As soon as we want to increment the game in a more sophisticated manner it becomes clear that working with abstract states is often not enough. For our example we could require that c.hotel.nearestAirport always equals c.ﬂight.destination when c is in state hotel and ﬂight. In order to express this as an additional winning condition which makes it more diﬃcult for Veriﬁer to win the game, attribute values have to be part of the system state. A more detailed object state also leads to increased expressiveness in state machines since we could for instance specify guards whose evaluation depends on the current attribute values. However, introducing a concrete object state has the disadvantage that we in general have to handle an inﬁnite state space.

8

Discussion and Future Work

This paper has described very early steps in a programme of work. Much remains to be done. Most obviously, we are developing tools to support the use of these techniques in practice. In an academic environment these tools will necessarily be limited prototypes, but the experience of building them should help in our explorations and dissemination of the ideas. One possible objection to the proposal is that it is not obvious that there is a speciﬁcation of the system separate from its design. If the speciﬁcation is incorporated into the game deﬁnition, which also incorporates a particular design, does this not lose clarity and prevent a separate evaluation of the speciﬁcation? We can answer this objection in a variety of ways. One answer would be that given appropriate prototype tool support we could investigate what games people build in practice, and see whether there is in fact a clear distinction between design and speciﬁcation. For example, the challenges in our examples so far can be seen to be separable and could be expressed as a separate speciﬁcation. Is this typical, or in real examples would the design and speciﬁcation be more intertwined? Another answer would be to say that it is not important to have a separate speciﬁcation at the design level. A speciﬁcation that is independent of design is a realistic aim at a high level, where the user requirements are being expressed in the user’s vocabulary. Inevitably, though, the veriﬁcation of a design is done against a more detailed, technical speciﬁcation that always incorporates many assumptions about the design, even when it is presented as a separate document. In this paper we have considered incrementing a game as a process outside the game framework. For software design games this process corresponds to incremental software development processes used in practice. Alternatively game improvement could itself be regarded as a game. One way of deﬁning such an incrementation game is to couple it closely to veriﬁcation. If Refuter wins a play of a veriﬁcation game, this play can be used as a challenge in the incrementation game. Veriﬁer has to respond by improving

Games for UML Software Design

483

the game in a suitable way. In order to prove that her/his modiﬁcations are a valid response, Veriﬁer has to play the (incremented) veriﬁcation game again with the same challenges by Refuter as in the play that induced the move in the incrementation game. A diﬀerent way of deﬁning Refuter’s challenges in an incrementation game is to regard them as independent proposals for system changes which are not related to plays of veriﬁcation games. In this more general version incrementation games can for instance be used to explore and verify the evolvability of a system. In this case the incrementation is hypothetical and serves to show that a game can be extended as desired without breaking functionality that was present in the initial design. However it is not yet clear how incrementation games could be used in practice. Though they might be a suitable way of thinking it is not obvious if and how playing this kind of games can be reasonably supported by tools.

9

Related Work

There is, of course, a large amount of work on applying veriﬁcation techniques to UML models. We do not attempt a representative discussion of this crowded ﬁeld; since our focus here is on highly interactive tool support for design using games as a basis, we will instead discuss the related games-based work. Two-player games of the kind we consider here have a long history in mathematics (see for example [6]) and informatics (see for example [3,12]). In controller synthesis for open systems two-player games where one player (control) has a strategy which enforces the system to behave according to a given speciﬁcation independent of what the other player (environment) does are of interest. Finding a winning strategy (the controller) for player control is known as the control problem. The control problem has been studied for diﬀerent kinds of open systems such as, for example, discrete systems [9] and systems in reactive environments [7]. The system speciﬁcation can be given as linear or branching time temporal logic formula. Results on the complexity of solving the control problem depend on the chosen logic and the kind of system that is investigated. In case that a controller exists for a given combination of system and speciﬁcation it is also of relevance whether the controller is ﬁnite and how big it is [9]. Games in this context are often inﬁnite and there are some classes of speciﬁcations which are of particular interest. An example for such a kind of speciﬁcation or “game objective” is that eventually an element of a given target set of system states has to be reached. Some frequently occurring game objectives, corresponding winning strategies and complexity results are presented in [14]. The relation between games in control and veriﬁcation is pointed out in [1], where a translation of a game objective from control into a ﬁxpoint formula written in the mu-calculus such as used in veriﬁcation, is deﬁned. A closely related area is that of system synthesis. Given a speciﬁcation, the task of constructing a system that satisﬁes it is known as the synthesis problem. It can be formulated in terms of a game between environment and system, and a

484

P. Stevens and J. Tenzer

winning strategy for such a game represents the desired system. If such a strategy exists we say that the given speciﬁcation is realisable. Like the control problem system synthesis and realisability have been examined for diﬀerent kinds of logics and systems, such as for systems in reactive environments [7] and distributed systems [8]. Another kind of two-player games called combinatorial games is used to represent, analyse and solve interesting problems in areas such as complexity, logic, graph theory and algorithms [2]. The players move alternately and cannot hide information from each other. The game ends with a tie or the win of one player and lose of the other. An example for a combinatorial game with a lot of literature within the area of artiﬁcial intelligence is chess. A diﬀerent, and more complex style of game appears in economics. Here, rather than simply winning or losing, a player receives a variable payoﬀ. A game is played between two or more players with possibly conﬂicting interests, i.e. a move which leads to a better payoﬀ for one player may have a negative eﬀect on another player’s payoﬀ. Thus a strategy which optimises one player’s payoﬀ can depend on the strategies followed by other players. The issue of payoﬀ optimisation can be considered under diﬀerent circumstances: the players can have complete, partial or zero knowledge of each others moves. It is also possible for players to collaborate in order to gain a better payoﬀ for a group of players rather than for the individual player. These kinds of games reﬂect situations in economics such as, for example, competition between diﬀerent companies, and were ﬁrst introduced within this context in [15]. Since then a large amount of further work has been published in this area. It would be interesting to explore the applicability of this kind of game to software design, but for now we prefer the simpler games described here. The work of Harel et. al. on “play-in play-out scenarios” [4], [5] has a similar ﬂavour to our work, and is motivated by similar concerns about the interactivity of tools to support design. Play-in scenarios allow the capture of requirements in a user-friendly way. The user speciﬁes what behaviour s/he expects of a system by operating the system’s graphic user interface (GUI) – or an abstract version thereof – which does not have any behaviour or implementation assigned to it yet. A tool which is called the play-engine transforms the play-in of the user into live sequence charts (LSCs), which are used as formal requirements language. The user does not have to prepare or modify the LSCs directly but only interacts with the GUI. LSCs are a powerful extension of message sequence charts (MSCs). In contrast to sequence diagrams – the variant of MSCs which is part of UML, and which is implicitly existential – they can be either existential or universal. A universal LSC deﬁnes restrictions that have to hold over all system runs, while an existential LSC represents a sample interaction which has to be realised by at least one system run. Using play-out scenarios we can verify whether a set of LSCs – created by play-in scenarios or in any other way – meets the system requirements. Thereby the user feeds the GUI with external environment actions rather as though s/he

Games for UML Software Design

485

were testing the ﬁnal system. For each user input the tool computes the response of the system on the basis of the LSCs in terms of a sequence of events which are carried out. The system response is called a superstep and it is correct if no universal LSC is violated during its execution. The task of ﬁnding the desired superstep can be formulated as a veriﬁcation problem. In [5] a translation of LSCs into transition systems which allows the usage of model checking tools for the computation of the supersteps is given. Similarly model checking can provide the answer to the question whether an existential LSC can be satisﬁed. This approach diﬀers form ours in that its focus is on capturing and testing the requirements while we are mainly interested in helping the user to design a system. Thus play-in play-out scenarios do not aim to help in deﬁning intraobject behaviour, as our games do, but remain on the higher level of interaction between objects and environment. Since our work concentrates on UML we use the diagram types provided by it, i.e. UML sequence diagrams instead of the more expressive LSCs.

10

Conclusion

We have suggested the use of games as a driving metaphor for highly interactive design tools to support designers working in UML. We have proposed that the user of such a tool should deﬁne incrementally a game which captures not only the evolving state of the system design but also the properties that the design should satisfy; the tool should support the user both in informal explorations of the resulting game at each stage, and in veriﬁcation, that is, in the ﬁnding of a winning strategy. We have given simple examples of how a design tool based on this idea might operate. We hope that eventually this work may contribute to allowing mainstream business software developers to take advantage of veriﬁcation technology without giving up their existing incremental development practices.

Acknowledgements We are particularly grateful to a reviewer of this paper for useful comments on incrementation games as mentioned in the discussion, and to the British Engineering and Physical Sciences Research Council for funding (GR/N13999/01, GR/A01756/01).

References 1. Luca de Alfaro, Thomas A. Henzinger, and Rupak Majumdar. From veriﬁcation to control: Dynamic programs for omega-regular objectives. In Proceedings of the 16th Annual Symposium on Logic in Computer Science (LICS), pages 279–290. IEEE Computer Society Press, 2001.

486

P. Stevens and J. Tenzer

2. A.S. Fraenkel. Selected bibliography on combinatorial games and some related material. The Electronic Journal of Combinatorics, (DS2), 2002. Available from http://www.combinatorics.org/Surveys/ds2.ps. 3. E. Gr¨ adel. Model checking games. In Proceedings of WOLLIC 02, volume 67 of Electronic Notes in Theoretical Computer Science. Elsevier, 2002. 4. D. Harel. From play-in scenarios to code: An achievable dream. IEEE Computer, 34(1):53–60, January 2001. 5. D. Harel, H. Kugler, R. Marelly, and A. Pnueli. Smart play-out of behavioral requirements. In Proceedings of the 4th International Conference on Formal Methods in Computer-Aided Design (FMCAD 2002), pages 378–398, November 2002. 6. W. Hodges. Model theory, volume 42 of Encyclopedia of Mathematics. Cambridge University Press, Cambridge, 1993. 7. O. Kupferman, P. Madhusudan, P.S. Thiagarajan, and M.Y. Vardi. Open systems in reactive enﬁronments: control and synthesis. In Catuscia Palamidessi, editor, Proceedings of the 11th International Conference on Concurrency Theory (CONCUR 2000), volume 1877 of Lecture Notes in Computer Science, pages 92–107. Springer, August 2000. 8. O. Kupferman and M.Y. Vardi. Synthesising distributed systems. In Proceedings of the 16th Annual IEEE Symposium on Logic in Computer Science (LICS 2001). IEEE Computer Society, June 2001. 9. P. Madhusudan and P.S. Thiagarajan. Branching time controllers for discrete event systems. Theoretical Computer Science, 274(1-2):117–149, 2002. 10. OMG. Uniﬁed Modeling Language Speciﬁcation version 1.4, September 2001. OMG document formal/01-09-67 available from http://www.omg.org/technology/documents/formal/uml.htm. 11. Perdita Stevens. Abstract interpretations of games. In Proc. 2nd International Workshop on Veriﬁcation, Model Checking and Abstract Interpretation, VMCAI’98, number CS98-12 in Venezia TR, 1998. 12. Colin Stirling. Model checking and other games. Notes for Mathﬁt Workshop on ﬁnite model theory, University of Wales, Swansea, July 1996. 13. Jennifer Tenzer and Perdita Stevens. Modelling recursive calls with UML state diagrams. In Proc. Fundamental Approaches to Software Engineering, number 2621 in LNCS, pages 135–149. Springer-Verlag, April 2003. 14. W. Thomas. On the synthesis of strategies in inﬁnite games. In E.W. Mayr and C. Puech, editors, Proceedings of the 12th Annual Symposium on Theoretical Aspects of Computer Science, STACS ’95, volume 900 of Lecture Notes in Computer Science, pages 1–13, Berlin, 1995. Springer. 15. J. von Neumann and O. Morgenstern. Theory of Games and Economic Behavior. Princeton University Press, Princeton, third edition, 1953.

Making Components Move: A Separation of Concerns Approach Dirk Pattinson and Martin Wirsing Institut f¨ur Informatik, LMU M¨unchen

Abstract. We present a new calculus for mobile systems, the main feature of which is the separation between dynamic and topological aspects of distributed computations. Our calculus realises the following basic assumptions: (1) every computation executes in a uniquely determined location (2) processes modify the distributed structure by means of predefined operations, and (3) the underlying programming language can be changed easily. This paper introduces our calculus, and shows, that this separation of concerns leads to a perfect match between the logical, syntactical and algebraic theory. On the methodological side, we demonstrate by means of two examples, that the strict distinction between topological and computational aspects allows for an easy integration of features, which are missing in other calculi.

1 Introduction With the success of the Internet, mobile systems have been promoted as new computational paradigm in which computation can be distributed over the net and highly dynamic, with the network itself changing continuously. In practice, however, such systems are not well accepted since users fear security problems, or more generally, the problems with controlling the behaviour of mobile systems. As a remedy, process calculi, modal logics and other formal techniques have been proposed and studied which provide theoretical foundations for mobile systems and allow one to analyse and verify properties of such systems. The most well-known example is the π-calculus [8] of Milner which provides an abstract basis for mobility where communicating systems can dynamically change the topology of the channels. The Ambient calculus [5] of Cardelli and Gordon focuses on the handling of administrative domains where mobile processes may enter a domain or exit from a domain and in this way may change the topology of the network. Similarly, the Seal calculus [17] of Vitek and Castagna aims at describing secure mobile computations in a network that is hierarchically partitioned by localities. In this paper we continue this line of research by proposing a new basic calculus for mobile processes called BasicSail with focus on explicit localities and dynamic reconfiguration of networks. A configuration is a hierarchy of administrative domains, each of which is controlled by a process and which may contain other subconfigurations. Configurations may be dynamically reconfigured by entering another configuration or by exiting from a configuration. This is similar to the Ambient calculus; in contrast

This work has been partially sponsored by the project AGILE, IST-2001-39029.

F.S. de Boer et al. (Eds.): FMCO 2002, LNCS 2852, pp. 487–507, 2003. c Springer-Verlag Berlin Heidelberg 2003

488

D. Pattinson and M. Wirsing

to other approaches we aim at a clear separation between processes and configurations: processes show behaviour, whereas the configurations provide the topological structure. BasicSail abstracts from a concrete process calculus: We aim at studying the dynamic reconfiguration of configurations independently of the underlying notion of process. Our approach is centred around three assumptions, which we now briefly discuss: Assumption 1. Every computation takes place in a uniquely determined location. This assumption in particular forces a two-sorted approach: We need to distinguish between elements which relate to the spatial structure and those, which drive the computation process. Since our primary interest is the study of mobile computation, we would like to be as independent as possible from the concrete realisation of processes, and therefore make Assumption 2. The distributed part of the calculus is independent of the underlying programming language or process calculus. However, a computation needs some means to change the distributed and spatial structure (otherwise our study would end here). That is, we need a clean mechanism, through which the distributed structure can be modified : Assumption 3. Processes modify the distributed structure of the computation through interfaces only. Our calculus is modelled after these assumptions. Regarding independence of the underlying programming language, we assume that the processes, which control the computations, already come with a (fixed) operational semantics, in terms of a labelled transition system; this allows us to realise interfaces as a particular set of distinguished labels. As already mentioned before, the separation between processes and locations is taken care of by using a two sorted approach. The main technical contribution of the paper is the study of the algebraic and logical properties of the basic calculus, and of its extension with local names. We introduce the notion of spatial bisimulation and give an algebraic and a logical characterisation of the induced congruence. Our main result here is, that if one abstracts from the concrete realisation of the computations, we obtain a perfect match between structural congruence, logical equivalence and spatial congruence. Methodologically, we want to advocate the separation between the concepts “mobility” and “computation” on a foundational basis; we try to make this point by giving two extensions of the calculus, which are missing in other calculi and can be smoothly integrated into BasicSail, thanks to the separation between spatial structure and computation. We introduce the basic calculus, that is, the calculus without local names, in Section 2. The algebraic theory of he calculus is investigated in Section 3, and Section 4 transfers these results to a logical setting. We then extend the calculus with local names (Section 5). Further extensions, which demonstrate the versatility of our approach, are discussed in Section 6. Finally, Section 7 compares our approach to other calculi found in the literature.

Making Components Move: A Separation of Concerns Approach

489

2 Basic Sail: The Basic Calculus This section introduces BasicSail, our testbed for studying mobile components. In order to ensure independence from the underlying programming language (cf. Assumption 1), BasicSail consists of two layers. The lower layer (which we assume as given) represents the programming language, which is used on the component level. The upper level represents the distributed structure, which is manipulated through programs (residing on the lower level) by means of pre-defined interfaces. Technically, we assume that the underlying programming language comes with a labelled transition system semantics, which manipulates the distributed structure (on the upper level) by means of a set of distinguished labels. The distinction between processes (or programs) and the locations, in which they execute (and the structure of which they modify), forces us to work in a two-sorted environment, where we assume the programs (and their operational semantics) as given, and concentrate on the distributed structure. Our basic setup is as follows: Notation 1. Throughout the paper, we fix a set N of names and the set L = {in , out , open } × N of labels and a transition system (P, −→), where P is a set (of processes) and −→⊆ P × L × P. We assume that (P, −→) is image finite, that is, l

for every (P, l) ∈ P × L, the set {P | P −→ P } is finite. We write in n for the pair (in , n) ∈ L and similarly for out , open and call the elements of L basic labels. The set P is the set of basic processes. The prototypical example of transition systems, which can be used to instantiate our framework, are of course process calculi. We present one such calculus, which will also be used in later examples, next. Example 1. Take P to be given as the least set according to the following grammar: P P, Q ::= 0 | P Q | α.P |!P where α ∈ L ranges over the basic labels. The transition relation −→ is generated by the following rules α P −→ P , α α α.P −→ P P Q −→ P Q modulo structural congruence ≡, given by the axioms P Q ≡ Q P , P 0 ≡ P , P (Q R) ≡ (P Q) R and !P ≡ P !P . For convenience, we often omit the trailing inert process and write α for α.0. Intuitively, α.P is a process which can perform an α action and continue as P ; the term P Q represents the processes P and Q running concurrently and !P represents a countable number of copies of P . Note that we use this concrete syntax for processes just in order to illustrate our approach; the general theory is independent of the syntactical presentation and just assumes that processes form a set and come with a transition system over the set L of labels. Given such a transition system (P, −→), the distributed structure (which is our primary interest) is built on top of (P, −→) as follows:

490

D. Pattinson and M. Wirsing

Definition 1. The set C of basic configurations is the least set according to the grammar C A, B ::= 0 | nP [A] | A, B where P ∈ P is a process and n ∈ N is a name, modulo structural congruence ≡, given by the equations A, B ≡ B, A A, 0 ≡ A A, (B, C) ≡ (A, B), C We call the configuration building operator “,” spatial composition. Here, 0 is the empty configuration, nP [A] is a configuration with name n, which is controlled by the process P and has the subconfiguration A. Finally, A, B are two configurations, which execute concurrently. The next definition lays down the formal semantics of our calculus, which is given in terms of the reduction semantics −→ of the underlying process calculus: Definition 2. The operational semantics of BasicSail is the relation given by the following rules the following rules in n

P −→ P mP [A],nQ[B] ⇒nQ[mP [A],B] out n

P −→ P nQ[mP [A],B] ⇒ mP [A],nQ[B] open n

P −→ P mP [A], nQ[B] ⇒ mP [A], B together with the congruence rules A =⇒ A A, B =⇒ A , B A =⇒ A nP [A] =⇒ nP [A ] where we do not distinguish between structurally congruent configurations. The relation =⇒ is called spatial reduction. In the examples, we often omit the empty configuration, and write nP [] instead of nP [0]. Using the above definition, we can study phenomena, which arise in a distributed setting, without making a commitment to any kind of underlying language. In particular, we do not have to take internal actions of processes into account; these are assumed to be incorporated into the reduction relation −→ on the level of processes. We cannot expect to be able to embed the full ambient calculus [5] into our setting, due to the fact that in the (original) ambient calculus, there are no sorts available. However, we can nevertheless treat many examples:

Making Components Move: A Separation of Concerns Approach

491

Example 2. We use the set of basic processes from Example 1. 1. An agent, which has the capability to enter and exit its home location to transport clients inside can be modelled as follows: Put agent = aP [] client = cQ[] home = h0[agent] where P =!(out h.in h.0) and Q = in a.out a.0. In the configuration home, client, we have the following chain of reductions (where P = in h.0 P and Q = out a.0): home, client =⇒ h0[], aP [], cQ[] =⇒ h0[], aP [cQ []] =⇒ h0[aP [cQ []] =⇒ h0[aP [], c0[]]. This sequence of reductions shows a guarded form of entry into h: The client has to enter the mediating agent a, which then transports it into h, where the client then exits. Note that in the basic calculus, c could enter h directly, if c’s controlling process were different. This can be made impossible if one adds local names, as we shall do later. 2. We model an agent, which repeatedly visits two network nodes, as follows: agent ≡ aP [] with P =!(in n1 .out n1 .0) !(in n2 .out n2 .0). The activity of a once it is at either n1 or n2 is not modelled (but imagine a checks, whether a node has been corrupted or is otherwise non-functional). In the presence of two nodes n1 and n2 , we have the (spatial) reductions, where we write N1 and N2 for the controlling processes of n1 and n2 : n1 N1 [], n2 N2 [], aP [] =⇒n1 N1 [aP1 []], n2 N2 [] =⇒n1 N1 [], n2 N2 [], aP [] =⇒n1 N1 [], n2 N2 [aP2 []] =⇒ . . . In the above, we have abbreviated P1 = out n1 .0 P and P2 = out n2 .0 P . Here, the program P controlling a does not force a to visit n1 and n2 in any particular order, and a could for example choose to enter and leave n1 continuously, without ever setting foot into n2 .

3 Algebraic Theory of the Basic Calculus This section is devoted to the algebraic theory of the basic calculus; extensions of the calculus, in particular with local names, are deferred until Section 5. In this section, we

492

D. Pattinson and M. Wirsing

show that the algebraic and the logical theory of the basic calculus fit together seamlessly. In more detail, we discuss the relationship between three relations on processes: spatial bisimulation (which we introduce shortly), the induced spatial congruence and structural congruence. 3.1 Basic Definitions and Examples Spatial bisimulation will defined as binary relation on configurations, subject to some closure properties; the precise meaning of which is given as follows: Terminology 2. Suppose R ⊆ A × A is a binary relation on a set A and S ⊆ A × · · · × A is n + 1-ary. We say that R is closed under S, if, whenever (a, b) ∈ R and (a, a1 , . . . , an ) ∈ S, there are b1 , . . . , bn ∈ A with (b, b1 , . . . , bn ) ∈ S and (ai , bi ) ∈ R for i = 1, . . . , n. If R is closed under S, it is often helpful to think of R as an equivalence on processes and of S as a reduction relation. In this setting, R is closed under S if, whenever a and b are equivalent (i.e. (a, b) ∈ R) and a reduces to a (i.e. (a, a ) ∈ S), there is some b such that a and b are again equivalent (i.e. (a , b ) ∈ R) and b reduces to b (that is, (b, b ) ∈ R). So if R is closed under S, we think of R as being some bisimulation relation and R the corresponding notion of reduction. Definition 3 (Spatial Bisimulation). Consider the following endorelations on C: 1. Subtree reduction ↓⊆ C ×C, where C ↓ D iff C ≡ nP [D] for some n ∈ N and P ∈P 2. Forest reduction ⊆ C × C × C where C (A, B) iff C ≡ A, B and A is of the form A ≡ nP [D] for some n ∈ N , P ∈ P and D ∈ C. 3. Top-level names @n ⊆ C, where n ∈ N and C ∈ @n iff C ≡ nP [A] for some P ∈ P and A ∈ C. The largest relation ⊆ C × C, which is closed under spatial reduction =⇒, subtree reduction ↓, forest reduction and top-level names @n, for all n ∈ N , is called spatial bisimulation. Furthermore, spatial congruence ∼ = is the largest spatial bisimulation, which is a congruence with respect to construction of configurations. Note that, in the previous definition, we just require the congruence property wrt. the construction of configurations, that is we require 1. A0 ∼ = A1 , B0 ∼ = B1 =⇒ A0 , A1 ∼ = B0 , B1 and ∼ 2. A = B, n ∈ N , P ∈ P =⇒ nP [A] ∼ = nP [B]. This not only justifies the name spatial congruence – it furthermore allows us to study the evolution of the tree structure of (a set of) mobile processes without reference to the underlying process calculus. Note that the spatial congruence is not the largest congruence contained in the spatial bisimulation (corresponding to closure under contexts). Our notion of spatial congruence follows the approach of dynamic bisimulation [9].

Making Components Move: A Separation of Concerns Approach

493

In a nutshell, two configurations are spatially bisimilar, if they have bisimilar reducts, bisimilar subtrees, and the same top-level names. If two configurations are spatially congruent, one can furthermore substitute them for one another, obtaining spatially congruent processes. Although spatial bisimulation is a very strong notion of bisimilarity, it is not a congruence: Example 3. Take n, m ∈ N with n = m and let A ≡ nin m.0[] and B ≡ n0[]. Then A B (since neither A nor B can perform a spatial reduction), but A ∼

B, since = A, m0[] does reduce, whereas B, m0[] does not. Since we clearly want equivalent configurations to be substitutable for one another (which allows us to build large systems in a compositional way), spatial congruence is the notion of equivalence we are interested in. By definition, spatial congruence involves the closure under all configuration constructing operators, and is therefore not easy to verify. Our first goal is therefore an alternative characterisation of spatial congruence. As it turns out, we only need to add one closure property to the definition of spatial bisimulation in order to obtain spatial congruence. 3.2 Spatial Congruence and Spatial Bisimulation We start on our first characterisation of spatial congruence. The approach is as follows: We consider labelled reduction, introduced in the next definition, and show that (i) spatial congruence is closed under labelled reduction, and (ii) that spatial bisimulation + labelled reduction is a congruence. This immediately entails that spatial congruence is spatial bisimulation plus closure under labelled reductions. We begin with the definition of labelled reduction: l

Definition 4. Let l ∈ L. Define the relation =⇒⊆ C × C by the rules l

P −→ P l

nP [A] =⇒ nP [A] l

C =⇒ C l

C, D =⇒ C , D l

and call a relation B ⊆ C × C closed under labelled reduction, if B is closed under =⇒ for all l ∈ L. We use the name “labelled bisimulation” for the closure of spatial bisimulation under labelled reductions. Definition 5. We take labelled bisimulation to be the largest symmetric relation ⊆ C × C which is closed under forest reduction, spatial reduction, subtree reduction, labelled reduction and top level names.

494

D. Pattinson and M. Wirsing

In order to be able to compare spatial congruence and labelled bisimulation, we need a proof principle, which allows us to reason about labelled bisimulation using induction on reductions. This principle works for finitely branching systems only, and is the content of the following two lemmas: Lemma 1. Suppose (P, −→) is finitely branching. Then the relations =⇒, , ↓ and l

=⇒ (for all l ∈ L) are image finite. Proof. By structural induction using the respective definitions. Proposition 2. Assume that (P, −→) is image finite and define a sequence of relations ∼i ⊆ C × C inductively as follows: 1. ∼0 = C × C 2. C ∼i+1 D is the largest symmetric relation s.t. – C ∈ @n implies D ∈ @n – (C, C ) ∈ R implies ∃D .(D, D ) ∈ R and C ∼i D where R is one of =⇒ or ↓ – C (C1 C2 ) implies ∃D1 , D2 .D (D1 , D2 ) and C1 ∼i D1 , C2 ∼i D2 l l – C =⇒ C implies ∃D .D =⇒ D and C ∼i+1 D for l ∈ L Then, if C and D ∈ C, we have C D iff C ∼i D for all i ∈ N. Proof. We abbreviate ∼= i∈N ∼i . In order to see that C D whenever C ∼i D, one shows that ∼ is a spatial bisimulation, which is closed under labelled reduction. The converse follows from the fact that all relations used in the definition of ∼i are image finite (Lemma 1). We note two easy consequences of the above characterisation: in particular, controlling processes, which are bisimilar (in the ordinary sense) do not destroy the relations ∼i and therefore preserve labelled bisimulation. That is, if we call the largest symmetric relation B ⊆ P × P, which is a (strong) labelled bisimulation in the ordinary sense a process bisimulation, we have the following: Lemma 3. 1. ∼i+1 ⊆∼i for all i ∈ N. 2. Let n ∈ N , A, B ∈ C and P, Q ∈ P rop. Then for all i ∈ N nP [A] ∼i+1 nQ[B] iff P, Q are process-bisimilar and A ∼i B. The relationship between labelled bisimulation and process bisimulation can be formalised as follows: Corollary 4. Let n ∈ N , A, B ∈ C and P, Q ∈ P rop. Then nP [A] and nQ[B] are labelled bisimilar iff P, Q are process-bisimilar and A and B are labelled bisimilar. We are now ready to tackle the first step of our comparison between labelled bisimulation and spatial congruence.

Making Components Move: A Separation of Concerns Approach

495

Lemma 5. Spatial congruence is closed under labelled reduction. l

Proof. Suppose n ∈ N , C, D ∈ C are spatially congruent and C =⇒ C . Then C is of l the form C ≡ C0 , C1 with C0 ≡ mP [E] and P −→ P for some P ∈ P and E ∈ C. We proceed by case distinction on l ∈ L, where we use a fresh name k ∈ N , i.e. k does not occur as the name of a location either in C or in D, and some arbitrary R ∈ P. Case l = in n: Consider the context K[ ] = nR[kR[]], . Then K[C] =⇒ C with C ≡ C1 , nR[mP [E], kR[]]. Since C ∼ = D, we have K[D] =⇒ D with ∼ C = D . Since spatial congruence is closed under forest reduction and top-level names, we can split D ≡ D1 , nR [F ] for some R ∈ P and F ∈ C, where D1 ∼ = C1 and nR [F ] ∼ = nR[mP [E], kR[]]. Using closure under subtree reduction, we obtain F ∼ = mQ [E ], kR[] (since k is fresh) with mQ [E ] ∼ = mP [E]. Again using in n that k is fresh, we have D ≡ D1 , mQ[E ] for some Q ∈ P with Q −→ Q with D1 ∼ = C1 and mP [E] ∼ = mQ [E ]; since spatial congruence is a congruence we in n finally obtain D =⇒ D1 , mQ [E ] ∼ = C1 , mP [E]. Case l = out n: Similar, using the context nR[ , kR[]]. Case l = open n: Similar, using the context nR[kR[]], . The converse of Lemma 5 needs the proof principle of Proposition 2. Lemma 6. Labelled bisimulation is a congruence. Proof. We have to show that labelled bisimulation is a congruence wrt. the construction of configurations, that is, wrt. “putting in a box” and spatial composition. Congruence wrt. spatial composition: We show that the relation Ri = {(C, E), (D, E)|C, D, E ∈ C and C ∼i D} is a subset of ∼i for all i ∈ N. The case i = 0 is trivial; for the inductive step we show that any pair ((C, E), (D, E)) ∈ Ri+1 satisfies the properties defined in Prop. 2 with ∼i+1 replaced by Ri . The cases of top level names, forest reductions and labelled reductions follow directly from the definitions of the Ri+1 and the fact that ∼i+1 ⊆∼i . For spatial reduction suppose C, E =⇒ C . If either C =⇒ C0 and C ≡ C0 , E or E =⇒ E0 and C ≡ C, E0 the result follows easily from the induction hypothesis. For all other cases we have to show that C, E and D, E have the same spatial reductions, resulting in configurations, which are ∼i equivalent. We only consider the in -rule; the other cases are similar. If C, E =⇒ C by virtue of the in -rule, either a component of C enters into a component of E, or vice versa. That is, we have one of the following two cases: in n

1. C ≡ C0 , C1 with C0 ≡ mP [F ] and P −→ P and E ≡ E0 , E1 with E0 ≡ nQ[G], or in n 2. E ≡ E0 , E1 with E0 ≡ mP [F ] and P −→ P and C ≡ C0 , C1 with C0 ≡ nQ[G]. We only treat the first case; the second can be treated along similar lines (using Lemma 3). From the assumption C ∼i+1 D we obtain (using forest reduction and preservation of top level names), that we can split D ≡ D0 , D1 with

496

D. Pattinson and M. Wirsing

D0 ≡ mR[H] and Cj ∼i Dj for j = 0, 1. Using closure under labelled rein n duction, we have R −→ R with mP [F ] ∼i mR [H]. Since C, E =⇒ C we obtain C ≡ nQ[mP [F ], G], C1 , E1 and D, E =⇒ D with D ≡ nQ[mR [H], G], D1 , E1 , from which we obtain C ∼i D using that ∼i is a congruence. Congruence wrt. putting in a box: Suppose C, D ∈ C with C ∼i+1 D and n ∈ N , P ∈ P. We have to show that nP [C] i+1 nP [D]. As before, the only interesting cases arise through spatial reductions. So suppose nP [C] =⇒ C . If this is because C =⇒ C and C ≡ nP [C ], we find D ∼i C with D =⇒ D , since C ∼i+1 D. In this case nP [D] =⇒ D with D ≡ nP [D ] and by ind. hyp. C ∼i D . Now assume nP [C] =⇒ C using the out -rule. That is C ≡ C0 , C1 with C1 of out n out n the form C1 ≡ mQ[E] and Q −→ Q . With C0 ≡ mQ [E] we thus have C0 =⇒ C0 . Using forest reduction, we can split D ≡ D0 , D1 with Dj ∼i Cj for j = 0, 1. In out n particular, D0 =⇒ D0 and D0 ∼i C0 . By assumption, we have C ≡ C0 , nP [C1 ]. Putting D ≡ D0 , nP [D1 ], we obtain nP [D] =⇒ D and D ∼i C . From the previous lemma, we obtain the desired characterisation of spatial congruence: Corollary 7. Spatial congruence and labelled bisimulation coincide. Proof. By Lemma 5, spatial congruence is contained in spatial bisimulation. Lemma 6 proves the other inclusion. This result is our first characterisation of spatial congruence in the basic calculus. Spatial congruence allows us to observe the dynamic behaviour of controlling processes plus the tree structure of configurations. One therefore suspects, that spatial congruence is a very intensional notion of equivalence. In the following, we show that spatial congruence is very intensional indeed, by comparing it to the relation of structural congruence on configurations. 3.3 Spatial Congruence vs Structural Congruence Depending on the underlying labelled transition system (P, −→), which controls the behaviour of processes (which in turn control the evolution of configurations), it is clear that structural congruence is strictly contained in spatial congruence: If P, Q ∈ P are bisimilar but not identical, we have that nP [] and nQ[] are not structurally congruent, but spatially congruent. This example relies on the existence of equivalent, but non-identical processes in P. In this section, we show, that this is indeed the only possible way in which we can have configurations, which are spatially congruent, but not structurally congruent. We now proceed to show that spatial congruence coincides with structural congruence modulo process bisimilarity. We start with the following: Definition 6. Weak structural congruence is the least relation R generated by the rules of Definition 1, plus the rule C≡D P, Q process bisimilar nP [A] ≡ nQ[B] where n ∈ N , A, B ∈ C and P, Q ∈ P.

Making Components Move: A Separation of Concerns Approach

497

Thus weak structural congruence not only identifies structurally congruent configurations, but also configurations with bisimilar controlling processes. We think of weak structural congruence as structural congruence up to process bisimilarity. Note that – coming back to the example at the beginning of the section – that nP [A] and nQ[A] are weakly congruent for P, Q process bisimilar. We have argued that this is an example of a pair of configurations, which are spatially congruent, but not structurally congruent. Extending structural congruence to include those configurations, which only differ in the controlling process, structural and spatial congruence can be shown to coincide: Proposition 8. Weak structural congruence and spatial congruence coincide. Proof. It follows directly from the definitions that weak structural congruence (which we denote by ≡ for the purpose of this proof) is contained in spatial congruence. We prove the converse inclusion by contradiction: assume that the set F = {(C, D) ∈ C | C∼ = D, C ≡ D} of felons is non empty. For C ∈ C, we define the height of C, ht(C), by induction as follows: ht(0) = 0, ht(C, D) = ht(C) + ht(D), ht(nP [C ]) = 1 + ht(C ). Since the standard ordering on natural numbers is a well-ordering, there is a pair (C, D) of felons, such that ht(C) is minimal, that is, for all (C , D ) ∈ F we have ht(C ) ≥ ht(C). We discuss the different possibilities for C. Case C ≡ C0 , C1 with C0 ≡ 0 ≡ C1 : Using forest reduction, we can split D ≡ D0 , D1 with Dj ∼ = Cj for j = 0, 1. Since ht(C0 ) < ht(C) and ht(C1 ) < ht(C), neither (C0 , D0 ) nor (C1 , D1 ) are felons, that is, C0 ≡ D0 and C1 ≡ D1 , hence C ≡ C0 , C1 ≡ D0 , D1 ≡ D, contradicting (C, D) ∈ F . Case C ≡ nP [C0 ]: By subtree reduction, D ≡ mQ[D0 ] with C0 ∼ = D0 . Since ht(C0 ) < ht(C), the pair (C0 , D0 ) is not a felon, hence C0 ≡ D0 . By closure under top-level names, furthermore n = m, and closure under labelled reduction implies that P and Q are process bisimilar. Hence nP [C0 ] and mQ[D0 ] are weakly congruent, contradicting (C, D) ∈ F . Case C ≡ 0: From C ∼ = D we conclude D ≡ 0, contradicting C ≡ D. This concludes our investigation of the algebraic properties of BasicSail, which we summarise as follows: Theorem 9. Suppose C, D ∈ C. The following are equivalent: 1. C and D are spatially congruent 2. C and D are labelled bisimilar 3. C and D are weakly structurally congruent

4 The Logical Theory of BasicSail In the previous section, we have looked at spatial congruence from an algebraic viewpoint and have given three different characterisations. This section adopts a logical view and gives a further characterisation of spatial bisimulation in terms of a (modal style) logic. Using our setup from the previous section, this task is not overly difficult, we

498

D. Pattinson and M. Wirsing

just have to make the (standard) assumption that the underlying processes are finitely branching. Making this assumption, we obtain a logic, which is completely standard except for one binary modal operator, which plays a role similar to the linear implication used in [4, 2], except for the fact that linear implication in loc. cit. is the logical version of parallel composition, whereas the modal operator we are about to introduce, is the logical dual to “extending a parallel composition with one more process”. As before, our definitions and results are parametric in a set N of names and the associated set L of labels (cf. Notation 1). We begin with introducing spatial logic. In essence, this definition is modelled after the characterisation given in Corollary 7. Definition 7. The language L of spatial logic is the least set of formulas according to the grammar L φ, ψ ::= | @n | ff | φ → ψ | Rφ | φψ l

where n ∈ N , l ∈ L ∪ {τ } and R ranges over the relations ↓, =⇒ and =⇒ for l ∈ L. Intuitively, the formula allows us to speak about the empty context and @n allows us to observe the names of locations. Formulas of type Rφ allow us (as in standard modal logic) to reason about the behaviour of a process after evolving according to the relation R. In our case, we can specify properties of sub-configurations (using ↓), tranl sitions (using =⇒) and labelled reductions (using =⇒). The most interesting formula is of type φψ: it asserts that we can split a process into a single node satisfying φ and a remainder, satisfying ψ. Definition 8. The semantics of propositional connectives is as usual. For the modal operators, we put, for C ∈ C: C |=

iff C ≡ 0

C |= @n C |= Rφ

iff C ∈ @n iff ∃C .(C, C ) ∈ R

C |= φψ

and C |= φ iff ∃C , C .C (C , C ) and C |= φ, C |= ψ

where R is as above. As usual, Th(C) = {φ ∈ L | C |= φ} denotes the logical theory of C ∈ C. Two configurations C, D are logically equivalent, if Th(C) = Th(D). Note that we use the expression “@n” above both as an atomic formula of the logic and as a unary relation. In this section, we show that logical equivalence gives yet another characterisation of spatial congruence, provided the underlying set of processes is finitely branching. This follows from the characterisation of spatial congruence as spatial bisimulation + labelled reduction by appealing to Proposition 2. We then obtain a characterisation of spatial congruence in the sense of Hennessy and Milner [7]. The main result of this section is as follows:

Making Components Move: A Separation of Concerns Approach

499

Theorem 10. Suppose (P, −→) is image finite. Then spatial congruence and logical equivalence coincide. Proof. We use the characterisation of spatial congruence as labelled bisimulation and Proposition 2. It follows directly from the definition of spatial logic, that formulas of spatial logic cannot distinguish states, which are labelled bisimilar, hence labelled bisimilarity is contained in logical equivalence. For the converse, we use the method of Hennessy and Milner [7] and a variant of Proposition 2, replacing “i + 1” by “i” in the last clause of the assumption (the meticulous reader is invited to check that the Proposition remains valid). Suppose for a contradiction that there is a pair of configurations (C, D) ∈ C × C such that C and D are logically equivalent, but not labelled bisimilar. Let i be minimal such with the property that C ∼i D but C ∼k D for all k < i (such an n exists because of Proposition 2). Since C and D are not labelled bisimilar, we have – up to symmetry – one of the following cases: 1. C ∈ @m but D ∈ @m for some m ∈ N . Then C |= @m but D |= @m, contradicting Th(C) = Th(D). 2. There is C ∈ C such that (P, P ) ∈ R but there is no D ∈ C with (D, D ) ∈ R l and C ∼i−1 D , where R is one of ↓, =⇒ or =⇒ (for l ∈ L). Since i is minimal, this means that for all D with(D, D ) ∈ R there is a formula φD such that D |= φD but C |= φD . Take φ = D :(D,D )∈R RφD , which is well defined by Lemma 1. Then C |= φ but D |= φ, contradicting Th(C) = Th(D). 3. There are C0 , C1 with C (C0 , C1 ) but there is no (D0 , D1 ) ∈ C × C with Dj ∼i−1 Cj (j = 0, 1) and D (D0 , D1 ). The argument is as above, using formulas of the form φ ψ. Summing up, we have shown that Spatial congruence = spatial bisimulation + labelled reduction = structural congruence up to process bisimilarity = logical equivalence Before extending these correspondences to a more general setting, we give some examples. Example 4. We use the same setup as in Example 2. 1. Consider the configuration C ≡ home, client from Example 2. We have C |= )(@home, tt), corresponding to the statement that there is a top level node with the name “home”. Also, C |= (↓@agent, tt), which expresses that C has a subtree, one node of which has the name “agent”. 2. Consider the configuration C ≡ n1 P [], n2 Q[], similar to Example 2. Here, C |= (@n1 , tt), i.e. there is a location in C with the name “n1 ”. Also, C |= (@n1 , ((@n2 , ))), which says that all top level processes contained in C have either the name n1 or n2 .

500

D. Pattinson and M. Wirsing

5 Local Names In the calculus of mobile ambients, local names are essential for many examples. The treatment of local names is derived from the π-calculus, i.e. governed by structural rule of scope extrusion (νnP ) | Q ≡ νn(P | Q) whenever n is not a freely occurring name of Q. In the ambient calculus, local names cut across dynamics and spatial structure, by adopting a second structural rule: νn(k[P ]) ≡ k[νnP ] if n = k, which allows to move the restriction operator up and down the tree structure, induced by the nesting of the ambient brackets. If we want to remain independent from the underlying process calculus, we cannot adopt the latter rule. However, we can look at a calculus with local names, where local names obey scope extrusion a la π-calculus. The next definition extends the syntax as to incorporate local names. In order to deal with scope extrusion, we also have to introduce the concept of free names. Definition 9. The set C of configurations in the calculus with local names is given by C C, D ::= 0 | nP [C] | C, D | (νn)C for n ∈ N and P ∈ P. Given P ∈ P and n ∈ N , we say that n is free in P , if there lk l1 l2 l are l1 , . . . , lk and P1 , . . . , Pk such that P −→ P1 −→ · · · −→ Pk −→ Q, where l is one of in n, out n and open n. We let fn(P ) = {n ∈ N | n free in P }. For C ∈ C, the set fn(C) is defined by induction on the structure of C as follows: – – – –

fn() = ∅ fn(C, D) = fn(C) ∪ fn(D) fn(nP [C]) = {n} ∪ fn(P ) ∪ fn(C) fn(νnC) = fn(C) \ {n}

where structural congruence is as in Definition 1, augmented with α-equivalence and the rule (νn)(A, B) ≡ (νnA), B whenever n does not occur freely in B. The operational semantics is given as in Definition 1, augmented with the rule C =⇒ C (νn)C =⇒ (νn)C for C, C ∈ C and n ∈ N . Note that, in order to be able to state the rule for α-equivalence, we need a notion of substitution on the underlying processes, which we do not make explicit here. Before investigating the logical and algebraic theory of the calculus with local names, we give a short example. Recall that in Example 2, we had an agent in a home location, the sole purpose of which was to transport clients inside the home-location. However, as we remarked when discussing this example, nothing prevents the client process to enter the home-location directly. This shortcoming can now be remedied in the calculus with local names.

Making Components Move: A Separation of Concerns Approach

501

Example 5. We can now model an agent, which has the capability to enter and exit its home location and to transport clients inside with local names as follows: We let “client” and “agent” as in Example 2 and put home = (νh)h0[agent] Using scope extrusion, we have the same chain of reductions as in Example 2. However, since h is a private name now, the client cannot enter “home” without the help of “agent”. The next issue we are going to discuss is the algebraic and the logical theory of the calculus with local names. In order to obtain a similar characterisation as in the calculus without local names, we have to extend the definition of spatial bisimulation, and demand closure under name revelations. Definition 10. Suppose C ∈ C and n, k ∈ N . We put rev n

C =⇒ C

iff C ≡ (νk)C and C ≡ C[n/k]

whenever n ∈ / fn(C). The definition of spatial bisimulation is modified as follows: Spatial bisimulation is the largest symmetric relation which is closed under spatial reduction =⇒, forest reduction , subtree reduction ↓, top level names @n and under rev n revelation =⇒ (for all n ∈ N ). As before, spatial congruence is the largest congruence, which is a spatial bisimulation. We now turn to the impact of local names on the equivalences, which we have discussed previously. Since we make revelation an explicit part of spatial bisimulation, everything goes through as before, once the equivalences are transferred (without changes) to the calculus with local names. We obtain: – labelled bisimulation is the largest spatial bisimulation, which is closed under labelled reduction – weak structural congruence is the least relation, which contains structural congruence and all pairs of the form (nP [C], nQ[C]) for P, Q ∈ P process bisimilar. Comparing these equivalences, we obtain Theorem 11. In the calculus with local names, spatial congruence coincides with labelled bisimulation and with weak structural congruence. Proof. We extend the respective results for the calculus without local names. The arguments used in Lemma 5 remain valid, showing that spatial congruence is closed under labelled reduction, implying that spatial congruence is contained in labelled bisimilarity. In order to see that labelled bisimulation is a congruence, one has to consider revelarev n tion reductions, that is, reductions of the form =⇒ on top of the reductions considered in Lemma 6, but they do not pose any problems. The comparison of spatial congruence and weak structural congruence is as in Proposition 8.

502

D. Pattinson and M. Wirsing

In order to transfer the characterisation result to a logical setting, we introduce a hidden name quantifier a la Gabbay / Pitts [6]: Definition 11. The language of spatial logic with local names is the least set according to the following grammar L φ, ψ ::= | @n | ff | φ → ψ | Rφ | φψ | Hn.φ Given C ∈ C and φ ∈ L, satisfaction C |= φ is as in Definition 7, plus the clause rev n

C |= Hn.φ iff C =⇒ C and C |= φ for the hidden name quantifier. As before, Th(C) = {φ ∈ L | C |= φ} for C ∈ C, and C, D ∈ C are called logically equivalent, if Th(C) = Th(D). Since the relation rev n (for n ∈ N ) is image-finite, Lemma 1 and Proposition 2 remain valid in the calculus with local names. We thus obtain Theorem 12. In the calculus with local names, spatial congruence and logical equivalence coincide.

6 Further Extensions This section shows, that the separation of dynamic and spatial aspects of mobile components allows for seamless integration of extensions, which are more difficult to model in other calculi. First, we demonstrate that multiple names can easily be handled, since every process runs in precisely one location. It is therefore a straightforward extension to allow the controlling process to change the name of that location. The second extension can be seen as orthogonal: Since the behaviour of every location is governed by precisely one process, new controlling processes can easily be substituted into configurations. This section intends to give an idea regarding extensions of the BasicSail calculus; we leave the investigation of the algebraic and logical theory for further work. 6.1 Change of Names and Multiple Names In the ambient calculus, each ambient has precisely one name, which does not change throughout the reduction process. One can argue that this does not reflect the real world in a faithful manner, since computing devices can also have no names, multiple names or change their names over time. The explicit reference to the enclosing location allows to model the change of names elegantly in the Sail-calculus by extending the set of labels, which influence the spatial reduction relation. Since we want to keep the separation of the dynamical from the spatial structure, we let the controlling processes change the names of locations through an interface (a set of distinguished labels) as before. This necessitates to extend the set of labels for the underlying process calculus:

Making Components Move: A Separation of Concerns Approach

503

Convention 3. We extend the set L of labels to include primitives for name changing as follows: L = {in , out , open , up , down } × N ; as before, (P, −→) is a labelled transition system with −→⊆ P × L × P. Definition 12. In the calculus with multiple names, configurations are given by C A, B ::= 0 | A, B | νnA | (n1 , . . . , nk )P [B] where k ∈ N and n1 , . . . , nk ∈ N . The axioms and rules of structural congruence are those of Definition 9 augmented with (n1 , . . . , nk )P [B] ≡ (nσ(1) , . . . , nσ(k) )P [B] (n, n, n1 , . . . , nk )P [B] ≡ (n, n1 , . . . , nk )P [B] whenever σ : {1, . . . , k} → {1, . . . , k} is a bijection. The operational semantics is that of Definition 9, augmented with the rules up n

P −→ P nP [A] −→ n+n (P )[A] down n

P −→ P nP [A] −→ n−n (P )[A] where n−n deletes n from the list n of names; n+n adds n to the list n of names. The idea of a term (n, m)P [A] is that of a location with two names, n and m, running the programme P and which has A as sub-locations. The additional rule of structural congruence captures the fact that there is no order on the names. The gained expressivity allows us to treat the following: Example 6. 1. Anonymous locations are modelled by an empty set of names. Take for example ()P [A] for P ∈ P and A ∈ C. Note that anonymous locations are anonymous also for processes from within, that is, the same effect cannot be achieved using local names. Indeed, the processes νn(n)P [kout n[]] and ()P [kout n[]] differ in that the former can perform a reduction under the name binder, whereas the latter cannot. 2. Consider the configuration (n)down n.0[A], ()in n.0[B]. First, this shows that unnamed locations can perform movements. Second, this example illustrates that the movement only succeeds, if the unnamed agent is lucky enough to enter into his partner before the name disappears.

504

D. Pattinson and M. Wirsing

6.2 Dynamic Reconfiguration We conclude by demonstrating the strength of our approach by discussing dynamic reconfiguration, another extension of the basic calculus. Here, we use the one-to-one relation between locations and controlling processes to model dynamic reconfiguration, i.e. locations, which dynamically change the programme they run. Sloppily speaking, this allows for downloading a new programme, which is then run in an already existing location. As with multiple names and the change of names, the explicit reference to the enclosing location allows for a concise and elegant formulation of dynamic reconfiguration. Note that this in particular necessitates the transmission of programmes (processes). The extension of the calculus follows the same scheme as the above extension with multiple names: in order to keep dynamic and spatial structure apart, we introduce new labels, which act as an interface, through which the controlling process manipulates the spatial structure. Convention 4. We extend the set L of labels to include primitives for dynamic reconfiguration as follows: L = {in , out , open } × N ∪ {send , rec , run } × P; as before, (P, −→) is a labelled transition system with −→⊆ P × L × P. Note that this requires the underlying transition system to have processes occurring in the labels, since processes need to be transmitted. Except for the absence of channel names, this is for example realised in the higher order π-calculus (see [13, 16]). For our purposes, it suffices that processes can be transmitted and received; we leave the concrete (syntactical) mechanism abstract. Definition 13. In the calculus with dynamic reconfiguration, configurations are given as in the calculus with local names (but recall the extended set of labels). The operational semantics for the calculus with dynamic reconfiguration is given by the rules of Definition 9, augmented with send R

rec R

P −→ P Q −→ Q nP [C],mQ[D] ⇒ nP [C],mQ [D] run R

P −→ P nP [C] =⇒ nR[C] Note that in the action run R, R is a process and the reduct of P after the run R reduction is forgotten. Using dynamic reconfiguration and communication, we can now model a location, which updates the process it executes: Example 7. We model an electronic device, which attempts to update the code it is running (its operating system). That is, it tries to replace the programme which it is running by another (newer) version. In order to model this behaviour, we first have to be more precise about the underlying set of processes: We let P, Q P ::= 0 | P Q | α.P |!P | X | run Q.P | send Q.P | rec Q.P

Making Components Move: A Separation of Concerns Approach

505

where X ∈ X ranges over a set of (process valued) variables. The process level transition relation from Example 1 is augmented with rec Q

rec X.P −→ P [Q/X] and the usual rules send Q

send Q.P −→ P run Q

run Q.P −→ P Note that in particular process variables X ∈ X do not generate reductions. Now consider P = (rec X.run X) O running inside location n, that is, the configuration C = nP [B], where B are n’s sub-locations. In the vicinity of a location which sends updates, e.g. U = u!(send (rec X.run X N ))[], where N stands for the “new” firmware, we have U, C =⇒ U, nrun (rec X.run X N ) O[B] which, executing the run -operation, reduces to U, nrec X.run X N [B], that is, a process which (again) waits for an update, but now running the new firmware N. As already mentioned in the introductory remark of this section, both extensions, multiple names and dynamic reconfiguration, are to demonstrate the extensibility of the calculus; the study of the algebraic and logical properties is left for further research.

7 Conclusions and Related Work As discussed above the first calculus for mobile computation was the π-calculus [8]. Further calculi are the Fusion calculus [12], Nomadic Pict [18] and the distributed coordination language KLAIM [10]. The study of hierarchical re-configurable administrative domains was introduced by the Ambient [5] and the Seal calculus [17]. BasicSail follows these lines but distinguishes processes and configurations in an a priori way and concentrates on a even simpler set of operations for reconfiguration. The basic calculus and its variations were inspired by the Seal-Calculus. [17]. However, the Seal-Calculus is quite involved syntactically; the present calculus is a simplification in order to study the effect of the separation of dynamics from the underlying topological structure, which is also present in Seal. The second source of inspiration was the calculus of mobile ambients [5]. As we have pointed out before, our principal

506

D. Pattinson and M. Wirsing

design decisions do not allow to embed the full ambient calculus into our framework. Spatial logics were studied by Cardelli and Caires [2, 3], although to our knowledge not wrt. a clear characterisation of the expressive power. Such a characterisation (called “intensional bisimulation”) was considered by Sangiorgi for a variant of the ambient calculus [14, 15]. Separation of Concerns in Models of software architecture has also been addressed – albeit not in the context of mobile code – in [1, 11]. There the authors differentiate between components, which provide certain services, and an additional layer, which describes the composition of components. In the context of explicit code mobility, this approach can be seen as orthogonal to ours; and it would certainly be interesting to have coordination and mobility in a single framework. Of course, there remains a wealth of open problems: Most pressingly, we have investigated neither the logical nor the algebraic theory of the calculus with multiple names or the calculus with reconfiguration.

References 1. F. Arbab. Abstract behaviour types: A foundation model for components and their composition. This Volume. 2. L. Caires and L. Cardelli. A spatial logic for concurrency (part i). In N. Kobayashi and B. Pierce, editors, Proc. TACS 2001, volume 2215 of Lecture Notes in Computer Science, pages 1–37. Springer, 2001. 3. L. Caires and L. Cardelli. A spatial logic for concurrency (part i). In L. Brim, P. Jan`car, M. K`ret`ınsk`y, and A. Ku`cera, editors, Proc. CONCUR 2002, volume 2421 of Lecture Notes in Computer Science. Springer, 2002. 4. L. Cardelli and A. Gordon. Anytime, anywhere: Modal logics for mobile ambients. In Proc. POPL 2000, pages 365–377. ACM, 2000. 5. L. Cardelli and A. Gordon. Mobile ambients. Theor. Comp. Sci., 240(1):177–213, 2000. 6. D. Gabbay and A. Pitts. A new approach to abstract syntax involving binders. In 14th IEEE Symposium on Logic in Computer Science (LICS 1999), pages 214–224. IEEE Computer Society, 1999. 7. M. Hennessy and R. Milner. Algebraic Laws for Non-determinism and Concurrency. Journal of the ACM, 32:137–161, 1985. 8. R. Milner. Communicating and Mobile Systems: the π-Calculus. Cambridge University Press, 1999. 9. U. Montanari and V. Sassone. Dynamic congruence vs. progressing bisimulation for CCS. Fundamenta Informaticae, 16(2):171–199, 1992. 10. R. De Nicola, G. Ferrari, and R. Pugliese. Klaim: a kernel language for agents interaction and mobility. IEEE Trans. Software Engineering, 24(5):315–330, 1998. 11. O. Nierstrasz and F. Achermann. A calculus for modelling software components. This Volume. 12. J. Parrow and B. Victor. The fusion calculus: Expressiveness and symmetry in mobile processes. In Thirteenth Annual Symposium on Logic in Computer Science (LICS 1998), pages 176–185. IEEE, IEEE Computer Society, 1998. 13. D. Sangiorgi. From π-calculus to Higher-Order π-calculus — and back. In M.-C. Gaudel and J.-P. Jouannaud, editors, Proc. TAPSOFT 93, volume 668 of Lect. Notes in Comp. Sci., pages 151–166, 1993.

Making Components Move: A Separation of Concerns Approach

507

14. D. Sangiorgi. Extensionality and intensionality of the ambient logics. In Proc. POPL 2001, pages 4–13. ACM, 2001. 15. D. Sangiorgi. Separability, expressiveness, and decidability in the ambient logic. In 17th IEEE Symposium on Logic in Computer Science (LICS 2002). IEEE Computer Society, 2002. 16. Davide Sangiorgi and David Walker. The π-calculus: a Theory of Mobile Processes. Cambridge University Press, 2001. 17. J. Vitek and G. Castagna. Seal: A framework for secure mobile computation. Internet Programming, 1999. 18. P. Wojciechowski and P. Sewell. Nomadic pict: Language and infrastructure design for mobile agents. IEEE Concurrency, 8(2):42–52, 2000.

This page intentionally left blank

Author Index

´ Abrah´ am, E. 1 Achermann, F. 339 Arbab, F. 33 Arnout, K. 285

Meyer, B. 285 Montanari, U. 319

Batson, B. 242 Boer, F.S. de 1

Olderog, E.-R.

Cheon, Y. 262 Clifton, C. 262 Cohen, I.R. 136 Cok, D.R. 262 Damm, W. 71, 99 Deng, X. 154 Dwyer, M.B. 154 Efroni, S.

136

Ferrari, G.

319

G¨ ossler, G.

443

Harel, D. 136 Hatcliﬀ, J. 154 Hooman, J. 182 Jacobs, B. 202 Jong, H. de 220 Josko, B. 71 Jung, G. 154 Kiniry, J. 202 Klint, P. 220 Lamport, L. 242 Leavens, G.T. 262

Nierstrasz, O.

339 361

Pattinson, D. 487 Plosila, J. 424 Pnueli, A. 71 Pol, Jaco van de 182 Raggi, R. 319 Robby, 154 Roever, W.-P. de Ruby, C. 262 Rumpe, B. 380

1

Sekerinski, E. 403 Sere, K. 424 Sifakis, J. 443 Singh, G. 154 Steﬀen, M. 1 Stevens, P. 467 Tenzer, J. 467 Tuosto, E. 319 Votintseva, A.

71

Wald´en, M. 424 Warnier, M. 202 Wehrheim, H. 361 Westphal, B. 99 Wirsing, M. 487