Advanced Information Systems Engineering: 12th International Conference, CAiSE 2000, Stockholm, Sweden, June 5-9, 2000 Proceedings

Lecture Notes in Computer Science Edited by G. Goos, J. Hartmanis and J. van Leeuwen 1789 3 Berlin Heidelberg New Yo...

Author: Benkt Wangler | Lars Bergman

38 downloads 792 Views 4MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

Lecture Notes in Computer Science Edited by G. Goos, J. Hartmanis and J. van Leeuwen

1789

3

Berlin Heidelberg New York Barcelona Hong Kong London Milan Paris Singapore Tokyo

Benkt Wangler Lars Bergman (Eds.)

Advanced Information Systems Engineering 12th International Conference, CAiSE 2000 Stockholm, Sweden, June 5-9, 2000 Proceedings

13

Series Editors Gerhard Goos, Karlsruhe University, Germany Juris Hartmanis, Cornell University, NY, USA Jan van Leeuwen, Utrecht University, The Netherlands Volume Editors Benkt Wangler University of Sk¨ovde Box 408, 541 28 Sk¨ovde, Sweden E-mail: [email protected] Lars Bergman Framkom AB Box 1243, 164 28 Kista E-mail: [email protected] Cataloging-in-Publication Data applied for Die Deutsche Bibliothek - CIP-Einheitsaufnahme Advanced information systems engineering : 12th international conference ; proceedings / CAiSE 2000, Stockholm, Sweden, June 5 - 9, 2000. Benkt Wangler ; Lars Bergmann (ed.). - Berlin ; Heidelberg ; New York ; Barcelona ; Hong Kong ; London ; Milan ; Paris ; Singapore ; Tokyo : Springer, 2000 (Lecture notes in computer science ; Vol. 1789) ISBN 3-540-67630-9

CR Subject Classiﬁcation (1998): H.2, H.4-5, J.1, K.4.3, K.6, D.2 ISSN 0302-9743 ISBN 3-540-67630-9 Springer-Verlag Berlin Heidelberg New York This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, speciﬁcally the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on microﬁlms or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer-Verlag. Violations are liable for prosecution under the German Copyright Law. Springer-Verlag is a company in the BertelsmannSpringer publishing group. © Springer-Verlag Berlin Heidelberg 2000 Printed in Germany Typesetting: Camera-ready by author, data conversion by Steingraeber Satztechnik GmbH, Heidelberg Printed on acid-free paper SPIN: 10720034 06/3142 543210

Preface CAiSE 2000 was the 12th in the series of International Conferences on Advanced Information Systems Engineering. In the year 2000 the conference returned to Stockholm where it was organized the very first time. Since that year, 1989, the CAiSE conferences have developed into an important forum for the presentation and exchange of research results and practical experiences within the field of Information Systems Engineering. The objective of the CAiSE conference series is to bring together researchers and practitioners in the field of information systems engineering to meet annually in order to discuss evolving research issues and applications in this field. The CAiSE conference series also aims to provide an opportunity for young scientists to establish relationships with senior scientists in their areas of interest. Stockholm is an important center of research and development for some of the leading IT and communications companies in Europe and indeed, in the world. In tune with this environment, a major theme of CAiSE 2000 was "information systems and services in a digitized world". This theme reflects the vast improvements in communication technology, including the increasing use of Internet and WWW, that has taken place over the last years, and that has led to better communication and easier information access in general. In addition, this development has initiated changes in the way organizations cooperate and trade. Hence, we see the formation of virtual enterprises where smaller companies collaborate to solve tasks they cannot cope with alone. It also leads to improvements in information services to people as consumers and as citizens. For such information services in a digitized world to become really useful it is necessary that they add value to the workings of businesses and people in their daily lives. This calls for enhancements of methods and tools in order better to understand organizations and the way they interact among themselves and with people. Also, there is a need for enhanced design methods and for services that are truly value adding and that function well together with the organizations’ legacy systems. As a consequence, the conference featured paper presentations, panels, and tutorials that addressed these issues from a wide range of perspectives but with a common goal. That goal was to advance the theory and practice of analyzing and designing individual but integrated information services in an organizational context. Of the 126 papers that were received 32 papers of high academic quality were selected for presentation at the conference. The papers covered topics such as webbased information systems, workflow models and systems, analysis patterns, component-based development, enterprise application integration, data warehousing and many others. In conjunction with the conference, tutorials were given by wellknown experts on topics such as component-based development, data mining, business modeling, modeling for ERP systems, and requirements engineering. A special one-day industrial track was organized with experts from industry reporting on the implementation of novel information systems architectures. The conference was preceded by a large number of scientific workshops covering topics such as modeling methods, requirements engineering, data warehousing, agentoriented information systems, and business process modeling. A special symposium on information systems engineering was organized in honor of Janis Bubenko Jr., on his retirement.

VI

Preface

CAiSE 2000 was organized jointly by the Department of Computer and Systems Sciences at Stockholm University and KTH, and the Swedish Institute for Systems Development (SISU). We would like to thank all those institutions and individuals who made this conference possible: the Swedish Research Institute for Information Technology (SITI), the University of Skövde, the Electrum Foundation, the sponsors and, of course, all the participants. March 2000

Benkt Wangler Lars Bergman

CAiSE 2000 Conference Organization

Advisory Committee Janis Bubenko Jr. Kungl. Tekniska Högskolan Stockholm Sweden

Arne Sölvberg The Norwegian University of Science and Technology, Trondheim Norway

General Chair

Programe Chair

Organizing Chair

Eva Lindencrona Swedish Institute for Systems Development Stockholm, Sweden

Benkt Wangler University of Skövde Sweden

Lars Bergman Swedish IT Institute Stockholm Sweden

Tutorials Chair

Workshop and Poster Chair

Organizing Committee

Anders G. Nilsson Karlstad University Sweden

Anne Persson University of Skövde Sweden

Birgitta Olsson Eriks Sneiders Janis Stirna

Programe Committee Alex Borgida USA Sjaak Brinkkemper The Netherlands Panos Constantopoulos Greece Jan Dietz The Netherlands Marcello Finger Brazil Marcel Franckson France Maria Grazia Fugini Italy Janis Grundspenkis Latvia Remigijus Gustas Sweden Hele-Mai Haav Estonia Yannis Ioannidis Greece Juhani Iivari Finland Matthias Jarke Germany

Keith G Jeffery United Kingdom Paul Johannesson Sweden Hannu Kangassalo Finland Gerti Kappel Austria Karlheinz Kautz Denmark Pasi Kuvaja Finland Michel Léonard Switzerland Frederick Lochovsky China Peri Loucopoulos United Kingdom Kalle Lyytinen Finland Neil Maiden United Kingdom Salvatore March USA Peter McBrien United Kingdom

VIII

Organization

John Mylopoulos Canada Moira Norrie Switzerland Antoni Olivé Spain Andreas Opdahl Norway Maria Orlowska Australia Michael Papazoglou The Netherlands Barbara Pernici Italy Naveen Prakash India Colette Rolland France Matti Rossi The Netherlands Michael Schrefl Australia Timos Sellis Greece Amilcar Sernadas Portugal Keng Siau USA

Stefano Spaccapietra Switzerland Alistair Sutcliffe United Kingdom Bernhard Thalheim Germany Constantinos Thanos Italy Yannis Vassiliou Greece Yair Wand Canada Benkt Wangler (chair) Sweden Tony Wasserman USA Roel Wieringa The Netherlands Jian Yang Australia Naoki Yonezaki Japan Eric Yu Canada Roberto Zicari Germany

Additional Referees Giuseppe Amato Italy Per Backlund Sweden Joseph Barjis The Netherlands Alistair Barros Australia Danny Brash Sweden Janis Bubenko Jr. Sweden Carlos Caleiro Portugal Fabio Casati Italy Donatella Castelli Italy Rik Eshuis The Netherlands L. Feng The Netherlands Paula Gouveia Portugal Shigegi Hagihara Japan Wilhelm Hasselbring The Netherlands Bart-Jan Hommes The Netherlands Marius Janson USA Prasad Jayaweera Sri Lanka Ingi Jonasson Iceland Mira K-Matsson Sweden Steven Kelly Finland Sven-Eric Lautemann Germany Jianguo Lu Canada Paul Mallens The Netherlands Olivera Marjanovic Australia Kenji Masui Japan Paulo Mateus Portugal Yoshiaki Minamisawa Japan Werner Obermair Australia

S.J. Paheerathan Sri Lanka Anne Persson Sweden Guenter Preuner Australia Claus Priese Germany Jaime Ramos Portugal João Rasga Portugal Wasim Sadiq Australia Shazia Sadiq Australia Han Schouten The Netherlands Klaas Sikkel The Netherlands Cristina Sernadas Portugal Eriks Sneiders Latvia Yannis Stavrakas Greece Janis Stirna Latvia Mattias Strand Sweden Masato Suzuki Japan Eva Söderström Sweden Arne Sölvberg Norway T. Thalhammer Australia Dimitri Theodoratos Greece Juha-Pekka Tolvanen Finland Panos Vassiliadis Greece Hans Weigand The Netherlands Peter Werner Germany Maria A. Wimmer Austria Kenji Yamashita Japan Noriaki Yoshiura Japan

Table of Contents

Keynote Speech BIO/COM*: Mixing Water with Fire ............................................................................ 1 Dennis Tsichritzis

Panel Why Is It So Difficult to Introduce RE1 Research Results into Mainstream RE Practice? ...................................................................................... 7 Moderators: Hermann Kaindl, John Mylopoulos

Web-Based Information Systems and Services Adaptive and Dynamic Service Composition in eFlow..............................................13 Fabio Casati, Ski Ilnicki, LiJie Jin,Vasudev Krishnamoorthy, Ming-Chien Shan Towards Extensible Information Brokers Based on XML .........................................32 Jianguo Lu, John Mylopoulos, Jamie Ho Advertising Database Capabilities for Information Sharing.......................................47 Suzanne M. Embury, Jianhua Shao, W. Alex Gray, Nigel Fishlock A Personal Assistant for Web Database Caching64 Beat Signer, Antonia Erni, Moira C. Norrie Extending a Conceptual Modelling Approach to Web Application Design ...............79 Jaime Gómez, Cristina Cachero, Oscar Pastor Efficient Distributed Workflow Management Based on Variable Server Assignments.................................................................................94 Thomas Bauer, Peter Dadam

*

BIO stands for Biology, Biochemistry, Biotechnology, etc. COM stands for Computer, Communication, Computability, Complexity, etc.

1

Requirements Engineering

X

Table of Contents

Workflow Models and Systems A Logical Framework for Exception Handling in ADOME Workflow Management System............................................................110 Dickson Chiu, Qing Li, Kamalakar Karlapalem Controlled Flexibility in Workflow Management ....................................................126 Justus Klingemann A Formal Model for Business Process Modeling and Design ..................................142 Manolis Koubarakis, Dimitris Plexousakis

Patterns and Components Conceptual Patterns for Reuse in Information Systems Analysis.............................157 Petia Wohed Evaluating a Pattern Approach as an Aid for the Development of Organisational Knowledge: An Empirical Study .................................................176 Colette Rolland, Janis Stirna, Nikos Prekas, Peri Loucopoulos, Anne Persson, Georges Grosz Specifying Processes with Dynamic Life Cycles .....................................................192 Rick van Rein

Enterprise Application Integration Design Principles for Application Integration ..........................................................212 Paul Johannesson, Erik Perjons CHAOS: An Active Security Mediation System......................................................232 David Liu, Kincho Law, Gio Wiederhold Modeling and Composing Service-Based and Reference Process-Based Multi-enterprise Processes........................................247 Hans Schuster, Dimitrios Georgakopoulos, Andrzej Cichocki, Donald Baker

The Information Systems Development Process Gossip: An Awareness Engine for Increasing Product Awareness in Distributed Development Projects........................................................................264 Babak A. Farshchian

Table of Contents

XI

How Culture Might Impact on the Implementation of Enterprise Resource Planning Packages...............................................................279 Marina Krumbholz, Neil Arthur McDougall Maiden Capture and Dissemination of Experience about the Construction of Engineering Processes.....................................................294 Christian Rupprecht, Martin Fünffinger, Holger Knublauch, Thomas Rose

Reuse in Information Systems Engineering Practical Reuse Measurement in ERP Requirements Engineering ...........................309 Maya Daneva Building Extensible Workflow Systems Using an Event-Based Infrastructure........325 Dimitrios Tombros, Andreas Geppert Defining Components in a MetaCASE Environment ...............................................340 Zheying Zhang

Information Systems Reengineering Tracing All Around ..................................................................................................355 Gerald Ebner, Hermann Kaindl A Framework for the Evolution of Temporal Conceptual Schemas of Information Systems.............................................................................................369 Juan-Ramón López, Antoni Olivé Distance Measures for Information System Reengineering .....................................387 Geert Poels, Stijn Viaene, Guido Dedene

Modeling Languages StateLator - Behavioral Code Generation as an Instance of a Model Transformation.......................................................................................401 Thomas Behrens, Simon Richards An Overview of RoZ: A Tool for Integrating UML and Z Specifications ...............417 Sophie Dupuy, Yves Ledru, Monique Chabre-Peccoud On Structured Workflow Modelling.........................................................................431 Bartek Kiepuszewski, Arthur Harry Maria ter Hofstede, Christoph J. Bussler

XII

Table of Contents

Databases and Data Warehousing A Model for Data Warehouse Operational Processes...............................................446 Panos Vassiliadis, Christoph Quix, Yannis Vassiliou, Matthias Jarke Temporally Faithful Execution of Business Transactions ........................................462 Werner Obermair, Michael Schrefl Modelling and Optimisation Issues for Multidimensional Databases ......................482 Panos Vassiliadis, Spiros Skiadopoulos

Investigations into Industrial Processes Managing the Software Process in the Middle of Rapid Growth: A Metrics Based Experiment Report from Nokia.....................................................498 Tapani Kilpi The Impacts of Electronic Commerce in the Automobile Industry: An Empirical Study in Western Australia ................................................................509 Peter Marshall, Roger Sor, Judy McKay

Author Index ...........................................................................................................523

BIO / COM*: Mixing Water with Fire Dennis Tsichritzis GMD-German National Research Center for Information Technology Schloss Birlinghoven, D-53754 Sankt Augustin

1. Introduction As we have reached a new century and a new millenium it is appropriate to comment on the past and predict the future. Undoubtedly the past century was shaped by material products and industrialization. Physics and Chemistry gave the foundation for new products that changed completely the economic activity and our lives. The scientific results of the nineteenth century were expanded and applied through engineering to create an industrial society. We influenced our environment at an unprecedented scale both for the good by providing technological wonders and the bad by creating pollution and using technology for war. Will the new century be just a continuation? There are already many signs pointing to a new direction. Material goods consumption is reaching its limits. In addition, it is debatable if the world can go on indefinitely consuming as it does today. Economic expansion needs, however, new products and new markets. Sustainable development points in that direction. What will be the basis in the future for scientific and economic development? In the next sections we will argue that BIO (Biology, Biotechnology) and COM (Computer, Communication) will provide that basis not only in isolation but in combination. It is already widely accepted that the future economic development will be based on services and not so much on material products. Terms such as Information Society and Knowledge Society point in that direction. Computers and Communications have become indispensable in our daily lives. Finally, the expansion of the Internet and the stock market valuation of its related companies show that at least the investors are betting heavily on such a future. COM related services, companies and jobs are expanding like fire. At the same time our lives are subject to another revolution. Biology is making tremendous progress. It is chartering new territory at an unprecedented scale. Biotechnology is already delivering economically significant results. Pharmacology and medicine are already beginning to reap the benefits of our increasing ability to understand life itself. BIO is the essence, the water, of this knowledge. As the application of science and technology increasingly changes our environment it is vital to our survival and well-being for us to understand how we function ourselves. It is perhaps controversial but not so surprising to claim that the future will be shaped by these two areas, BIO and COM. Many scientific and general articles have already discussed these issues. The discussions, however, run in parallel as if these two areas are not only independent but are somehow competing in claiming the *

BIO stands for Biology, Biochemistry, Biotechnology, etc. COM stands for Computer, Communication, Computability, Complexity, etc.

B. Wangler, L. Bergman (Eds.): CAiSE 2000, LNCS 1789, pp. 1-6, 2000 © Springer-Verlag Berlin Heidelberg 2000

2

D. Tsichritzis

future. We believe that on the contrary many future developments will be based on their combination. At first glance everything separates these areas. Biology studies life, its diversity of material and organizational forms, their natural evolution and interdependencies. Computers and Communications are based on abstractions, completely virtual and rationally optimized to function in the artificial world of our information problems. Whereas Biology is predominantly experimental, Computer Science is more mathematical and abstract. The favorite support for materialization of information abstractions is crystalline silicon, a very dead substance known for its stability, whereas biological organisms continually reconstruct themselves from unstable molecules based on carbon. Biotechnology is using fluids, catalysts and processes. Computer Engineering is using the solid state, electronics and programming. Even in their applications they differ. The BIO economic sector and the COM sector seem to be disjoint, at least so far. Needless to say, university departments and research projects are also usually far apart. We claim that they should get close very quickly because they need each other.

2. Computational Biology (COM-BIO) We first look at what COM can do for BIO. The customers are in Biology and Biotechnology. The methods and the tools come from computers and communications. Biology in the past, in particular molecular and developmental biology, has had to focus on unraveling specific mechanisms, small pieces from complex biological organisms and ecosystems. How the whole process was happening, and why, were problems too difficult to attack. Lately, however, a wealth of knowledge on entire organisms and processes down to the molecular level is becoming available which enable us to address the questions of how and why. There is an avalanche of facts, of structures, of models which have to be studied, organized and disseminated. In short, there is an emergence of extreme complexity which needs to be mastered. If we view computer science not in terms of technology but in terms of substance everything we do serves to master complexity. Because our computer systems have become extremely complex we introduced tools to deal with complexity. Similar tools can be used to deal with the complexity of Biological systems. -

Algorithms of different sorts can be combined to approximate and simulate complex biological phenomena. Data Bases of different kinds are needed to organize rapidly accumulated knowledge and to make this knowledge readily available. Data mining techniques can be very important. Visualization tools can be used to explain structures and grasp behavior. Modeling concepts can be applied to abstract from known facts and produce general theories. Communication Networks and Digital Libraries can be introduced to support the wild expansion of scientific activity and economic development.

Take as an example bioinformatics which develops algorithms for the structural analysis of proteins and molecular complexes involving proteins. These methods can

BIO / COM: Mixing Water with Fire

3

be applied to two very important problems in the pharmaceutical industry. The first problem is finding appropriate target proteins for the design of new drugs. These proteins have to be selected from a large set of hundreds or thousands of proteins turned up by high-tech genomics currently being developed. The second is to find the right drug binding to a given target protein. The key to solving both problems are pattern matching methods coming from computer science and statistics and being carefully taylored to this challenging application. Information Technology and methodology can also help to develop a Theory of Biology. If the structure of proteins are known, their interaction is explained and their cycles of metabolism understood we still need to combine all of these in some sort of abstract framework. We claim that many areas of Computer Science from Algorithms, to knowledge modeling to AI can be very useful to underpin such a theory. We hope that Computer Science can play a similar role for Biology as Mathematics plays in Physics. In return, the challenge of both new paradigms and enormous complexity will force us to develop even more and better theoretical tools. Both computer science and biology are grappling for a theory of complex design processes, and this will be essential both for future computer engineering and biotechnology. Computer science is playing a significant role in our attempts to understand the cognitive processes of our minds. The major activity of the field of Artificial Intelligence is not to build an artificial brain but to understand the workings of intelligence and learning through the analysis of model constructs within the context of the conceivable. Computer science is also already playing a significant role in understanding the abstract self-reproducing and self-organizing nature of biological organisms. The field of Artificial Life deals with the analysis of model constructs to understand life. This approach has a tradition dating back to von Neumann. We don't strive to produce artificial life. We only hope to produce model artifacts which behave as life and which can be used to explain and predict life. The overall hope and promise to bring biologists together with computer scientists is that using analytical and synthetical means in an interleaving, coordinated, and systematic way we may be able to sort out basic principles in the organization of living systems and the processes of life such that it covers more than one level of description. This certainly will rise the quality of theories about living systems. If we had a Theory of Biological systems we can predict behavior. For example, from a protein solution, we can obtain its structure. From its structure we can make hypotheses about behavior. Going backward from abnormal behavior we could suggest causes and remedies.

3. Biological Computing (BIO-COM) We turn our attention on what BIO can do for COM. The customers and the applications are in Computing and Communications. The tools and the methods come from Biology and Biotechnology. Computers and Communications evolved over the years and are currently mainly based on digital electronic technology. In the future we may need to exploit novel ways of computing using perhaps quantum, optical or biological computing. In this paper we will concentrate on biological computing which has already been demonstrated in principle.

4

D. Tsichritzis

Beginning in the early '80s, physical molecular biologists began to build machines to deal with natural information processing in biological organisms. DNA sequencing machines, DNA synthesis machines and DNA amplification machines have become widespread. Even evolution machines were constructed. In the '90s, in the quest for ever higher levels of integration and following a market pressure, such machines have migrated to a silicon technology with extensive overlaps with the electronics industry. DNA Chips with arrays of different sequences bound to the surface of wafers are an example of such an approach Biological communicators have also been studied. Directed diffusion and signal amplification is the usual mode of biological communication at the molecular level. The directionality is achieved mainly by selective transport across membranes or selective attachment to static or dynamic structures. The global signaling starting with antibodies in the clonal response of the immune system to foreign substances provides an impressive communication network, components of which are already finding application in biotechnology (e.g. antibody chips). The role of transport proteins in cells is getting increasing attention. Cell receptors, phosphorylation cascades, electron transfer reactions and intracellular calcium waves provide further examples of complex biomolecular signaling. If we achieve substantial progress in biological computing and communication it does not necessarily imply that we will use these technologies in the same way as modern Computers and Communications are used. We do not expect that biological computers will control elevators or that we will base Internet on biological sea communications. There are many applications in controlling biological phenomena where we see tremendous possibilities. There we can directly interface real life biological processes with artificial biological computers and communications. The applications can be endless from medicine to pollution control. We need to concentrate on the applications that nature does better and avoid those that we already know how to do well with electronic computers. It is probably a mistake to try to prove the superiority of biological computers by using them to solve complex mathematical problems. Maybe they will never be as good as their electronic counterparts. The most interesting, influence of biological computing and communication is potentially in the areas of Theory and Programming of computers. The Theory of Computation evolved this century in a couple of phases before settling down to its current stable version. In the beginning there was an attempt to define computability in a constructive manner. Starting with computations we know (like arithmetic) and using composition and iteration we obtained some notion of computability (primitive recursive functions). It proved to be too narrow of a scope although much of what is computed is of that kind. The widely accepted notion of computability was obtained in a different way. Models of different kinds were defined (Church, Turing, Post) and then proved to be of equal power. Church then declared in his thesis that the informal notion of computability corresponds to the formal notion defined by any of these models. Many problems were proved to be not computable in their general from. In restricted form they are computed every day. The study of algorithms gave rise to different notions of complexity in time and space. A wide class of difficult problems were defined as NP complete, that is we can compute them in reasonable time (polynomial) if we could deal with non determinism (a form of uncontrolled parallelism). Specific constrained NP complete problems are of course

BIO / COM: Mixing Water with Fire

5

computed often but we still cannot handle the non determinism in general (i.e. for large problems). Biological computing can provide other paradigms which could unsettle the current state of computability and complexity theory. First, the constructive approach did not go far enough because only limited forms of combinations were used. Biological computing offers naturally, evolution, replication, even mutation which could prove very successful. The constructive way of defining computability will get a new boost. Second, the interaction between models was strictly controlled. A Turing Machine could only work on what was on its tape. A Turing machine with so called "oracles" could do more. Biological computing offers a paradigm of a variety of unrestricted interactions which may expand the scope of what is effectively computable. The so called Heuristics which were always used to solve real problems in practice could possibly be inside the model. Third, non determinism in the form of uncontrolled parallelism is built-in in biological computing. This implies that non-determinism is not simulated but it is effectively computed. The implications can be that a host of very hard problems could be effectively computable. The biomolecular paradigm will also influence programming. Internet programming is already far removed from early programming techniques. It is component based, running in a distributed fashion on an architecture which is not even known a priori. Biological computing programming will be even more flexible. In what ways will we be able to influence and externally control Biological Computing? We are still in the initial phases. We can provide input (for instance in the form of biomolecular sequence libraries) in a known environment and we can read the corresponding output or send it as input to a further such processor. A stored program approach can be designed using a dataflow-like hardware architecture, produced using microstructured silicon, with the program specified by a sequence of photomasks which determine the spatial location of specific biomolecules in the dataflow architecture. Aspects of message passing and interprocess communication can also be realized in this approach but clearly we are at a very early stage in this development. The programming languages in Biological computing may be similar or very different from what we have today. Biological computing will force us to redefine our notions of programming or reactivate old ones which are not in wide use. In these new programming paradigms replication and evolution will play a very significant role. Biomolecular systems process enormous amounts of information in parallel while mutually constructing themselves. When specific subsequences on RNA are translated to make components of the translation apparatus or polymerases which can copy RNA, we perceive a partly programmed and partly autonomous processing of sequence information to make self-assembling construction components. This is a bootstrapping process involving both hardware (molecules and reactions) and software (specific sequences and folding algorithms). A tight coupling of information processing and construction was foreseen as a vital aspect of computer science by John von Neumann. Up until recently however, the technology has caused computers to be built with a complete temporal and spatial separation of hardware construction and information processing. Now self-reproducing biomolecular systems are revealing exciting new paradigms for massively parallel, self-repairing, robust, adaptive and constructive computation which break down this distinction between hardware and software and between a computer and its execution.

6

D. Tsichritzis

Biological organisms are so perfectly adapted to their environment as a result of an iterative optimization process, rather than of directly programmed design. Evolutionary optimization is a key feature of such biomolecular information processing. Molecular evolution can be harnessed to solve large difficult problems where physical simulation in software is intractable. Biomolecular systems occuring in biological organisms have evolved sophisticated ways of selecting fleeting structural patterns, to be converted into permanent structures which themselves become triggers for further structural pattern selection. Recently this basic mechanism has been harnessed to allow artificial in vitro self-replicating systems to be constructed. Adleman's demonstration that externally specified optimization problems could also be programmed and solved has now encouraged an exchange of ideas between computer scientists and molecular biologists working on molecular selection systems.

4. Concluding Remarks We claimed that not only BIO and COM will shape the next century but their mutual support and cooperation will open new perspectives for both areas. We see already a flurry of activity but we also notice many barriers. First, Biology and Computer Science are very far apart and there are too few scientists grasping both. Second, most Research organizations are concentrating on one or the other but seldom mix both. Third, Research Budgets are usually separate and combination projects are looked with suspicion in both areas. Fourth, each area has a wild expansion and it is natural for each one to be self centered. Fifth, senior persons in both areas may recognize the tremendous possibilities but there is massive inertia in their Institutions and Societies. Finally, companies and economic sectors are far apart. Biotechnology is being pushed by the Chemical and Pharmaceutical sector while Computers and Communications have their own sector, the Information Technology economic sector. The most severe difficulties are in terms of scientific language. The two communities talk different languages. They initially don't understand each other. It is worthwhile to overcome all these barriers, Science seems to develop faster at the boundaries of areas rather than at their core. We expect BIO / COM to expand like fire and be as essential as water. The expected results can have major economic and societal implications. Mixing water with fire will be difficult. It is, however, worthwhile.

Why Is It So Difficult to Introduce RE Research Results into Mainstream RE Practice? Hermann Kaindl 1 and John Mylopoulos 2 1

Siemens AG Österreich, Geusaugasse 17 A-1030 Vienna, Austria [email protected] 2 Department of Computer Science, University of Toronto, 6 King's College Road Toronto, Ontario, Canada M5S 1A4 [email protected]

1 Motivation For quite a long time research results in requirements engineering (RE) were developed without much interaction with, or impact on, practice. In 1993 Hsia et al. [2] made an honest evaluation of the requirements engineering practice. Since then, some improvements have been achieved, e.g., by some applications of usage scenarios. Nevertheless, mainstream practice is still to use one's favourite word processor to write down mostly unstructured text in natural language, even though there exists some influential work and good experience with approaches that have used RE results in real projects (see, e.g., [1, 3]). In fact, mainstream practice deals only rudimentarily with requirements, and what is done might not even deserve to be called “engineering”.

2 Goal Statement The goal of this panel is to really understand the issue of why it is so difficult to introduce research results from requirements engineering into mainstream practice. The result should be a (research) agenda that helps us to bridge the gap between theory and practice and, finally, to reach the people in the trenches so that they improve the way they deal with requirements.

3 Questions Addressed In order to achieve the stated goal, the following questions (among others) will be addressed in the panel: B. Wangler, L. Bergman (Eds.): CAiSE 2000, LNCS 1789, pp. 7-12, 2000 © Springer-Verlag Berlin Heidelberg 2000

8

• • • • • • • • • • • • •

H. Kaindl and J. Mylopoulos

What are the obstacles to applying RE research results in practice? Does schedule pressure in real-world industrial projects have an influence? Is it the fault of customers and users that they do not know (much) about RE? Are the people working on requirements from the software perspective aware of RE research results? Does installing and enforcing rigorous software-development processes help? How can we (as the requirements engineering community) reach people in the trenches? Does RE research address the real issues arising in practice? Are more practical methods needed? Is better tool support needed? What are the social problems involved in changing practice? Are practitioners satisfied with their current requirements process? Are practitioners searching for better ways to do requirements? How do the practitioners view academic research and its results?

4 Position Statements of Panel Members Sjaak Brinkkemper ([email protected]) Baan Company R&D The title of this panel seems to assume that there exists an abundance of research results of the RE research community being worthwhile to be considered for implementation in the RE practice. Speaking in the position of being responsible for the world-wide implementation of requirements engineering in one of the larger packaged software companies, I have to admit that there are few results that really make a difference and should be considered. Most RE research focuses on Yet Another Specification Technique, or contains the traces of simple desktop research with toy examples and invented problems. This leads me to rephrase of the title: “Why is it so difficult to have critical issues of the RE practice investigated in mainstream RE research?”. Any requirements listing of an arbitrary IT project will contain a complex myriad of statements expressing wishes of users/customers on a hardware/software artefact to be engineered by a team. Hence structuring, understandability, and formulation to name a few, are far more important to be improved. Engineering requirements is relatively simple, their management is considerably more difficult. As high level critical issues that deserve research projects right away I mention: • Composition and communication of large volumes of requirements, • Base-lining and scoping for release based software development, • Empirical investigation of real-life projects, • Tracking and tracing from requirements to designs and code.

Why Is It So Difficult to Introduce RE Research Results

9

Janis A. Bubenko Jr ([email protected]) Royal Institute of Technology and Stockholm University Problems encountered in transferring academic research results in Requirements Engineering (RE) to practical applications and “colleagues in the trenches” are similar to those in transferring research results from most Informatics disciplines, such as Information Systems Development Methodologies, CASE Technologies, and Software Engineering. All these areas experience difficulties in adopting research from academia. The case for RE is perhaps even a bit harder as RE is performed not only by systems analysts and engineers, but also, increasingly, by different “stakeholders” without a professional background in Informatics. There are, in my opinion, two basic issues: 1. We are not in agreement regarding what requirements, or approaches to requirements specifications really stand for. Some of us are heavily engaged in development of more or less formal models and languages for representing requirements. We believe that there is only one way of specifying requirements: they should be formal, consistent, and complete. Others are more engaged in developing conceptual frameworks for functional as well as non-functional requirements. Still others are more concerned with the “process” of acquiring requirements and how to guide it. In Scandinavia some researchers are interested in participatory aspects of requirements development – how to get most out of the stakeholders in the process of acquiring requirements. Business oriented requirements engineering, on the other hand, focuses on achieving a best possible fit between business goals and processes and the supporting information system. 2. We cannot tell the people in the trenches what the benefit is of applying research results (whatever they are) to practical cases. We know from literature that 50% or more of all errors are made in the requirements stage, and that it costs many times more to correct these errors in later stages of systems development. We also know that many software projects never lead to systems that are applied as originally designed. Some are even never used (but paid for). We try to tell our practitioner colleagues these figures, but it does not seem to give any effect. If we want so “sell” something to practitioners then we must be able to tell them what they will gain by making heavy investments in skill development and in spending more resources in the requirements phase of systems development. And we must be able to illustrate this in monetary terms. Issue one illustrates my opinion that there is no “standard model” for requirements specifications and requirements work. However, to some extent, all aspects, formal/technical, economical, conceptual, organisational, and social must always be considered. But the degree to which specifications should be developed may vary considerably depending on a number of situational factors. We can, for instance, imagine two extreme cases, one where there is no software development support in the company (all software development is out-sourced) and one where there is a “receptive and competent” software unit next door in the corridor. In case one, if the user company wants bids for tender from alternative suppliers then this calls for detailed

10

H. Kaindl and J. Mylopoulos

and complete requirements specifications. If, on the other hand, the company has a good in-house software developer, then a realistic alternative is to have an iterative process and incrementally develop requirements and a working system. Perhaps we can, in this case, even do without formal models of requirements. Clearly, there are many additional alternative combinations of situational factors such as stakeholder maturity and skills, type of system to be built, maximum cost permitted, maximum time given, etc. Each combination will give more or less different guidelines how to develop requirements. Issue two really says that we should do more research on the “economics of requirements specification”. To my knowledge no directly useful research is reported on this issue, but I may be wrong. In conclusion, our lack of concrete knowledge of what organisations can gain from applying skill and time consuming requirements approaches is one of the major obstacles of getting research results adopted by practitioners. The only way to gain more knowledge about the economics of RE is to go out in the trenches and work together with the professionals. Barbara Farbey1 ([email protected]) University College London The position adopted is that a) RE research results will only be introduced into practice if they are shown to be aligned with, and sympathetic to, the goals of the business and b) the introduction of new thinking should be seen as an organisational innovation and managed as an exercise in change management. The problem Organisations face a highly competitive environment with intense pressure on costs, time to market and flexibility in the face of rapidly changing requirements. The range of development choices has opened up a brave new world of kernels, components and COTS as well as outsourcing. Contemporary sourcing policy demands new skills – commercial and process management, including requirements management – but RE and requirements management are not usually institutionalised. Software providers are under the same pressures as their customers. Incorporating new methods into a contract means justifying the extra cost and time to their customers. And to their hard-pressed staff. Universities could form a spearhead. But they do not have the resources to tackle large systems, nor do they necessarily have the domain expertise which companies see as essential. So they lack credibility. A way forward … is to treat the introduction of new results as a problem in innovation and strategic change and manage the change. To do that we need to marshal the strategic as well as the operational arguments. And we need to: 1

This statement is based on experience as the industrial liaison co-ordinator of RENOIR, an EU Research Network of Excellence, Project no. 20.800.

Why Is It So Difficult to Introduce RE Research Results

11

• • • • • •

get senior management commitment [4], identify and justify costs and risks, as well as promote benefits, design effective reward structures for good practice, advertise success [4], leverage and optimise skills across the supply network, make better use of universities' core competencies – teaching and the critical imagination, • use universities for projects that are high-risk, high-gain problems, two to five years away. Ivar Jacobsson ([email protected]) Rational Software Corp. Requirements engineering is really not a discipline in its own but part of something larger - that larger being software engineering or systems engineering. Finding the right requirements is just one set of activities in the whole chain of activities that need to work seamlessly together: initiated by business design, and followed by design, implementation, test, etc. And all these activities need to be integrated in an iterative development process that focuses at first creating the architecture in parallel with finding most of the requirements. This is why use cases have been so successful and so universally adopted. Use cases model the business to be designed, they capture the requirements on the software (for e-business that software is the business), they drive the design of architecture and components, and they drive the testing of the developed system. They bind together the whole development work in each iteration and in the whole development project.

5 Panel Format In order to leave sufficient time for discussions, each panel member is limited to a short position statement presentation. The written statements above already show a diversity of views, so that the panel format should enable spontaneous, interactive discussion, involving both the panel members and the audience.

Acknowledgment We would like to thank Pei Hsia for email discussions about the topic of the proposed panel.

References 1. S. Greenspan, J. Mylopoulos, and A. Borgida. Capturing more world knowledge in the requirements specification. In Proceedings of the Sixth International Conference on Soft-

12

H. Kaindl and J. Mylopoulos

ware Engineering (ICSE'82), 1982, also published in Freeman and Wasserman (eds.) Software Design Techniques, IEEE Computer Society Press, 1983. 2. P. Hsia, A. Davis, and D. Kung. Status report: Requirements engineering. IEEE Software, 10(6):75–79, 1993. 3. H. Kaindl. A practical approach to combining requirements definition and object-oriented analysis. Annals of Software Engineering, 3:319–343, 1997. 4. Pugh, D., (1993), Understanding and Managing Organisational Change, in Maby, C. and Mayon-White, W. (eds.), Managing Change (2nd Edition), Paul Chapman, London.

Information about the Moderators Hermann Kaindl is a senior consultant at Siemens AG Österreich. His research interests include software engineering, with a focus on requirements engineering; humancomputer interaction as it relates to scenario-based design and hypertext; and artificial intelligence, including heuristic search and knowledge-based systems. Dr. Kaindl received the Dipl.-Ing. in computer science in 1979 and his doctoral degree in technical science in 1981, both from the Technical University of Vienna, where he has given lecture courses since 1984 and served as an adjunct professor since 1989. He is a senior member of the IEEE, a member of AAAI, the ACM, and the International Computer Chess Association, and is on the executive board of the Austrian Society for Artificial Intelligence. John Mylopoulos is professor of Computer Science at the University of Toronto. His research interests include conceptual modelling, requirements engineering and knowledge management. Prof. Mylopoulos has published more than 160 refereed journal and conference proceedings papers and five edited books. He is the recipient of the first ever Outstanding Services Award given out by the Canadian AI Society (CSCSI), a corecipient of the most influential paper award of the 1994 International Conference on Software Engineering, a fellow of the American Association for AI (AAAI) and is currently serving as elected president of the VLDB Endowment Board. The most influential paper award was earned for the paper of reference [1]. The award is given out by the ICSE PC to the paper judged ten ICSE conferences later “... to be the most outstanding and influential on subsequent work ...”.

Adaptive and Dynamic Service Composition in eFlow Fabio Casati, Ski Ilnicki, LiJie Jin, Vasudev Krishnamoorthy1, and Ming-Chien Shan Software Technology Lab Hewlett-Packard Laboratories, 1U-4A 1501 Page Mill Road Palo Alto, CA, 94304 {casati,ilnicki,ljjin,shan}@hpl.hp.com

Abstract. E-Services are typically delivered point-to-point. However, the eservice environment creates the opportunity for providing value-added, integrated services, which are delivered by composing existing e-services. In order to enable organizations to pursue this business opportunity we have developed eFlow, a system that supports the specification, enactment, and management of composite e-services, modeled as processes that are enacted by a service process engine. Composite e-services have to cope with a highly dynamic business environment in terms of services and service providers. In addition, the increased competition forces companies to provide customized services to better satisfy the needs of every individual customer. Ideally, service processes should be able to transparently adapt to changes in the environment and to the needs of different customers with minimal or no user intervention. In addition, it should be possible to dynamically modify service process definitions in a simple and effective way to manage cases where user intervention is indeed required. In this paper we show how eFlow achieves these goals.

1

Introduction and Motivations

In recent years the Web has become the platform through which many companies communicate with their partners, interact with their back-end systems, and perform electronic commerce transactions. Today, organizations use the Web not only as an efficient and cost-effective way to sell products and deliver information, but also as a platform for providing services to businesses and individual customers. Examples of e-services include bill payment, customized on-line newspapers, or stock trading services. As Web technologies continue to improve, allowing for smaller and more powerful web servers, and as more and more appliances become web-enabled, the number and type of services that can be made available through the Internet is likely to increase at an exponential rate.

1

Now with Rightworks corp., 31 N. Second St., suite 400, San Jose, CA, USA. email: [email protected]

B. Wangler, L. Bergman (Eds.): CAiSE 2000, LNCS 1789, pp. 13-31, 2000 © Springer-Verlag Berlin Heidelberg 2000

14

F. Casati et al.

Today, services are typically delivered point-to-point. However, the e-service environment creates the business opportunity for providing value-added, integrated services, which are delivered by composing existing e-services, possibly offered by different companies. For instance, an eMove composite service could support customers that need to relocate, by composing truck rental, furniture shipments, address change, and airline reservation services, according to the customer's requirements. In order to support organizations in pursuing this business opportunity we have developed eFlow, a platform for specifying, enacting, and monitoring composite eservices. Composite services are modeled as business processes, enacted by a service process engine. eFlow provides a number of features that support service process specification and management, including a powerful yet simple service composition language, events and exception handling, ACID service-level transactions, security management, and monitoring tools. Unlike "traditional" business processes, which are mostly executed in a predictable and repetitive way, composite services delivered through the Internet have to cope with a highly dynamic environment, where new services become available on a daily basis and the number of service providers is constantly growing. In addition, the availability of many service providers from different countries increases the competition and forces companies to provide customized services to better satisfy the need of every individual customer. These two characteristics of the e-service environment impose demanding requirements on a system that supports the development and delivery of composite services. In order to stay competitive, service providers should offer the best available service in every given moment to every specific customer. Clearly, it is unfeasible to continuously change the process to reflect changes in the business environment, since these occur too frequently and modifying a process definition is a delicate and timeconsuming activity. Ideally, service processes should be able to transparently adapt to changes in the environment and to the needs of different customers with minimal or no user intervention. Furthermore, it should be possible to dynamically modify service process definition in a simple and effective way to manage cases where user intervention is required, for instance to handle major changes in the environment or to cope with unexpected exceptional situations. This paper shows how eFlow supports the definition and enactment of adaptive and dynamic service processes. We illustrate how the eFlow model enables the specification of processes that can automatically configure themselves at run-time according to the nature and type of services available on the Internet and to the requests and needs of each individual customer. We then present the dynamic change features provided by eFlow, that allow a great flexibility in modifying service process instances and service process definitions, enabling changes to every aspect of a process. Since dynamic process modification is a very powerful but delicate operation, one of our main goal has been to define very simple modification semantics, so that users can have a clear understanding of the effects of a modification. Prior to applying the changes, eFlow will enforce consistency rules, to avoid run-time errors resulting from the modifications, as well as authorization rules, to guarantee that only authorized users perform the modifications.

Adaptive and Dynamic Service Composition in eFlow

2

15

Overview of eFlow

This section presents an overview of the eFlow process model. We only present basic concepts that are needed in order to illustrate its adaptive and dynamic features. The interested reader is referred to [5] for details about the model and the implementation. In eFlow, a composite service is described as a process schema that composes other basic or composite services. A composite service is modeled by a graph (the flow structure), which defines the order of execution among the nodes in the process. The graph may include service, decision, and event nodes. Service nodes represent the invocation of a basic or composite service; decision nodes specify the alternatives and rules controlling the execution flow, while event nodes enable service processes to send and receive several types of events. Arcs in the graph may be labeled with transition predicates defined over process data, meaning that as a node is completed, nodes connected to outgoing arcs are executed only if the corresponding transition predicate evaluates to true. A service process instance is an enactment of a process schema. The same service process may be instantiated several times, and several instances may be concurrently running. Fig. 1 shows a simple graph describing a composite service that helps customers in organizing an award ceremony. In the figures, rounded boxes represent invocations of basic or composite services, filled-in circles represent the starting and ending point of the process, while horizontal bars are one of eFlow decision node types, and are used to specify parallel invocation of services and synchronization after parallel service executions. The semantics of the schema is the following: when a new instance is started, service node Data Collection gathers information regarding the customer and his/her preferences and needs. Then, the Restaurant Reservation service is invoked, in order to book the restaurant and select the meals for the banquet. This node is executed first, since the characteristics of the selected restaurant (e.g., its location and the number of seats) affect the remainder of the service execution, i.e., the organization of the ceremony. Then, several services are invoked in parallel: the Advertisement service prepares a marketing campaign to advertise the ceremony, the Invitation service proposes a choice of several types of invitation cards and delivers them to the specified special guests, while the Registration service handles guest registrations and payments. Finally, the Billing service is invoked in order to present a unified bill to the organizing customer. All services can be either basic services (possibly provided by different organizations) or composite services, specified by eFlow processes. Service nodes can access and modify data included in a case packet. Each process instance has a local copy of the case packet, and the eFlow engine controls access to these data. The specification of each service node includes the definition of which data the node is authorized to read or to modify. The eFlow model also includes the notion of transactional regions. A transactional region identifies a portion of the process graph that should be executed in an atomic fashion. If for any reason the part of the process identified by the transactional region cannot be successfully completed, then all running services in the region are aborted and completed ones are compensated, by executing a service-specific compensating action. Compensating actions may be defined for each service or may be defined at the region level. For instance, by enclosing the Advertisement, Registration, and Invitation services in a transactional region, and by providing compensating actions

16

F. Casati et al.

for each of these services (or one compensating action at the region level), we are guaranteed that either all of the services are executed, or none is. Award Ceremony Service

Data collection

Restaurant Reservation

Advertisment

Invitation

Registration

Billing

Fig. 1. Ceremony service process definition

Transactional regions may also include the specification of different isolation modes, that prevent data read or modified by nodes in the regions to be accessed by services that are outside the transactional region. Process instances are enacted by the eFlow engine. The main function of the engine is to process messages notifying completion status of service nodes, by updating the value of case packet variables accessed by the service node and by subsequently scheduling the next node to be activated in the instance, according to the process definition. The engine then contacts the service broker in order to discover the actual service (and service provider) that can fulfill the requests specified in the service node definition, and eventually contacts the provider in order to execute the service. The engine also processes events (either detected by the eFlow event monitor or notified by external event managers), by delivering them to the requesting event nodes. Notifications of occurred events and of service node completions are inserted into two separate transactional, First-in-First-Out queues (see Fig. 2). The engine extracts elements from the queues and processes them one by one. eFlow does not specifies any priority between the queues, but it does guarantee that every element in the queues is eventually processed. Finally, the engine logs every event related to process instance executions (to enable process monitoring, compensation, and to support dynamic process modifications) and ensures process integrity by enforcing

Adaptive and Dynamic Service Composition in eFlow

17

transactional semantics and by compensating nodes executed within transactional regions in case of failures. instance execution log

External service broker

The engine invokes the broker to find a suitable service

the engine invokes the service a service returned by the broker(through a proxy)

eFlow engine

eFlow service broker

proxy

service completion notifications repository of service descriptions

repository of schema definitions

event notifications

Service

insert notification of completion and return data into transactional queue

proxy

Event monitor

Internal or External event monitors notify events which are translated into the eFlow event format and inserted in the transactional event queue.

Fig. 2. The eFlow engine processes events and notifications of service completions in order to schedule service node executions

Like most Internet-based services, the Award Ceremony service provided by the OneStopShop company is executed in a highly dynamic environment. For instance, providers will continue to improve their e-services, and new providers may enter the market while some of the existing ones may cease their business. In addition, new types of e-services that can support the organization of an award ceremony may become available, such as renting of mega-screens and cameras, live broadcast of the ceremony over the Internet, or selection of trained personnel such as an anchorman. In the remainder of the paper we will show how eFlow addresses these challenges in order to allow service designer to provide composite services that naturally adapt to changes in the environment with minimal user intervention, that can be customized to fit the needs of every customer, and that are able to cope with unexpected exceptional situations.

3

Adaptive Service Processes

In order to manage and even take advantage of the frequent changes in the environment, service processes need to be adaptive, i.e., capable of adjusting themselves to changes in the environmental conditions with minimal or no manual intervention. eFlow provides several features and constructs to achieve this goal.

18

F. Casati et al.

These include dynamic service discovery, multiservice nodes, and generic nodes. In the following we present an overview of these features. 3.1

Dynamic Service Discovery

A service node represents the invocation of a basic or composite service. Besides defining the data that the node is allowed to read and modify, and possibly a deadline to manage delays in service execution, a service node specification includes the description of the service to be invoked. For instance, within the Advertisement service node, we may specify that eFlow should invoke the e-campaign service offered by the GreatAdvert.com provider. While useful in some situations, such a static service binding is often too rigid, since it does not allow to: − − −

select the appropriate service depending on the customer's requirements: for instance, some customers may prefer a low-cost e-mail campaign, while other may prefer advertisements via TV, radio stations, or web sites; decouple service selection from the process definition: different service processes may require an advertisement service, and the selection criteria may need to be defined at the company level rather than at the composite service level; dynamically discover the best currently available service that fits the need of a specific customer.

To cope with the characteristics of the Internet environment, eFlow provides an open and dynamic approach to service selection. The service node includes the specification of a service selection rule, which can have several input parameters (defined by references to workflow variables). When a service node is started, the eFlow engine invokes a service broker that will execute the specified rule and return the appropriate service. Service selection rules are defined in a service broker-specific language, such as XQL if e''speak [4] is used as the service broker. eFlow only requires that the rule returns an XML document which includes the definition of input and output data, the URI used to contact the service, billing and payment information, and a priority value used to select a specific service when several services are returned by the rule (choice among services with the same priority is non deterministic). Mapping between service node input/output data and the parameters of the invoked service is performed by a mapping function, specified as a set of string pairs . A mapping function must be defined for a <service node, service description> pair before the service can be invoked in the context of the service node. eFlow users can replace the default broker and plug-in the service broker that best fits their needs. Plugged-in brokers are not even required to access the service repository: they can dynamically discover services by contacting other external brokers or service advertisement facility, in order to get the most up to date information about available services and their characteristics. Service selection rules will be then defined in the language supported by that broker, and can include arbitrary service selection policies. Plugged-in brokers must either present to the engine the same (simple) interface of the default one, or an adapter must be interposed between the engine and the broker to map requests and

Adaptive and Dynamic Service Composition in eFlow

19

responses. In addition, if service brokers dynamically discover services not stored in the service description repository, they must also return a mapping function that allows the mapping of service node input/output data to service parameters. 3.2

Multiservice Nodes

In some composite service processes there is the need of invoking multiple, parallel instances of the same type of service. For instance, a restaurant reservation brokering service may request rates and availability to several restaurants that provide on-line access to these information. In order to allow the specification of these kinds of process semantics, eFlow includes the notion of multiservice node. The multiservice node is a particular kind of node that allows for multiple, parallel activation of the same service node. The number of service nodes to be activated is determined at run time in one of the following ways: 1.

2.

It can be determined by the number of service providers able to provide a given service. For instance, for the award ceremony service, we may want to contact all restaurant in the San Francisco Bay Area that can host a specified number of guests. It can be equal to the number of elements in a case packet variable of type list. In this case each service node instance receives one and only one of the list items as input parameter. The value of such item will affect service selection and execution. For instance, a list may include a set of customers of different nationalities for which we want to check their credit history. The number of service nodes that will be instantiated within the multiservice node will be equal to the number of customers, and each node will focus on one customer. A service selection rule will be executed for each service node to be activated; the rule can have the customer's data as input parameter, in order to select the appropriate credit check service for each customer, for instance depending on the customer's nationality.

An important part of a multiservice is the specification of when the multiservice can be considered completed and the flow can proceed with the successor service node. In most cases, the flow can proceed only when all invoked services have been completed. However, in other cases, there is no need to wait for all service instances to be completed, since the multiservice goal may have already been achieved before. For instance, suppose that we want to verify a customer's credit with several agencies: if our acceptance criteria is that all agencies must give a positive judgment for the customer to be accepted, then as soon as one agency gives a negative opinion we can proceed with service execution, without waiting for the completion of the other services, which may be canceled. The multiservice termination is specified by a condition, checked every time one of its service nodes terminate. If the condition holds, then the successor of the multiservice is activated and services in execution are canceled. An example of termination condition for the credit check example could be Rejections.length>0, where Rejections is a variable of type ListOf (String), and length is an attribute common to every list variable that contains the number of elements in the list. Fig. 3 shows a sample specification of a multiservice node in eFlow. The specification includes the reference to the service node to be

20

F. Casati et al.

instantiated (multiple times) as part of the multiservice node, as well as the activation and termination conditions. <MULTISERVICE_NODE id="check_customers_credit"> Check Customers' credit <SERVICE_NODE id="check_single_customer_credit" /> Multiservice node that checks the credit history of several customers in parallel rejections.length>0

Fig. 3. Specification of a multiservice node in eFlow

3.3

Dynamic Service Node Creation

An important requirement for providers of Internet-based services is the ability of providing personalized services, to better satisfy the needs of every individual customer. While the service process depicted in Fig. 1 may be suited for some customer, other customers might need additional services, such as rental of video/audio equipment or the hiring of trained personnel to work with such equipment. At the same time, some customers may not need the services offered by the Award Ceremony service process. For instance, they may not need an advertisement service or they may provide for it by themselves. Clearly, it is practically unfeasible to foresee all possible combinations of services which may be needed by each customer and to define a process for each potential type of customer. Besides, this would imply a very high maintenance cost, especially in the e-service environment where new types of services become available on a daily basis. To cope with these demanding needs, eFlow supports the dynamic creation of service process definitions by including in its model the notion of generic service node. Unlike ordinary service nodes, generic nodes are not statically bound or limited to a specific set of services. Instead, they include a configuration parameter that can be set with a list of actual service nodes either at process instantiation time (through the process instance input parameters) or at runtime. The parameter is a variable of type ListOf(Service_Node). The specified services will be executed in parallel or sequentially depending on an executionMode attribute of the generic service node. Generic nodes are resolved each time they are activated, in order to allow maximum flexibility and to cope with processes executed in highly dynamic environments. For instance, if the generic node is within a loop, then its configuration parameters can be modified within the loop, and the node can be resolved into different ordinary service nodes for each loop of the execution. Notice that generic nodes are different from multiservice nodes: multiservice nodes model the activation of a dynamically determined number of instances of the same service node, while generic nodes allow the dynamic selection of different service nodes.

Adaptive and Dynamic Service Composition in eFlow

21

Award Ceremony Services <SERVICE_NODE_POOL> Ceremony Service Pool Placeholder for service nodes related to a ceremony service,to be executed in parallel <SERVICE_SELECTION_VAR> SelectedServices <EXECUTION_MODE mode="parallel" />

Fig. 4. Sample XML description of a generic service node in eFlow

4

Dynamic Service Process Modifications

While adaptive processes considerably reduce the need for human intervention in managing and maintaining process definitions, there may still be cases in which process schemas need to be modified, or in which actions need to be taken on running process instances to modify their course. Process modifications may be needed to handle unexpected exceptional situations, to incorporate new laws or new business policies, to improve the process, or to correct errors or deficiencies in the current definition. We distinguish between two types of service process modifications: − Ad-hoc changes are modifications applied to a single running service process instance. They are typically needed to manage exceptional situations that are not expected to occur again, such as the unavailability of a restaurant that had been booked for a ceremony. − Bulk changes refer to modifications collectively applied to a subset (or to all) the running instances of a service process. For instance, suppose that an advertisement company on which many ceremony advertisement campaigns relied upon goes out of business. This situation can affect many instances, and it is practically unfeasible to separately modify each single instance. Bulk changes may also be needed when a new, improved version of a process is defined. If, for instance, a new law forces a modification of a process, then running instances will need to respect the new constraints as well. 4.1

Ad-hoc Changes

Ad-hoc changes are modifications applied to a single, running process instance. eFlow allows two types of ad-hoc changes: modifications of the process schema and modifications of the process instance state. In the remainder of this section we show how eFlow supports both type of changes. Ad-hoc Changes to the Process Schema eFlow allows authorized users to modify the schema followed by a given service process instance. The modifications are applied by first defining a new schema (usually by modifying the current one) and by then migrating the instance from its current schema (called source schema) to the newly defined one (called destination

22

F. Casati et al. Award Ceremony Service Doe

-Temp - John

Data collection

Restaur ant Reservation

Advertisment

Registration

Invitation

Catering

Billing

Fig. 5. Ad-hoc process definition to handle the request by customer John Doe

schema). For instance, suppose that a customer of OneStopShop, John Doe, is accessing a restaurant reservation service within an Award Ceremony process; John found a restaurant, Chez Jaques, that fully satisfies his needs in terms of number of seats, location, and atmosphere, but that does not serve food of satisfactory quality. John then asks OneStopShop to provide him a catering service, so that he can rent only the place and separately arrange for the food. Since John is a good customer and the company wants to keep his business, the process responsible decides to satisfy his request and modify the process definition (for this particular instance only) by adding a catering service, as depicted in Fig. 5. Authorized users can modify every aspect of a schema, including the flow structure, the definition of service, decision, and event nodes, process data, and even transactional regions. eFlow only verifies that behavioral consistency is respected when migrating an instance to a destination schema (i.e., that instance migration does not generate run-time errors and that transactional semantics can be enforced). Case migration is a very delicate operation, since it allows changing the rules of the game while it is in progress. Hence, our main design goal has been to define a very simple migration semantics, so that users can easily and clearly understand the behavior of the instance after the modifications have been applied, and avoid the risk of unexpected and undesired effects. In the following we describe how eFlow manages and performs instance migrations.

Adaptive and Dynamic Service Composition in eFlow

23

eFlow engine proxy

service completion notifications/JohnDoe

service completion notifications

event notifications/JohnDoe

event notifications

proxy

Service

Event monitor

The engine does not process event notifications and service node completion messages related to a suspended case. Instead, it (logically) copies them into a case-dedicated queue so that they can be processes as the case is resumed.

Fig. 6. Events and notifications related to suspended instances are not processed, but are placed in a separate queue

Case migration operations Case migrations are performed by a suitable eFlow module, called migration manager. The following operations are performed in order to migrate an instance from a schema to another: 1. An authorized user accesses the migration manager and identifies the instance to be migrated as well as the destination schema (details on user authorizations are provided in section 4.3). The destination schema must have been previously defined, either from scratch or by modifying the one being followed by the instance to be migrated. 2. The migration manager notifies to the eFlow engine that instance execution (for the process instance to be migrated) should be suspended. When a process instance is suspended, running services are allowed to complete. However, the engine does not schedule any new service and does not deliver events. When the engine processes a service completion notification related to a service node of the suspended instance, it puts this notification into an ad-hoc, temporary queue maintained for the suspended instance. The notification will be processed when instance execution is resumed. Similarly, events to be delivered to the suspended instance are also placed in a different logical queue (see Fig. 6), and will be delivered as instance execution is resumed. An instance can only be suspended when the engine is not processing messages related to it: in fact, the sequence of operations performed by the engine to process events or service node completion messages and to activate subsequent nodes is atomic. 3. The migration manager verifies that the migration preserves behavioral consistency. 4a. If behavioral consistency is preserved, then the migration manager builds an execution state for the instance in the new schema (details are provided below). 4b. If the instance cannot be migrated, the user is notified of the reason that does not allow the migration and is asked to modify the destination schema (or to indicate

24

F. Casati et al.

a different destination schema). Steps 1 to 4 will then be repeated. In the meantime, instance execution remains suspended. 5. The migration manager informs eFlow that instance execution can be resumed, now according to the destination schema. At any time during this sequence of operations the user can abort the migration, and instance execution will be resumed according to the old process schema. The operations performed by the migration manager are summarized in Fig. 7. (1): The user specifies the case to be migrated and the destination schema, which is typically defined by modifying the current schema followed by the instance

1

(2): the migration manager orders the engine to suspend the execution of the specified instance

2

migration manager 4b

5 (5): the migration manager notifies to the engine that instance execution can be resumed.

(4b): the migration manager informs the user that consistency rules are violated: the user must modify the destination schema or abort the migration

4a

3 (3): the migration manager accesses the instance execution state and the definition of the source and destination schema in order to verify whether the case can be migrated (i.e., consistency rules are respected)

eFlow engine

3

repository of schema definitions

(4a): the migration manager builds an execution state for the instance in the new schema.

instance execution log

Fig. 7. Sequence of operations performed by the migration manager when migrating an instance

Consistency rules An instance can be migrated from a version to another only if behavioral consistency is preserved. Behavioral consistency implies that the migration does not result in runtime errors or non-deterministic behaviors. In order to guarantee behavioral consistency, eFlow enforces the following rules: 1.

Each service or event node that is active when the instance is suspended must be present in the destination schema. This rule is necessary since it allows the definition of an execution state for the instance in the new schema, and in particular the definition of which nodes should be set to the active state when execution is resumed, as explained below. In addition, it allows the engine to know how to correctly process completion messages related to those running services that will eventually be received. While the definition of active service nodes can differ from the one in the source schema (e.g., they may have different input data or different deadlines), their write list must be the same, since it is

Adaptive and Dynamic Service Composition in eFlow

2.

3. 4.

25

expected that the running nodes will actually try to modify the value of those variables. If a variable in the destination schema is also present in the source schema, then it must be of the same type. This rule is needed since variables will keep their current value after migration, and therefore the types in the source and destination schema must be the same. Transactional regions not present in the source schema must not include any node which is part of the source schema and which is active or completed. If a transactional region with the same identifier is present both in the source and destination schema, and the region was active at migration time, then: a. The isolation properties of these transactional regions must be the same. b. No node in the region of the destination schema should read (write) variables which are not also read (written) by at least one node of the same transactional region in the source schema. The only allowed exception is when the newly introduced variable is only used within the region. c. The region should not be extended "in the past", i.e., it should not include nodes that are also in the source schema, that have already been executed, and that are not part of the region in the source schema. Rules related to transactional regions are necessary since eFlow acquires the read and write locks necessary for enforcing the specified isolation mode at the start of the transactional region.

Migration semantics The essence of the migration process consists in building an execution state for the instance in the new schema, and then in resuming instance execution. An execution state is formed by the value of the case packet variables and by the execution state of all service and event nodes in the instance. The values of case packet variables are set as follows: −

Variables in the destination schema that are also present in the source schema keep the value they had in the case packet of the migrated instance. − Variables in the destination schema that are not present in the source schema are initialized with their default value (or are left undefined if no default value was provided). The execution state of service and event nodes is defined as follows: −

Nodes of the destination schema that are also present in the source schema are initialized with the same execution state they had in the migrated instance (e.g., not started, active, completed, failed, canceled, timed out). − Nodes of the destination schema that are not present in the source schema are initialized to the not started state. After the instance state has been reconstructed, the migration is completed. The migration manager will then inform the engine that instance execution can be resumed. The eFlow engine then processes all events and all service completion messages included in the event and service completion queues that were created to manage instance suspension. Elements in these queues are processed with the same semantics used to process elements in the standard queues. After all elements included in both queues have been processed, the engine discards these queues and resume normal operations, that is, it resumes processing of the standard queues.

26

F. Casati et al.

Modifications to the Process State Besides changes to the process schema, authorized users can perform the following operations on an instance in execution: − Change the value of case packet variables. − Initiate the rollback of a process region or of the entire process. − Terminate the process. − Reassign a node to a different service: the running service is canceled, and the one specified by the user is invoked. These actions are performed through the service operation monitor component of eFlow, and do not require instance suspension. 4.2

Bulk Changes

Bulk changes handle exceptional situations that affect many instances of the same process. Instead of handling running instances on a case-by-case basis, eFlow allows authorized users to apply changes to sets of instances that have common properties. Modifications are introduced by specifying one or more destination schemas and by defining which set of instances should be migrated to each schema. For instance, suppose that OneStopShop decides to provide, as a bonus, a security service for all ceremonies that involve more than 100 guests. To perform this, a new service process is defined, by modifying the Award Ceremony one, in order to include a security personnel service, as shown in Fig. 8.

Bonus Ceremony Service Data collection

Restaurant Reservation

Advertisment

Invitation

Registration

Security personnel

Billing

Fig. 8. Modified Award Ceremony service, now including a security service

Adaptive and Dynamic Service Composition in eFlow

27

Next, the service process responsible can migrate all running instances (related to a Ceremony service that involves more than 100 guests) to the newly defined one. Migrations are defined by means of a simple, rule-based language. A migration rule identifies a subset of the running instances of a given process and specifies the schema to which instances in this subset should be migrated. Rules have the form IF THEN MIGRATE TO <schema>. The condition is a predicate over service process data and service process execution state that identifies a subset of the running instances, while <schema> denotes the destination schema. Instances whose state does not fulfill the migration condition will proceed with the same schema. An example of migration rule is: IF (guests>100) THEN MIGRATE TO "Bonus_Ceremony_Service". MIGRATION RULES

IF cond1 THEN MIGRATE TO D1 IF cond2 THEN MIGRATE TO D2 IF cond3 THEN MIGRATE TO D3

Destination schema D1

Instances that satisfy cond1 (but not cond2 and cond3 ) Instances that do not satisfy any condition among of cond1 , cond2 , and cond3

Source schema S

Instances that satisfy cond 3 (but not cond 1 and cond 2)

Instances that satisfy cond 2 (but not cond 1 and cond 3)

Destination schema D2

Destination schema D3

Fig. 9. Bulk migration: instances are migrated to different destination schemas depending on migration rules

The set of rules must define a partitioning over the set of active instances, so that each instance is migrated to one schema at most. Instances that do not satisfy any rule condition are not migrated. Fig. 9 exemplifies bulk migration. The sequence of operations performed in bulk migration is as follows:

28

F. Casati et al.

1. 2. 3. 4.

5.

4.3

The user defines, compiles, and checks in the migration rules. All destination schemas referred to by migration rules must have been previously defined. eFlow suspends all running instances of the process. eFlow verifies that the migration rules actually define a partition over the set of running instances. If the state of an instance satisfies more that one migration condition, the user is asked to revise the rules or abort the migration. eFlow verifies that each instance can be migrated to the specified destination version of the process schema, i.e., it checks that behavioral consistency is preserved. In addition, it checks that the user who started the migration has the required authorizations to migrate each of the selected instances. Due to the delicacy of a bulk migration operation, eFlow does not perform any migration until all instances can be safely migrated to their destination schema. If all migrations can preserve behavioral consistency, then instances are migrated to their destination schema. Instance executions are then resumed. Security in Dynamic Process Modifications

Dynamic service process modifications in eFlow are constrained by authorization rules that defines which user or application is authorized to perform a given modification. Rules are specified at process definition time, and can include an arbitrary number of input parameters, taken from case packet variable. This enables the definition of security rules that differs according to the particular execution state of the instance. Each process definition includes the following authorization rules with respect to process modifications: − − − −

Authorized_State_Modifiers: identifies the users (services) that have write access to case packet variables, i.e., that can perform state changes to the instance. Authorized_Node_Modificators: identifies the users (services) authorized to modify service nodes in the process instance. This rule can also be specified at the node level, to further constrain authorizations. Authorized_Flow_Modificators: identifies the users (services) authorized to make any kind of dynamic changes to the process instance. Authorized_Initiators: identifies the users (services) authorized to start an instance of this process

Each time a state change or a migration is requested, eFlow verifies that the requestor has the appropriate authorizations, according to the defined rules and to the differences between source and destination schema. In particular, in case of a bulk migration, authorization rule (or Authorized_Node_Modificators Authorized_Flow_Modificators, depending on the extent of the changes) defined in the source schema are executed for each instance to be migrated, and the migration is performed only if the user has the privileges to migrate all of these instances. In addition, since the migration will result in executions of the destination schema, rule Authorized_Initiators of the defined destination schema will be executed, to verify that the user is authorized to create instance of that schema.

Adaptive and Dynamic Service Composition in eFlow

5

29

Related Work

To the best of our knowledge, there is no commercial process management system that supports adaptive and dynamic features such as those of eFlow, neither among traditional workflow management systems (such as MQ Workflow [11] or InConcert [10]), nor among newly developed, open, XML- and web-based systems such as Forte' Fusion [9] and KeyFlow [6]. A few of these systems, such as InConcert and KeyFlow, provide some support for ad-hoc changes, by allowing simple modifications to the schema followed by a given instance as well as execution state modifications. Recently, some approaches to handle dynamic changes have been presented in the literature by the workflow research community. One of the first contributions come from [2], that defines a correctness criterion for instance migration, based on the definition of the set of all valid node sequences: a change is correct if the execution sequence could have been obtained with the new schema. The paper, however, introduces a simple workflow model and restricts to a limited set of modifications. Ad-hoc and bulk changes are discussed in [7]. Workflow changes are specified by transformation rules composed of a source schema fragment, a destination schema fragment, and of a condition. The system checks for parts of the process that are isomorphic with the source schema and replaces them with the destination schema for all instances for which the condition is verified. The paper also proposes a migration language for managing instance-specific migrations, conceptually similar to our migration language. Other contributions to the area of workflow evolution come from [8,12]. In [12], a complete and minimal set of workflow modification operations is presented. Correctness properties are defined in order to determine whether a specific change can be applied to a given instance. If these constraints are violated, the change is either rejected or the correctness must be explicitly restored with exception handling techniques. Liu et al [8] focus instead on a language for workflow evolution, by which the designer can specify which instances should be migrated to which versions, depending on conditions over workflow data. The language is conceptually similar to that of [7] and to ours. In designing eFlow, we took advantage of all these research contributions and extended them as follows: − We designed a model and system that provides all the flexibility features required for a dynamic environment such as that of the Internet, including a wide range of possible ad-hoc and bulk changes; − we designed a very simple, yet powerful migration language and a very simple migration semantics, to enable an easy understanding of the instance behavior after migration. This is a fundamental requirement in operational environments; − we discussed migration in the context of a rich process model, which includes events and transactions. These model features posed us additional challenges in managing migrations; − we introduced authorization constraints that allows the definition who is authorized to perform a given type of change;

30

− −

F. Casati et al.

we defined the process followed by the system when the changes are made, focusing in particular on the delicate issue of instance suspension; finally, in addition to dynamic change support, eFlow also provides a set of adaptive features in order to strongly reduce the need for dynamic changes.

Adaptive process management is also recently gaining attention. The workflow model proposed in [1] includes a "shoot tip" activity: when a shoot tip activity is executed, the control is transferred to a process modeler that can extend the flow structure with one additional activity, which is inserted before the shoot tip activity. Next, instance execution will proceed by activating the newly inserted task and subsequently another "shoot tip" activity to determine the next step. Another interesting approach, which also allows for automatic adaptation, is proposed in [3]. The presented workflow model includes a placeholder activity, which is an abstract activity replaced at runtime with a concrete activity type, which must have the same input and output data of those defined as part of the placeholder. A selection policy can be specified to indicate the activity that should be executed. The model has an expressive power similar to the one allowed by eFlow dynamic service discovery mechanism. However, we do not restrict the input and output parameters of the selected activity to be the same of those of the node. In addition, we also provide the notion of generic and multiservice node for further achieving additional flexibility and we provide a set of dynamic modification features to cope with situations in which changes in the flow are needed.

6

Conclusions

In this paper we have shown how eFlow supports the dynamic composition, enactment, and management of composite e-services, i.e., of e-services built on top of other basic or composite services. In particular, we focused on the adaptive and dynamic features of eFlow, which are essential characteristics in order to cope with dynamic environments such as that of e-services. Our future research will be focused on providing effective means for monitoring and analyzing instances that have been modified one or more times during their executions. In summary, we believe that the eFlow platform has the required characteristics and functionality to satisfy the need of Internet-based service providers. eFlow is integrated with the Hewlett-Packard e-service strategy; however, it is an open technology: it is based on Java and it is compliant with the workflow and Internet standards, such as XML and the Workflow Management Coalition Interface standards. Hence, it can be integrated and used in virtually any IT environment.

References 1. T. Dirk Meijler, H. Kessels, C. Vuijst and R. le Comte. Realising Run-time Adaptable Workflow by means of Reflection in the Baan Workflow Engine. Proceedings of the CSCW Workshop on Adaptive Workflow Management, Seattle, WA, 1998.

Adaptive and Dynamic Service Composition in eFlow

31

2. S. Ellis, K. Keddara and G. Rozenberg, Dynamic Change within Workflow Systems, Proceedings of (COOCS '95), Milpitas, California, 1995. 3. D. Georgakopoulos, H. Schuster, D. Baker, and A. Cichocki. Managing Escalation of Collaboration Processes in Crisis Mitigation Situations. Proceedings of ICDE 2000, San Diego, CA, USA, 2000. 4. Hewlett-Packard. e''speak Architectural Specifications. 2000. 5. Hewlett-Packard. eFlow model and architecture, version 1.0. 2000. 6. Keyflow Corp. Workflow Server and Workflow designer. 1999. 7. G. Joeris and O. Herzog. Managing Evolving Workflow Specifications with Schema Versioning and Migration Rules. TZI Technical Report 15, University of Bremen, 1999. 8. C. Liu, M. Orlowska and H. Li. Automating Handover in Dynamic Workflow Environments. Proceedings of CAiSE '98, Pisa, Italy, 1998. 9. J. Mann. Forte' Fusion. Patricia Seybold Group report, 1999. 10. R. Marshak. InConcert Workflow. Workgroup Computing report, Vol 20, No. 3, Patricia Seybold Group, 1997. 11. IBM. MQ Series Workflow - Concepts and Architectures. 1998. 12. M. Reichert, P. Dadam. ADEPTflex - Supporting Dynamic Changes of Workflows Without Loosing Control. Technical report 97-07, DBIS, University of Ulm, 1997.

Towards Extensible Information Brokers Based on XML Jianguo Lu, John Mylopoulos, Jamie Ho Department of Computer Science, University of Toronto {jglu, jm, jamie}@cs.toronto.edu

Abstract. The exponential growth in the number and size of information services available on the internet has created an urgent need for information agents which act as brokers in that they can autonomously search, gather and integrate information on behalf of a user. Moreover, the inherent volatility of the internet and the wide range of information processing tasks to be carried out, calls for a framework that facilitates both the construction and evolution of such information brokers. This paper proposes such a framework named XIB (extensible Information Brokers). Based on descriptions of relevant information services, XIB supports the interactive generation of an integrated query interface, generates wrappers for each information service dynamically, and returns to the user the composed result to a query. XIB depends heavily on XML-related techniques. More specifically, we will use DTDs to model the input and output of the service, use XML elements to denote the input and output values. By using this representation, service integration is investigated in the form of DTD integration, and query decomposition is studied in the form of XML element decomposition. Within the proposed framework, it is easy to add or remove information services on the internet to a broker, thereby facilitating maintenance, evolution and customization. Keywords: XML, data integration, interoperability, wrapper, multi-agent system, mediator, web-bascd information system.

1

Introduction

The availability of information sources, services and deployed software agents on the internet is literally exploding. To find relevant information, users often have to manually browse or query various information services, extract relevant data, and fuse them into an usable form. To ease this kind of tedious work, various types of information agents have been proposed, including meta-searchers [28], mediators [11] [4], and information brokers [9]. These provide a virtual integrated view of heterogeneous information services, and perform a variety of tasks autonomously on behalf of their users. Two issues are critical in building such software agents: extensibility and flexibility. The internet is an open and fast changing environment. Information B. Wangler, L. Bergman (Eds.): CAiSE 2000, LNCS 1789, pp. 32-46, 2000. © Springer-Verlag Berlin Heidelberg 2000

Towards Extensible Information Brokers Based on XML

33

sources, internet connections, and the information agents themselves may appear and disappear unpredictably, or simply change with no warning. Any software agent that operates within such an environment needs to be easily adaptable to the volatile internet environment. Likewise, in such an open environment there will always be new users who have different requirements for their information processing tasks. To meet such demands, the technology we use to build information agents needs to support customizability and evolution. The XIB (extensible Information Broker) is a framework intended to facilitate the construction of information brokers that meet such extensibility and flexibility requirements. The basic idea is to make the web services currently only available to users also accessible from within other applications. To enable this, we define a service description language, called XIBL, intended to be used as the common language for various services. XIBL is based on XML. More specifically, the input and output descriptions are represented as DTDs, and input and output data are denoted as XML elements. Due to the wide adoption of XML notation, and the extensibility of the XML itself, XIBL is flexible enough to describe various services ranging from web services, database, and even Java remote objects. There are three groups of users inside this framework, i.e., wrapper engineers, broker engineers, and end users. Wrapper engineers are responsible for wrapping up a particular service in terms of XIBL, and registering the service in a service server. Broker engineers select services from the service server, and build brokers where they define how to integrate the services. End users use the brokers. Correspondingly, we provide Wrapper Builder and Broker Builder tools. Wrapper Builder is a visual tool that helps a user wrap up a service interactively. Through an interactive session, Wrapper Builder produces a service description written in XIBL and gets it registered in a service server. BrokerBuilder is a visual tool that interact with users to define a broker. The tool allows allow broker engineers to select from the service server the services they want to integrate, and to define the logic of the integration. Again, through an interactive session, a broker is generated automatically, without writing program code. Brokers will typically accept more complicated queries then any individual services, decompose the query into sub-queries, and compose the results from sub-queries into a coherent response to the user query. As well, facilities are provided so that brokers can replace a source that is out-of-service with the help of matchmaking capability of the service server. More details of the system can be found at www.cs.toronto.edu/km/xib. In the following we first introduce the information service description language XIBL, which allows the description of websites or databases. Next we describe how a broker engineer defines or customizes an information broker, based on a set of such information service descriptions. Wrapper generation is described next, while section 5 discusses result composition. The paper concludes with a review of the literature and a summary of the key issues addressed.

34

2

J. Lu, J. Mylopoulos, and J. Ho

Information Service Description Language XIBL

We classify web ser<XIB> vices into three cate<SERVICE NAME="AmazonSearch"/> gories, i.e., static, dy<elementType id=' query"> <string/> namic, and interactive. Static web services are <elementType id=' author"> <string/> those static HTML web <elementType id=' title"> <string/> <eleinentType id="publisher"> <string/> pages. Dynamic services <elementType id=' year"> <string/> typically allow users to <eleinentType id="price"> <string/> <elementType id=' book"? provide input on a HTML <element type= "iauthor" /> form and get a dynam"ititle" /> <element types "fpublisher" /> ically generated web<element type "tyear" /> page. One example of <element type= "fprice" /> such web services is a <elementType id=' books"> generic search engine. <element type= "tbook" occurs="ZEROORMDRE" /> Interactive web services are a special class of dy namic web services that http://w»w.amazon.com allow for the change ally provide interactive search for book information from Amazon. services. This paper focuses mainly on dynamic web services. This kind of Fl service could be modeled S " 1 • A m a z o n description as a function. There are four layers of description for such services: =

M

1. Where is the service. For our purposes, this may be the URL of a cgi script for a website, or the URL address of a database server. 2. What queries can it answer. For a database, this would usually be determined by the database query language (SQL or other). However, the queries that can be answered by a particular website are usually very limited. The XIB needs to provide a grammatical notation for specifying the set of queries that can be submitted. 3. What information can it provide. This is the output data we expect from the service, specified in a XML DTD. 4. Where is the data exactly located. For a database, this is specified in the database schema. For a website, on the other hand, pertinent data is usually hidden inside an HTML document, so we need to specify the exact location of those data.

Towards Extensible Information Brokers Based on XML

35

We shall call these four components of a service description as INPUT BINDING INPUT, OUTPUT, and OUTPUT BINDING, respectively. Figure 1 is an example of an Amazon search service description.

2.1

Input and Output Descriptions

Information sources usually allow only a limited number of query forms to be submitted. The input description defines the set of queries acceptable to a particular service. It consists of a set of variables that a user can associate values with, and their corresponding range specification. One design goal of the description is to model the HTML form, so that a description can be generated from an HTML form or vice versa. The input description takes the form of <SERVICE NAME="computerSearchl"> an XML schema ex- <elementType id="model"> <string/> pressed in XML-data <elementType id="cpu"> a t t r i b u t e name="cpuValue" atttype="ENUMERATION" [15], an extension of the

XML

DTD

that

can be embedded in an XML document. Figure 2 shows an example of a more complicated input/output description. The input part here means that the input variable model can be any string (which corresponds to the text input control in an HTML form), cpu can take values PII350 or PII400 (which correspond to the menus control in the HTML form), and memories can take values 32M, 64M, or both (which corresponds to the menus control with multiple selections in an

" « « »

>»«°"

<elementType id="memory"> <elementType id=Mmemories"> <element type="*memory" occurs="ONEORMORE"/> <elementType id="computers"> <«lement type="#oomputer" occurs="ZER00RMORE" /> <elementType id="comput9r"/> <element type="#cpu"/> <eleraent type="tmemories"/> <element type="ihardDisk"/> <eleraent type="tprice"/> <«lement type="#address"/> <elementType id="hardDisk"> id="price"> <string/> <elementType id="address"/> <element type=M#mailM/> <element type=MtemailM/>

HTML form). p. 2 c o r n p u t e r s e a r c h i input/output description For the output de' scriptions, it is not adequate to use a variation of the relational data model as in [26]. Instead, we also use a syntax similar to DTD to allow for the description of a tree-like data structure. In the example shown in figure 2, the output consists of zero or more computer elements, each consisting of cpu, memory, hardDisk, price, and address el-

36

J. Lu, J. Mylopoulos, and J. Ho

ements. The address element, in turn, consists of two other elements, mail address and email address. In figure 1, the INPUT component is simply a query <SERVICE NAHE=' computerSearch2"> that can take arbitrary <elementType id="cpu"> strings as its value. The attribute name^'cpuValue" atttype=MENUMERATION" v a l u e s = M P I I 2 6 6 PII350" /> OUTPUT component, on the other hand, declares that <elementTyp9 id="hardDisk"> books consists of zero or a more book. Each book con- sists of elements author, <0UTPUT> <eleraentType id="computers"> <eleraent type="tcomputer" occurs="ZER0ORMORE" /> t i t l e , publisher, year, and price. <elementType id="computer"/> 2.2 Input a n d O u t p u t Bindings

<element type="#model"/> <element type="tcpu'7> <e lenient type="tmemorias"/> <element type="ihardDisk"/> <element type="fprice"/> <elementType i d s "me m o r y " > <elementType id="price"> <string/> <elementType id="model"> <string/>

An input binding provides necessary information for the dynamic construction of an URL. For our example of figure 1, the input binding consists of the URL of the website in question, the cgi script name, and Fig. 3. ComputerSearch2 input/output descripthe mappings between the tion name used in the description and the attribute name used in the HTML form (the mapping from query to keyword-query).

The OUTPUT BINDING uses the markup algebra introduced in [17] to define the location of the data inside a HTML documents.

3

Broker Synthesis

Once web descriptions are available, a broker engineer can interact with the XIB to synthesize a broker as needed. First of all, the broker engineer needs to select a set of services to be integrated. The publication and selection of relevant services can be handled by a matchmaking agent [24]. To synthesize the broker with selected services, there are three things to be defined by the broker engineer. First, the user interface through which a query is submitted. Second, the output of the query, which consists of both the output format and the means to compose the results from each information source. Third, the mappings between the names in the broker and the names in each service. The following two subsections describe how the broker interface

Towards Extensible Information Brokers Based on XML

37

is defined and how the results are composed, while name mapping issues are discussed throughout these two subsections.

3.1

Definition of t h e Broker Query Interface

The broker query interface is an HTML form through which a user can submit queries. To derive the broker interface, first we must derive a broker XML schema from a set of input XML schemata, one for each service. Then we can generate an HTML form from the broker schema via XSL. There are several requirements for the broker schema: Generality The broker XML schema(DTD) should be capable of accepting queries(XML instances of the DTD) for every service. That is, every instance of each source XML schema should also be an instance of the broker schema. Decomposability Every query acceptable by the broker schema (XML instance of the schema) should be decomposable to sub-queries that are acceptable to the services. In general, it is not desirable for the interface to let users submit queries that always fail to produce answers. N o r m a l Form The schema should be normalized so that the same element type or attribute name and value will not be defined twice. Since each schema element type or attribute will be transformed into an HTML form control, multiple definitions of an element or an attribute will require a user to duplicate the action to set a value in several places. Besides, to ensure the validity of the schema, multiple definition of element types should be removed. The steps required to construct the broker input schema are as follows. Given two service input descriptions A and B, first construct the integrated schema (DTD) as

<elementType id="model"> <string/> <elementType id="cpu"> < a t t r i b u t a name="cpuValue" atttype="ENUMERATION" values="PII266 PII3S0 PII400" /> <e lenient Type id="hardDisk"> a t t r i b u t e name="hdValue" atttype="ENUMERATION" 8G'7>

which means that the broker input schema could be either A, B, or the sequential composition of A and B. Obviously,

<8lement type="#memory" occurs="0NE0RM0RE"/> <elemQntType id="memory"> < a t t r i b u t a name="memoryValue" atttype="ENUMERATION" valuas="32M 64M" />

. .

'

<elamentTypa id="mamories">

this schema satisfies condition one. The next step is to apply a set of schema transfer- F i g _ 4_ Computer search broker input XML mation rules to simplify the sc hema schema so that element types, attributes, or values inside the integrated schema are not defined in multiple places. Each transformation rule will preserve the equality of the schemas. The process continues until no transformation rule is applicable. Let's look at the example of generating a computer search interface from the descriptions in figures 2 and 3. To keep things simple, we suppose that the same

38

J. Lu, J. Mylopoulos, and J. Ho

entities in the two descriptions are denoted by the same name. When they are denoted by different names, the broker engineer needs to construct a mapping between those names. By integrating the XML schemata in computerSearchi and computerSearch2, we have produced the INPUT description as in figure 4. We notice that in the broker schema the valid values of the cpuValue is obtained by combining the corresponding valid values from computerSearchl and computerSearch2. The correspondence between this XML schema and an HTML form is as follows. Element type model will produce an HTML text input control, element types cpu and hardDisk will produce menus controls, and memories will produce menus control that allows for multiple selection. 3.2

Definition of t h e O u t p u t X M L T e m p l a t e

When the services are CONSTRUCT selected, there are numer $a ous ways to integrate them. $t Given the two services $p3 *p2 GlobeChaptersSearch and AmazonSearch, we can use WHERE $a them to search for books %t that appear in both places, <price> $pl IN "http://cs.toronto.edu/XIB/amazonSearch" for books that can be CONDITION "amazonSearch.INPUT.query shipped within 24 hours, =nenSearch.INPUT.queryString" etc. Here we are concerned AND with the comparison of the $a $t prices of the books in these $p2 two places. Hence the bro IN "http://cs.toronto.edu/XIB/globeChaptersSaarch" ker engineer needs to interCONDITION "globeChaptersSearch.INPUT.query act with the XIB to define =neuSearch.IHPUT.queryString" the output XML template. AND Given the output XML $pl Jp3 schemata for AmazonSearch IN and GlobeChaptersSearch, "http://cs.toronto.edu/XIB/ConverterService" CONDITION "ConverterServ ive.INPUT.from=USD; the broker engineer can deConverterService.INPUT.to=CND; fine the output XML temConverterSerivce.INPUT.amount=$pl" plate as in figure 5. This kind of template Fig. 5. XML template uses a simplified form of the XML query language XMLQL [10]. The major difference in the syntax of XIB templates is that the IN clause contains wrapper information, instead of an URL that points to an XML file. The Construct component of the template defines the intended output, i.e., a list of elements which consist of elements , < t i t l e > , , and . The WHERE part defines how to

Towards Extensible Information Brokers Based on XML

39

compose results from different information services. Note that the strings preceded by the sign $ denote variables. In this example, the AmazonSearch and globeChaptersSearch are joined on the author and t i t l e , and the AmazonSearch and ConverterService are joined on the price (amount) in USD. To do the currency conversion between US dollars and Canadian dollars, we need to use another information service ConverterService whose input and output definitions are as in figure 6.

4

Wrapper Generation

Wrapping a system is the process of defining and restricting access to a system through an abstract interface. A wrapper for information services accepts queries in a given format, converts them into one or more commands or sub-queries understandable by the underlying information service and transforms the native results into a format understood by the application. In the following we discuss wrapper generation for websites and relational databases. 4.1

<SERVICE Name="ConverterService"/> <e lenient Type id="amnt"> <string/> <elementType id="frm"> <string/> <string/> <elementType id="to"> <elementType id="conversion"> <elament type="iamount"/> <element type=M#result"/> <elament type="tfromM/> <element type="tintoM/>

Fig. 6. Currency Converter

Wrappers for Dynamic Web Services

A wrapper for a web-based information source is a special kind of wrapper in that the source involves websites and applications, while the native results are usually in the form of HTML documents. The basic functionality of such wrappers includes accepting a query and constructing the corresponding URL; also accessing a webpage given a URL, extracting the relevant information and returning the resulting XML DOM object to the broker. When the output XML template is defined, the wrappers for each service will be generated dynamically. Wrappers are not developed a priori due to the fact that each information source has a vast array of services, while different broker or mediator will only use some of these service. For example, the AmazonSearch service also provides information on publisher, publish year etc. However, since the broker will not need this information, there is no need for the wrapper to produce it. The process of generating XML wrappers for websites is described in figure 7.

40

J. Lu, J. Mylopoulos, and J. Ho Legend:

Data flow Control flow

HTML Document- . Information Source I

DOM Object

" XML generator |

Output

\ Output Template WehL Interpreter

>.. tf

WebL-based Script

Output Binding Def

Query

Web description Transformer

parser Input Binding Def

• I

Input/Output Spec

XML wrapper for Web Site

Web Ascription

Fig. 7. XML Wrapper generation

The description parser parses the web description. From the input binding, the URL constructor is able to generate the URL to get the corresponding webpage in HTML format. In the AmazonSearch example, suppose the query string to the user interface is "XML", to search for books about XML, the generated URL is http://www.amazon.com/cgi-bin?keyword-query="XML". At the same time, given the output template, the output bindings, and the input/output descriptions, the Transformer will generate the WebL [17] scripts that can extract the relevant data. In our example, according to the AmazonSearch description, we can get information about publisher and publish year. However, since this information is not needed according to the output template, the generated WebL scripts won't extract it. The WebL interpreter will interpret the WebL scripts and extract the pertinent data. Finally, the XML generator transforms the output of the WebL interpreter to the XML format according to the output XML schema. 4.2

From Databases to XML

To allow for data exchange between databases and websites we need to be able to build XML wrappers for database systems. For our book comparison example, suppose there is another converter service provided by a relational database system and its description is as in figure 8. In this description the INPUT and OUTPUT

Towards Extensible Information Brokers Based on XML

41

specifications are the same as in figure 6. Inside the binding part, there are definitions for the database URL, the database driver, user account, password, and the database query.

5

Query Planning and Result Composition

Given a query from the broker interface, the broker needs to decompose the query and form a plan to execute the query. Let's look at an example in the computer search case. Suppose that the following query is submitted through the broker input interface specified in figure 4: Q=PII350 4G<memory>32M, This asks for the selection of computer infor- <SERVICE NAME="ConverterDBService"> mation satisfying the con- <elementType id="amnt"> <string/> straints that cpu is PII350, <elamentType id="frm"> <string/> <elementType id="to"> <string/> hardDisk is 4G, and memory is 32M. <eleraentType id="conversion"> First, the broker needs <element type famount"/> to decide which service is capable of accepting this <element type="tinto"/> query. Suppose / and O are the PUTBI INfl input and output schemata <™ «° > j dbc:moql://mika.ai.t oront o.edu:1114 /converterDB of service S. A query Q com.imginary.sql.msql.HsqlDriver is acceptable to a service guest 12345 S(I,O), if either Q is an instance of / , or there is SELECT amount, result, from, into FRDH ConversionTable a decomposition of Q = =M

=

Ir\

r\ \

u j.1. A /"i •

WHERE amount=amnt AND from=frm AND into=to

(Qi, Q2), such that Qi is an instance of / , and Q2 is an

instance of part of O. In our example, since Fi Q is not an instance of S - 8 - Database description the input schema of computerSearch2, it is decomposed into Qi= PII350 4G and Q 2 = <memory>32M, which are instances of the input and output schemata of computerSearch2, respectively. Hence this query is acceptable to computerSearch2. Once such decomposition is obtained, Q\ is sent to the computerSearch2 and Q2 is used as a filter condition inside the broker. With the XML documents produced from the wrapper and the output template provided by the broker engineer, the task of result composition becomes

42

J. Lu, J. Mylopoulos, and J. Ho

easier. In our implementation, we transform the output template to the XMLQL query with some variables instantiated and the service name replaced by a concrete XML document. Then using the XML-QL engine, we can get the result XML document. One complication that can arise here is that some input variables may not be instantiated beforehand. In our book search example, the converter can only be activated after the AmazonSearch is completed, i.e, when the value for the variable $P1 is available from the AmazonSearch. Thus the result composer needs to wait for that value, assign that value to the input variable of the ConverterService, generate a new query, and send it to the wrapper.

6 6.1

Related Work Information Integration

Much work has been done on query planning and rewriting for information mediators [16]. Recently, XML-related issues have also been studied in this area. For example, MIX[4] is an XML-based DTD driven mediator prototype. In MIX, data exchange and integration relies on XML. The XML query language XMAS is used to define the integration view, and the graphical user interface BBQ (Blended Browsing and Querying) is used to generate complex queries driven by the mediator view DTD. The view DTD is derived from the view definition and source DTDs. MIX is not comparable with XIB in that MIX mediates between static web pagesm, while XIB is intended to integrate dynamic services that are modeled as functions. Metawrappers [26] are components within a mediator architecture which decompose user queries and compose wrapper responses. The assumption here is that there are hundreds of information sources in a dynamic WWW environment, so there is a need to group similar information sources, and generate a non-redundant, least-cost plan for a given query. An extension of the relational data model is used to describe the source and metawrapper schemata. An input/output relation is used to describe the limited capability of a web source. Unlike the XIB, where the input and output could be XML schemata, the input and output in a metawrapper are sets of attributes. Moreover, the metawrapper assumes the existence of wrappers which are responsible for the direct access to information sources and the translation from different data model to a uniform data model, the source description in a metawrapper does not include the BINDING part in XIB. WebSemantics[18] proposes an architecture to describe, publish, register, discover, and access relevant data over the internet using XML and XML-data. The focus is on data instead of the services available on the internet. WIDL [27] is an XML application that tries to describe web sources so that they can interoperate. In WIDL, the input and output are simply described as a set of variables that have no structure, hence WIDL is unable to support the integration of new query interface and the result composition based on certain data schema.

43

Towards Extensible Information Brokers Based on XML 6.2

Wrapper Construction

Wrapper construction or generation is the task of producing CONSTRUCT wrappers from source descrip $cpu tions. The approaches to wrap<memory> $mm $hd per construction vary with re<price> ScndPrice spect to the level of abstraction

$addr of the input/output descripWHERE tions, the expressiveness of the ( descriptions, also the degree of $cpu tool support for the acquisi<memory> $mm Jhd tion of the input/output de<price> JcndPrice scriptions and the transforma

$addr

IN tion from the descriptions to "http://www.toronto.edu/XIB/computerSearchl" executable code. CONDITION "computerSearchl.INPUT.cpu= neHComputer.INPUT.cpu; Input/output descriptions computerSearchl.INPUT.mm =newComputer. INPUT .mm; could be represented in a $hd=ne»Computer.INPUT.hd" declarative language, e.g., [27][22] OR They could also be represented $cpu <memory> $mm as executable scripts [17]. The <price> $usdPrice input/output descriptions may

taddr

Advanced Information Systems Engineering: 12th International Conference, CAiSE 2000, Stockholm, Sweden, June 5-9, 2000 Proceedings

Advanced Information Systems Engineering: 19th International Conference, CAiSE 2007, Trondheim, Norway, June 11-15, 2007, Proceedings

Advanced Information Systems Engineering: 7th International Conference, CAiSE '95, Jyväskylä, Finland, June 12 - 16, 1995. Proceedings

Advanced Information Systems Engineering: 18th International Conference, CAiSE 2006, Luxembourg, Luxembourg, June 5-9, 2006, Proceedings

Advanced Information Systems Engineering: 5th International Conference, CAiSE '93, Paris, France, June 8-11, 1993. Proceedings

Advanced Information Systems Engineering: 10th International Conference, CAiSE'98, Pisa, Italy, June 8-12, 1998, Proceedings

Advanced Information Systems Engineering: 13th International Conference, CAiSE 2001, Interlaken, Switzerland, June 4-8, 2001. Proceedings

Advanced Information Systems Engineering: 17th International Conference, CAiSE 2005, Porto, Portugal, June 13-17, 2005, Proceedings

Advanced Information Systems Engineering: 10th International Conference, CAiSE'98, Pisa, Italy, June 8-12, 1998, Proceedings

Advanced Information Systems Engineering: Second Nordic Conference CAiSE '90, Stockholm, Sweden, May 8-10, 1990, Proceedings: 2nd

Advanced Information Systems Engineering - CAiSE 2011

Intelligent Tutoring Systems: 5th International Conference, ITS 2000, Montreal, Canada, June 19-23, 2000 Proceedings

2000 (59)

Cooperative Information Systems: 7th International Conference, CoopIS 2000 Eilat, Israel, September 6-8, 2000 Proceedings

Advanced Information Systems Engineering: 4th International Conference CAiSE '92, Manchester, UK, May 12-15, 1992. Proceedings

Advances in Information Systems: First International Conference, ADVIS 2000, Izmir, Turkey, October 25-27, 2000, Proceedings

Advanced Information Systems Engineering: 6th International Conference, CAiSE '94, Utrecht, The Netherlands, June 6 - 10, 1994. Proceedings

Advanced Information Systems Engineering: 9th International Conference, CAiSE'97, Barcelona, Catalonia, Spain, June 16-20, 1997, Proceedings

Natural Language Processing - NLP 2000: Second International Conference Patras, Greece, June 2-4, 2000 Proceedings

Information Security and Cryptology - ICISC 2000: Third International Conference, Seoul, Korea, December 8-9, 2000, Proceedings

International Conference on Advanced Information Systems Engineering (13: 2001)

Advanced Information Systems Engineering, 16 conf., CAiSE 2004

Advanced Information Systems Engineering, 16 conf., CAiSE 2004

Advanced Information Systems Engineering, 20 conf., CAiSE 2008

Databases in Networked Information Systems: International Workshop DNIS 2000 Aizu, Japan, December 4-6, 2000 Proceedings

Advanced Information Systems Engineering, 11 conf., CAiSE'99

Advanced Information Systems Engineering: 8th International Conference, CAiSE'96, Herakleion, Crete, Greece, May (20-24), 1996. Proceedings

Circuit Cellar (June 2000)

Advances in Visual Information Systems: 4th International Conference, VISUAL 2000, Lyon, France, November 2-4, 2000 Proceedings

Discrete Geometry for Computer Imagery: 9th International Conference, DGCI 2000 Uppsala, Sweden, December 13-15, 2000 Proceedings

Distributed Computing: 14th International Conference, DISC 2000 Toledo, Spain, October 4-6, 2000 Proceedings

Recommend Documents